1
|
Lerminiaux N, Fakharuddin K, Mulvey MR, Mataseje L. Do we still need Illumina sequencing data? Evaluating Oxford Nanopore Technologies R10.4.1 flow cells and the Rapid v14 library prep kit for Gram negative bacteria whole genome assemblies. Can J Microbiol 2024; 70:178-189. [PMID: 38354391 DOI: 10.1139/cjm-2023-0175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/16/2024]
Abstract
The best whole genome assemblies are currently built from a combination of highly accurate short-read sequencing data and long-read sequencing data that can bridge repetitive and problematic regions. Oxford Nanopore Technologies (ONT) produce long-read sequencing platforms and they are continually improving their technology to obtain higher quality read data that is approaching the quality obtained from short-read platforms such as Illumina. As these innovations continue, we evaluated how much ONT read coverage produced by the Rapid Barcoding Kit v14 (SQK-RBK114) is necessary to generate high-quality hybrid and long-read-only genome assemblies for a panel of carbapenemase-producing Enterobacterales bacterial isolates. We found that 30× long-read coverage is sufficient if Illumina data are available, and that more (at least 100× long-read coverage is recommended for long-read-only assemblies. Illumina polishing is still improving single nucleotide variants (SNVs) and INDELs in long-read-only assemblies. We also examined if antimicrobial resistance genes could be accurately identified in long-read-only data, and found that Flye assemblies regardless of ONT coverage detected >96% of resistance genes at 100% identity and length. Overall, the Rapid Barcoding Kit v14 and long-read-only assemblies can be an optimal sequencing strategy (i.e., plasmid characterization and AMR detection) but finer-scale analyses (i.e., SNV) still benefit from short-read data.
Collapse
Affiliation(s)
- Nicole Lerminiaux
- National Microbiology Lab, Public Health Agency of Canada, Winnipeg, MB, Canada
| | - Ken Fakharuddin
- National Microbiology Lab, Public Health Agency of Canada, Winnipeg, MB, Canada
| | - Michael R Mulvey
- National Microbiology Lab, Public Health Agency of Canada, Winnipeg, MB, Canada
| | - Laura Mataseje
- National Microbiology Lab, Public Health Agency of Canada, Winnipeg, MB, Canada
| |
Collapse
|
2
|
Vandeputte M, Coppens S, Bossier P, Vereecke N, Vanrompay D. Genomic mining of Vibrio parahaemolyticus highlights prevalence of antimicrobial resistance genes and new genetic markers associated with AHPND and tdh + /trh + genotypes. BMC Genomics 2024; 25:178. [PMID: 38355437 PMCID: PMC10868097 DOI: 10.1186/s12864-024-10093-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Accepted: 02/05/2024] [Indexed: 02/16/2024] Open
Abstract
BACKGROUND Acute Hepatopancreatic Necrosis Disease (AHPND) causes significant mortality in shrimp aquaculture. The infection is primarily instigated by Vibrio parahaemolyticus (Vp) strains carrying a plasmid encoding the binary toxin PirAB. Yet, comprehension of supplementary virulence factors associated with this relatively recent disease remains limited. Furthermore, the same holds for gastroenteritis in humans caused by other Vp genotypes. Additionally, given the prevalent use of antibiotics to combat bacterial infections, it becomes imperative to illuminate the presence of antimicrobial resistance genes within these bacteria. RESULTS A subsampled number of 1,036 Vp genomes was screened for the presence of antimicrobial resistance genes, revealing an average prevalence of 5 ± 2 (SD) genes. Additional phenotypic antimicrobial susceptibility testing of three Vp strains (M0904, TW01, and PV1) sequenced in this study demonstrated resistance to ampicillin by all tested strains. Additionally, Vp M0904 showed multidrug resistance (against ampicillin, tetracycline, and trimethoprim-sulfamethoxazole). With a focus on AHPND, a screening of all Vibrio spp. for the presence of pirA and/or pirB indicates an estimated prevalence of 0.6%, including four V. campbellii, four V. owensii, and a Vibrio sp. next to Vp. Their pirAB-encoding plasmids exhibited a highly conserved backbone, with variations primarily in the region of the Tn3 family transposase. Furthermore, an assessment of the subsampled Vp genomes for the presence of known virulence factors showed a correlation between the presence of the Type 3 Secretion System 2 and tdh, while the presence of the Type 6 Secretion System 1 was clade dependent. Furthermore, a genome-wide association study (GWAS) unveiled (new) genes associated with pirA, pirB, tdh, and trh genotypes. Notable associations with the pirAB genotype included outer membrane proteins, immunoglobulin-like domain containing proteins, and toxin-antitoxin systems. For the tdh + /trh + genotypes (containing tdh, trh, or both genes), associations were found with T3SS2 genes, urease-related genes and nickel-transport system genes, and genes involved in a 'minimal' type I-F CRISPR mechanism. CONCLUSIONS This study highlights the prevalence of antimicrobial resistance and virulence genes in Vp, identifying novel genetic markers associated with AHPND and tdh + /trh + genotypes. These findings contribute valuable insights into the genomic basis of these genotypes, with implications for shrimp aquaculture and food safety.
Collapse
Affiliation(s)
- Marieke Vandeputte
- Laboratory of Immunology and Animal Biotechnology, Department of Animal Production and Aquatic Ecology, Faculty of Bioscience Engineering, Ghent University, Ghent, Belgium.
- Laboratory of Aquaculture & Artemia Reference Center, Department of Animal Production and Aquatic Ecology, Faculty of Bioscience Engineering, Ghent University, Ghent, Belgium.
| | | | - Peter Bossier
- Laboratory of Aquaculture & Artemia Reference Center, Department of Animal Production and Aquatic Ecology, Faculty of Bioscience Engineering, Ghent University, Ghent, Belgium
| | | | - Daisy Vanrompay
- Laboratory of Immunology and Animal Biotechnology, Department of Animal Production and Aquatic Ecology, Faculty of Bioscience Engineering, Ghent University, Ghent, Belgium
| |
Collapse
|
3
|
Arredondo-Alonso S, Gladstone R, Pöntinen A, Gama J, Schürch A, Lanza V, Johnsen P, Samuelsen Ø, Tonkin-Hill G, Corander J. Mge-cluster: a reference-free approach for typing bacterial plasmids. NAR Genom Bioinform 2023; 5:lqad066. [PMID: 37435357 PMCID: PMC10331934 DOI: 10.1093/nargab/lqad066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Revised: 06/08/2023] [Accepted: 06/26/2023] [Indexed: 07/13/2023] Open
Abstract
Extrachromosomal elements of bacterial cells such as plasmids are notorious for their importance in evolution and adaptation to changing ecology. However, high-resolution population-wide analysis of plasmids has only become accessible recently with the advent of scalable long-read sequencing technology. Current typing methods for the classification of plasmids remain limited in their scope which motivated us to develop a computationally efficient approach to simultaneously recognize novel types and classify plasmids into previously identified groups. Here, we introduce mge-cluster that can easily handle thousands of input sequences which are compressed using a unitig representation in a de Bruijn graph. Our approach offers a faster runtime than existing algorithms, with moderate memory usage, and enables an intuitive visualization, classification and clustering scheme that users can explore interactively within a single framework. Mge-cluster platform for plasmid analysis can be easily distributed and replicated, enabling a consistent labelling of plasmids across past, present, and future sequence collections. We underscore the advantages of our approach by analysing a population-wide plasmid data set obtained from the opportunistic pathogen Escherichia coli, studying the prevalence of the colistin resistance gene mcr-1.1 within the plasmid population, and describing an instance of resistance plasmid transmission within a hospital environment.
Collapse
Affiliation(s)
| | | | - Anna K Pöntinen
- Department of Biostatistics, University of Oslo, Oslo, Norway
- Norwegian National Advisory Unit on Detection of Antimicrobial Resistance, Department of Microbiology and Infection Control, University Hospital of North Norway, Tromsø, Norway
| | - João A Gama
- Department of Pharmacy, Faculty of Health Sciences, UiT The Arctic University of Norway, Tromsø, Norway
| | - Anita C Schürch
- Department of Medical Microbiology, UMC Utrecht, Utrecht, The Netherlands
| | - Val F Lanza
- CIBERINFEC, Madrid, Spain
- Bioinformatics Unit, University Hospital Ramón y Cajal, IRYCIS, Madrid, Spain
| | - Pål Jarle Johnsen
- Department of Pharmacy, Faculty of Health Sciences, UiT The Arctic University of Norway, Tromsø, Norway
| | - Ørjan Samuelsen
- Norwegian National Advisory Unit on Detection of Antimicrobial Resistance, Department of Microbiology and Infection Control, University Hospital of North Norway, Tromsø, Norway
- Department of Pharmacy, Faculty of Health Sciences, UiT The Arctic University of Norway, Tromsø, Norway
| | - Gerry Tonkin-Hill
- Department of Biostatistics, University of Oslo, Oslo, Norway
- Parasites and Microbes, Wellcome Sanger Institute, Cambridge, UK
| | - Jukka Corander
- Department of Biostatistics, University of Oslo, Oslo, Norway
- Parasites and Microbes, Wellcome Sanger Institute, Cambridge, UK
- Department of Mathematics and Statistics, Helsinki Institute of Information Technology (HIIT), FI-00014 University of Helsinki, Helsinki, Finland
| |
Collapse
|
4
|
Arredondo-Alonso S, Blundell-Hunter G, Fu Z, Gladstone RA, Fillol-Salom A, Loraine J, Cloutman-Green E, Johnsen PJ, Samuelsen Ø, Pöntinen AK, Cléon F, Chavez-Bueno S, De la Cruz MA, Ares MA, Vongsouvath M, Chmielarczyk A, Horner C, Klein N, McNally A, Reis JN, Penadés JR, Thomson NR, Corander J, Taylor PW, McCarthy AJ. Evolutionary and functional history of the Escherichia coli K1 capsule. Nat Commun 2023; 14:3294. [PMID: 37322051 PMCID: PMC10272209 DOI: 10.1038/s41467-023-39052-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2023] [Accepted: 05/26/2023] [Indexed: 06/17/2023] Open
Abstract
Escherichia coli is a leading cause of invasive bacterial infections in humans. Capsule polysaccharide has an important role in bacterial pathogenesis, and the K1 capsule has been firmly established as one of the most potent capsule types in E. coli through its association with severe infections. However, little is known about its distribution, evolution and functions across the E. coli phylogeny, which is fundamental to elucidating its role in the expansion of successful lineages. Using systematic surveys of invasive E. coli isolates, we show that the K1-cps locus is present in a quarter of bloodstream infection isolates and has emerged in at least four different extraintestinal pathogenic E. coli (ExPEC) phylogroups independently in the last 500 years. Phenotypic assessment demonstrates that K1 capsule synthesis enhances E. coli survival in human serum independent of genetic background, and that therapeutic targeting of the K1 capsule re-sensitizes E. coli from distinct genetic backgrounds to human serum. Our study highlights that assessing the evolutionary and functional properties of bacterial virulence factors at population levels is important to better monitor and predict the emergence of virulent clones, and to also inform therapies and preventive medicine to effectively control bacterial infections whilst significantly lowering antibiotic usage.
Collapse
Affiliation(s)
- Sergio Arredondo-Alonso
- Department of Biostatistics, University of Oslo, 0317, Oslo, Norway
- Parasites and Microbes, Wellcome Sanger Institute, Cambridge, UK
| | | | - Zuyi Fu
- Department of Infectious Disease, Centre for Bacterial Resistance Biology, Imperial College London, London, UK
| | - Rebecca A Gladstone
- Department of Biostatistics, University of Oslo, 0317, Oslo, Norway
- Parasites and Microbes, Wellcome Sanger Institute, Cambridge, UK
| | - Alfred Fillol-Salom
- Department of Infectious Disease, Centre for Bacterial Resistance Biology, Imperial College London, London, UK
| | | | - Elaine Cloutman-Green
- Great Ormond Street Institute of Child Health, University College London, London, UK
| | - Pål J Johnsen
- Department of Pharmacy, Faculty of Health Sciences, UiT The Arctic University of Norway, Tromsø, Norway
| | - Ørjan Samuelsen
- Department of Pharmacy, Faculty of Health Sciences, UiT The Arctic University of Norway, Tromsø, Norway
- Norwegian National Advisory Unit on Detection of Antimicrobial Resistance, Department of Microbiology and Infection Control, University Hospital of North Norway, Tromsø, Norway
| | - Anna K Pöntinen
- Department of Biostatistics, University of Oslo, 0317, Oslo, Norway
- Norwegian National Advisory Unit on Detection of Antimicrobial Resistance, Department of Microbiology and Infection Control, University Hospital of North Norway, Tromsø, Norway
| | - François Cléon
- Department of Pharmacy, Faculty of Health Sciences, UiT The Arctic University of Norway, Tromsø, Norway
| | - Susana Chavez-Bueno
- University of Missouri Kansas City, Kansas City, USA
- Division of Infectious Diseases, Children's Mercy Hospital Kansas City, UMKC School of Medicine, Kansas City, USA
| | - Miguel A De la Cruz
- Unidad de Investigación Médica en Enfermedades Infecciosas y Parasitarias, Hospital de Pediatría, Centro Médico Nacional Siglo XXI Instituto Mexicano del Seguro Social, Mexico City, Mexico
- Facultad de Medicina, Benemérita Universidad Autónoma de Puebla, Puebla, Mexico
| | - Miguel A Ares
- Unidad de Investigación Médica en Enfermedades Infecciosas y Parasitarias, Hospital de Pediatría, Centro Médico Nacional Siglo XXI Instituto Mexicano del Seguro Social, Mexico City, Mexico
| | - Manivanh Vongsouvath
- Lao-Oxford-Mahosot Hospital-Wellcome Trust Research Unit (LOMWRU), Microbiology Laboratory, Mahosot Hospital, Vientiane, Lao PDR
| | - Agnieszka Chmielarczyk
- Faculty of Medicine, Chair of Microbiology, Jagiellonian University Medical College, Czysta str. 18, 31-121, Kraków, Poland
| | - Carolyne Horner
- British Society for Antimicrobial Chemotherapy, Birmingham, UK
| | - Nigel Klein
- Great Ormond Street Institute of Child Health, University College London, London, UK
| | - Alan McNally
- Institute of Microbiology and Infection, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
| | - Joice N Reis
- Laboratory of Pathology and Molecular Biology (LPBM), Gonçalo Moniz Research Institute, Oswaldo Cruz Foundation, Salvador, Brazil
- Faculdade de Farmácia, Universidade Federal da Bahia, Salvador, Brazil
| | - José R Penadés
- Department of Infectious Disease, Centre for Bacterial Resistance Biology, Imperial College London, London, UK
| | - Nicholas R Thomson
- Parasites and Microbes, Wellcome Sanger Institute, Cambridge, UK
- London School of Hygiene and Tropical Medicine, London, UK
| | - Jukka Corander
- Department of Biostatistics, University of Oslo, 0317, Oslo, Norway.
- Parasites and Microbes, Wellcome Sanger Institute, Cambridge, UK.
- Helsinki Institute of Information Technology, Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland.
| | - Peter W Taylor
- School of Pharmacy, University College London, London, UK.
| | - Alex J McCarthy
- Department of Infectious Disease, Centre for Bacterial Resistance Biology, Imperial College London, London, UK.
| |
Collapse
|
5
|
Sanderson ND, Kapel N, Rodger G, Webster H, Lipworth S, Street TL, Peto T, Crook D, Stoesser N. Comparison of R9.4.1/Kit10 and R10/Kit12 Oxford Nanopore flowcells and chemistries in bacterial genome reconstruction. Microb Genom 2023; 9:mgen000910. [PMID: 36748454 PMCID: PMC9973852 DOI: 10.1099/mgen.0.000910] [Citation(s) in RCA: 17] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Complete, accurate, cost-effective, and high-throughput reconstruction of bacterial genomes for large-scale genomic epidemiological studies is currently only possible with hybrid assembly, combining long- (typically using nanopore sequencing) and short-read (Illumina) datasets. Being able to use nanopore-only data would be a significant advance. Oxford Nanopore Technologies (ONT) have recently released a new flowcell (R10.4) and chemistry (Kit12), which reportedly generate per-read accuracies rivalling those of Illumina data. To evaluate this, we sequenced DNA extracts from four commonly studied bacterial pathogens, namely Escherichia coli, Klebsiella pneumoniae, Pseudomonas aeruginosa and Staphylococcus aureus, using Illumina and ONT's R9.4.1/Kit10, R10.3/Kit12, R10.4/Kit12 flowcells/chemistries. We compared raw read accuracy and assembly accuracy for each modality, considering the impact of different nanopore basecalling models, commonly used assemblers, sequencing depth, and the use of duplex versus simplex reads. 'Super accuracy' (sup) basecalled R10.4 reads - in particular duplex reads - have high per-read accuracies and could be used to robustly reconstruct bacterial genomes without the use of Illumina data. However, the per-run yield of duplex reads generated in our hands with standard sequencing protocols was low (typically <10 %), with substantial implications for cost and throughput if relying on nanopore data only to enable bacterial genome reconstruction. In addition, recovery of small plasmids with the best-performing long-read assembler (Flye) was inconsistent. R10.4/Kit12 combined with sup basecalling holds promise as a singular sequencing technology in the reconstruction of commonly studied bacterial genomes, but hybrid assembly (Illumina+R9.4.1 hac) currently remains the highest throughput, most robust, and cost-effective approach to fully reconstruct these bacterial genomes.
Collapse
Affiliation(s)
- Nicholas D. Sanderson
- NIHR OxfordBiomedical Research Centre, University of Oxford, Oxford, UK
- Nuffield Department of Medicine, University of Oxford, Oxford, UK
- *Correspondence: Nicholas D. Sanderson,
| | - Natalia Kapel
- NIHR OxfordBiomedical Research Centre, University of Oxford, Oxford, UK
- Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Gillian Rodger
- NIHR OxfordBiomedical Research Centre, University of Oxford, Oxford, UK
- Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Hermione Webster
- NIHR OxfordBiomedical Research Centre, University of Oxford, Oxford, UK
- Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Samuel Lipworth
- Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Teresa L. Street
- NIHR OxfordBiomedical Research Centre, University of Oxford, Oxford, UK
- Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Timothy Peto
- NIHR OxfordBiomedical Research Centre, University of Oxford, Oxford, UK
- Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Derrick Crook
- NIHR OxfordBiomedical Research Centre, University of Oxford, Oxford, UK
- Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Nicole Stoesser
- NIHR OxfordBiomedical Research Centre, University of Oxford, Oxford, UK
- Nuffield Department of Medicine, University of Oxford, Oxford, UK
- NIHR Health Protection Research Unit in Healthcare Associated Infections and Antimicrobial Resistance at University of Oxford in partnership with Public Health England, Oxford, UK
- *Correspondence: Nicole Stoesser,
| |
Collapse
|
6
|
Waters EV, Tucker LA, Ahmed JK, Wain J, Langridge GC. Impact of Salmonella genome rearrangement on gene expression. Evol Lett 2022; 6:426-437. [PMID: 36579163 PMCID: PMC9783417 DOI: 10.1002/evl3.305] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Revised: 10/03/2022] [Accepted: 11/07/2022] [Indexed: 11/20/2022] Open
Abstract
In addition to nucleotide variation, many bacteria also undergo changes at a much larger scale via rearrangement of their genome structure (GS) around long repeat sequences. These rearrangements result in genome fragments shifting position and/or orientation in the genome without necessarily affecting the underlying nucleotide sequence. To date, scalable techniques have not been applied to GS identification, so it remains unclear how extensive this variation is and the extent of its impact upon gene expression. However, the emergence of multiplexed, long-read sequencing overcomes the scale problem, as reads of several thousand bases are routinely produced that can span long repeat sequences to identify the flanking chromosomal DNA, allowing GS identification. Genome rearrangements were generated in Salmonella enterica serovar Typhi through long-term culture at ambient temperature. Colonies with rearrangements were identified via long-range PCR and subjected to long-read nanopore sequencing to confirm genome variation. Four rearrangements were investigated for differential gene expression using transcriptomics. All isolates with changes in genome arrangement relative to the parent strain were accompanied by changes in gene expression. Rearrangements with similar fragment movements demonstrated similar changes in gene expression. The most extreme rearrangement caused a large imbalance between the origin and terminus of replication and was associated with differential gene expression as a factor of distance moved toward or away from the origin of replication. Genome structure variation may provide a mechanism through which bacteria can quickly adapt to new environments and warrants routine assessment alongside traditional nucleotide-level measures of variation.
Collapse
Affiliation(s)
- Emma V. Waters
- Microbes in the Food ChainQuadram Institute BioscienceNorwichNR4 7UQUnited Kingdom
| | - Liam A. Tucker
- Microbes in the Food ChainQuadram Institute BioscienceNorwichNR4 7UQUnited Kingdom
| | - Jana K. Ahmed
- The Wellcome Trust Sanger InstituteCambridgeCB10 1SAUnited Kingdom
| | - John Wain
- Microbes in the Food ChainQuadram Institute BioscienceNorwichNR4 7UQUnited Kingdom,Norwich Medical SchoolUniversity of East AngliaNorwichNR4 7TJUnited Kingdom
| | - Gemma C. Langridge
- Microbes in the Food ChainQuadram Institute BioscienceNorwichNR4 7UQUnited Kingdom
| |
Collapse
|
7
|
Jaudou S, Tran ML, Vorimore F, Fach P, Delannoy S. Evaluation of high molecular weight DNA extraction methods for long-read sequencing of Shiga toxin-producing Escherichia coli. PLoS One 2022; 17:e0270751. [PMID: 35830426 PMCID: PMC9278759 DOI: 10.1371/journal.pone.0270751] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Accepted: 06/16/2022] [Indexed: 11/18/2022] Open
Abstract
Next generation sequencing has become essential for pathogen characterization and typing. The most popular second generation sequencing technique produces data of high quality with very low error rates and high depths. One major drawback of this technique is the short reads. Indeed, short-read sequencing data of Shiga toxin-producing Escherichia coli (STEC) are difficult to assemble because of the presence of numerous mobile genetic elements (MGEs), which contain repeated elements. The resulting draft assemblies are often highly fragmented, which results in a loss of information, especially concerning MGEs or large structural variations. The use of long-read sequencing can circumvent these problems and produce complete or nearly complete genomes. The ONT MinION, for its small size and minimal investment requirements, is particularly popular. The ultra-long reads generated with the MinION can easily span prophages and repeat regions. In order to take full advantage of this technology it requires High Molecular Weight (HMW) DNA of high quality in high quantity. In this study, we have tested three different extraction methods: bead-based, solid-phase and salting-out, and evaluated their impact on STEC DNA yield, quality and integrity as well as performance in MinION long-read sequencing. Both the bead-based and salting-out methods allowed the recovery of large quantities of HMW STEC DNA suitable for MinION library preparation. The DNA extracted using the salting-out method consistently produced longer reads in the subsequent MinION runs, compared with the bead-based methods. While both methods performed similarly in subsequent STEC genome assembly, DNA extraction based on salting-out appeared to be the overall best method to produce high quantity of pure HMW STEC DNA for MinION sequencing.
Collapse
Affiliation(s)
- Sandra Jaudou
- Pathogenic E. coli Unit, Laboratory for Food Safety, Anses, Maisons-Alfort, France
| | - Mai-Lan Tran
- Pathogenic E. coli Unit, Laboratory for Food Safety, Anses, Maisons-Alfort, France
- IdentyPath Platform, Laboratory for Food Safety, Anses, Maisons-Alfort, France
| | - Fabien Vorimore
- IdentyPath Platform, Laboratory for Food Safety, Anses, Maisons-Alfort, France
| | - Patrick Fach
- Pathogenic E. coli Unit, Laboratory for Food Safety, Anses, Maisons-Alfort, France
- IdentyPath Platform, Laboratory for Food Safety, Anses, Maisons-Alfort, France
| | - Sabine Delannoy
- Pathogenic E. coli Unit, Laboratory for Food Safety, Anses, Maisons-Alfort, France
- IdentyPath Platform, Laboratory for Food Safety, Anses, Maisons-Alfort, France
- * E-mail:
| |
Collapse
|
8
|
Arredondo-Alonso S, Pöntinen AK, Cléon F, Gladstone RA, Schürch AC, Johnsen PJ, Samuelsen Ø, Corander J. A high-throughput multiplexing and selection strategy to complete bacterial genomes. Gigascience 2021; 10:giab079. [PMID: 34891160 PMCID: PMC8673558 DOI: 10.1093/gigascience/giab079] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Revised: 09/29/2021] [Accepted: 11/12/2021] [Indexed: 12/27/2022] Open
Abstract
BACKGROUND Bacterial whole-genome sequencing based on short-read technologies often results in a draft assembly formed by contiguous sequences. The introduction of long-read sequencing technologies permits those contiguous sequences to be unambiguously bridged into complete genomes. However, the elevated costs associated with long-read sequencing frequently limit the number of bacterial isolates that can be long-read sequenced. Here we evaluated the recently released 96 barcoding kit from Oxford Nanopore Technologies (ONT) to generate complete genomes on a high-throughput basis. In addition, we propose an isolate selection strategy that optimizes a representative selection of isolates for long-read sequencing considering as input large-scale bacterial collections. RESULTS Despite an uneven distribution of long reads per barcode, near-complete chromosomal sequences (assembly contiguity = 0.89) were generated for 96 Escherichia coli isolates with associated short-read sequencing data. The assembly contiguity of the plasmid replicons was even higher (0.98), which indicated the suitability of the multiplexing strategy for studies focused on resolving plasmid sequences. We benchmarked hybrid and ONT-only assemblies and showed that the combination of ONT sequencing data with short-read sequencing data is still highly desirable (i) to perform an unbiased selection of isolates for long-read sequencing, (ii) to achieve an optimal genome accuracy and completeness, and (iii) to include small plasmids underrepresented in the ONT library. CONCLUSIONS The proposed long-read isolate selection ensures the completion of bacterial genomes that span the genome diversity inherent in large collections of bacterial isolates. We show the potential of using this multiplexing approach to close bacterial genomes on a high-throughput basis.
Collapse
Affiliation(s)
- Sergio Arredondo-Alonso
- Department of Biostatistics, University of Oslo, 0317, Oslo, Norway
- Parasites and Microbes, Wellcome Sanger Institute, Cambridgeshire CB10 1RQ, UK
| | - Anna K Pöntinen
- Department of Biostatistics, University of Oslo, 0317, Oslo, Norway
| | - François Cléon
- Department of Pharmacy, Faculty of Health Sciences, UiT The Arctic University of Norway, 9037, Tromsø, Norway
| | | | - Anita C Schürch
- Department of Medical Microbiology, UMC Utrecht, 3584 CX, Utrecht, the Netherlands
| | - Pål J Johnsen
- Department of Pharmacy, Faculty of Health Sciences, UiT The Arctic University of Norway, 9037, Tromsø, Norway
| | - Ørjan Samuelsen
- Department of Pharmacy, Faculty of Health Sciences, UiT The Arctic University of Norway, 9037, Tromsø, Norway
- Norwegian National Advisory Unit on Detection of Antimicrobial Resistance, Department of Microbiology and Infection Control, University Hospital of North Norway, 9038, Tromsø, Norway
| | - Jukka Corander
- Department of Biostatistics, University of Oslo, 0317, Oslo, Norway
- Parasites and Microbes, Wellcome Sanger Institute, Cambridgeshire CB10 1RQ, UK
- Department of Mathematics and Statistics, Helsinki Institute of Information Technology (HIIT), FI-00014 University of Helsinki, 02130, Espoo, Helsinki, Finland
| |
Collapse
|