1
|
Derelle R, von Wachsmann J, Mäklin T, Hellewell J, Russell T, Lalvani A, Chindelevitch L, Croucher NJ, Harris SR, Lees JA. Seamless, rapid, and accurate analyses of outbreak genomic data using split k-mer analysis. Genome Res 2024; 34:1661-1673. [PMID: 39406504 PMCID: PMC11529842 DOI: 10.1101/gr.279449.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Accepted: 09/16/2024] [Indexed: 11/01/2024]
Abstract
Sequence variation observed in populations of pathogens can be used for important public health and evolutionary genomic analyses, especially outbreak analysis and transmission reconstruction. Identifying this variation is typically achieved by aligning sequence reads to a reference genome, but this approach is susceptible to reference biases and requires careful filtering of called genotypes. There is a need for tools that can process this growing volume of bacterial genome data, providing rapid results, but that remain simple so they can be used without highly trained bioinformaticians, expensive data analysis, and long-term storage and processing of large files. Here we describe split k-mer analysis (SKA2), a method that supports both reference-free and reference-based mapping to quickly and accurately genotype populations of bacteria using sequencing reads or genome assemblies. SKA2 is highly accurate for closely related samples, and in outbreak simulations, we show superior variant recall compared with reference-based methods, with no false positives. SKA2 can also accurately map variants to a reference and be used with recombination detection methods to rapidly reconstruct vertical evolutionary history. SKA2 is many times faster than comparable methods and can be used to add new genomes to an existing call set, allowing sequential use without the need to reanalyze entire collections. With an inherent absence of reference bias, high accuracy, and a robust implementation, SKA2 has the potential to become the tool of choice for genotyping bacteria. SKA2 is implemented in Rust and is freely available as open-source software.
Collapse
Affiliation(s)
- Romain Derelle
- NIHR Health Protection Research Unit in Respiratory Infections, National Heart and Lung Institute, Imperial College London, London W21PG, United Kingdom
| | - Johanna von Wachsmann
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, United Kingdom
| | - Tommi Mäklin
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, United Kingdom
- Department of Mathematics and Statistics, University of Helsinki, Helsinki 00014, Finland
| | - Joel Hellewell
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, United Kingdom
| | - Timothy Russell
- Centre for Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, London WC1E 7HT, United Kingdom
| | - Ajit Lalvani
- NIHR Health Protection Research Unit in Respiratory Infections, National Heart and Lung Institute, Imperial College London, London W21PG, United Kingdom
| | - Leonid Chindelevitch
- MRC Centre for Global Infectious Disease Analysis, Department of Infectious Disease Epidemiology, School of Public Health, Imperial College London, London W12 0BZ, United Kingdom
| | - Nicholas J Croucher
- MRC Centre for Global Infectious Disease Analysis, Department of Infectious Disease Epidemiology, School of Public Health, Imperial College London, London W12 0BZ, United Kingdom
| | - Simon R Harris
- Bill and Melinda Gates Foundation, Westminster, London SW1E 6AJ, United Kingdom
| | - John A Lees
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, United Kingdom;
| |
Collapse
|
2
|
Versmessen N, Mispelaere M, Vandekerckhove M, Hermans C, Boelens J, Vranckx K, Van Nieuwerburgh F, Vaneechoutte M, Hulpiau P, Cools P. Average Nucleotide Identity and Digital DNA-DNA Hybridization Analysis Following PromethION Nanopore-Based Whole Genome Sequencing Allows for Accurate Prokaryotic Typing. Diagnostics (Basel) 2024; 14:1800. [PMID: 39202288 PMCID: PMC11353866 DOI: 10.3390/diagnostics14161800] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Revised: 07/31/2024] [Accepted: 08/13/2024] [Indexed: 09/03/2024] Open
Abstract
Whole-genome sequencing (WGS) is revolutionizing clinical bacteriology. However, bacterial typing remains investigated by reference techniques with inherent limitations. This stresses the need for alternative methods providing robust and accurate sequence type (ST) classification. This study optimized and evaluated a GridION nanopore sequencing protocol, adapted for the PromethION platform. Forty-eight Escherichia coli clinical isolates with diverse STs were sequenced to assess two alternative typing methods and resistance profiling applications. Multi-locus sequence typing (MLST) was used as the reference typing method. Genomic relatedness was assessed using Average Nucleotide Identity (ANI) and digital DNA-DNA Hybridization (DDH), and cut-offs for discriminative strain resolution were evaluated. WGS-based antibiotic resistance prediction was compared to reference Minimum Inhibitory Concentration (MIC) assays. We found ANI and DDH cut-offs of 99.3% and 94.1%, respectively, which correlated well with MLST classifications and demonstrated potentially higher discriminative resolution than MLST. WGS-based antibiotic resistance prediction showed categorical agreements of ≥ 93% with MIC assays for amoxicillin, ceftazidime, amikacin, tobramycin, and trimethoprim-sulfamethoxazole. Performance was suboptimal (68.8-81.3%) for amoxicillin-clavulanic acid, cefepime, aztreonam, and ciprofloxacin. A minimal sequencing coverage of 12× was required to maintain essential genomic features and typing accuracy. Our protocol allows the integration of PromethION technology in clinical laboratories, with ANI and DDH proving to be accurate and robust alternative typing methods, potentially offering superior resolution. WGS-based antibiotic resistance prediction holds promise for specific antibiotic classes.
Collapse
Affiliation(s)
- Nick Versmessen
- Laboratory Bacteriology Research, Department of Diagnostic Sciences, Faculty of Medicine and Health Sciences, Ghent University, 9000 Ghent, Belgium
- Department of Diagnostic Sciences, Faculty of Medicine and Health Sciences, Ghent University, 9000 Ghent, Belgium
| | - Marieke Mispelaere
- Department of Bio-Medical Sciences, HOWEST University of Applied Sciences, 8000 Bruges, Belgium
| | | | - Cedric Hermans
- Department of Bio-Medical Sciences, HOWEST University of Applied Sciences, 8000 Bruges, Belgium
| | - Jerina Boelens
- Department of Diagnostic Sciences, Faculty of Medicine and Health Sciences, Ghent University, 9000 Ghent, Belgium
- Department of Laboratory Medicine, Ghent University Hospital, 9000 Ghent, Belgium
| | | | - Filip Van Nieuwerburgh
- NXTGNT, Department of Pharmaceutics, Faculty of Pharmaceutical Sciences, Ghent University, 9000 Ghent, Belgium
| | - Mario Vaneechoutte
- Laboratory Bacteriology Research, Department of Diagnostic Sciences, Faculty of Medicine and Health Sciences, Ghent University, 9000 Ghent, Belgium
| | - Paco Hulpiau
- Department of Bio-Medical Sciences, HOWEST University of Applied Sciences, 8000 Bruges, Belgium
| | - Piet Cools
- Laboratory Bacteriology Research, Department of Diagnostic Sciences, Faculty of Medicine and Health Sciences, Ghent University, 9000 Ghent, Belgium
- Department of Diagnostic Sciences, Faculty of Medicine and Health Sciences, Ghent University, 9000 Ghent, Belgium
| |
Collapse
|
3
|
Islam MR, Sharma MK, KhunKhun R, Shandro C, Sekirov I, Tyrrell GJ, Soualhine H. Whole genome sequencing-based identification of human tuberculosis caused by animal-lineage Mycobacterium orygis. J Clin Microbiol 2023; 61:e0026023. [PMID: 37877705 PMCID: PMC10662373 DOI: 10.1128/jcm.00260-23] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Accepted: 09/11/2023] [Indexed: 10/26/2023] Open
Abstract
A recently described member of the Mycobacterium tuberculosis complex (MTBC) is Mycobacterium orygis, which can cause disease primarily in animals but also in humans. Although M. orygis has been reported from different geographic regions around the world, due to a lack of proper identification techniques, the contribution of this emerging pathogen to the global burden of zoonotic tuberculosis is not fully understood. In the present work, we report single nucleotide polymorphism (SNP) analysis using whole genome sequencing (WGS) that can accurately identify M. orygis and differentiate it from other members of the MTBC species. WGS-based SNP analysis was performed for 61 isolates from different provinces in Canada that were identified as M. orygis. A total of 56 M. orygis sequences from the public databases were also included in the analysis. Several unique SNPs in the gyrB, PPE55, Rv2042c, leuS, mmpL6, and mmpS6 genes were used to determine their effectiveness as genetic markers for the identification of M. orygis. To the best of our knowledge, five of these SNPs, viz., gyrB 277 (A→G), gyrB 1478 (T→C), leuS 1064 (A→T), mmpL6 486 (T→C), and mmpS6 334 (C→G), are reported for the first time in this study. Our results also revealed several SNPs specific to other species within MTBC. The phylogenetic analysis shows that the studied genomes were genetically diverse and clustered with M. orygis sequences of human and animal origin reported from different geographic locations. Therefore, the present study provides a new insight into the high-confidence identification of M. orygis from MTBC species based on WGS data, which can be useful for reference and diagnostic laboratories.
Collapse
Affiliation(s)
- Md Rashedul Islam
- National Reference Centre for Mycobacteriology, National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, Manitoba, Canada
| | - Meenu K. Sharma
- National Reference Centre for Mycobacteriology, National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, Manitoba, Canada
- Department of Medical Microbiology and Infectious Diseases, Max Rady College of Medicine, University of Manitoba, Winnipeg, Manitoba, Canada
| | - Rupinder KhunKhun
- BC Center for Disease Control Public Health Laboratory, Vancouver, British Columbia, Canada
| | - Cary Shandro
- Provincial Laboratory for Public Health, Alberta Precision Labs, Edmonton, Alberta, Canada
| | - Inna Sekirov
- BC Center for Disease Control Public Health Laboratory, Vancouver, British Columbia, Canada
| | - Gregory J. Tyrrell
- Provincial Laboratory for Public Health, Alberta Precision Labs, Edmonton, Alberta, Canada
| | - Hafid Soualhine
- National Reference Centre for Mycobacteriology, National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, Manitoba, Canada
- Department of Medical Microbiology and Infectious Diseases, Max Rady College of Medicine, University of Manitoba, Winnipeg, Manitoba, Canada
| |
Collapse
|
4
|
García-Martín AB, Roder T, Schmitt S, Zeeh F, Bruggmann R, Perreten V. Whole-genome analyses reveal a novel prophage and cgSNPs-derived sublineages of Brachyspira hyodysenteriae ST196. BMC Genomics 2022; 23:131. [PMID: 35168548 PMCID: PMC8845278 DOI: 10.1186/s12864-022-08347-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Accepted: 01/25/2022] [Indexed: 11/17/2022] Open
Abstract
Background Brachyspira (B.) hyodysenteriae is a fastidious anaerobe spirochete that can cause swine dysentery, a severe mucohaemorragic colitis that affects pig production and animal welfare worldwide. In Switzerland, the population of B. hyodysenteriae is characterized by the predominance of macrolide-lincosamide-resistant B. hyodysenteriae isolates of sequence type (ST) ST196, prompting us to obtain deeper insights into the genomic structure and variability of ST196 using pangenome and whole genome variant analyses. Results The draft genome of 14 B. hyodysenteriae isolates of ST196, sampled during a 7-year period from geographically distant pig herds, was obtained by whole-genome sequencing (WGS) and compared to the complete genome of the B. hyodysenteriae isolate Bh743-7 of ST196 used as reference. Variability results revealed the existence of 30 to 52 single nucleotide polymorphisms (SNPs), resulting in eight sublineages of ST196. The pangenome analysis led to the identification of a novel prophage, pphBhCH20, of the Siphoviridae family in a single isolate of ST196, which suggests that horizontal gene transfer events may drive changes in genomic structure. Conclusions This study contributes to the catalogue of publicly available genomes and provides relevant bioinformatic tools and information for further comparative genomic analyses for B. hyodysenteriae. It reveals that Swiss B. hyodysenteriae isolates of the same ST may have evolved independently over time by point mutations and acquisition of larger genetic elements. In line with this, the third type of mobile genetic element described so far in B. hyodysenteriae, the novel prophage pphBhCH20, has been identified in a single isolate of B. hyodysenteriae of ST196. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08347-5.
Collapse
Affiliation(s)
- Ana Belén García-Martín
- Division of Molecular Bacterial Epidemiology and Infectious Diseases, Institute of Veterinary Bacteriology, Vetsuisse Faculty, University of Bern, Bern, Switzerland.,Graduate School for Cellular and Biomedical Sciences, University of Bern, Bern, Switzerland
| | - Thomas Roder
- Graduate School for Cellular and Biomedical Sciences, University of Bern, Bern, Switzerland.,Interfaculty Bioinformatics Unit and Swiss Institute of Bioinformatics, University of Bern, Bern, Switzerland
| | - Sarah Schmitt
- Section of Veterinary Bacteriology, Institute for Food Safety and Hygiene, Vetsuisse Faculty, University of Zurich, Zurich, Switzerland
| | - Friederike Zeeh
- Clinic for Swine, Department of Clinical Veterinary Medicine, Vetsuisse Faculty, University of Bern, Bern, Switzerland
| | - Rémy Bruggmann
- Interfaculty Bioinformatics Unit and Swiss Institute of Bioinformatics, University of Bern, Bern, Switzerland
| | - Vincent Perreten
- Division of Molecular Bacterial Epidemiology and Infectious Diseases, Institute of Veterinary Bacteriology, Vetsuisse Faculty, University of Bern, Bern, Switzerland. .,Institute of Veterinary Bacteriology, University of Bern, Länggassstrasse 122, CH-3012, Bern, Switzerland.
| |
Collapse
|
5
|
Asadi L, Croxen M, Heffernan C, Dhillon M, Paulsen C, Egedahl ML, Tyrrell G, Doroshenko A, Long R. How much do smear-negative patients really contribute to tuberculosis transmissions? Re-examining an old question with new tools. EClinicalMedicine 2022; 43:101250. [PMID: 35036885 PMCID: PMC8743225 DOI: 10.1016/j.eclinm.2021.101250] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Revised: 12/02/2021] [Accepted: 12/09/2021] [Indexed: 11/16/2022] Open
Abstract
BACKGROUND Sputum smear microscopy is a common surrogate for tuberculosis infectiousness. Previous estimates that smear-negative patients contribute 13-20% of transmissions and are, on average, 20 to 25% as infectious as smear-positive cases are understood to be high. Herein, we use an ideal real-world setting, a comprehensive dataset, and new high-resolution techniques to more accurately estimate the true transmission risk of smear-negative cases. METHODS We treated all adult culture-positive pulmonary TB patients diagnosed in the province of Alberta, Canada from 2003 to 2016 as potential transmitters. The primary data sources were the Alberta TB Registry and the Provincial Laboratory for Public Health. We measured, as primary outcomes, the proportion of transmissions attributable to smear-negative sources and the relative transmission rate. First, we replicated previous studies by using molecular (DNA) fingerprint clustering. Then, using a prospectively collected registry of TB contacts, we defined transmission events as active TB amongst identified contacts who either had a 100% DNA fingerprint match to the source case or a clinical diagnosis. We supplemented our analysis with genome sequencing on temporally and geographically linked DNA fingerprint clusters of cases not identified as contacts. FINDINGS There were 1176 cases, 563 smear-negative and 613 smear-positive, and 23,131 contacts. Replicating previous studies, the proportion of transmissions attributable to smear-negative source cases was 16% (95% CI, 12-19%) and the relative transmission rate was 0.19 (95% CI, 0.14-0.26). With our combined approach, the proportion of transmission was 8% (95% CI, 3-14%) and the relative transmission rate became 0.10 (95% CI, 0.05-0.19). INTERPRETATION When we examined the same outcomes as in previous studies but refined transmission ascertainment with the addition of conventional epidemiology and genomics, we found that smear-negative cases were ∼50% less infectious than previously thought. FUNDING Alberta Innovates Health Solutions.
Collapse
Affiliation(s)
- Leyla Asadi
- The Department of Medicine, Faculty of Medicine and Dentistry, University of Alberta, Room 8334A, 3rd Floor, Aberhart Centre, 11402 University Avenue NW, Edmonton, Edmonton, AB T6G 2J3, Canada
| | - Matthew Croxen
- The Department of Laboratory Medicine and Pathology, Faculty of Medicine and Dentistry, University of Alberta, Edmonton, AB, Canada
| | - Courtney Heffernan
- The Department of Medicine, Faculty of Medicine and Dentistry, University of Alberta, Room 8334A, 3rd Floor, Aberhart Centre, 11402 University Avenue NW, Edmonton, Edmonton, AB T6G 2J3, Canada
| | - Mannat Dhillon
- The Department of Medicine, Faculty of Medicine and Dentistry, University of Alberta, Room 8334A, 3rd Floor, Aberhart Centre, 11402 University Avenue NW, Edmonton, Edmonton, AB T6G 2J3, Canada
| | - Catherine Paulsen
- The Department of Medicine, Faculty of Medicine and Dentistry, University of Alberta, Room 8334A, 3rd Floor, Aberhart Centre, 11402 University Avenue NW, Edmonton, Edmonton, AB T6G 2J3, Canada
| | - Mary Lou Egedahl
- The Department of Medicine, Faculty of Medicine and Dentistry, University of Alberta, Room 8334A, 3rd Floor, Aberhart Centre, 11402 University Avenue NW, Edmonton, Edmonton, AB T6G 2J3, Canada
| | - Greg Tyrrell
- The Department of Laboratory Medicine and Pathology, Faculty of Medicine and Dentistry, University of Alberta, Edmonton, AB, Canada
| | - Alexander Doroshenko
- The Department of Medicine, Faculty of Medicine and Dentistry, University of Alberta, Room 8334A, 3rd Floor, Aberhart Centre, 11402 University Avenue NW, Edmonton, Edmonton, AB T6G 2J3, Canada
| | - Richard Long
- The Department of Medicine, Faculty of Medicine and Dentistry, University of Alberta, Room 8334A, 3rd Floor, Aberhart Centre, 11402 University Avenue NW, Edmonton, Edmonton, AB T6G 2J3, Canada
- Corresponding author.
| |
Collapse
|