1
|
Mixão V, Pinto M, Brendebach H, Sobral D, Dourado Santos J, Radomski N, Majgaard Uldall AS, Bomba A, Pietsch M, Bucciacchio A, de Ruvo A, Castelli P, Iwan E, Simon S, Coipan CE, Linde J, Petrovska L, Kaas RS, Grimstrup Joensen K, Holtsmark Nielsen S, Kiil K, Lagesen K, Di Pasquale A, Gomes JP, Deneke C, Tausch SH, Borges V. Multi-country and intersectoral assessment of cluster congruence between pipelines for genomics surveillance of foodborne pathogens. Nat Commun 2025; 16:3961. [PMID: 40295532 PMCID: PMC12038046 DOI: 10.1038/s41467-025-59246-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2024] [Accepted: 04/15/2025] [Indexed: 04/30/2025] Open
Abstract
Different laboratories employ different Whole-Genome Sequencing (WGS) pipelines for Food and Waterborne disease (FWD) surveillance, casting doubt on the comparability of their results and hindering optimal communication at intersectoral and international levels. Through a collaborative effort involving eleven European institutes spanning the food, animal, and human health sectors, we aimed to assess the inter-pipeline clustering congruence across all resolution levels and perform an in-depth comparative analysis of cluster composition at outbreak level for four important foodborne pathogens: Listeria monocytogenes, Salmonella enterica, Escherichia coli, and Campylobacter jejuni. We found a general concordance between allele-based pipelines for all species, except for C. jejuni, where the different resolution power of allele-based schemas led to marked discrepancies. Still, we identified non-negligible differences in outbreak detection and demonstrated how a threshold flexibilization favors the detection of similar outbreak signals by different laboratories. These results, together with the observation that different traditional typing groups (e.g., serotypes) exhibit a remarkably different genetic diversity, represent valuable information for future outbreak case-definitions and WGS-based nomenclature design. This study reinforces the need, while demonstrating the feasibility, of conducting continuous pipeline comparability assessments, and opens good perspectives for a smoother international and intersectoral cooperation towards an efficient One Health FWD surveillance.
Collapse
Affiliation(s)
- Verónica Mixão
- Genomics and Bioinformatics Unit, Department of Infectious Diseases, National Institute of Health Doutor Ricardo Jorge (INSA), Lisbon, Portugal
| | - Miguel Pinto
- Genomics and Bioinformatics Unit, Department of Infectious Diseases, National Institute of Health Doutor Ricardo Jorge (INSA), Lisbon, Portugal
| | - Holger Brendebach
- National Study Center for Sequencing, Department of Biological Safety, German Federal Institute for Risk Assessment (BfR), Berlin, Germany
| | - Daniel Sobral
- Genomics and Bioinformatics Unit, Department of Infectious Diseases, National Institute of Health Doutor Ricardo Jorge (INSA), Lisbon, Portugal
| | - João Dourado Santos
- Genomics and Bioinformatics Unit, Department of Infectious Diseases, National Institute of Health Doutor Ricardo Jorge (INSA), Lisbon, Portugal
| | - Nicolas Radomski
- National Reference Centre (NRC) for Whole Genome Sequencing of microbial pathogens: database and bioinformatics analysis (GENPAT), Istituto Zooprofilattico Sperimentale dell'Abruzzo e del Molise (IZSAM), Teramo, Italy
| | | | - Arkadiusz Bomba
- Department of Omics Analyses, National Veterinary Research Institute (PIWet), Puławy, Poland
| | - Michael Pietsch
- Unit of Enteropathogenic Bacteria and Legionella, Robert Koch Institute (RKI), Wernigerode, Germany
| | - Andrea Bucciacchio
- National Reference Centre (NRC) for Whole Genome Sequencing of microbial pathogens: database and bioinformatics analysis (GENPAT), Istituto Zooprofilattico Sperimentale dell'Abruzzo e del Molise (IZSAM), Teramo, Italy
| | - Andrea de Ruvo
- National Reference Centre (NRC) for Whole Genome Sequencing of microbial pathogens: database and bioinformatics analysis (GENPAT), Istituto Zooprofilattico Sperimentale dell'Abruzzo e del Molise (IZSAM), Teramo, Italy
- Computer Science, Gran Sasso Science Institute, L'Aquila, Italy
| | - Pierluigi Castelli
- National Reference Centre (NRC) for Whole Genome Sequencing of microbial pathogens: database and bioinformatics analysis (GENPAT), Istituto Zooprofilattico Sperimentale dell'Abruzzo e del Molise (IZSAM), Teramo, Italy
| | - Ewelina Iwan
- Department of Omics Analyses, National Veterinary Research Institute (PIWet), Puławy, Poland
| | - Sandra Simon
- Unit of Enteropathogenic Bacteria and Legionella, Robert Koch Institute (RKI), Wernigerode, Germany
| | - Claudia E Coipan
- Department for Infectious Diseases, Epidemiology and Surveillance, National Institute for Public Health and the Environment (RIVM), Bilthoven, The Netherlands
| | - Jörg Linde
- Institute of Bacterial Infections and Zoonoses, Friedrich-Loeffler-Institute (FLI), Jena, Germany
| | | | - Rolf Sommer Kaas
- National Food Institute, Technical University of Denmark (DTU), Lyngby, Denmark
| | | | - Sofie Holtsmark Nielsen
- Department of Bacteria, Parasites & Fungi, Statens Serum Institut (SSI), Copenhagen, Denmark
| | - Kristoffer Kiil
- Department of Bacteria, Parasites & Fungi, Statens Serum Institut (SSI), Copenhagen, Denmark
| | - Karin Lagesen
- Section for Epidemiology, Norwegian Veterinary Institute (NVI), Ås, Norway
| | - Adriano Di Pasquale
- National Reference Centre (NRC) for Whole Genome Sequencing of microbial pathogens: database and bioinformatics analysis (GENPAT), Istituto Zooprofilattico Sperimentale dell'Abruzzo e del Molise (IZSAM), Teramo, Italy
| | - João Paulo Gomes
- Genomics and Bioinformatics Unit, Department of Infectious Diseases, National Institute of Health Doutor Ricardo Jorge (INSA), Lisbon, Portugal
- Veterinary and Animal Research Center (CECAV), Faculty of Veterinary Medicine, Lusófona University, Lisbon, Portugal
| | - Carlus Deneke
- National Study Center for Sequencing, Department of Biological Safety, German Federal Institute for Risk Assessment (BfR), Berlin, Germany
| | - Simon H Tausch
- National Study Center for Sequencing, Department of Biological Safety, German Federal Institute for Risk Assessment (BfR), Berlin, Germany
| | - Vítor Borges
- Genomics and Bioinformatics Unit, Department of Infectious Diseases, National Institute of Health Doutor Ricardo Jorge (INSA), Lisbon, Portugal.
| |
Collapse
|
2
|
Nemati A, Gigliucci F, Morabito S, Badouei MA. Virulence plasmids in edema disease: Insights from whole-genome analysis of porcine O139:H1 Shiga toxin-producing Escherichia coli (STEC) strains. Front Cell Infect Microbiol 2025; 15:1528408. [PMID: 40182763 PMCID: PMC11965690 DOI: 10.3389/fcimb.2025.1528408] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2024] [Accepted: 03/04/2025] [Indexed: 04/05/2025] Open
Abstract
This study investigates the plasmid sequences of porcine O139:H1 Shiga toxin-producing Escherichia coli (STEC) responsible for Edema Disease (ED). Whole-genome analysis reveals significant similarities between these strains and known plasmids, notably pW1316-2, which harbors key virulence genes like hemolysin (hlyA, hlyB) and adhesion factors (aidA-I, faeE). These genes contribute to the cytotoxicity and host colonization associated with ED. Additionally, similarities to plasmids from Shigella flexneri 2a highlight potential associations in virulence gene regulation, particularly via the Hha-H-NS complex. The identification of sequences resembling plasmid pB71 raises serious concerns about the emergence of highly pathogenic strains, as it includes tetracycline resistance genes (tetA, tetC, tetR). This research emphasizes the role of plasmid-like sequences in ED pathogenesis, indicating important implications for swine industry management and public health.
Collapse
Affiliation(s)
- Ali Nemati
- European Union Reference Laboratory (EURL) for Escherichia coli including Shiga toxin-producing E. coli (STEC), Department of Food Safety, Nutrition and Veterinary Public Health, Istituto Superiore di Sanità, Rome, Italy
- Department of Pathobiology, Faculty of Veterinary Medicine, Ferdowsi University of Mashhad, Mashhad, Iran
| | - Federica Gigliucci
- European Union Reference Laboratory (EURL) for Escherichia coli including Shiga toxin-producing E. coli (STEC), Department of Food Safety, Nutrition and Veterinary Public Health, Istituto Superiore di Sanità, Rome, Italy
| | - Stefano Morabito
- European Union Reference Laboratory (EURL) for Escherichia coli including Shiga toxin-producing E. coli (STEC), Department of Food Safety, Nutrition and Veterinary Public Health, Istituto Superiore di Sanità, Rome, Italy
| | - Mahdi Askari Badouei
- Department of Pathobiology, Faculty of Veterinary Medicine, Ferdowsi University of Mashhad, Mashhad, Iran
| |
Collapse
|
3
|
Shelenkov A, Slavokhotova A, Mikhaylova Y, Akimkin V. Genomic typing, antimicrobial resistance gene, virulence factor and plasmid replicon database for the important pathogenic bacteria Klebsiella pneumoniae. BMC Microbiol 2025; 25:3. [PMID: 39762743 PMCID: PMC11702089 DOI: 10.1186/s12866-024-03720-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2024] [Accepted: 12/19/2024] [Indexed: 01/11/2025] Open
Abstract
BACKGROUND The infections of bacterial origin represent a significant problem to the public healthcare worldwide both in clinical and community settings. Recent decade was marked by limiting treatment options for bacterial infections due to growing antimicrobial resistance (AMR) acquired and transferred by various bacterial species, especially the ones causing healthcare-associated infections, which has become a dangerous issue noticed by the World Health Organization. Numerous reports shown that the spread of AMR is often driven by several species-specific lineages usually called the 'global clones of high risk'. Thus, it is essential to track the isolates belonging to such clones and investigate the mechanisms of their pathogenicity and AMR acquisition. Currently, the whole genome-based analysis is more and more often used for these purposes, including the epidemiological surveillance and analysis of mobile elements involved in resistance transfer. However, in spite of the exponential growth of available bacterial genomes, their representation usually lack uniformity and availability of supporting metadata, which creates a bottleneck for such investigations. DESCRIPTION In this database, we provide the results of a thorough genomic analysis of 61,857 genomes of a highly dangerous bacterial pathogen Klebsiella pneumoniae. Important isolate typing information including multilocus sequence typing (MLST) types (STs), assignment of the isolates to known global clones, capsular (KL) and lipooligosaccharide (O) types, the presence of CRISPR-Cas systems, and cgMLST profiles are given, and the information regarding the presence of AMR, virulence genes and plasmid replicons within the genomes is provided. CONCLUSION This database is freely available under CC BY-NC-SA at https://doi.org/10.5281/zenodo.11069018 . The database will facilitate selection of the proper reference isolate sets for any types of genome-based investigations. It will be helpful for investigations in the field of K. pneumoniae genomic epidemiology, as well as antimicrobial resistance analysis and the development of prevention measures against this important pathogen.
Collapse
Affiliation(s)
- Andrey Shelenkov
- Central Research Institute of Epidemiology, Novogireevskaya Str., 3a, Moscow, 111123, Russia.
| | - Anna Slavokhotova
- Central Research Institute of Epidemiology, Novogireevskaya Str., 3a, Moscow, 111123, Russia
| | - Yulia Mikhaylova
- Central Research Institute of Epidemiology, Novogireevskaya Str., 3a, Moscow, 111123, Russia
| | - Vasiliy Akimkin
- Central Research Institute of Epidemiology, Novogireevskaya Str., 3a, Moscow, 111123, Russia
| |
Collapse
|
4
|
Merda D, Vila-Nova M, Bonis M, Boutigny AL, Brauge T, Cavaiuolo M, Cunty A, Regnier A, Sayeb M, Vingadassalon N, Yvon C, Chesnais V. Unraveling the impact of genome assembly on bacterial typing: a one health perspective. BMC Genomics 2024; 25:1059. [PMID: 39516732 PMCID: PMC11545336 DOI: 10.1186/s12864-024-10982-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2024] [Accepted: 10/30/2024] [Indexed: 11/16/2024] Open
Abstract
BACKGROUND In the context of pathogen surveillance, it is crucial to ensure interoperability and harmonized data. Several surveillance systems are designed to compare bacteria and identify outbreak clusters based on core genome MultiLocus Sequence Typing (cgMLST). Among the different approaches available to generate bacterial cgMLST, our research used an assembly-based approach (chewBBACA tool). METHODS Simulations of short-read sequencing were conducted for 5 genomes of 27 pathogens of interest in animal, plant, and human health to evaluate the repeatability and reproducibility of cgMLST. Various quality parameters, such as read quality and depth of sequencing were applied, and several read simulations and genome assemblies were repeated using three tools: SPAdes, Unicycler and Shovill. In vitro sequencing were also used to evaluate assembly impact on cgMLST results, for six bacterial species: Bacillus thuringiensis, Listeria monocytogenes, Salmonella enterica, Staphylococcus aureus, Vibrio parahaemolyticus and Xylella fastidiosa. RESULTS The results highlighted variability in cgMLST, which not only related to the assembly tools, but also induced by the intrinsic composition of the genomes themselves. This variability observed in simulated sequencing was further validated with real data for six of the bacterial pathogens studied. CONCLUSION This highlights that the intrinsic genome composition affects assembly and resulting cgMLST profiles, and that variability in bioinformatics tools can induce a bias in cgMLST profiles. In conclusion, we propose that the completeness of cgMLST schemes should be considered when clustering strains.
Collapse
Affiliation(s)
- Déborah Merda
- Université Paris Est, ANSES, Laboratory for Food Safety, SPAAD unit, Maisons-Alfort, F-94701, France.
| | - Meryl Vila-Nova
- Université Paris Est, ANSES, Laboratory for Food Safety, SPAAD unit, Maisons-Alfort, F-94701, France
| | - Mathilde Bonis
- Université Paris Est, ANSES, Laboratory for Food Safety, SBCL unit, Maisons-Alfort, F-94701, France
| | - Anne-Laure Boutigny
- ANSES, Plant Health Laboratory, Bacteriology Virology GMO Unit, 7 rue Jean Dixméras, Angers cedex 01, 49044, France
| | - Thomas Brauge
- ANSES, Laboratory for Food Safety, Bacteriology and Parasitology of Fishery and Aquaculture Products Unit (B3PA), Boulevard du Bassin Napoléon, Boulogne-sur-Mer, France
| | - Marina Cavaiuolo
- Université Paris Est, ANSES, Laboratory for Food Safety, SBCL unit, Maisons-Alfort, F-94701, France
| | - Amandine Cunty
- ANSES, Plant Health Laboratory, Bacteriology Virology GMO Unit, 7 rue Jean Dixméras, Angers cedex 01, 49044, France
| | - Antoine Regnier
- ANSES, Laboratory for Food Safety, Bacteriology and Parasitology of Fishery and Aquaculture Products Unit (B3PA), Boulevard du Bassin Napoléon, Boulogne-sur-Mer, France
| | - Maroua Sayeb
- Université Paris Est, ANSES, Laboratory for Food Safety, SEL unit, Maisons-Alfort, F-94701, France
| | - Noémie Vingadassalon
- Université Paris Est, ANSES, Laboratory for Food Safety, SBCL unit, Maisons-Alfort, F-94701, France
| | - Claire Yvon
- Université Paris Est, ANSES, Laboratory for Food Safety, SEL unit, Maisons-Alfort, F-94701, France
| | - Virginie Chesnais
- Université Paris Est, ANSES, Laboratory for Food Safety, SPAAD unit, Maisons-Alfort, F-94701, France
| |
Collapse
|
5
|
Fernández-Palacios P, Galán-Sánchez F, Casimiro-Soriguer CS, Jurado-Tarifa E, Arroyo F, Lara M, Chaves JA, Dopazo J, Rodríguez-Iglesias MA. Genotypic characterization and antimicrobial susceptibility of human Campylobacter jejuni isolates in Southern Spain. Microbiol Spectr 2024; 12:e0102824. [PMID: 39162511 PMCID: PMC11449230 DOI: 10.1128/spectrum.01028-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Accepted: 07/09/2024] [Indexed: 08/21/2024] Open
Abstract
Campylobacter jejuni is the main cause of bacterial gastroenteritis and a public health problem worldwide. Little information is available on the genotypic characteristics of human C. jejuni in Spain. This study is based on an analysis of the resistome, virulome, and phylogenetic relationship, antibiogram prediction, and antimicrobial susceptibility of 114 human isolates of C. jejuni from a tertiary hospital in southern Spain from October 2020 to June 2023. The isolates were sequenced using Illumina technology, and a bioinformatic analysis was subsequently performed. The susceptibility of C. jejuni isolates to ciprofloxacin, tetracycline, and erythromycin was also tested. The resistance rates for each antibiotic were 90.3% for ciprofloxacin, 66.7% for tetracycline, and 0.88% for erythromycin. The fluoroquinolone resistance rate obtained is well above the European average (69.1%). CC-21 (n = 23), ST-572 (n = 13), and ST-6532 (n = 13) were the most prevalent clonal complexes (CCs) and sequence types (STs). In the virulome, the cadF, ciaB, and cdtABC genes were detected in all the isolates. A prevalence of 20.1% was obtained for the genes wlaN and cstIII, which are related to the pathogenesis of Guillain-Barré syndrome (GBS). The prevalence of the main antimicrobial resistance markers detected were CmeABC (92.1%), RE-cmeABC (7.9%), the T86I substitution in gyrA (88.9%), blaOXA-61 (72.6%), tet(O) (65.8%), and ant (6)-Ia (17.1%). High antibiogram prediction rates (>97%) were obtained, except for in the case of the erythromycin-resistant phenotype. This study contributes significantly to the knowledge of C. jejuni genomics for the prevention, treatment, and control of infections caused by this pathogen.IMPORTANCEDespite being the pathogen with the greatest number of gastroenteritis cases worldwide, Campylobacter jejuni remains a poorly studied microorganism. A sustained increase in fluoroquinolone resistance in human isolates is a problem when treating Campylobacter infections. The development of whole genome sequencing (WGS) techniques has allowed us to better understand the genotypic characteristics of this pathogen and relate them to antibiotic resistance phenotypes. These techniques complement the data obtained from the phenotypic analysis of C. jejuni isolates. The zoonotic transmission of C. jejuni through the consumption of contaminated poultry supports approaching the study of this pathogen through "One Health" approach. In addition, due to the limited information on the genomic characteristics of C. jejuni in Spain, this study provides important data and allows us to compare the results with those obtained in other countries.
Collapse
Affiliation(s)
| | | | - Carlos S Casimiro-Soriguer
- Plataforma Andaluza de Medicina Computacional, Fundación Pública Andaluza Progreso y Salud, Sevilla, Spain
| | - Estefanía Jurado-Tarifa
- Instituto de Investigación e Innovación Biomédica de Cádiz (INIBICA), Hospital Universitario Puerta del Mar, Cádiz, Spain
| | - Federico Arroyo
- UGC Microbiología, Hospital Universitario Puerta del Mar, Cádiz, Spain
| | - María Lara
- Plataforma Andaluza de Medicina Computacional, Fundación Pública Andaluza Progreso y Salud, Sevilla, Spain
| | - J Alberto Chaves
- Subdirección de Protección de la Salud, Consejería de Salud y Familias, Sevilla, Spain
| | - Joaquín Dopazo
- Plataforma Andaluza de Medicina Computacional, Fundación Pública Andaluza Progreso y Salud, Sevilla, Spain
| | - Manuel A Rodríguez-Iglesias
- UGC Microbiología, Hospital Universitario Puerta del Mar, Cádiz, Spain
- Departamento de Biomedicina, Biotecnología y Salud Pública, Universidad de Cádiz, Cádiz, Spain
| |
Collapse
|
6
|
Cossi MVC, Polveiro RC, Yamatogi RS, Camargo AC, Nero LA. Multi-locus sequence typing, antimicrobials resistance and virulence profiles of Salmonella enterica isolated from bovine carcasses in Minas Gerais state, Brazil. Braz J Microbiol 2024; 55:1773-1781. [PMID: 38702536 PMCID: PMC11153481 DOI: 10.1007/s42770-024-01341-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Accepted: 04/08/2024] [Indexed: 05/06/2024] Open
Abstract
The aim of this study was to identify virulence and antimicrobial resistance profiles and determine the sequence type (ST) by multilocus sequence typing (MLST) of Salmonella enterica isolates from bovine carcasses from slaughterhouse located in Minas Gerais state, Brazil, and its relationship with bovine isolates obtained on the American continent based on sequence type profile. The MLST results were compared with all Salmonella STs associated with cattle on American continent, and a multi-locus sequence tree (MS tree) was built. Among the 17 S. enterica isolates, five ST profiles identified, and ST10 were the most frequent, grouping seven (41.2%) isolates. The isolates presented 11 different profiles of virulence genes, and six different antibiotics resistance profiles. The survey on Enterobase platform showed 333 Salmonella STs from American continent, grouped into four different clusters. Most of the isolates in the present study (13/17), were concentrated in a single cluster (L4) composed by 74 STs. As a conclusion, five different STs were identified, with ST10 being the most common. The isolates showed great diversity of virulence genes and antibiotics resistance profiles. Most of the isolates of this study were grouped into a single cluster composed by 74 STs formed by bovine isolates obtained on the American continent.
Collapse
Affiliation(s)
| | - Richard Costa Polveiro
- Departamento de Veterinária, Universidade Federal de Viçosa, Campus Universitário, Viçosa, Minas Gerais, 36570-000, Brazil
| | - Ricardo Seiti Yamatogi
- Departamento de Veterinária, Universidade Federal de Viçosa, Campus Universitário, Viçosa, Minas Gerais, 36570-000, Brazil
| | - Anderson Carlos Camargo
- Departamento de Veterinária, Universidade Federal de Viçosa, Campus Universitário, Viçosa, Minas Gerais, 36570-000, Brazil
| | - Luís Augusto Nero
- Departamento de Veterinária, Universidade Federal de Viçosa, Campus Universitário, Viçosa, Minas Gerais, 36570-000, Brazil
| |
Collapse
|
7
|
Guzinski J, Tang Y, Chattaway MA, Dallman TJ, Petrovska L. Development and validation of a random forest algorithm for source attribution of animal and human Salmonella Typhimurium and monophasic variants of S. Typhimurium isolates in England and Wales utilising whole genome sequencing data. Front Microbiol 2024; 14:1254860. [PMID: 38533130 PMCID: PMC10963456 DOI: 10.3389/fmicb.2023.1254860] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Accepted: 12/22/2023] [Indexed: 03/28/2024] Open
Abstract
Source attribution has traditionally involved combining epidemiological data with different pathogen characterisation methods, including 7-gene multi locus sequence typing (MLST) or serotyping, however, these approaches have limited resolution. In contrast, whole genome sequencing data provide an overview of the whole genome that can be used by attribution algorithms. Here, we applied a random forest (RF) algorithm to predict the primary sources of human clinical Salmonella Typhimurium (S. Typhimurium) and monophasic variants (monophasic S. Typhimurium) isolates. To this end, we utilised single nucleotide polymorphism diversity in the core genome MLST alleles obtained from 1,061 laboratory-confirmed human and animal S. Typhimurium and monophasic S. Typhimurium isolates as inputs into a RF model. The algorithm was used for supervised learning to classify 399 animal S. Typhimurium and monophasic S. Typhimurium isolates into one of eight distinct primary source classes comprising common livestock and pet animal species: cattle, pigs, sheep, other mammals (pets: mostly dogs and horses), broilers, layers, turkeys, and game birds (pheasants, quail, and pigeons). When applied to the training set animal isolates, model accuracy was 0.929 and kappa 0.905, whereas for the test set animal isolates, for which the primary source class information was withheld from the model, the accuracy was 0.779 and kappa 0.700. Subsequently, the model was applied to assign 662 human clinical cases to the eight primary source classes. In the dataset, 60/399 (15.0%) of the animal and 141/662 (21.3%) of the human isolates were associated with a known outbreak of S. Typhimurium definitive type (DT) 104. All but two of the 141 DT104 outbreak linked human isolates were correctly attributed by the model to the primary source classes identified as the origin of the DT104 outbreak. A model that was run without the clonal DT104 animal isolates produced largely congruent outputs (training set accuracy 0.989 and kappa 0.985; test set accuracy 0.781 and kappa 0.663). Overall, our results show that RF offers considerable promise as a suitable methodology for epidemiological tracking and source attribution for foodborne pathogens.
Collapse
Affiliation(s)
- Jaromir Guzinski
- Animal and Plant Health Agency, Bacteriology Department, Addlestone, United Kingdom
| | - Yue Tang
- Animal and Plant Health Agency, Bacteriology Department, Addlestone, United Kingdom
| | - Marie Anne Chattaway
- Gastrointestinal Bacteria Reference Unit, UK Health Security Agency, London, United Kingdom
| | - Timothy J. Dallman
- Gastrointestinal Bacteria Reference Unit, UK Health Security Agency, London, United Kingdom
| | - Liljana Petrovska
- Animal and Plant Health Agency, Bacteriology Department, Addlestone, United Kingdom
| |
Collapse
|
8
|
Biguenet A, Bordy A, Atchon A, Hocquet D, Valot B. Introduction and benchmarking of pyMLST: open-source software for assessing bacterial clonality using core genome MLST. Microb Genom 2023; 9. [PMID: 37966168 DOI: 10.1099/mgen.0.001126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2023] Open
Abstract
Core genome multilocus sequence typing (cgMLST) has gained in popularity for bacterial typing since whole-genome sequencing (WGS) has become affordable. We introduce here pyMLST, a new complete, stand-alone, free and open source pipeline for cgMLST analysis. pyMLST can create or import a core genome database. For each gene, the first allele is aligned against the bacterial genome of interest using BLAT. Incomplete genes are aligned using MAFT. All data are stored in a SQLite database. pyMLST accepts assembly genomes or raw data (with the option pyMLST-KMA) as input. To evaluate our new tool, we selected three genome collections of major bacterial pathogens (Escherichia coli, Pseudomonas aeruginosa and Staphylococcus aureus) and compared them with pyMLST, pyMLST-KMA, ChewBBACA, SeqSphere and the variant calling approach. We compared the sensitivity, precision and false-positive rate for each method with those of the variant calling approach. Minimal spanning trees were generated with each type of software to evaluate their interest in the context of a bacterial outbreak. We found that pyMLST-KMA is a convenient screening method to avoid assembling large bacterial collections. Our data showed that pyMLST (free, open source, available in Galaxy and pipeline ready) performed similarly to the commercial SeqSphere and performed better than ChewBBACA and pyMLST-KMA.
Collapse
Affiliation(s)
- Adrien Biguenet
- CHU de Besançon, Hygiène Hospitalière, F-25030 Besançon, France
- Université de Franche-Comté, CNRS, Chrono-environnement, F-25000 Besançon, France
| | - Augustin Bordy
- Université de Franche-Comté, CNRS, Chrono-environnement, F-25000 Besançon, France
| | - Alban Atchon
- Bioinformatique et Big Data Au Service de La Santé, Université de Franche-Comté, F-25000 Besançon, France
| | - Didier Hocquet
- CHU de Besançon, Hygiène Hospitalière, F-25030 Besançon, France
- Université de Franche-Comté, CNRS, Chrono-environnement, F-25000 Besançon, France
| | - Benoit Valot
- Université de Franche-Comté, CNRS, Chrono-environnement, F-25000 Besançon, France
- Bioinformatique et Big Data Au Service de La Santé, Université de Franche-Comté, F-25000 Besançon, France
| |
Collapse
|
9
|
Satyam R, Ahmad S, Raza K. Comparative genomic assessment of members of genus Tenacibaculum: an exploratory study. Mol Genet Genomics 2023; 298:979-993. [PMID: 37225902 DOI: 10.1007/s00438-023-02031-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Accepted: 05/04/2023] [Indexed: 05/26/2023]
Abstract
Tenacibaculosis is an ulcerative skin disorder that affects finfish. It is caused by members of the genus Tenacibaculum, resulting in eccentric behavioural changes, including anorexia, lethargy, and abnormal swimming patterns that often result in mortality. Currently, species suspected of causing fish mortality include T. ovolyticum, T. gallaicum, T. discolor, T. finnmarkense, T. mesophilum, T. soleae, T. dicentrarchi, and T. maritimum. However, pathogenic members and the mechanisms involved in disease causation, progression, and transmission are limited due to the inadequate sequencing efforts in the past decade. In this study, we use a comparative genomics approach to investigate the characteristic features of 26 publicly available genomes of Tenacibaculum and report our observations. We propose the reclassification of "T. litoreum HSC 22" to the singaporense species and assignment of "T. sp. 4G03" to the species discolor (species with quotation marks have not been appropriately named). We also report the co-occurrence of several antimicrobial resistance/virulence genes and genes private to a few members. Finally, we mine several non-B DNA forming regions, operons, tandem repeats, high-confidence putative effector proteins, and sortase that might play a pivotal role in bacterial evolution, transcription, and pathogenesis.
Collapse
Affiliation(s)
- Rohit Satyam
- Computational Intelligence and Bioinformatics Laboratory, Department of Computer Science, Jamia Millia Islamia, New Delhi, 110025, India
| | - Shaban Ahmad
- Computational Intelligence and Bioinformatics Laboratory, Department of Computer Science, Jamia Millia Islamia, New Delhi, 110025, India
| | - Khalid Raza
- Computational Intelligence and Bioinformatics Laboratory, Department of Computer Science, Jamia Millia Islamia, New Delhi, 110025, India.
| |
Collapse
|
10
|
Knijn A, Michelacci V, Gigliucci F, Tozzoli R, Chiani P, Minelli F, Scavia G, Ventola E, Morabito S. IRIDA-ARIES Genomics, a key player in the One Health surveillance of diseases caused by infectious agents in Italy. Front Public Health 2023; 11:1151568. [PMID: 37361153 PMCID: PMC10289303 DOI: 10.3389/fpubh.2023.1151568] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Accepted: 05/12/2023] [Indexed: 06/28/2023] Open
Abstract
Pathogen genomics is transforming surveillance of infectious diseases, deepening our understanding of evolution and diffusion of etiological agents, host-pathogen interactions and antimicrobial resistance. This discipline is playing an important role in the development of One Health Surveillance with public health experts of various disciplines integrating methods applied to pathogen research, monitoring, management and prevention of outbreaks. Especially with the notion that foodborne diseases may not be transmitted by food only, the ARIES Genomics project aimed to deliver an Information System for the collection of genomic and epidemiological data to enable genomics-based surveillance of infectious epidemics, foodborne outbreaks and diseases at the animal-human interface. Keeping in mind that the users of the system comprised persons with expertise in a wide variety of domains, the system was expected to be used with a low learning curve directly by the persons target of the analyses' results, keeping the information exchange chains as short as possible. As a result, the IRIDA-ARIES platform (https://irida.iss.it/) provides an intuitive web-based interface for multisectoral data collection and bioinformatic analyses. In practice, the user creates a sample and uploads the Next-generation sequencing reads, then an analysis pipeline is launched automatically performing a series of typing and clustering operations fueling the information flow. Instances of IRIDA-ARIES host the Italian national surveillance system for infections by Listeria monocytogenes (Lm) and the surveillance system for infections by Shigatoxin-producing Escherichia coli (STEC). As of today, the platform does not provide tools to manage epidemiological investigations but serves as an instrument of aggregation for risk monitoring, capable of triggering alarms on possible critical situations that might go unnoticed otherwise.
Collapse
Affiliation(s)
- Arnold Knijn
- Department of Food Safety, Nutrition and Veterinary Public Health, Istituto Superiore di Sanità, Rome, Italy
| | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Shelenkov A, Mikhaylova Y, Voskanyan S, Egorova A, Akimkin V. Whole-Genome Sequencing Revealed the Fusion Plasmids Capable of Transmission and Acquisition of Both Antimicrobial Resistance and Hypervirulence Determinants in Multidrug-Resistant Klebsiella pneumoniae Isolates. Microorganisms 2023; 11:1314. [PMID: 37317293 DOI: 10.3390/microorganisms11051314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Revised: 05/11/2023] [Accepted: 05/15/2023] [Indexed: 06/16/2023] Open
Abstract
Klebsiella pneumoniae, a member of the Enterobacteriaceae family, has become a dangerous pathogen accountable for a large fraction of the various infectious diseases in both clinical and community settings. In general, the K. pneumoniae population has been divided into the so-called classical (cKp) and hypervirulent (hvKp) lineages. The former, usually developing in hospitals, can rapidly acquire resistance to a wide spectrum of antimicrobial drugs, while the latter is associated with more aggressive but less resistant infections, mostly in healthy humans. However, a growing number of reports in the last decade have confirmed the convergence of these two distinct lineages into superpathogen clones possessing the properties of both, and thus imposing a significant threat to public health worldwide. This process is associated with horizontal gene transfer, in which plasmid conjugation plays a very important role. Therefore, the investigation of plasmid structures and the ways plasmids spread within and between bacterial species will provide benefits in developing prevention measures against these powerful pathogens. In this work, we investigated clinical multidrug-resistant K. pneumoniae isolates using long- and short-read whole-genome sequencing, which allowed us to reveal fusion IncHI1B/IncFIB plasmids in ST512 isolates capable of simultaneously carrying hypervirulence (iucABCD, iutA, prmpA, peg-344) and resistance determinants (armA, blaNDM-1 and others), and to obtain insights into their formation and transmission mechanisms. Comprehensive phenotypic, genotypic and phylogenetic analysis of the isolates, as well as of their plasmid repertoire, was performed. The data obtained will facilitate epidemiological surveillance of high-risk K. pneumoniae clones and the development of prevention strategies against them.
Collapse
Affiliation(s)
- Andrey Shelenkov
- Central Research Institute of Epidemiology, Novogireevskaya Str., 3a, 111123 Moscow, Russia
| | - Yulia Mikhaylova
- Central Research Institute of Epidemiology, Novogireevskaya Str., 3a, 111123 Moscow, Russia
| | - Shushanik Voskanyan
- Central Research Institute of Epidemiology, Novogireevskaya Str., 3a, 111123 Moscow, Russia
| | - Anna Egorova
- Central Research Institute of Epidemiology, Novogireevskaya Str., 3a, 111123 Moscow, Russia
| | - Vasiliy Akimkin
- Central Research Institute of Epidemiology, Novogireevskaya Str., 3a, 111123 Moscow, Russia
| |
Collapse
|
12
|
Long-Read Whole Genome Sequencing Elucidates the Mechanisms of Amikacin Resistance in Multidrug-Resistant Klebsiella pneumoniae Isolates Obtained from COVID-19 Patients. Antibiotics (Basel) 2022; 11:antibiotics11101364. [PMID: 36290022 PMCID: PMC9598329 DOI: 10.3390/antibiotics11101364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Revised: 09/29/2022] [Accepted: 10/04/2022] [Indexed: 11/28/2022] Open
Abstract
Klebsiella pneumoniae is a Gram-negative, encapsulated, non-motile bacterium, which represents a global challenge to public health as one of the major causes of healthcare-associated infections worldwide. In the recent decade, the World Health Organization (WHO) noticed a critically increasing rate of carbapenem-resistant K. pneumoniae occurrence in hospitals. The situation with extended-spectrum beta-lactamase (ESBL) producing bacteria further worsened during the COVID-19 pandemic, due to an increasing number of patients in intensive care units (ICU) and extensive, while often inappropriate, use of antibiotics including carbapenems. In order to elucidate the ways and mechanisms of antibiotic resistance spreading within the K. pneumoniae population, whole genome sequencing (WGS) seems to be a promising approach, and long-read sequencing is especially useful for the investigation of mobile genetic elements carrying antibiotic resistance genes, such as plasmids. We have performed short- and long read sequencing of three carbapenem-resistant K. pneumoniae isolates obtained from COVID-19 patients in a dedicated ICU of a multipurpose medical center, which belonged to the same clone according to cgMLST analysis, in order to understand the differences in their resistance profiles. We have revealed the presence of a small plasmid carrying aph(3′)-VIa gene providing resistance to amikacin in one of these isolates, which corresponded perfectly to its phenotypic resistance profile. We believe that the results obtained will facilitate further elucidating of antibiotic resistance mechanisms for this important pathogen, and highlight the need for continuous genomic epidemiology surveillance of clinical K. pneumoniae isolates.
Collapse
|
13
|
Delineating Mycobacterium abscessus population structure and transmission employing high-resolution core genome multilocus sequence typing. Nat Commun 2022; 13:4936. [PMID: 35999208 PMCID: PMC9399081 DOI: 10.1038/s41467-022-32122-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Accepted: 07/19/2022] [Indexed: 11/08/2022] Open
Abstract
Mycobacterium abscessus is an emerging multidrug-resistant non-tuberculous mycobacterium that causes a wide spectrum of infections and has caused several local outbreaks worldwide. To facilitate standardized prospective molecular surveillance, we established a novel core genome multilocus sequence typing (cgMLST) scheme. Whole genome sequencing data of 1991 isolates were employed to validate the scheme, re-analyze global population structure and set genetic distance thresholds for cluster detection and taxonomic identification. We confirmed and amended the nomenclature of the main dominant circulating clones and found that these also correlate well with traditional 7-loci MLST. Dominant circulating clones could be linked to a corresponding reference genome with less than 250 alleles while 99% of pairwise comparisons between epidemiologically linked isolates were below 25 alleles and 90% below 10 alleles. These thresholds can be used to guide further epidemiological investigations. Overall, the scheme will help to unravel the apparent global spread of certain clonal complexes and as yet undiscovered transmission routes.
Collapse
|
14
|
Hernández-Díaz EA, Vázquez-Garcidueñas MS, Negrete-Paz AM, Vázquez-Marrufo G. Comparative Genomic Analysis Discloses Differential Distribution of Antibiotic Resistance Determinants between Worldwide Strains of the Emergent ST213 Genotype of Salmonella Typhimurium. Antibiotics (Basel) 2022; 11:925. [PMID: 35884180 PMCID: PMC9312005 DOI: 10.3390/antibiotics11070925] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 07/06/2022] [Accepted: 07/07/2022] [Indexed: 12/17/2022] Open
Abstract
Salmonella enterica constitutes a global public health concern as one of the main etiological agents of human gastroenteritis. The Typhimurium serotype is frequently isolated from human, animal, food, and environmental samples, with its sequence type 19 (ST19) being the most widely distributed around the world as well as the founder genotype. The replacement of the ST19 genotype with the ST213 genotype that has multiple antibiotic resistance (MAR) in human and food samples was first observed in Mexico. The number of available genomes of ST213 strains in public databases indicates its fast worldwide dispersion, but its public health relevance is unknown. A comparative genomic analysis conducted as part of this research identified the presence of 44 genes, 34 plasmids, and five point mutations associated with antibiotic resistance, distributed across 220 genomes of ST213 strains, indicating the MAR phenotype. In general, the grouping pattern in correspondence to the presence/absence of genes/plasmids that confer antibiotic resistance cluster the genomes according to the geographical origin where the strain was isolated. Genetic determinants of antibiotic resistance group the genomes of North America (Canada, Mexico, USA) strains, and suggest a dispersion route to reach the United Kingdom and, from there, the rest of Europe, then Asia and Oceania. The results obtained here highlight the worldwide public health relevance of the ST213 genotype, which contains a great diversity of genetic elements associated with MAR.
Collapse
Affiliation(s)
- Elda Araceli Hernández-Díaz
- Centro Multidisciplinario de Estudios en Biotecnología, Facultad de Medicina Veterinaria y Zootecnia, Universidad Michoacana de San Nicolás de Hidalgo, Km 9.5 Carretera Morelia-Zinapécuaro, Col. La Palma Tarímbaro, Morelia 58893, Michoacán, Mexico; (E.A.H.-D.); (A.M.N.-P.)
| | - Ma. Soledad Vázquez-Garcidueñas
- División de Estudios de Posgrado, Facultad de Ciencias Médicas y Biológicas “Dr. Ignacio Chávez”, Universidad Michoacana de San Nicolás de Hidalgo, Ave. Rafael Carrillo esq. Dr. Salvador González Herrejón, Col. Cuauhtémoc, Morelia 58020, Michoacán, Mexico;
| | - Andrea Monserrat Negrete-Paz
- Centro Multidisciplinario de Estudios en Biotecnología, Facultad de Medicina Veterinaria y Zootecnia, Universidad Michoacana de San Nicolás de Hidalgo, Km 9.5 Carretera Morelia-Zinapécuaro, Col. La Palma Tarímbaro, Morelia 58893, Michoacán, Mexico; (E.A.H.-D.); (A.M.N.-P.)
| | - Gerardo Vázquez-Marrufo
- Centro Multidisciplinario de Estudios en Biotecnología, Facultad de Medicina Veterinaria y Zootecnia, Universidad Michoacana de San Nicolás de Hidalgo, Km 9.5 Carretera Morelia-Zinapécuaro, Col. La Palma Tarímbaro, Morelia 58893, Michoacán, Mexico; (E.A.H.-D.); (A.M.N.-P.)
| |
Collapse
|
15
|
Systems-Based Approach for Optimization of Assembly-Free Bacterial MLST Mapping. Life (Basel) 2022; 12:life12050670. [PMID: 35629339 PMCID: PMC9147691 DOI: 10.3390/life12050670] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Revised: 04/24/2022] [Accepted: 04/25/2022] [Indexed: 12/02/2022] Open
Abstract
Epidemiological surveillance of bacterial pathogens requires real-time data analysis with a fast turnaround, while aiming at generating two main outcomes: (1) species-level identification and (2) variant mapping at different levels of genotypic resolution for population-based tracking and surveillance, in addition to predicting traits such as antimicrobial resistance (AMR). Multi-locus sequence typing (MLST) aids this process by identifying sequence types (ST) based on seven ubiquitous genome-scattered loci. In this paper, we selected one assembly-dependent and one assembly-free method for ST mapping and applied them with the default settings and ST schemes they are distributed with, and systematically assessed their accuracy and scalability across a wide array of phylogenetically divergent Public Health-relevant bacterial pathogens with available MLST databases. Our data show that the optimal k-mer length for stringMLST is species-specific and that genome-intrinsic and -extrinsic features can affect the performance and accuracy of the program. Although suitable parameters could be identified for most organisms, there were instances where this program may not be directly deployable in its current format. Next, we integrated stringMLST into our freely available and scalable hierarchical-based population genomics platform, ProkEvo, and further demonstrated how the implementation facilitates automated, reproducible bacterial population analysis.
Collapse
|
16
|
Palma F, Mangone I, Janowicz A, Moura A, Chiaverini A, Torresi M, Garofolo G, Criscuolo A, Brisse S, Di Pasquale A, Cammà C, Radomski N. In vitro and in silico parameters for precise cgMLST typing of Listeria monocytogenes. BMC Genomics 2022; 23:235. [PMID: 35346021 PMCID: PMC8961897 DOI: 10.1186/s12864-022-08437-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Accepted: 02/28/2022] [Indexed: 02/02/2023] Open
Abstract
Background Whole genome sequencing analyzed by core genome multi-locus sequence typing (cgMLST) is widely used in surveillance of the pathogenic bacteria Listeria monocytogenes. Given the heterogeneity of available bioinformatics tools to define cgMLST alleles, our aim was to identify parameters influencing the precision of cgMLST profiles. Methods We used three L. monocytogenes reference genomes from different phylogenetic lineages and assessed the impact of in vitro (i.e. tested genomes, successive platings, replicates of DNA extraction and sequencing) and in silico parameters (i.e. targeted depth of coverage, depth of coverage, breadth of coverage, assembly metrics, cgMLST workflows, cgMLST completeness) on cgMLST precision made of 1748 core loci. Six cgMLST workflows were tested, comprising assembly-based (BIGSdb, INNUENDO, GENPAT, SeqSphere and BioNumerics) and assembly-free (i.e. kmer-based MentaLiST) allele callers. Principal component analyses and generalized linear models were used to identify the most impactful parameters on cgMLST precision. Results The isolate’s genetic background, cgMLST workflows, cgMLST completeness, as well as depth and breadth of coverage were the parameters that impacted most on cgMLST precision (i.e. identical alleles against reference circular genomes). All workflows performed well at ≥40X of depth of coverage, with high loci detection (> 99.54% for all, except for BioNumerics with 97.78%) and showed consistent cluster definitions using the reference cut-off of ≤7 allele differences. Conclusions This highlights that bioinformatics workflows dedicated to cgMLST allele calling are largely robust when paired-end reads are of high quality and when the sequencing depth is ≥40X. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08437-4.
Collapse
|
17
|
Shelenkov A, Mikhaylova Y, Petrova L, Gaidukova I, Zamyatin M, Akimkin V. Genomic Characterization of Clinical Acinetobacter baumannii Isolates Obtained from COVID-19 Patients in Russia. Antibiotics (Basel) 2022; 11:346. [PMID: 35326809 PMCID: PMC8944674 DOI: 10.3390/antibiotics11030346] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Revised: 02/22/2022] [Accepted: 03/03/2022] [Indexed: 11/18/2022] Open
Abstract
The coronavirus disease 2019 (COVID-19) pandemic has already affected all realms of public healthcare and, in particular, has led to increasing use of various antibiotics to treat possible bacterial coinfections even in cases for which such infections were not confirmed clinically. This could lead to an increase in the fraction and severity of multidrug-resistant bacterial isolates in healthcare facilities, especially in intensive care units (ICU). However, detailed epidemiological investigations, possibly including whole genome sequencing (WGS), are required to confirm the increase in antibiotic resistance and changes, if any, in the population and clonal structures of bacterial pathogens. In this study, we performed a comprehensive genomic and phenotypic characterization of selected multidrug-resistant A. baumannii isolates obtained from the patients of a dedicated COVID-19 ICU in Moscow, Russia. Hybrid short- and long-read sequencing allowed us to obtain complete profiles of genomic antimicrobial resistance and virulence determinants, as well as to reveal the plasmid structure. We demonstrated the genomic similarity in terms of cgMLST profiles of the isolates studied with a clone previously identified in the same facility. We believe that the data provided will contribute to better understanding the changes imposed by the COVID-19 pandemic on the population structure and the antimicrobial resistance of bacterial pathogens in healthcare facilities.
Collapse
Affiliation(s)
- Andrey Shelenkov
- Central Research Institute of Epidemiology, Novogireevskaya Str., 3a, 111123 Moscow, Russia; (Y.M.); (V.A.)
| | - Yulia Mikhaylova
- Central Research Institute of Epidemiology, Novogireevskaya Str., 3a, 111123 Moscow, Russia; (Y.M.); (V.A.)
| | - Lyudmila Petrova
- National Medical and Surgical Center named after N.I. Pirogov, Nizhnyaya Pervomayskaya Str., 70, 105203 Moscow, Russia; (L.P.); (I.G.); (M.Z.)
| | - Irina Gaidukova
- National Medical and Surgical Center named after N.I. Pirogov, Nizhnyaya Pervomayskaya Str., 70, 105203 Moscow, Russia; (L.P.); (I.G.); (M.Z.)
| | - Mikhail Zamyatin
- National Medical and Surgical Center named after N.I. Pirogov, Nizhnyaya Pervomayskaya Str., 70, 105203 Moscow, Russia; (L.P.); (I.G.); (M.Z.)
| | - Vasiliy Akimkin
- Central Research Institute of Epidemiology, Novogireevskaya Str., 3a, 111123 Moscow, Russia; (Y.M.); (V.A.)
| |
Collapse
|
18
|
Ben Khedher M, Ghedira K, Rolain JM, Ruimy R, Croce O. Application and Challenge of 3rd Generation Sequencing for Clinical Bacterial Studies. Int J Mol Sci 2022; 23:1395. [PMID: 35163319 PMCID: PMC8835973 DOI: 10.3390/ijms23031395] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Revised: 01/20/2022] [Accepted: 01/24/2022] [Indexed: 02/04/2023] Open
Abstract
Over the past 25 years, the powerful combination of genome sequencing and bioinformatics analysis has played a crucial role in interpreting information encoded in bacterial genomes. High-throughput sequencing technologies have paved the way towards understanding an increasingly wide range of biological questions. This revolution has enabled advances in areas ranging from genome composition to how proteins interact with nucleic acids. This has created unprecedented opportunities through the integration of genomic data into clinics for the diagnosis of genetic traits associated with disease. Since then, these technologies have continued to evolve, and recently, long-read sequencing has overcome previous limitations in terms of accuracy, thus expanding its applications in genomics, transcriptomics and metagenomics. In this review, we describe a brief history of the bacterial genome sequencing revolution and its application in public health and molecular epidemiology. We present a chronology that encompasses the various technological developments: whole-genome shotgun sequencing, high-throughput sequencing, long-read sequencing. We mainly discuss the application of next-generation sequencing to decipher bacterial genomes. Secondly, we highlight how long-read sequencing technologies go beyond the limitations of traditional short-read sequencing. We intend to provide a description of the guiding principles of the 3rd generation sequencing applications and ongoing improvements in the field of microbial medical research.
Collapse
Affiliation(s)
- Mariem Ben Khedher
- Bacteriology Laboratory, Archet 2 Hospital, CHU Nice, 06000 Nice, France
- Institute for Research on Cancer and Aging Nice (IRCAN), CNRS, INSERM, Université Côte d’Azur, 06108 Nice, France
| | - Kais Ghedira
- Laboratory of Bioinformatics, Biomathematics and Biostatistics, Institute Pasteur of Tunis, Tunis 1002, Tunisia;
| | - Jean-Marc Rolain
- IRD, APHM, MEPHI, IHU-Méditerranée Infection, Aix Marseille Université, 13005 Marseille, France;
| | - Raymond Ruimy
- Bacteriology Laboratory, Archet 2 Hospital, CHU Nice, 06000 Nice, France
- Centre Méditerranéen de Médecine Moléculaire (C3M), INSERM, Université Côte D’Azur, 06108 Nice, France
| | - Olivier Croce
- Institute for Research on Cancer and Aging Nice (IRCAN), CNRS, INSERM, Université Côte d’Azur, 06108 Nice, France
| |
Collapse
|
19
|
Labbé G, Kruczkiewicz P, Robertson J, Mabon P, Schonfeld J, Kein D, Rankin MA, Gopez M, Hole D, Son D, Knox N, Laing CR, Bessonov K, Taboada EN, Yoshida C, Ziebell K, Nichani A, Johnson RP, Van Domselaar G, Nash JHE. Rapid and accurate SNP genotyping of clonal bacterial pathogens with BioHansel. Microb Genom 2021; 7. [PMID: 34554082 PMCID: PMC8715432 DOI: 10.1099/mgen.0.000651] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Hierarchical genotyping approaches can provide insights into the source, geography and temporal distribution of bacterial pathogens. Multiple hierarchical SNP genotyping schemes have previously been developed so that new isolates can rapidly be placed within pre-computed population structures, without the need to rebuild phylogenetic trees for the entire dataset. This classification approach has, however, seen limited uptake in routine public health settings due to analytical complexity and the lack of standardized tools that provide clear and easy ways to interpret results. The BioHansel tool was developed to provide an organism-agnostic tool for hierarchical SNP-based genotyping. The tool identifies split k-mers that distinguish predefined lineages in whole genome sequencing (WGS) data using SNP-based genotyping schemes. BioHansel uses the Aho-Corasick algorithm to type isolates from assembled genomes or raw read sequence data in a matter of seconds, with limited computational resources. This makes BioHansel ideal for use by public health agencies that rely on WGS methods for surveillance of bacterial pathogens. Genotyping results are evaluated using a quality assurance module which identifies problematic samples, such as low-quality or contaminated datasets. Using existing hierarchical SNP schemes for Mycobacterium tuberculosis and Salmonella Typhi, we compare the genotyping results obtained with the k-mer-based tools BioHansel and SKA, with those of the organism-specific tools TBProfiler and genotyphi, which use gold-standard reference-mapping approaches. We show that the genotyping results are fully concordant across these different methods, and that the k-mer-based tools are significantly faster. We also test the ability of the BioHansel quality assurance module to detect intra-lineage contamination and demonstrate that it is effective, even in populations with low genetic diversity. We demonstrate the scalability of the tool using a dataset of ~8100 S. Typhi public genomes and provide the aggregated results of geographical distributions as part of the tool’s output. BioHansel is an open source Python 3 application available on PyPI and Conda repositories and as a Galaxy tool from the public Galaxy Toolshed. In a public health context, BioHansel enables rapid and high-resolution classification of bacterial pathogens with low genetic diversity.
Collapse
Affiliation(s)
- Geneviève Labbé
- National Microbiology Laboratory, Public Health Agency of Canada, Guelph, Ontario, Canada
| | | | - James Robertson
- National Microbiology Laboratory, Public Health Agency of Canada, Guelph, Ontario, Canada
| | - Philip Mabon
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, Manitoba, Canada
| | - Justin Schonfeld
- National Microbiology Laboratory, Public Health Agency of Canada, Guelph, Ontario, Canada
| | - Daniel Kein
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, Manitoba, Canada
| | - Marisa A Rankin
- National Microbiology Laboratory, Public Health Agency of Canada, Guelph, Ontario, Canada
| | - Matthew Gopez
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, Manitoba, Canada
| | - Darian Hole
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, Manitoba, Canada
| | - David Son
- National Microbiology Laboratory, Public Health Agency of Canada, Guelph, Ontario, Canada
| | - Natalie Knox
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, Manitoba, Canada.,Department of Medical Microbiology & Infectious Diseases, Max Rady College of Medicine, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, Manitoba, Canada
| | - Chad R Laing
- National Centres for Animal Disease Lethbridge Laboratory, Canadian Food Inspection Agency, Lethbridge, AB, Canada
| | - Kyrylo Bessonov
- National Microbiology Laboratory, Public Health Agency of Canada, Guelph, Ontario, Canada
| | - Eduardo N Taboada
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, Manitoba, Canada
| | - Catherine Yoshida
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, Manitoba, Canada
| | - Kim Ziebell
- National Microbiology Laboratory, Public Health Agency of Canada, Guelph, Ontario, Canada
| | - Anil Nichani
- National Microbiology Laboratory, Public Health Agency of Canada, Guelph, Ontario, Canada
| | - Roger P Johnson
- National Microbiology Laboratory, Public Health Agency of Canada, Guelph, Ontario, Canada
| | - Gary Van Domselaar
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, Manitoba, Canada.,National Centres for Animal Disease Lethbridge Laboratory, Canadian Food Inspection Agency, Lethbridge, AB, Canada
| | - John H E Nash
- National Microbiology Laboratory, Public Health Agency of Canada, Toronto, Ontario, Canada
| |
Collapse
|
20
|
Gigliucci F, van Hoek AHAM, Chiani P, Knijn A, Minelli F, Scavia G, Franz E, Morabito S, Michelacci V. Genomic Characterization of hlyF-positive Shiga Toxin-Producing Escherichia coli, Italy and the Netherlands, 2000-2019. Emerg Infect Dis 2021; 27:853-861. [PMID: 33622476 PMCID: PMC7920663 DOI: 10.3201/eid2703.203110] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Shiga toxin–producing Escherichia coli (STEC) O80:H2 has emerged in Europe as a cause of hemolytic uremic syndrome associated with bacteremia. STEC O80:H2 harbors the mosaic plasmid pR444_A, which combines several virulence genes, including hlyF and antimicrobial resistance genes. pR444_A is found in some extraintestinal pathogenic E. coli (ExPEC) strains. We identified and characterized 53 STEC strains with ExPEC-associated virulence genes isolated in Italy and the Netherlands during 2000–2019. The isolates belong to 2 major populations: 1 belongs to sequence type 301 and harbors diverse stx2 subtypes, the intimin variant eae-ξ, and pO157-like and pR444_A plasmids; 1 consists of strains belonging to various sequence types, some of which lack the pO157 plasmid, the locus of enterocyte effacement, and the antimicrobial resistance–encoding region. Our results showed that STEC strains harboring ExPEC-associated virulence genes can include multiple serotypes and that the pR444_A plasmid can be acquired and mobilized by STEC strains.
Collapse
|
21
|
Gabbassov E, Moreno-Molina M, Comas I, Libbrecht M, Chindelevitch L. SplitStrains, a tool to identify and separate mixed Mycobacterium tuberculosis infections from WGS data. Microb Genom 2021; 7. [PMID: 34165419 PMCID: PMC8461467 DOI: 10.1099/mgen.0.000607] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
The occurrence of multiple strains of a bacterial pathogen such as M. tuberculosis or C. difficile within a single human host, referred to as a mixed infection, has important implications for both healthcare and public health. However, methods for detecting it, and especially determining the proportion and identities of the underlying strains, from WGS (whole-genome sequencing) data, have been limited. In this paper we introduce SplitStrains, a novel method for addressing these challenges. Grounded in a rigorous statistical model, SplitStrains not only demonstrates superior performance in proportion estimation to other existing methods on both simulated as well as real M. tuberculosis data, but also successfully determines the identity of the underlying strains. We conclude that SplitStrains is a powerful addition to the existing toolkit of analytical methods for data coming from bacterial pathogens and holds the promise of enabling previously inaccessible conclusions to be drawn in the realm of public health microbiology.
Collapse
Affiliation(s)
- Einar Gabbassov
- School of Computing Science, Simon Fraser University, Burnaby, BC, Canada
- Department of Mathematics, Simon Fraser University, Burnaby, BC, Canada
- *Correspondence: Einar Gabbassov,
| | | | - Iñaki Comas
- Instituto de Biomedicina de Valencia, Valencia, Spain
| | - Maxwell Libbrecht
- School of Computing Science, Simon Fraser University, Burnaby, BC, Canada
| | - Leonid Chindelevitch
- MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College, London, UK
- *Correspondence: Leonid Chindelevitch,
| |
Collapse
|
22
|
Deneke C, Uelze L, Brendebach H, Tausch SH, Malorny B. Decentralized Investigation of Bacterial Outbreaks Based on Hashed cgMLST. Front Microbiol 2021; 12:649517. [PMID: 34220740 PMCID: PMC8244591 DOI: 10.3389/fmicb.2021.649517] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2021] [Accepted: 03/25/2021] [Indexed: 02/05/2023] Open
Abstract
Whole-genome sequencing (WGS)-based outbreak investigation has proven to be a valuable method for the surveillance of bacterial pathogens. Its utility has been successfully demonstrated using both gene-by-gene (cgMLST or wgMLST) and single-nucleotide polymorphism (SNP)-based approaches. Among the obstacles of implementing a WGS-based routine surveillance is the need for an exchange of large volumes of sequencing data, as well as a widespread reluctance to share sequence and metadata in public repositories, together with a lacking standardization of suitable bioinformatic tools and workflows. To address these issues, we present chewieSnake, an intuitive and simple-to-use cgMLST workflow. ChewieSnake builds on the allele calling software chewBBACA and extends it by the concept of allele hashing. The resulting hashed allele profiles can be readily compared between laboratories without the need of a central allele nomenclature. The workflow fully automates the computation of the allele distance matrix, cluster membership, and phylogeny and summarizes all important findings in an interactive HTML report. Furthermore, chewieSnake can join allele profiles generated at different laboratories and identify shared clusters, including a stable and intercommunicable cluster nomenclature, thus facilitating a joint outbreak investigation. We demonstrate the feasibility of the proposed approach with a thorough method comparison using publically available sequencing data for Salmonella enterica. However, chewieSnake is readily applicable to all bacterial taxa, provided that a suitable cgMLST scheme is available. The workflow is freely available as an open-source tool and can be easily installed via conda or docker.
Collapse
Affiliation(s)
- Carlus Deneke
- Department Biological Safety, German Federal Institute for Risk Assessment, Berlin, Germany
| | - Laura Uelze
- Department Biological Safety, German Federal Institute for Risk Assessment, Berlin, Germany
| | - Holger Brendebach
- Department Biological Safety, German Federal Institute for Risk Assessment, Berlin, Germany
| | - Simon H Tausch
- Department Biological Safety, German Federal Institute for Risk Assessment, Berlin, Germany
| | - Burkhard Malorny
- Department Biological Safety, German Federal Institute for Risk Assessment, Berlin, Germany
| |
Collapse
|
23
|
Closed Genome and Plasmid Sequences of Legionella pneumophila AW-13-4, Isolated from a Hot Water Loop System of a Large Occupational Building. Microbiol Resour Announc 2021; 10:10/1/e01276-20. [PMID: 33414354 PMCID: PMC8407730 DOI: 10.1128/mra.01276-20] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Unused water in unoccupied buildings can become stagnant, with reductions in temperature and levels of disinfectant resulting in increased microbial growth. We report the closed and complete genome and plasmid of Legionella pneumophila strain AW-13-4 (serogroup 1), which was isolated from a hot water loop system of a large building.
Collapse
|
24
|
Shaidullina E, Shelenkov A, Yanushevich Y, Mikhaylova Y, Shagin D, Alexandrova I, Ershova O, Akimkin V, Kozlov R, Edelstein M. Antimicrobial Resistance and Genomic Characterization of OXA-48- and CTX-M-15-Co-Producing Hypervirulent Klebsiella pneumoniae ST23 Recovered from Nosocomial Outbreak. Antibiotics (Basel) 2020; 9:antibiotics9120862. [PMID: 33287207 PMCID: PMC7761672 DOI: 10.3390/antibiotics9120862] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2020] [Revised: 11/27/2020] [Accepted: 12/01/2020] [Indexed: 12/19/2022] Open
Abstract
Multidrug resistance (MDR) and hypervirulence (hv) have been long considered distinct evolutionary traits for Klebsiella pneumoniae (Kp), a versatile human pathogen. The recent emergence of Kp strains combining these traits poses a serious global threat. In this article, we describe the phenotypic and genomic characteristics of an MDR hvKp isolate, MAR14-456, representative of a nosocomial outbreak in Moscow, Russia, that was recovered from a postoperative wound in a patient who later developed multiple abscesses, fatal sepsis, and septic shock. Broth microdilution testing revealed decreased susceptibility of MAR14-456 to carbapenems (MICs 0.5–2 mg/L) and a high-level resistance to most β-lactams, β-lactam-β-lactamase-inhibitor combinations, and non-β-lactam antibiotics, except ceftazidime-avibactam, amikacin, tigecycline, and colistin. Whole-genome sequencing using Illumina MiSeq and ONT MinION systems allowed to identify and completely assemble two conjugative resistance plasmids, a typical ‘European’ epidemic IncL/M plasmid that carries the gene of OXA-48 carbapenemase, and an IncFIIK plasmid that carries the gene of CTX-M-15 ESBL and other resistance genes. MLST profile, capsular, lipopolysaccharide, virulence genes encoded on chromosome and IncHI1B/FIB plasmid, and the presence of apparently functional type I-E* CRISPR-Cas system were all characteristic of hvKp ST23, serotype K1-O1v2. Phylogenetic analysis showed the closest relatedness of MAR14-456 to ST23 isolates from China. This report highlights the threat of multiple resistance acquisition by hvKp strain and its spread as a nosocomial pathogen.
Collapse
Affiliation(s)
- Elvira Shaidullina
- Institute of Antimicrobial Chemotherapy, Smolensk State Medical University, 214019 Smolensk, Russia; (E.S.); (R.K.); (M.E.)
- Institute of Fundamental Medicine and Biology, Kazan Federal University, 420012 Kazan, Russia
| | - Andrey Shelenkov
- Central Research Institute of Epidemiology, Rospotrebnadzor, 111123 Moscow, Russia; (Y.Y.); (Y.M.); (D.S.); (V.A.)
- Correspondence:
| | - Yuri Yanushevich
- Central Research Institute of Epidemiology, Rospotrebnadzor, 111123 Moscow, Russia; (Y.Y.); (Y.M.); (D.S.); (V.A.)
| | - Yulia Mikhaylova
- Central Research Institute of Epidemiology, Rospotrebnadzor, 111123 Moscow, Russia; (Y.Y.); (Y.M.); (D.S.); (V.A.)
| | - Dmitriy Shagin
- Central Research Institute of Epidemiology, Rospotrebnadzor, 111123 Moscow, Russia; (Y.Y.); (Y.M.); (D.S.); (V.A.)
- Pirogov Russian National Research Medical University, 117997 Moscow, Russia
| | - Irina Alexandrova
- N.N. Burdenko National Scientific and Practical Center for Neurosurgery, 125047 Moscow, Russia; (I.A.); (O.E.)
| | - Olga Ershova
- N.N. Burdenko National Scientific and Practical Center for Neurosurgery, 125047 Moscow, Russia; (I.A.); (O.E.)
| | - Vasiliy Akimkin
- Central Research Institute of Epidemiology, Rospotrebnadzor, 111123 Moscow, Russia; (Y.Y.); (Y.M.); (D.S.); (V.A.)
| | - Roman Kozlov
- Institute of Antimicrobial Chemotherapy, Smolensk State Medical University, 214019 Smolensk, Russia; (E.S.); (R.K.); (M.E.)
| | - Mikhail Edelstein
- Institute of Antimicrobial Chemotherapy, Smolensk State Medical University, 214019 Smolensk, Russia; (E.S.); (R.K.); (M.E.)
| |
Collapse
|
25
|
Huang J, Zhang S, Zhang S, Zhao Z, Cao Y, Chen M, Li B. A Comparative Study of Fluoroquinolone-Resistant Escherichia coli Lineages Portrays Indistinguishable Pathogenicity- and Survivability-Associated Phenotypic Characteristics Between ST1193 and ST131. Infect Drug Resist 2020; 13:4167-4175. [PMID: 33244246 PMCID: PMC7685377 DOI: 10.2147/idr.s277681] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2020] [Accepted: 10/19/2020] [Indexed: 12/12/2022] Open
Abstract
Background Sequence type 1193 is a new such lineage among fluoroquinolone-resistant Escherichia coli, which has risen dramatically within the last several years. However, reasons for rapid emergence and successful spread of E. coli ST1193 remain unclear. The aim of this study was to compare the pathogenicity and survivability features of E. coli ST1193 with global epidemic lineage, ST131. Methods A total of 30 E. coli were used in this study. Isolates were divided into two groups, ST1193 (n=15) and ST131 (n=15). Adhesion and invasion to T24 cells and resistance to serum were quantified and compared among two groups. Biofilm formation capacity was assessed by crystal violet assay. Macrocolony formation was assessed on macrocolony formation plates. Resistance to hydrogen peroxide was performed by broth microdilution. RAW264.7 cells were used to assess the anti-phagocytic function of different isolates. Results Adhesion and invasion assays revealed that E. coli ST1193 could adhere and invade T24 cells (p <0.05). 93.3% of E. coli ST1193 could form biofilms. The majority of E. coli ST1193 (66.7%) possessed no curli/no cellulose on macrocolony formation plates. E. coli ST1193 showed significant growth in serum and hydrogen peroxide and illustrated higher anti-phagocytic function to RAW264.7 cells (p <0.05). Group analysis showed that E. coli ST1193 was similar to ST131 in pathogenicity- and survivability-associated phenotypic characteristics (p >0.05). Conclusion Our study provided more insights into pathogenicity and survivability features of E. coli ST1193, which was similar to ST131. Our study could be of great importance in understanding the emergence of global spread E. coli ST1193. Strategic and continued surveillance should be carried out to prevent the infections caused by E. coli ST1193.
Collapse
Affiliation(s)
- Jiangqing Huang
- Department of Clinical Laboratory, Fujian Medical University Union Hospital, Fuzhou, Fujian 350001, People's Republic of China
| | - Shengcen Zhang
- Department of Clinical Laboratory, Fujian Medical University Union Hospital, Fuzhou, Fujian 350001, People's Republic of China
| | - Shuyu Zhang
- Department of Laboratory Medicine, Fujian Medical University, Fuzhou, Fujian 350001, People's Republic of China
| | - Zhichang Zhao
- Department of Pharmacy, Fujian Medical University Union Hospital, Fuzhou, Fujian 350001, People's Republic of China
| | - Yingping Cao
- Department of Clinical Laboratory, Fujian Medical University Union Hospital, Fuzhou, Fujian 350001, People's Republic of China
| | - Min Chen
- Department of Laboratory Medicine, Fujian Medical University, Fuzhou, Fujian 350001, People's Republic of China
| | - Bin Li
- Department of Clinical Laboratory, Fujian Medical University Union Hospital, Fuzhou, Fujian 350001, People's Republic of China
| |
Collapse
|
26
|
Espitia-Navarro HF, Chande AT, Nagar SD, Smith H, Jordan IK, Rishishwar L. STing: accurate and ultrafast genomic profiling with exact sequence matches. Nucleic Acids Res 2020; 48:7681-7689. [PMID: 32619234 PMCID: PMC7430640 DOI: 10.1093/nar/gkaa566] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2020] [Revised: 06/16/2020] [Accepted: 07/01/2020] [Indexed: 11/30/2022] Open
Abstract
Genome-enabled approaches to molecular epidemiology have become essential to public health agencies and the microbial research community. We developed the algorithm STing to provide turn-key solutions for molecular typing and gene detection directly from next generation sequence data of microbial pathogens. Our implementation of STing uses an innovative k-mer search strategy that eliminates the computational overhead associated with the time-consuming steps of quality control, assembly, and alignment, required by more traditional methods. We compared STing to six of the most widely used programs for genome-based molecular typing and demonstrate its ease of use, accuracy, speed and efficiency. STing shows superior accuracy and performance for standard multilocus sequence typing schemes, along with larger genome-scale typing schemes, and it enables rapid automated detection of antimicrobial resistance and virulence factor genes. STing determines the sequence type of traditional 7-gene MLST with 100% accuracy in less than 10 seconds per isolate. We hope that the adoption of STing will help to democratize microbial genomics and thereby maximize its benefit for public health.
Collapse
Affiliation(s)
- Hector F Espitia-Navarro
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA 30332, USA.,PanAmerican Bioinformatics Institute, Cali, Valle del Cauca 760043, Colombia.,Applied Bioinformatics Laboratory, Atlanta, GA 30332, USA
| | - Aroon T Chande
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA 30332, USA.,PanAmerican Bioinformatics Institute, Cali, Valle del Cauca 760043, Colombia.,Applied Bioinformatics Laboratory, Atlanta, GA 30332, USA
| | - Shashwat D Nagar
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA 30332, USA.,PanAmerican Bioinformatics Institute, Cali, Valle del Cauca 760043, Colombia
| | - Heather Smith
- School of Mathematics, Georgia Institute of Technology, Atlanta, GA 30332, USA.,Department of Mathematics and Computer Science, Davidson College, Davidson, NC 28035, USA
| | - I King Jordan
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA 30332, USA.,PanAmerican Bioinformatics Institute, Cali, Valle del Cauca 760043, Colombia.,Applied Bioinformatics Laboratory, Atlanta, GA 30332, USA
| | - Lavanya Rishishwar
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA 30332, USA.,PanAmerican Bioinformatics Institute, Cali, Valle del Cauca 760043, Colombia.,Applied Bioinformatics Laboratory, Atlanta, GA 30332, USA
| |
Collapse
|
27
|
Lepuschitz S, Weinmaier T, Mrazek K, Beisken S, Weinberger J, Posch AE. Analytical Performance Validation of Next-Generation Sequencing Based Clinical Microbiology Assays Using a K-mer Analysis Workflow. Front Microbiol 2020; 11:1883. [PMID: 32849463 PMCID: PMC7422695 DOI: 10.3389/fmicb.2020.01883] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2020] [Accepted: 07/17/2020] [Indexed: 12/13/2022] Open
Abstract
Next-generation sequencing (NGS) enables clinical microbiology assays such as molecular typing of bacterial isolates which is now routinely applied for infection control and epidemiology. Additionally, feasibility for NGS-based identification of antimicrobial resistance (AMR) markers as well as genetic prediction of antibiotic susceptibility testing results has been demonstrated. Various bioinformatics approaches enabling NGS-based clinical microbiology assays exist, but standardized, computationally efficient and scalable sample-to-results workflows including validated quality control parameters are still lacking. Bioinformatics analysis workflows based on k-mers have been shown to allow for fast and efficient analysis of large genomics data sets as obtained from microbial sequencing applications. We here demonstrate applicability of k-mer based clinical microbiology assays for whole-genome sequencing (WGS) including variant calling, taxonomic identification, bacterial typing as well as AMR marker detection. The wet-lab and dry-lab workflows were developed and validated in line with Clinical Laboratory Improvement Act (CLIA) guidelines for laboratory-developed tests (LDTs) on multi-drug resistant ESKAPE pathogens. The developed k-mer based workflow demonstrated ≥99.39% repeatability, ≥99.09% reproducibility and ≥99.76% accuracy for variant calling and applied assays as determined by intra-day and inter-day triplicate measurements. The limit of detection (LOD) across assays was found to be at 20× sequencing depth and 15× for AMR marker detection. Thorough benchmarking of the k-mer based workflow revealed analytical performance criteria are comparable to state-of-the-art alignment based workflows across clinical microbiology assays. Diagnostic sensitivity and specificity for multilocus sequence typing (MLST) and phylogenetic analysis were 100% for both approaches. For AMR marker detection, sensitivity and specificity were 95.29 and 99.78% for the k-mer based workflow as compared to 95.17 and 99.77% for the alignment-based approach. Summarizing, results illustrate that k-mer based analysis workflows enable a broad range of clinical microbiology assays, potentially not only for WGS-based typing and AMR gene detection but also genetic prediction of antibiotic susceptibility testing results.
Collapse
|
28
|
Zuppi M, Tozzoli R, Chiani P, Quiros P, Martinez-Velazquez A, Michelacci V, Muniesa M, Morabito S. Investigation on the Evolution of Shiga Toxin-Converting Phages Based on Whole Genome Sequencing. Front Microbiol 2020; 11:1472. [PMID: 32754128 PMCID: PMC7366253 DOI: 10.3389/fmicb.2020.01472] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2020] [Accepted: 06/05/2020] [Indexed: 12/25/2022] Open
Abstract
Bacteriophages are pivotal elements in the dissemination of virulence genes. The main virulence determinants of Shiga Toxin producing E. coli, Shiga Toxins (Stx), are encoded by genes localized in the genome of lambdoid bacteriophages. Stx comprise two antigenically different types, Stx1 and Stx2, further divided into subtypes. Among these, certain Stx2 subtypes appear to be more commonly occurring in the most severe forms of the STEC disease, haemorrhagic colitis and haemolytic uremic syndrome (HUS). This study aimed at obtaining insights on the evolution of Stx2 bacteriophages, due to their relevance in public health, and we report here on the analysis of the genomic structure of Stx2 converting phages in relation with the known reservoir of the E. coli strains harboring them. Stx2-converting phages conveying the genes encoding different stx2 subtypes have been isolated from STEC strains and their whole genomes have been sequenced, analyzed and compared to those of other Stx2 phages available in the public domain. The phages' regions containing the stx2 genes have been analyzed in depth allowing to make inference on the possible mechanisms of selection and maintenance of certain Stx2 phages in the reservoir. The "stx regions" of different stx2 gene subtypes grouped into three different evolutionary lines in the comparative analysis, reflecting the frequency with which these subtypes are found in different animal niches, suggesting that the colonization of specific reservoir by STEC strains could be influenced by the Stx phage that they carry. Noteworthy, we could identify the presence of nanS-p gene exclusively in the "stx regions" of the phages identified in STEC strains commonly found in cattle. As a matter of fact, this gene encodes an esterase capable of metabolizing sialic acids produced by submaxillary glands of bovines and present in great quantities in their gastrointestinal tract.
Collapse
Affiliation(s)
- Michele Zuppi
- Department of Food Safety, Nutrition and Veterinary Public Health, Istituto Superiore di Sanità, Rome, Italy
| | - Rosangela Tozzoli
- Department of Food Safety, Nutrition and Veterinary Public Health, Istituto Superiore di Sanità, Rome, Italy
| | - Paola Chiani
- Department of Food Safety, Nutrition and Veterinary Public Health, Istituto Superiore di Sanità, Rome, Italy
| | - Pablo Quiros
- Department of Genetics, Microbiology and Statistics, University of Barcelona, Barcelona, Spain
| | - Adan Martinez-Velazquez
- Department of Genetics, Microbiology and Statistics, University of Barcelona, Barcelona, Spain
| | - Valeria Michelacci
- Department of Food Safety, Nutrition and Veterinary Public Health, Istituto Superiore di Sanità, Rome, Italy
| | - Maite Muniesa
- Department of Genetics, Microbiology and Statistics, University of Barcelona, Barcelona, Spain
| | - Stefano Morabito
- Department of Food Safety, Nutrition and Veterinary Public Health, Istituto Superiore di Sanità, Rome, Italy
| |
Collapse
|
29
|
Waker E, Ambrozkiewicz F, Kulecka M, Paziewska A, Skubisz K, Cybula P, Targoński Ł, Mikula M, Walewski J, Ostrowski J. High Prevalence of Genetically Related Clostridium Difficile Strains at a Single Hemato-Oncology Ward Over 10 Years. Front Microbiol 2020; 11:1618. [PMID: 32793147 PMCID: PMC7384382 DOI: 10.3389/fmicb.2020.01618] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2020] [Accepted: 06/22/2020] [Indexed: 12/19/2022] Open
Abstract
Aims: Clostridium difficile (C. difficile) infection (CDI) is the main cause of healthcare-associated infectious diarrhea. We used whole-genome sequencing (WGS) to measure the prevalence and genetic variability of C. difficile at a single hemato-oncology ward over a 10 year period. Methods: Between 2008 and 2018, 2077 stool samples were obtained from diarrheal patients hospitalized at the Department of Lymphoma; of these, 618 were positive for toxin A/B. 140 isolates were then subjected to WGS on Ion Torrent PGM sequencer. Results: 36 and 104 isolates were recovered from 36 to 46 patients with single and multiple CDIs, respectively. Of these, 131 strains were toxigenic. Toxin gene profiles tcdA(+);tcdB(+);cdtA/cdtB(+) and tcdA(+);tcdB(+);cdtA/cdtB(-) were identified in 122 and nine strains, respectively. No isolates showed reduced susceptibility to metronidazole and vancomycin. All tested strains were resistant to ciprofloxacin, and 72.9, 42.9, and 72.9% of strains were resistant to erythromycin, clindamycin, or moxifloxacin, respectively. Multi-locus sequence typing (MLST) identified 23 distinct sequence types (STs) and two unidentified strains. Strains ST1 and ST42 represented 31 and 30.1% of all strains tested, respectively. However, while ST1 was detected across nearly all years studied, ST42 was detected only from 2009 to 2011. Conclusion: The high proportion of infected patients in 2008-2011 may be explained by the predominance of more transmissible and virulent C. difficile strains. Although this retrospective study was not designed to define outbreaks of C. difficile, the finding that most isolates exhibited high levels of genetic relatedness suggests nosocomial acquisition.
Collapse
Affiliation(s)
- Edyta Waker
- Department of Clinical Microbiology, Maria Skłodowska-Curie National Research Institute of Oncology, Warsaw, Poland
| | - Filip Ambrozkiewicz
- Department of Genetics, Maria Skłodowska-Curie National Research Institute of Oncology, Warsaw, Poland
| | - Maria Kulecka
- Department of Genetics, Maria Skłodowska-Curie National Research Institute of Oncology, Warsaw, Poland
- Department of Gastroenterology, Hepatology and Clinical Oncology, Centre for Postgraduate Medical Education, Warsaw, Poland
| | - Agnieszka Paziewska
- Department of Genetics, Maria Skłodowska-Curie National Research Institute of Oncology, Warsaw, Poland
- Department of Gastroenterology, Hepatology and Clinical Oncology, Centre for Postgraduate Medical Education, Warsaw, Poland
| | - Karolina Skubisz
- Department of Gastroenterology, Hepatology and Clinical Oncology, Centre for Postgraduate Medical Education, Warsaw, Poland
| | - Patrycja Cybula
- Department of Gastroenterology, Hepatology and Clinical Oncology, Centre for Postgraduate Medical Education, Warsaw, Poland
| | - Łukasz Targoński
- Department of Lymphoproliferative Diseases, Maria Skłodowska-Curie National Research Institute of Oncology, Warsaw, Poland
| | - Michał Mikula
- Department of Genetics, Maria Skłodowska-Curie National Research Institute of Oncology, Warsaw, Poland
| | - Jan Walewski
- Department of Lymphoproliferative Diseases, Maria Skłodowska-Curie National Research Institute of Oncology, Warsaw, Poland
| | - Jerzy Ostrowski
- Department of Genetics, Maria Skłodowska-Curie National Research Institute of Oncology, Warsaw, Poland
- Department of Gastroenterology, Hepatology and Clinical Oncology, Centre for Postgraduate Medical Education, Warsaw, Poland
- *Correspondence: Jerzy Ostrowski,
| |
Collapse
|
30
|
Martín-Vide C, Vega-Rodríguez MA, Wheeler T. PathOGiST: A Novel Method for Clustering Pathogen Isolates by Combining Multiple Genotyping Signals. ALGORITHMS FOR COMPUTATIONAL BIOLOGY 2020. [PMCID: PMC7197062 DOI: 10.1007/978-3-030-42266-0_9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
In this paper we study the problem of clustering bacterial isolates into epidemiologically related groups from next-generation sequencing data. Existing methods for this problem mainly use a single genotyping signal, and either use a distance-based method with a pre-specified number of clusters, or a phylogenetic tree-based method with a pre-specified threshold. We propose PathOGiST, an algorithmic framework for clustering bacterial isolates by leveraging multiple genotypic signals and calibrated thresholds. PathOGiST uses different genotypic signals, clusters the isolates based on these individual signals with correlation clustering, and combines the clusterings based on the individual signals through consensus clustering. We implemented and tested PathOGiST on three different bacterial pathogens - Escherichia coli, Yersinia pseudotuberculosis, and Mycobacterium tuberculosis - and we conclude by discussing further avenues to explore.
Collapse
|
31
|
Seth-Smith HMB, Bonfiglio F, Cuénod A, Reist J, Egli A, Wüthrich D. Evaluation of Rapid Library Preparation Protocols for Whole Genome Sequencing Based Outbreak Investigation. Front Public Health 2019; 7:241. [PMID: 31508405 PMCID: PMC6719548 DOI: 10.3389/fpubh.2019.00241] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2019] [Accepted: 08/12/2019] [Indexed: 12/18/2022] Open
Abstract
Whole genome sequencing (WGS) has become the new gold standard for bacterial outbreak investigation, due to the high resolution available for typing. While sequencing is currently predominantly performed on Illumina devices, the preceding library preparation can be performed using various protocols. Enzymatic fragmentation library preparation protocols are fast, have minimal hands-on time, and work with small quantities of DNA. The aim of our study was to compare three library preparation protocols for molecular typing: Nextera XT (Illumina); Nextera Flex (Illumina); and QIAseq FX (Qiagen). We selected 12 ATCC strains from human Gram-positive and Gram-negative pathogens with %G+C-content ranging from 27% (Fusobacterium nucleatum) to 73% (Micrococcus luteus), each having a high quality complete genome assembly available, to allow in-depth analysis of the resulting Illumina sequence data quality. Additionally, we selected isolates from previously analyzed cases of vancomycin-resistant Enterococcus faecium (VRE) (n = 7) and a local outbreak of Klebsiella aerogenes (n = 5). The number of protocol steps and time required were compared, in order to test the suitability for routine laboratory work. Data analyses were performed with standard tools commonly used in outbreak situations: Ridom SeqSphere+ for cgMLST; CLC genomics workbench for SNP analysis; and open source programs. Nextera Flex and QIAseq FX were found to be less sensitive than Nextera XT to variable %G+C-content, resulting in an almost uniform distribution of read-depth. Therefore, low coverage regions are reduced to a minimum resulting in a more complete representation of the genome. Thus, with these two protocols, more alleles were detected in the cgMLST analysis, producing a higher resolution of closely related isolates. Furthermore, they result in a more complete representation of accessory genes. In particular, the high data quality and relative simplicity of the workflow of Nextera Flex stood out in this comparison. This thorough comparison within an ISO/IEC 17025 accredited environment will be of interest to those aiming to optimize their clinical microbiological genome sequencing.
Collapse
Affiliation(s)
- Helena M B Seth-Smith
- Division of Clinical Bacteriology and Mycology, University Hospital Basel, Basel, Switzerland.,Applied Microbiology Research, Department of Biomedicine, University of Basel, Basel, Switzerland.,DBM Bioinformatics Core Facility, SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Ferdinando Bonfiglio
- Applied Microbiology Research, Department of Biomedicine, University of Basel, Basel, Switzerland.,Personalized Health Basel, University of Basel, Basel, Switzerland
| | - Aline Cuénod
- Division of Clinical Bacteriology and Mycology, University Hospital Basel, Basel, Switzerland.,Applied Microbiology Research, Department of Biomedicine, University of Basel, Basel, Switzerland
| | - Josiane Reist
- Applied Microbiology Research, Department of Biomedicine, University of Basel, Basel, Switzerland
| | - Adrian Egli
- Division of Clinical Bacteriology and Mycology, University Hospital Basel, Basel, Switzerland.,Applied Microbiology Research, Department of Biomedicine, University of Basel, Basel, Switzerland
| | - Daniel Wüthrich
- Division of Clinical Bacteriology and Mycology, University Hospital Basel, Basel, Switzerland.,Applied Microbiology Research, Department of Biomedicine, University of Basel, Basel, Switzerland.,DBM Bioinformatics Core Facility, SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| |
Collapse
|
32
|
Draft Genome Sequences of Seven Legionella pneumophila Isolates from a Hot Water System of a Large Building. Microbiol Resour Announc 2019; 8:8/18/e00384-19. [PMID: 31048385 PMCID: PMC6498240 DOI: 10.1128/mra.00384-19] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Public health data show that a significant fraction of the nation's waterborne disease outbreaks are attributable to premise plumbing. We report the draft genome sequences of seven Legionella pneumophila serogroup 1 isolates from hot water lines of a large building. Genomic analysis identified the isolates as belonging to sequence type 1.
Collapse
|
33
|
Su M, Satola SW, Read TD. Genome-Based Prediction of Bacterial Antibiotic Resistance. J Clin Microbiol 2019; 57:e01405-18. [PMID: 30381421 PMCID: PMC6425178 DOI: 10.1128/jcm.01405-18] [Citation(s) in RCA: 209] [Impact Index Per Article: 34.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2018] [Accepted: 10/23/2018] [Indexed: 01/02/2023] Open
Abstract
Clinical microbiology has long relied on growing bacteria in culture to determine antimicrobial susceptibility profiles, but the use of whole-genome sequencing for antibiotic susceptibility testing (WGS-AST) is now a powerful alternative. This review discusses the technologies that made this possible and presents results from recent studies to predict resistance based on genome sequences. We examine differences between calling antibiotic resistance profiles by the simple presence or absence of previously known genes and single-nucleotide polymorphisms (SNPs) against approaches that deploy machine learning and statistical models. Often, the limitations to genome-based prediction arise from limitations of accuracy of culture-based AST in addition to an incomplete knowledge of the genetic basis of resistance. However, we need to maintain phenotypic testing even as genome-based prediction becomes more widespread to ensure that the results do not diverge over time. We argue that standardization of WGS-AST by challenge with consistently phenotyped strain sets of defined genetic diversity is necessary to compare the efficacy of methods of prediction of antibiotic resistance based on genome sequences.
Collapse
Affiliation(s)
- Michelle Su
- Department of Infectious Diseases, Emory University, Atlanta, Georgia, USA
- Antimicrobial Resistance and Therapeutic Discovery Training Program, Emory University, Atlanta, Georgia, USA
- Antibiotic Resistance Center, Emory University, Atlanta, Georgia, USA
| | - Sarah W Satola
- Department of Infectious Diseases, Emory University, Atlanta, Georgia, USA
- Antibiotic Resistance Center, Emory University, Atlanta, Georgia, USA
- Emory Investigational Clinical Microbiology Laboratory, Emory University, Atlanta, Georgia, USA
| | - Timothy D Read
- Department of Infectious Diseases, Emory University, Atlanta, Georgia, USA
- Antibiotic Resistance Center, Emory University, Atlanta, Georgia, USA
- Emory Investigational Clinical Microbiology Laboratory, Emory University, Atlanta, Georgia, USA
| |
Collapse
|
34
|
Page AJ, Keane JA. Rapid multi-locus sequence typing direct from uncorrected long reads using Krocus. PeerJ 2018; 6:e5233. [PMID: 30083440 PMCID: PMC6074768 DOI: 10.7717/peerj.5233] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2018] [Accepted: 06/25/2018] [Indexed: 02/05/2023] Open
Abstract
Genome sequencing is rapidly being adopted in reference labs and hospitals for bacterial outbreak investigation and diagnostics where time is critical. Seven gene multi-locus sequence typing is a standard tool for broadly classifying samples into sequence types (STs), allowing, in many cases, to rule a sample out of an outbreak, or allowing for general characteristics about a bacterial strain to be inferred. Long-read sequencing technologies, such as from Oxford Nanopore, can produce read data within minutes of an experiment starting, unlike short-read sequencing technologies which require many hours/days. However, the error rates of raw uncorrected long read data are very high. We present Krocus which can predict a ST directly from uncorrected long reads, and which was designed to consume read data as it is produced, providing results in minutes. It is the only tool which can do this from uncorrected long reads. We tested Krocus on over 700 isolates sequenced using long-read sequencing technologies from Pacific Biosciences and Oxford Nanopore. It provides STs for isolates on average within 90 s, with a sensitivity of 94% and specificity of 97% on real sample data, directly from uncorrected raw sequence reads. The software is written in Python and is available under the open source license GNU GPL version 3.
Collapse
Affiliation(s)
- Andrew J Page
- Quadram Institute Bioscience, Norwich Research Park, Norwich, UK.,Pathogen Informatics, Wellcome Sanger Institute, Hinxton, Cambridgeshire, UK
| | - Jacqueline A Keane
- Pathogen Informatics, Wellcome Sanger Institute, Hinxton, Cambridgeshire, UK
| |
Collapse
|
35
|
Petit RA, Read TD. Staphylococcus aureus viewed from the perspective of 40,000+ genomes. PeerJ 2018; 6:e5261. [PMID: 30013858 PMCID: PMC6046195 DOI: 10.7717/peerj.5261] [Citation(s) in RCA: 68] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2018] [Accepted: 06/28/2018] [Indexed: 12/31/2022] Open
Abstract
Low-cost Illumina sequencing of clinically-important bacterial pathogens has generated thousands of publicly available genomic datasets. Analyzing these genomes and extracting relevant information for each pathogen and the associated clinical phenotypes requires not only resources and bioinformatic skills but organism-specific knowledge. In light of these issues, we created Staphopia, an analysis pipeline, database and application programming interface, focused on Staphylococcus aureus, a common colonizer of humans and a major antibiotic-resistant pathogen responsible for a wide spectrum of hospital and community-associated infections. Written in Python, Staphopia's analysis pipeline consists of submodules running open-source tools. It accepts raw FASTQ reads as an input, which undergo quality control filtration, error correction and reduction to a maximum of approximately 100× chromosome coverage. This reduction significantly reduces total runtime without detrimentally affecting the results. The pipeline performs de novo assembly-based and mapping-based analysis. Automated gene calling and annotation is performed on the assembled contigs. Read-mapping is used to call variants (single nucleotide polymorphisms and insertion/deletions) against a reference S. aureus chromosome (N315, ST5). We ran the analysis pipeline on more than 43,000 S. aureus shotgun Illumina genome projects in the public European Nucleotide Archive database in November 2017. We found that only a quarter of known multi-locus sequence types (STs) were represented but the top 10 STs made up 70% of all genomes. methicillin-resistant S. aureus (MRSA) were 64% of all genomes. Using the Staphopia database we selected 380 high quality genomes deposited with good metadata, each from a different multi-locus ST, as a non-redundant diversity set for studying S. aureus evolution. In addition to answering basic science questions, Staphopia could serve as a potential platform for rapid clinical diagnostics of S. aureus isolates in the future. The system could also be adapted as a template for other organism-specific databases.
Collapse
Affiliation(s)
- Robert A. Petit
- Department of Medicine, Division of Infectious Diseases, Emory University School of Medicine, Atlanta, GA, USA
| | - Timothy D. Read
- Department of Medicine, Division of Infectious Diseases, Emory University School of Medicine, Atlanta, GA, USA
| |
Collapse
|
36
|
Silva M, Machado MP, Silva DN, Rossi M, Moran-Gilad J, Santos S, Ramirez M, Carriço JA. chewBBACA: A complete suite for gene-by-gene schema creation and strain identification. Microb Genom 2018. [PMID: 29543149 PMCID: PMC5885018 DOI: 10.1099/mgen.0.000166] [Citation(s) in RCA: 205] [Impact Index Per Article: 29.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Gene-by-gene approaches are becoming increasingly popular in bacterial genomic epidemiology and outbreak detection. However, there is a lack of open-source scalable software for schema definition and allele calling for these methodologies. The chewBBACA suite was designed to assist users in the creation and evaluation of novel whole-genome or core-genome gene-by-gene typing schemas and subsequent allele calling in bacterial strains of interest. chewBBACA performs the schema creation and allele calls on complete or draft genomes resulting from de novo assemblers. The chewBBACA software uses Python 3.4 or higher and can run on a laptop or in high performance clusters making it useful for both small laboratories and large reference centers. ChewBBACA is available at https://github.com/B-UMMI/chewBBACA.
Collapse
Affiliation(s)
- Mickael Silva
- 1Instituto de Microbiologia, Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisbon, Portugal
| | - Miguel P Machado
- 1Instituto de Microbiologia, Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisbon, Portugal
| | - Diogo N Silva
- 1Instituto de Microbiologia, Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisbon, Portugal
| | - Mirko Rossi
- 2Department of Food Hygiene and Environmental Health, Faculty of Veterinary Medicine, University of Helsinki, Helsinki, Finland
| | - Jacob Moran-Gilad
- 3School of Public Health, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel.,4Public Health Services, Ministry of Health, Jerusalem, Israel
| | - Sergio Santos
- 1Instituto de Microbiologia, Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisbon, Portugal
| | - Mario Ramirez
- 1Instituto de Microbiologia, Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisbon, Portugal
| | - João André Carriço
- 1Instituto de Microbiologia, Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisbon, Portugal
| |
Collapse
|