1
|
Jones G, Nodari CS, Fabre L, de Valk H, Noel H, Cointe A, Bonacorsi S, Weill FX, Le Strat Y. Lessons from 5 Years of Routine Whole-Genome Sequencing for Epidemiologic Surveillance of Shiga Toxin-Producing Escherichia coli, France, 2018-2022. Emerg Infect Dis 2025; 31:117-128. [PMID: 40359096 PMCID: PMC12078545 DOI: 10.3201/eid3113.241950] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/15/2025] Open
Abstract
Whole-genome sequencing (WGS) is routine for surveillance of Shiga toxin-producing Escherichia coli human isolates in France. Protocols use EnteroBase hierarchical clustering at <5 allelic differences (HC5) as screening for cluster detection. We assessed current implementation after 5 years for 1,002 sequenced isolates. From genomic distances of serotypes O26:H11, O157:H7, O80:H2, and O103:H2, we determined statistical thresholds for cluster determination and compared those with HC5 clusters. Thresholds varied by serotype, 5-16 allelic distances and 15-20 single-nucleotide polymorphisms, showing limits of a single-threshold approach. We confirmed validity of HC5 screening for 3 serotypes because statistical thresholds had limited effect on isolate clustering (high sensitivity and specificity). For O80:H2, results suggest that HC5 is less reliable, and other approaches should be explored. Public health officials should regularly assess WGS used for Shiga toxin-producing E. coli surveillance to account for serotype and genomic evolution and to interpret WGS-linked isolates in light of epidemiologic data.
Collapse
|
2
|
Samper-Cativiela C, Torre-Fuentes L, Diéguez-Roda B, Maex M, Ugarte-Ruiz M, Carrizo P, Hernández M, Höfle Ú, Sáez JL, de Frutos C, Agüero M, Moreno MÁ, Domínguez L, Herrera-León S, Alvarez J. Molecular epidemiology of Salmonella Enteritidis in humans and animals in Spain. Antimicrob Agents Chemother 2025; 69:e0073824. [PMID: 40029002 PMCID: PMC11963599 DOI: 10.1128/aac.00738-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2024] [Accepted: 01/11/2025] [Indexed: 03/05/2025] Open
Abstract
Salmonella Enteritidis, the most prevalent serovar-causing human gastroenteritis, has been traditionally linked to poultry sources. Although antimicrobial resistance (AMR) is not common in this serovar, increasing levels of resistance to fluoroquinolones and ampicillin have been reported in the last few years. Here, 298 isolates retrieved from different sources (human, livestock, wildlife, food, and environment) and years (2002-2021) in Spain were analyzed to evaluate their diversity, the distribution of AMR-conferring genes (ARGs), and mutations and reconstruct the epidemiology of infection due to this serovar. Isolates were clustered in two major clades (I and II), with strains in clade I (including 61.5% of all human isolates) displaying a pan-susceptible phenotype and not carrying AMR determinants. In contrast, clade II included 80.7% of isolates from animal/food/environmental sources, with the majority (69.8%) harboring mutations in the quinolone resistance determinant regions (QRDR). ARGs, although rare, were mostly found in clade II strains that also carried plasmid replicons, among which IncX1 was the most common. Although higher levels of phenotypic resistance were found in animal isolates, extended-spectrum beta-lactamase, plasmid-mediated AmpC, and carbapenemase-encoding genes were only found among human isolates. In summary, the majority of human and animal isolates from Spanish sources in our collection were classified in different phylogenetic branches, suggesting that additional sources are contributing to the occurrence of foodborne infections in Spain. Furthermore, the different distributions of virulence factors and ARGs in isolates from different sources and their association with specific plasmids suggest the presence of different dynamics contributing to the selection of resistant strains.
Collapse
Affiliation(s)
- Clara Samper-Cativiela
- VISAVET Health Surveillance Centre, Complutense University of Madrid, Madrid, Spain
- Department of Animal Health, Faculty of Veterinary Medicine, Complutense University of Madrid, Madrid, Spain
| | - Laura Torre-Fuentes
- VISAVET Health Surveillance Centre, Complutense University of Madrid, Madrid, Spain
| | | | - Margo Maex
- Division of Human Bacterial Diseases, Sciensano, Uccle, Belgium
| | - María Ugarte-Ruiz
- VISAVET Health Surveillance Centre, Complutense University of Madrid, Madrid, Spain
| | - Paula Carrizo
- VISAVET Health Surveillance Centre, Complutense University of Madrid, Madrid, Spain
- Department of Animal Health, Faculty of Veterinary Medicine, Complutense University of Madrid, Madrid, Spain
| | - Marta Hernández
- Departamento de Anatomía Patológica, Microbiología, Medicina Preventiva y Salud Pública, Medicina Legal y Forense. Facultad de Medicina, Universidad de Valladolid Facultad de Medicina, Valladolid, Spain
| | - Úrsula Höfle
- IREC, Instituto de Investigación en Recursos Cinegéticos, Ciudad Real, Spain
| | - José Luis Sáez
- Subdirección General de Sanidad e Higiene Animal y Trazabilidad, Dirección General de la Producción Agraria, Ministerio de Agricultura, Pesca y Alimentación, Madrid, Spain
| | - Cristina de Frutos
- Laboratorio Central de Veterinaria, Ministerio de Agricultura, Pesca y Alimentación, Algete, Spain
| | - Montserrat Agüero
- Laboratorio Central de Veterinaria, Ministerio de Agricultura, Pesca y Alimentación, Algete, Spain
| | - Miguel Ángel Moreno
- Department of Animal Health, Faculty of Veterinary Medicine, Complutense University of Madrid, Madrid, Spain
| | - Lucas Domínguez
- VISAVET Health Surveillance Centre, Complutense University of Madrid, Madrid, Spain
- Department of Animal Health, Faculty of Veterinary Medicine, Complutense University of Madrid, Madrid, Spain
| | - Silvia Herrera-León
- Laboratorio de Referencia e Investigación en Enfermedades Bacterianas Transmitidas por Alimentos, Instituto de Salud Carlos III, Madrid, Spain
| | - Julio Alvarez
- VISAVET Health Surveillance Centre, Complutense University of Madrid, Madrid, Spain
- Department of Animal Health, Faculty of Veterinary Medicine, Complutense University of Madrid, Madrid, Spain
| |
Collapse
|
3
|
Chanamé Pinedo LE, Franz E, Dallman TJ, Coipan CE, Wolthuis R, Veldman KT, Mughini-Gras L, Pijnacker R, van den Beld MJ. Genomic epidemiology of Salmonella Enteritidis human infections in the Netherlands, 2019 to 2023. Microb Genom 2025; 11:001394. [PMID: 40266678 PMCID: PMC12044193 DOI: 10.1099/mgen.0.001394] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2024] [Accepted: 03/07/2025] [Indexed: 04/24/2025] Open
Abstract
Salmonella enterica serotype Enteritidis (SE) is a common foodborne pathogen that can cause human salmonellosis. Identifying closely related cases is essential to control the pathogen through, e.g. outbreak investigation, but it is often challenging due to the low genetic diversity of SE, particularly with traditional typing methods. This study aimed to investigate the population structure of SE genomes collected during routine surveillance in the Netherlands using whole-genome sequencing (WGS), their clustering, temporal distribution and the association between epidemiological and phenotypic antimicrobial resistance (AMR) factors and the persistence of SE clusters. We also investigated the distribution of genotypic AMR markers among these isolates. The study collection comprised 1,669 unique SE isolates from human infections collected from Dutch surveillance between 2019 and 2023, and their relatedness was derived using core-genome multi-locus sequence typing and Hamming distances. Based on the results, the 216 clusters comprised 1,085 sequences, in addition to 584 sequences depicted as singletons. These clusters predominantly fell within three major lineages, of which two were the previously described Global and Atlantic lineages. Of these clusters, approximately a third persisted for more than 1 year during the 5-year study period. However, no statistically significant associations were found between epidemiological factors, such as age, gender and travel history, or phenotypic AMR and the persistence of SE clusters. The most common AMR genetic markers observed were related to antimicrobial classes of (fluor)quinolones, β-lactamases and aminoglycosides. This study provides a better understanding of the genomic epidemiology of SE in the Netherlands based on WGS. Further analysis that includes samples from the food-chain supply, along with higher resolution methods during a post-Coronavirus Disease of 2019 (COVID-19) period, may provide more insights into the possible causes of the persistence of SE clusters.
Collapse
Affiliation(s)
- Linda E. Chanamé Pinedo
- Centre for Infectious Disease Control, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, Netherlands
| | - Eelco Franz
- Centre for Infectious Disease Control, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands
| | - Timothy J. Dallman
- Centre for Infectious Disease Control, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, Netherlands
| | - Claudia E. Coipan
- Centre for Infectious Disease Control, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands
| | - Roxanne Wolthuis
- Centre for Infectious Disease Control, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands
| | - Kees T. Veldman
- Wageningen Bioveterinary Research (WBVR), part of Wageningen University and Research, Lelystad, Netherlands
| | - Lapo Mughini-Gras
- Centre for Infectious Disease Control, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, Netherlands
| | - Roan Pijnacker
- Centre for Infectious Disease Control, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands
| | - Maaike J.C. van den Beld
- Centre for Infectious Disease Control, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands
| |
Collapse
|
4
|
Chen R, Yang L, Pajor MS, Wiedmann M, Orsi RH. Salmonella associated with agricultural animals exhibit diverse evolutionary rates and show evidence of recent clonal expansion. mBio 2024; 15:e0191324. [PMID: 39287448 PMCID: PMC11492988 DOI: 10.1128/mbio.01913-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2024] [Accepted: 08/18/2024] [Indexed: 09/19/2024] Open
Abstract
Most foodborne salmonellosis outbreaks are linked to agricultural animal products with a few serovars accounting for most Salmonella isolated from specific animal products, suggesting an adaptation to the corresponding animal hosts and their respective environments. Here, we utilized whole-genome sequence (WGS) data to analyze the evolution and population genetics of seven serovars frequently isolated from ground beef (Montevideo, Cerro, and Dublin), chicken (Kentucky, Infantis, and Enteritidis), and turkey (Reading) in the United States. In addition, publicly available metadata were used to characterize major clades within each serovar with regard to public health significance. Except for Dublin, all serovars were polyphyletic, comprising 2-6 phylogenetic groups. Further partitioning of the phylogenies identified 25 major clades, including 12 associated with animal or environmental niches. These 12 clades differed in evolutionary parameters (e.g., substitution rates) as well as public health relevant characteristics (e.g., association with human illness, antimicrobial resistance). Overall, our results highlight several critical trends: (i) the Salmonella generation time appears to be more dependent on source than serovar and (ii) all serovars contain clades and sub-clades that are estimated to have emerged after the year 1940 and that are enriched for isolates associated with humans, agricultural animals, antimicrobial resistance (AMR), and/or specific geographical regions. These findings suggest that serotyping alone does not provide enough resolution to differentiate isolates that may have evolved independently, present distinct geographic distribution and host association, and possibly have distinct public health significance. IMPORTANCE Non-typhoidal Salmonella are major foodborne bacterial pathogens estimated to cause more than one million illnesses, thousands of hospitalizations, and hundreds of deaths annually in the United States. More than 70% of Salmonella outbreaks in the United States have been associated with agricultural animals. Certain serovars include persistent strains that have repeatedly contaminated beef, chicken, and turkey, causing outbreaks and sporadic cases over many years. These persistent strains represent a particular challenge to public health, as they are genetically clonal and widespread, making it difficult to differentiate distinct outbreak and contamination events using whole-genome sequence (WGS)-based subtyping methods (e.g., core genome allelic typing). Our results indicate that a phylogenetic approach is needed to investigate persistent strains and suggest that the association between a Salmonella serovar and an agricultural animal is driven by the expansion of clonal subtypes that likely became adapted to specific animals and associated environments.
Collapse
Affiliation(s)
- Ruixi Chen
- Department of Food
Science, Cornell University,
Ithaca, New York, USA
| | - Linghuan Yang
- Department of Food
Science, Cornell University,
Ithaca, New York, USA
| | | | - Martin Wiedmann
- Department of Food
Science, Cornell University,
Ithaca, New York, USA
| | - Renato H. Orsi
- Department of Food
Science, Cornell University,
Ithaca, New York, USA
| |
Collapse
|
5
|
Schadron T, van den Beld M, Mughini-Gras L, Franz E. Use of whole genome sequencing for surveillance and control of foodborne diseases: status quo and quo vadis. Front Microbiol 2024; 15:1460335. [PMID: 39345263 PMCID: PMC11427404 DOI: 10.3389/fmicb.2024.1460335] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2024] [Accepted: 08/27/2024] [Indexed: 10/01/2024] Open
Abstract
Improvements in sequencing quality, availability, speed and costs results in an increased presence of genomics in infectious disease applications. Nevertheless, there are still hurdles in regard to the optimal use of WGS for public health purposes. Here, we discuss the current state ("status quo") and future directions ("quo vadis") based on literature regarding the use of genomics in surveillance, hazard characterization and source attribution of foodborne pathogens. The future directions include the application of new techniques, such as machine learning and network approaches that may overcome the current shortcomings. These include the use of fixed genomic distances in cluster delineation, disentangling similarity or lack thereof in source attribution, and difficulties ascertaining function in hazard characterization. Although, the aforementioned methods can relatively easily be applied technically, an overarching challenge is the inference and biological/epidemiological interpretation of these large amounts of high-resolution data. Understanding the context in terms of bacterial isolate and host diversity allows to assess the level of representativeness in regard to sources and isolates in the dataset, which in turn defines the level of certainty associated with defining clusters, sources and risks. This also marks the importance of metadata (clinical, epidemiological, and biological) when using genomics for public health purposes.
Collapse
Affiliation(s)
- Tristan Schadron
- Centre for Infectious Disease Control, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands
| | - Maaike van den Beld
- Centre for Infectious Disease Control, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands
| | - Lapo Mughini-Gras
- Centre for Infectious Disease Control, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, Netherlands
| | - Eelco Franz
- Centre for Infectious Disease Control, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands
| |
Collapse
|
6
|
Leeper MM, Tolar BM, Griswold T, Vidyaprakash E, Hise KB, Williams GM, Im SB, Chen JC, Pouseele H, Carleton HA. Evaluation of whole and core genome multilocus sequence typing allele schemes for Salmonella enterica outbreak detection in a national surveillance network, PulseNet USA. Front Microbiol 2023; 14:1254777. [PMID: 37808298 PMCID: PMC10558246 DOI: 10.3389/fmicb.2023.1254777] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Accepted: 09/04/2023] [Indexed: 10/10/2023] Open
Abstract
Salmonella enterica is a leading cause of bacterial foodborne and zoonotic illnesses in the United States. For this study, we applied four different whole genome sequencing (WGS)-based subtyping methods: high quality single-nucleotide polymorphism (hqSNP) analysis, whole genome multilocus sequence typing using either all loci [wgMLST (all loci)] and only chromosome-associated loci [wgMLST (chrom)], and core genome multilocus sequence typing (cgMLST) to a dataset of isolate sequences from 9 well-characterized Salmonella outbreaks. For each outbreak, we evaluated the genomic and epidemiologic concordance between hqSNP and allele-based methods. We first compared pairwise genomic differences using all four methods. We observed discrepancies in allele difference ranges when using wgMLST (all loci), likely caused by inflated genetic variation due to loci found on plasmids and/or other mobile genetic elements in the accessory genome. Therefore, we excluded wgMLST (all loci) results from any further comparisons in the study. Then, we created linear regression models and phylogenetic tanglegrams using the remaining three methods. K-means analysis using the silhouette method was applied to compare the ability of the three methods to partition outbreak and sporadic isolate sequences. Our results showed that pairwise hqSNP differences had high concordance with cgMLST and wgMLST (chrom) allele differences. The slopes of the regressions for hqSNP vs. allele pairwise differences were 0.58 (cgMLST) and 0.74 [wgMLST (chrom)], and the slope of the regression was 0.77 for cgMLST vs. wgMLST (chrom) pairwise differences. Tanglegrams showed high clustering concordance between methods using two statistical measures, the Baker's gamma index (BGI) and cophenetic correlation coefficient (CCC), where 9/9 (100%) of outbreaks yielded BGI values ≥ 0.60 and CCCs were ≥ 0.97 across all nine outbreaks and all three methods. K-means analysis showed separation of outbreak and sporadic isolate groups with average silhouette widths ≥ 0.87 for outbreak groups and ≥ 0.16 for sporadic groups. This study demonstrates that Salmonella isolates clustered in concordance with epidemiologic data using three WGS-based subtyping methods and supports using cgMLST as the primary method for national surveillance of Salmonella outbreak clusters.
Collapse
Affiliation(s)
- Molly M. Leeper
- Division of Foodborne, Waterborne, and Environmental Diseases, Centers for Disease Control and Prevention, Atlanta, GA, United States
| | - Beth M. Tolar
- Division of Foodborne, Waterborne, and Environmental Diseases, Centers for Disease Control and Prevention, Atlanta, GA, United States
| | - Taylor Griswold
- Division of Foodborne, Waterborne, and Environmental Diseases, Centers for Disease Control and Prevention, Atlanta, GA, United States
| | - Eshaw Vidyaprakash
- Division of Foodborne, Waterborne, and Environmental Diseases, Centers for Disease Control and Prevention, Atlanta, GA, United States
| | - Kelley B. Hise
- Division of Foodborne, Waterborne, and Environmental Diseases, Centers for Disease Control and Prevention, Atlanta, GA, United States
| | - Grant M. Williams
- Division of Foodborne, Waterborne, and Environmental Diseases, Centers for Disease Control and Prevention, Atlanta, GA, United States
| | - Sung B. Im
- Division of Foodborne, Waterborne, and Environmental Diseases, Centers for Disease Control and Prevention, Atlanta, GA, United States
| | - Jessica C. Chen
- Division of Foodborne, Waterborne, and Environmental Diseases, Centers for Disease Control and Prevention, Atlanta, GA, United States
| | | | - Heather A. Carleton
- Division of Foodborne, Waterborne, and Environmental Diseases, Centers for Disease Control and Prevention, Atlanta, GA, United States
| |
Collapse
|
7
|
Mashe T, Thilliez G, Chaibva BV, Leekitcharoenphon P, Bawn M, Nyanzunda M, Robertson V, Tarupiwa A, Al-Khanaq H, Baker D, Gosa M, Kock MM, Midzi S, Witson ML, Jorge M, Jensen JD, Aarestrup FM, Weill FX, Hendriksen RS, Ehlers MM, Kingsley RA. Highly drug resistant clone of Salmonella Kentucky ST198 in clinical infections and poultry in Zimbabwe. NPJ ANTIMICROBIALS AND RESISTANCE 2023; 1:6. [PMID: 39843602 PMCID: PMC11721084 DOI: 10.1038/s44259-023-00003-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Accepted: 03/27/2023] [Indexed: 01/24/2025]
Abstract
A highly multidrug-resistant strain of Salmonella enterica serotype Kentucky (S. Kentucky) of sequence type (ST)198 emerged in North Africa and has since spread widely. To investigate the genetic diversity and phylogenetic relationship of S. Kentucky in Zimbabwe and identify potential sources of infection, the whole-genome sequence of 37 S. Kentucky strains isolated from human clinical infections and from poultry farms between 2017 and 2020 was determined. Of 37 S. Kentucky isolates, 36 were ST198 and one was ST152. All ST198 isolates had between six and fifteen antimicrobial resistance (AMR) genes, and 92% carried at least ten AMRs. All ST198 isolates harbored the Salmonella genomic island K-Israel variant (SGI1-KIV) integrated into the chromosome with aac(3)-ld, aac(6)-laa, aadA7, blaTEM-1, sul1, and tetA genes, with occasional sporadic loss of one or more genes noted from five isolates. All ST198 isolates also had mutations in the quinolone resistance-determining region of the gyrA and parC genes. The blaCTX-M-14.1 and fosA3 genes were present in 92% of the ST198 isolates, conferring resistance to extended-spectrum cephalosporins and fosfomycin, respectively, were present on an IncHI2 plasmid with the aadA2b, aadA1, aph(3')-Ib, aph(6')-Id, cmlA1 and sul3 AMR genes. S. Kentucky ST198 isolates from Zimbabwe formed a closely related phylogenetic clade that emerged from a previously reported global epidemic population. The close genetic relationship and population structure of the human clinical and poultry isolates of ST198 in Zimbabwe are consistent with poultry being an important source of highly resistant strains of S. Kentucky in Zimbabwe.
Collapse
Affiliation(s)
- Tapfumanei Mashe
- University of Pretoria, Pretoria, South Africa.
- National Microbiology Reference Laboratory, Harare, Zimbabwe.
- World Health Organization, Harare, Zimbabwe.
| | | | | | | | - Matt Bawn
- Quadram Institute Bioscience, Norwich, UK
- Earlham Insitute, Norwich, UK
- University of Leeds, Leeds, UK
| | | | | | - Andrew Tarupiwa
- National Microbiology Reference Laboratory, Harare, Zimbabwe
| | | | - Dave Baker
- Quadram Institute Bioscience, Norwich, UK
| | | | - Marleen M Kock
- University of Pretoria, Pretoria, South Africa
- National Health Laboratory Service, Pretoria, South Africa
| | | | | | | | | | | | | | | | - Marthie M Ehlers
- University of Pretoria, Pretoria, South Africa
- National Health Laboratory Service, Pretoria, South Africa
| | - Robert A Kingsley
- Quadram Institute Bioscience, Norwich, UK.
- University of East Anglia, Norwich, UK.
| |
Collapse
|
8
|
Smith AM, Erasmus LK, Tau NP, Smouse SL, Ngomane HM, Disenyeng B, Whitelaw A, Lawrence CA, Sekwadi P, Thomas J. Enteric fever cluster identification in South Africa using genomic surveillance of Salmonella enterica serovar Typhi. Microb Genom 2023; 9. [PMID: 37339282 DOI: 10.1099/mgen.0.001044] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/22/2023] Open
Abstract
The National Institute for Communicable Diseases in South Africa participates in national laboratory-based surveillance for human isolates of Salmonella species. Laboratory analysis includes whole-genome sequencing (WGS) of isolates. We report on WGS-based surveillance of Salmonella enterica serovar Typhi (Salmonella Typhi) in South Africa from 2020 through 2021. We describe how WGS analysis identified clusters of enteric fever in the Western Cape Province of South Africa and describe the epidemiological investigations associated with these clusters. A total of 206 Salmonella Typhi isolates were received for analysis. Genomic DNA was isolated from bacteria and WGS was performed using Illumina NextSeq technology. WGS data were investigated using multiple bioinformatics tools, including those available at the Centre for Genomic Epidemiology, EnteroBase and Pathogenwatch. Core-genome multilocus sequence typing was used to investigate the phylogeny of isolates and identify clusters. Three major clusters of enteric fever were identified in the Western Cape Province; cluster one (n=11 isolates), cluster two (n=13 isolates), and cluster three (n=14 isolates). To date, no likely source has been identified for any of the clusters. All isolates associated with the clusters, showed the same genotype (4.3.1.1.EA1) and resistome (antimicrobial resistance genes: bla TEM-1B, catA1, sul1, sul2, dfrA7). The implementation of genomic surveillance of Salmonella Typhi in South Africa has enabled rapid detection of clusters indicative of possible outbreaks. Cluster identification allows for targeted epidemiological investigations and a timely, coordinated public health response.
Collapse
Affiliation(s)
- Anthony Marius Smith
- Centre for Enteric Diseases, National Institute for Communicable Diseases, Division of the National Health Laboratory Service, Johannesburg, South Africa
- Department of Medical Microbiology, Faculty of Health Sciences, University of Pretoria, Pretoria, South Africa
| | - Linda Kathleen Erasmus
- Centre for Enteric Diseases, National Institute for Communicable Diseases, Division of the National Health Laboratory Service, Johannesburg, South Africa
| | - Nomsa Pauline Tau
- Centre for Enteric Diseases, National Institute for Communicable Diseases, Division of the National Health Laboratory Service, Johannesburg, South Africa
| | - Shannon Lucrecia Smouse
- Centre for Enteric Diseases, National Institute for Communicable Diseases, Division of the National Health Laboratory Service, Johannesburg, South Africa
| | - Hlengiwe Mimmy Ngomane
- Centre for Enteric Diseases, National Institute for Communicable Diseases, Division of the National Health Laboratory Service, Johannesburg, South Africa
| | - Bolele Disenyeng
- Centre for Enteric Diseases, National Institute for Communicable Diseases, Division of the National Health Laboratory Service, Johannesburg, South Africa
| | - Andrew Whitelaw
- Department of Pathology, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
- National Health Laboratory Service, Tygerberg Hospital, Cape Town, South Africa
| | - Charlene Ann Lawrence
- Communicable Disease Control, Service Priorities Coordination, Department of Health, Cape Town, South Africa
| | - Phuti Sekwadi
- Centre for Enteric Diseases, National Institute for Communicable Diseases, Division of the National Health Laboratory Service, Johannesburg, South Africa
| | - Juno Thomas
- Centre for Enteric Diseases, National Institute for Communicable Diseases, Division of the National Health Laboratory Service, Johannesburg, South Africa
| |
Collapse
|
9
|
Usein CR, Oprea M, Ciontea AS, Dinu S, Cristea D, Zota LC, Kotila S. A Snapshot of the Genetic Diversity of Salmonella Enteritidis Population Involved in Human Infections in Romania Taken in the European Epidemiological Context. Pathogens 2021; 10:pathogens10111490. [PMID: 34832645 PMCID: PMC8621327 DOI: 10.3390/pathogens10111490] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Revised: 11/12/2021] [Accepted: 11/12/2021] [Indexed: 11/16/2022] Open
Abstract
In the absence of consistent national molecular typing data to enhance the surveillance of Salmonella Enteritidis, it was considered useful to collect baseline information on the genetic diversity and antibiotic susceptibility of strains isolated in Romania between January 2016 and April 2020 and compare them to strains described in major international outbreaks of the same period. A collection of 245 clinical isolates were genotyped by a standardised multiple-locus variable-number of tandem repeats analysis (MLVA) 5-loci protocol and screened for antimicrobial resistance against 15 compounds. Twenty strains were further subjected to whole genome sequencing (WGS) and compared to epidemiologically relevant high-throughput sequencing data available in European databases. Twenty-seven MLVA genotypes were identified, of which three, commonly reported in Europe between 2016–2020, covered 72% of the collection. Antibiotic resistance was detected in 30% of the strains, with resistance to nalidixic acid and ciprofloxacin as the most common phenotype, and also associated with two prevalent MLVA clones. WGS-derived multilocus sequence typing (MLST) revealed a single sequence type (ST11) further resolved into 10 core-genome MLST complex types. The minimum spanning tree constructed from the cgMLST data clustered Romanian and international strains, which shared more than 95% of the core genes, revealing links with a contemporaneous multi-country outbreak. This study could be regarded as a forerunner to the advent of using this integrative approach in the public health practice at a national level and thus contribute to the concerted actions at a European level to stop outbreaks.
Collapse
Affiliation(s)
- Codruta-Romanita Usein
- “Cantacuzino” National Military Medical Institute for Research and Development, 050096 Bucharest, Romania; (A.S.C.); (S.D.); (D.C.)
- Correspondence: (C.-R.U.); (M.O.)
| | - Mihaela Oprea
- “Cantacuzino” National Military Medical Institute for Research and Development, 050096 Bucharest, Romania; (A.S.C.); (S.D.); (D.C.)
- Correspondence: (C.-R.U.); (M.O.)
| | - Adriana Simona Ciontea
- “Cantacuzino” National Military Medical Institute for Research and Development, 050096 Bucharest, Romania; (A.S.C.); (S.D.); (D.C.)
| | - Sorin Dinu
- “Cantacuzino” National Military Medical Institute for Research and Development, 050096 Bucharest, Romania; (A.S.C.); (S.D.); (D.C.)
| | - Daniela Cristea
- “Cantacuzino” National Military Medical Institute for Research and Development, 050096 Bucharest, Romania; (A.S.C.); (S.D.); (D.C.)
| | - Lavinia Cipriana Zota
- National Center for Surveillance and Control of Communicable Diseases, National Institute of Public Health, 050463 Bucharest, Romania;
| | - Saara Kotila
- European Centre for Disease Prevention and Control, 16973 Solna, Sweden;
| |
Collapse
|
10
|
Wang SH, Wu K, Chu T, Fernandes SL, Zhou Q, Zhang YD, Sun J. SOSPCNN: Structurally Optimized Stochastic Pooling Convolutional Neural Network for Tetralogy of Fallot recognition. WIRELESS COMMUNICATIONS & MOBILE COMPUTING 2021; 2021:1-17. [PMID: 35573891 PMCID: PMC7612722 DOI: 10.1155/2021/5792975] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Accepted: 06/05/2021] [Indexed: 05/22/2023]
Abstract
Aim This study proposes a new artificial intelligence model based on cardiovascular computed tomography for more efficient and precise recognition of Tetralogy of Fallot (TOF). Methods Our model is a structurally optimized stochastic pooling convolutional neural network (SOSPCNN), which combines stochastic pooling, structural optimization, and convolutional neural network. In addition, multiple-way data augmentation is used to overcome overfitting. Grad-CAM is employed to provide explainability to the proposed SOSPCNN model. Meanwhile, both desktop and web apps are developed based on this SOSPCNN model. Results The results on ten runs of 10-fold cross-validation show that our SOSPCNN model yields a sensitivity of 92.25±2.19, a specificity of 92.75±2.49, a precision of 92.79±2.29, an accuracy of 92.50±1.18, an F1 score of 92.48±1.17, an MCC of 85.06±2.38, an FMI of 92.50±1.17, and an AUC of 0.9587. Conclusion The SOSPCNN method performed better than three state-of-the-art TOF recognition approaches.
Collapse
Affiliation(s)
- Shui-Hua Wang
- School of Informatics, University of Leicester, Leicester, LE1 7RH, UK
| | - Kaihong Wu
- The Affiliated Children's Hospital of Nanjing Medical University, Nanjing, China
| | - Tianshu Chu
- Nanjing Yirongda Institute of Intelligent Medicine and Additive Manufacturing, Nanjing, China
| | - Steven L Fernandes
- Department of Computer Science, Design & Journalism, Creighton University, Omaha, Nebraska, USA
| | - Qinghua Zhou
- School of Informatics, University of Leicester, Leicester, LE1 7RH, UK
| | - Yu-Dong Zhang
- School of Informatics, University of Leicester, Leicester, LE1 7RH, UK
- Nanjing Yirongda Institute of Intelligent Medicine and Additive Manufacturing, Nanjing, China
| | - Jian Sun
- The Affiliated Children's Hospital of Nanjing Medical University, Nanjing, China
| |
Collapse
|
11
|
Deneke C, Uelze L, Brendebach H, Tausch SH, Malorny B. Decentralized Investigation of Bacterial Outbreaks Based on Hashed cgMLST. Front Microbiol 2021; 12:649517. [PMID: 34220740 PMCID: PMC8244591 DOI: 10.3389/fmicb.2021.649517] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2021] [Accepted: 03/25/2021] [Indexed: 02/05/2023] Open
Abstract
Whole-genome sequencing (WGS)-based outbreak investigation has proven to be a valuable method for the surveillance of bacterial pathogens. Its utility has been successfully demonstrated using both gene-by-gene (cgMLST or wgMLST) and single-nucleotide polymorphism (SNP)-based approaches. Among the obstacles of implementing a WGS-based routine surveillance is the need for an exchange of large volumes of sequencing data, as well as a widespread reluctance to share sequence and metadata in public repositories, together with a lacking standardization of suitable bioinformatic tools and workflows. To address these issues, we present chewieSnake, an intuitive and simple-to-use cgMLST workflow. ChewieSnake builds on the allele calling software chewBBACA and extends it by the concept of allele hashing. The resulting hashed allele profiles can be readily compared between laboratories without the need of a central allele nomenclature. The workflow fully automates the computation of the allele distance matrix, cluster membership, and phylogeny and summarizes all important findings in an interactive HTML report. Furthermore, chewieSnake can join allele profiles generated at different laboratories and identify shared clusters, including a stable and intercommunicable cluster nomenclature, thus facilitating a joint outbreak investigation. We demonstrate the feasibility of the proposed approach with a thorough method comparison using publically available sequencing data for Salmonella enterica. However, chewieSnake is readily applicable to all bacterial taxa, provided that a suitable cgMLST scheme is available. The workflow is freely available as an open-source tool and can be easily installed via conda or docker.
Collapse
Affiliation(s)
- Carlus Deneke
- Department Biological Safety, German Federal Institute for Risk Assessment, Berlin, Germany
| | - Laura Uelze
- Department Biological Safety, German Federal Institute for Risk Assessment, Berlin, Germany
| | - Holger Brendebach
- Department Biological Safety, German Federal Institute for Risk Assessment, Berlin, Germany
| | - Simon H Tausch
- Department Biological Safety, German Federal Institute for Risk Assessment, Berlin, Germany
| | - Burkhard Malorny
- Department Biological Safety, German Federal Institute for Risk Assessment, Berlin, Germany
| |
Collapse
|
12
|
High-Resolution Genomic Comparisons within Salmonella enterica Serotypes Derived from Beef Feedlot Cattle: Parsing the Roles of Cattle Source, Pen, Animal, Sample Type, and Production Period. Appl Environ Microbiol 2021; 87:e0048521. [PMID: 33863705 DOI: 10.1128/aem.00485-21] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Salmonella enterica is a major foodborne pathogen, and contaminated beef products have been identified as one of the primary sources of Salmonella-related outbreaks. Pathogenicity and antibiotic resistance of Salmonella are highly serotype and subpopulation specific, which makes it essential to understand high-resolution Salmonella population dynamics in cattle. Time of year, source of cattle, pen, and sample type (i.e., feces, hide, or lymph nodes) have previously been identified as important factors influencing the serotype distribution of Salmonella (e.g., Anatum, Lubbock, Cerro, Montevideo, Kentucky, Newport, and Norwich) that were isolated from a longitudinal sampling design in a research feedlot. In this study, we performed high-resolution genomic comparisons of Salmonella isolates within each serotype using both single-nucleotide polymorphism-based maximum-likelihood phylogeny and hierarchical clustering of core-genome multilocus sequence typing. The importance of the aforementioned features in clonal Salmonella expansion was further explored using a supervised machine learning algorithm. In addition, we identified and compared the resistance genes, plasmids, and pathogenicity island profiles of the isolates within each subpopulation. Our findings indicate that clonal expansion of Salmonella strains in cattle was mainly influenced by the randomization of block and pen, as well as the origin/source of the cattle, i.e., regardless of sampling time and sample type (i.e., feces, lymph node, or hide). Further research is needed concerning the role of the feedlot pen environment prior to cattle placement to better understand carryover contributions of existing strains of Salmonella and their bacteriophages. IMPORTANCE Salmonella serotypes isolated from outbreaks in humans can also be found in beef cattle and feedlots. Virulence factors and antibiotic resistance are among the primary defense mechanisms of Salmonella, and are often associated with clonal expansion. This makes understanding the subpopulation dynamics of Salmonella in cattle critical for effective mitigation. There remains a gap in the literature concerning subpopulation dynamics within Salmonella serotypes in feedlot cattle from the beginning of feeding up until slaughter. Here, we explore Salmonella population dynamics within each serotype using core-genome phylogeny and hierarchical classifications. We used machine learning to quantitatively parse the relative importance of both hierarchical and longitudinal clustering among cattle host samples. Our results reveal that Salmonella populations in cattle are highly clonal over a 6-month study period and that clonal dissemination of Salmonella in cattle is mainly influenced spatially by experimental block and pen, as well by the geographical origin of the cattle.
Collapse
|
13
|
Zhou Z, Charlesworth J, Achtman M. HierCC: A multi-level clustering scheme for population assignments based on core genome MLST. Bioinformatics 2021; 37:3645-3646. [PMID: 33823553 PMCID: PMC8545296 DOI: 10.1093/bioinformatics/btab234] [Citation(s) in RCA: 44] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Revised: 03/30/2021] [Accepted: 04/06/2021] [Indexed: 12/04/2022] Open
Abstract
Motivation Routine infectious disease surveillance is increasingly based on large-scale whole-genome sequencing databases. Real-time surveillance would benefit from immediate assignments of each genome assembly to hierarchical population structures. Here we present pHierCC, a pipeline that defines a scalable clustering scheme, HierCC, based on core genome multi-locus typing that allows incremental, static, multi-level cluster assignments of genomes. We also present HCCeval, which identifies optimal thresholds for assigning genomes to cohesive HierCC clusters. HierCC was implemented in EnteroBase in 2018 and has since genotyped >530 000 genomes from Salmonella, Escherichia/Shigella, Streptococcus, Clostridioides, Vibrio and Yersinia. Availability and implementation https://enterobase.warwick.ac.uk/ and Source code and instructions: https://github.com/zheminzhou/pHierCC Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Zhemin Zhou
- Warwick Medical School, University of Warwick, Gibbet Hill Road, Coventry, CV4 7AL, United Kingdom
| | - Jane Charlesworth
- Warwick Medical School, University of Warwick, Gibbet Hill Road, Coventry, CV4 7AL, United Kingdom
| | - Mark Achtman
- Warwick Medical School, University of Warwick, Gibbet Hill Road, Coventry, CV4 7AL, United Kingdom
| |
Collapse
|
14
|
Barretto C, Rincón C, Portmann AC, Ngom-Bru C. Whole Genome Sequencing Applied to Pathogen Source Tracking in Food Industry: Key Considerations for Robust Bioinformatics Data Analysis and Reliable Results Interpretation. Genes (Basel) 2021; 12:275. [PMID: 33671973 PMCID: PMC7919020 DOI: 10.3390/genes12020275] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2021] [Revised: 01/28/2021] [Accepted: 02/08/2021] [Indexed: 12/31/2022] Open
Abstract
Whole genome sequencing (WGS) has arisen as a powerful tool to perform pathogen source tracking in the food industry thanks to several developments in recent years. However, the cost associated to this technology and the degree of expertise required to accurately process and understand the data has limited its adoption at a wider scale. Additionally, the time needed to obtain actionable information is often seen as an impairment for the application and use of the information generated via WGS. Ongoing work towards standardization of wet lab including sequencing protocols, following guidelines from the regulatory authorities and international standardization efforts make the technology more and more accessible. However, data analysis and results interpretation guidelines are still subject to initiatives coming from distinct groups and institutions. There are multiple bioinformatics software and pipelines developed to handle such information. Nevertheless, little consensus exists on a standard way to process the data and interpret the results. Here, we want to present the constraints we face in an industrial setting and the steps we consider necessary to obtain high quality data, reproducible results and a robust interpretation of the obtained information. All of this, in a time frame allowing for data-driven actions supporting factories and their needs.
Collapse
Affiliation(s)
- Caroline Barretto
- Institute of Food Safety and Analytical Sciences, Nestlé Research, 1000 Lausanne 26, Switzerland; (C.R.); (A.-C.P.); (C.N.-B.)
| | | | | | | |
Collapse
|
15
|
Tassinari E, Bawn M, Thilliez G, Charity O, Acton L, Kirkwood M, Petrovska L, Dallman T, Burgess CM, Hall N, Duffy G, Kingsley RA. Whole-genome epidemiology links phage-mediated acquisition of a virulence gene to the clonal expansion of a pandemic Salmonella enterica serovar Typhimurium clone. Microb Genom 2020; 6:mgen000456. [PMID: 33112226 PMCID: PMC7725340 DOI: 10.1099/mgen.0.000456] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2020] [Accepted: 10/01/2020] [Indexed: 01/10/2023] Open
Abstract
Epidemic and pandemic clones of bacterial pathogens with distinct characteristics continually emerge, replacing those previously dominant through mechanisms that remain poorly characterized. Here, whole-genome-sequencing-powered epidemiology linked horizontal transfer of a virulence gene, sopE, to the emergence and clonal expansion of a new epidemic Salmonella enterica serovar Typhimurium (S. Typhimurium) clone. The sopE gene is sporadically distributed within the genus Salmonella and rare in S. enterica Typhimurium lineages, but was acquired multiple times during clonal expansion of the currently dominant pandemic monophasic S. Typhimurium sequence type (ST) 34 clone. Ancestral state reconstruction and time-scaled phylogenetic analysis indicated that sopE was not present in the common ancestor of the epidemic clade, but later acquisition resulted in increased clonal expansion of sopE-containing clones that was temporally associated with emergence of the epidemic, consistent with increased fitness. The sopE gene was mainly associated with a temperate bacteriophage mTmV, but recombination with other bacteriophage and apparent horizontal gene transfer of the sopE gene cassette resulted in distribution among at least four mobile genetic elements within the monophasic S. enterica Typhimurium ST34 epidemic clade. The mTmV prophage lysogenic transfer to other S. enterica serovars in vitro was limited, but included the common pig-associated S. enterica Derby (S. Derby). This may explain mTmV in S. Derby co-circulating on farms with monophasic S. Typhimurium ST34, highlighting the potential for further transfer of the sopE virulence gene in nature. We conclude that whole-genome epidemiology pinpoints potential drivers of evolutionary and epidemiological dynamics during pathogen emergence, and identifies targets for subsequent research in epidemiology and bacterial pathogenesis.
Collapse
Affiliation(s)
- Eleonora Tassinari
- Quadram Institute Bioscience, Norwich Research Park, Norwich, UK
- Teagasc Food Research Centre, Ashtown, Dublin 15, Ireland
| | - Matt Bawn
- Quadram Institute Bioscience, Norwich Research Park, Norwich, UK
- Earlham Institute, Norwich Research Park, Norwich, UK
| | - Gaetan Thilliez
- Quadram Institute Bioscience, Norwich Research Park, Norwich, UK
| | - Oliver Charity
- Quadram Institute Bioscience, Norwich Research Park, Norwich, UK
| | - Luke Acton
- Quadram Institute Bioscience, Norwich Research Park, Norwich, UK
| | - Mark Kirkwood
- Quadram Institute Bioscience, Norwich Research Park, Norwich, UK
| | | | - Timothy Dallman
- Gastrointestinal Bacteria Reference Unit, National Infection Service, Public Health England, London, UK
| | | | - Neil Hall
- Earlham Institute, Norwich Research Park, Norwich, UK
| | | | - Robert A. Kingsley
- Quadram Institute Bioscience, Norwich Research Park, Norwich, UK
- University of East Anglia, Norwich, UK
| |
Collapse
|
16
|
Evaluation of an ensemble-based distance statistic for clustering MLST datasets using epidemiologically defined clusters of cyclosporiasis. Epidemiol Infect 2020; 148:e172. [PMID: 32741426 PMCID: PMC7439293 DOI: 10.1017/s0950268820001697] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Outbreaks of cyclosporiasis, a food-borne illness caused by the coccidian parasite Cyclospora cayetanensis have increased in the USA in recent years, with approximately 2300 laboratory-confirmed cases reported in 2018. Genotyping tools are needed to inform epidemiological investigations, yet genotyping Cyclospora has proven challenging due to its sexual reproductive cycle which produces complex infections characterized by high genetic heterogeneity. We used targeted amplicon deep sequencing and a recently described ensemble-based distance statistic that accommodates heterogeneous (mixed) genotypes and specimens with partial genotyping data, to genotype and cluster 648 C. cayetanensis samples submitted to CDC in 2018. The performance of the ensemble was assessed by comparing ensemble-identified genetic clusters to analogous clusters identified independently based on common food exposures. Using these epidemiologic clusters as a gold standard, the ensemble facilitated genetic clustering with 93.8% sensitivity and 99.7% specificity. Hence, we anticipate that this procedure will greatly complement epidemiologic investigations of cyclosporiasis.
Collapse
|