1
|
Grote A, Hendin N, Amit S, Adani B, Rahav G, Adler A, Livny J, Gal-Mor O, Earl AM. Genetic diversity of Salmonella enterica during acute human infections. Gut Microbes 2025; 17:2491666. [PMID: 40260673 PMCID: PMC12026202 DOI: 10.1080/19490976.2025.2491666] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/25/2024] [Revised: 03/27/2025] [Accepted: 04/05/2025] [Indexed: 04/24/2025] Open
Abstract
The ubiquitous bacterial pathogen Salmonella enterica is the causative agent of both enteric fever and gastroenteritis. Despite its significant global health burden, we lack an understanding of its genetic diversity during acute infection, with ramifications for treatment and prevention. Here, we investigated within-host infection diversity of acute salmonellosis using whole-genome sequencing of blood or stool isolates obtained from 23 different patients. We found that intestinal infections exhibited greater genetic variation than blood infections, including in their plasmid content. While same-patient isolates were separated by 10 single nucleotide polymorphisms or less, they often differed in the carriage of genes or alleles, including those associated with antibiotic resistance or virulence. Given the longstanding emphasis on single colony isolation in clinical and laboratory microbiology, these findings have implications for how we both study evolution and transmission and how we treat salmonellosis in an age of increasing antibiotic resistance.
Collapse
Affiliation(s)
- Alexandra Grote
- Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Natav Hendin
- The Infectious Diseases unit, Sheba Medical Center, Tel-Hashomer, Israel
| | - Sharon Amit
- Microbiology Laboratory, Sheba Medical Center, Tel-Hashomer, Israel
| | - Boaz Adani
- The Infectious Diseases unit, Sheba Medical Center, Tel-Hashomer, Israel
| | - Galia Rahav
- The Infectious Diseases unit, Sheba Medical Center, Tel-Hashomer, Israel
| | - Amos Adler
- Clinical Microbiology, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel
| | - Jonathan Livny
- Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Ohad Gal-Mor
- Infectious Diseases Research Laboratory, Sheba Medical Center, Tel-Hashomer, Israel
- Department of Clinical Microbiology and Immunology, Tel-Aviv University, Tel-Aviv, Israel
| | - Ashlee M. Earl
- Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| |
Collapse
|
2
|
Kavanagh NL, Kinnevey PM, Brennan GI, O’Connell B, Goering RV, Coleman DC. Co-carriage of diverse vancomycin-resistant Enterococcus faecium ST80-lineages by 70% of patients in an Irish hospital. JAC Antimicrob Resist 2025; 7:dlaf065. [PMID: 40309497 PMCID: PMC12039289 DOI: 10.1093/jacamr/dlaf065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2025] [Accepted: 04/15/2025] [Indexed: 05/02/2025] Open
Abstract
Background Vancomycin-resistant Enterococcus faecium (VREfm) are significant nosocomial pathogens. Irish VREfm comprise diverse vanA-encoding ST80-complex type (CT) lineages. Recent studies indicate that within-patient VREfm diversity could confound surveillance. This study investigated the intra-host VREfm genetic diversity among colonized Irish hospital patients. Methods Rectal VREfm (n = 150) from 10 patients (15 isolates each) were investigated by WGS, core-genome MLST and split k-mer (SKA)-SNP analysis. Plasmids and vanA-transposons from 39 VREfm representative of CTs identified were resolved by hybrid assembly of short-read (Illumina) and long-read (Oxford Nanopore Technologies) sequences. Plasmid relatedness was assessed based on Mash distances. Thirty vancomycin-susceptible E. faecium (VSEfm) from four VREfm-positive patients were also investigated. Results All isolates were clade A1 and most were ST80 (VREfm, 147/150; VSEfm, 25/30). Seventy-percent of patients (7/10) harboured either two (n = 4), three (n = 2) or four (n = 1) VREfm CTs. Individual patient isolate pairs from different CTs differed significantly (median SKA-SNPs 2933), but differences were minimal between isolate pairs of the same CT (median SKA-SNPs 0). In total, 193 plasmids were identified in 39 VREfm investigated. Near-identical plasmids (≥99.5% average nucleotide identity) were identified in divergent CTs from multiple patients. Most VREfm (28/39, 72%) harboured vanA on closely related transferable, linear plasmids. Divergent CTs within individual patients harboured either indistinguishable vanA-transposons or vanA-transposons with distinct organizational iterations. Four VSEfm from different CTs investigated harboured similar plasmids to VREfm. Conclusion VREfm within-host diversity is highly prevalent in Irish hospital patients, which complicates surveillance. Linear plasmids play an important role in the emergence of Irish VREfm.
Collapse
Affiliation(s)
- Nicole L Kavanagh
- Microbiology Research Unit, Division of Oral Biosciences, Dublin Dental University Hospital, University of Dublin, Trinity College Dublin, Lincoln Place, Dublin D02 F859, Ireland
| | - Peter M Kinnevey
- Microbiology Research Unit, Division of Oral Biosciences, Dublin Dental University Hospital, University of Dublin, Trinity College Dublin, Lincoln Place, Dublin D02 F859, Ireland
| | - Grainne I Brennan
- Department of Clinical Microbiology, St. James’s Hospital, Dublin, Ireland
- National MRSA Reference Laboratory, St. James’s Hospital, Dublin, Ireland
| | - Brian O’Connell
- Department of Clinical Microbiology, St. James’s Hospital, Dublin, Ireland
- National MRSA Reference Laboratory, St. James’s Hospital, Dublin, Ireland
| | - Richard V Goering
- Department of Medical Microbiology and Immunology, Creighton University School of Medicine, Omaha, NE, USA
| | - David C Coleman
- Microbiology Research Unit, Division of Oral Biosciences, Dublin Dental University Hospital, University of Dublin, Trinity College Dublin, Lincoln Place, Dublin D02 F859, Ireland
| |
Collapse
|
3
|
Sobkowiak B, Cudahy P, Chitwood MH, Clark TG, Colijn C, Grandjean L, Walter KS, Crudu V, Cohen T. A new method for detecting mixed Mycobacterium tuberculosis infection and reconstructing constituent strains provides insights into transmission. Genome Med 2025; 17:8. [PMID: 39871355 PMCID: PMC11771024 DOI: 10.1186/s13073-025-01430-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2024] [Accepted: 01/03/2025] [Indexed: 01/29/2025] Open
Abstract
BACKGROUND Mixed infection with multiple strains of the same pathogen in a single host can present clinical and analytical challenges. Whole genome sequence (WGS) data can identify signals of multiple strains in samples, though the precision of previous methods can be improved. Here, we present MixInfect2, a new tool to accurately detect mixed samples from Mycobacterium tuberculosis short-read WGS data. We then evaluate three approaches for reconstructing the underlying mixed constituent strain sequences. This allows these samples to be included in downstream analysis to gain insights into the epidemiology and transmission of mixed infections. METHODS We employed a Gaussian mixture model to cluster allele frequencies at mixed sites (hSNPs) in each sample to identify signals of multiple strains. Building upon our previous tool, MixInfect, we increased the accuracy of classifying in vitro mixed samples through multiple improvements to the bioinformatic pipeline. Major and minor proportion constituent strains were reconstructed using three approaches and assessed by comparing the estimated sequence to the known constituent strain sequence. Lastly, mixed infections in a real-world Mycobacterium tuberculosis population from Moldova were detected with MixInfect2 and clusters of recent transmission that included major and minor constituent strains were built. RESULTS All 36/36 in vitro mixed and 12/12 non-mixed samples were correctly classified with MixInfect2, and major strain proportions were estimated with high accuracy (within 3% of the true strain proportion), outperforming previous tools. Reconstructed major strain sequences closely matched the true constituent sequence by taking the allele at the highest frequency at hSNPs, while the best-performing approach to reconstruct the minor proportion strain sequence was identifying the closest non-mixed isolate in the same population, though no approach was effective when the minor strain proportion was at 5%. Finally, fewer mixed infections were identified in Moldova than previous estimates (6.6% vs 17.4%) and we found multiple instances where the constituent strains of mixed samples were present in transmission clusters. CONCLUSIONS MixInfect2 accurately detects samples with evidence of mixed infection from short-read WGS data and provides an excellent estimate of the mixture proportions. While there are limitations in reconstructing the constituent strain sequences of mixed samples, we present recommendations for the best approach to include these isolates in further analyses.
Collapse
Affiliation(s)
- Benjamin Sobkowiak
- Department of Epidemiology of Microbial Disease, Yale School of Public Health, 60 College Street, New Haven, CT, USA.
- Department of Infection, Immunity and Inflammation, Institute of Child Health, University College London, London, UK.
| | - Patrick Cudahy
- Division of Infectious Diseases, Department of Internal Medicine, Yale School of Medicine, New Haven, CT, USA
| | - Melanie H Chitwood
- Department of Epidemiology of Microbial Disease, Yale School of Public Health, 60 College Street, New Haven, CT, USA
| | - Taane G Clark
- Faculty of Infectious and Tropical Diseases, School of Hygiene and Tropical Medicine, London, UK
- Faculty of Epidemiology and Public Health, School of Hygiene and Tropical Medicine, London, UK
| | - Caroline Colijn
- Department of Mathematics, Simon Fraser University, 8888 University Drive West, Burnaby, BC, Canada
| | - Louis Grandjean
- Department of Infection, Immunity and Inflammation, Institute of Child Health, University College London, London, UK
| | | | - Valeriu Crudu
- Phthisiopneumology Institute, Strada Constantin Vârnav 13, Chisinau, Republic of Moldova
| | - Ted Cohen
- Department of Epidemiology of Microbial Disease, Yale School of Public Health, 60 College Street, New Haven, CT, USA
| |
Collapse
|
4
|
Tran-Kiem C, Bedford T. Estimating the reproduction number and transmission heterogeneity from the size distribution of clusters of identical pathogen sequences. Proc Natl Acad Sci U S A 2024; 121:e2305299121. [PMID: 38568971 PMCID: PMC11009662 DOI: 10.1073/pnas.2305299121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Accepted: 02/26/2024] [Indexed: 04/05/2024] Open
Abstract
Quantifying transmission intensity and heterogeneity is crucial to ascertain the threat posed by infectious diseases and inform the design of interventions. Methods that jointly estimate the reproduction number R and the dispersion parameter k have however mainly remained limited to the analysis of epidemiological clusters or contact tracing data, whose collection often proves difficult. Here, we show that clusters of identical sequences are imprinted by the pathogen offspring distribution, and we derive an analytical formula for the distribution of the size of these clusters. We develop and evaluate an inference framework to jointly estimate the reproduction number and the dispersion parameter from the size distribution of clusters of identical sequences. We then illustrate its application across a range of epidemiological situations. Finally, we develop a hypothesis testing framework relying on clusters of identical sequences to determine whether a given pathogen genetic subpopulation is associated with increased or reduced transmissibility. Our work provides tools to estimate the reproduction number and transmission heterogeneity from pathogen sequences without building a phylogenetic tree, thus making it easily scalable to large pathogen genome datasets.
Collapse
Affiliation(s)
- Cécile Tran-Kiem
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, Seattle, WA98109
| | - Trevor Bedford
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, Seattle, WA98109
- HHMI, Seattle, WA98109
| |
Collapse
|
5
|
McHugh MP, Pettigrew KA, Taori S, Evans TJ, Leanord A, Gillespie SH, Templeton KE, Holden MTG. Consideration of within-patient diversity highlights transmission pathways and antimicrobial resistance gene variability in vancomycin-resistant Enterococcus faecium. J Antimicrob Chemother 2024; 79:656-668. [PMID: 38323373 PMCID: PMC11090465 DOI: 10.1093/jac/dkae023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Accepted: 01/02/2024] [Indexed: 02/08/2024] Open
Abstract
BACKGROUND WGS is increasingly being applied to healthcare-associated vancomycin-resistant Enterococcus faecium (VREfm) outbreaks. Within-patient diversity could complicate transmission resolution if single colonies are sequenced from identified cases. OBJECTIVES Determine the impact of within-patient diversity on transmission resolution of VREfm. MATERIALS AND METHODS Fourteen colonies were collected from VREfm positive rectal screens, single colonies were collected from clinical samples and Illumina WGS was performed. Two isolates were selected for Oxford Nanopore sequencing and hybrid genome assembly to generate lineage-specific reference genomes. Mapping to closely related references was used to identify genetic variations and closely related genomes. A transmission network was inferred for the entire genome set using Phyloscanner. RESULTS AND DISCUSSION In total, 229 isolates from 11 patients were sequenced. Carriage of two or three sequence types was detected in 27% of patients. Presence of antimicrobial resistance genes and plasmids was variable within genomes from the same patient and sequence type. We identified two dominant sequence types (ST80 and ST1424), with two putative transmission clusters of two patients within ST80, and a single cluster of six patients within ST1424. We found transmission resolution was impaired using fewer than 14 colonies. CONCLUSIONS Patients can carry multiple sequence types of VREfm, and even within related lineages the presence of mobile genetic elements and antimicrobial resistance genes can vary. VREfm within-patient diversity could be considered in future to aid accurate resolution of transmission networks.
Collapse
Affiliation(s)
- Martin P McHugh
- School of Medicine, University of St Andrews, St Andrews, UK
- Medical Microbiology, Department of Laboratory Medicine, Royal Infirmary of Edinburgh, Edinburgh, UK
| | | | - Surabhi Taori
- Medical Microbiology, Department of Laboratory Medicine, Royal Infirmary of Edinburgh, Edinburgh, UK
| | - Thomas J Evans
- School of Infection and Immunity, University of Glasgow, Glasgow, UK
| | - Alistair Leanord
- School of Infection and Immunity, University of Glasgow, Glasgow, UK
- Scottish Microbiology Reference Laboratories, Glasgow Royal Infirmary, Glasgow, UK
| | | | - Kate E Templeton
- Medical Microbiology, Department of Laboratory Medicine, Royal Infirmary of Edinburgh, Edinburgh, UK
| | | |
Collapse
|
6
|
Abstract
Bacterial pathogens undergo remarkable adaptive change in response to the selective forces they encounter during host colonization and infection. Studies performed over the past few decades have demonstrated that many general evolutionary processes can be discerned during the course of host adaptation, including genetic diversification of lineages, clonal succession events, convergent evolution, and balanced fitness trade-offs. In some cases, elevated mutation rates resulting from mismatch repair or proofreading deficiencies accelerate evolution, and active mobile genetic elements or phages may facilitate genome plasticity. The host immune response provides another critical component of the fitness landscapes guiding adaptation, and selection operating on pathogens at this level may lead to immune evasion and the establishment of chronic infection. This review summarizes recent advances in this field, with a special focus on different forms of bacterial genome plasticity in the context of infection, and considers clinical consequences of adaptive changes for the host.
Collapse
Affiliation(s)
- John P Dekker
- Bacterial Pathogenesis and Antimicrobial Resistance Unit, Laboratory of Clinical Immunology and Microbiology, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, Maryland, USA;
- National Institutes of Health Clinical Center, National Institutes of Health, Bethesda, Maryland, USA
| |
Collapse
|
7
|
Senghore M, Read H, Oza P, Johnson S, Passarelli-Araujo H, Taylor BP, Ashley S, Grey A, Callendrello A, Lee R, Goddard MR, Lumley T, Hanage WP, Wiles S. Inferring bacterial transmission dynamics using deep sequencing genomic surveillance data. Nat Commun 2023; 14:6397. [PMID: 37907520 PMCID: PMC10618251 DOI: 10.1038/s41467-023-42211-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2022] [Accepted: 09/27/2023] [Indexed: 11/02/2023] Open
Abstract
Identifying and interrupting transmission chains is important for controlling infectious diseases. One way to identify transmission pairs - two hosts in which infection was transmitted from one to the other - is using the variation of the pathogen within each single host (within-host variation). However, the role of such variation in transmission is understudied due to a lack of experimental and clinical datasets that capture pathogen diversity in both donor and recipient hosts. In this work, we assess the utility of deep-sequenced genomic surveillance (where genomic regions are sequenced hundreds to thousands of times) using a mouse transmission model involving controlled spread of the pathogenic bacterium Citrobacter rodentium from infected to naïve female animals. We observe that within-host single nucleotide variants (iSNVs) are maintained over multiple transmission steps and present a model for inferring the likelihood that a given pair of sequenced samples are linked by transmission. In this work we show that, beyond the presence and absence of within-host variants, differences arising in the relative abundance of iSNVs (allelic frequency) can infer transmission pairs more precisely. Our approach further highlights the critical role bottlenecks play in reserving the within-host diversity during transmission.
Collapse
Affiliation(s)
- Madikay Senghore
- Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard TH Chan School of Public Health, Boston, MA, USA.
| | - Hannah Read
- Bioluminescent Superbugs Lab, Department of Molecular Medicine and Pathology, University of Auckland, Auckland, New Zealand
| | - Priyali Oza
- Bioluminescent Superbugs Lab, Department of Molecular Medicine and Pathology, University of Auckland, Auckland, New Zealand
| | - Sarah Johnson
- Bioluminescent Superbugs Lab, Department of Molecular Medicine and Pathology, University of Auckland, Auckland, New Zealand
| | - Hemanoel Passarelli-Araujo
- Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard TH Chan School of Public Health, Boston, MA, USA
- Department of Biochemistry and Immunology, Federal University of Minas Gerais, Minas Gerais, Brazil
| | - Bradford P Taylor
- Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard TH Chan School of Public Health, Boston, MA, USA
| | - Stephen Ashley
- Bioluminescent Superbugs Lab, Department of Molecular Medicine and Pathology, University of Auckland, Auckland, New Zealand
| | - Alex Grey
- Bioluminescent Superbugs Lab, Department of Molecular Medicine and Pathology, University of Auckland, Auckland, New Zealand
| | - Alanna Callendrello
- Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard TH Chan School of Public Health, Boston, MA, USA
| | - Robyn Lee
- Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard TH Chan School of Public Health, Boston, MA, USA
- University of Toronto Dalla Lana School of Public Health, Toronto, ON, Canada
| | - Matthew R Goddard
- School of Biological Sciences, University of Auckland, Auckland, New Zealand
- School of Life and Environmental Sciences, University of Lincoln, Lincoln, UK
| | - Thomas Lumley
- Department of Statistics, University of Auckland, Auckland, New Zealand
| | - William P Hanage
- Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard TH Chan School of Public Health, Boston, MA, USA
| | - Siouxsie Wiles
- Bioluminescent Superbugs Lab, Department of Molecular Medicine and Pathology, University of Auckland, Auckland, New Zealand.
- Te Pūnaha Matatini, Centre of Research Excellence in Complex Systems, Auckland, New Zealand.
| |
Collapse
|
8
|
Hackman J, Sheppard C, Phelan J, Jones-Warner W, Sobkowiak B, Shah S, Litt D, Fry NK, Toizumi M, Yoshida LM, Hibberd M, Miller E, Flasche S, Hué S. Phylogenetic inference of pneumococcal transmission from cross-sectional data, a pilot study. Wellcome Open Res 2023; 8:427. [PMID: 38638914 PMCID: PMC11024593 DOI: 10.12688/wellcomeopenres.19219.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/03/2023] [Indexed: 04/20/2024] Open
Abstract
Background: Inference on pneumococcal transmission has mostly relied on longitudinal studies which are costly and resource intensive. Therefore, we conducted a pilot study to test the ability to infer who infected whom from cross-sectional pneumococcal sequences using phylogenetic inference. Methods: Five suspected transmission pairs, for which there was epidemiological evidence of who infected whom, were selected from a household study. For each pair, Streptococcus pneumoniae full genomes were sequenced from nasopharyngeal swabs collected on the same day. The within-host genetic diversity of the pneumococcal population was used to infer the transmission direction and then cross-validated with the direction suggested by the epidemiological records. Results: The pneumococcal genomes clustered into the five households from which the samples were taken. The proportion of concordantly inferred transmission direction generally increased with increasing minimum genome fragment size and single nucleotide polymorphisms. We observed a larger proportion of unique polymorphic sites in the source bacterial population compared to that of the recipient in four of the five pairs, as expected in the case of a transmission bottleneck. The only pair that did not exhibit this effect was also the pair that had consistent discordant transmission direction compared to the epidemiological records suggesting potential misdirection as a result of false-negative sampling. Conclusions: This pilot provided support for further studies to test if the direction of pneumococcal transmission can be reliably inferred from cross-sectional samples if sequenced with sufficient depth and fragment length.
Collapse
Affiliation(s)
- Jada Hackman
- Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, London, UK
- School of Tropical Medicine and Global Health, Nagasaki University, Nagasaki, Japan
| | - Carmen Sheppard
- Vaccine Preventable Bacteria Section, UK Health Security Agency, London, UK
| | - Jody Phelan
- Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, UK
| | - William Jones-Warner
- Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, UK
| | - Ben Sobkowiak
- Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, UK
| | - Sonal Shah
- Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, UK
| | - David Litt
- Vaccine Preventable Bacteria Section, UK Health Security Agency, London, UK
| | - Norman K. Fry
- Vaccine Preventable Bacteria Section, UK Health Security Agency, London, UK
- Immunisation & Countermeasures Division, UK Health Security Agency, London, UK
| | - Michiko Toizumi
- School of Tropical Medicine and Global Health, Nagasaki University, Nagasaki, Japan
- Department of Paediatric Infectious Diseases, Nagasaki University, Nagasaki, Japan
| | - Lay-Myint Yoshida
- School of Tropical Medicine and Global Health, Nagasaki University, Nagasaki, Japan
- Department of Paediatric Infectious Diseases, Nagasaki University, Nagasaki, Japan
| | - Martin Hibberd
- Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, UK
| | - Elizabeth Miller
- Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, London, UK
| | - Stefan Flasche
- Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, London, UK
| | - Stéphane Hué
- Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, London, UK
| |
Collapse
|
9
|
Torres Ortiz A, Kendall M, Storey N, Hatcher J, Dunn H, Roy S, Williams R, Williams C, Goldstein RA, Didelot X, Harris K, Breuer J, Grandjean L. Within-host diversity improves phylogenetic and transmission reconstruction of SARS-CoV-2 outbreaks. eLife 2023; 12:e84384. [PMID: 37732733 PMCID: PMC10602588 DOI: 10.7554/elife.84384] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2022] [Accepted: 09/20/2023] [Indexed: 09/22/2023] Open
Abstract
Accurate inference of who infected whom in an infectious disease outbreak is critical for the delivery of effective infection prevention and control. The increased resolution of pathogen whole-genome sequencing has significantly improved our ability to infer transmission events. Despite this, transmission inference often remains limited by the lack of genomic variation between the source case and infected contacts. Although within-host genetic diversity is common among a wide variety of pathogens, conventional whole-genome sequencing phylogenetic approaches exclusively use consensus sequences, which consider only the most prevalent nucleotide at each position and therefore fail to capture low-frequency variation within samples. We hypothesized that including within-sample variation in a phylogenetic model would help to identify who infected whom in instances in which this was previously impossible. Using whole-genome sequences from SARS-CoV-2 multi-institutional outbreaks as an example, we show how within-sample diversity is partially maintained among repeated serial samples from the same host, it can transmitted between those cases with known epidemiological links, and how this improves phylogenetic inference and our understanding of who infected whom. Our technique is applicable to other infectious diseases and has immediate clinical utility in infection prevention and control.
Collapse
Affiliation(s)
- Arturo Torres Ortiz
- Department of Infectious Diseases, Imperial College LondonLondonUnited Kingdom
- Department of Infection, Immunity and Inflammation, University College LondonLondonUnited Kingdom
| | - Michelle Kendall
- Department of Statistics, University of WarwickCoventryUnited Kingdom
| | - Nathaniel Storey
- Department of Microbiology, Great Ormond Street HospitalLondonUnited Kingdom
| | - James Hatcher
- Department of Microbiology, Great Ormond Street HospitalLondonUnited Kingdom
| | - Helen Dunn
- Department of Microbiology, Great Ormond Street HospitalLondonUnited Kingdom
| | - Sunando Roy
- Department of Infection, Immunity and Inflammation, University College LondonLondonUnited Kingdom
| | | | | | | | - Xavier Didelot
- Department of Statistics, University of WarwickCoventryUnited Kingdom
| | - Kathryn Harris
- Department of Microbiology, Great Ormond Street HospitalLondonUnited Kingdom
- Department of Virology, East & South East London Pathology Partnership, Royal London Hospital, Barts Health NHS TrustLondonUnited Kingdom
| | - Judith Breuer
- Department of Infection, Immunity and Inflammation, University College LondonLondonUnited Kingdom
| | - Louis Grandjean
- Department of Infection, Immunity and Inflammation, University College LondonLondonUnited Kingdom
| |
Collapse
|
10
|
Batisti Biffignandi G, Bellinzona G, Petazzoni G, Sassera D, Zuccotti GV, Bandi C, Baldanti F, Comandatore F, Gaiarsa S. P-DOR, an easy-to-use pipeline to reconstruct bacterial outbreaks using genomics. Bioinformatics 2023; 39:btad571. [PMID: 37701995 PMCID: PMC10533420 DOI: 10.1093/bioinformatics/btad571] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 08/24/2023] [Accepted: 09/12/2023] [Indexed: 09/14/2023] Open
Abstract
SUMMARY Bacterial Healthcare-Associated Infections (HAIs) are a major threat worldwide, which can be counteracted by establishing effective infection control measures, guided by constant surveillance and timely epidemiological investigations. Genomics is crucial in modern epidemiology but lacks standard methods and user-friendly software, accessible to users without a strong bioinformatics proficiency. To overcome these issues we developed P-DOR, a novel tool for rapid bacterial outbreak characterization. P-DOR accepts genome assemblies as input, it automatically selects a background of publicly available genomes using k-mer distances and adds it to the analysis dataset before inferring a Single-Nucleotide Polymorphism (SNP)-based phylogeny. Epidemiological clusters are identified considering the phylogenetic tree topology and SNP distances. By analyzing the SNP-distance distribution, the user can gauge the correct threshold. Patient metadata can be inputted as well, to provide a spatio-temporal representation of the outbreak. The entire pipeline is fast and scalable and can be also run on low-end computers. AVAILABILITY AND IMPLEMENTATION P-DOR is implemented in Python3 and R and can be installed using conda environments. It is available from GitHub https://github.com/SteMIDIfactory/P-DOR under the GPL-3.0 license.
Collapse
Affiliation(s)
| | - Greta Bellinzona
- Department of Biology and Biotechnology, University of Pavia, Pavia, 27100, Italy
| | - Greta Petazzoni
- Department of Medical, Surgical, Diagnostic and Pediatric Sciences, University of Pavia, Pavia, 27100, Italy
- Microbiology and Virology Unit, Fondazione IRCCS Policlinico San Matteo, Pavia, 27100, Italy
| | - Davide Sassera
- Department of Biology and Biotechnology, University of Pavia, Pavia, 27100, Italy
- Fondazione IRCCS Policlinico San Matteo, Pavia, 27100, Italy
| | - Gian Vincenzo Zuccotti
- Department of Biomedical and Clinical Sciences, Pediatric Clinical Research Center Romeo ed Enrica Invernizzi, University of Milan, Milan, 20157, Italy
- Pediatric Department, Buzzi Children’s Hospital, Milan, 20154, Italy
| | - Claudio Bandi
- Department of Biosciences, Pediatric Clinical Research Center Romeo ed Enrica Invernizzi, University of Milan, Milan, 20133, Italy
| | - Fausto Baldanti
- Department of Medical, Surgical, Diagnostic and Pediatric Sciences, University of Pavia, Pavia, 27100, Italy
- Microbiology and Virology Unit, Fondazione IRCCS Policlinico San Matteo, Pavia, 27100, Italy
| | - Francesco Comandatore
- Department of Biomedical and Clinical Sciences, Pediatric Clinical Research Center Romeo ed Enrica Invernizzi, University of Milan, Milan, 20157, Italy
| | - Stefano Gaiarsa
- Microbiology and Virology Unit, Fondazione IRCCS Policlinico San Matteo, Pavia, 27100, Italy
| |
Collapse
|
11
|
Taouk ML, Steinig E, Taiaroa G, Savic I, Tran T, Higgins N, Tran S, Lee A, Braddick M, Moso MA, Chow EPF, Fairley CK, Towns J, Chen MY, Caly L, Lim CK, Williamson DA. Intra- and interhost genomic diversity of monkeypox virus. J Med Virol 2023; 95:e29029. [PMID: 37565686 PMCID: PMC10952654 DOI: 10.1002/jmv.29029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2023] [Revised: 07/21/2023] [Accepted: 07/25/2023] [Indexed: 08/12/2023]
Abstract
The impact and frequency of infectious disease outbreaks demonstrate the need for timely genomic surveillance to inform public health responses. In the largest known outbreak of mpox, genomic surveillance efforts have primarily focused on high-incidence nations in Europe and the Americas, with a paucity of data from South-East Asia and the Western Pacific. Here we analyzed 102 monkeypox virus (MPXV) genomes sampled from 56 individuals in Melbourne, Australia. All genomes fell within the 2022 MPXV outbreak lineage (B.1), with likely onward local transmission detected. We observed within-host diversity and instances of co-infection, and highlight further examples of structural variation and apolipoprotein B editing complex-driven micro-evolution in the current MPXV outbreak. Updating our understanding of MPXV emergence and diversification will inform public health measures and enable monitoring of the virus' evolutionary trajectory throughout the mpox outbreak.
Collapse
Affiliation(s)
- Mona L. Taouk
- Department of Infectious DiseasesThe University of Melbourne at the Peter Doherty Institute for Infection and ImmunityMelbourneVictoriaAustralia
- Victorian Infectious Diseases Reference LaboratoryThe Royal Melbourne Hospital at The Peter Doherty Institute for Infection and ImmunityMelbourneVictoriaAustralia
| | - Eike Steinig
- Department of Infectious DiseasesThe University of Melbourne at the Peter Doherty Institute for Infection and ImmunityMelbourneVictoriaAustralia
- Victorian Infectious Diseases Reference LaboratoryThe Royal Melbourne Hospital at The Peter Doherty Institute for Infection and ImmunityMelbourneVictoriaAustralia
| | - George Taiaroa
- Department of Infectious DiseasesThe University of Melbourne at the Peter Doherty Institute for Infection and ImmunityMelbourneVictoriaAustralia
- Victorian Infectious Diseases Reference LaboratoryThe Royal Melbourne Hospital at The Peter Doherty Institute for Infection and ImmunityMelbourneVictoriaAustralia
| | - Ivana Savic
- Victorian Infectious Diseases Reference LaboratoryThe Royal Melbourne Hospital at The Peter Doherty Institute for Infection and ImmunityMelbourneVictoriaAustralia
| | - Thomas Tran
- Victorian Infectious Diseases Reference LaboratoryThe Royal Melbourne Hospital at The Peter Doherty Institute for Infection and ImmunityMelbourneVictoriaAustralia
| | - Nasra Higgins
- Victorian Department of HealthMelbourneVictoriaAustralia
| | - Stephanie Tran
- Victorian Department of HealthMelbourneVictoriaAustralia
| | - Alvin Lee
- Victorian Department of HealthMelbourneVictoriaAustralia
| | | | - Michael A. Moso
- Department of Infectious DiseasesThe University of Melbourne at the Peter Doherty Institute for Infection and ImmunityMelbourneVictoriaAustralia
- Victorian Infectious Diseases Reference LaboratoryThe Royal Melbourne Hospital at The Peter Doherty Institute for Infection and ImmunityMelbourneVictoriaAustralia
| | - Eric P. F. Chow
- Melbourne Sexual Health CentreAlfred HealthMelbourneVictoriaAustralia
- Central Clinical School, Faculty of Medicine, Nursing and Health SciencesMonash UniversityMelbourneVictoriaAustralia
- Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global HealthThe University of MelbourneMelbourneVictoriaAustralia
| | - Christopher K. Fairley
- Melbourne Sexual Health CentreAlfred HealthMelbourneVictoriaAustralia
- Central Clinical School, Faculty of Medicine, Nursing and Health SciencesMonash UniversityMelbourneVictoriaAustralia
| | - Janet Towns
- Melbourne Sexual Health CentreAlfred HealthMelbourneVictoriaAustralia
| | - Marcus Y. Chen
- Melbourne Sexual Health CentreAlfred HealthMelbourneVictoriaAustralia
| | - Leon Caly
- Department of Infectious DiseasesThe University of Melbourne at the Peter Doherty Institute for Infection and ImmunityMelbourneVictoriaAustralia
- Victorian Infectious Diseases Reference LaboratoryThe Royal Melbourne Hospital at The Peter Doherty Institute for Infection and ImmunityMelbourneVictoriaAustralia
| | - Chuan K. Lim
- Department of Infectious DiseasesThe University of Melbourne at the Peter Doherty Institute for Infection and ImmunityMelbourneVictoriaAustralia
- Victorian Infectious Diseases Reference LaboratoryThe Royal Melbourne Hospital at The Peter Doherty Institute for Infection and ImmunityMelbourneVictoriaAustralia
| | - Deborah A. Williamson
- Department of Infectious DiseasesThe University of Melbourne at the Peter Doherty Institute for Infection and ImmunityMelbourneVictoriaAustralia
- Victorian Infectious Diseases Reference LaboratoryThe Royal Melbourne Hospital at The Peter Doherty Institute for Infection and ImmunityMelbourneVictoriaAustralia
| |
Collapse
|
12
|
Ke Z, Vikalo H. Graph-Based Reconstruction and Analysis of Disease Transmission Networks Using Viral Genomic Data. J Comput Biol 2023. [PMID: 37347892 DOI: 10.1089/cmb.2022.0373] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/24/2023] Open
Abstract
Understanding the patterns of viral disease transmissions helps establish public health policies and aids in controlling and ending a disease outbreak. Classical methods for studying disease transmission dynamics that rely on epidemiological data, such as times of sample collection and duration of exposure intervals, struggle to provide desired insight due to limited informativeness of such data. A more precise characterization of disease transmissions may be acquired from sequencing data that reveal genetic distance between viral genomes in patient samples. Indeed, genetic distance between viral strains present in hosts contains valuable information about transmission history, thus motivating the design of methods that rely on genomic data to reconstruct a directed disease transmission network, detect transmission clusters, and identify significant network nodes (e.g., super-spreaders). In this article, we present a novel end-to-end framework for the analysis of viral transmissions utilizing viral genomic (sequencing) data. The proposed framework groups infected hosts into transmission clusters based on the reconstructed viral strains infecting them; the genetic distance between a pair of hosts is calculated using Earth Mover's Distance, and further used to infer transmission direction between the hosts. To quantify the significance of a host in the transmission network, the importance score is calculated by a graph convolutional autoencoder. The viral transmission network is represented by a directed minimum spanning tree utilizing the Edmond's algorithm modified to incorporate constraints on the importance scores of the hosts. The proposed framework outperforms state-of-the-art techniques for the analysis of viral transmission dynamics in several experiments on semiexperimental as well as experimental data.
Collapse
Affiliation(s)
- Ziqi Ke
- Department of Electrical and Computer Engineering, The University of Texas at Austin, Austin, Texas, USA
| | - Haris Vikalo
- Department of Electrical and Computer Engineering, The University of Texas at Austin, Austin, Texas, USA
| |
Collapse
|
13
|
Determination and quantification of microbial communities and antimicrobial resistance on food through host DNA-depleted metagenomics. Food Microbiol 2023; 110:104162. [DOI: 10.1016/j.fm.2022.104162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Revised: 09/28/2022] [Accepted: 10/09/2022] [Indexed: 11/05/2022]
|
14
|
Djeghout B, Bloomfield SJ, Rudder S, Elumogo N, Mather AE, Wain J, Janecko N. Comparative genomics of Campylobacter jejuni from clinical campylobacteriosis stool specimens. Gut Pathog 2022; 14:45. [PMID: 36476389 PMCID: PMC9727990 DOI: 10.1186/s13099-022-00520-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Accepted: 11/22/2022] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Campylobacter jejuni is a pervasive pathogen of major public health concern with a complex ecology requiring accurate and informative approaches to define pathogen diversity during outbreak investigations. Source attribution analysis may be confounded if the genetic diversity of a C. jejuni population is not adequately captured in a single specimen. The aim of this study was to determine the genomic diversity of C. jejuni within individual stool specimens from four campylobacteriosis patients. Direct plating and pre-culture filtration of one stool specimen per patient was used to culture multiple isolates per stool specimen. Whole genome sequencing and pangenome level analysis were used to investigate genomic diversity of C. jejuni within a patient. RESULTS A total 92 C. jejuni isolates were recovered from four patients presenting with gastroenteritis. The number of isolates ranged from 13 to 30 per patient stool. Three patients yielded a single C. jejuni multilocus sequence type: ST-21 (n = 26, patient 4), ST-61 (n = 30, patient 1) and ST-2066 (n = 23, patient 2). Patient 3 was infected with two different sequence types [ST-51 (n = 12) and ST-354 (n = 1)]. Isolates belonging to the same sequence type from the same patient specimen shared 12-43 core non-recombinant SNPs and 0-20 frameshifts with each other, and the pangenomes of each sequence type consisted of 1406-1491 core genes and 231-264 accessory genes. However, neither the mutation nor the accessory genes were connected to a specific functional gene category. CONCLUSIONS Our findings show that the C. jejuni population recovered from an individual patient's stool are genetically diverse even within the same ST and may have shared common ancestors before specimens were obtained. The population is unlikely to have evolved from a single isolate at the time point of initial patient infection, leading us to conclude that patients were likely infected with a heterogeneous C. jejuni population. The diversity of the C. jejuni population found within individual stool specimens can inform future methodological approaches to attribution and outbreak investigations.
Collapse
Affiliation(s)
- Bilal Djeghout
- grid.40368.390000 0000 9347 0159Quadram Institute Bioscience, Rosalind Franklin Rd, Norwich Research Park, Norwich, NR4 7UQ UK
| | - Samuel J. Bloomfield
- grid.40368.390000 0000 9347 0159Quadram Institute Bioscience, Rosalind Franklin Rd, Norwich Research Park, Norwich, NR4 7UQ UK
| | - Steven Rudder
- grid.40368.390000 0000 9347 0159Quadram Institute Bioscience, Rosalind Franklin Rd, Norwich Research Park, Norwich, NR4 7UQ UK
| | - Ngozi Elumogo
- grid.40368.390000 0000 9347 0159Quadram Institute Bioscience, Rosalind Franklin Rd, Norwich Research Park, Norwich, NR4 7UQ UK ,grid.416391.80000 0004 0400 0120Eastern Pathology Alliance, Norfolk and Norwich University Hospital, Norwich, NR4 7UY UK
| | - Alison E. Mather
- grid.40368.390000 0000 9347 0159Quadram Institute Bioscience, Rosalind Franklin Rd, Norwich Research Park, Norwich, NR4 7UQ UK ,grid.8273.e0000 0001 1092 7967Faculty of Medicine and Health Sciences, University of East Anglia, Norwich, NR4 7TJ UK
| | - John Wain
- grid.40368.390000 0000 9347 0159Quadram Institute Bioscience, Rosalind Franklin Rd, Norwich Research Park, Norwich, NR4 7UQ UK ,grid.8273.e0000 0001 1092 7967Faculty of Medicine and Health Sciences, University of East Anglia, Norwich, NR4 7TJ UK
| | - Nicol Janecko
- grid.40368.390000 0000 9347 0159Quadram Institute Bioscience, Rosalind Franklin Rd, Norwich Research Park, Norwich, NR4 7UQ UK
| |
Collapse
|
15
|
Brown TS, Robinson DA, Buckee CO, Mathema B. Connecting the dots: understanding how human mobility shapes TB epidemics. Trends Microbiol 2022; 30:1036-1044. [PMID: 35597716 PMCID: PMC10068677 DOI: 10.1016/j.tim.2022.04.005] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Revised: 04/18/2022] [Accepted: 04/19/2022] [Indexed: 01/13/2023]
Abstract
Tuberculosis (TB) remains a leading infectious cause of death worldwide. Reducing TB infections and TB-related deaths rests ultimately on stopping forward transmission from infectious to susceptible individuals. Critical to this effort is understanding how human host mobility shapes the transmission and dispersal of new or existing strains of Mycobacterium tuberculosis (Mtb). Important questions remain unanswered. What kinds of mobility, over what temporal and spatial scales, facilitate TB transmission? How do human mobility patterns influence the dispersal of novel Mtb strains, including emergent drug-resistant strains? This review summarizes the current state of knowledge on mobility and TB epidemic dynamics, using examples from three topic areas, including inference of genetic and spatial clustering of infections, delineating source-sink dynamics, and mapping the dispersal of novel TB strains, to examine scientific questions and methodological issues within this topic. We also review new data sources for measuring human mobility, including mobile phone-associated movement data, and discuss important limitations on their use in TB epidemiology.
Collapse
Affiliation(s)
- Tyler S Brown
- Center for Communicable Disease Dynamics, Harvard T.H. Chan School of Public Health, Boston, MA, USA; Infectious Diseases Division, Massachusetts General Hospital, Boston, MA, USA
| | - D Ashley Robinson
- Department of Microbiology and Immunology, University of Mississippi Medical Center, Jackson, MS, USA
| | - Caroline O Buckee
- Center for Communicable Disease Dynamics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Barun Mathema
- Department of Epidemiology, Mailman School of Public Health, Columbia University, New York, NY, USA.
| |
Collapse
|
16
|
Alamil M, Thébaud G, Berthier K, Soubeyrand S. Characterizing viral within-host diversity in fast and non-equilibrium demo-genetic dynamics. Front Microbiol 2022; 13:983938. [PMID: 36274731 PMCID: PMC9581327 DOI: 10.3389/fmicb.2022.983938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 09/08/2022] [Indexed: 11/13/2022] Open
Abstract
High-throughput sequencing has opened the route for a deep assessment of within-host genetic diversity that can be used, e.g., to characterize microbial communities and to infer transmission links in infectious disease outbreaks. The performance of such characterizations and inferences cannot be analytically assessed in general and are often grounded on computer-intensive evaluations. Then, being able to simulate within-host genetic diversity across time under various demo-genetic assumptions is paramount to assess the performance of the approaches of interest. In this context, we built an original model that can be simulated to investigate the temporal evolution of genotypes and their frequencies under various demo-genetic assumptions. The model describes the growth and the mutation of genotypes at the nucleotide resolution conditional on an overall within-host viral kinetics, and can be tuned to generate fast non-equilibrium demo-genetic dynamics. We ran simulations of this model and computed classic diversity indices to characterize the temporal variation of within-host genetic diversity (from high-throughput amplicon sequences) of virus populations under three demographic kinetic models of viral infection. Our results highlight how demographic (viral load) and genetic (mutation, selection, or drift) factors drive variations in within-host diversity during the course of an infection. In particular, we observed a non-monotonic relationship between pathogen population size and genetic diversity, and a reduction of the impact of mutation on diversity when a non-specific host immune response is activated. The large variation in the diversity patterns generated in our simulations suggests that the underlying model provides a flexible basis to produce very diverse demo-genetic scenarios and test, for instance, methods for the inference of transmission links during outbreaks.
Collapse
Affiliation(s)
- Maryam Alamil
- INRAE, BioSP, Avignon, France
- Department of Mathematics and Computer Science, Alfaisal University, Riyadh, Saudi Arabia
- *Correspondence: Maryam Alamil ;
| | - Gaël Thébaud
- PHIM Plant Health Institute, INRAE, Univ Montpellier, CIRAD, Institut Agro, IRD, Montpellier, France
| | | | | |
Collapse
|
17
|
Lundgren E, Romero-Severson E, Albert J, Leitner T. Combining biomarker and virus phylogenetic models improves HIV-1 epidemiological source identification. PLoS Comput Biol 2022; 18:e1009741. [PMID: 36026480 PMCID: PMC9455879 DOI: 10.1371/journal.pcbi.1009741] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 09/08/2022] [Accepted: 08/02/2022] [Indexed: 01/07/2023] Open
Abstract
To identify and stop active HIV transmission chains new epidemiological techniques are needed. Here, we describe the development of a multi-biomarker augmentation to phylogenetic inference of the underlying transmission history in a local population. HIV biomarkers are measurable biological quantities that have some relationship to the amount of time someone has been infected with HIV. To train our model, we used five biomarkers based on real data from serological assays, HIV sequence data, and target cell counts in longitudinally followed, untreated patients with known infection times. The biomarkers were modeled with a mixed effects framework to allow for patient specific variation and general trends, and fit to patient data using Markov Chain Monte Carlo (MCMC) methods. Subsequently, the density of the unobserved infection time conditional on observed biomarkers were obtained by integrating out the random effects from the model fit. This probabilistic information about infection times was incorporated into the likelihood function for the transmission history and phylogenetic tree reconstruction, informed by the HIV sequence data. To critically test our methodology, we developed a coalescent-based simulation framework that generates phylogenies and biomarkers given a specific or general transmission history. Testing on many epidemiological scenarios showed that biomarker augmented phylogenetics can reach 90% accuracy under idealized situations. Under realistic within-host HIV-1 evolution, involving substantial within-host diversification and frequent transmission of multiple lineages, the average accuracy was at about 50% in transmission clusters involving 5-50 hosts. Realistic biomarker data added on average 16 percentage points over using the phylogeny alone. Using more biomarkers improved the performance. Shorter temporal spacing between transmission events and increased transmission heterogeneity reduced reconstruction accuracy, but larger clusters were not harder to get right. More sequence data per infected host also improved accuracy. We show that the method is robust to incomplete sampling and that adding biomarkers improves reconstructions of real HIV-1 transmission histories. The technology presented here could allow for better prevention programs by providing data for locally informed and tailored strategies.
Collapse
Affiliation(s)
- Erik Lundgren
- Theoretical Biology and Biophysics Group, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - Ethan Romero-Severson
- Theoretical Biology and Biophysics Group, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - Jan Albert
- Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, Stockholm, Sweden
- Department of Clinical Microbiology, Karolinska University Hospital, Stockholm, Sweden
| | - Thomas Leitner
- Theoretical Biology and Biophysics Group, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
- * E-mail:
| |
Collapse
|
18
|
Wegener A, Duim B, van der Graaf-van Bloois L, Zomer AL, Visser CE, Spaninks M, Timmerman AJ, Wagenaar JA, Broens EM. Within-Household Transmission and Bacterial Diversity of Staphylococcus pseudintermedius. Pathogens 2022; 11:pathogens11080850. [PMID: 36014971 PMCID: PMC9415945 DOI: 10.3390/pathogens11080850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 07/26/2022] [Accepted: 07/26/2022] [Indexed: 12/05/2022] Open
Abstract
Staphylococcus pseudintermedius can be transmitted between dogs and their owners and can cause opportunistic infections in humans. Whole genome sequencing was applied to identify the relatedness between isolates from human infections and isolates from dogs in the same households. Genome SNP diversity and distribution of plasmids and antimicrobial resistance genes identified related and unrelated isolates in both households. Our study shows that within-host bacterial diversity is present in S. pseudintermedius, demonstrating that multiple isolates from each host should preferably be sequenced to study transmission dynamics.
Collapse
Affiliation(s)
- Alice Wegener
- Department of Infectious Diseases and Immunology, Faculty of Veterinary Medicine, Utrecht University, 3584 CL Utrecht, The Netherlands; (A.W.); (L.v.d.G.-v.B.); (A.L.Z.); (A.J.T.); (J.A.W.); (E.M.B.)
| | - Birgitta Duim
- Department of Infectious Diseases and Immunology, Faculty of Veterinary Medicine, Utrecht University, 3584 CL Utrecht, The Netherlands; (A.W.); (L.v.d.G.-v.B.); (A.L.Z.); (A.J.T.); (J.A.W.); (E.M.B.)
- Correspondence:
| | - Linda van der Graaf-van Bloois
- Department of Infectious Diseases and Immunology, Faculty of Veterinary Medicine, Utrecht University, 3584 CL Utrecht, The Netherlands; (A.W.); (L.v.d.G.-v.B.); (A.L.Z.); (A.J.T.); (J.A.W.); (E.M.B.)
| | - Aldert L. Zomer
- Department of Infectious Diseases and Immunology, Faculty of Veterinary Medicine, Utrecht University, 3584 CL Utrecht, The Netherlands; (A.W.); (L.v.d.G.-v.B.); (A.L.Z.); (A.J.T.); (J.A.W.); (E.M.B.)
| | - Caroline E. Visser
- Department of Medical Microbiology & Infection Prevention, Amsterdam UMC Location AMC, Amsterdam, 1105 AZ Amsterdam, The Netherlands;
| | - Mirlin Spaninks
- Department Population Health Sciences, Faculty of Veterinary Medicine, Utrecht University, 3584 CL Utrecht, The Netherlands;
| | - Arjen J. Timmerman
- Department of Infectious Diseases and Immunology, Faculty of Veterinary Medicine, Utrecht University, 3584 CL Utrecht, The Netherlands; (A.W.); (L.v.d.G.-v.B.); (A.L.Z.); (A.J.T.); (J.A.W.); (E.M.B.)
| | - Jaap A. Wagenaar
- Department of Infectious Diseases and Immunology, Faculty of Veterinary Medicine, Utrecht University, 3584 CL Utrecht, The Netherlands; (A.W.); (L.v.d.G.-v.B.); (A.L.Z.); (A.J.T.); (J.A.W.); (E.M.B.)
- Wageningen Bioveterinary Research, 8221 RA Lelystad, The Netherlands
| | - Els M. Broens
- Department of Infectious Diseases and Immunology, Faculty of Veterinary Medicine, Utrecht University, 3584 CL Utrecht, The Netherlands; (A.W.); (L.v.d.G.-v.B.); (A.L.Z.); (A.J.T.); (J.A.W.); (E.M.B.)
| |
Collapse
|
19
|
Ortiz AT, Kendall M, Storey N, Hatcher J, Dunn H, Roy S, Williams R, Williams C, Goldstein RA, Didelot X, Harris K, Breuer J, Grandjean L. Within-host diversity improves phylogenetic and transmission reconstruction of SARS-CoV-2 outbreaks. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2022:2022.06.07.495142. [PMID: 35702156 PMCID: PMC9196117 DOI: 10.1101/2022.06.07.495142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Accurate inference of who infected whom in an infectious disease outbreak is critical for the delivery of effective infection prevention and control. The increased resolution of pathogen whole-genome sequencing has significantly improved our ability to infer transmission events. Despite this, transmission inference often remains limited by the lack of genomic variation between the source case and infected contacts. Although within-host genetic diversity is common among a wide variety of pathogens, conventional whole-genome sequencing phylogenetic approaches to reconstruct outbreaks exclusively use consensus sequences, which consider only the most prevalent nucleotide at each position and therefore fail to capture low frequency variation within samples. We hypothesized that including within-sample variation in a phylogenetic model would help to identify who infected whom in instances in which this was previously impossible. Using whole-genome sequences from SARS-CoV-2 multi-institutional outbreaks as an example, we show how within-sample diversity is stable among repeated serial samples from the same host, is transmitted between those cases with known epidemiological links, and how this improves phylogenetic inference and our understanding of who infected whom. Our technique is applicable to other infectious diseases and has immediate clinical utility in infection prevention and control.
Collapse
Affiliation(s)
| | - Michelle Kendall
- Department of Statistics, University of Warwick, Coventry, CV4 7AL
| | - Nathaniel Storey
- Department of Microbiology, Great Ormond Street Hospital, London WC1N 3JH
| | - James Hatcher
- Department of Microbiology, Great Ormond Street Hospital, London WC1N 3JH
| | - Helen Dunn
- Department of Microbiology, Great Ormond Street Hospital, London WC1N 3JH
| | - Sunando Roy
- Department of Infection, Immunity and Inflammation, Institute of Child Health, UCL, London WC1N 1EH
| | - Rachel Williams
- UCL Genomics, Institute of Child Health, UCL, London WC1N 1EH
| | | | | | - Xavier Didelot
- Department of Statistics, University of Warwick, Coventry, CV4 7AL
| | - Kathryn Harris
- Department of Microbiology, Great Ormond Street Hospital, London WC1N 3JH
- Department of Virology, East South East London Pathology Partnership, Royal London Hospital, Barts Health NHS Trust, London E12ES
| | - Judith Breuer
- Department of Infection, Immunity and Inflammation, Institute of Child Health, UCL, London WC1N 1EH
| | - Louis Grandjean
- Department of Infection, Immunity and Inflammation, Institute of Child Health, UCL, London WC1N 1EH
| |
Collapse
|
20
|
Foster-Nyarko E, Pallen MJ. The microbial ecology of Escherichia coli in the vertebrate gut. FEMS Microbiol Rev 2022; 46:fuac008. [PMID: 35134909 PMCID: PMC9075585 DOI: 10.1093/femsre/fuac008] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Revised: 01/31/2022] [Accepted: 02/01/2022] [Indexed: 11/13/2022] Open
Abstract
Escherichia coli has a rich history as biology's 'rock star', driving advances across many fields. In the wild, E. coli resides innocuously in the gut of humans and animals but is also a versatile pathogen commonly associated with intestinal and extraintestinal infections and antimicrobial resistance-including large foodborne outbreaks such as the one that swept across Europe in 2011, killing 54 individuals and causing approximately 4000 infections and 900 cases of haemolytic uraemic syndrome. Given that most E. coli are harmless gut colonizers, an important ecological question plaguing microbiologists is what makes E. coli an occasionally devastating pathogen? To address this question requires an enhanced understanding of the ecology of the organism as a commensal. Here, we review how our knowledge of the ecology and within-host diversity of this organism in the vertebrate gut has progressed in the 137 years since E. coli was first described. We also review current approaches to the study of within-host bacterial diversity. In closing, we discuss some of the outstanding questions yet to be addressed and prospects for future research.
Collapse
Affiliation(s)
- Ebenezer Foster-Nyarko
- Quadram Institute Bioscience, Norwich Research Park, Norwich, NR4 7UQ, United Kingdom
- Norwich Medical School, University of East Anglia, Norwich Research Park, Norwich, NR4 7TJ, United Kingdom
- Department of Infection Biology, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, United Kingdom
| | - Mark J Pallen
- Quadram Institute Bioscience, Norwich Research Park, Norwich, NR4 7UQ, United Kingdom
- School of Veterinary Medicine, University of Surrey, Guildford, Surrey, GU2 7AL, United Kingdom
- School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich, NR4 7TU, United Kingdom
| |
Collapse
|
21
|
Methods Combining Genomic and Epidemiological Data in the Reconstruction of Transmission Trees: A Systematic Review. Pathogens 2022; 11:pathogens11020252. [PMID: 35215195 PMCID: PMC8875843 DOI: 10.3390/pathogens11020252] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 02/08/2022] [Accepted: 02/11/2022] [Indexed: 11/17/2022] Open
Abstract
In order to better understand transmission dynamics and appropriately target control and preventive measures, studies have aimed to identify who-infected-whom in actual outbreaks. Numerous reconstruction methods exist, each with their own assumptions, types of data, and inference strategy. Thus, selecting a method can be difficult. Following PRISMA guidelines, we systematically reviewed the literature for methods combing epidemiological and genomic data in transmission tree reconstruction. We identified 22 methods from the 41 selected articles. We defined three families according to how genomic data was handled: a non-phylogenetic family, a sequential phylogenetic family, and a simultaneous phylogenetic family. We discussed methods according to the data needed as well as the underlying sequence mutation, within-host evolution, transmission, and case observation. In the non-phylogenetic family consisting of eight methods, pairwise genetic distances were estimated. In the phylogenetic families, transmission trees were inferred from phylogenetic trees either simultaneously (nine methods) or sequentially (five methods). While a majority of methods (17/22) modeled the transmission process, few (8/22) took into account imperfect case detection. Within-host evolution was generally (7/8) modeled as a coalescent process. These practical and theoretical considerations were highlighted in order to help select the appropriate method for an outbreak.
Collapse
|
22
|
Senghore M, Chaguza C, Bojang E, Tientcheu PE, Bancroft RE, Lo SW, Gladstone RA, McGee L, Worwui A, Foster-Nyarko E, Ceesay F, Okoi CB, Klugman KP, Breiman RF, Bentley SD, Adegbola R, Antonio M, Hanage WP, Kwambana-Adams BA. Widespread sharing of pneumococcal strains in a rural African setting: proximate villages are more likely to share similar strains that are carried at multiple timepoints. Microb Genom 2022; 8. [PMID: 35119356 PMCID: PMC8942022 DOI: 10.1099/mgen.0.000732] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The transmission dynamics of Streptococcus pneumoniae in sub-Saharan Africa are poorly understood due to a lack of adequate epidemiological and genomic data. Here we leverage a longitudinal cohort from 21 neighbouring villages in rural Africa to study how closely related strains of S. pneumoniae are shared among infants. We analysed 1074 pneumococcal genomes isolated from 102 infants from 21 villages. Strains were designated for unique serotype and sequence-type combinations, and we arbitrarily defined strain sharing where the pairwise genetic distance between strains could be accounted for by the mean within host intra-strain diversity. We used non-parametric statistical tests to assess the role of spatial distance and prolonged carriage on strain sharing using a logistic regression model. We recorded 458 carriage episodes including 318 (69.4 %) where the carried strain was shared with at least one other infant. The odds of strain sharing varied significantly across villages (χ2=47.5, df=21, P-value <0.001). Infants in close proximity to each other were more likely to be involved in strain sharing, but we also show a considerable amount of strain sharing across longer distances. Close geographic proximity (<5 km) between shared strains was associated with a significantly lower pairwise SNP distance compared to strains shared over longer distances (P-value <0.005). Sustained carriage of a shared strain among the infants was significantly more likely to occur if they resided in villages within a 5 km radius of each other (P-value <0.005, OR 3.7). Conversely, where both infants were transiently colonized by the shared strain, they were more likely to reside in villages separated by over 15 km (P-value <0.05, OR 1.5). PCV7 serotypes were rare (13.5 %) and were significantly less likely to be shared (P-value <0.001, OR −1.07). Strain sharing was more likely to occur over short geographical distances, especially where accompanied by sustained colonization. Our results show that strain sharing is a useful proxy for studying transmission dynamics in an under-sampled population with limited genomic data. This article contains data hosted by Microreact.
Collapse
Affiliation(s)
- Madikay Senghore
- WHO Regional Reference Laboratory (RRL), West Africa Strategy and Partnership, Medical Research Council Unit the Gambia at the London School of Hygiene and Tropical Medicine, Atlantic Road, Fajara, The Gambia.,Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard TH Chan School of Public Health, 677 Huntington Ave, Boston, MA 02115, USA
| | - Chrispin Chaguza
- Infection Genomics, Wellcome Sanger Institute, Hinxton, UK.,Darwin College, University of Cambridge, Silver Street, Cambridge, UK.,Department of Clinical Infection, Microbiology and Immunology, Institute of Infection and Global Health, University of Liverpool, Liverpool, UK
| | - Ebrima Bojang
- WHO Regional Reference Laboratory (RRL), West Africa Strategy and Partnership, Medical Research Council Unit the Gambia at the London School of Hygiene and Tropical Medicine, Atlantic Road, Fajara, The Gambia
| | - Peggy-Estelle Tientcheu
- WHO Regional Reference Laboratory (RRL), West Africa Strategy and Partnership, Medical Research Council Unit the Gambia at the London School of Hygiene and Tropical Medicine, Atlantic Road, Fajara, The Gambia
| | - Rowan E Bancroft
- WHO Regional Reference Laboratory (RRL), West Africa Strategy and Partnership, Medical Research Council Unit the Gambia at the London School of Hygiene and Tropical Medicine, Atlantic Road, Fajara, The Gambia
| | - Stephanie W Lo
- Infection Genomics, Wellcome Sanger Institute, Hinxton, UK
| | | | - Lesley McGee
- Respiratory Diseases Branch, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - Archibald Worwui
- WHO Regional Reference Laboratory (RRL), West Africa Strategy and Partnership, Medical Research Council Unit the Gambia at the London School of Hygiene and Tropical Medicine, Atlantic Road, Fajara, The Gambia
| | - Ebenezer Foster-Nyarko
- WHO Regional Reference Laboratory (RRL), West Africa Strategy and Partnership, Medical Research Council Unit the Gambia at the London School of Hygiene and Tropical Medicine, Atlantic Road, Fajara, The Gambia
| | - Fatima Ceesay
- WHO Regional Reference Laboratory (RRL), West Africa Strategy and Partnership, Medical Research Council Unit the Gambia at the London School of Hygiene and Tropical Medicine, Atlantic Road, Fajara, The Gambia
| | - Catherine Bi Okoi
- WHO Regional Reference Laboratory (RRL), West Africa Strategy and Partnership, Medical Research Council Unit the Gambia at the London School of Hygiene and Tropical Medicine, Atlantic Road, Fajara, The Gambia
| | - Keith P Klugman
- Rollins School Public Health, Emory University, Atlanta, USA
| | - Robert F Breiman
- Hubert Department of Global Health, Rollins School of Public Health, Emory University, Atlanta, GA 30322, USA
| | | | - Richard Adegbola
- Immunisation and Global Health Consulting, RAMBICON, Lagos, Nigeria
| | - Martin Antonio
- WHO Regional Reference Laboratory (RRL), West Africa Strategy and Partnership, Medical Research Council Unit the Gambia at the London School of Hygiene and Tropical Medicine, Atlantic Road, Fajara, The Gambia.,Microbiology and Infection Unit, Warwick Medical School, University of Warwick, Coventry, UK
| | - William P Hanage
- Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard TH Chan School of Public Health, 677 Huntington Ave, Boston, MA 02115, USA
| | - Brenda A Kwambana-Adams
- WHO Regional Reference Laboratory (RRL), West Africa Strategy and Partnership, Medical Research Council Unit the Gambia at the London School of Hygiene and Tropical Medicine, Atlantic Road, Fajara, The Gambia.,NIHR Global Health Research Unit on Mucosal Pathogens, Division of Infection and Immunity, University College London, London, UK
| |
Collapse
|
23
|
Forde TL, Dennis TPW, Aminu OR, Harvey WT, Hassim A, Kiwelu I, Medvecky M, Mshanga D, Van Heerden H, Vogel A, Zadoks RN, Mmbaga BT, Lembo T, Biek R. Population genomics of Bacillus anthracis from an anthrax hyperendemic area reveals transmission processes across spatial scales and unexpected within-host diversity. Microb Genom 2022; 8:000759. [PMID: 35188453 PMCID: PMC8942019 DOI: 10.1099/mgen.0.000759] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Accepted: 12/10/2021] [Indexed: 11/18/2022] Open
Abstract
Genomic sequencing has revolutionized our understanding of bacterial disease epidemiology, but remains underutilized for zoonotic pathogens in remote endemic settings. Anthrax, caused by the spore-forming bacterium Bacillus anthracis, remains a threat to human and animal health and rural livelihoods in low- and middle-income countries. While the global genomic diversity of B. anthracis has been well-characterized, there is limited information on how its populations are genetically structured at the scale at which transmission occurs, critical for understanding the pathogen's evolution and transmission dynamics. Using a uniquely rich dataset, we quantified genome-wide SNPs among 73 B. anthracis isolates derived from 33 livestock carcasses sampled over 1 year throughout the Ngorongoro Conservation Area, Tanzania, a region hyperendemic for anthrax. Genome-wide SNPs distinguished 22 unique B. anthracis genotypes (i.e. SNP profiles) within the study area. However, phylogeographical structure was lacking, as identical SNP profiles were found throughout the study area, likely the result of the long and variable periods of spore dormancy and long-distance livestock movements. Significantly, divergent genotypes were obtained from spatio-temporally linked cases and even individual carcasses. The high number of SNPs distinguishing isolates from the same host is unlikely to have arisen during infection, as supported by our simulation models. This points to an unexpectedly wide transmission bottleneck for B. anthracis, with an inoculum comprising multiple variants being the norm. Our work highlights that inferring transmission patterns of B. anthracis from genomic data will require analytical approaches that account for extended and variable environmental persistence, as well as co-infection.
Collapse
Affiliation(s)
- Taya L. Forde
- Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Glasgow, UK
| | - Tristan P. W. Dennis
- Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Glasgow, UK
| | - O. Rhoda Aminu
- Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Glasgow, UK
| | - William T. Harvey
- Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Glasgow, UK
| | - Ayesha Hassim
- Department of Veterinary Tropical Diseases, Faculty of Veterinary Science, University of Pretoria, Onderstepoort, South Africa
| | - Ireen Kiwelu
- Kilimanjaro Clinical Research Institute, Kilimanjaro Christian Medical Centre, Moshi, Tanzania
| | - Matej Medvecky
- Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Glasgow, UK
| | | | - Henriette Van Heerden
- Department of Veterinary Tropical Diseases, Faculty of Veterinary Science, University of Pretoria, Onderstepoort, South Africa
| | - Adeline Vogel
- Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Glasgow, UK
| | - Ruth N. Zadoks
- Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Glasgow, UK
- Present address: Sydney School of Veterinary Science, University of Sydney, Sydney, Australia
| | - Blandina T. Mmbaga
- Kilimanjaro Clinical Research Institute, Kilimanjaro Christian Medical Centre, Moshi, Tanzania
- Kilimanjaro Christian Medical University College, Moshi, Tanzania
| | - Tiziana Lembo
- Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Glasgow, UK
| | - Roman Biek
- Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Glasgow, UK
| |
Collapse
|
24
|
Gallego-García P, Varela N, Estévez-Gómez N, De Chiara L, Fernández-Silva I, Valverde D, Sapoval N, Treangen TJ, Regueiro B, Cabrera-Alvargonzález JJ, del Campo V, Pérez S, Posada D. OUP accepted manuscript. Virus Evol 2022; 8:veac008. [PMID: 35242361 PMCID: PMC8889950 DOI: 10.1093/ve/veac008] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Revised: 12/21/2021] [Accepted: 02/04/2022] [Indexed: 11/23/2022] Open
Abstract
A detailed understanding of how and when severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) transmission occurs is crucial for designing effective prevention measures. Other than contact tracing, genome sequencing provides information to help infer who infected whom. However, the effectiveness of the genomic approach in this context depends on both (high enough) mutation and (low enough) transmission rates. Today, the level of resolution that we can obtain when describing SARS-CoV-2 outbreaks using just genomic information alone remains unclear. In order to answer this question, we sequenced forty-nine SARS-CoV-2 patient samples from ten local clusters in NW Spain for which partial epidemiological information was available and inferred transmission history using genomic variants. Importantly, we obtained high-quality genomic data, sequencing each sample twice and using unique barcodes to exclude cross-sample contamination. Phylogenetic and cluster analyses showed that consensus genomes were generally sufficient to discriminate among independent transmission clusters. However, levels of intrahost variation were low, which prevented in most cases the unambiguous identification of direct transmission events. After filtering out recurrent variants across clusters, the genomic data were generally compatible with the epidemiological information but did not support specific transmission events over possible alternatives. We estimated the effective transmission bottleneck size to be one to two viral particles for sample pairs whose donor–recipient relationship was likely. Our analyses suggest that intrahost genomic variation in SARS-CoV-2 might be generally limited and that homoplasy and recurrent errors complicate identifying shared intrahost variants. Reliable reconstruction of direct SARS-CoV-2 transmission based solely on genomic data seems hindered by a slow mutation rate, potential convergent events, and technical artifacts. Detailed contact tracing seems essential in most cases to study SARS-CoV-2 transmission at high resolution.
Collapse
Affiliation(s)
| | - Nair Varela
- CINBIO, Universidade de Vigo, Vigo 36310, Spain
- Galicia Sur Health Research Institute (IIS Galicia Sur), SERGAS-UVIGO
| | - Nuria Estévez-Gómez
- CINBIO, Universidade de Vigo, Vigo 36310, Spain
- Galicia Sur Health Research Institute (IIS Galicia Sur), SERGAS-UVIGO
| | - Loretta De Chiara
- CINBIO, Universidade de Vigo, Vigo 36310, Spain
- Galicia Sur Health Research Institute (IIS Galicia Sur), SERGAS-UVIGO
| | - Iria Fernández-Silva
- Department of Biochemistry, Genetics, and Immunology, Universidade de Vigo, Vigo 36310, Spain
| | - Diana Valverde
- CINBIO, Universidade de Vigo, Vigo 36310, Spain
- Galicia Sur Health Research Institute (IIS Galicia Sur), SERGAS-UVIGO
- Department of Biochemistry, Genetics, and Immunology, Universidade de Vigo, Vigo 36310, Spain
| | | | | | - Benito Regueiro
- Galicia Sur Health Research Institute (IIS Galicia Sur), SERGAS-UVIGO
- Department of Microbiology, Complexo Hospitalario Universitario de Vigo (CHUVI), Sergas, Vigo 36213, Spain
- Microbiology and Parasitology Department, Medicine and Odontology, Universidade de Santiago, Santiago de Compostela 15782, Spain
| | - Jorge Julio Cabrera-Alvargonzález
- Galicia Sur Health Research Institute (IIS Galicia Sur), SERGAS-UVIGO
- Department of Microbiology, Complexo Hospitalario Universitario de Vigo (CHUVI), Sergas, Vigo 36213, Spain
| | - Víctor del Campo
- Galicia Sur Health Research Institute (IIS Galicia Sur), SERGAS-UVIGO
- Department of Preventive Medicine, Complexo Hospitalario Universitario de Vigo (CHUVI), Sergas, Vigo 36213, Spain
| | | | | |
Collapse
|
25
|
Bernaquez I, Gaudreau C, Pilon PA, Bekal S. Evaluation of whole-genome sequencing-based subtyping methods for the surveillance of Shigella spp. and the confounding effect of mobile genetic elements in long-term outbreaks. Microb Genom 2021; 7. [PMID: 34730485 PMCID: PMC8743557 DOI: 10.1099/mgen.0.000672] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Many public health laboratories across the world have implemented whole-genome sequencing (WGS) for the surveillance and outbreak detection of foodborne pathogens. PulseNet-affiliated laboratories have determined that most single-strain foodborne outbreaks are contained within 0–10 multi-locus sequence typing (MLST)-based allele differences and/or core genome single-nucleotide variants (SNVs). In addition to being a food- and travel-associated outbreak pathogen, most
Shigella
spp. cases occur through continuous person-to-person transmission, predominantly involving men who have sex with men (MSM), leading to long-term and recurrent outbreaks. Continuous transmission patterns coupled to genetic evolution under antibiotic treatment pressure require an assessment of existing WGS-based subtyping methods and interpretation criteria for cluster inclusion/exclusion. An evaluation of 4 WGS-based subtyping methods [SNVPhyl, coreMLST, core genome MLST (cgMLST) and whole-genome MLST (wgMLST)] was performed on 9 foodborne-, travel- and MSM-related retrospective outbreaks from a collection of 91
Shigella flexneri
and 232
Shigella sonnei
isolates to determine the methods’ epidemiological concordance, discriminatory power, robustness and ability to generate stable interpretation criteria. The discriminatory powers were ranked as follows: coreMLST<SNVPhyl<cgMLST<wgMLST (range: 0.970–1.000). The genetic differences observed for non-MSM-related
Shigella
spp. outbreaks respect the standard 0–10 allele/SNV guideline; however, mobile genetic element (MGE)-encoded loci caused inflated genetic variation and discrepant phylogenies for prolonged MSM-related
S. sonnei
outbreaks via wgMLST. The
S. sonnei
correlation coefficients of wgMLST were also the lowest at 0.680, 0.703 and 0.712 for SNVPhyl, coreMLST and cgMLST, respectively. Plasmid maintenance, mobilization and conjugation-associated genes were found to be the main source of genetic distance inflation in addition to prophage-related genes. Duplicated alleles arising from the repeated nature of IS elements were also responsible for many false cg/wgMLST differences. The coreMLST approach was shown to be the most robust, followed by SNVPhyl and wgMLST for inter-laboratory comparability. Our results highlight the need for validating species-specific subtyping methods based on microbial genome plasticity and outbreak dynamics in addition to the importance of filtering confounding MGEs for cluster detection.
Collapse
Affiliation(s)
- Isabelle Bernaquez
- Laboratoire de santé publique du Québec, Sainte-Anne-de-Bellevue, QC, H9X 3R5, Canada
| | - Christiane Gaudreau
- Microbiologie médicale et infectiologie, Centre Hospitalier de l’Université de Montréal (CHUM), Montreal, QC, H2X 3E4, Canada
- Département de microbiologie, infectiologie et immunologie, Université de Montréal, Montreal, QC, H3C 3J7, Canada
| | - Pierre A. Pilon
- Direction régionale de santé publique, Centre intégré universitaire de santé et de services sociaux du Centre-Sud-de-l’île-de-Montréal, Montreal, QC, H2L 4M1, Canada
- Département de médecine sociale et préventive, Université de Montréal, Montreal, QC, H3N 1X9, Canada
| | - Sadjia Bekal
- Laboratoire de santé publique du Québec, Sainte-Anne-de-Bellevue, QC, H9X 3R5, Canada
- Département de microbiologie, infectiologie et immunologie, Université de Montréal, Montreal, QC, H3C 3J7, Canada
- *Correspondence: Sadjia Bekal,
| |
Collapse
|
26
|
Mäklin T, Kallonen T, Alanko J, Samuelsen Ø, Hegstad K, Mäkinen V, Corander J, Heinz E, Honkela A. Bacterial genomic epidemiology with mixed samples. Microb Genom 2021; 7:000691. [PMID: 34779765 PMCID: PMC8743562 DOI: 10.1099/mgen.0.000691] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Accepted: 09/13/2021] [Indexed: 11/18/2022] Open
Abstract
Genomic epidemiology is a tool for tracing transmission of pathogens based on whole-genome sequencing. We introduce the mGEMS pipeline for genomic epidemiology with plate sweeps representing mixed samples of a target pathogen, opening the possibility to sequence all colonies on selective plates with a single DNA extraction and sequencing step. The pipeline includes the novel mGEMS read binner for probabilistic assignments of sequencing reads, and the scalable pseudoaligner Themisto. We demonstrate the effectiveness of our approach using closely related samples in a nosocomial setting, obtaining results that are comparable to those based on single-colony picks. Our results lend firm support to more widespread consideration of genomic epidemiology with mixed infection samples.
Collapse
Affiliation(s)
- Tommi Mäklin
- Helsinki Institute for Information Technology HIIT, Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland
| | - Teemu Kallonen
- Department of Biostatistics, University of Oslo, Oslo, Norway
- Wellcome Sanger Institute, Hinxton, Cambridgeshire, UK
| | - Jarno Alanko
- Helsinki Institute for Information Technology HIIT, Department of Computer Science, University of Helsinki, Helsinki, Finland
| | - Ørjan Samuelsen
- Norwegian National Advisory Unit on Detection of Antimicrobial Resistance, Department of Microbiology and Infection Control, University Hospital of North Norway, Tromsø, Norway
- Department of Pharmacy, UT The Arctic University of Norway, Tromsø, Norway
| | - Kristin Hegstad
- Norwegian National Advisory Unit on Detection of Antimicrobial Resistance, Department of Microbiology and Infection Control, University Hospital of North Norway, Tromsø, Norway
- Research group for Host-Microbe Interactions, Department of Medical Biology, Faculty of Health Sciences, UT The Arctic University of Norway, Tromsø, Norway
| | - Veli Mäkinen
- Helsinki Institute for Information Technology HIIT, Department of Computer Science, University of Helsinki, Helsinki, Finland
| | - Jukka Corander
- Helsinki Institute for Information Technology HIIT, Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland
- Department of Biostatistics, University of Oslo, Oslo, Norway
- Wellcome Sanger Institute, Hinxton, Cambridgeshire, UK
| | - Eva Heinz
- Department of Biostatistics, University of Oslo, Oslo, Norway
- Liverpool School of Tropical Medicine, Liverpool, UK
| | - Antti Honkela
- Helsinki Institute for Information Technology HIIT, Department of Computer Science, University of Helsinki, Helsinki, Finland
| |
Collapse
|
27
|
Mäklin T, Kallonen T, David S, Boinett CJ, Pascoe B, Méric G, Aanensen DM, Feil EJ, Baker S, Parkhill J, Sheppard SK, Corander J, Honkela A. High-resolution sweep metagenomics using fast probabilistic inference. Wellcome Open Res 2021; 5:14. [PMID: 34746439 PMCID: PMC8543175 DOI: 10.12688/wellcomeopenres.15639.2] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/30/2021] [Indexed: 01/13/2023] Open
Abstract
Determining the composition of bacterial communities beyond the level of a genus or species is challenging because of the considerable overlap between genomes representing close relatives. Here, we present the mSWEEP pipeline for identifying and estimating the relative sequence abundances of bacterial lineages from plate sweeps of enrichment cultures. mSWEEP leverages biologically grouped sequence assembly databases, applying probabilistic modelling, and provides controls for false positive results. Using sequencing data from major pathogens, we demonstrate significant improvements in lineage quantification and detection accuracy. Our pipeline facilitates investigating cultures comprising mixtures of bacteria, and opens up a new field of plate sweep metagenomics.
Collapse
Affiliation(s)
- Tommi Mäklin
- Helsinki Institute for Information Technology HIIT, Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland
| | - Teemu Kallonen
- Department of Biostatistics, University of Oslo, Oslo, Norway
- Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, UK
| | - Sophia David
- Centre for Genomic Pathogen Surveillance, Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, UK
| | - Christine J. Boinett
- Hospital for Tropical Diseases, Wellcome Trust Major Overseas Programme, Oxford University Clinical Research Unit, Ho Chi Minh City, Vietnam
- Centre for Tropical Medicine and Global Health, Nuffield Department of Clinical Medicine, University of Oxford, Oxford, UK
| | - Ben Pascoe
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, UK
| | - Guillaume Méric
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, UK
| | - David M. Aanensen
- Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, UK
- Department of Infectious Disease Epidemiology, Imperial College London, London, UK
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Edward J. Feil
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, UK
| | - Stephen Baker
- Hospital for Tropical Diseases, Wellcome Trust Major Overseas Programme, Oxford University Clinical Research Unit, Ho Chi Minh City, Vietnam
- Centre for Tropical Medicine and Global Health, Nuffield Department of Clinical Medicine, University of Oxford, Oxford, UK
| | - Julian Parkhill
- Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, UK
| | - Samuel K. Sheppard
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, UK
| | - Jukka Corander
- Helsinki Institute for Information Technology HIIT, Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland
- Department of Biostatistics, University of Oslo, Oslo, Norway
- Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, UK
| | - Antti Honkela
- Helsinki Institute for Information Technology HIIT, Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland
- Department of Public Health, University of Helsinki, Helsinki, Finland
- Helsinki Institute for Information Technology HIIT, Department of Computer Science, University of Helsinki, Helsinki, Finland
| |
Collapse
|
28
|
Gomes-Neto JC, Pavlovikj N, Cano C, Abdalhamid B, Al-Ghalith GA, Loy JD, Knights D, Iwen PC, Chaves BD, Benson AK. Heuristic and Hierarchical-Based Population Mining of Salmonella enterica Lineage I Pan-Genomes as a Platform to Enhance Food Safety. FRONTIERS IN SUSTAINABLE FOOD SYSTEMS 2021. [DOI: 10.3389/fsufs.2021.725791] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
The recent incorporation of bacterial whole-genome sequencing (WGS) into Public Health laboratories has enhanced foodborne outbreak detection and source attribution. As a result, large volumes of publicly available datasets can be used to study the biology of foodborne pathogen populations at an unprecedented scale. To demonstrate the application of a heuristic and agnostic hierarchical population structure guided pan-genome enrichment analysis (PANGEA), we used populations of S. enterica lineage I to achieve two main objectives: (i) show how hierarchical population inquiry at different scales of resolution can enhance ecological and epidemiological inquiries; and (ii) identify population-specific inferable traits that could provide selective advantages in food production environments. Publicly available WGS data were obtained from NCBI database for three serovars of Salmonella enterica subsp. enterica lineage I (S. Typhimurium, S. Newport, and S. Infantis). Using the hierarchical genotypic classifications (Serovar, BAPS1, ST, cgMLST), datasets from each of the three serovars showed varying degrees of clonal structuring. When the accessory genome (PANGEA) was mapped onto these hierarchical structures, accessory loci could be linked with specific genotypes. A large heavy-metal resistance mobile element was found in the Monophasic ST34 lineage of S. Typhimurium, and laboratory testing showed that Monophasic isolates have on average a higher degree of copper resistance than the Biphasic ones. In S. Newport, an extra sugE gene copy was found among most isolates of the ST45 lineage, and laboratory testing of multiple isolates confirmed that isolates of S. Newport ST45 were on average less sensitive to the disinfectant cetylpyridimium chloride than non-ST45 isolates. Lastly, data-mining of the accessory genomic content of S. Infantis revealed two cryptic Ecotypes with distinct accessory genomic content and distinct ecological patterns. Poultry appears to be the major reservoir for Ecotype 1, and temporal analysis further suggested a recent ecological succession, with Ecotype 2 apparently being displaced by Ecotype 1. Altogether, the use of a heuristic hierarchical-based population structure analysis that includes bacterial pan-genomes (core and accessory genomes) can (1) improve genomic resolution for mapping populations and accessing epidemiological patterns; and (2) define lineage-specific informative loci that may be associated with survival in the food chain.
Collapse
|
29
|
Tonkin-Hill G, Martincorena I, Amato R, Lawson ARJ, Gerstung M, Johnston I, Jackson DK, Park N, Lensing SV, Quail MA, Gonçalves S, Ariani C, Spencer Chapman M, Hamilton WL, Meredith LW, Hall G, Jahun AS, Chaudhry Y, Hosmillo M, Pinckert ML, Georgana I, Yakovleva A, Caller LG, Caddy SL, Feltwell T, Khokhar FA, Houldcroft CJ, Curran MD, Parmar S, The COVID-19 Genomics UK (COG-UK) Consortium, Alderton A, Nelson R, Harrison EM, Sillitoe J, Bentley SD, Barrett JC, Torok ME, Goodfellow IG, Langford C, Kwiatkowski D, Wellcome Sanger Institute COVID-19 Surveillance Team. Patterns of within-host genetic diversity in SARS-CoV-2. eLife 2021; 10:e66857. [PMID: 34387545 PMCID: PMC8363274 DOI: 10.7554/elife.66857] [Citation(s) in RCA: 111] [Impact Index Per Article: 27.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2021] [Accepted: 07/22/2021] [Indexed: 12/15/2022] Open
Abstract
Monitoring the spread of SARS-CoV-2 and reconstructing transmission chains has become a major public health focus for many governments around the world. The modest mutation rate and rapid transmission of SARS-CoV-2 prevents the reconstruction of transmission chains from consensus genome sequences, but within-host genetic diversity could theoretically help identify close contacts. Here we describe the patterns of within-host diversity in 1181 SARS-CoV-2 samples sequenced to high depth in duplicate. 95.1% of samples show within-host mutations at detectable allele frequencies. Analyses of the mutational spectra revealed strong strand asymmetries suggestive of damage or RNA editing of the plus strand, rather than replication errors, dominating the accumulation of mutations during the SARS-CoV-2 pandemic. Within- and between-host diversity show strong purifying selection, particularly against nonsense mutations. Recurrent within-host mutations, many of which coincide with known phylogenetic homoplasies, display a spectrum and patterns of purifying selection more suggestive of mutational hotspots than recombination or convergent evolution. While allele frequencies suggest that most samples result from infection by a single lineage, we identify multiple putative examples of co-infection. Integrating these results into an epidemiological inference framework, we find that while sharing of within-host variants between samples could help the reconstruction of transmission chains, mutational hotspots and rare cases of superinfection can confound these analyses.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Naomi Park
- Wellcome Sanger InstituteHinxtonUnited Kingdom
| | | | | | | | | | | | | | - Luke W Meredith
- Department of Pathology, University of CambridgeCambridgeUnited Kingdom
| | - Grant Hall
- Department of Pathology, University of CambridgeCambridgeUnited Kingdom
| | - Aminu S Jahun
- Department of Pathology, University of CambridgeCambridgeUnited Kingdom
| | - Yasmin Chaudhry
- Department of Pathology, University of CambridgeCambridgeUnited Kingdom
| | - Myra Hosmillo
- Department of Pathology, University of CambridgeCambridgeUnited Kingdom
| | - Malte L Pinckert
- Department of Pathology, University of CambridgeCambridgeUnited Kingdom
| | - Iliana Georgana
- Department of Pathology, University of CambridgeCambridgeUnited Kingdom
| | - Anna Yakovleva
- Department of Pathology, University of CambridgeCambridgeUnited Kingdom
| | - Laura G Caller
- Department of Pathology, University of CambridgeCambridgeUnited Kingdom
| | - Sarah L Caddy
- Department of Medicine, University of CambridgeCambridgeUnited Kingdom
| | - Theresa Feltwell
- Department of Pathology, University of CambridgeCambridgeUnited Kingdom
| | - Fahad A Khokhar
- Department of Medicine, University of CambridgeCambridgeUnited Kingdom
- Cambridge Institute of Therapeutic Immunology and Infectious Disease, University of CambridgeCambridgeUnited Kingdom
| | | | | | | | | | | | | | - Ewan M Harrison
- Wellcome Sanger InstituteHinxtonUnited Kingdom
- European Bioinformatics InstituteHinxtonUnited Kingdom
| | | | | | | | - M Estee Torok
- Department of Medicine, University of CambridgeCambridgeUnited Kingdom
| | - Ian G Goodfellow
- Department of Pathology, University of CambridgeCambridgeUnited Kingdom
| | | | - Dominic Kwiatkowski
- Wellcome Sanger InstituteHinxtonUnited Kingdom
- Nuffield Department of Medicine, University of OxfordOxfordUnited Kingdom
| | | |
Collapse
|
30
|
Baik Y, Modongo C, Moonan PK, Click ES, Tobias JL, Boyd R, Finlay A, Oeltmann JE, Shin SS, Zetola NM. Possible Transmission Mechanisms of Mixed Mycobacterium tuberculosis Infection in High HIV Prevalence Country, Botswana. Emerg Infect Dis 2021; 26:953-960. [PMID: 32310078 PMCID: PMC7181944 DOI: 10.3201/eid2605.191638] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Tuberculosis caused by concurrent infection with multiple Mycobacteriumtuberculosis strains (i.e., mixed infection) challenges clinical and epidemiologic paradigms. We explored possible transmission mechanisms of mixed infection in a population-based, molecular epidemiology study in Botswana during 2012–2016. We defined mixed infection as multiple repeats of alleles at >2 loci within a discrete mycobacterial interspersed repetitive unit–variable-number tandem-repeat (MIRU-VNTR) result. We compared mixed infection MIRU-VNTR results with all study MIRU-VNTR results by considering all permutations at each multiple allele locus; matched MIRU-VNTR results were considered evidence of recently acquired strains and nonmatched to any other results were considered evidence of remotely acquired strains. Among 2,051 patients, 34 (1.7%) had mixed infection, of which 23 (68%) had recently and remotely acquired strains. This finding might support the mixed infection mechanism of recent transmission and simultaneous remote reactivation. Further exploration is needed to determine proportions of transmission mechanisms in settings where mixed infections are prevalent.
Collapse
|
31
|
Robert A, Funk S, Kucharski AJ. o2geosocial: Reconstructing who-infected-whom from routinely collected surveillance data. F1000Res 2021; 10:31. [PMID: 36998981 PMCID: PMC10044721.2 DOI: 10.12688/f1000research.28073.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 05/28/2021] [Indexed: 11/20/2022] Open
Abstract
Reconstructing the history of individual transmission events between cases is key to understanding what factors facilitate the spread of an infectious disease. Since conducting extended contact-tracing investigations can be logistically challenging and costly, statistical inference methods have been developed to reconstruct transmission trees from onset dates and genetic sequences. However, these methods are not as effective if the mutation rate of the virus is very slow, or if sequencing data is sparse. We developed the package o2geosocial to combine variables from routinely collected surveillance data with a simple transmission process model. The model reconstructs transmission trees when full genetic sequences are unavailable, or uninformative. Our model incorporates the reported age-group, onset date, location and genotype of infected cases to infer probabilistic transmission trees. The package also includes functions to summarise and visualise the inferred cluster size distribution. The results generated by o2geosocial can highlight regions where importations repeatedly caused large outbreaks, which may indicate a higher regional susceptibility to infections. It can also be used to generate the individual number of secondary transmissions, and show the features associated with individuals involved in high transmission events. The package is available for download from the Comprehensive R Archive Network (CRAN) and GitHub.
Collapse
Affiliation(s)
- Alexis Robert
- Centre for the Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, London, WC1E 7HT, UK
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, WC1E 7HT, UK
| | - Sebastian Funk
- Centre for the Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, London, WC1E 7HT, UK
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, WC1E 7HT, UK
| | - Adam J Kucharski
- Centre for the Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, London, WC1E 7HT, UK
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, WC1E 7HT, UK
| |
Collapse
|
32
|
Dawson D, Rasmussen D, Peng X, Lanzas C. Inferring environmental transmission using phylodynamics: a case-study using simulated evolution of an enteric pathogen. J R Soc Interface 2021; 18:20210041. [PMID: 34102084 DOI: 10.1098/rsif.2021.0041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Indirect (environmental) and direct (host-host) transmission pathways cannot easily be distinguished when they co-occur in epidemics, particularly when they occur on similar time scales. Phylodynamic reconstruction is a potential approach to this problem that combines epidemiological information (temporal, spatial information) with pathogen whole-genome sequencing data to infer transmission trees of epidemics. However, factors such as differences in mutation and transmission rates between host and non-host environments may obscure phylogenetic inference from these methods. In this study, we used a network-based transmission model that explicitly models pathogen evolution to simulate epidemics with both direct and indirect transmission. Epidemics were simulated according to factorial combinations of direct/indirect transmission proportions, host mutation rates and conditions of environmental pathogen growth. Transmission trees were then reconstructed using the phylodynamic approach SCOTTI (structured coalescent transmission tree inference) and evaluated. We found that although insufficient diversity sets a lower bound on when accurate phylodynamic inferences can be made, transmission routes and assumed pathogen lifestyle affected pathogen population structure and subsequently influenced both reconstruction success and the likelihood of direct versus indirect pathways being reconstructed. We conclude that prior knowledge of the likely ecology and population structure of pathogens in host and non-host environments is critical to fully using phylodynamic techniques.
Collapse
Affiliation(s)
- Daniel Dawson
- Department of Population Health and Pathobiology, College of Veterinary Medicine, North Carolina State University, Raleigh, NC, USA
| | - David Rasmussen
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC, USA.,Department of Entomology and Plant Pathology, North Carolina State University, Raleigh, NC, USA
| | - Xinxia Peng
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC, USA.,Department of Molecular Biomedical Sciences, College of Veterinary Medicine, North Carolina State University, Raleigh, NC, USA
| | - Cristina Lanzas
- Department of Population Health and Pathobiology, College of Veterinary Medicine, North Carolina State University, Raleigh, NC, USA
| |
Collapse
|
33
|
Azarian T. The Importance of Pathogen Whole-Genome Sequencing in Evaluating Interventions to Reduce the Spread of Multidrug-Resistant Organisms in the Healthcare Setting. Clin Infect Dis 2021; 72:1888-1890. [PMID: 32505133 DOI: 10.1093/cid/ciaa724] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Accepted: 06/03/2020] [Indexed: 11/14/2022] Open
Affiliation(s)
- Taj Azarian
- Burnett School of Biomedical Sciences, University of Central Florida, Orlando, Florida, USA
| |
Collapse
|
34
|
Nigsch A, Robbe-Austerman S, Stuber TP, Pavinski Bitar PD, Gröhn YT, Schukken YH. Who infects whom?-Reconstructing infection chains of Mycobacterium avium ssp. paratuberculosis in an endemically infected dairy herd by use of genomic data. PLoS One 2021; 16:e0246983. [PMID: 33983941 PMCID: PMC8118464 DOI: 10.1371/journal.pone.0246983] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Accepted: 04/30/2021] [Indexed: 12/18/2022] Open
Abstract
Recent evidence of circulation of multiple strains within herds and mixed infections of cows marks the beginning of a rethink of our knowledge on Mycobacterium avium ssp. paratuberculosis (MAP) epidemiology. Strain typing opens new ways to investigate MAP transmission. This work presents a method for reconstructing infection chains in a setting of endemic Johne’s disease on a well-managed dairy farm. By linking genomic data with demographic field data, strain-specific differences in spreading patterns could be quantified for a densely sampled dairy herd. Mixed infections of dairy cows with MAP are common, and some strains spread more successfully. Infected cows remain susceptible for co-infections with other MAP genotypes. The model suggested that cows acquired infection from 1–4 other cows and spread infection to 0–17 individuals. Reconstructed infection chains supported the hypothesis that high shedding animals that started to shed at an early age and showed a progressive infection pattern represented a greater risk for spreading MAP. Transmission of more than one genotype between animals was recorded. In this farm with a good MAP control management program, adult-to-adult contact was proposed as the most important transmission route to explain the reconstructed networks. For each isolate, at least one more likely ancestor could be inferred. Our study results help to capture underlying transmission processes and to understand the challenges of tracing MAP spread within a herd. Only the combination of precise longitudinal field data and bacterial strain type information made it possible to trace infection in such detail.
Collapse
Affiliation(s)
- Annette Nigsch
- Department of Animal Sciences, Wageningen University, Wageningen, The Netherlands
- * E-mail:
| | - Suelee Robbe-Austerman
- USDA APHIS National Veterinary Services Laboratories, Ames, Iowa, United States of America
| | - Tod P. Stuber
- USDA APHIS National Veterinary Services Laboratories, Ames, Iowa, United States of America
| | - Paulina D. Pavinski Bitar
- Department of Population Medicine and Diagnostic Sciences, College of Veterinary Medicine, Cornell University, Ithaca, NY, United States of America
| | - Yrjö T. Gröhn
- Department of Population Medicine and Diagnostic Sciences, College of Veterinary Medicine, Cornell University, Ithaca, NY, United States of America
| | - Ynte H. Schukken
- Department of Animal Sciences, Wageningen University, Wageningen, The Netherlands
- Royal GD, Deventer, The Netherlands
| |
Collapse
|
35
|
Colonization with multidrug-resistant Enterobacteriaceae among infants: an observational study in southern Sri Lanka. Antimicrob Resist Infect Control 2021; 10:72. [PMID: 33931120 PMCID: PMC8086278 DOI: 10.1186/s13756-021-00938-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2020] [Accepted: 04/21/2021] [Indexed: 01/21/2023] Open
Abstract
BACKGROUND The timing of and risk factors for intestinal colonization with multidrug-resistant Enterobacteriaceae (MDRE) are still poorly understood in areas with high MDRE carriage. We determined the prevalence, timing, and risk factors associated with MDRE intestinal colonization among infants in southern Sri Lanka. METHODS Women and their newborn children were enrolled within 48 h after delivery in southern Sri Lanka. Rectal swabs were collected from women and infants at enrollment and 4-6 weeks later. Enterobacteriaceae were isolated and identified as MDRE (positive for extended-spectrum β-lactamases or carbapenem resistant) using standard microbiologic procedures. We used exact methods (Fisher's exact and Kruskal-Wallis tests) and multivariable logistic regression to identify sociodemographic and clinical features associated with MDRE intestinal colonization. Whole-genome sequencing was performed on selected MDRE isolates to identify phylogroups and antibiotic resistance-encoding genes were identified with NCBI's AMRfinder tool. RESULTS Overall, 199 post-partum women and 199 infants were enrolled; 148/199 (74.4%) women and 151/199 (75.9%) infants were reassessed later in the community. Twenty-four/199 (12.1%) women and 3/199 (1.5%) infants displayed intestinal colonization with MDRE at enrollment, while 26/148 (17.6%) women and 24/151 (15.9%) infants displayed intestinal colonization with MDRE at the reassessment. While there were no risk factors associated with infant colonization at enrollment, multivariable analysis indicated that risk factors for infant colonization at reassessment included mother colonized at enrollment (aOR = 3.62) or reassessment (aOR = 4.44), delivery by Cesarean section (aOR = 2.91), and low birth weight (aOR = 5.39). Of the 20 MDRE isolates from infants that were sequenced, multilocus sequence typing revealed that 6/20 (30%) were clustered on the same branch as MDRE isolates found in the respective mothers. All sequenced isolates for mothers (47) and infants (20) had at least one ESBL-producing gene. Genes encoding fosfomycin resistance were found in 33/47 (70%) of mothers' isolates and 16/20 (80%) of infants' isolates and genes encoding resistance to colistin were found in one (2%) mother's isolate. CONCLUSIONS Our results suggest that a substantial proportion of infants undergo MDRE intestinal colonization within 6 weeks of birth, potentially due to postnatal rather than intranatal transmission.
Collapse
|
36
|
Valesano AL, Rumfelt KE, Dimcheff DE, Blair CN, Fitzsimmons WJ, Petrie JG, Martin ET, Lauring AS. Temporal dynamics of SARS-CoV-2 mutation accumulation within and across infected hosts. PLoS Pathog 2021; 17:e1009499. [PMID: 33826681 PMCID: PMC8055005 DOI: 10.1371/journal.ppat.1009499] [Citation(s) in RCA: 72] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Revised: 04/19/2021] [Accepted: 03/24/2021] [Indexed: 01/12/2023] Open
Abstract
Analysis of SARS-CoV-2 genetic diversity within infected hosts can provide insight into the generation and spread of new viral variants and may enable high resolution inference of transmission chains. However, little is known about temporal aspects of SARS-CoV-2 intrahost diversity and the extent to which shared diversity reflects convergent evolution as opposed to transmission linkage. Here we use high depth of coverage sequencing to identify within-host genetic variants in 325 specimens from hospitalized COVID-19 patients and infected employees at a single medical center. We validated our variant calling by sequencing defined RNA mixtures and identified viral load as a critical factor in variant identification. By leveraging clinical metadata, we found that intrahost diversity is low and does not vary by time from symptom onset. This suggests that variants will only rarely rise to appreciable frequency prior to transmission. Although there was generally little shared variation across the sequenced cohort, we identified intrahost variants shared across individuals who were unlikely to be related by transmission. These variants did not precede a rise in frequency in global consensus genomes, suggesting that intrahost variants may have limited utility for predicting future lineages. These results provide important context for sequence-based inference in SARS-CoV-2 evolution and epidemiology.
Collapse
Affiliation(s)
- Andrew L. Valesano
- Division of Infectious Diseases, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, United States of America
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Kalee E. Rumfelt
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Derek E. Dimcheff
- Division of Hospital Medicine, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Christopher N. Blair
- Division of Infectious Diseases, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, United States of America
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, Michigan, United States of America
| | - William J. Fitzsimmons
- Division of Infectious Diseases, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, United States of America
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Joshua G. Petrie
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Emily T. Martin
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Adam S. Lauring
- Division of Infectious Diseases, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, United States of America
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, Michigan, United States of America
| |
Collapse
|
37
|
Valesano AL, Rumfelt KE, Dimcheff DE, Blair CN, Fitzsimmons WJ, Petrie JG, Martin ET, Lauring AS. Temporal dynamics of SARS-CoV-2 mutation accumulation within and across infected hosts. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2021:2021.01.19.427330. [PMID: 33501443 PMCID: PMC7836113 DOI: 10.1101/2021.01.19.427330] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Analysis of SARS-CoV-2 genetic diversity within infected hosts can provide insight into the generation and spread of new viral variants and may enable high resolution inference of transmission chains. However, little is known about temporal aspects of SARS-CoV-2 intrahost diversity and the extent to which shared diversity reflects convergent evolution as opposed to transmission linkage. Here we use high depth of coverage sequencing to identify within-host genetic variants in 325 specimens from hospitalized COVID-19 patients and infected employees at a single medical center. We validated our variant calling by sequencing defined RNA mixtures and identified a viral load threshold that minimizes false positives. By leveraging clinical metadata, we found that intrahost diversity is low and does not vary by time from symptom onset. This suggests that variants will only rarely rise to appreciable frequency prior to transmission. Although there was generally little shared variation across the sequenced cohort, we identified intrahost variants shared across individuals who were unlikely to be related by transmission. These variants did not precede a rise in frequency in global consensus genomes, suggesting that intrahost variants may have limited utility for predicting future lineages. These results provide important context for sequence-based inference in SARS-CoV-2 evolution and epidemiology.
Collapse
Affiliation(s)
- Andrew L. Valesano
- Division of Infectious Diseases, Department of Internal Medicine, University of Michigan, Ann Arbor, MI, USA
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA
| | - Kalee E. Rumfelt
- Division of Infectious Diseases, Department of Internal Medicine, University of Michigan, Ann Arbor, MI, USA
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA
| | - Derek E. Dimcheff
- Division of Hospital Medicine, Department of Internal Medicine, University of Michigan, Ann Arbor, MI, USA
| | - Christopher N. Blair
- Division of Infectious Diseases, Department of Internal Medicine, University of Michigan, Ann Arbor, MI, USA
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA
| | - William J. Fitzsimmons
- Division of Infectious Diseases, Department of Internal Medicine, University of Michigan, Ann Arbor, MI, USA
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA
| | - Joshua G. Petrie
- Department of Epidemiology, University of Michigan, Ann Arbor, MI, USA
| | - Emily T. Martin
- Department of Epidemiology, University of Michigan, Ann Arbor, MI, USA
| | - Adam S. Lauring
- Division of Infectious Diseases, Department of Internal Medicine, University of Michigan, Ann Arbor, MI, USA
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA
| |
Collapse
|
38
|
McLean K, Balada-Llasat JM, Waalkes A, Pancholi P, Salipante SJ. Whole-genome sequencing of clinical Clostridioides difficile isolates reveals molecular epidemiology and discrepancies with conventional laboratory diagnostic testing. J Hosp Infect 2020; 108:64-71. [PMID: 33227298 DOI: 10.1016/j.jhin.2020.11.014] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Revised: 10/06/2020] [Accepted: 11/16/2020] [Indexed: 12/29/2022]
Abstract
BACKGROUND The high clinical burden of Clostridioides difficile infections merits rapid and sensitive identification of affected individuals. However, effective diagnosis remains challenging. Current best practice guidelines recommend molecular and/or direct toxin detection-based screening for symptomatic individuals, but previous work has called into question the concordance and performance of extant clinical assays. AIM To better correlate the genomic and phenotypic properties of clinical C. difficile isolates with laboratory testing outcomes in both C. difficile-infected patients and asymptomatic carriers. METHODS Whole-genome sequencing of clinical C. difficile isolates collected from an inpatient population at a single healthcare institution was performed, enabling examination of their molecular epidemiology and toxigenic gene content. Genomic findings were compared with clinical testing outcomes, identifying multiple diagnostic discrepancies. FINDINGS Toxigenic culture, considered a 'reference standard', provided perfect sensitivity and specificity in predicting toxigenic gene content, whereas reduced performance was observed for Simplexa C. difficile Direct Assay (100% specificity, 88% sensitivity), Gene Xpert CD/Epi Assay (86% specificity, 83% sensitivity), and Quick Check Complete Tox A/B (100% specificity, 30% sensitivity). Genomic analysis additionally revealed variability in toxin gene sequences among C. difficile strains, phylogenomic equivalency between isolates from affected patients and carriers, and patient carriage with uncommon environmentally derived C. difficile lineages, as well as presenting opportunities for tracing pathogen transmission events. CONCLUSION These results highlight the variable performance of clinical stool-based testing approaches as well as the potential diagnostic utility of whole-genome sequencing as an alternative to conventional testing algorithms.
Collapse
Affiliation(s)
- K McLean
- University of Washington Department of Laboratory Medicine, Seattle, WA, USA
| | - J-M Balada-Llasat
- Ohio State University Wexner Medical Center, Department of Pathology, Columbus, OH, USA
| | - A Waalkes
- University of Washington Department of Laboratory Medicine, Seattle, WA, USA
| | - P Pancholi
- Ohio State University Wexner Medical Center, Department of Pathology, Columbus, OH, USA.
| | - S J Salipante
- University of Washington Department of Laboratory Medicine, Seattle, WA, USA.
| |
Collapse
|
39
|
Montazeri H, Little S, Legha MM, Beerenwinkel N, DeGruttola V. Bayesian reconstruction of transmission trees from genetic sequences and uncertain infection times. Stat Appl Genet Mol Biol 2020; 19:/j/sagmb.ahead-of-print/sagmb-2019-0026/sagmb-2019-0026.xml. [PMID: 33085643 PMCID: PMC8212962 DOI: 10.1515/sagmb-2019-0026] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2019] [Accepted: 09/16/2020] [Indexed: 11/15/2022]
Abstract
Genetic sequence data of pathogens are increasingly used to investigate transmission dynamics in both endemic diseases and disease outbreaks. Such research can aid in the development of appropriate interventions and in the design of studies to evaluate them. Several computational methods have been proposed to infer transmission chains from sequence data; however, existing methods do not generally reliably reconstruct transmission trees because genetic sequence data or inferred phylogenetic trees from such data contain insufficient information for accurate estimation of transmission chains. Here, we show by simulation studies that incorporating infection times, even when they are uncertain, can greatly improve the accuracy of reconstruction of transmission trees. To achieve this improvement, we propose a Bayesian inference methods using Markov chain Monte Carlo that directly draws samples from the space of transmission trees under the assumption of complete sampling of the outbreak. The likelihood of each transmission tree is computed by a phylogenetic model by treating its internal nodes as transmission events. By a simulation study, we demonstrate that accuracy of the reconstructed transmission trees depends mainly on the amount of information available on times of infection; we show superiority of the proposed method to two alternative approaches when infection times are known up to specified degrees of certainty. In addition, we illustrate the use of a multiple imputation framework to study features of epidemic dynamics, such as the relationship between characteristics of nodes and average number of outbound edges or inbound edges, signifying possible transmission events from and to nodes. We apply the proposed method to a transmission cluster in San Diego and to a dataset from the 2014 Sierra Leone Ebola virus outbreak and investigate the impact of biological, behavioral, and demographic factors.
Collapse
Affiliation(s)
- Hesam Montazeri
- Department of Bioinformatics, Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| | - Susan Little
- Department of Medicine, University of California San Diego, California, USA
| | - Mozhgan Mozaffari Legha
- Department of Bioinformatics, Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| | - Niko Beerenwinkel
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | | |
Collapse
|
40
|
Boskova V, Stadler T. PIQMEE: Bayesian Phylodynamic Method for Analysis of Large Data Sets with Duplicate Sequences. Mol Biol Evol 2020; 37:3061-3075. [PMID: 32492139 PMCID: PMC7530608 DOI: 10.1093/molbev/msaa136] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Next-generation sequencing of pathogen quasispecies within a host yields data sets of tens to hundreds of unique sequences. However, the full data set often contains thousands of sequences, because many of those unique sequences have multiple identical copies. Data sets of this size represent a computational challenge for currently available Bayesian phylogenetic and phylodynamic methods. Through simulations, we explore how large data sets with duplicate sequences affect the speed and accuracy of phylogenetic and phylodynamic analysis within BEAST 2. We show that using unique sequences only leads to biases, and using a random subset of sequences yields imprecise parameter estimates. To overcome these shortcomings, we introduce PIQMEE, a BEAST 2 add-on that produces reliable parameter estimates from full data sets with increased computational efficiency as compared with the currently available methods within BEAST 2. The principle behind PIQMEE is to resolve the tree structure of the unique sequences only, while simultaneously estimating the branching times of the duplicate sequences. Distinguishing between unique and duplicate sequences allows our method to perform well even for very large data sets. Although the classic method converges poorly for data sets of 6,000 sequences when allowed to run for 7 days, our method converges in slightly more than 1 day. In fact, PIQMEE can handle data sets of around 21,000 sequences with 20 unique sequences in 14 days. Finally, we apply the method to a real, within-host HIV sequencing data set with several thousand sequences per patient.
Collapse
Affiliation(s)
- Veronika Boskova
- Department of Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland
- Swiss Institute of Bioinformatics (SIB), Switzerland
- Center for Integrative Bioinformatics Vienna, Max Perutz Labs, University of Vienna and Medical University of Vienna, Vienna, Austria
| | - Tanja Stadler
- Department of Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland
- Swiss Institute of Bioinformatics (SIB), Switzerland
| |
Collapse
|
41
|
McKew G, Ramsperger M, Cheong E, Gottlieb T, Sintchenko V, O'Sullivan M. Hospital MRSA outbreaks: Multiplex PCR-reverse line blot binary typing as a screening method for WGS, and the role of the environment in transmission. Infect Dis Health 2020; 25:268-276. [PMID: 32616448 DOI: 10.1016/j.idh.2020.05.007] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Revised: 04/30/2020] [Accepted: 05/11/2020] [Indexed: 11/30/2022]
Abstract
BACKGROUND Whole-genome sequencing (WGS) can provide useful information on methicillin-resistant Staphylococcus aureus (MRSA) transmission in hospitals. However, it is expensive and laborious, especially for environmental screening programs which generate large numbers of isolates. Multiplex PCR-reverse line blot binary typing (mPCR-RLB BT) is a rapid, high throughput, inexpensive typing method which could be useful to screen isolates for WGS. This study assessed the strategy of screening isolates with mPCR-RLB BT to reduce the need for WGS; and to assess the role of the environment in transmission. METHODS MRSA transmission in a Burns Unit and its related Intensive Care Unit was studied. mPCR-RLB BT was performed on 238 isolates; this, combined with epidemiological data, was used to choose 97 isolates for WGS. RESULTS Relationships between isolates by WGS demonstrated several outbreaks. There was a significant contribution of environmental isolates to transmission, and several problem areas were identified. There was a substantial cost saving from screening isolates with mPCR-RLB BT. CONCLUSIONS The use of an inexpensive, rapid screening method for MRSA typing is useful to reduce expenditure and time spent on hospital infection control programs, and reductions are likely to be even more considerable in a non-outbreak setting. Environmental screening and WGS are useful to determine exact sources of transmission to focus mitigation strategies.
Collapse
Affiliation(s)
- Genevieve McKew
- Department of Microbiology and Infectious Diseases Concord Repatriation General Hospital, NSW Health Pathology, Concord, 2139, Australia; Faculty of Medicine and Health, The University of Sydney, Camperdown, 2006, Australia.
| | - Marc Ramsperger
- Faculty of Medicine and Health, The University of Sydney, Camperdown, 2006, Australia; Centre for Infectious Diseases & Microbiology Laboratory Services Westmead Hospital, Westmead, 2145, Australia
| | - Elaine Cheong
- Department of Microbiology and Infectious Diseases Concord Repatriation General Hospital, NSW Health Pathology, Concord, 2139, Australia; Faculty of Medicine and Health, The University of Sydney, Camperdown, 2006, Australia
| | - Thomas Gottlieb
- Department of Microbiology and Infectious Diseases Concord Repatriation General Hospital, NSW Health Pathology, Concord, 2139, Australia; Faculty of Medicine and Health, The University of Sydney, Camperdown, 2006, Australia
| | - Vitali Sintchenko
- Faculty of Medicine and Health, The University of Sydney, Camperdown, 2006, Australia; Centre for Infectious Diseases & Microbiology Laboratory Services Westmead Hospital, Westmead, 2145, Australia
| | - Matthew O'Sullivan
- Faculty of Medicine and Health, The University of Sydney, Camperdown, 2006, Australia; Centre for Infectious Diseases & Microbiology Laboratory Services Westmead Hospital, Westmead, 2145, Australia
| |
Collapse
|
42
|
Moustafa AM, Lal A, Planet PJ. Comparative genomics in infectious disease. Curr Opin Microbiol 2020; 53:61-70. [PMID: 32248056 DOI: 10.1016/j.mib.2020.02.009] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2020] [Revised: 02/23/2020] [Accepted: 02/24/2020] [Indexed: 02/07/2023]
Abstract
With more than one million bacterial genome sequences uploaded to public databases in the last 25 years, genomics has become a powerful tool for studying bacterial biology. Here, we review recent approaches that leverage large numbers of whole genome sequences to decipher the spread and pathogenesis of bacterial infectious diseases.
Collapse
Affiliation(s)
- Ahmed M Moustafa
- Division of Pediatric Infectious Diseases, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Arnav Lal
- School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Paul J Planet
- Division of Pediatric Infectious Diseases, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Department of Pediatrics, Perelman College of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, NY 10024, USA.
| |
Collapse
|
43
|
Mak L, Perera D, Lang R, Kossinna P, He J, Gill MJ, Long Q, van Marle G. Evaluation of A Phylogenetic Pipeline to Examine Transmission Networks in A Canadian HIV Cohort. Microorganisms 2020; 8:E196. [PMID: 32023939 PMCID: PMC7074708 DOI: 10.3390/microorganisms8020196] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Revised: 01/23/2020] [Accepted: 01/29/2020] [Indexed: 01/08/2023] Open
Abstract
Keywords: HIV; Canada; molecular phylogenetics; viral evolution; person-to-person transmission inference; transmission network; summary statistics.
Collapse
Affiliation(s)
- Lauren Mak
- Department of Biochemistry & Molecular Biology, Alberta Children’s Hospital Research Institute, University of Calgary, Calgary, AB T2N 4N1, Canada (P.K.)
| | - Deshan Perera
- Department of Biochemistry & Molecular Biology, Alberta Children’s Hospital Research Institute, University of Calgary, Calgary, AB T2N 4N1, Canada (P.K.)
| | - Raynell Lang
- Department of Medicine, Cumming School of Medicine, University of Calgary and Alberta Health Services, Calgary, AB T2N 4N1, Canada
| | - Pathum Kossinna
- Department of Biochemistry & Molecular Biology, Alberta Children’s Hospital Research Institute, University of Calgary, Calgary, AB T2N 4N1, Canada (P.K.)
| | - Jingni He
- Department of Biochemistry & Molecular Biology, Alberta Children’s Hospital Research Institute, University of Calgary, Calgary, AB T2N 4N1, Canada (P.K.)
| | - M. John Gill
- Department of Medicine, Cumming School of Medicine, University of Calgary and Alberta Health Services, Calgary, AB T2N 4N1, Canada
| | - Quan Long
- Department of Biochemistry & Molecular Biology, Alberta Children’s Hospital Research Institute, University of Calgary, Calgary, AB T2N 4N1, Canada (P.K.)
- Department of Medical Genetics, and Mathematics & Statistics, Alberta Children’s Hospital Research Institute, O’Brien Institute for Public Health, University of Calgary, Calgary, AB T2N 4N1, Canada
- Department of Mathematics & Statistics, University of Calgary, Calgary, AB T2N 1N4, Canada
| | - Guido van Marle
- Department of Microbiology, Immunology, and Infectious Diseases, Cumming School of Medicine, University of Calgary, Calgary, AB T2N 4N1, Canada
| |
Collapse
|
44
|
San JE, Baichoo S, Kanzi A, Moosa Y, Lessells R, Fonseca V, Mogaka J, Power R, de Oliveira T. Current Affairs of Microbial Genome-Wide Association Studies: Approaches, Bottlenecks and Analytical Pitfalls. Front Microbiol 2020; 10:3119. [PMID: 32082269 PMCID: PMC7002396 DOI: 10.3389/fmicb.2019.03119] [Citation(s) in RCA: 46] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Accepted: 12/24/2019] [Indexed: 12/12/2022] Open
Abstract
Microbial genome-wide association studies (mGWAS) are a new and exciting research field that is adapting human GWAS methods to understand how variations in microbial genomes affect host or pathogen phenotypes, such as drug resistance, virulence, host specificity and prognosis. Several computational tools and methods have been developed or adapted from human GWAS to facilitate the discovery of novel mutations and structural variations that are associated with the phenotypes of interest. However, no comprehensive, end-to-end, user-friendly tool is currently available. The development of a broadly applicable pipeline presents a real opportunity among computational biologists. Here, (i) we review the prominent and promising tools, (ii) discuss analytical pitfalls and bottlenecks in mGWAS, (iii) provide insights into the selection of appropriate tools, (iv) highlight the gaps that still need to be filled and how users and developers can work together to overcome these bottlenecks. Use of mGWAS research can inform drug repositioning decisions as well as accelerate the discovery and development of more effective vaccines and antimicrobials for pressing infectious diseases of global health significance, such as HIV, TB, influenza, and malaria.
Collapse
Affiliation(s)
- James Emmanuel San
- Kwazulu-Natal Research and Innovation Sequencing Platform (KRISP), College of Health Sciences, University of KwaZulu-Natal, Durban, South Africa
| | - Shakuntala Baichoo
- Department of Digital Technologies, FoICDT, University of Mauritius, Réduit, Mauritius
| | - Aquillah Kanzi
- Kwazulu-Natal Research and Innovation Sequencing Platform (KRISP), College of Health Sciences, University of KwaZulu-Natal, Durban, South Africa
| | - Yumna Moosa
- Kwazulu-Natal Research and Innovation Sequencing Platform (KRISP), College of Health Sciences, University of KwaZulu-Natal, Durban, South Africa
| | - Richard Lessells
- Kwazulu-Natal Research and Innovation Sequencing Platform (KRISP), College of Health Sciences, University of KwaZulu-Natal, Durban, South Africa
| | - Vagner Fonseca
- Kwazulu-Natal Research and Innovation Sequencing Platform (KRISP), College of Health Sciences, University of KwaZulu-Natal, Durban, South Africa
- Laboratório de Genética Celular e Molecular, ICB, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - John Mogaka
- Discipline of Public Health, University of Kwazulu-Natal, Durban, South Africa
| | - Robert Power
- St Edmund Hall, Oxford University, Oxford, United Kingdom
| | - Tulio de Oliveira
- Kwazulu-Natal Research and Innovation Sequencing Platform (KRISP), College of Health Sciences, University of KwaZulu-Natal, Durban, South Africa
- Department of Global Health, University of Washington, Seattle, WA, United States
| |
Collapse
|
45
|
Mäklin T, Kallonen T, David S, Boinett CJ, Pascoe B, Méric G, Aanensen DM, Feil EJ, Baker S, Parkhill J, Sheppard SK, Corander J, Honkela A. High-resolution sweep metagenomics using fast probabilistic inference. Wellcome Open Res 2020; 5:14. [PMID: 34746439 PMCID: PMC8543175 DOI: 10.12688/wellcomeopenres.15639.1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/02/2020] [Indexed: 12/29/2022] Open
Abstract
Determining the composition of bacterial communities beyond the level of a genus or species is challenging because of the considerable overlap between genomes representing close relatives. Here, we present the mSWEEP pipeline for identifying and estimating the relative sequence abundances of bacterial lineages from plate sweeps of enrichment cultures. mSWEEP leverages biologically grouped sequence assembly databases, applying probabilistic modelling, and provides controls for false positive results. Using sequencing data from major pathogens, we demonstrate significant improvements in lineage quantification and detection accuracy. Our pipeline facilitates investigating cultures comprising mixtures of bacteria, and opens up a new field of plate sweep metagenomics.
Collapse
Affiliation(s)
- Tommi Mäklin
- Helsinki Institute for Information Technology HIIT, Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland
| | - Teemu Kallonen
- Department of Biostatistics, University of Oslo, Oslo, Norway
- Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, UK
| | - Sophia David
- Centre for Genomic Pathogen Surveillance, Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, UK
| | - Christine J. Boinett
- Hospital for Tropical Diseases, Wellcome Trust Major Overseas Programme, Oxford University Clinical Research Unit, Ho Chi Minh City, Vietnam
- Centre for Tropical Medicine and Global Health, Nuffield Department of Clinical Medicine, University of Oxford, Oxford, UK
| | - Ben Pascoe
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, UK
| | - Guillaume Méric
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, UK
| | - David M. Aanensen
- Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, UK
- Department of Infectious Disease Epidemiology, Imperial College London, London, UK
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Edward J. Feil
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, UK
| | - Stephen Baker
- Hospital for Tropical Diseases, Wellcome Trust Major Overseas Programme, Oxford University Clinical Research Unit, Ho Chi Minh City, Vietnam
- Centre for Tropical Medicine and Global Health, Nuffield Department of Clinical Medicine, University of Oxford, Oxford, UK
| | - Julian Parkhill
- Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, UK
| | - Samuel K. Sheppard
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, UK
| | - Jukka Corander
- Helsinki Institute for Information Technology HIIT, Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland
- Department of Biostatistics, University of Oslo, Oslo, Norway
- Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, UK
| | - Antti Honkela
- Helsinki Institute for Information Technology HIIT, Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland
- Department of Public Health, University of Helsinki, Helsinki, Finland
- Helsinki Institute for Information Technology HIIT, Department of Computer Science, University of Helsinki, Helsinki, Finland
| |
Collapse
|
46
|
Crellen T, Turner P, Pol S, Baker S, Nguyen Thi Nguyen T, Stoesser N, Day NPJ, Turner C, Cooper BS. Transmission dynamics and control of multidrug-resistant Klebsiella pneumoniae in neonates in a developing country. eLife 2019; 8:e50468. [PMID: 31793878 PMCID: PMC6977969 DOI: 10.7554/elife.50468] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2019] [Accepted: 11/26/2019] [Indexed: 12/11/2022] Open
Abstract
Multidrug-resistant Klebsiella pneumoniae is an increasing cause of infant mortality in developing countries. We aimed to develop a quantitative understanding of the drivers of this epidemic by estimating the effects of antibiotics on nosocomial transmission risk, comparing competing hypotheses about mechanisms of spread, and quantifying the impact of potential interventions. Using a sequence of dynamic models, we analysed data from a one-year prospective carriage study in a Cambodian neonatal intensive care unit with hyperendemic third-generation cephalosporin-resistant K. pneumoniae. All widely-used antibiotics except imipenem were associated with an increased daily acquisition risk, with an odds ratio for the most common combination (ampicillin + gentamicin) of 1.96 (95% CrI 1.18, 3.36). Models incorporating genomic data found that colonisation pressure was associated with a higher transmission risk, indicated sequence type heterogeneity in transmissibility, and showed that within-ward transmission was insufficient to maintain endemicity. Simulations indicated that increasing the nurse-patient ratio could be an effective intervention.
Collapse
Affiliation(s)
- Thomas Crellen
- Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical MedicineMahidol UniversityBangkokThailand
- Nuffield Department of MedicineUniversity of OxfordOxfordUnited Kingdom
| | - Paul Turner
- Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical MedicineMahidol UniversityBangkokThailand
- Nuffield Department of MedicineUniversity of OxfordOxfordUnited Kingdom
- Cambodia-Oxford Medical Research UnitAngkor Hospital for ChildrenSiem ReapCambodia
| | - Sreymom Pol
- Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical MedicineMahidol UniversityBangkokThailand
- Cambodia-Oxford Medical Research UnitAngkor Hospital for ChildrenSiem ReapCambodia
| | - Stephen Baker
- Nuffield Department of MedicineUniversity of OxfordOxfordUnited Kingdom
- Oxford University Clinical Research UnitCentre for Tropical MedicineHo Chi Minh CityViet Nam
| | - To Nguyen Thi Nguyen
- Oxford University Clinical Research UnitCentre for Tropical MedicineHo Chi Minh CityViet Nam
| | - Nicole Stoesser
- Nuffield Department of MedicineUniversity of OxfordOxfordUnited Kingdom
| | - Nicholas PJ Day
- Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical MedicineMahidol UniversityBangkokThailand
- Nuffield Department of MedicineUniversity of OxfordOxfordUnited Kingdom
| | - Claudia Turner
- Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical MedicineMahidol UniversityBangkokThailand
- Nuffield Department of MedicineUniversity of OxfordOxfordUnited Kingdom
- Cambodia-Oxford Medical Research UnitAngkor Hospital for ChildrenSiem ReapCambodia
| | - Ben S Cooper
- Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical MedicineMahidol UniversityBangkokThailand
- Nuffield Department of MedicineUniversity of OxfordOxfordUnited Kingdom
| |
Collapse
|
47
|
Tosas Auguet O, Stabler RA, Betley J, Preston MD, Dhaliwal M, Gaunt M, Ioannou A, Desai N, Karadag T, Batra R, Otter JA, Marbach H, Clark TG, Edgeworth JD. Frequent Undetected Ward-Based Methicillin-Resistant Staphylococcus aureus Transmission Linked to Patient Sharing Between Hospitals. Clin Infect Dis 2019; 66:840-848. [PMID: 29095965 PMCID: PMC5850096 DOI: 10.1093/cid/cix901] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2017] [Accepted: 11/16/2017] [Indexed: 12/04/2022] Open
Abstract
Background Recent evidence suggests that hospital transmission of methicillin-resistant Staphylococcus aureus (MRSA) is uncommon in UK centers that have implemented sustained infection control programs. We investigated whether a healthcare-network analysis could shed light on transmission paths currently sustaining MRSA levels in UK hospitals. Methods A cross-sectional observational study was performed in 2 National Health Service hospital groups and a general district hospital in Southeast London. All MRSA patients identified at inpatient, outpatient, and community settings between 1 November 2011 and 29 February 2012 were included. We identified genetically defined MRSA transmission clusters in individual hospitals and across the healthcare network, and examined genetic differentiation of sequence type (ST) 22 MRSA isolates within and between hospitals and inpatient or outpatient and community settings, as informed by average and median pairwise single-nucleotide polymorphisms (SNPs) and SNP-based proportions of nearly identical isolates. Results Two hundred forty-eight of 610 (40.7%) MRSA patients were linked in 90 transmission clusters, of which 27 spanned multiple hospitals. Analysis of a large 32 patient ST22-MRSA cluster showed that 26 of 32 patients (81.3%) had multiple contacts with one another during ward stays at any hospital. No residential, outpatient, or significant community healthcare contacts were identified. Genetic differentiation between ST22 MRSA inpatient isolates from different hospitals was less than between inpatient isolates from the same hospitals (P ≤ .01). Conclusions There is evidence of frequent ward-based transmission of MRSA brought about by frequent patient admissions to multiple hospitals. Limiting in-ward transmission requires sharing of MRSA status data between hospitals.
Collapse
Affiliation(s)
- Olga Tosas Auguet
- Centre for Clinical Infection and Diagnostics Research, Department of Infectious Diseases, King's College London and Guy's and St Thomas' NHS Foundation Trust.,Oxford Health Systems Collaboration, Centre for Tropical Medicine and Global Health, Nuffield Department of Clinical Medicine, University of Oxford
| | - Richard A Stabler
- Department of Pathogen Molecular Biology, Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine
| | - Jason Betley
- Illumina, Cambridge Ltd, Chesterford Research Park, Little Chesterford, Essex
| | - Mark D Preston
- Department of Pathogen Molecular Biology, Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine
| | - Mandeep Dhaliwal
- Department of Pathogen Molecular Biology, Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine
| | - Michael Gaunt
- Department of Pathogen Molecular Biology, Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine
| | - Avgousta Ioannou
- Illumina, Cambridge Ltd, Chesterford Research Park, Little Chesterford, Essex
| | - Nergish Desai
- Department of Medical Microbiology, King's College Hospital NHS Foundation Trust
| | - Tacim Karadag
- Department of Microbiology, University Hospital Lewisham, Lewisham and Greenwich NHS Trust
| | - Rahul Batra
- Centre for Clinical Infection and Diagnostics Research, Department of Infectious Diseases, King's College London and Guy's and St Thomas' NHS Foundation Trust
| | - Jonathan A Otter
- Centre for Clinical Infection and Diagnostics Research, Department of Infectious Diseases, King's College London and Guy's and St Thomas' NHS Foundation Trust.,National Institute for Health Research Health Protection Research Unit in Healthcare-Associated Infections and Antimicrobial Resistance at Imperial College London, and Imperial College Healthcare NHS Trust, Infection Prevention and Control
| | - Helene Marbach
- Centre for Clinical Infection and Diagnostics Research, Department of Infectious Diseases, King's College London and Guy's and St Thomas' NHS Foundation Trust
| | - Taane G Clark
- Department of Pathogen Molecular Biology, Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine.,Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, London, United Kingdom
| | - Jonathan D Edgeworth
- Centre for Clinical Infection and Diagnostics Research, Department of Infectious Diseases, King's College London and Guy's and St Thomas' NHS Foundation Trust
| |
Collapse
|
48
|
Amambua-Ngwa A, Jeffries D, Mwesigwa J, Seedy-Jawara A, Okebe J, Achan J, Drakeley C, Volkman S, D'Alessandro U. Long-distance transmission patterns modelled from SNP barcodes of Plasmodium falciparum infections in The Gambia. Sci Rep 2019; 9:13515. [PMID: 31534181 PMCID: PMC6751170 DOI: 10.1038/s41598-019-49991-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2018] [Accepted: 08/12/2019] [Indexed: 02/04/2023] Open
Abstract
Malaria has declined significantly in The Gambia and determining transmission dynamics of Plasmodium falciparum can help targeting control interventions towards elimination. This can be inferred from genetic similarity between parasite isolates from different sites and timepoints. Here, we imposed a P. falciparum life cycle time on a genetic distance likelihood model to determine transmission paths from a 54 SNP barcode of 355 isolates. Samples were collected monthly during the 2013 malaria season from six pairs of villages spanning 300 km from western to eastern Gambia. There was spatial and temporal hierarchy in pairwise genetic relatedness, with the most similar barcodes from isolates within the same households and village. Constrained by travel data, the model detected 60 directional transmission events, with 27% paths linking persons from different regions. We identified 13 infected individuals (4.2% of those genotyped) responsible for 2 to 8 subsequent infections within their communities. These super-infectors were mostly from high transmission villages. When considering paths between isolates from the most distant regions (west vs east) and travel history, there were 3 transmission paths from eastern to western Gambia, all at the peak (October) of the malaria transmission season. No paths with known travel originated from the extreme west to east. Although more than half of all paths were within-village, parasite flow from east to west may contribute to maintain transmission in western Gambia, where malaria transmission is already low. Therefore, interrupting malaria transmission in western Gambia would require targeting eastern Gambia, where malaria prevalence is substantially higher, with intensified malaria interventions.
Collapse
Affiliation(s)
- Alfred Amambua-Ngwa
- Medical Research Council Unit The Gambia at London School of Hygiene and Tropical Medicine, Banjul, The Gambia.
| | - David Jeffries
- Medical Research Council Unit The Gambia at London School of Hygiene and Tropical Medicine, Banjul, The Gambia
| | - Julia Mwesigwa
- Medical Research Council Unit The Gambia at London School of Hygiene and Tropical Medicine, Banjul, The Gambia
| | - Aminata Seedy-Jawara
- Medical Research Council Unit The Gambia at London School of Hygiene and Tropical Medicine, Banjul, The Gambia
| | - Joseph Okebe
- Medical Research Council Unit The Gambia at London School of Hygiene and Tropical Medicine, Banjul, The Gambia
| | - Jane Achan
- Medical Research Council Unit The Gambia at London School of Hygiene and Tropical Medicine, Banjul, The Gambia
| | - Chris Drakeley
- London School of Hygiene and tropical Medicine, London, UK
| | - Sarah Volkman
- Harvard School of Public Health, Boston, Massachusetts, USA
| | - Umberto D'Alessandro
- Medical Research Council Unit The Gambia at London School of Hygiene and Tropical Medicine, Banjul, The Gambia.,London School of Hygiene and tropical Medicine, London, UK
| |
Collapse
|
49
|
Fujikura Y, Hamamoto T, Kanayama A, Kaku K, Yamagishi J, Kawana A. Bayesian reconstruction of a vancomycin-resistant Enterococcus transmission route using epidemiologic data and genomic variants from whole genome sequencing. J Hosp Infect 2019; 103:395-403. [PMID: 31425718 DOI: 10.1016/j.jhin.2019.08.011] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2019] [Accepted: 08/12/2019] [Indexed: 11/16/2022]
Abstract
BACKGROUND Outbreaks of vancomycin-resistant enterococcus (VRE) are a serious problem in hospitals. Inferring the transmission route is an important factor to institute appropriate infection control measures; however, the methodology has not been fully established. AIM To reconstruct and evaluate the transmission model using sequence variants extracted from whole genome sequencing (WGS) data and epidemiological information from patients involved in a VRE outbreak. METHODS During a VRE outbreak in our hospital, 23 samples were collected from patients and environmental surfaces and analysed using WGS. By combining genome alignment information with patient epidemiological data, the VRE transmission route was reconstructed using a Bayesian approach. With the transmission model, evaluation and further analyses were performed to identify risk factors that contributed to the outbreak. FINDINGS All VREs were identified as Enterococcus faecium belonging to sequence type 17, which consisted of two VRE genotypes: vanA (N = 8, including one environmental sample) and vanB (N = 15). The reconstruction model using the Bayesian approach showed the transmission direction with posterior probability and revealed transmission through an environmental surface. In addition, some cases acting as VRE spreaders were identified, which can interfere with appropriate infection control. Vancomycin administration was identified as a significant risk factor for spreaders. CONCLUSION A Bayesian approach for transmission route reconstruction using epidemiologic data and genomic variants from WGS can be applied in actual VRE outbreaks. This may contribute to the design and implementation of effective infection control measures.
Collapse
Affiliation(s)
- Y Fujikura
- Department of Medical Risk Management and Infection Control, National Defense Medical College Hospital, Saitama, Japan; Division of Infectious Diseases and Respiratory Medicine, Department of Internal Medicine, National Defense Medical College, Saitama, Japan.
| | - T Hamamoto
- Department of Clinical Laboratory, National Defense Medical College Hospital, Saitama, Japan
| | - A Kanayama
- Division of Infectious Diseases Epidemiology and Control, National Defense Medical College Research Institute, Saitama, Japan
| | - K Kaku
- Division of Infectious Diseases Epidemiology and Control, National Defense Medical College Research Institute, Saitama, Japan
| | - J Yamagishi
- Research Center for Zoonosis Control, Hokkaido University, Sapporo, Japan
| | - A Kawana
- Division of Infectious Diseases and Respiratory Medicine, Department of Internal Medicine, National Defense Medical College, Saitama, Japan
| |
Collapse
|
50
|
Direct transmission of within-host Mycobacterium tuberculosis diversity to secondary cases can lead to variable between-host heterogeneity without de novo mutation: A genomic investigation. EBioMedicine 2019; 47:293-300. [PMID: 31420303 PMCID: PMC6796532 DOI: 10.1016/j.ebiom.2019.08.010] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2019] [Revised: 08/02/2019] [Accepted: 08/04/2019] [Indexed: 01/15/2023] Open
Abstract
BACKGROUND Whole genome sequencing (WGS) has enabled the development of new approaches to track Mycobacterium tuberculosis (Mtb) transmission between tuberculosis (TB) cases but its utility may be challenged by the discovery that Mtb diversifies within hosts. Nevertheless, there is limited data on the presence and degree of within-host evolution. METHODS We profiled a well-documented Mtb transmission cluster with three pulmonary TB cases to investigate within-host evolution and describe its impact on recent transmission estimates. We used deep sequencing to track minority allele frequencies (<50·0% abundance) during transmission and standard treatment. FINDINGS Pre-treatment (n = 3) and serial samples collected over 2 months of antibiotic treatment (n = 16) from all three cases were analysed. Consistent with the epidemiological data, zero fixed SNP separated all genomes. However, we identified six subclones between the three cases with an allele frequency ranging from 35·0% to 100·0% across sampling intervals. Five subclones were identified within the index case pre-treatment and shared with one secondary case, while only the dominant clone was observed in the other secondary case. By tracking the frequency of these heterogeneous alleles over the two-month therapy, we observed distinct signatures of drift and negative selection, but limited evidence for de novo mutations, even under drug pressure. INTERPRETATION We document within-host Mtb diversity in an index case, which led to transmission of minority alleles to a secondary case. Incorporating data on heterogeneous alleles may refine our understanding of Mtb transmission dynamics. However, more evidence is needed on the role of transmission bottleneck on observed heterogeneity between cases.
Collapse
|