1
|
Yang Q, Wang X, Han M, Sheng H, Sun Y, Su L, Lu W, Li M, Wang S, Chen J, Cui S, Yang BW. Bacterial genome-wide association studies: exploring the genetic variation underlying bacterial phenotypes. Appl Environ Microbiol 2025:e0251224. [PMID: 40377303 DOI: 10.1128/aem.02512-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/18/2025] Open
Abstract
With the continuous advancements in high-throughput genome sequencing technologies and the development of innovative bioinformatics tools, bacterial genome-wide association studies (BGWAS) have emerged as a transformative approach for investigating the genetic variations underlying diverse bacterial phenotypes at the population genome level. This review provides a comprehensive overview of the application of BGWAS in elucidating genetic determinants of bacterial drug resistance, pathogenicity, host specificity, biofilm formation, and probiotic fermentation characteristics. We systematically summarize the BGWAS workflow, including study design, data analysis pipelines, and the bioinformatics software employed at various stages. Furthermore, we highlight specialized tools tailored for BGWAS and discuss their unique features and applications. We also discuss confounding factors that can influence the accuracy and reliability of BGWAS results, including population structure, linkage disequilibrium, and multiple testing. By incorporating recent advancements, this review serves as a comprehensive reference for researchers utilizing BGWAS to uncover the genetic basis of bacterial phenotypes.
Collapse
Affiliation(s)
- Qiuping Yang
- College of Food Science and Engineering, Northwest A&F University, Shaanxi, China
| | - Xiaoqi Wang
- College of Food Science and Engineering, Northwest A&F University, Shaanxi, China
| | - Mengting Han
- College of Food Science and Engineering, Northwest A&F University, Shaanxi, China
| | - Huanjing Sheng
- College of Food Science and Engineering, Northwest A&F University, Shaanxi, China
| | - Yulu Sun
- College of Food Science and Engineering, Northwest A&F University, Shaanxi, China
| | - Li Su
- College of Food Science and Engineering, Northwest A&F University, Shaanxi, China
| | - Wenjing Lu
- College of Food Science and Engineering, Northwest A&F University, Shaanxi, China
| | - Mei Li
- College of Food Science and Engineering, Northwest A&F University, Shaanxi, China
| | - Siyue Wang
- College of Food Science and Engineering, Northwest A&F University, Shaanxi, China
| | - Jia Chen
- College of Chemical Technology, Shijiazhuang University, Shijiazhuang, China
| | - Shenghui Cui
- National Institutes for Food and Drug Control, Beijing, China
| | - Bao-Wei Yang
- College of Food Science and Engineering, Northwest A&F University, Shaanxi, China
| |
Collapse
|
2
|
Ortega-Sanz I, Rovira J, Megías G, Rivero-Pérez MD, Melero B. Genome-Wide association study to identify genetic markers associated with Campylobacter jejuni motility. Microb Pathog 2025; 205:107657. [PMID: 40318771 DOI: 10.1016/j.micpath.2025.107657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2024] [Revised: 04/18/2025] [Accepted: 04/30/2025] [Indexed: 05/07/2025]
Abstract
The ability of Campylobacter jejuni to survive and persist under harsh conditions is linked to the presence of flagella. This structure promotes the motility of the bacteria towards their optimum environment. The aim of this study was to examine the genetic basis for motility within 136 C. jejuni isolates through two different Genome-Wide Association Studies, gene presence/absence and Single Nucleotide Polymorphisms (SNPs). The motility phenotype was widely distributed across the phylogeny with large intra-lineage swarming performance variabilities. Accessory genes significantly associated with motility were found in four key genomic regions. One of these regions affected the Cj0727-Cj0733 operon, that encodes a putative ABC transporter system for phosphate uptake, while other influenced the capsule biosynthesis locus. Multiple SNPs mostly linked to increased motility were also discovered in clusters of genes, with special relevance to transport and membrane proteins. Therefore, the capsule and membrane composition might influence nutrient transfer, further impacting the protonmotive force that drives flagellar motor rotation in C. jejuni. The study provides novel genetic markers with a potential role in the motility phenotype of the pathogen.
Collapse
Affiliation(s)
- Irene Ortega-Sanz
- Department of Biotechnology and Food Science, University of Burgos, 9001 Burgos, Spain.
| | - Jordi Rovira
- Department of Biotechnology and Food Science, University of Burgos, 9001 Burgos, Spain.
| | - Gregoria Megías
- Microbiology Department of the University Hospital of Burgos (HUBU), Burgos, Spain.
| | | | - Beatriz Melero
- Department of Biotechnology and Food Science, University of Burgos, 9001 Burgos, Spain.
| |
Collapse
|
3
|
Zhou N, Zheng Q, Liu Y, Huang Z, Feng Y, Chen Y, Hu F, Zheng H. Strain diversity and host specificity of the gut symbiont Gilliamella in Apis mellifera, Apis cerana and Bombus terrestris. Microbiol Res 2025; 293:128048. [PMID: 39813751 DOI: 10.1016/j.micres.2025.128048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2024] [Revised: 12/15/2024] [Accepted: 01/02/2025] [Indexed: 01/18/2025]
Abstract
Social bees, with their specialized gut microbiota and societal transmission between individuals, provide an ideal model for studying host-gut microbiota interactions. While the functional disparities arising from strain-level diversity of gut symbionts and their effects on host health have been studied in Apis mellifera and bumblebees, studies focusing on host-specific investigations of individual strains across different honeybee hosts remain relatively unexplored. In this study, the complete genomic sequences of 17 strains of Gilliamella from A. mellifera, Apis cerana and Bombus terrestris were analyzed. The analysis revealed that the strains of A. mellifera display a more expansive genomic and functional content compared to the strains of A. cerana and B. terrestris. Phylogenetic analysis showed a deep divergence among the Gilliamella strains from different hosts. Additionally, biochemistry tests and antibiotic susceptibility tests revealed that gut strains from A. mellifera exhibited a more extensive pathway for carbohydrate metabolism and a greater resistance to antibiotics than gut strains from A. cerana and B. terrestris. Strains from A. mellifera and A. cerana showed higher colonization efficiency and competitive ability whithin their respective host species, indicating a higher degree of host-specific adaptation of local gut microbiota. In addition, colonization by A. mellifera-derived strain triggers a stronger transcriptional response in the host than A. cerana-derived strain. The variation in the number of differentially expressed genes and the involvement of distinct signaling pathways across these two host species suggest species-specific adaptations to Gilliamella strains. These findings suggest that despite occupying similar niches in the bee gut, strain-level variations can influence microbial functions, and their impact on host physiological functions may vary across different strains.
Collapse
Affiliation(s)
- Nihong Zhou
- College of Animal Sciences, Zhejiang University, Hangzhou 310058, China
| | - Qiulan Zheng
- College of Animal Sciences, Zhejiang University, Hangzhou 310058, China
| | - Yao Liu
- College of Animal Sciences, Zhejiang University, Hangzhou 310058, China
| | - Zhichu Huang
- College of Animal Sciences, Zhejiang University, Hangzhou 310058, China
| | - Ye Feng
- Institute of Translational Medicine, Zhejiang University, Hangzhou 310058, China
| | - Yanping Chen
- Bee Research Laboratory, USDA-ARS, Beltsville, MD, USA
| | - Fuliang Hu
- College of Animal Sciences, Zhejiang University, Hangzhou 310058, China
| | - Huoqing Zheng
- College of Animal Sciences, Zhejiang University, Hangzhou 310058, China.
| |
Collapse
|
4
|
Burgaya J, Damaris BF, Fiebig J, Galardini M. microGWAS: a computational pipeline to perform large-scale bacterial genome-wide association studies. Microb Genom 2025; 11. [PMID: 39932497 DOI: 10.1099/mgen.0.001349] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/08/2025] Open
Abstract
Identifying genetic variants associated with bacterial phenotypes, such as virulence, host preference and antimicrobial resistance, has great potential for a better understanding of the mechanisms involved in these traits. The availability of large collections of bacterial genomes has made genome-wide association studies (GWAS) a common approach for this purpose. The need to employ multiple software tools for data pre- and postprocessing limits the application of these methods by experienced bioinformaticians. To address this issue, we have developed a pipeline to perform bacterial GWAS from a set of assemblies and annotations, with multiple phenotypes as targets. The associations are run using five sets of genetic variants: unitigs, gene presence/absence, rare variants (i.e. gene burden test), gene-cluster-specific k-mers and all unitigs jointly. All variants passing the association threshold are further annotated to identify overrepresented biological processes and pathways. The results can be further augmented by generating a phylogenetic tree and predicting the presence of antimicrobial resistance and virulence-associated genes. We tested the microGWAS pipeline on a previously reported dataset on Escherichia coli virulence, successfully identifying the causal variants and providing further interpretation of the association results. The microGWAS pipeline integrates state-of-the-art tools to perform bacterial GWAS into a single, user-friendly and reproducible pipeline, allowing for the democratization of these analyses. The pipeline, together with its documentation, can be accessed at https://github.com/microbial-pangenomes-lab/microGWAS.
Collapse
Affiliation(s)
- Judit Burgaya
- Institute for Molecular Bacteriology, TWINCORE Centre for Experimental and Clinical Infection Research, a joint venture between the Hannover Medical School (MHH) and the Helmholtz Centre for Infection Research (HZI), Hannover, Germany
- Cluster of Excellence RESIST (EXC 2155), Hannover Medical School (MHH), Hannover, Germany
| | - Bamu F Damaris
- Institute for Molecular Bacteriology, TWINCORE Centre for Experimental and Clinical Infection Research, a joint venture between the Hannover Medical School (MHH) and the Helmholtz Centre for Infection Research (HZI), Hannover, Germany
- Cluster of Excellence RESIST (EXC 2155), Hannover Medical School (MHH), Hannover, Germany
| | - Jenny Fiebig
- Institute for Molecular Bacteriology, TWINCORE Centre for Experimental and Clinical Infection Research, a joint venture between the Hannover Medical School (MHH) and the Helmholtz Centre for Infection Research (HZI), Hannover, Germany
| | - Marco Galardini
- Institute for Molecular Bacteriology, TWINCORE Centre for Experimental and Clinical Infection Research, a joint venture between the Hannover Medical School (MHH) and the Helmholtz Centre for Infection Research (HZI), Hannover, Germany
- Cluster of Excellence RESIST (EXC 2155), Hannover Medical School (MHH), Hannover, Germany
| |
Collapse
|
5
|
Tsoumtsa Meda L, Lagarde J, Guillier L, Roussel S, Douarre PE. Using GWAS and Machine Learning to Identify and Predict Genetic Variants Associated with Foodborne Bacteria Phenotypic Traits. Methods Mol Biol 2025; 2852:223-253. [PMID: 39235748 DOI: 10.1007/978-1-0716-4100-2_16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/06/2024]
Abstract
One of the main challenges in food microbiology is to prevent the risk of outbreaks by avoiding the distribution of food contaminated by bacteria. This requires constant monitoring of the circulating strains throughout the food production chain. Bacterial genomes contain signatures of natural evolution and adaptive markers that can be exploited to better understand the behavior of pathogen in the food industry. The monitoring of foodborne strains can therefore be facilitated by the use of these genomic markers capable of rapidly providing essential information on isolated strains, such as the source of contamination, risk of illness, potential for biofilm formation, and tolerance or resistance to biocides. The increasing availability of large genome datasets is enhancing the understanding of the genetic basis of complex traits such as host adaptation, virulence, and persistence. Genome-wide association studies have shown very promising results in the discovery of genomic markers that can be integrated into rapid detection tools. In addition, machine learning has successfully predicted phenotypes and classified important traits. Genome-wide association and machine learning tools have therefore the potential to support decision-making circuits intending at reducing the burden of foodborne diseases. The aim of this chapter review is to provide knowledge on the use of these two methods in food microbiology and to recommend their use in the field.
Collapse
Affiliation(s)
- Landry Tsoumtsa Meda
- ACTALIA, La Roche-sur-Foron, France
- ANSES, Salmonella and Listeria Unit (USEL), University of Paris-Est, Maisons-Alfort Laboratory for Food Safety, Maisons-Alfort, France
| | - Jean Lagarde
- ANSES, Salmonella and Listeria Unit (USEL), University of Paris-Est, Maisons-Alfort Laboratory for Food Safety, Maisons-Alfort, France
- INRAE, Unit of Process Optimisation in Food, Agriculture and the Environment (UR OPAALE), Rennes, France
| | | | - Sophie Roussel
- ANSES, Salmonella and Listeria Unit (USEL), University of Paris-Est, Maisons-Alfort Laboratory for Food Safety, Maisons-Alfort, France
| | - Pierre-Emmanuel Douarre
- ANSES, Salmonella and Listeria Unit (USEL), University of Paris-Est, Maisons-Alfort Laboratory for Food Safety, Maisons-Alfort, France.
| |
Collapse
|
6
|
Liu N, Guan M, Ma B, Chu H, Tian G, Zhang Y, Li C, Zheng W, Wang X. Unraveling genetic mysteries: A comprehensive review of GWAS and DNA insights in animal and plant pathosystems. Int J Biol Macromol 2025; 285:138216. [PMID: 39631605 DOI: 10.1016/j.ijbiomac.2024.138216] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2024] [Revised: 11/13/2024] [Accepted: 11/28/2024] [Indexed: 12/07/2024]
Abstract
DNA serves as the carrier of genetic information, with sequence variations playing a pivotal role in defining hereditary traits. Genome-Wide Association Studies (GWAS) facilitate the investigation of the links between genetic variations and phenotypes, significantly influencing biological research, particularly in animal and plant pathology. By identifying genetic markers associated with specific traits or diseases, GWAS enhances our understanding of host-pathogen interactions and improves disease-resistant breeding strategies. It has been vital in revealing the genetic basis of disease resistance, pinpointing key genes and DNA loci, which enrich genetic resources for breeding programs and deepen our knowledge of disease resistance mechanisms at the DNA level. Additionally, GWAS contributes to pathogen population genetics, facilitating a thorough exploration of pathogen virulence. Integrating GWAS with marker-assisted selection enhances breeding efficiency and precision in selecting for disease-resistant traits. While previous research has largely focused on host genetics, the genetic variation of pathogens is equally significant. Notably, reports integrating animal and plant pathosystems are still lacking. Given the importance of these systems, this review summarizes key advancements in this field, addresses current challenges, and proposes future directions, thereby offering a vital reference for ongoing research.
Collapse
Affiliation(s)
- Na Liu
- Collaborative Innovation Center of Henan Grain Crops/State Key Laboratory of Wheat and Maize Crop Science, College of Life Sciences, Henan Agricultural University, 450046 Zhengzhou, China
| | - Mengxin Guan
- Collaborative Innovation Center of Henan Grain Crops/State Key Laboratory of Wheat and Maize Crop Science, College of Life Sciences, Henan Agricultural University, 450046 Zhengzhou, China
| | - Baozhan Ma
- Collaborative Innovation Center of Henan Grain Crops/State Key Laboratory of Wheat and Maize Crop Science, College of Life Sciences, Henan Agricultural University, 450046 Zhengzhou, China
| | - Hao Chu
- Collaborative Innovation Center of Henan Grain Crops/State Key Laboratory of Wheat and Maize Crop Science, College of Life Sciences, Henan Agricultural University, 450046 Zhengzhou, China
| | - Guangxiang Tian
- Collaborative Innovation Center of Henan Grain Crops/State Key Laboratory of Wheat and Maize Crop Science, College of Life Sciences, Henan Agricultural University, 450046 Zhengzhou, China
| | - Yanyan Zhang
- Collaborative Innovation Center of Henan Grain Crops/State Key Laboratory of Wheat and Maize Crop Science, College of Life Sciences, Henan Agricultural University, 450046 Zhengzhou, China
| | - Chuang Li
- Collaborative Innovation Center of Henan Grain Crops/State Key Laboratory of Wheat and Maize Crop Science, College of Life Sciences, Henan Agricultural University, 450046 Zhengzhou, China; Center of Crop Genome Engineering, College of Agronomy, Henan Agricultural University, 450046 Zhengzhou, China.
| | - Wenming Zheng
- Collaborative Innovation Center of Henan Grain Crops/State Key Laboratory of Wheat and Maize Crop Science, College of Life Sciences, Henan Agricultural University, 450046 Zhengzhou, China.
| | - Xu Wang
- Collaborative Innovation Center of Henan Grain Crops/State Key Laboratory of Wheat and Maize Crop Science, College of Life Sciences, Henan Agricultural University, 450046 Zhengzhou, China.
| |
Collapse
|
7
|
Cookson AL, Burgess S, Midwinter AC, Marshall JC, Moinet M, Rogers L, Fayaz A, Biggs PJ, Brightwell G. New Campylobacter Lineages in New Zealand Freshwater: Pathogenesis and Public Health Implications. Environ Microbiol 2024; 26:e70016. [PMID: 39680962 DOI: 10.1111/1462-2920.70016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2024] [Revised: 11/11/2024] [Accepted: 11/26/2024] [Indexed: 12/18/2024]
Abstract
This study investigated the diversity of thermophilic Campylobacter species isolated from three New Zealand freshwater catchments affected by pastoral and urban activities. Utilising matrix-assisted laser desorption ionisation-time of flight and whole genome sequence analysis, the study identified Campylobacter jejuni (n = 46, 46.0%), C. coli (n = 39, 39%), C. lari (n = 4, 4.0%), and two novel Campylobacter species lineages (n = 11, 11%). Core genome sequence analysis provided evidence of prolonged persistence or continuous faecal shedding of closely related strains. The C. jejuni isolates displayed distinct sequence types (STs) associated with human, ruminant, and environmental sources, whereas the C. coli STs included waterborne ST3302 and ST7774. Recombination events affecting loci implicated in human pathogenesis and environmental persistence were observed, particularly in the cdtABC operon (encoding the cytolethal distending toxin) of non-human C. jejuni STs. A low diversity of antimicrobial resistance genes (aadE-Cc in C. coli), with genotype/phenotype concordance for tetracycline resistance (tetO) in three ST177 isolates, was noted. The data suggest the existence of two types of naturalised waterborne Campylobacter: environmentally persistent strains originating from waterbirds and new environmental species not linked to human campylobacteriosis. Identifying and understanding naturalised Campylobacter species is crucial for accurate waterborne public health risk assessments and the effective allocation of resources for water quality management.
Collapse
Affiliation(s)
- Adrian L Cookson
- AgResearch Limited, Hopkirk Research Institute, Palmerston North, New Zealand
- mEpiLab, School of Veterinary Sciences, Massey University, Palmerston North, New Zealand
| | - Sara Burgess
- mEpiLab, School of Veterinary Sciences, Massey University, Palmerston North, New Zealand
| | - Anne C Midwinter
- mEpiLab, School of Veterinary Sciences, Massey University, Palmerston North, New Zealand
| | - Jonathan C Marshall
- School of Mathematical and Computational Sciences, Massey University, Palmerston North, New Zealand
| | - Marie Moinet
- AgResearch Limited, Hopkirk Research Institute, Palmerston North, New Zealand
| | - Lynn Rogers
- AgResearch Limited, Hopkirk Research Institute, Palmerston North, New Zealand
| | - Ahmed Fayaz
- mEpiLab, School of Veterinary Sciences, Massey University, Palmerston North, New Zealand
| | - Patrick J Biggs
- mEpiLab, School of Veterinary Sciences, Massey University, Palmerston North, New Zealand
- School of Natural Sciences, Massey University, Palmerston North, New Zealand
| | - Gale Brightwell
- AgResearch Limited, Hopkirk Research Institute, Palmerston North, New Zealand
| |
Collapse
|
8
|
Do DT, Yang MR, Vo TNS, Le NQK, Wu YW. Unitig-centered pan-genome machine learning approach for predicting antibiotic resistance and discovering novel resistance genes in bacterial strains. Comput Struct Biotechnol J 2024; 23:1864-1876. [PMID: 38707536 PMCID: PMC11067008 DOI: 10.1016/j.csbj.2024.04.035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Revised: 04/13/2024] [Accepted: 04/13/2024] [Indexed: 05/07/2024] Open
Abstract
In current genomic research, the widely used methods for predicting antimicrobial resistance (AMR) often rely on prior knowledge of known AMR genes or reference genomes. However, these methods have limitations, potentially resulting in imprecise predictions owing to incomplete coverage of AMR mechanisms and genetic variations. To overcome these limitations, we propose a pan-genome-based machine learning approach to advance our understanding of AMR gene repertoires and uncover possible feature sets for precise AMR classification. By building compacted de Brujin graphs (cDBGs) from thousands of genomes and collecting the presence/absence patterns of unique sequences (unitigs) for Pseudomonas aeruginosa, we determined that using machine learning models on unitig-centered pan-genomes showed significant promise for accurately predicting the antibiotic resistance or susceptibility of microbial strains. Applying a feature-selection-based machine learning algorithm led to satisfactory predictive performance for the training dataset (with an area under the receiver operating characteristic curve (AUC) of > 0.929) and an independent validation dataset (AUC, approximately 0.77). Furthermore, the selected unitigs revealed previously unidentified resistance genes, allowing for the expansion of the resistance gene repertoire to those that have not previously been described in the literature on antibiotic resistance. These results demonstrate that our proposed unitig-based pan-genome feature set was effective in constructing machine learning predictors that could accurately identify AMR pathogens. Gene sets extracted using this approach may offer valuable insights into expanding known AMR genes and forming new hypotheses to uncover the underlying mechanisms of bacterial AMR.
Collapse
Affiliation(s)
- Duyen Thi Do
- Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan
| | - Ming-Ren Yang
- Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan
- Department of Electrical Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan
| | - Tran Nam Son Vo
- Department of Business Administration, College of Management, Lunghwa University of Science and Technology, Taoyuan City, Taiwan
| | - Nguyen Quoc Khanh Le
- Professional Master Program in Artificial Intelligence in Medicine, College of Medicine, Taipei Medical University, Taipei, Taiwan
| | - Yu-Wei Wu
- Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan
- Clinical Big Data Research Center, Taipei Medical University Hospital, Taipei, Taiwan
- TMU Research Center for Digestive Medicine, Taipei Medical University, Taipei, Taiwan
| |
Collapse
|
9
|
Zhou H, Du W, Ouyang D, Li Y, Gong Y, Yao Z, Zhong M, Zhong X, Ye X. Simple and accurate genomic classification model for distinguishing between human and pig Staphylococcus aureus. Commun Biol 2024; 7:1171. [PMID: 39294434 PMCID: PMC11410946 DOI: 10.1038/s42003-024-06883-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Accepted: 09/11/2024] [Indexed: 09/20/2024] Open
Abstract
Staphylococcus aureus (S. aureus) can cause various infections in humans and animals, contributing to high morbidity and mortality. To prevent and control cross-species transmission of S. aureus, it is necessary to understand the host-associated genetic variants. We performed a two-stage genome-wide association study (GWAS) including initial screening and further validation to compare genomic differences between human and pig S. aureus, aiming to identify host-associated determinants. Our multiple GWAS analyses found six consensus significant k-mers associated with host species, providing novel genetic evidence for distinguishing human from pig S. aureus. The best k-mer predictor achieved a high classification accuracy of 98.12% on its own and had extremely high resolution similar to the SNPs-based phylogeny, offering a very simple target for predicting the cross-species transmission risk of S. aureus. The final k-mer model revealed that 90% of S. aureus isolates from farm workers were predicted as livestock origin, suggesting a high risk of cross-species transmission. Bayesian inference revealed different cross-species transmission directions, with the human-to-pig transmission for ST5 and the pig-to-human transmission for ST398. Our findings provide a simple and accurate k-mer model for identifying and predicting the cross-species transmission risk of S. aureus.
Collapse
Affiliation(s)
- Huiliu Zhou
- School of Public Health, Guangdong Pharmaceutical University, Guangzhou, China
| | - Wenyin Du
- School of Public Health, Guangdong Pharmaceutical University, Guangzhou, China
| | - Dejia Ouyang
- School of Public Health, Guangdong Pharmaceutical University, Guangzhou, China
| | - Yuehe Li
- School of Public Health, Guangdong Pharmaceutical University, Guangzhou, China
| | - Yajie Gong
- School of Public Health, Guangdong Pharmaceutical University, Guangzhou, China
| | - Zhenjiang Yao
- School of Public Health, Guangdong Pharmaceutical University, Guangzhou, China
| | - Minghao Zhong
- Department of Prevention and Health Care, The Sixth People's Hospital of Dongguan, Dongguan, China
| | - Xinguang Zhong
- Department of Prevention and Health Care, The Sixth People's Hospital of Dongguan, Dongguan, China.
| | - Xiaohua Ye
- School of Public Health, Guangdong Pharmaceutical University, Guangzhou, China.
| |
Collapse
|
10
|
Middendorf PS, Wijnands LM, Boeren S, Zomer AL, Jacobs-Reitsma WF, den Besten HM, Abee T. Activation of the l-fucose utilization cluster in Campylobacter jejuni induces proteomic changes and enhances Caco-2 cell invasion and fibronectin binding. Heliyon 2024; 10:e34996. [PMID: 39220920 PMCID: PMC11365321 DOI: 10.1016/j.heliyon.2024.e34996] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Revised: 07/16/2024] [Accepted: 07/21/2024] [Indexed: 09/04/2024] Open
Abstract
Most Campylobacter jejuni isolates carry the fucose utilization cluster (Cj0480c-Cj0489) that supports the metabolism of l-fucose and d-arabinose. In this study we quantified l-fucose and d-arabinose metabolism and metabolite production, and the impact on Caco-2 cell interaction and binding to fibronectin, using C. jejuni NCTC11168 and the closely related human isolate C. jejuni strain 286. When cultured with l-fucose and d-arabinose, both isolates showed increased survival and production of acetate, pyruvate and succinate, and the respective signature metabolites lactate and glycolic acid, in line with an overall upregulation of l-fucose cluster proteins. In vitro Caco-2 cell studies and fibronectin-binding experiments showed a trend towards higher invasion and a significantly higher fibronectin binding efficacy of C. jejuni NCTC11168 cells grown with l-fucose and d-arabinose, while no significant differences were found with C. jejuni 286. Both fibronectin binding proteins, CadF and FlpA, were detected in the two isolates, but were not significantly differentially expressed in l-fucose or d-arabinose grown cells. Comparative proteomics analysis linked the C. jejuni NCTC11168 phenotypes uniquely to the more than 135-fold upregulated protein Cj0608, putative TolC-like component MacC, which, together with the detected Cj0606 and Cj0607 proteins, forms the tripartite secretion system MacABC with putative functions in antibiotic resistance, cell envelope stress response and virulence in Gram negative pathogenic bacteria. Further studies are required to elucidate the role of the MacABC system in C. jejuni cell surface structure modulation and virulence.
Collapse
Affiliation(s)
- Pjotr S. Middendorf
- Food Microbiology, Wageningen University and Research, Wageningen, the Netherlands
- National Institute for Public Health and the Environment, Bilthoven, the Netherlands
| | - Lucas M. Wijnands
- National Institute for Public Health and the Environment, Bilthoven, the Netherlands
| | - Sjef Boeren
- Laboratory of Biochemistry, Wageningen University and Research, Wageningen, the Netherlands
| | - Aldert L. Zomer
- Faculty of Veterinary Medicine, Department of Infectious Diseases and Immunology, Utrecht University, Utrecht, the Netherlands
- WHO Collaborating Center for Campylobacter/OIE Reference Laboratory for Campylobacteriosis, Utrecht, the Netherlands
| | | | | | - Tjakko Abee
- Food Microbiology, Wageningen University and Research, Wageningen, the Netherlands
| |
Collapse
|
11
|
Crestani C, Forde TL, Bell J, Lycett SJ, Oliveira LMA, Pinto TCA, Cobo-Ángel CG, Ceballos-Márquez A, Phuoc NN, Sirimanapong W, Chen SL, Jamrozy D, Bentley SD, Fontaine M, Zadoks RN. Genomic and functional determinants of host spectrum in Group B Streptococcus. PLoS Pathog 2024; 20:e1012400. [PMID: 39133742 PMCID: PMC11341095 DOI: 10.1371/journal.ppat.1012400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2024] [Revised: 08/22/2024] [Accepted: 07/08/2024] [Indexed: 08/24/2024] Open
Abstract
Group B Streptococcus (GBS) is a major human and animal pathogen that threatens public health and food security. Spill-over and spill-back between host species is possible due to adaptation and amplification of GBS in new niches but the evolutionary and functional mechanisms underpinning those phenomena are poorly known. Based on analysis of 1,254 curated genomes from all major GBS host species and six continents, we found that the global GBS population comprises host-generalist, host-adapted and host-restricted sublineages, which are found across host groups, preferentially within one host group, or exclusively within one host group, respectively, and show distinct levels of recombination. Strikingly, the association of GBS genomes with the three major host groups (humans, cattle, fish) is driven by a single accessory gene cluster per host, regardless of sublineage or the breadth of host spectrum. Moreover, those gene clusters are shared with other streptococcal species occupying the same niche and are functionally relevant for host tropism. Our findings demonstrate (1) the heterogeneity of genome plasticity within a bacterial species of public health importance, enabling the identification of high-risk clones; (2) the contribution of inter-species gene transmission to the evolution of GBS; and (3) the importance of considering the role of animal hosts, and the accessory gene pool associated with their microbiota, in the evolution of multi-host bacterial pathogens. Collectively, these phenomena may explain the adaptation and clonal expansion of GBS in animal reservoirs and the risk of spill-over and spill-back between animals and humans.
Collapse
Affiliation(s)
- Chiara Crestani
- Institute of Biodiversity, Animal Health & Comparative Medicine, University of Glasgow, Glasgow, Scotland, United Kingdom
| | - Taya L. Forde
- Institute of Biodiversity, Animal Health & Comparative Medicine, University of Glasgow, Glasgow, Scotland, United Kingdom
| | - John Bell
- Moredun Research Institute, Penicuik, Scotland, United Kingdom
| | - Samantha J. Lycett
- The Roslin Institute, University of Edinburgh, Midlothian, Scotland, United Kingdom
| | - Laura M. A. Oliveira
- Instituto de Microbiologia Paulo de Goes, Federal University of Rio de Janeiro, Rio de Janeiro, State of Rio de Janeiro, Brazil
| | - Tatiana C. A. Pinto
- Instituto de Microbiologia Paulo de Goes, Federal University of Rio de Janeiro, Rio de Janeiro, State of Rio de Janeiro, Brazil
| | | | | | - Nguyen N. Phuoc
- Faculty of Fisheries, University of Agriculture and Forestry, Hue University, Hue, Vietnam
| | - Wanna Sirimanapong
- Faculty of Veterinary Science, Mahidol University, Nakhon Pathom, Thailand
| | - Swaine L. Chen
- Infectious Diseases Translational Research Programme, Department of Medicine, Division of Infectious Diseases, Yong Loo Lin School of Medicine, National University of Singapore, Singapore
- Laboratory of Bacterial Genomics, Genome Institute of Singapore, Singapore
| | - Dorota Jamrozy
- Parasites and Microbes Programme, Wellcome Sanger Institute, Hinxton, England, United Kingdom
| | - Stephen D. Bentley
- Parasites and Microbes Programme, Wellcome Sanger Institute, Hinxton, England, United Kingdom
| | | | - Ruth N. Zadoks
- Institute of Biodiversity, Animal Health & Comparative Medicine, University of Glasgow, Glasgow, Scotland, United Kingdom
- Moredun Research Institute, Penicuik, Scotland, United Kingdom
- Sydney School of Veterinary Science, Faculty of Science, University of Sydney, Camden, NSW, Australia
| |
Collapse
|
12
|
Arrieta-Gisasola A, Martínez-Ballesteros I, Martinez-Malaxetxebarria I, Garrido V, Grilló MJ, Bikandi J, Laorden L. Pan-Genome-Wide Association Study reveals a key role of the salmochelin receptor IroN in the biofilm formation of Salmonella Typhimurium and its monophasic variant 4,[5],12:i:. Int J Food Microbiol 2024; 419:110753. [PMID: 38797020 DOI: 10.1016/j.ijfoodmicro.2024.110753] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Revised: 04/25/2024] [Accepted: 05/17/2024] [Indexed: 05/29/2024]
Abstract
Salmonella enterica subsp. enterica serovar Typhimurium variant 4,[5],12:i:- (so called S. 4,[5],12:i:-) has rapidly become one of the most prevalent serovars in humans in Europe, with clinical cases associated with foodborne from pork products. The mechanisms, genetic basis and biofilms relevance by which S. 4,[5],12:i:- maintains and spreads its presence in pigs remain unclear. In this study, we examined the genetic basis of biofilm production in 78 strains of S. 4,[5],12:i:- (n = 57) and S. Typhimurium (n = 21), from human gastroenteritis, food products and asymptomatic pigs. The former showed a lower Specific Biofilm Formation index (SBF) and distant phylogenetic clades, suggesting that the ability to form biofilms is not a crucial adaptation for the S. 4,[5],12:i:- emerging success in pigs. However, using a pan-Genome-Wide Association Study (pan-GWAS) we identified genetic determinants of biofilm formation, revealing 167 common orthologous groups and genes associated with the SBF. The analysis of annotated sequences highlighted specific genetic deletions in three chromosomal regions of S. 4,[5],12:i:- correlating with SBF values: i) the complete fimbrial operon stbABCDE widely recognized as the most critical factor involved in Salmonella adherence; ii) the hxlA, hlxB, and pgiA genes, which expression in S. Typhimurium is induced in the tonsils during swine infection, and iii) the entire iroA locus related to the characteristic deletion of the second-phase flagellar genomic region in S. 4,[5],12:i:-. Consequently, we further investigated the role of the iro-genes on biofilm by constructing S. Typhimurium deletion mutants in iroBCDE and iroN. While iroBCDE showed no significant impact, iroN clearly contributed to S. Typhimurium biofilm formation. In conclusion, the pan-GWAS approach allowed us to uncover complex interactions between genetic and phenotypic factors influencing biofilm formation in S. 4,[5],12:i:- and S. Typhimurium.
Collapse
Affiliation(s)
- A Arrieta-Gisasola
- MikroIker Research Group, Department of Immunology, Microbiology and Parasitology, Faculty of Pharmacy, University of the Basque Country (UPV/EHU), Paseo de la Universidad 7, 01006 Vitoria-Gasteiz, Spain; Bioaraba, Microbiology, Infectious Diseases, Antimicrobial Agents, and Gene Therapy, 01006 Vitoria-Gasteiz, Spain
| | - I Martínez-Ballesteros
- MikroIker Research Group, Department of Immunology, Microbiology and Parasitology, Faculty of Pharmacy, University of the Basque Country (UPV/EHU), Paseo de la Universidad 7, 01006 Vitoria-Gasteiz, Spain; Bioaraba, Microbiology, Infectious Diseases, Antimicrobial Agents, and Gene Therapy, 01006 Vitoria-Gasteiz, Spain
| | - I Martinez-Malaxetxebarria
- MikroIker Research Group, Department of Immunology, Microbiology and Parasitology, Faculty of Pharmacy, University of the Basque Country (UPV/EHU), Paseo de la Universidad 7, 01006 Vitoria-Gasteiz, Spain; Bioaraba, Microbiology, Infectious Diseases, Antimicrobial Agents, and Gene Therapy, 01006 Vitoria-Gasteiz, Spain
| | - V Garrido
- Institute of Agrobiotechnology (IdAB; CSIC-Gobierno de Navarra), 31192 Mutilva, Spain
| | - M J Grilló
- Institute of Agrobiotechnology (IdAB; CSIC-Gobierno de Navarra), 31192 Mutilva, Spain
| | - J Bikandi
- MikroIker Research Group, Department of Immunology, Microbiology and Parasitology, Faculty of Pharmacy, University of the Basque Country (UPV/EHU), Paseo de la Universidad 7, 01006 Vitoria-Gasteiz, Spain; Bioaraba, Microbiology, Infectious Diseases, Antimicrobial Agents, and Gene Therapy, 01006 Vitoria-Gasteiz, Spain
| | - L Laorden
- MikroIker Research Group, Department of Immunology, Microbiology and Parasitology, Faculty of Pharmacy, University of the Basque Country (UPV/EHU), Paseo de la Universidad 7, 01006 Vitoria-Gasteiz, Spain; Bioaraba, Microbiology, Infectious Diseases, Antimicrobial Agents, and Gene Therapy, 01006 Vitoria-Gasteiz, Spain.
| |
Collapse
|
13
|
Tam YL, Cameron S, Preston A, Cowley L. GWarrange: a pre- and post- genome-wide association studies pipeline for detecting phenotype-associated genome rearrangement events. Microb Genom 2024; 10:001268. [PMID: 38980151 PMCID: PMC11316554 DOI: 10.1099/mgen.0.001268] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Accepted: 06/17/2024] [Indexed: 07/10/2024] Open
Abstract
The use of k-mers to capture genetic variation in bacterial genome-wide association studies (bGWAS) has demonstrated its effectiveness in overcoming the plasticity of bacterial genomes by providing a comprehensive array of genetic variants in a genome set that is not confined to a single reference genome. However, little attempt has been made to interpret k-mers in the context of genome rearrangements, partly due to challenges in the exhaustive and high-throughput identification of genome structure and individual rearrangement events. Here, we present GWarrange, a pre- and post-bGWAS processing methodology that leverages the unique properties of k-mers to facilitate bGWAS for genome rearrangements. Repeat sequences are common instigators of genome rearrangements through intragenomic homologous recombination, and they are commonly found at rearrangement boundaries. Using whole-genome sequences, repeat sequences are replaced by short placeholder sequences, allowing the regions flanking repeats to be incorporated into relatively short k-mers. Then, locations of flanking regions in significant k-mers are mapped back to complete genome sequences to visualise genome rearrangements. Four case studies based on two bacterial species (Bordetella pertussis and Enterococcus faecium) and a simulated genome set are presented to demonstrate the ability to identify phenotype-associated rearrangements. GWarrange is available at https://github.com/DorothyTamYiLing/GWarrange.
Collapse
Affiliation(s)
- Yi Ling Tam
- The Milner Centre for Evolution and Department of Life Sciences, University of Bath, Claverton Down, Bath, BA2 7AY, UK
| | - Sarah Cameron
- The Milner Centre for Evolution and Department of Life Sciences, University of Bath, Claverton Down, Bath, BA2 7AY, UK
| | - Andrew Preston
- The Milner Centre for Evolution and Department of Life Sciences, University of Bath, Claverton Down, Bath, BA2 7AY, UK
| | - Lauren Cowley
- The Milner Centre for Evolution and Department of Life Sciences, University of Bath, Claverton Down, Bath, BA2 7AY, UK
| |
Collapse
|
14
|
Barber MF, Fitzgerald JR. Mechanisms of host adaptation by bacterial pathogens. FEMS Microbiol Rev 2024; 48:fuae019. [PMID: 39003250 PMCID: PMC11308195 DOI: 10.1093/femsre/fuae019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2024] [Revised: 07/02/2024] [Accepted: 07/24/2024] [Indexed: 07/15/2024] Open
Abstract
The emergence of new infectious diseases poses a major threat to humans, animals, and broader ecosystems. Defining factors that govern the ability of pathogens to adapt to new host species is therefore a crucial research imperative. Pathogenic bacteria are of particular concern, given dwindling treatment options amid the continued expansion of antimicrobial resistance. In this review, we summarize recent advancements in the understanding of bacterial host species adaptation, with an emphasis on pathogens of humans and related mammals. We focus particularly on molecular mechanisms underlying key steps of bacterial host adaptation including colonization, nutrient acquisition, and immune evasion, as well as suggest key areas for future investigation. By developing a greater understanding of the mechanisms of host adaptation in pathogenic bacteria, we may uncover new strategies to target these microbes for the treatment and prevention of infectious diseases in humans, animals, and the broader environment.
Collapse
Affiliation(s)
- Matthew F Barber
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR 97403, United States
- Department of Biology, University of Oregon, Eugene, OR 97403, United States
| | - J Ross Fitzgerald
- The Roslin Institute, University of Edinburgh, Midlothian, EH25 9RG, United Kingdom
| |
Collapse
|
15
|
Taylor AJ, Yahara K, Pascoe B, Ko S, Mageiros L, Mourkas E, Calland JK, Puranen S, Hitchings MD, Jolley KA, Kobras CM, Bayliss S, Williams NJ, van Vliet AHM, Parkhill J, Maiden MCJ, Corander J, Hurst LD, Falush D, Keim P, Didelot X, Kelly DJ, Sheppard SK. Epistasis, core-genome disharmony, and adaptation in recombining bacteria. mBio 2024; 15:e0058124. [PMID: 38683013 PMCID: PMC11237541 DOI: 10.1128/mbio.00581-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Accepted: 03/26/2024] [Indexed: 05/01/2024] Open
Abstract
Recombination of short DNA fragments via horizontal gene transfer (HGT) can introduce beneficial alleles, create genomic disharmony through negative epistasis, and create adaptive gene combinations through positive epistasis. For non-core (accessory) genes, the negative epistatic cost is likely to be minimal because the incoming genes have not co-evolved with the recipient genome and are frequently observed as tightly linked cassettes with major effects. By contrast, interspecific recombination in the core genome is expected to be rare because disruptive allelic replacement is likely to introduce negative epistasis. Why then is homologous recombination common in the core of bacterial genomes? To understand this enigma, we take advantage of an exceptional model system, the common enteric pathogens Campylobacter jejuni and C. coli that are known for very high magnitude interspecies gene flow in the core genome. As expected, HGT does indeed disrupt co-adapted allele pairings, indirect evidence of negative epistasis. However, multiple HGT events enable recovery of the genome's co-adaption between introgressing alleles, even in core metabolism genes (e.g., formate dehydrogenase). These findings demonstrate that, even for complex traits, genetic coalitions can be decoupled, transferred, and independently reinstated in a new genetic background-facilitating transition between fitness peaks. In this example, the two-step recombinational process is associated with C. coli that are adapted to the agricultural niche.IMPORTANCEGenetic exchange among bacteria shapes the microbial world. From the acquisition of antimicrobial resistance genes to fundamental questions about the nature of bacterial species, this powerful evolutionary force has preoccupied scientists for decades. However, the mixing of genes between species rests on a paradox: 0n one hand, promoting adaptation by conferring novel functionality; on the other, potentially introducing disharmonious gene combinations (negative epistasis) that will be selected against. Taking an interdisciplinary approach to analyze natural populations of the enteric bacteria Campylobacter, an ideal example of long-range admixture, we demonstrate that genes can independently transfer across species boundaries and rejoin in functional networks in a recipient genome. The positive impact of two-gene interactions appears to be adaptive by expanding metabolic capacity and facilitating niche shifts through interspecific hybridization. This challenges conventional ideas and highlights the possibility of multiple-step evolution of multi-gene traits by interspecific introgression.
Collapse
Affiliation(s)
- Aidan J Taylor
- School of Biological Sciences, University of Reading, Reading, United Kingdom
| | - Koji Yahara
- Antimicrobial Resistance Research Center, National Institute of Infectious Diseases, Tokyo, Japan
| | - Ben Pascoe
- Department of Biology, University of Oxford, Oxford, United Kingdom
| | - Seungwon Ko
- Department of Biology, University of Oxford, Oxford, United Kingdom
| | - Leonardos Mageiros
- Swansea University Medical School, Institute of Life Science, Swansea, United Kingdom
- The Department of Biology and Biochemistry, University of Bath, Bath, United Kingdom
| | | | - Jessica K Calland
- Oslo Centre for Biostatistics and Epidemiology, Oslo University Hospital, Oslo, Norway
| | - Santeri Puranen
- Department of Mathematics and Statistics, Helsinki Institute for Information Technology, University of Helsinki, Helsinki, Finland
| | - Matthew D Hitchings
- Swansea University Medical School, Institute of Life Science, Swansea, United Kingdom
| | - Keith A Jolley
- Department of Biology, University of Oxford, Oxford, United Kingdom
| | - Carolin M Kobras
- Sir William Dunn School of Pathology, University of Oxford, Oxford, United Kingdom
| | - Sion Bayliss
- Bristol Veterinary School, University of Bristol, Bristol, United Kingdom
| | - Nicola J Williams
- Department of Epidemiology and Population Health, Institute of Infection and Global Health, University of Liverpool, Leahurst Campus, Wirral, United Kingdom
| | | | - Julian Parkhill
- Department of Veterinary Medicine, University of Cambridge, Cambridge, United Kingdom
| | | | - Jukka Corander
- Department of Mathematics and Statistics, Helsinki Institute for Information Technology, University of Helsinki, Helsinki, Finland
- Sir William Dunn School of Pathology, University of Oxford, Oxford, United Kingdom
- Parasites and Microbes, Wellcome Sanger Institute, Cambridge, United Kingdom
| | - Laurence D Hurst
- The Department of Biology and Biochemistry, University of Bath, Bath, United Kingdom
| | - Daniel Falush
- The Centre for Microbes, Development and Health, Institut Pasteur of Shanghai, Shanghai, China
| | - Paul Keim
- Department of Biology, University of Oxford, Oxford, United Kingdom
- The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, USA
- Department of Biological Sciences, Northern Arizona University, Flagstaff, Arizona, USA
| | - Xavier Didelot
- Department of Statistics, School of Life Sciences, University of Warwick, Coventry, United Kingdom
| | - David J Kelly
- School of Biosciences, University of Sheffield, Sheffield, United Kingdom
| | | |
Collapse
|
16
|
Frentzel H, Kraemer M, Kelner-Burgos Y, Uelze L, Bodi D. Cereulide production capacities and genetic properties of 31 emetic Bacillus cereus group strains. Int J Food Microbiol 2024; 417:110694. [PMID: 38614024 DOI: 10.1016/j.ijfoodmicro.2024.110694] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 04/02/2024] [Accepted: 04/05/2024] [Indexed: 04/15/2024]
Abstract
The highly potent toxin cereulide is a frequent cause of foodborne intoxications. This extremely resistant toxin is produced by Bacillus cereus group strains carrying the plasmid encoded cesHPTABCD gene cluster. It is known that the capacities to produce cereulide vary greatly between different strains but the genetic background of these variations is not clear. In this study, cereulide production capacities were associated with genetic characteristics. For this, cereulide levels in cultures of 31 strains were determined after incubation in tryptic soy broth for 24 h at 24 °C, 30 °C and 37 °C. Whole genome sequencing based data were used for an in-depth characterization of gene sequences related to cereulide production. The taxonomy, population structure and phylogenetic relationships of the strains were evaluated based on average nucleotide identity, multi-locus sequence typing (MLST), core genome MLST and single nucleotide polymorphism analyses. Despite a limited strain number, the approach of a genome wide association study (GWAS) was tested to link genetic variation with cereulide quantities. Our study confirms strain-dependent differences in cereulide production. For most strains, these differences were not explainable by sequence variations in the cesHPTABCD gene cluster or the regulatory genes abrB, spo0A, codY and pagRBc. Likewise, the population structure and phylogeny of the tested strains did not comprehensively reflect the cereulide production capacities. GWAS yielded first hints for associated proteins, while their possible effect on cereulide synthesis remains to be further investigated.
Collapse
Affiliation(s)
- Hendrik Frentzel
- German Federal Institute for Risk Assessment (BfR), Max-Dohrn-Str. 8-10, 10589 Berlin, Germany.
| | - Marco Kraemer
- German Federal Institute for Risk Assessment (BfR), Max-Dohrn-Str. 8-10, 10589 Berlin, Germany
| | - Ylanna Kelner-Burgos
- German Federal Institute for Risk Assessment (BfR), Max-Dohrn-Str. 8-10, 10589 Berlin, Germany
| | - Laura Uelze
- Max Planck Institute of Molecular Cell Biology and Genetics (MPI-CBG), Sequencing and Genotyping Service Unit, Pfotenhauerstraße 108, 01307 Dresden, Germany
| | - Dorina Bodi
- German Federal Institute for Risk Assessment (BfR), Max-Dohrn-Str. 8-10, 10589 Berlin, Germany
| |
Collapse
|
17
|
Zhernakova DV, Wang D, Liu L, Andreu-Sánchez S, Zhang Y, Ruiz-Moreno AJ, Peng H, Plomp N, Del Castillo-Izquierdo Á, Gacesa R, Lopera-Maya EA, Temba GS, Kullaya VI, van Leeuwen SS, Xavier RJ, de Mast Q, Joosten LAB, Riksen NP, Rutten JHW, Netea MG, Sanna S, Wijmenga C, Weersma RK, Zhernakova A, Harmsen HJM, Fu J. Host genetic regulation of human gut microbial structural variation. Nature 2024; 625:813-821. [PMID: 38172637 PMCID: PMC10808065 DOI: 10.1038/s41586-023-06893-w] [Citation(s) in RCA: 27] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Accepted: 11/23/2023] [Indexed: 01/05/2024]
Abstract
Although the impact of host genetics on gut microbial diversity and the abundance of specific taxa is well established1-6, little is known about how host genetics regulates the genetic diversity of gut microorganisms. Here we conducted a meta-analysis of associations between human genetic variation and gut microbial structural variation in 9,015 individuals from four Dutch cohorts. Strikingly, the presence rate of a structural variation segment in Faecalibacterium prausnitzii that harbours an N-acetylgalactosamine (GalNAc) utilization gene cluster is higher in individuals who secrete the type A oligosaccharide antigen terminating in GalNAc, a feature that is jointly determined by human ABO and FUT2 genotypes, and we could replicate this association in a Tanzanian cohort. In vitro experiments demonstrated that GalNAc can be used as the sole carbohydrate source for F. prausnitzii strains that carry the GalNAc-metabolizing pathway. Further in silico and in vitro studies demonstrated that other ABO-associated species can also utilize GalNAc, particularly Collinsella aerofaciens. The GalNAc utilization genes are also associated with the host's cardiometabolic health, particularly in individuals with mucosal A-antigen. Together, the findings of our study demonstrate that genetic associations across the human genome and bacterial metagenome can provide functional insights into the reciprocal host-microbiome relationship.
Collapse
Affiliation(s)
- Daria V Zhernakova
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, The Netherlands
| | - Daoming Wang
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, The Netherlands
- University of Groningen, University Medical Center Groningen, Department of Pediatrics, Groningen, The Netherlands
| | - Lei Liu
- University of Groningen, University Medical Center Groningen, Department of Medical Microbiology and Infection Prevention, Groningen, The Netherlands
| | - Sergio Andreu-Sánchez
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, The Netherlands
- University of Groningen, University Medical Center Groningen, Department of Pediatrics, Groningen, The Netherlands
| | - Yue Zhang
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, The Netherlands
- University of Groningen, University Medical Center Groningen, Department of Pediatrics, Groningen, The Netherlands
| | - Angel J Ruiz-Moreno
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, The Netherlands
- University of Groningen, University Medical Center Groningen, Department of Pediatrics, Groningen, The Netherlands
| | - Haoran Peng
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, The Netherlands
| | - Niels Plomp
- University of Groningen, University Medical Center Groningen, Department of Medical Microbiology and Infection Prevention, Groningen, The Netherlands
- University of Groningen, University Medical Center Groningen, Department of Gastroenterology and Hepatology, Groningen, The Netherlands
| | - Ángela Del Castillo-Izquierdo
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, The Netherlands
- University of Groningen, University Medical Center Groningen, Department of Medical Microbiology and Infection Prevention, Groningen, The Netherlands
| | - Ranko Gacesa
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, The Netherlands
- University of Groningen, University Medical Center Groningen, Department of Gastroenterology and Hepatology, Groningen, The Netherlands
| | - Esteban A Lopera-Maya
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, The Netherlands
| | - Godfrey S Temba
- Department of Internal Medicine, Radboud University Medical Center, Nijmegen, The Netherlands
- Department of Medical Biochemistry and Molecular Biology, Kilimanjaro Christian Medical University College, Moshi, Tanzania
- Radboud Center for Infectious Diseases, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Vesla I Kullaya
- Department of Medical Biochemistry and Molecular Biology, Kilimanjaro Christian Medical University College, Moshi, Tanzania
- Kilimanjaro Clinical Research Institute, Kilimanjaro Christian Medical Center, Moshi, Tanzania
| | - Sander S van Leeuwen
- University of Groningen, University Medical Center Groningen, Department of Laboratory Medicine, Groningen, The Netherlands
| | - Ramnik J Xavier
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Computational and Integrative Biology, Department of Molecular Biology, Massachusetts General Hospital, Boston, MA, USA
| | - Quirijn de Mast
- Department of Internal Medicine, Radboud University Medical Center, Nijmegen, The Netherlands
- Radboud Center for Infectious Diseases, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Leo A B Joosten
- Department of Internal Medicine, Radboud University Medical Center, Nijmegen, The Netherlands
- Department of Medical Genetics, Iuliu Haţieganu University of Medicine and Pharmacy, Cluj-Napoca, Romania
| | - Niels P Riksen
- Department of Internal Medicine, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Joost H W Rutten
- Department of Internal Medicine, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Mihai G Netea
- Department of Internal Medicine, Radboud University Medical Center, Nijmegen, The Netherlands
- Radboud Center for Infectious Diseases, Radboud University Medical Center, Nijmegen, The Netherlands
- Department of Immunology and Metabolism, Life and Medical Sciences Institute, University of Bonn, Bonn, Germany
- Human Genomics Laboratory, Craiova University of Medicine and Pharmacy, Craiova, Romania
| | - Serena Sanna
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, The Netherlands
- Institute for Genetic and Biomedical Research, National Research Council, Cagliari, Italy
| | - Cisca Wijmenga
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, The Netherlands
| | - Rinse K Weersma
- University of Groningen, University Medical Center Groningen, Department of Gastroenterology and Hepatology, Groningen, The Netherlands
| | - Alexandra Zhernakova
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, The Netherlands
| | - Hermie J M Harmsen
- University of Groningen, University Medical Center Groningen, Department of Medical Microbiology and Infection Prevention, Groningen, The Netherlands.
| | - Jingyuan Fu
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, The Netherlands.
- University of Groningen, University Medical Center Groningen, Department of Pediatrics, Groningen, The Netherlands.
| |
Collapse
|
18
|
Malekian N, Sainath S, Al-Fatlawi A, Schroeder M. Word-based GWAS harnesses the rich potential of genomic data for E. coli quinolone resistance. Front Microbiol 2023; 14:1276332. [PMID: 38152371 PMCID: PMC10751334 DOI: 10.3389/fmicb.2023.1276332] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Accepted: 11/16/2023] [Indexed: 12/29/2023] Open
Abstract
Quinolone resistance presents a growing global health threat. We employed word-based GWAS to explore genomic data, aiming to enhance our understanding of this phenomenon. Unlike traditional variant-based GWAS analyses, this approach simultaneously captures multiple genomic factors, including single and interacting resistance mutations and genes. Analyzing a dataset of 92 genomic E. coli samples from a wastewater treatment plant in Dresden, we identified 54 DNA unitigs significantly associated with quinolone resistance. Remarkably, our analysis not only validated known mutations in gyrA and parC genes and the results of our variant-based GWAS but also revealed new (mutated) genes such as mdfA, the AcrEF-TolC multidrug efflux system, ptrB, and hisI, implicated in antibiotic resistance. Furthermore, our study identified joint mutations in 14 genes including the known gyrA gene, providing insights into potential synergistic effects contributing to quinolone resistance. These findings showcase the exceptional capabilities of word-based GWAS in unraveling the intricate genomic foundations of quinolone resistance.
Collapse
Affiliation(s)
- Negin Malekian
- Biotechnology Center (BIOTEC), Technische Universität Dresden, Dresden, Germany
| | - Srividhya Sainath
- Biotechnology Center (BIOTEC), Technische Universität Dresden, Dresden, Germany
| | - Ali Al-Fatlawi
- Biotechnology Center (BIOTEC), Technische Universität Dresden, Dresden, Germany
- ITRDC, University of Kufa, Najaf, Iraq
| | - Michael Schroeder
- Biotechnology Center (BIOTEC), Technische Universität Dresden, Dresden, Germany
- Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI), TU Dresden, Dresden, Germany
| |
Collapse
|
19
|
Dutta A, McDonald BA, Croll D. Combined reference-free and multi-reference based GWAS uncover cryptic variation underlying rapid adaptation in a fungal plant pathogen. PLoS Pathog 2023; 19:e1011801. [PMID: 37972199 PMCID: PMC10688896 DOI: 10.1371/journal.ppat.1011801] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Revised: 11/30/2023] [Accepted: 11/06/2023] [Indexed: 11/19/2023] Open
Abstract
Microbial pathogens often harbor substantial functional diversity driven by structural genetic variation. Rapid adaptation from such standing variation threatens global food security and human health. Genome-wide association studies (GWAS) provide a powerful approach to identify genetic variants underlying recent pathogen adaptation. However, the reliance on single reference genomes and single nucleotide polymorphisms (SNPs) obscures the true extent of adaptive genetic variation. Here, we show quantitatively how a combination of multiple reference genomes and reference-free approaches captures substantially more relevant genetic variation compared to single reference mapping. We performed reference-genome based association mapping across 19 reference-quality genomes covering the diversity of the species. We contrasted the results with a reference-free (i.e., k-mer) approach using raw whole-genome sequencing data in a panel of 145 strains collected across the global distribution range of the fungal wheat pathogen Zymoseptoria tritici. We mapped the genetic architecture of 49 life history traits including virulence, reproduction and growth in multiple stressful environments. The inclusion of additional reference genome SNP datasets provides a nearly linear increase in additional loci mapped through GWAS. Variants detected through the k-mer approach explained a higher proportion of phenotypic variation than a reference genome-based approach and revealed functionally confirmed loci that classic GWAS approaches failed to map. The power of GWAS in microbial pathogens can be significantly enhanced by comprehensively capturing structural genetic variation. Our approach is generalizable to a large number of species and will uncover novel mechanisms driving rapid adaptation of pathogens.
Collapse
Affiliation(s)
- Anik Dutta
- Plant Pathology, Institute of Integrative Biology, ETH Zurich, Zurich, Switzerland
| | - Bruce A. McDonald
- Plant Pathology, Institute of Integrative Biology, ETH Zurich, Zurich, Switzerland
| | - Daniel Croll
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| |
Collapse
|
20
|
Sommer H, Djamalova D, Galardini M. Reduced ambiguity and improved interpretability of bacterial genome-wide associations using gene-cluster-centric k-mers. Microb Genom 2023; 9. [PMID: 37934071 DOI: 10.1099/mgen.0.001129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2023] Open
Abstract
The wide adoption of bacterial genome sequencing and encoding both core and accessory genome variation using k-mers has allowed bacterial genome-wide association studies (GWAS) to identify genetic variants associated with relevant phenotypes such as those linked to infection. Significant limitations still remain because of k-mers being duplicated across gene clusters and as far as the interpretation of association results is concerned, which affects the wider adoption of GWAS methods on microbial data sets. We have developed a simple computational method (panfeed) that explicitly links each k-mer to their gene cluster at base-resolution level, which allows us to avoid biases introduced by a global de Bruijn graph as well as more easily map and annotate associated variants. We tested panfeed on two independent data sets, correctly identifying previously characterized causal variants, which demonstrates the precision of the method, as well as its scalable performance. panfeed is a command line tool written in the python programming language and is available at https://github.com/microbial-pangenomes-lab/panfeed.
Collapse
Affiliation(s)
- Hannes Sommer
- Institute for Molecular Bacteriology, TWINCORE Centre for Experimental and Clinical Infection Research, a joint venture between the Hannover Medical School (MHH) and the Helmholtz Centre for Infection Research (HZI), Hannover, Germany
- Cluster of Excellence RESIST (EXC 2155), Hannover Medical School (MHH), Hannover, Germany
| | - Dilfuza Djamalova
- Institute for Molecular Bacteriology, TWINCORE Centre for Experimental and Clinical Infection Research, a joint venture between the Hannover Medical School (MHH) and the Helmholtz Centre for Infection Research (HZI), Hannover, Germany
- Cluster of Excellence RESIST (EXC 2155), Hannover Medical School (MHH), Hannover, Germany
| | - Marco Galardini
- Institute for Molecular Bacteriology, TWINCORE Centre for Experimental and Clinical Infection Research, a joint venture between the Hannover Medical School (MHH) and the Helmholtz Centre for Infection Research (HZI), Hannover, Germany
- Cluster of Excellence RESIST (EXC 2155), Hannover Medical School (MHH), Hannover, Germany
| |
Collapse
|
21
|
Bianconi I, Aschbacher R, Pagani E. Current Uses and Future Perspectives of Genomic Technologies in Clinical Microbiology. Antibiotics (Basel) 2023; 12:1580. [PMID: 37998782 PMCID: PMC10668849 DOI: 10.3390/antibiotics12111580] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 10/16/2023] [Accepted: 10/25/2023] [Indexed: 11/25/2023] Open
Abstract
Recent advancements in sequencing technology and data analytics have led to a transformative era in pathogen detection and typing. These developments not only expedite the process, but also render it more cost-effective. Genomic analyses of infectious diseases are swiftly becoming the standard for pathogen analysis and control. Additionally, national surveillance systems can derive substantial benefits from genomic data, as they offer profound insights into pathogen epidemiology and the emergence of antimicrobial-resistant strains. Antimicrobial resistance (AMR) is a pressing global public health issue. While clinical laboratories have traditionally relied on culture-based antimicrobial susceptibility testing, the integration of genomic data into AMR analysis holds immense promise. Genomic-based AMR data can furnish swift, consistent, and highly accurate predictions of resistance phenotypes for specific strains or populations, all while contributing invaluable insights for surveillance. Moreover, genome sequencing assumes a pivotal role in the investigation of hospital outbreaks. It aids in the identification of infection sources, unveils genetic connections among isolates, and informs strategies for infection control. The One Health initiative, with its focus on the intricate interconnectedness of humans, animals, and the environment, seeks to develop comprehensive approaches for disease surveillance, control, and prevention. When integrated with epidemiological data from surveillance systems, genomic data can forecast the expansion of bacterial populations and species transmissions. Consequently, this provides profound insights into the evolution and genetic relationships of AMR in pathogens, hosts, and the environment.
Collapse
Affiliation(s)
- Irene Bianconi
- Laboratory of Microbiology and Virology, Provincial Hospital of Bolzano (SABES-ASDAA), Lehrkrankenhaus der Paracelsus Medizinischen Privatuniversitätvia Amba Alagi 5, 39100 Bolzano, Italy; (R.A.); (E.P.)
| | | | | |
Collapse
|
22
|
Monteith W, Pascoe B, Mourkas E, Clark J, Hakim M, Hitchings MD, McCarthy N, Yahara K, Asakura H, Sheppard SK. Contrasting genes conferring short- and long-term biofilm adaptation in Listeria. Microb Genom 2023; 9:001114. [PMID: 37850975 PMCID: PMC10634452 DOI: 10.1099/mgen.0.001114] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Accepted: 09/28/2023] [Indexed: 10/19/2023] Open
Abstract
Listeria monocytogenes is an opportunistic food-borne bacterium that is capable of infecting humans with high rates of hospitalization and mortality. Natural populations are genotypically and phenotypically variable, with some lineages being responsible for most human infections. The success of L. monocytogenes is linked to its capacity to persist on food and in the environment. Biofilms are an important feature that allow these bacteria to persist and infect humans, so understanding the genetic basis of biofilm formation is key to understanding transmission. We sought to investigate the biofilm-forming ability of L. monocytogenes by identifying genetic variation that underlies biofilm formation in natural populations using genome-wide association studies (GWAS). Changes in gene expression of specific strains during biofilm formation were then investigated using RNA sequencing (RNA-seq). Genetic variation associated with enhanced biofilm formation was identified in 273 genes by GWAS and differential expression in 220 genes by RNA-seq. Statistical analyses show that the number of overlapping genes flagged by either type of experiment is less than expected by random sampling. This novel finding is consistent with an evolutionary scenario where rapid adaptation is driven by variation in gene expression of pioneer genes, and this is followed by slower adaptation driven by nucleotide changes within the core genome.
Collapse
Affiliation(s)
- William Monteith
- Department of Biology, University of Oxford, Oxford, UK
- Department of Biology, University of Bath, Claverton Down, Bath, UK
| | - Ben Pascoe
- Department of Biology, University of Oxford, Oxford, UK
- Big Data Institute, University of Oxford, Oxford, UK
| | | | - Jack Clark
- Department of Genetics, University of Leicester, University Road, Leicester, UK
| | - Maliha Hakim
- Cancer Research UK Cambridge Institute, University of Cambridge, Li Ka Shing Centre, Robinson Way, Cambridge, UK
| | - Matthew D. Hitchings
- Swasnsea University Medical School, Swansea University, Singleton Campus, Swansea, UK
| | - Noel McCarthy
- School of Medicine, Trinity College Dublin, Dublin, Ireland
| | - Koji Yahara
- Antimicrobial Resistance Research Center, National Institute of Infectious Diseases, Tokyo, Japan
| | - Hiroshi Asakura
- Division of Biomedical Food Research, National Institute of Health Sciences, Tonomachi 3-25-26, Kawasaki-ku, Kawasaki, Kanagawa 210-9501, Japan
| | | |
Collapse
|
23
|
Zang X, Pascoe B, Mourkas E, Kong K, Jiao X, Sheppard SK, Huang J. Evidence of potential Campylobacter jejuni zooanthroponosis in captive macaque populations. Microb Genom 2023; 9:001121. [PMID: 37877958 PMCID: PMC10634442 DOI: 10.1099/mgen.0.001121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Accepted: 10/09/2023] [Indexed: 10/26/2023] Open
Abstract
Non-human primates share recent common ancestry with humans and exhibit comparable disease symptoms. Here, we explored the transmission potential of enteric bacterial pathogens in monkeys exhibiting symptoms of recurrent diarrhoea in a biomedical research facility in China. The common zoonotic bacterium Campylobacter jejuni was isolated from macaques (Macaca mulatta and Macaca fascicularis) and compared to isolates from humans and agricultural animals in Asia. Among the monkeys sampled, 5 % (44/973) tested positive for C. jejuni, 11 % (5/44) of which displayed diarrhoeal symptoms. Genomic analysis of monkey isolates, and 1254 genomes from various sources in Asia, were used to identify the most likely source of human infection. Monkey and human isolates shared high average nucleotide identity, common MLST clonal complexes and clustered together on a phylogeny. Furthermore, the profiles of putative antimicrobial resistance genes were similar between monkeys and humans. Taken together these findings suggest that housed macaques became infected with C. jejuni either directly from humans or via a common contamination source.
Collapse
Affiliation(s)
- Xiaoqi Zang
- Jiangsu Key Laboratory of Zoonosis, Jiangsu Co-Innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonoses, Yangzhou University, Yangzhou, PR China
- Joint International Research Laboratory of Agriculture and Agri-Product Safety, Ministry of Education of China, Yangzhou University, Yangzhou, PR China
- Ineos Oxford Institute for Antimicrobial Research, Department of Biology, University of Oxford, Oxford, UK
| | - Ben Pascoe
- Ineos Oxford Institute for Antimicrobial Research, Department of Biology, University of Oxford, Oxford, UK
- Centre for Genomic Pathogen Surveillance, Big Data Institute, University of Oxford, Oxford, UK
| | - Evangelos Mourkas
- Ineos Oxford Institute for Antimicrobial Research, Department of Biology, University of Oxford, Oxford, UK
| | - Ke Kong
- Jiangsu Key Laboratory of Zoonosis, Jiangsu Co-Innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonoses, Yangzhou University, Yangzhou, PR China
| | - Xinan Jiao
- Jiangsu Key Laboratory of Zoonosis, Jiangsu Co-Innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonoses, Yangzhou University, Yangzhou, PR China
- Joint International Research Laboratory of Agriculture and Agri-Product Safety, Ministry of Education of China, Yangzhou University, Yangzhou, PR China
| | - Samuel K. Sheppard
- Ineos Oxford Institute for Antimicrobial Research, Department of Biology, University of Oxford, Oxford, UK
| | - Jinlin Huang
- Jiangsu Key Laboratory of Zoonosis, Jiangsu Co-Innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonoses, Yangzhou University, Yangzhou, PR China
- Joint International Research Laboratory of Agriculture and Agri-Product Safety, Ministry of Education of China, Yangzhou University, Yangzhou, PR China
| |
Collapse
|
24
|
Chen C, Che S, Dong Z, Sui J, Tian Y, Su Y, Zhang M, Sun W, Fan J, Xie J, Xie H. A genome-wide association study reveals that epistasis underlies the pathogenicity of Pectobacterium. Microbiol Spectr 2023; 11:e0176423. [PMID: 37712699 PMCID: PMC10580964 DOI: 10.1128/spectrum.01764-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Accepted: 07/28/2023] [Indexed: 09/16/2023] Open
Abstract
Pectobacterium spp. are important bacterial pathogens that cause soft rot symptoms in various crops. However, their mechanism of pathogenicity requires clarity to help control their infections. Here, genome-wide association studies (GWAS) were conducted by integrating genomic data and measurements of two phenotypes (virulence and cellulase activity) for 120 various Pectobacterium strains in order to identify the genetic basis of their pathogenicity. An artificial intelligence-based software program was developed to automatically measure lesion areas on Chinese cabbage, thereby facilitating accurate and rapid data collection for virulence phenotypes for use in GWAS analysis. The analysis discovered 428 and 158 loci significantly associated with Pectobacterium virulence (lesion area) and cellulase activity, respectively. In addition, 1,229 and 586 epistasis loci pairs were identified for the virulence and cellulase activity phenotypes, respectively. Among them, the AraC transcriptional regulator exerted epistasis effects with another three nutrient transport-related genes in pairs contributing to the virulence phenotype, and their epistatic effects were experimentally confirmed for one pair with knockout mutants of each single gene and double gene. This study consequently provides valuable insights into the genetic mechanism underlying Pectobacterium spp. pathogenicity. IMPORTANCE Plant diseases and pests are responsible for the loss of up to 40% of food crops, and annual economic losses caused by plant diseases reach more than $220 billion. Fighting against plant diseases requires an understanding of the pathogenic mechanisms of pathogens. This study adopted an advanced approach using population genomics integrated with virulence-related phenotype data to investigate the genetic basis of Pectobacterium spp., which causes serious crop losses worldwide. An automated software program based on artificial intelligence was developed to measure the virulence phenotype (lesion area), which greatly facilitated this research. The analysis predicted key genomic loci that were highly associated with virulence phenotypes, exhibited epistasis effects, and were further confirmed as critical for virulence with mutant gene deletion experiments. The present study provides new insights into the genetic determinants associated with Pectobacterium pathogenicity and provides a valuable new software resource that can be adapted to improve plant infection measurements.
Collapse
Affiliation(s)
- Changlong Chen
- Institute of Biotechnology, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
| | - Shu Che
- Department of Plant Pathology, Nanjing Agricultural University, Nanjing, China
| | - Zhou Dong
- EVision Technology (Beijing) Co. Ltd, Beijing, China
| | - Jiayi Sui
- Institute of Biotechnology, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
| | - Yu Tian
- Institute of Biotechnology, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
| | - Yanyan Su
- Institute of Biotechnology, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
| | - Meng Zhang
- Institute of Biotechnology, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
| | - Wangwang Sun
- Institute of Biotechnology, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
| | - Jiaqin Fan
- Department of Plant Pathology, Nanjing Agricultural University, Nanjing, China
| | - Jianbo Xie
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
| | - Hua Xie
- Institute of Biotechnology, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
| |
Collapse
|
25
|
Guo L, Tang J, Tang M, Luo S, Zhou X. Reactive oxygen species are regulated by immune deficiency and Toll pathways in determining the host specificity of honeybee gut bacteria. Proc Natl Acad Sci U S A 2023; 120:e2219634120. [PMID: 37556501 PMCID: PMC10438842 DOI: 10.1073/pnas.2219634120] [Citation(s) in RCA: 28] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 06/26/2023] [Indexed: 08/11/2023] Open
Abstract
Host specificity is observed in gut symbionts of diverse animal lineages. But how hosts maintain symbionts while rejecting their close relatives remains elusive. We use eusocial bees and their codiversified gut bacteria to understand host regulation driving symbiotic specificity. The cross-inoculation of bumblebee Gilliamella induced higher prostaglandin in the honeybee gut, promoting a pronounced host response through immune deficiency (IMD) and Toll pathways. Gene silencing and vitamin C treatments indicate that reactive oxygen species (ROS), not antimicrobial peptides, acts as the effector in inhibiting the non-native strain. Quantitative PCR and RNAi further reveal a regulatory function of the IMD and Toll pathways, in which Relish and dorsal-1 may regulate Dual Oxidase (Duox) for ROS production. Therefore, the honeybee maintains symbiotic specificity by creating a hostile gut environment to exotic bacteria, through differential regulation of its immune system, reflecting a co-opting of existing machinery evolved to combat pathogens.
Collapse
Affiliation(s)
- Lizhen Guo
- Department of Entomology, College of Plant Protection, China Agricultural University, Beijing100083, People’s Republic of China
- Sanya Institute of China Agricultural University, Sanya572000, People’s Republic of China
| | - Junbo Tang
- Department of Entomology, College of Plant Protection, China Agricultural University, Beijing100083, People’s Republic of China
- College of Food Science and Nutritional Engineering, China Agricultural University, Beijing100083, People’s Republic of China
| | - Min Tang
- Department of Entomology, College of Plant Protection, China Agricultural University, Beijing100083, People’s Republic of China
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou215100, People’s Republic of China
| | - Shiqi Luo
- Department of Entomology, College of Plant Protection, China Agricultural University, Beijing100083, People’s Republic of China
| | - Xin Zhou
- Department of Entomology, College of Plant Protection, China Agricultural University, Beijing100083, People’s Republic of China
- Sanya Institute of China Agricultural University, Sanya572000, People’s Republic of China
| |
Collapse
|
26
|
Mehta RS, Petit RA, Read TD, Weissman DB. Detecting patterns of accessory genome coevolution in Staphylococcus aureus using data from thousands of genomes. BMC Bioinformatics 2023; 24:243. [PMID: 37296404 PMCID: PMC10251594 DOI: 10.1186/s12859-023-05363-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Accepted: 05/26/2023] [Indexed: 06/12/2023] Open
Abstract
Bacterial genomes exhibit widespread horizontal gene transfer, resulting in highly variable genome content that complicates the inference of genetic interactions. In this study, we develop a method for detecting coevolving genes from large datasets of bacterial genomes based on pairwise comparisons of closely related individuals, analogous to a pedigree study in eukaryotic populations. We apply our method to pairs of genes from the Staphylococcus aureus accessory genome of over 75,000 annotated gene families using a database of over 40,000 whole genomes. We find many pairs of genes that appear to be gained or lost in a coordinated manner, as well as pairs where the gain of one gene is associated with the loss of the other. These pairs form networks of rapidly coevolving genes, primarily consisting of genes involved in virulence, mechanisms of horizontal gene transfer, and antibiotic resistance, particularly the SCCmec complex. While we focus on gene gain and loss, our method can also detect genes that tend to acquire substitutions in tandem, or genotype-phenotype or phenotype-phenotype coevolution. Finally, we present the R package DeCoTUR that allows for the computation of our method.
Collapse
Affiliation(s)
- Rohan S Mehta
- Department of Physics, Emory University, Atlanta, GA, USA.
| | - Robert A Petit
- Division of Infectious Diseases, Department of Medicine, School of Medicine, Emory University, Atlanta, GA, USA
- Wyoming Public Health Laboratory, Cheyenne, WY, USA
| | - Timothy D Read
- Division of Infectious Diseases, Department of Medicine, School of Medicine, Emory University, Atlanta, GA, USA
- Department of Human Genetics, School of Medicine, Emory University, Atlanta, GA, USA
| | | |
Collapse
|
27
|
Papudeshi B, Rusch DB, VanInsberghe D, Lively CM, Edwards RA, Bashey F. Host Association and Spatial Proximity Shape but Do Not Constrain Population Structure in the Mutualistic Symbiont Xenorhabdus bovienii. mBio 2023:e0043423. [PMID: 37154562 DOI: 10.1128/mbio.00434-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/10/2023] Open
Abstract
To what extent are generalist species cohesive evolutionary units rather than a compilation of recently diverged lineages? We examine this question in the context of host specificity and geographic structure in the insect pathogen and nematode mutualist Xenorhabdus bovienii. This bacterial species partners with multiple nematode species across two clades in the genus Steinernema. We sequenced the genomes of 42 X. bovienii strains isolated from four different nematode species and three field sites within a 240-km2 region and compared them to globally available reference genomes. We hypothesized that X. bovienii would comprise several host-specific lineages, such that bacterial and nematode phylogenies would be largely congruent. Alternatively, we hypothesized that spatial proximity might be a dominant signal, as increasing geographic distance might lower shared selective pressures and opportunities for gene flow. We found partial support for both hypotheses. Isolates clustered largely by nematode host species but did not strictly match the nematode phylogeny, indicating that shifts in symbiont associations across nematode species and clades have occurred. Furthermore, both genetic similarity and gene flow decreased with geographic distance across nematode species, suggesting differentiation and constraints on gene flow across both factors, although no absolute barriers to gene flow were observed across the regional isolates. Several genes associated with biotic interactions were found to be undergoing selective sweeps within this regional population. The interactions included several insect toxins and genes implicated in microbial competition. Thus, gene flow maintains cohesiveness across host associations in this symbiont and may facilitate adaptive responses to a multipartite selective environment. IMPORTANCE Microbial populations and species are notoriously hard to delineate. We used a population genomics approach to examine the population structure and the spatial scale of gene flow in Xenorhabdus bovienii, an intriguing species that is both a specialized mutualistic symbiont of nematodes and a broadly virulent insect pathogen. We found a strong signature of nematode host association, as well as evidence for gene flow connecting isolates associated with different nematode host species and collected from distinct study sites. Furthermore, we saw signatures of selective sweeps for genes involved with nematode host associations, insect pathogenicity, and microbial competition. Thus, X. bovienii exemplifies the growing consensus that recombination not only maintains cohesion but can also allow the spread of niche-beneficial alleles.
Collapse
Affiliation(s)
- Bhavya Papudeshi
- Flinders Accelerator for Microbiome Exploration, Flinders University, Adelaide, Australia
- National Centre for Genome Analysis Support, Pervasive Institute of Technology, Indiana University, Bloomington, Indiana, USA
| | - Douglas B Rusch
- Center for Genomics and Bioinformatics, Indiana University, Bloomington, Indiana, USA
| | | | - Curtis M Lively
- Department of Biology, Indiana University, Bloomington, Indiana, USA
| | - Robert A Edwards
- Flinders Accelerator for Microbiome Exploration, Flinders University, Adelaide, Australia
| | - Farrah Bashey
- Department of Biology, Indiana University, Bloomington, Indiana, USA
| |
Collapse
|
28
|
Tiwari SK, van der Putten BCL, Fuchs TM, Vinh TN, Bootsma M, Oldenkamp R, La Ragione R, Matamoros S, Hoa NT, Berens C, Leng J, Álvarez J, Ferrandis-Vila M, Ritchie JM, Fruth A, Schwarz S, Domínguez L, Ugarte-Ruiz M, Bethe A, Huber C, Johanns V, Stamm I, Wieler LH, Ewers C, Fivian-Hughes A, Schmidt H, Menge C, Semmler T, Schultsz C. Genome-wide association reveals host-specific genomic traits in Escherichia coli. BMC Biol 2023; 21:76. [PMID: 37038177 PMCID: PMC10088187 DOI: 10.1186/s12915-023-01562-w] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Accepted: 03/10/2023] [Indexed: 04/12/2023] Open
Abstract
BACKGROUND Escherichia coli is an opportunistic pathogen which colonizes various host species. However, to what extent genetic lineages of E. coli are adapted or restricted to specific hosts and the genomic determinants of such adaptation or restriction is poorly understood. RESULTS We randomly sampled E. coli isolates from four countries (Germany, UK, Spain, and Vietnam), obtained from five host species (human, pig, cattle, chicken, and wild boar) over 16 years, from both healthy and diseased hosts, to construct a collection of 1198 whole-genome sequenced E. coli isolates. We identified associations between specific E. coli lineages and the host from which they were isolated. A genome-wide association study (GWAS) identified several E. coli genes that were associated with human, cattle, or chicken hosts, whereas no genes associated with the pig host could be found. In silico characterization of nine contiguous genes (collectively designated as nan-9) associated with the human host indicated that these genes are involved in the metabolism of sialic acids (Sia). In contrast, the previously described sialic acid regulon known as sialoregulon (i.e. nanRATEK-yhcH, nanXY, and nanCMS) was not associated with any host species. In vitro growth experiments with a Δnan-9 E. coli mutant strain, using the sialic acids 5-N-acetylneuraminic acid (Neu5Ac) and N-glycolylneuraminic acid (Neu5Gc) as sole carbon source, showed impaired growth behaviour compared to the wild-type. CONCLUSIONS This study provides an extensive analysis of genetic determinants which may contribute to host specificity in E. coli. Our findings should inform risk analysis and epidemiological monitoring of (antimicrobial resistant) E. coli.
Collapse
Affiliation(s)
- Sumeet K Tiwari
- Robert Koch Institute, Genome Sequencing and Genomic Epidemiology, Berlin, Germany
- Quadram Institute Bioscience, Gut Microbes and Health Institute Strategic Program, Norwich Research Park, Norwich, UK
| | - Boas C L van der Putten
- Department of Global Health, Amsterdam Institute for Global Health and Development, Amsterdam UMC, University of Amsterdam, Amsterdam, Netherlands
- Department of Medical Microbiology, Amsterdam UMC, University of Amsterdam, Amsterdam, Netherlands
| | - Thilo M Fuchs
- Friedrich-Loeffler-Institut, Institute of Molecular Pathogenesis, Jena, Germany
| | - Trung N Vinh
- Oxford University Clinical Research Unit, Ho Chi Minh City, Vietnam
- Faculty of Veterinary Medicine, College of Agriculture, Can Tho University, Can Tho, Vietnam
| | | | - Rik Oldenkamp
- Department of Global Health, Amsterdam Institute for Global Health and Development, Amsterdam UMC, University of Amsterdam, Amsterdam, Netherlands
- Amsterdam Institute for Life and Environment, Faculty of Science, Vrije Universiteit Amsterdam, Amsterdam, Netherlands
| | - Roberto La Ragione
- Department of Comparative Biomedical Sciences, School of Veterinary Medicine, University of Surrey, Guildford, UK
- Department of Microbial Sciences, School of Biosciences, University of Surrey, Guildford, UK
| | - Sebastien Matamoros
- Department of Medical Microbiology, Amsterdam UMC, University of Amsterdam, Amsterdam, Netherlands
| | - Ngo T Hoa
- Oxford University Clinical Research Unit, Ho Chi Minh City, Vietnam
- Tropical medicine and global health, Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7BN, UK
- Microbiology- Parasitology Unit, Biomedical Research Center and Microbiology Department, Pham Ngoc Thach University of Medicine, Ho Chi Minh City, Vietnam
| | - Christian Berens
- Friedrich-Loeffler-Institut, Institute of Molecular Pathogenesis, Jena, Germany
| | - Joy Leng
- Department of Comparative Biomedical Sciences, School of Veterinary Medicine, University of Surrey, Guildford, UK
| | - Julio Álvarez
- VISAVET Health Surveillance Centre, Complutense University of Madrid, Madrid, Spain
- Department of Animal Health, Faculty of Veterinary Medicine, Complutense University of Madrid, Madrid, Spain
| | | | - Jenny M Ritchie
- Department of Microbial Sciences, School of Biosciences, University of Surrey, Guildford, UK
| | - Angelika Fruth
- Robert Koch Institute, Enteropathogenic Bacteria and Legionella, Wernigerode, Germany
| | - Stefan Schwarz
- Institute of Microbiology and Epizootics, Freie Universität Berlin, Berlin, Germany
- Veterinary Centre for Resistance Research (TZR), Freie Universität Berlin, Berlin, Germany
| | - Lucas Domínguez
- Tropical medicine and global health, Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7BN, UK
- Microbiology- Parasitology Unit, Biomedical Research Center and Microbiology Department, Pham Ngoc Thach University of Medicine, Ho Chi Minh City, Vietnam
| | - María Ugarte-Ruiz
- VISAVET Health Surveillance Centre, Complutense University of Madrid, Madrid, Spain
| | - Astrid Bethe
- Institute of Microbiology and Epizootics, Freie Universität Berlin, Berlin, Germany
- Veterinary Centre for Resistance Research (TZR), Freie Universität Berlin, Berlin, Germany
| | - Charlotte Huber
- Institute of Microbiology and Epizootics, Freie Universität Berlin, Berlin, Germany
| | - Vanessa Johanns
- Robert Koch Institute, Advanced Light and Electron Microscopy, Berlin, Germany
| | - Ivonne Stamm
- Vet Med Labor GmbH, Division of IDEXX Laboratories, Kornwestheim, Germany
| | | | - Christa Ewers
- Institute of Hygiene and Infectious Diseases of Animals, Giessen, Germany
| | - Amanda Fivian-Hughes
- Department of Microbial Sciences, School of Biosciences, University of Surrey, Guildford, UK
| | - Herbert Schmidt
- Institute of Food Science and Biotechnology, Department of Food Microbiology and Hygiene, University of Hohenheim, Stuttgart, Germany
| | - Christian Menge
- Friedrich-Loeffler-Institut, Institute of Molecular Pathogenesis, Jena, Germany
| | - Torsten Semmler
- Robert Koch Institute, Genome Sequencing and Genomic Epidemiology, Berlin, Germany.
| | - Constance Schultsz
- Department of Global Health, Amsterdam Institute for Global Health and Development, Amsterdam UMC, University of Amsterdam, Amsterdam, Netherlands.
- Department of Medical Microbiology, Amsterdam UMC, University of Amsterdam, Amsterdam, Netherlands.
| |
Collapse
|
29
|
Ruperao P, Gandham P, Odeny DA, Mayes S, Selvanayagam S, Thirunavukkarasu N, Das RR, Srikanda M, Gandhi H, Habyarimana E, Manyasa E, Nebie B, Deshpande SP, Rathore A. Exploring the sorghum race level diversity utilizing 272 sorghum accessions genomic resources. FRONTIERS IN PLANT SCIENCE 2023; 14:1143512. [PMID: 37008459 PMCID: PMC10063887 DOI: 10.3389/fpls.2023.1143512] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Accepted: 02/22/2023] [Indexed: 06/19/2023]
Abstract
Due to evolutionary divergence, sorghum race populations exhibit significant genetic and morphological variation. A k-mer-based sorghum race sequence comparison identified the conserved k-mers of all 272 accessions from sorghum and the race-specific genetic signatures identified the gene variability in 10,321 genes (PAVs). To understand sorghum race structure, diversity and domestication, a deep learning-based variant calling approach was employed in a set of genotypic data derived from a diverse panel of 272 sorghum accessions. The data resulted in 1.7 million high-quality genome-wide SNPs and identified selective signature (both positive and negative) regions through a genome-wide scan with different (iHS and XP-EHH) statistical methods. We discovered 2,370 genes associated with selection signatures including 179 selective sweep regions distributed over 10 chromosomes. Co-localization of these regions undergoing selective pressure with previously reported QTLs and genes revealed that the signatures of selection could be related to the domestication of important agronomic traits such as biomass and plant height. The developed k-mer signatures will be useful in the future to identify the sorghum race and for trait and SNP markers for assisting in plant breeding programs.
Collapse
Affiliation(s)
- Pradeep Ruperao
- Center of Excellence in Genomics and Systems Biology, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | - Prasad Gandham
- School of Plant, Environmental and Soil Sciences, Louisiana State University Agricultural Center, LA, United States
| | - Damaris A. Odeny
- Center of Excellence in Genomics and Systems Biology, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | - Sean Mayes
- Center of Excellence in Genomics and Systems Biology, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | | | - Nepolean Thirunavukkarasu
- Genomics and Molecular Breeding Lab, Indian Council of Agricultural Research (ICAR) - Indian Institute of Millets Research, Hyderabad, India
| | - Roma R. Das
- International Crops Research Institute for the Semi-Arid Tropics, Hyderabad, India
| | - Manasa Srikanda
- Department of Statistics, Osmania University, Hyderabad, India
| | - Harish Gandhi
- International Maize and Wheat Improvement Center (CIMMYT), Nairobi, Kenya
| | - Ephrem Habyarimana
- International Crops Research Institute for the Semi-Arid Tropics, Hyderabad, India
| | - Eric Manyasa
- Sorghum Breeding Program, International Crops Research Institute for the Semi-Arid Tropics, Nairobi, Kenya
| | - Baloua Nebie
- International Maize and Wheat Improvement Center (CIMMYT), Dakar, Senegal
| | | | - Abhishek Rathore
- Excellence in Breeding, International Maize and Wheat Improvement Center (CIMMYT), Hyderabad, India
| |
Collapse
|
30
|
Newberry EA, Minsavage GV, Holland A, Jones JB, Potnis N. Genome-Wide Association to Study the Host-Specificity Determinants of Xanthomonas perforans. PHYTOPATHOLOGY 2023; 113:400-412. [PMID: 36318253 DOI: 10.1094/phyto-08-22-0294-r] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Xanthomonas perforans and X. euvesicatoria are the causal agents of bacterial spot disease of tomato and pepper, endemic to the Southeastern United States. Although very closely related, the two bacterial species differ in host specificity, where X. perforans is the dominant pathogen of tomato and X. euvesicatoria that of pepper. This is in part due to the activity of avirulence proteins that are secreted by X. perforans strains and elicit effector-triggered immunity in pepper leaves, thereby restricting pathogen growth. In recent years, the emergence of several pepper-pathogenic X. perforans lineages has revealed variability within the bacterial species to multiply and cause disease in pepper, even in the absence of avirulence gene activity. Here, we investigated the basal evolutionary processes underlying the host range of this species using multiple genome-wide association analyses. Surprisingly, we identified two novel gene candidates that were significantly associated with pepper-pathogenic X. perforans and X. euvesicatoria. Both candidates were predicted to be involved in the transport/acquisition of nutrients common to the plant cell wall or apoplast and included a TonB-dependent receptor, which was disrupted through independent mutations within the X. perforans lineage. The other included a symporter of protons/glutamate, gltP, enriched with pepper-associated mutations near the promoter and start codon of the gene. Functional analysis of these candidates revealed that only the TonB-dependent receptor had a minor effect on the symptom development and growth of X. perforans in pepper leaves, indicating that pathogenicity to this host might have evolved independently within the bacterial species and is likely a complex, multigenic trait.
Collapse
Affiliation(s)
- Eric A Newberry
- Department of Entomology and Plant Pathology, Auburn University, AL 36849
| | | | - Auston Holland
- Department of Entomology and Plant Pathology, Auburn University, AL 36849
| | - Jeffrey B Jones
- Department of Plant Pathology, University of Florida, FL 32611
| | - Neha Potnis
- Department of Entomology and Plant Pathology, Auburn University, AL 36849
| |
Collapse
|
31
|
Myers BK, Shin GY, Agarwal G, Stice SP, Gitaitis RD, Kvitko BH, Dutta B. Genome-wide association and dissociation studies in Pantoea ananatis reveal potential virulence factors affecting Allium porrum and Allium fistulosum × Allium cepa hybrid. Front Microbiol 2023; 13:1094155. [PMID: 36817114 PMCID: PMC9933511 DOI: 10.3389/fmicb.2022.1094155] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2022] [Accepted: 12/30/2022] [Indexed: 02/05/2023] Open
Abstract
Pantoea ananatis is a member of a Pantoea species complex that causes center rot of bulb onions (A. cepa) and also infects other Allium crops like leeks (Allium porrum), chives (Allium schoenoprasum), bunching onion or Welsh onion (Allium fistulosum), and garlic (Allium sativum). This pathogen relies on a chromosomal phosphonate biosynthetic gene cluster (HiVir) and a plasmid-borne thiosulfinate tolerance cluster (alt) for onion pathogenicity and virulence, respectively. However, pathogenicity and virulence factors associated with other Allium species remain unknown. We used phenotype-dependent genome-wide association (GWAS) and phenotype-independent gene-pair coincidence (GPC) analyses on a panel of diverse 92 P. ananatis strains, which were inoculated on A. porrum and A. fistulosum × A. cepa under greenhouse conditions. Phenotypic assays showed that, in general, these strains were more aggressive on A. fistulosum × A. cepa as opposed to A. porrum. Of the 92 strains, only six showed highly aggressive foliar lesions on A. porrum compared to A. fistulosum × A. cepa. Conversely, nine strains showed highly aggressive foliar lesions on A. fistulosum × A. cepa compared to A. porrum. These results indicate that there are underlying genetic components in P. ananatis that may drive pathogenicity in these two Allium spp. Based on GWAS for foliar pathogenicity, 835 genes were associated with P. ananatis' pathogenicity on A. fistulosum × A. cepa whereas 243 genes were associated with bacterial pathogenicity on A. porrum. The Hivir as well as the alt gene clusters were identified among these genes. Besides the 'HiVir' and the alt gene clusters that are known to contribute to pathogenicity and virulence from previous studies, genes annotated with functions related to stress responses, a potential toxin-antitoxin system, flagellar-motility, quorum sensing, and a previously described phosphonoglycan biosynthesis (pgb) cluster were identified. The GPC analysis resulted in the identification of 165 individual genes sorted into 39 significant gene-pair association components and 255 genes sorted into 50 significant gene-pair dissociation components. Within the coincident gene clusters, several genes that occurred on the GWAS outputs were associated with each other but dissociated with genes that did not appear in their respective GWAS output. To focus on candidate genes that could explain the difference in virulence between hosts, a comparative genomics analysis was performed on five P. ananatis strains that were differentially pathogenic on A. porrum or A. fistulosum × A. cepa. Here, we found a putative type III secretion system, and several other genes that occurred on both GWAS outputs of both Allium hosts. Further, we also demonstrated utilizing mutational analysis that the pepM gene in the HiVir cluster is important than the pepM gene in the pgb cluster for P. ananatis pathogenicity in A. fistulosum × A. cepa and A. porrum. Overall, our results support that P. ananatis may utilize a common set of genes or gene clusters to induce symptoms on A. fistulosum × A. cepa foliar tissue as well as A. cepa but implicates additional genes for infection on A. porrum.
Collapse
Affiliation(s)
- Brendon K. Myers
- Department of Plant Pathology, The University of Georgia, Tifton, GA, United States
| | - Gi Yoon Shin
- Department of Plant Pathology, The University of Georgia, Athens, GA, United States
| | - Gaurav Agarwal
- Department of Plant Pathology, The University of Georgia, Tifton, GA, United States
| | - Shaun P. Stice
- Department of Plant Pathology, The University of Georgia, Athens, GA, United States
| | - Ronald D. Gitaitis
- Department of Plant Pathology, The University of Georgia, Tifton, GA, United States
| | - Brian H. Kvitko
- Department of Plant Pathology, The University of Georgia, Athens, GA, United States
| | - Bhabesh Dutta
- Department of Plant Pathology, The University of Georgia, Tifton, GA, United States,*Correspondence: Bhabesh Dutta, ✉
| |
Collapse
|
32
|
Strachan CR, Yu XA, Neubauer V, Mueller AJ, Wagner M, Zebeli Q, Selberherr E, Polz MF. Differential carbon utilization enables co-existence of recently speciated Campylobacteraceae in the cow rumen epithelial microbiome. Nat Microbiol 2023; 8:309-320. [PMID: 36635570 PMCID: PMC9894753 DOI: 10.1038/s41564-022-01300-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Accepted: 12/05/2022] [Indexed: 01/14/2023]
Abstract
The activities of different microbes in the cow rumen have been shown to modulate the host's ability to utilize plant biomass, while the host-rumen interface has received little attention. As datasets collected worldwide have pointed to Campylobacteraceae as particularly abundant members of the rumen epithelial microbiome, we targeted this group in a subset of seven cows with meta- and isolate genome analysis. We show that the dominant Campylobacteraceae lineage has recently speciated into two populations that were structured by genome-wide selective sweeps followed by population-specific gene import and recombination. These processes led to differences in gene expression and enzyme domain composition that correspond to the ability to utilize acetate, the main carbon source for the host, at the cost of inhibition by propionate. This trade-off in competitive ability further manifests itself in differential dynamics of the two populations in vivo. By exploring population-level adaptations that otherwise remain cryptic in culture-independent analyses, our results highlight how recent evolutionary dynamics can shape key functional roles in the rumen microbiome.
Collapse
Affiliation(s)
- Cameron R Strachan
- Institute of Food Safety, Food Technology and Veterinary Public Health, Department for Farm Animals and Public Health, University of Veterinary Medicine Vienna, Vienna, Austria
- Austrian Competence Centre for Feed and Food Quality, Safety and Innovation FFoQSI GmbH, Tulln, Austria
| | - Xiaoqian A Yu
- Division of Microbial Ecology, Centre for Microbiology and Environmental Systems Science, University of Vienna, Vienna, Austria
| | - Viktoria Neubauer
- Institute of Food Safety, Food Technology and Veterinary Public Health, Department for Farm Animals and Public Health, University of Veterinary Medicine Vienna, Vienna, Austria
- Austrian Competence Centre for Feed and Food Quality, Safety and Innovation FFoQSI GmbH, Tulln, Austria
| | - Anna J Mueller
- Division of Microbial Ecology, Centre for Microbiology and Environmental Systems Science, University of Vienna, Vienna, Austria
- University of Vienna, Doctoral School in Microbiology and Environmental Science, Vienna, Austria
| | - Martin Wagner
- Institute of Food Safety, Food Technology and Veterinary Public Health, Department for Farm Animals and Public Health, University of Veterinary Medicine Vienna, Vienna, Austria
- Austrian Competence Centre for Feed and Food Quality, Safety and Innovation FFoQSI GmbH, Tulln, Austria
| | - Qendrim Zebeli
- Institute of Animal Nutrition and Functional Plant Compounds, Department for Farm Animals and Public Health, University of Veterinary Medicine Vienna, Vienna, Austria
- Christian Doppler Laboratory for Innovative Gut Health Concepts of Livestock, Vienna, Austria
| | - Evelyne Selberherr
- Institute of Food Safety, Food Technology and Veterinary Public Health, Department for Farm Animals and Public Health, University of Veterinary Medicine Vienna, Vienna, Austria.
| | - Martin F Polz
- Division of Microbial Ecology, Centre for Microbiology and Environmental Systems Science, University of Vienna, Vienna, Austria.
| |
Collapse
|
33
|
Deblais L, Jang H, Kauffman M, Gangiredla J, Sawyer M, Basa S, Poelstra JW, Babu US, Harrison LM, Hiett KL, Balan KV, Rajashekara G. Whole genome characterization of thermophilic Campylobacter species isolated from dairy manure in small specialty crop farms of Northeast Ohio. Front Microbiol 2023; 14:1074548. [PMID: 37025625 PMCID: PMC10071015 DOI: 10.3389/fmicb.2023.1074548] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Accepted: 01/30/2023] [Indexed: 04/08/2023] Open
Abstract
Introduction With more public interest in consuming locally grown produce, small specialty crop farms (SSCF) are a viable and growing segment of the food production chain in the United States. Methods The goal of this study was to investigate the genomic diversity of Campylobacter isolated from dairy manure (n = 69) collected from 10 SSCF in Northeast Ohio between 2018 and 2020. Results A total of 56 C. jejuni and 13 C. coli isolates were sequenced. Multi-locus sequence typing (MLST) identified 22 sequence types (STs), with ST-922 (18%) and ST-61 (13%) predominant in C. jejuni and ST-829 (62%) and ST-1068 (38%) predominant in C. coli. Interestingly, isolates with similar genomic and gene contents were detected within and between SSCF over time, suggesting that Campylobacter could be transmitted between farms and may persist in a given SSCF over time. Virulence-associated genes (n = 35) involved in the uptake and utilization of potassium and organic compounds (succinate, gluconate, oxoglutarate, and malate) were detected only in the C. jejuni isolates, while 45 genes associated with increased resistance to environmental stresses (capsule production, cell envelope integrity, and iron uptake) were detected only in the C. coli isolates. Campylobacter coli isolates were also sub-divided into two distinct clusters based on the presence of unique prophages (n = 21) or IncQ conjugative plasmid/type-IV secretion system genes (n = 15). Campylobacter coli isolates harbored genes associated with resistance to streptomycin (aadE-Cc; 54%) and quinolone (gyrA-T86I; 77%), while C. jejuni had resistance genes for kanamycin (aph3'-IIIa; 20%). Both species harbored resistance genes associated with β-lactam (especially, blaOXA-193; up to 100%) and tetracycline (tetO; up to 59%). Discussion/Conclusion Our study demonstrated that Campylobacter genome plasticity associated with conjugative transfer might provide resistance to certain antimicrobials and viral infections via the acquisition of protein-encoding genes involved in mechanisms such as ribosomal protection and capsule modification.
Collapse
Affiliation(s)
- Loic Deblais
- Department of Animal Sciences, Center for Food Animal Health, The Ohio Agricultural Research and Development Center, The Ohio State University, Wooster, OH, United States
| | - Hyein Jang
- Center for Food Safety and Applied Nutrition (CFSAN), Office of Applied Research and Safety Assessment (OARSA), U.S. Food and Drug Administration, Laurel, MD, United States
| | - Mike Kauffman
- Department of Animal Sciences, Center for Food Animal Health, The Ohio Agricultural Research and Development Center, The Ohio State University, Wooster, OH, United States
| | - Jayanthi Gangiredla
- Center for Food Safety and Applied Nutrition (CFSAN), Office of Applied Research and Safety Assessment (OARSA), U.S. Food and Drug Administration, Laurel, MD, United States
| | - Marianne Sawyer
- Center for Food Safety and Applied Nutrition (CFSAN), Office of Applied Research and Safety Assessment (OARSA), U.S. Food and Drug Administration, Laurel, MD, United States
| | - Saritha Basa
- Center for Food Safety and Applied Nutrition (CFSAN), Office of Applied Research and Safety Assessment (OARSA), U.S. Food and Drug Administration, Laurel, MD, United States
| | - Jelmer W. Poelstra
- Molecular and Cellular Imaging Center, The Ohio Agricultural Research and Development Center, The Ohio State University, Wooster, OH, United States
| | - Uma S. Babu
- Center for Food Safety and Applied Nutrition (CFSAN), Office of Applied Research and Safety Assessment (OARSA), U.S. Food and Drug Administration, Laurel, MD, United States
| | - Lisa M. Harrison
- Center for Food Safety and Applied Nutrition (CFSAN), Office of Applied Research and Safety Assessment (OARSA), U.S. Food and Drug Administration, Laurel, MD, United States
| | - Kelli L. Hiett
- Center for Food Safety and Applied Nutrition (CFSAN), Office of Applied Research and Safety Assessment (OARSA), U.S. Food and Drug Administration, Laurel, MD, United States
| | - Kannan V. Balan
- Center for Food Safety and Applied Nutrition (CFSAN), Office of Applied Research and Safety Assessment (OARSA), U.S. Food and Drug Administration, Laurel, MD, United States
| | - Gireesh Rajashekara
- Department of Animal Sciences, Center for Food Animal Health, The Ohio Agricultural Research and Development Center, The Ohio State University, Wooster, OH, United States
- *Correspondence: Gireesh Rajashekara,
| |
Collapse
|
34
|
dessouky YE, Elsayed SW, Abdelsalam NA, Saif NA, Álvarez-Ordóñez A, Elhadidy M. Genomic insights into zoonotic transmission and antimicrobial resistance in Campylobacter jejuni from farm to fork: a one health perspective. Gut Pathog 2022; 14:44. [PMID: 36471447 PMCID: PMC9721040 DOI: 10.1186/s13099-022-00517-w] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/19/2022] [Accepted: 11/08/2022] [Indexed: 12/09/2022] Open
Abstract
BACKGROUND Campylobacteriosis represents a global public health threat with various socio-economic impacts. Among different Campylobacter species, Campylobacter jejuni (C. jejuni) is considered to be the foremost Campylobacter species responsible for most of gastrointestinal-related infections. Although these species are reported to primarily inhabit birds, its high genetic and phenotypic diversity allowed their adaptation to other animal reservoirs and to the environment that may impact on human infection. MAIN BODY A stringent and consistent surveillance program based on high resolution subtyping is crucial. Recently, different epidemiological investigations have implemented high-throughput sequencing technologies and analytical pipelines for higher resolution subtyping, accurate source attribution, and detection of antimicrobial resistance determinants among these species. In this review, we aim to present a comprehensive overview on the epidemiology, clinical presentation, antibiotic resistance, and transmission dynamics of Campylobacter, with specific focus on C. jejuni. This review also summarizes recent attempts of applying whole-genome sequencing (WGS) coupled with bioinformatic algorithms to identify and provide deeper insights into evolutionary and epidemiological dynamics of C. jejuni precisely along the farm-to-fork continuum. CONCLUSION WGS is a valuable addition to traditional surveillance methods for Campylobacter. It enables accurate typing of this pathogen and allows tracking of its transmission sources. It is also advantageous for in silico characterization of antibiotic resistance and virulence determinants, and hence implementation of control measures for containment of infection.
Collapse
Affiliation(s)
- Yara El dessouky
- grid.440881.10000 0004 0576 5483Biomedical Sciences Program, University of Science and Technology, Zewail City of Science and Technology, Giza, Egypt ,grid.440881.10000 0004 0576 5483Center for Genomics, Helmy Institute for Medical Sciences, Zewail City of Science and Technology, Giza, Egypt
| | - Salma W. Elsayed
- grid.440881.10000 0004 0576 5483Biomedical Sciences Program, University of Science and Technology, Zewail City of Science and Technology, Giza, Egypt ,grid.440881.10000 0004 0576 5483Center for Genomics, Helmy Institute for Medical Sciences, Zewail City of Science and Technology, Giza, Egypt ,grid.7269.a0000 0004 0621 1570Department of Microbiology and Immunology, Faculty of Pharmacy, Ain Shams University, Cairo, Egypt
| | - Nehal Adel Abdelsalam
- grid.440881.10000 0004 0576 5483Biomedical Sciences Program, University of Science and Technology, Zewail City of Science and Technology, Giza, Egypt ,grid.440881.10000 0004 0576 5483Center for Genomics, Helmy Institute for Medical Sciences, Zewail City of Science and Technology, Giza, Egypt ,grid.7776.10000 0004 0639 9286Department of Microbiology and Immunology, Faculty of Pharmacy, Cairo University, Cairo, Egypt
| | - Nehal A. Saif
- grid.440881.10000 0004 0576 5483Biomedical Sciences Program, University of Science and Technology, Zewail City of Science and Technology, Giza, Egypt ,grid.440881.10000 0004 0576 5483Center for Genomics, Helmy Institute for Medical Sciences, Zewail City of Science and Technology, Giza, Egypt
| | - Avelino Álvarez-Ordóñez
- grid.4807.b0000 0001 2187 3167Department of Food Hygiene and Technology and Institute of Food Science and Technology, Universidad de León, León, Spain
| | - Mohamed Elhadidy
- grid.440881.10000 0004 0576 5483Biomedical Sciences Program, University of Science and Technology, Zewail City of Science and Technology, Giza, Egypt ,grid.440881.10000 0004 0576 5483Center for Genomics, Helmy Institute for Medical Sciences, Zewail City of Science and Technology, Giza, Egypt ,grid.10251.370000000103426662Department of Bacteriology, Mycology and Immunology, Faculty of Veterinary Medicine, Mansoura University, Mansoura, Egypt
| |
Collapse
|
35
|
Morales-Laverde L, Trobos M, Echeverz M, Solano C, Lasa I. Functional analysis of intergenic regulatory regions of genes encoding surface adhesins in Staphylococcus aureus isolates from periprosthetic joint infections. Biofilm 2022; 4:100093. [PMID: 36408060 PMCID: PMC9667196 DOI: 10.1016/j.bioflm.2022.100093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Revised: 10/31/2022] [Accepted: 11/07/2022] [Indexed: 11/09/2022] Open
Abstract
Staphylococcus aureus is a leading cause of prosthetic joint infections (PJI). Surface adhesins play an important role in the primary attachment to plasma proteins that coat the surface of prosthetic devices after implantation. Previous efforts to identify a genetic component of the bacterium that confers an enhanced capacity to cause PJI have focused on gene content, kmers, or single-nucleotide polymorphisms (SNPs) in coding sequences. Here, using a collection of S. aureus strains isolated from PJI and wounds, we investigated whether genetic variations in the regulatory region of genes encoding surface adhesins lead to differences in their expression levels and modulate the capacity of S. aureus to colonize implanted prosthetic devices. The data revealed that S. aureus isolates from the same clonal complex (CC) contain a specific pattern of SNPs in the regulatory region of genes encoding surface adhesins. As a consequence, each clonal lineage shows a specific profile of surface proteins expression. Co-infection experiments with representative isolates of the most prevalent CCs demonstrated that some lineages have a higher capacity to colonize implanted catheters in a murine infection model, which correlated with a greater ability to form a biofilm on coated surfaces with plasma proteins. Together, results indicate that differences in the expression level of surface adhesins may modulate the propensity of S. aureus strains to cause PJI. Given the high conservation of surface proteins among staphylococci, our work lays the framework for investigating how diversification at intergenic regulatory regions affects the capacity of S. aureus to colonize the surface of medical implants.
Collapse
|
36
|
Pei N, Sun W, He J, Li Y, Chen X, Liang T, Kristiansen K, Liu W, Li J. Genome-wide association study of Klebsiella pneumoniae identifies variations linked to carbapenems resistance. Front Microbiol 2022; 13:997769. [PMID: 36386631 PMCID: PMC9664935 DOI: 10.3389/fmicb.2022.997769] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Accepted: 10/10/2022] [Indexed: 01/25/2023] Open
Abstract
Klebsiella pneumoniae (KP) is one of the microorganisms that can acquire carbapenem-resistance (CR), and few antimicrobial therapy options exist for infections caused by Carbapenem-Resistant KP (CRKP). In recent years, with the increase of carbapenem resistance rates, treating CRKP has become a serious public health threat in clinical practice. We have collected 2,035 clinical KP isolates from a tertiary hospital in China. Whole genome sequencing data coupled with their binary antimicrobial susceptibility testing data were obtained to conduct the genome-wide association study using a bayesian-based method, including single nucleotide polymorphisms (SNPs) and genes. We identified 28 and 37 potential maker genes associated with imipenem and meropenem resistance, respectively. Among which 19 of them were selected in both drugs by genome-wide association study (GWAS), 11 genes among them were simultaneously validated in independent datasets. These genes were likely related to biofilm formation, efflux pump, and DNA repairing. Moreover, we identified 13 significant CR related SNPs in imipenem or meropenem, with one SNP located in the non-coding region and validated in the independent datasets. Our study indicates complex mechanisms of carbapenems resistance and further investigation of CRKP-related factors are warranted to better understand their contributions to carbapenems resistance. These identified biomarkers may provide targets for future drug interventions or treatments.
Collapse
Affiliation(s)
- Na Pei
- BGI-Shenzhen, Shenzhen, China,Laboratory of Genomics and Molecular Biomedicine, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | | | - Jingxuan He
- Department of Clinical Laboratory, Xiangya Hospital, Central South University, Changsha, China
| | - Yanming Li
- Department of Clinical Laboratory, Xiangya Hospital, Central South University, Changsha, China
| | - Xia Chen
- Department of Clinical Laboratory, Xiangya Hospital, Central South University, Changsha, China
| | - Tianzhu Liang
- BGI-Shenzhen, Shenzhen, China,Shenzhen Key Laboratory of Unknown Pathogen Identification, Shenzhen, China
| | - Karsten Kristiansen
- BGI-Shenzhen, Shenzhen, China,Laboratory of Genomics and Molecular Biomedicine, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Wenen Liu
- Department of Clinical Laboratory, Xiangya Hospital, Central South University, Changsha, China,*Correspondence: Junhua Li, ; Wenen Liu,
| | - Junhua Li
- BGI-Shenzhen, Shenzhen, China,Shenzhen Key Laboratory of Unknown Pathogen Identification, Shenzhen, China,*Correspondence: Junhua Li, ; Wenen Liu,
| |
Collapse
|
37
|
McInerney JO. Prokaryotic Pangenomes Act as Evolving Ecosystems. Mol Biol Evol 2022; 40:6775222. [PMID: 36288801 PMCID: PMC9851318 DOI: 10.1093/molbev/msac232] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Revised: 10/11/2022] [Accepted: 10/20/2022] [Indexed: 01/24/2023] Open
Abstract
Understanding adaptation to the local environment is a central tenet and a major focus of evolutionary biology. But this is only part of the adaptionist story. In addition to the external environment, one of the main drivers of genome composition is genetic background. In this perspective, I argue that there is a growing body of evidence that intra-genomic selective pressures play a significant part in the composition of prokaryotic genomes and play a significant role in the origin, maintenance and structuring of prokaryotic pangenomes.
Collapse
|
38
|
Didelot X, Parkhill J. A scalable analytical approach from bacterial genomes to epidemiology. Philos Trans R Soc Lond B Biol Sci 2022; 377:20210246. [PMID: 35989600 PMCID: PMC9393561 DOI: 10.1098/rstb.2021.0246] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Accepted: 02/17/2022] [Indexed: 12/21/2022] Open
Abstract
Recent years have seen a remarkable increase in the practicality of sequencing whole genomes from large numbers of bacterial isolates. The availability of this data has huge potential to deliver new insights into the evolution and epidemiology of bacterial pathogens, but the scalability of the analytical methodology has been lagging behind that of the sequencing technology. Here we present a step-by-step approach for such large-scale genomic epidemiology analyses, from bacterial genomes to epidemiological interpretations. A central component of this approach is the dated phylogeny, which is a phylogenetic tree with branch lengths measured in units of time. The construction of dated phylogenies from bacterial genomic data needs to account for the disruptive effect of recombination on phylogenetic relationships, and we describe how this can be achieved. Dated phylogenies can then be used to perform fine-scale or large-scale epidemiological analyses, depending on the proportion of cases for which genomes are available. A key feature of this approach is computational scalability and in particular the ability to process hundreds or thousands of genomes within a matter of hours. This is a clear advantage of the step-by-step approach described here. We discuss other advantages and disadvantages of the approach, as well as potential improvements and avenues for future research. This article is part of a discussion meeting issue 'Genomic population structures of microbial pathogens'.
Collapse
Affiliation(s)
- Xavier Didelot
- School of Life Sciences and Department of Statistics, University of Warwick, Coventry CV4 7AL, UK
| | - Julian Parkhill
- Department of Veterinary Medicine, University of Cambridge, Cambridge CB3 0ES, UK
| |
Collapse
|
39
|
Cardenas-Alvarez MX, Restrepo-Montoya D, Bergholz TM. Genome-Wide Association Study of Listeria monocytogenes Isolates Causing Three Different Clinical Outcomes. Microorganisms 2022; 10:1934. [PMID: 36296210 PMCID: PMC9610272 DOI: 10.3390/microorganisms10101934] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Revised: 09/16/2022] [Accepted: 09/24/2022] [Indexed: 12/05/2022] Open
Abstract
Heterogeneity in virulence potential of L. monocytogenes subgroups have been associated with genetic elements that could provide advantages in certain environments to invade, multiply, and survive within a host. The presence of gene mutations has been found to be related to attenuated phenotypes, while the presence of groups of genes, such as pathogenicity islands (PI), has been associated with hypervirulent or stress-resistant clones. We evaluated 232 whole genome sequences from invasive listeriosis cases in human and ruminants from the US and Europe to identify genomic elements associated with strains causing three clinical outcomes: central nervous system (CNS) infections, maternal-neonatal (MN) infections, and systemic infections (SI). Phylogenetic relationships and virulence-associated genes were evaluated, and a gene-based and single nucleotide polymorphism (SNP)-based genome-wide association study (GWAS) were conducted in order to identify loci associated with the different clinical outcomes. The orthologous results indicated that genes of phage phiX174, transfer RNAs, and type I restriction-modification (RM) system genes along with SNPs in loci involved in environmental adaptation such as rpoB and a phosphotransferase system (PTS) were associated with one or more clinical outcomes. Detection of phenotype-specific candidate loci represents an approach that could narrow the group of genetic elements to be evaluated in future studies.
Collapse
Affiliation(s)
| | | | - Teresa M. Bergholz
- Department of Food Science and Human Nutrition, Michigan State University, East Lansing, MI 48824, USA
| |
Collapse
|
40
|
Almeida RND, Greenberg M, Bundalovic-Torma C, Martel A, Wang PW, Middleton MA, Chatterton S, Desveaux D, Guttman DS. Predictive modeling of Pseudomonas syringae virulence on bean using gradient boosted decision trees. PLoS Pathog 2022; 18:e1010716. [PMID: 35877772 PMCID: PMC9352200 DOI: 10.1371/journal.ppat.1010716] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2022] [Revised: 08/04/2022] [Accepted: 06/30/2022] [Indexed: 11/18/2022] Open
Abstract
Pseudomonas syringae is a genetically diverse bacterial species complex responsible for numerous agronomically important crop diseases. Individual P. syringae isolates are assigned pathovar designations based on their host of isolation and the associated disease symptoms, and these pathovar designations are often assumed to reflect host specificity although this assumption has rarely been rigorously tested. Here we developed a rapid seed infection assay to measure the virulence of 121 diverse P. syringae isolates on common bean (Phaseolus vulgaris). This collection includes P. syringae phylogroup 2 (PG2) bean isolates (pathovar syringae) that cause bacterial spot disease and P. syringae phylogroup 3 (PG3) bean isolates (pathovar phaseolicola) that cause the more serious halo blight disease. We found that bean isolates in general were significantly more virulent on bean than non-bean isolates and observed no significant virulence difference between the PG2 and PG3 bean isolates. However, when we compared virulence within PGs we found that PG3 bean isolates were significantly more virulent than PG3 non-bean isolates, while there was no significant difference in virulence between PG2 bean and non-bean isolates. These results indicate that PG3 strains have a higher level of host specificity than PG2 strains. We then used gradient boosting machine learning to predict each strain’s virulence on bean based on whole genome k-mers, type III secreted effector k-mers, and the presence/absence of type III effectors and phytotoxins. Our model performed best using whole genome data and was able to predict virulence with high accuracy (mean absolute error = 0.05). Finally, we functionally validated the model by predicting virulence for 16 strains and found that 15 (94%) had virulence levels within the bounds of estimated predictions. This study strengthens the hypothesis that P. syringae PG2 strains have evolved a different lifestyle than other P. syringae strains as reflected in their lower level of host specificity. It also acts as a proof-of-principle to demonstrate the power of machine learning for predicting host specific adaptation. Pseudomonas syringae is a genetically diverse Gammaproteobacterial species complex responsible for numerous agronomically important crop diseases. Strains in the P. syringae species complex are frequently categorized into pathovars depending on pathogenic characteristics such as host of isolation and disease symptoms. Common bean pathogens from P. syringae are known to cause two major diseases: (1) pathovar phaseolicola strains from phylogroup 3 cause halo blight disease, characterized by large necrotic lesions surrounded by a chlorotic zone or halo of yellow tissue; and (2) pathovar syringae strains from phylogroup 2 causes bacterial spot disease, characterized by brown leaf spots. While halo blight can cause serious crop losses, bacterial spot disease is generally of minor agronomic concern. Recently, statistical genetic and machine learning approaches have been applied to genomic data to identify genes underlying traits of interest or predict the outcome of host-microbe interactions. Here, we apply machine learning to P. syringae genomic data to predict virulence on bean. We first characterized the virulence of P. syringae isolates on common bean using a seed infection assay and then applied machine learning to the genomic data from the same strains to generate a predictive model for virulence on bean. We found that machine learning models built with k-mers from either full genome data or virulence factors could predict bean virulence with high accuracy. We also confirmed prior work showing that phylogroup 3 halo blight pathogens display a stronger degree of phylogenetic clustering and host specificity compared to phylogroup 2 brown spot pathogens. This works serves as a proof-of-principle for the power of machine learning for predicting host specificity and may find utility in agricultural diagnostic microbiology.
Collapse
Affiliation(s)
- Renan N. D. Almeida
- Department of Cell & Systems Biology, University of Toronto, Toronto, Canada
| | - Michael Greenberg
- Department of Cell & Systems Biology, University of Toronto, Toronto, Canada
| | | | - Alexandre Martel
- Department of Cell & Systems Biology, University of Toronto, Toronto, Canada
| | - Pauline W. Wang
- Department of Cell & Systems Biology, University of Toronto, Toronto, Canada
- Centre for the Analysis of Genome Evolution & Function, University of Toronto, Toronto, Canada
| | - Maggie A. Middleton
- Department of Cell & Systems Biology, University of Toronto, Toronto, Canada
- Centre for the Analysis of Genome Evolution & Function, University of Toronto, Toronto, Canada
| | - Syama Chatterton
- Agriculture and Agri-Food Canada, Lethbridge Research and Development Centre, Lethbridge, Canada
| | - Darrell Desveaux
- Department of Cell & Systems Biology, University of Toronto, Toronto, Canada
| | - David S. Guttman
- Department of Cell & Systems Biology, University of Toronto, Toronto, Canada
- Centre for the Analysis of Genome Evolution & Function, University of Toronto, Toronto, Canada
- * E-mail:
| |
Collapse
|
41
|
Liu W, Li W, Zheng H, Kwok LY, Sun Z. Genomics divergence of Lactococcus lactis subsp. lactis isolated from naturally fermented dairy products. Food Res Int 2022; 155:111108. [DOI: 10.1016/j.foodres.2022.111108] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2022] [Revised: 03/04/2022] [Accepted: 03/06/2022] [Indexed: 12/13/2022]
|
42
|
The Retrospective on Atypical Brucella Species Leads to Novel Definitions. Microorganisms 2022; 10:microorganisms10040813. [PMID: 35456863 PMCID: PMC9025488 DOI: 10.3390/microorganisms10040813] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Revised: 04/11/2022] [Accepted: 04/12/2022] [Indexed: 02/01/2023] Open
Abstract
The genus Brucella currently comprises twelve species of facultative intracellular bacteria with variable zoonotic potential. Six of them have been considered as classical, causing brucellosis in terrestrial mammalian hosts, with two species originated from marine mammals. In the past fifteen years, field research as well as improved pathogen detection and typing have allowed the identification of four new species, namely Brucella microti, Brucella inopinata, Brucella papionis, Brucella vulpis, and of numerous strains, isolated from a wide range of hosts, including for the first time cold-blooded animals. While their genome sequences are still highly similar to those of classical strains, some of them are characterized by atypical phenotypes such as higher growth rate, increased resistance to acid stress, motility, and lethality in the murine infection model. In our review, we provide an overview of state-of-the-art knowledge about these novel Brucella sp., with emphasis on their phylogenetic positions in the genus, their metabolic characteristics, acid stress resistance mechanisms, and their behavior in well-established in cellulo and in vivo infection models. Comparison of phylogenetic classification and phenotypical properties between classical and novel Brucella species and strains finally lead us to propose a more adapted terminology, distinguishing between core and non-core, and typical versus atypical brucellae, respectively.
Collapse
|
43
|
Chaguza C, Ebruke C, Senghore M, Lo SW, Tientcheu PE, Gladstone RA, Tonkin-Hill G, Cornick JE, Yang M, Worwui A, McGee L, Breiman RF, Klugman KP, Kadioglu A, Everett DB, Mackenzie G, Croucher NJ, Roca A, Kwambana-Adams BA, Antonio M, Bentley SD. Comparative Genomics of Disease and Carriage Serotype 1 Pneumococci. Genome Biol Evol 2022; 14:evac052. [PMID: 35439297 PMCID: PMC9048925 DOI: 10.1093/gbe/evac052] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/12/2022] [Indexed: 11/14/2022] Open
Abstract
The isolation of Streptococcus pneumoniae serotypes in systemic tissues of patients with invasive disease versus the nasopharynx of healthy individuals with asymptomatic carriage varies widely. Some serotypes are hyper-invasive, particularly serotype 1, but the underlying genetics remain poorly understood due to the rarity of carriage isolates, reducing the power of comparison with invasive isolates. Here, we use a well-controlled genome-wide association study to search for genetic variation associated with invasiveness of serotype 1 pneumococci from a serotype 1 endemic setting in Africa. We found no consensus evidence that certain genomic variation is overrepresented among isolates from patients with invasive disease than asymptomatic carriage. Overall, the genomic variation explained negligible phenotypic variability, suggesting a minimal effect on the disease status. Furthermore, changes in lineage distribution were seen with lineages replacing each other over time, highlighting the importance of continued pathogen surveillance. Our findings suggest that the hyper-invasiveness is an intrinsic property of the serotype 1 strains, not specific for a "disease-associated" subpopulation disproportionately harboring unique genomic variation.
Collapse
Affiliation(s)
- Chrispin Chaguza
- Parasites and Microbes Programme, Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
- Darwin College, University of Cambridge, Silver Street, Cambridge, UK
- Department of Clinical Infection, Microbiology and Immunology, Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, UK
| | - Chinelo Ebruke
- Medical Research Council (MRC) Unit The Gambia at the London School of Hygiene and Tropical Medicine, Fajara, The Gambia
| | - Madikay Senghore
- Medical Research Council (MRC) Unit The Gambia at the London School of Hygiene and Tropical Medicine, Fajara, The Gambia
- Department of Epidemiology, Center for Communicable Disease Dynamics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Stephanie W. Lo
- Parasites and Microbes Programme, Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| | - Peggy-Estelle Tientcheu
- Medical Research Council (MRC) Unit The Gambia at the London School of Hygiene and Tropical Medicine, Fajara, The Gambia
| | - Rebecca A. Gladstone
- Parasites and Microbes Programme, Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
- Department of Biostatistics, University of Oslo, Oslo, Norway
| | - Gerry Tonkin-Hill
- Parasites and Microbes Programme, Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| | - Jennifer E. Cornick
- Department of Clinical Infection, Microbiology and Immunology, Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, UK
- Malawi-Liverpool-Wellcome Trust Clinical Research Programme, Blantyre, Malawi
| | - Marie Yang
- Department of Clinical Infection, Microbiology and Immunology, Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, UK
| | - Archibald Worwui
- Medical Research Council (MRC) Unit The Gambia at the London School of Hygiene and Tropical Medicine, Fajara, The Gambia
| | - Lesley McGee
- Respiratory Diseases Branch, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - Robert F. Breiman
- Hubert Department of Global Health, Rollins School of Public Health, Emory University, Atlanta, GA, USA
| | - Keith P. Klugman
- Hubert Department of Global Health, Rollins School of Public Health, Emory University, Atlanta, GA, USA
| | - Aras Kadioglu
- Department of Clinical Infection, Microbiology and Immunology, Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, UK
| | - Dean B. Everett
- College of Medicine and Health Sciences, Khalifa University, Abu Dhabi, UAE
| | - Grant Mackenzie
- Medical Research Council (MRC) Unit The Gambia at the London School of Hygiene and Tropical Medicine, Fajara, The Gambia
- Murdoch Children’s Research Institute, Parkville, Melbourne, VIC, Australia
- London School of Hygiene & Tropical Medicine, London, UK
| | - Nicholas J. Croucher
- MRC Centre for Global Infectious Disease Analysis, Department of Infectious Disease Epidemiology, School of Public Health, Imperial College London, London, UK
| | - Anna Roca
- Medical Research Council (MRC) Unit The Gambia at the London School of Hygiene and Tropical Medicine, Fajara, The Gambia
- London School of Hygiene & Tropical Medicine, London, UK
| | - Brenda A. Kwambana-Adams
- Medical Research Council (MRC) Unit The Gambia at the London School of Hygiene and Tropical Medicine, Fajara, The Gambia
- NIHR Global Health Research Unit on Mucosal Pathogens, Division of Infection and Immunity, University College London, London, UK
| | - Martin Antonio
- Medical Research Council (MRC) Unit The Gambia at the London School of Hygiene and Tropical Medicine, Fajara, The Gambia
- London School of Hygiene & Tropical Medicine, London, UK
- Warwick Medical School, University of Warwick, Coventry, UK
| | - Stephen D. Bentley
- Parasites and Microbes Programme, Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| |
Collapse
|
44
|
Genome-Wide Association Study Reveals Host Factors Affecting Conjugation in Escherichia coli. Microorganisms 2022; 10:microorganisms10030608. [PMID: 35336183 PMCID: PMC8954029 DOI: 10.3390/microorganisms10030608] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Revised: 03/08/2022] [Accepted: 03/10/2022] [Indexed: 02/04/2023] Open
Abstract
The emergence and dissemination of antibiotic resistance threaten the treatment of common bacterial infections. Resistance genes are often encoded on conjugative elements, which can be horizontally transferred to diverse bacteria. In order to delay conjugative transfer of resistance genes, more information is needed on the genetic determinants promoting conjugation. Here, we focus on which bacterial host factors in the donor assist transfer of conjugative plasmids. We introduced the broad-host-range plasmid pKJK10 into a diverse collection of 113 Escherichia coli strains and measured by flow cytometry how effectively each strain transfers its plasmid to a fixed E. coli recipient. Differences in conjugation efficiency of up to 2.7 and 3.8 orders of magnitude were observed after mating for 24 h and 48 h, respectively. These differences were linked to the underlying donor strain genetic variants in genome-wide association studies, thereby identifying candidate genes involved in conjugation. We confirmed the role of fliF, fliK, kefB and ucpA in the donor ability of conjugative elements by validating defects in the conjugation efficiency of the corresponding lab strain single-gene deletion mutants. Based on the known cellular functions of these genes, we suggest that the motility and the energy supply, the intracellular pH or salinity of the donor affect the efficiency of plasmid transfer. Overall, this work advances the search for targets for the development of conjugation inhibitors, which can be administered alongside antibiotics to more effectively treat bacterial infections.
Collapse
|
45
|
Bundalovic-Torma C, Desveaux D, Guttman DS. RecPD: A Recombination-aware measure of phylogenetic diversity. PLoS Comput Biol 2022; 18:e1009899. [PMID: 35192600 PMCID: PMC8896707 DOI: 10.1371/journal.pcbi.1009899] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Revised: 03/04/2022] [Accepted: 02/07/2022] [Indexed: 12/22/2022] Open
Abstract
A critical step in studying biological features (e.g., genetic variants, gene families, metabolic capabilities, or taxa) is assessing their diversity and distribution among a sample of individuals. Accurate assessments of these patterns are essential for linking features to traits or outcomes of interest and understanding their functional impact. Consequently, it is of crucial importance that the measures employed for quantifying feature diversity can perform robustly under any evolutionary scenario. However, the standard measures used for quantifying and comparing the distribution of features, such as prevalence, phylogenetic diversity, and related approaches, either do not take into consideration evolutionary history, or assume strictly vertical patterns of inheritance. Consequently, these approaches cannot accurately assess diversity for features that have undergone recombination or horizontal transfer. To address this issue, we have devised RecPD, a novel recombination-aware phylogenetic-diversity statistic for measuring the distribution and diversity of features under all evolutionary scenarios. RecPD utilizes ancestral-state reconstruction to map the presence / absence of features onto ancestral nodes in a species tree, and then identifies potential recombination events in the evolutionary history of the feature. We also derive several related measures from RecPD that can be used to assess and quantify evolutionary dynamics and correlation of feature evolutionary histories. We used simulation studies to show that RecPD reliably reconstructs feature evolutionary histories under diverse recombination and loss scenarios. We then applied RecPD in two diverse real-world scenarios including a preliminary study type III effector protein families secreted by the plant pathogenic bacterium Pseudomonas syringae and growth phenotypes of the Pseudomonas genus and demonstrate that prevalence is an inadequate measure that obscures the potential impact of recombination. We believe RecPD will have broad utility for revealing and quantifying complex evolutionary processes for features at any biological level. Phylogenetic diversity is an important concept utilized in evolutionary ecology which has extensive applications in population genetics to help us understand how evolutionary processes have distributed genetic variation among individuals of a species, and how this impacts phenotypic diversification over time. However, existing approaches for studying phylogenetic diversity largely assume that the genetic features follow vertical inheritance, which is frequently violated in the case of microbial genomes due to horizontal transfer. To address this shortcoming, we present RecPD, a recombination-aware phylogenetic diversity measure, which incorporates ancestral state reconstruction to quantify the phylogenetic diversity of genetic features mapped onto a species phylogeny. Through simulation experiments we show that RecPD robustly reconstructs the evolutionary histories of features evolving under various scenarios of recombination and loss. When applied to a real-world example of type III secreted effector protein families from the plant pathogenic bacterium Pseudomonas syringae, RecPD reveals that horizontal transfer has played an important role in shaping the phylogenetic distributions of a substantial proportion of families across the P. syringae species complex. Furthermore, we demonstrate that the traditional measures of feature prevalence are unsuitable as a measure for comparing feature diversity. We also provide a R package implementation of RecPD for public use: https://github.com/cedatorma/recpd.
Collapse
Affiliation(s)
| | - Darrell Desveaux
- Department of Cell & Systems Biology, University of Toronto, Toronto, Ontario, Canada
- Centre for the Analysis of Genome Evolution & Function, University of Toronto, Toronto, Ontario, Canada
| | - David S. Guttman
- Department of Cell & Systems Biology, University of Toronto, Toronto, Ontario, Canada
- Centre for the Analysis of Genome Evolution & Function, University of Toronto, Toronto, Ontario, Canada
- * E-mail:
| |
Collapse
|
46
|
Abstract
MOTIVATION Genome-wide association studies (GWAS), aiming to find genetic variants associated with a trait, have widely been used on bacteria to identify genetic determinants of drug resistance or hypervirulence. Recent bacterial GWAS methods usually rely on k-mers, whose presence in a genome can denote variants ranging from single-nucleotide polymorphisms to mobile genetic elements. This approach does not require a reference genome, making it easier to account for accessory genes. However, a same gene can exist in slightly different versions across different strains, leading to diluted effects. RESULTS Here, we overcome this issue by testing covariates built from closed connected subgraphs (CCSs) of the de Bruijn graph defined over genomic k-mers. These covariates capture polymorphic genes as a single entity, improving k-mer-based GWAS both in terms of power and interpretability. However, a method naively testing all possible subgraphs would be powerless due to multiple testing corrections, and the mere exploration of these subgraphs would quickly become computationally intractable. The concept of testable hypothesis has successfully been used to address both problems in similar contexts. We leverage this concept to test all CCSs by proposing a novel enumeration scheme for these objects which fully exploits the pruning opportunity offered by testability, resulting in drastic improvements in computational efficiency. Our method integrates with existing visual tools to facilitate interpretation. AVAILABILITY AND IMPLEMENTATION We provide an implementation of our method, as well as code to reproduce all results at https://github.com/HectorRDB/Caldera_ISMB. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Leandro Lima
- European Bioinformatics Institute, Cambridge CB10 1SD, UK
| | | | - Arnaud Mary
- Univ. Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Évolutive UMR 5558, Villeurbanne 69100, France
| | - Sandrine Dudoit
- Division of Biostatistics, Department of Statistics, University of California, Berkeley, CA 94704, USA
| | - Laurent Jacob
- To whom correspondence should be addressed. E-mail: or
| |
Collapse
|
47
|
Tuan VP, Yahara K, Dung HDQ, Binh TT, Huu Tung P, Tri TD, Thuan NPM, Khien VV, Trang TTH, Phuc BH, Tshibangu-Kabamba E, Matsumoto T, Akada J, Suzuki R, Okimoto T, Kodama M, Murakami K, Yano H, Fukuyo M, Takahashi N, Kato M, Nishiumi S, Azuma T, Ogura Y, Hayashi T, Toyoda A, Kobayashi I, Yamaoka Y. Genome-wide association study of gastric cancer- and duodenal ulcer-derived Helicobacter pylori strains reveals discriminatory genetic variations and novel oncoprotein candidates. Microb Genom 2021; 7. [PMID: 34846284 PMCID: PMC8743543 DOI: 10.1099/mgen.0.000680] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Genome-wide association studies (GWASs) can reveal genetic variations associated with a phenotype in the absence of any hypothesis of candidate genes. The problem of false-positive sites linked with the responsible site might be bypassed in bacteria with a high homologous recombination rate, such as Helicobacter pylori, which causes gastric cancer. We conducted a small-sample GWAS (125 gastric cancer cases and 115 controls) followed by prediction of gastric cancer and control (duodenal ulcer) H. pylori strains. We identified 11 single nucleotide polymorphisms (eight amino acid changes) and three DNA motifs that, combined, allowed effective disease discrimination. They were often informative of the underlying molecular mechanisms, such as electric charge alteration at the ligand-binding pocket, alteration in subunit interaction, and mode-switching of DNA methylation. We also identified three novel virulence factors/oncoprotein candidates. These results provide both defined targets for further informatic and experimental analyses to gain insights into gastric cancer pathogenesis and a basis for identifying a set of biomarkers for distinguishing these H. pylori-related diseases.
Collapse
Affiliation(s)
- Vo Phuoc Tuan
- Department of Endoscopy, Cho Ray Hospital, Ho Chi Minh, Vietnam
- Department of Environmental and Preventive Medicine, Oita University Faculty of Medicine, Oita, Japan
| | - Koji Yahara
- Antimicrobial Resistance ResearchCenter, National Institute of Infectious Diseases, Tokyo, Japan
- *Correspondence: Koji Yahara,
| | | | - Tran Thanh Binh
- Department of Endoscopy, Cho Ray Hospital, Ho Chi Minh, Vietnam
| | - Pham Huu Tung
- Department of Endoscopy, Cho Ray Hospital, Ho Chi Minh, Vietnam
| | - Tran Dinh Tri
- Department of Endoscopy, Cho Ray Hospital, Ho Chi Minh, Vietnam
| | | | - Vu Van Khien
- Department of GI Endoscopy, 108 Central Hospital, Hanoi, Vietnam
| | | | - Bui Hoang Phuc
- Department of Environmental and Preventive Medicine, Oita University Faculty of Medicine, Oita, Japan
- Department of Microbiology, Cho Ray Hospital, Ho Chi Minh, Vietnam
| | | | - Takashi Matsumoto
- Department of Environmental and Preventive Medicine, Oita University Faculty of Medicine, Oita, Japan
| | - Junko Akada
- Department of Environmental and Preventive Medicine, Oita University Faculty of Medicine, Oita, Japan
| | - Rumiko Suzuki
- Department of Environmental and Preventive Medicine, Oita University Faculty of Medicine, Oita, Japan
| | - Tadayoshi Okimoto
- Department of Gastroenterology, Oita University Faculty of Medicine, Yufu, Oita, Japan
| | - Masaaki Kodama
- Department of Gastroenterology, Oita University Faculty of Medicine, Yufu, Oita, Japan
| | - Kazunari Murakami
- Department of Gastroenterology, Oita University Faculty of Medicine, Yufu, Oita, Japan
| | - Hirokazu Yano
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, Tokyo, Japan
- Institute of Medical Science, University of Tokyo, Tokyo, Japan
- Graduate School of Life Sciences, Tohoku University, Sendai, Japan
| | - Masaki Fukuyo
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, Tokyo, Japan
- Institute of Medical Science, University of Tokyo, Tokyo, Japan
- Department of Molecular Oncology, Chiba University, Chiba, Japan
| | - Noriko Takahashi
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, Tokyo, Japan
- Institute of Medical Science, University of Tokyo, Tokyo, Japan
- Department of Infectious Diseases, Kyorin University School of Medicine, Mitaka City, Tokyo, Japan
| | - Mototsugu Kato
- Division of Endoscopy, Hokkaido University Hospital, Sapporo, Hokkaido, Japan
- Department of Gastroenterology, National Hospital Organization Hakodate Hospital, Hakodate, Hokkaido, Japan
| | - Shin Nishiumi
- Department of Gastroenterology, Graduate School of Medicine, Kobe University, Chuou-ku, Kobe, Hyogo, Japan
- Department of Omics Medicine, Hyogo College of Medicine, Hyogo, Japan
| | - Takashi Azuma
- Department of Gastroenterology, Graduate School of Medicine, Kobe University, Chuou-ku, Kobe, Hyogo, Japan
| | - Yoshitoshi Ogura
- Department of Bacteriology, Faculty of Medical Sciences, Kyushu University, Fukuoka, Japan
- Division of Microbiology, Department of Infectious Medicine, Kurume University School of Medicine, Kurume, Fukuoka, Japan
| | - Tetsuya Hayashi
- Department of Bacteriology, Faculty of Medical Sciences, Kyushu University, Fukuoka, Japan
| | - Atsushi Toyoda
- Advanced GenomicsCenter, National Institute of Genetics, Shizuoka, Japan
| | - Ichizo Kobayashi
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, Tokyo, Japan
- Institute of Medical Science, University of Tokyo, Tokyo, Japan
- Department of Infectious Diseases, Kyorin University School of Medicine, Mitaka City, Tokyo, Japan
- Research Center for Micro-Nano Technology, Hosei University, Tokyo, Japan
- *Correspondence: Ichizo Kobayashi, ;
| | - Yoshio Yamaoka
- Department of Environmental and Preventive Medicine, Oita University Faculty of Medicine, Oita, Japan
- Department of Medicine, gastroenterology section, Baylor College of Medicine, Houston TX, USA
- *Correspondence: Yoshio Yamaoka,
| |
Collapse
|
48
|
Yang W, Lin YC, Johnson W, Dai N, Vaisvila R, Weigele P, Lee YJ, Corrêa IR, Schildkraut I, Ettwiller L. A Genome-Phenome Association study in native microbiomes identifies a mechanism for cytosine modification in DNA and RNA. eLife 2021; 10:70021. [PMID: 34747693 PMCID: PMC8670742 DOI: 10.7554/elife.70021] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Accepted: 11/05/2021] [Indexed: 11/13/2022] Open
Abstract
Shotgun metagenomic sequencing is a powerful approach to study microbiomes in an unbiased manner and of increasing relevance for identifying novel enzymatic functions. However, the potential of metagenomics to relate from microbiome composition to function has thus far been underutilized. Here, we introduce the Metagenomics Genome-Phenome Association (MetaGPA) study framework, which allows linking genetic information in metagenomes with a dedicated functional phenotype. We applied MetaGPA to identify enzymes associated with cytosine modifications in environmental samples. From the 2365 genes that met our significance criteria, we confirm known pathways for cytosine modifications and proposed novel cytosine-modifying mechanisms. Specifically, we characterized and identified a novel nucleic acid-modifying enzyme, 5-hydroxymethylcytosine carbamoyltransferase, that catalyzes the formation of a previously unknown cytosine modification, 5-carbamoyloxymethylcytosine, in DNA and RNA. Our work introduces MetaGPA as a novel and versatile tool for advancing functional metagenomics. Many industrial processes, such as starch processing and oil refinement, use chemicals that cause harm to the environment. These can often be switched to more sustainable biological processes that are powered by proteins called enzymes. Enzymes are micro-factories that speed up biochemical reactions in most living things. Communities of microorganisms (also known as microbiomes) are an amazing but often untapped resource for discovering enzymes that can be harnessed for industrial purposes. To gain a better picture of the microbes present within a population, researchers often extract and sequence the genetic material of all microorganisms in an environmental sample, also known as the metagenome. While current methods for analyzing the metagenome are good at identifying new species, they often provide limited information about the microorganism’s functional role within the community. This makes it difficult to find new enzymes that may be useful for industry. Here, Yang, Lin et al. have developed a new technique called Metagenomics Genome-Phenome Association, or MetaGPA for short. The method works in a similar way to genome-wide association studies (GWAS) which are used to identify genes involved in human disease. However, instead of disease associated genes in humans, MetaGPA finds microbial genes that are associated with a biological process useful for biotechnology. Like GWAS, the new approach created by Yang, Lin et al. compares two groups: the first contains microorganisms that carry out a specific process, and the second contains all organisms in the microbiome. The metagenome of each group is extracted and a computational pipeline is then applied to identify genes, including those coding for enzymes, that are found more often in the group performing the desired task. To test the technique, Yang, Lin et al. used MetGPA to find new enzymes involved in DNA modification. Microbiome samples were collected from coastal water and sewage, and the computational pipeline was applied to discover genes that are associated with this process. Further analysis revealed that one of the identified genes codes for an enzyme that introduces a previously unknown change to DNA. MetaGPA could be applied to other processes and microbiomes, and, if successful, may help researchers to identify more diverse enzymes than is currently available. This could scale up the discovery of new enzymes that can be used to power industrial reactions.
Collapse
Affiliation(s)
- Weiwei Yang
- Research department, New England Biolabs Inc, Ipswich, United States
| | - Yu-Cheng Lin
- Research department, New England Biolabs Inc, Ipswich, United States
| | - William Johnson
- Research department, New England Biolabs Inc, Ipswich, United States
| | - Nan Dai
- RNA Biology, New England Biolabs Inc, Ipswich, United States
| | | | - Peter Weigele
- Research department, New England Biolabs Inc, Ipswich, United States
| | - Yan-Jiun Lee
- Research department, New England Biolabs Inc, Ipswich, United States
| | - Ivan R Corrêa
- RNA Biology, New England Biolabs Inc, Ipswich, United States
| | - Ira Schildkraut
- Research department, New England Biolabs Inc, Ipswich, United States
| | | |
Collapse
|
49
|
Young BC, Wu CH, Charlesworth J, Earle S, Price JR, Gordon NC, Cole K, Dunn L, Liu E, Oakley S, Godwin H, Fung R, Miller R, Knox K, Votintseva A, Quan TP, Tilley R, Scarborough M, Crook DW, Peto TE, Walker AS, Llewelyn MJ, Wilson DJ. Antimicrobial resistance determinants are associated with Staphylococcus aureus bacteraemia and adaptation to the healthcare environment: a bacterial genome-wide association study. Microb Genom 2021; 7:000700. [PMID: 34812717 PMCID: PMC8743558 DOI: 10.1099/mgen.0.000700] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Accepted: 09/30/2021] [Indexed: 12/30/2022] Open
Abstract
Staphylococcus aureus is a major bacterial pathogen in humans, and a dominant cause of severe bloodstream infections. Globally, antimicrobial resistance (AMR) in S. aureus remains challenging. While human risk factors for infection have been defined, contradictory evidence exists for the role of bacterial genomic variation in S. aureus disease. To investigate the contribution of bacterial lineage and genomic variation to the development of bloodstream infection, we undertook a genome-wide association study comparing bacteria from 1017 individuals with bacteraemia to 984 adults with asymptomatic S. aureus nasal carriage. Within 984 carriage isolates, we also compared healthcare-associated (HA) carriage with community-associated (CA) carriage. All major global lineages were represented in both bacteraemia and carriage, with no evidence for different infection rates. However, kmers tagging trimethoprim resistance-conferring mutation F99Y in dfrB were significantly associated with bacteraemia-vs-carriage (P=10-8.9-10-9.3). Pooling variation within genes, bacteraemia-vs-carriage was associated with the presence of mecA (HMP=10-5.3) as well as the presence of SCCmec (HMP=10-4.4). Among S. aureus carriers, no lineages were associated with HA-vs-CA carriage. However, we found a novel signal of HA-vs-CA carriage in the foldase protein prsA, where kmers representing conserved sequence allele were associated with CA carriage (P=10-7.1-10-19.4), while in gyrA, a ciprofloxacin resistance-conferring mutation, L84S, was associated with HA carriage (P=10-7.2). In an extensive study of S. aureus bacteraemia and nasal carriage in the UK, we found strong evidence that all S. aureus lineages are equally capable of causing bloodstream infection, and of being carried in the healthcare environment. Genomic variation in the foldase protein prsA is a novel genomic marker of healthcare origin in S. aureus but was not associated with bacteraemia. AMR determinants were associated with both bacteraemia and healthcare-associated carriage, suggesting that AMR increases the propensity not only to survive in healthcare environments, but also to cause invasive disease.
Collapse
Affiliation(s)
- Bernadette C. Young
- Nuffield Department of Medicine, Experimental Medicine Division, University of Oxford, John Radcliffe Hospital, Oxford OX3 9DU, UK
- Microbiology and Infectious Diseases Department, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford OX3 9DU, UK
| | - Chieh-Hsi Wu
- Nuffield Department of Medicine, Experimental Medicine Division, University of Oxford, John Radcliffe Hospital, Oxford OX3 9DU, UK
| | - Jane Charlesworth
- Nuffield Department of Medicine, Experimental Medicine Division, University of Oxford, John Radcliffe Hospital, Oxford OX3 9DU, UK
| | - Sarah Earle
- Big Data Institute, Nuffield Department of Population Health, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Old Road Campus, Oxford, OX3 7LF, UK
| | - James R. Price
- Department of Infectious Diseases and Microbiology, Royal Sussex County Hospital, Brighton BN2 5BE, UK
- Department of Global Health and Infection, Brighton and Sussex Medical School, University of Sussex, Falmer BN1 9PS, UK
| | - N. Claire Gordon
- Nuffield Department of Medicine, Experimental Medicine Division, University of Oxford, John Radcliffe Hospital, Oxford OX3 9DU, UK
- Microbiology and Infectious Diseases Department, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford OX3 9DU, UK
| | - Kevin Cole
- Department of Infectious Diseases and Microbiology, Royal Sussex County Hospital, Brighton BN2 5BE, UK
- Department of Global Health and Infection, Brighton and Sussex Medical School, University of Sussex, Falmer BN1 9PS, UK
| | - Laura Dunn
- Microbiology and Infectious Diseases Department, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford OX3 9DU, UK
| | - Elian Liu
- Microbiology and Infectious Diseases Department, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford OX3 9DU, UK
| | - Sarah Oakley
- Microbiology and Infectious Diseases Department, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford OX3 9DU, UK
| | - Heather Godwin
- Nuffield Department of Medicine, Experimental Medicine Division, University of Oxford, John Radcliffe Hospital, Oxford OX3 9DU, UK
| | - Rowena Fung
- Nuffield Department of Medicine, Experimental Medicine Division, University of Oxford, John Radcliffe Hospital, Oxford OX3 9DU, UK
| | - Ruth Miller
- Nuffield Department of Medicine, Experimental Medicine Division, University of Oxford, John Radcliffe Hospital, Oxford OX3 9DU, UK
| | - Kyle Knox
- Nuffield Department of Primary Care Health Sciences, University of Oxford, Oxford, UK
| | - Antonina Votintseva
- Nuffield Department of Medicine, Experimental Medicine Division, University of Oxford, John Radcliffe Hospital, Oxford OX3 9DU, UK
| | - T. Phuong Quan
- Nuffield Department of Medicine, Experimental Medicine Division, University of Oxford, John Radcliffe Hospital, Oxford OX3 9DU, UK
- National Institute for Health Research, Oxford Biomedical Research Centre, Oxford, UK
- NIHR Health Protection Unit in Healthcare Associated Infections and Antimicrobial Resistance at University of Oxford in partnership with Public Health England, Oxford, UK
| | - Robert Tilley
- Department of Microbiology, University Hospitals Plymouth NHS Trust, Derriford Hospital, Plymouth PL6 8DH, UK
| | - Matthew Scarborough
- Microbiology and Infectious Diseases Department, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford OX3 9DU, UK
| | - Derrick W. Crook
- Nuffield Department of Medicine, Experimental Medicine Division, University of Oxford, John Radcliffe Hospital, Oxford OX3 9DU, UK
- Microbiology and Infectious Diseases Department, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford OX3 9DU, UK
- National Institute for Health Research, Oxford Biomedical Research Centre, Oxford, UK
- NIHR Health Protection Unit in Healthcare Associated Infections and Antimicrobial Resistance at University of Oxford in partnership with Public Health England, Oxford, UK
| | - Timothy E. Peto
- Nuffield Department of Medicine, Experimental Medicine Division, University of Oxford, John Radcliffe Hospital, Oxford OX3 9DU, UK
- Microbiology and Infectious Diseases Department, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford OX3 9DU, UK
- National Institute for Health Research, Oxford Biomedical Research Centre, Oxford, UK
- NIHR Health Protection Unit in Healthcare Associated Infections and Antimicrobial Resistance at University of Oxford in partnership with Public Health England, Oxford, UK
| | - A. Sarah Walker
- Nuffield Department of Medicine, Experimental Medicine Division, University of Oxford, John Radcliffe Hospital, Oxford OX3 9DU, UK
- National Institute for Health Research, Oxford Biomedical Research Centre, Oxford, UK
- NIHR Health Protection Unit in Healthcare Associated Infections and Antimicrobial Resistance at University of Oxford in partnership with Public Health England, Oxford, UK
| | - Martin J. Llewelyn
- Department of Infectious Diseases and Microbiology, Royal Sussex County Hospital, Brighton BN2 5BE, UK
- Department of Global Health and Infection, Brighton and Sussex Medical School, University of Sussex, Falmer BN1 9PS, UK
| | - Daniel J. Wilson
- Big Data Institute, Nuffield Department of Population Health, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Old Road Campus, Oxford, OX3 7LF, UK
| |
Collapse
|
50
|
Arning N, Sheppard SK, Bayliss S, Clifton DA, Wilson DJ. Machine learning to predict the source of campylobacteriosis using whole genome data. PLoS Genet 2021; 17:e1009436. [PMID: 34662334 PMCID: PMC8553134 DOI: 10.1371/journal.pgen.1009436] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2021] [Revised: 10/28/2021] [Accepted: 08/26/2021] [Indexed: 11/18/2022] Open
Abstract
Campylobacteriosis is among the world's most common foodborne illnesses, caused predominantly by the bacterium Campylobacter jejuni. Effective interventions require determination of the infection source which is challenging as transmission occurs via multiple sources such as contaminated meat, poultry, and drinking water. Strain variation has allowed source tracking based upon allelic variation in multi-locus sequence typing (MLST) genes allowing isolates from infected individuals to be attributed to specific animal or environmental reservoirs. However, the accuracy of probabilistic attribution models has been limited by the ability to differentiate isolates based upon just 7 MLST genes. Here, we broaden the input data spectrum to include core genome MLST (cgMLST) and whole genome sequences (WGS), and implement multiple machine learning algorithms, allowing more accurate source attribution. We increase attribution accuracy from 64% using the standard iSource population genetic approach to 71% for MLST, 85% for cgMLST and 78% for kmerized WGS data using the classifier we named aiSource. To gain insight beyond the source model prediction, we use Bayesian inference to analyse the relative affinity of C. jejuni strains to infect humans and identified potential differences, in source-human transmission ability among clonally related isolates in the most common disease causing lineage (ST-21 clonal complex). Providing generalizable computationally efficient methods, based upon machine learning and population genetics, we provide a scalable approach to global disease surveillance that can continuously incorporate novel samples for source attribution and identify fine-scale variation in transmission potential.
Collapse
Affiliation(s)
- Nicolas Arning
- Big Data institute, Nuffield Department of Population Health, University of Oxford, Li Ka Shing Centre for Health Information and Discovery, Old Road Campus, Oxford, United Kingdom
- * E-mail:
| | - Samuel K. Sheppard
- The Milner Centre of Evolution, Department of Biology & Biochemistry, University of Bath, Claverton Down, Bath, United Kingdom
| | - Sion Bayliss
- The Milner Centre of Evolution, Department of Biology & Biochemistry, University of Bath, Claverton Down, Bath, United Kingdom
| | - David A. Clifton
- Department of Engineering Science, University of Oxford, Oxford, UK; Oxford-Suzhou Centre for Advanced Research, Suzhou, China
| | - Daniel J. Wilson
- Big Data institute, Nuffield Department of Population Health, University of Oxford, Li Ka Shing Centre for Health Information and Discovery, Old Road Campus, Oxford, United Kingdom
| |
Collapse
|