1
|
Tantoso E, Eisenhaber B, Kirsch M, Shitov V, Zhao Z, Eisenhaber F. To kill or to be killed: pangenome analysis of Escherichia coli strains reveals a tailocin specific for pandemic ST131. BMC Biol 2022; 20:146. [PMID: 35710371 PMCID: PMC9205054 DOI: 10.1186/s12915-022-01347-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Accepted: 06/08/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Escherichia coli (E. coli) has been one of the most studied model organisms in the history of life sciences. Initially thought just to be commensal bacteria, E. coli has shown wide phenotypic diversity including pathogenic isolates with great relevance to public health. Though pangenome analysis has been attempted several times, there is no systematic functional characterization of the E. coli subgroups according to the gene profile. RESULTS Systematically scanning for optimal parametrization, we have built the E. coli pangenome from 1324 complete genomes. The pangenome size is estimated to be ~25,000 gene families (GFs). Whereas the core genome diminishes as more genomes are added, the softcore genome (≥95% of strains) is stable with ~3000 GFs regardless of the total number of genomes. Apparently, the softcore genome (with a 92% or 95% generation threshold) can define the genome of a bacterial species listing the critically relevant, evolutionarily most conserved or important classes of GFs. Unsupervised clustering of common E. coli sequence types using the presence/absence GF matrix reveals distinct characteristics of E. coli phylogroups B1, B2, and E. We highlight the bi-lineage nature of B1, the variation of the secretion and of the iron acquisition systems in ST11 (E), and the incorporation of a highly conserved prophage into the genome of ST131 (B2). The tail structure of the prophage is evolutionarily related to R2-pyocin (a tailocin) from Pseudomonas aeruginosa PAO1. We hypothesize that this molecular machinery is highly likely to play an important role in protecting its own colonies; thus, contributing towards the rapid rise of pandemic E. coli ST131. CONCLUSIONS This study has explored the optimized pangenome development in E. coli. We provide complete GF lists and the pangenome matrix as supplementary data for further studies. We identified biological characteristics of different E. coli subtypes, specifically for phylogroups B1, B2, and E. We found an operon-like genome region coding for a tailocin specific for ST131 strains. The latter is a potential killer weapon providing pandemic E. coli ST131 with an advantage in inter-bacterial competition and, suggestively, explains their dominance as human pathogen among E. coli strains.
Collapse
Affiliation(s)
- Erwin Tantoso
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), 60 Biopolis Street, Singapore, 138672, Republic of Singapore.,Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street #07-01, Matrix Building, Singapore, 138671, Republic of Singapore
| | - Birgit Eisenhaber
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), 60 Biopolis Street, Singapore, 138672, Republic of Singapore.,Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street #07-01, Matrix Building, Singapore, 138671, Republic of Singapore
| | - Miles Kirsch
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street #07-01, Matrix Building, Singapore, 138671, Republic of Singapore.,Present address: Northeastern University, Boston, USA
| | - Vladimir Shitov
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street #07-01, Matrix Building, Singapore, 138671, Republic of Singapore
| | - Zhiya Zhao
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street #07-01, Matrix Building, Singapore, 138671, Republic of Singapore.,Present address: The University of Cambridge, Cambridge, UK
| | - Frank Eisenhaber
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), 60 Biopolis Street, Singapore, 138672, Republic of Singapore. .,Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street #07-01, Matrix Building, Singapore, 138671, Republic of Singapore. .,School of Biological Sciences (SBS), Nanyang Technological University (NTU), 60 Nanyang Drive, 637551, Singapore, Republic of Singapore.
| |
Collapse
|
2
|
Ecology and Function of the Transmissible Locus of Stress Tolerance in Escherichia coli and Plant-Associated Enterobacteriaceae. mSystems 2021; 6:e0037821. [PMID: 34402641 PMCID: PMC8407380 DOI: 10.1128/msystems.00378-21] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The transmissible locus of stress tolerance (tLST) is a genomic island which confers resistance to heat and chlorine. In this study, we determined that the tLST is frequent in genomes of those Enterobacteriaceae that occur in association with plants as well as the intestines of humans and animals and are relevant as nosocomial pathogens, e.g., Klebsiella and Cronobacter species. The tLST is more frequent in environmental and clinical isolates of Klebsiella pneumoniae than in animal isolates, and heat and chlorine resistance of tLST-positive strains of K. pneumoniae matched the resistance of tLST-positive strains of Escherichia coli. The function of 13 tLST genes was determined by assessing the heat and chlorine resistance of E. coli MG1655 mutants. The deletion of sHsp20, clpKGI, sHspGI, pscA, pscB, and hdeDGI reduced both heat and chlorine resistance; deletion of kefB reduced only chlorine resistance. Genes coding for heat shock proteins sHsp20, clpKGI, and sHspGI decreased the oxidation of cytoplasmic proteins, while kefB decreased the oxidation of membrane lipids. The fitness cost of the tLST for E. coli MG1655 was assessed by pairwise competition experiments with isogenic tLST-positive or tLST-negative strains. The tLST imposes a fitness cost that is compensated for by frequent and lethal challenges with chlorine. All core genes need to be present to maintain the ecological advantage relative to the fitness cost. Taken together, core tLST genes are necessary to provide protection for E. coli against heat and chlorine stress, and the selective pressure for the tLST maintains core genes. IMPORTANCE The transmissible locus of stress tolerance (tLST) is a genomic island comprising 10 core genes that occurs in diverse Enterobacteriaceae and confers resistance to heat and chlorine. Experimentation described in the manuscript describes the physiological function of the core genes by characterization of the resistance of 13 single-knockout (KO) mutants and by characterization of protein and membrane oxidation in these strains after chlorine challenge. Results identify tLST resistance as a genomic island that is specific for those Enterobacteriaceae that occur in plant-associated habitats as well in the intestines of vertebrates. In addition, the ecological function of the genomic island was characterized by large-scale genomic analysis and competition experiments of wild-type and mutant strains. Results suggest that tLST-mediated resistance to chlorine may contribute to the persistence of nosocomial pathogens in hospitals.
Collapse
|