1
|
Choudhury ST, Piper KR, Montoya-Giraldo M, Ikhimiukor OO, Dettman JR, Kassen R, Andam CP. Heterogeneity in recombination rates and accessory gene co-occurrence distinguish Pseudomonas aeruginosa phylogroups. mSystems 2025; 10:e0030125. [PMID: 40304385 PMCID: PMC12090758 DOI: 10.1128/msystems.00301-25] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2025] [Accepted: 04/07/2025] [Indexed: 05/02/2025] Open
Abstract
Pseudomonas aeruginosa (class Gammaproteobacteria) is a ubiquitous, ecologically widespread, and metabolically versatile species. It is also an opportunistic pathogen that causes a variety of chronic and acute infections in humans. Its ability to thrive in diverse environments and exhibit a wide range of phenotypes lies in part on its large gene pool, but the processes that govern inter-strain genomic variation remain unclear. Here, we aim to characterize the recombination features and accessory genome structure of P. aeruginosa using 840 globally distributed genome sequences. The species can be subdivided into five phylogenetic sequence clusters (corresponding to known phylogroups), two of which are most prominent. Notable epidemic clones are found in the two phylogroups: ST17, ST111, ST146, ST274, and ST395 in phylogroup 1, and ST235 and ST253 in phylogroup 2. The two phylogroups differ in the frequency and characteristics of homologous recombination in their core genomes, including the specific genes that most frequently recombine and the impact of recombination on sequence diversity. Each phylogroup's accessory genome is characterized by a unique gene pool, co-occurrence networks of shared genes, and anti-phage defense systems. Different pools of antimicrobial resistance and virulence genes exist in the two phylogroups and display dissimilar patterns of co-occurrence. Altogether, our results indicate that each phylogroup displays distinct histories and patterns of acquiring exogenous DNA, which may contribute in part to their predominance in the global population. Our study has important implications for understanding the genome dynamics, within-species heterogeneity, and clinically relevant traits of P. aeruginosa. IMPORTANCE The consummate opportunist Pseudomonas aeruginosa inhabits many nosocomial and non-clinical environments, posing a major health burden worldwide. Our study reveals phylogroup-specific differences in recombination features and co-occurrence networks of accessory genes within the species. This genomic variation partly explains its remarkable ability to exhibit diverse ecological and phenotypic traits, and thus contribute to circumventing clinical and public health intervention strategies to contain it. Our results may help inform efforts to control and prevent P. aeruginosa diseases, including managing transmission, therapeutic efforts, and pathogen circulation in non-clinical environmental reservoirs.
Collapse
Affiliation(s)
- Samara T. Choudhury
- Department of Biological Sciences, University at Albany, State University of New York, Albany, New York, USA
| | - Kathryn R. Piper
- Department of Biological Sciences, University at Albany, State University of New York, Albany, New York, USA
| | - Manuela Montoya-Giraldo
- Department of Biological Sciences, University at Albany, State University of New York, Albany, New York, USA
| | - Odion O. Ikhimiukor
- Department of Biological Sciences, University at Albany, State University of New York, Albany, New York, USA
| | - Jeremy R. Dettman
- Ottawa Research and Development Centre, Agriculture & Agri-Food Canada, Ottawa, Ontario, Canada
| | - Rees Kassen
- Department of Biology, McGill University, Montreal, Québec, Canada
| | - Cheryl P. Andam
- Department of Biological Sciences, University at Albany, State University of New York, Albany, New York, USA
| |
Collapse
|
2
|
Salamzade R, Kalan LR. Context matters: assessing the impacts of genomic background and ecology on microbial biosynthetic gene cluster evolution. mSystems 2025; 10:e0153824. [PMID: 39992097 PMCID: PMC11915812 DOI: 10.1128/msystems.01538-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/25/2025] Open
Abstract
Encoded within many microbial genomes, biosynthetic gene clusters (BGCs) underlie the synthesis of various secondary metabolites that often mediate ecologically important functions. Several studies and bioinformatics methods developed over the past decade have advanced our understanding of both microbial pangenomes and BGC evolution. In this minireview, we first highlight challenges in broad evolutionary analysis of BGCs, including delineation of BGC boundaries and clustering of BGCs across genomes. We further summarize key findings from microbial comparative genomics studies on BGC conservation across taxa and habitats and discuss the potential fitness effects of BGCs in different settings. Afterward, recent research showing the importance of genomic context on the production of secondary metabolites and the evolution of BGCs is highlighted. These studies draw parallels to recent, broader, investigations on gene-to-gene associations within microbial pangenomes. Finally, we describe mechanisms by which microbial pangenomes and BGCs evolve, ranging from the acquisition or origination of entire BGCs to micro-evolutionary trends of individual biosynthetic genes. An outlook on how expansions in the biosynthetic capabilities of some taxa might support theories that open pangenomes are the result of adaptive evolution is also discussed. We conclude with remarks about how future work leveraging longitudinal metagenomics across diverse ecosystems is likely to significantly improve our understanding on the evolution of microbial genomes and BGCs.
Collapse
Affiliation(s)
- Rauf Salamzade
- Department of Medical Microbiology and Immunology, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, Wisconsin, USA
- Microbiology Doctoral Training Program, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Lindsay R. Kalan
- Department of Medical Microbiology and Immunology, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, Wisconsin, USA
- M.G. DeGroote Institute for Infectious Disease Research, David Braley Center for Antibiotic Discovery, McMaster University, Hamilton, Ontario, Canada
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, Ontario, Canada
| |
Collapse
|
3
|
Gamblin J, Lambert A, Blanquart F. Persistent, Private, and Mobile Genes: A Model for Gene Dynamics in Evolving Pangenomes. Mol Biol Evol 2025; 42:msaf001. [PMID: 39812022 PMCID: PMC11781223 DOI: 10.1093/molbev/msaf001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2024] [Revised: 11/22/2024] [Accepted: 12/17/2024] [Indexed: 01/16/2025] Open
Abstract
The pangenome of a species is the set of all genes carried by at least one member of the species. In bacteria, pangenomes can be much larger than the set of genes carried by a single organism. Many questions remain unanswered regarding the evolutionary forces shaping the patterns of the presence/absence of genes in pangenomes of a given species. We introduce a new model for bacterial pangenome evolution along a species phylogeny that explicitly describes the timing of appearance of each gene in the species and accounts for three generic types of gene evolutionary dynamics: persistent genes that are present in the ancestral genome, private genes that are specific to a given clade, and mobile genes that are imported once into the gene pool and then undergo frequent horizontal gene transfers. We call this model the Persistent-Private-Mobile (PPM) model. We develop an algorithm fitting the PPM model and apply it to a dataset of 902 Salmonella enterica genomes. We show that the best fitting model is able to reproduce the global pattern of some multivariate statistics like the gene frequency spectrum and the parsimony vs. frequency plot. Moreover, the gene classification induced by the PPM model allows us to study the position of accessory genes on the chromosome depending on their category, as well as the gene functions that are most present in each category. This work paves the way for a mechanistic understanding of pangenome evolution, and the PPM model developed here could be used for dynamics-aware gene classification.
Collapse
Affiliation(s)
- Jasmine Gamblin
- Center for Interdisciplinary Research in Biology (CIRB), College de France, CNRS, INSERM, Université PSL, Paris, France
| | - Amaury Lambert
- Center for Interdisciplinary Research in Biology (CIRB), College de France, CNRS, INSERM, Université PSL, Paris, France
- Institut de Biologie de l’ENS (IBENS), École Normale Supérieure (ENS), CNRS, INSERM, Université PSL, Paris, France
| | - François Blanquart
- Center for Interdisciplinary Research in Biology (CIRB), College de France, CNRS, INSERM, Université PSL, Paris, France
| |
Collapse
|
4
|
Eren AM, Banfield JF. Modern microbiology: Embracing complexity through integration across scales. Cell 2024; 187:5151-5170. [PMID: 39303684 PMCID: PMC11450119 DOI: 10.1016/j.cell.2024.08.028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2024] [Revised: 08/14/2024] [Accepted: 08/14/2024] [Indexed: 09/22/2024]
Abstract
Microbes were the only form of life on Earth for most of its history, and they still account for the vast majority of life's diversity. They convert rocks to soil, produce much of the oxygen we breathe, remediate our sewage, and sustain agriculture. Microbes are vital to planetary health as they maintain biogeochemical cycles that produce and consume major greenhouse gases and support large food webs. Modern microbiologists analyze nucleic acids, proteins, and metabolites; leverage sophisticated genetic tools, software, and bioinformatic algorithms; and process and integrate complex and heterogeneous datasets so that microbial systems may be harnessed to address contemporary challenges in health, the environment, and basic science. Here, we consider an inevitably incomplete list of emergent themes in our discipline and highlight those that we recognize as the archetypes of its modern era that aim to address the most pressing problems of the 21st century.
Collapse
Affiliation(s)
- A Murat Eren
- Helmholtz Institute for Functional Marine Biodiversity, 26129 Oldenburg, Germany; Alfred Wegener Institute, Helmholtz Centre for Polar and Marine Research, Bremerhaven, Germany; Institute for Chemistry and Biology of the Marine Environment, University of Oldenburg, Oldenburg, Germany; Marine Biological Laboratory, Woods Hole, MA, USA; Max Planck Institute for Marine Microbiology, Bremen, Germany.
| | - Jillian F Banfield
- Department of Earth and Planetary Sciences, University of California, Berkeley, Berkeley, CA, USA; Earth and Environmental Sciences, Lawrence Berkeley National Laboratory, Berkeley, CA, USA; Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA, USA; Biomedicine Discovery Institute, Monash University, Clayton, VIC, Australia; Department of Environmental Science Policy, and Management, University of California, Berkeley, Berkeley, CA, USA.
| |
Collapse
|
5
|
Dewar AE, Hao C, Belcher LJ, Ghoul M, West SA. Bacterial lifestyle shapes pangenomes. Proc Natl Acad Sci U S A 2024; 121:e2320170121. [PMID: 38743630 PMCID: PMC11126918 DOI: 10.1073/pnas.2320170121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Accepted: 04/06/2024] [Indexed: 05/16/2024] Open
Abstract
Pangenomes vary across bacteria. Some species have fluid pangenomes, with a high proportion of genes varying between individual genomes. Other species have less fluid pangenomes, with different genomes tending to contain the same genes. Two main hypotheses have been suggested to explain this variation: differences in species' bacterial lifestyle and effective population size. However, previous studies have not been able to test between these hypotheses because the different features of lifestyle and effective population size are highly correlated with each other, and phylogenetically conserved, making it hard to disentangle their relative importance. We used phylogeny-based analyses, across 126 bacterial species, to tease apart the causal role of different factors. We found that pangenome fluidity was lower in i) host-associated compared with free-living species and ii) host-associated species that are obligately dependent on a host, live inside cells, and are more pathogenic and less motile. In contrast, we found no support for the competing hypothesis that larger effective population sizes lead to more fluid pangenomes. Effective population size appears to correlate with pangenome variation because it is also driven by bacterial lifestyle, rather than because of a causal relationship.
Collapse
Affiliation(s)
- Anna E. Dewar
- Department of Biology, University of Oxford, OxfordOX1 3SZ, United Kingdom
| | - Chunhui Hao
- Department of Biology, University of Oxford, OxfordOX1 3SZ, United Kingdom
| | | | - Melanie Ghoul
- Department of Biology, University of Oxford, OxfordOX1 3SZ, United Kingdom
| | - Stuart A. West
- Department of Biology, University of Oxford, OxfordOX1 3SZ, United Kingdom
| |
Collapse
|
6
|
Deery J, Carmody M, Flavin R, Tomanek M, O'Keeffe M, McGlacken GP, Reen FJ. Comparative genomics reveals distinct diversification patterns among LysR-type transcriptional regulators in the ESKAPE pathogen Pseudomonas aeruginosa. Microb Genom 2024; 10:001205. [PMID: 38421269 PMCID: PMC10926688 DOI: 10.1099/mgen.0.001205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Accepted: 02/06/2024] [Indexed: 03/02/2024] Open
Abstract
Pseudomonas aeruginosa, a harmful nosocomial pathogen associated with cystic fibrosis and burn wounds, encodes for a large number of LysR-type transcriptional regulator proteins. To understand how and why LTTR proteins evolved with such frequency and to establish whether any relationships exist within the distribution we set out to identify the patterns underpinning LTTR distribution in P. aeruginosa and to uncover cluster-based relationships within the pangenome. Comparative genomic studies revealed that in the JGI IMG database alone ~86 000 LTTRs are present across the sequenced genomes (n=699). They are widely distributed across the species, with core LTTRs present in >93 % of the genomes and accessory LTTRs present in <7 %. Analysis showed that subsets of core LTTRs can be classified as either variable (typically specific to P. aeruginosa) or conserved (and found to be distributed in other Pseudomonas species). Extending the analysis to the more extensive Pseudomonas database, PA14 rooted analysis confirmed the diversification patterns and revealed PqsR, the receptor for the Pseudomonas quinolone signal (PQS) and 2-heptyl-4-quinolone (HHQ) quorum-sensing signals, to be amongst the most variable in the dataset. Successful complementation of the PAO1 pqsR - mutant using representative variant pqsR sequences suggests a degree of structural promiscuity within the most variable of LTTRs, several of which play a prominent role in signalling and communication. These findings provide a new insight into the diversification of LTTR proteins within the P. aeruginosa species and suggests a functional significance to the cluster, conservation and distribution patterns identified.
Collapse
Affiliation(s)
- Jamie Deery
- School of Microbiology, University College Cork, Cork, Ireland
| | - Muireann Carmody
- School of Microbiology, University College Cork, Cork, Ireland
- School of Chemistry, University College Cork, Cork, Ireland
| | - Rhiannon Flavin
- School of Microbiology, University College Cork, Cork, Ireland
| | - Malwina Tomanek
- School of Microbiology, University College Cork, Cork, Ireland
| | - Maria O'Keeffe
- School of Microbiology, University College Cork, Cork, Ireland
| | - Gerard P. McGlacken
- School of Chemistry, University College Cork, Cork, Ireland
- Synthesis and Solid State Pharmaceutical Centre, University College Cork, Cork, Ireland
| | - F. Jerry Reen
- School of Microbiology, University College Cork, Cork, Ireland
- Synthesis and Solid State Pharmaceutical Centre, University College Cork, Cork, Ireland
| |
Collapse
|
7
|
Beavan A, Domingo-Sananes MR, McInerney JO. Contingency, repeatability, and predictability in the evolution of a prokaryotic pangenome. Proc Natl Acad Sci U S A 2024; 121:e2304934120. [PMID: 38147560 PMCID: PMC10769857 DOI: 10.1073/pnas.2304934120] [Citation(s) in RCA: 16] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Accepted: 11/05/2023] [Indexed: 12/28/2023] Open
Abstract
Pangenomes exhibit remarkable variability in many prokaryotic species, much of which is maintained through the processes of horizontal gene transfer and gene loss. Repeated acquisitions of near-identical homologs can easily be observed across pangenomes, leading to the question of whether these parallel events potentiate similar evolutionary trajectories, or whether the remarkably different genetic backgrounds of the recipients mean that postacquisition evolutionary trajectories end up being quite different. In this study, we present a machine learning method that predicts the presence or absence of genes in the Escherichia coli pangenome based on complex patterns of the presence or absence of other accessory genes within a genome. Our analysis leverages the repeated transfer of genes through the E. coli pangenome to observe patterns of repeated evolution following similar events. We find that the presence or absence of a substantial set of genes is highly predictable from other genes alone, indicating that selection potentiates and maintains gene-gene co-occurrence and avoidance relationships deterministically over long-term bacterial evolution and is robust to differences in host evolutionary history. We propose that at least part of the pangenome can be understood as a set of genes with relationships that govern their likely cohabitants, analogous to an ecosystem's set of interacting organisms. Our findings indicate that intragenomic gene fitness effects may be key drivers of prokaryotic evolution, influencing the repeated emergence of complex gene-gene relationships across the pangenome.
Collapse
Affiliation(s)
- Alan Beavan
- School of Life Sciences, The University of Nottingham, NottinghamNG7 2UH, United Kingdom
| | - Maria Rosa Domingo-Sananes
- School of Life Sciences, The University of Nottingham, NottinghamNG7 2UH, United Kingdom
- School of Science and Technology, Nottingham Trent University, NottinghamNG1 4FQ, United Kingdom
| | - James O. McInerney
- School of Life Sciences, The University of Nottingham, NottinghamNG7 2UH, United Kingdom
| |
Collapse
|
8
|
Hall JPJ. Microbial Musings: Spring 2023. Microbiology (Reading) 2023; 169. [PMID: 37561018 DOI: 10.1099/mic.0.001379] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/11/2023] Open
Affiliation(s)
- James P. J. Hall
- Department of Evolution, Ecology and Behaviour, Institute for Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, UK
| |
Collapse
|
9
|
Abondio P, Cilli E, Luiselli D. Human Pangenomics: Promises and Challenges of a Distributed Genomic Reference. Life (Basel) 2023; 13:1360. [PMID: 37374141 DOI: 10.3390/life13061360] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Revised: 06/02/2023] [Accepted: 06/08/2023] [Indexed: 06/29/2023] Open
Abstract
A pangenome is a collection of the common and unique genomes that are present in a given species. It combines the genetic information of all the genomes sampled, resulting in a large and diverse range of genetic material. Pangenomic analysis offers several advantages compared to traditional genomic research. For example, a pangenome is not bound by the physical constraints of a single genome, so it can capture more genetic variability. Thanks to the introduction of the concept of pangenome, it is possible to use exceedingly detailed sequence data to study the evolutionary history of two different species, or how populations within a species differ genetically. In the wake of the Human Pangenome Project, this review aims at discussing the advantages of the pangenome around human genetic variation, which are then framed around how pangenomic data can inform population genetics, phylogenetics, and public health policy by providing insights into the genetic basis of diseases or determining personalized treatments, targeting the specific genetic profile of an individual. Moreover, technical limitations, ethical concerns, and legal considerations are discussed.
Collapse
Affiliation(s)
- Paolo Abondio
- Laboratory of Ancient DNA, Department of Cultural Heritage, University of Bologna, Via degli Ariani 1, 48121 Ravenna, Italy
| | - Elisabetta Cilli
- Laboratory of Ancient DNA, Department of Cultural Heritage, University of Bologna, Via degli Ariani 1, 48121 Ravenna, Italy
| | - Donata Luiselli
- Laboratory of Ancient DNA, Department of Cultural Heritage, University of Bologna, Via degli Ariani 1, 48121 Ravenna, Italy
| |
Collapse
|