1
|
Gamblin J, Lambert A, Blanquart F. Persistent, Private, and Mobile Genes: A Model for Gene Dynamics in Evolving Pangenomes. Mol Biol Evol 2025; 42:msaf001. [PMID: 39812022 PMCID: PMC11781223 DOI: 10.1093/molbev/msaf001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2024] [Revised: 11/22/2024] [Accepted: 12/17/2024] [Indexed: 01/16/2025] Open
Abstract
The pangenome of a species is the set of all genes carried by at least one member of the species. In bacteria, pangenomes can be much larger than the set of genes carried by a single organism. Many questions remain unanswered regarding the evolutionary forces shaping the patterns of the presence/absence of genes in pangenomes of a given species. We introduce a new model for bacterial pangenome evolution along a species phylogeny that explicitly describes the timing of appearance of each gene in the species and accounts for three generic types of gene evolutionary dynamics: persistent genes that are present in the ancestral genome, private genes that are specific to a given clade, and mobile genes that are imported once into the gene pool and then undergo frequent horizontal gene transfers. We call this model the Persistent-Private-Mobile (PPM) model. We develop an algorithm fitting the PPM model and apply it to a dataset of 902 Salmonella enterica genomes. We show that the best fitting model is able to reproduce the global pattern of some multivariate statistics like the gene frequency spectrum and the parsimony vs. frequency plot. Moreover, the gene classification induced by the PPM model allows us to study the position of accessory genes on the chromosome depending on their category, as well as the gene functions that are most present in each category. This work paves the way for a mechanistic understanding of pangenome evolution, and the PPM model developed here could be used for dynamics-aware gene classification.
Collapse
Affiliation(s)
- Jasmine Gamblin
- Center for Interdisciplinary Research in Biology (CIRB), College de France, CNRS, INSERM, Université PSL, Paris, France
| | - Amaury Lambert
- Center for Interdisciplinary Research in Biology (CIRB), College de France, CNRS, INSERM, Université PSL, Paris, France
- Institut de Biologie de l’ENS (IBENS), École Normale Supérieure (ENS), CNRS, INSERM, Université PSL, Paris, France
| | - François Blanquart
- Center for Interdisciplinary Research in Biology (CIRB), College de France, CNRS, INSERM, Université PSL, Paris, France
| |
Collapse
|
2
|
Ramos B, Cunha MV. Genomic epidemiology of Staphylococcus aureus from the Iberian Peninsula highlights the expansion of livestock associated-CC398 towards wildlife. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 933:173027. [PMID: 38729368 DOI: 10.1016/j.scitotenv.2024.173027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/20/2024] [Revised: 05/04/2024] [Accepted: 05/04/2024] [Indexed: 05/12/2024]
Abstract
Staphylococcus aureus is a versatile pathobiont, exhibiting a broad host range, including humans, other mammals, and avian species. Host specificity determinants, virulence, and antimicrobial resistance genes are often shared by strains circulating at the animal-human interface. While transmission dynamics studies have shown strain exchange between humans and livestock, knowledge of the source, genetic diversification, and transmission drivers of S. aureus in wildlife lag behind. In this work, we explore a wide array of S. aureus genomes from different sources in the Iberian Peninsula to understand population structure, gene content and niche adaptation at the human-livestock-wildlife nexus. Through Bayesian inference, we address the hypothesis that S. aureus strains in wildlife originate from humanized landscapes, either from contact with humans or through interactions with livestock. Phylogenetic reconstruction applied to whole genome sequence data was completed with a dataset of 450 isolates featuring multiple clones from the 1990-2022 period and a subset of CC398 strains representing the 2008-2022 period. Phylodynamic signatures of S. aureus from the Iberian Peninsula suggest widespread circulation of most clones among humans before jumping to other hosts. The number of transitions of CC398 strains within each host category (human, livestock, wildlife) was high (88.26 %), while the posterior probability of transitions from livestock to wildlife was remarkably high (0.99). Microbial genome-wide association analysis did not evidence genome rearrangements nor biomarkers suggesting S. aureus niche adaptation to wildlife, thus supporting recent spill overs. Altogether, our findings indicate that S. aureus isolates collected in the past years from wildlife most likely represent multiple introduction events from livestock. The clonal origin of CC398 and its potential to disseminate and evolve through different animal host species are highlighted, calling for management practices at the livestock-wildlife axis to improve biosecurity and thus restrict S. aureus transmission and niche expansion along gradients of human influence.
Collapse
Affiliation(s)
- Beatriz Ramos
- Centre for Ecology, Evolution and Environmental Changes (cE3c) & CHANGE - Global Change and Sustainability Institute, Faculdade de Ciências da Universidade de Lisboa, 1749-016 Lisboa, Portugal; Biosystems and Integrative Sciences Institute (BioISI), Faculdade de Ciências da Universidade de Lisboa, 1749-016 Lisboa, Portugal
| | - Mónica V Cunha
- Centre for Ecology, Evolution and Environmental Changes (cE3c) & CHANGE - Global Change and Sustainability Institute, Faculdade de Ciências da Universidade de Lisboa, 1749-016 Lisboa, Portugal; Biosystems and Integrative Sciences Institute (BioISI), Faculdade de Ciências da Universidade de Lisboa, 1749-016 Lisboa, Portugal.
| |
Collapse
|
3
|
van Hal SJ, Jensen SO, Tong SYC, Bentley S, Holden MT. Unravelling the complex interplay between antibiotic consumption and adaptive changes in methicillin-resistant Staphylococcus aureus. J Antimicrob Chemother 2024; 79:891-896. [PMID: 38412336 DOI: 10.1093/jac/dkae048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 01/29/2024] [Indexed: 02/29/2024] Open
Abstract
OBJECTIVES This study aims to elucidate the genomic dynamics driving the emergence of antimicrobial resistance (AMR), with a specific focus on the interplay between AMR and antimicrobial usage. METHODS We conducted a comprehensive analysis using a ST239 methicillin-resistant Staphylococcus aureus (MRSA) dataset over a continuous 12-year period from a single hospital. Genomic analyses were performed tracking the changes in MRSA populations, particularly the emergence of reduced vancomycin susceptibility, and assessing the impact of glycopeptide use on these emergence events. RESULTS Our findings reveal a significant correlation between hospital glycopeptide usage and the selection of MRSA strains with reduced vancomycin susceptibility. Genomic analyses provided insights into the molecular mechanisms driving resistance emergence, including the slowing of the molecular clock rate in response to heightened antimicrobial consumption. CONCLUSIONS In conclusion, this study the highlights the complex dynamics between AMR and antimicrobial use at the hospital level. The observed correlation between antimicrobial consumption and the development of less susceptible MRSA strains underscores the importance of antimicrobial stewardship programmes and the establishment of optimal consumption thresholds for mitigating AMR effectively.
Collapse
Affiliation(s)
- Sebastiaan J van Hal
- Department of Microbiology and Infectious Diseases, Royal Prince Alfred Hospital, Sydney, Australia
- Sydney Medical School, Faculty of Medicine and Health, University of Sydney, Sydney, Australia
- Antimicrobial Resistance and Mobile Elements Group, Ingham Institute for Applied Medical Research, Sydney, NSW, Australia
| | - Slade O Jensen
- Antimicrobial Resistance and Mobile Elements Group, Ingham Institute for Applied Medical Research, Sydney, NSW, Australia
- Microbiology and Infectious Diseases, School of Medicine, Western Sydney University, Sydney, NSW, Australia
| | - Stephen Y C Tong
- Victorian Infectious Diseases Service, The Royal Melbourne Hospital, at the Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
- Department of Infectious Diseases, The University of Melbourne at the Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
| | - Stephen Bentley
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Matthew T Holden
- School of Medicine, University of St Andrews, St Andrews, Fife KY16 9TF, UK
| |
Collapse
|
4
|
Grandchamp A, Czuppon P, Bornberg-Bauer E. Quantification and modeling of turnover dynamics of de novo transcripts in Drosophila melanogaster. Nucleic Acids Res 2024; 52:274-287. [PMID: 38000384 PMCID: PMC10783523 DOI: 10.1093/nar/gkad1079] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Revised: 10/13/2023] [Accepted: 10/28/2023] [Indexed: 11/26/2023] Open
Abstract
Most of the transcribed eukaryotic genomes are composed of non-coding transcripts. Among these transcripts, some are newly transcribed when compared to outgroups and are referred to as de novo transcripts. De novo transcripts have been shown to play a major role in genomic innovations. However, little is known about the rates at which de novo transcripts are gained and lost in individuals of the same species. Here, we address this gap and estimate the de novo transcript turnover rate with an evolutionary model. We use DNA long reads and RNA short reads from seven geographically remote samples of inbred individuals of Drosophila melanogaster to detect de novo transcripts that are gained on a short evolutionary time scale. Overall, each sampled individual contains around 2500 unspliced de novo transcripts, with most of them being sample specific. We estimate that around 0.15 transcripts are gained per year, and that each gained transcript is lost at a rate around 5× 10-5 per year. This high turnover of transcripts suggests frequent exploration of new genomic sequences within species. These rate estimates are essential to comprehend the process and timescale of de novo gene birth.
Collapse
Affiliation(s)
- Anna Grandchamp
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Peter Czuppon
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
- Department of Protein Evolution, Max Planck Institute for Biology, Tübingen, Germany
| |
Collapse
|
5
|
van Hal SJ, Whiley DM, Le T, Ray S, Kundu RL, Kerr E, Lahra MM. Rapid expansion of Neisseria gonorrhoeae ST7827 clone in Australia, with variable ceftriaxone phenotype unexplained by genotype. J Antimicrob Chemother 2023; 78:2203-2208. [PMID: 37452731 DOI: 10.1093/jac/dkad221] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Accepted: 07/01/2023] [Indexed: 07/18/2023] Open
Abstract
BACKGROUND Neisseria gonorrhoeae is identified as a priority pathogen due to its capacity to rapidly develop antimicrobial resistance (AMR). Following the easing of SARS-CoV-2 pandemic travel restrictions across international borders in the state of New South Wales (NSW), Australia, a surge of gonococcal isolates with raised ceftriaxone MIC values were detected. METHODS All N. gonorrhoeae isolates (n = 150) with increased ceftriaxone MIC values in NSW between 1 January 2021 and July 2022 from males and females from all sites were sequenced. RESULTS A new emergence and rapid expansion of an N. gonorrhoeae ST7827 clone was documented within NSW, Australia and provides further evidence of the ability of N. gonorrhoeae to undergo sufficient genomic changes and re-emerge as a geographically restricted subclone. Mapping AMR determinants to MIC results did not reveal any genomic pattern that correlated with MIC values. CONCLUSIONS The rapid dissemination and establishment of this clone at the population level is a new and concerning demonstration of the agility of this pathogen, and underscores concerns about similar incursions and establishment of MDR clones. Moreover, it is notable that in this context the AMR genotype-phenotype correlates remain unclear, which requires further investigation to enable better understanding of genomic aspects of AMR in N. gonorrhoeae.
Collapse
Affiliation(s)
- S J van Hal
- Department of Infectious Diseases and Microbiology, NSW Health Pathology, Royal Prince Alfred Hospital, Sydney, NSW 2050, Australia
- Central Clinical School, University of Sydney, Sydney, NSW 2006, Australia
| | - D M Whiley
- UQ Centre for Clinical Research, Faculty of Medicine, The University of Queensland, Brisbane, Queensland, Australia
- Pathology Queensland Central Laboratory, Queensland Health, Brisbane, Queensland, Australia
| | - T Le
- Department of Infectious Diseases and Microbiology, NSW Health Pathology, Royal Prince Alfred Hospital, Sydney, NSW 2050, Australia
| | - S Ray
- World Health Organization Collaborating Centre for STI and AMR, New South Wales Health Pathology Microbiology, The Prince of Wales Hospital, Randwick, New South Wales, Australia
| | - R L Kundu
- World Health Organization Collaborating Centre for STI and AMR, New South Wales Health Pathology Microbiology, The Prince of Wales Hospital, Randwick, New South Wales, Australia
| | - E Kerr
- Communicable Diseases Branch, Health Protection NSW, NSW Health, Sydney, Australia
| | - M M Lahra
- World Health Organization Collaborating Centre for STI and AMR, New South Wales Health Pathology Microbiology, The Prince of Wales Hospital, Randwick, New South Wales, Australia
- Faculty of Medicine, The University of New South Wales, Sydney, New South Wales, Australia
| |
Collapse
|
6
|
Carroll LM, Piacenza N, Cheng RA, Wiedmann M, Guldimann C. A multidrug-resistant Salmonella enterica Typhimurium DT104 complex lineage circulating among humans and cattle in the USA lost the ability to produce pertussis-like toxin ArtAB. Microb Genom 2023; 9:mgen001050. [PMID: 37402177 PMCID: PMC10438809 DOI: 10.1099/mgen.0.001050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2022] [Accepted: 05/23/2023] [Indexed: 07/06/2023] Open
Abstract
Salmonella enterica subsp. enterica serotype Typhimurium definitive type 104 (DT104) can infect both humans and animals and is often multidrug-resistant (MDR). Previous studies have indicated that, unlike most S . Typhimurium, the overwhelming majority of DT104 strains produce pertussis-like toxin ArtAB via prophage-encoded genes artAB . However, DT104 that lack artAB have been described on occasion. Here, we identify an MDR DT104 complex lineage circulating among humans and cattle in the USA, which lacks artAB (i.e. the ‘U.S. artAB -negative major clade’; n =42 genomes). Unlike most other bovine- and human-associated DT104 complex strains from the USA (n =230 total genomes), which harbour artAB on prophage Gifsy-1 (n =177), members of the U.S. artAB -negative major clade lack Gifsy-1, as well as anti-inflammatory effector gogB . The U.S. artAB -negative major clade encompasses human- and cattle-associated strains isolated from ≥11 USA states over a 20-year period. The clade was predicted to have lost artAB , Gifsy-1 and gogB circa 1985–1987 (95 % highest posterior density interval 1979.0–1992.1). When compared to DT104 genomes from other regions of the world (n =752 total genomes), several additional, sporadic artAB , Gifsy-1 and/or gogB loss events among clades encompassing five or fewer genomes were observed. Using phenotypic assays that simulate conditions encountered during human and/or bovine digestion, members of the U.S. artAB -negative major clade did not differ from closely related Gifsy-1/artAB /gogB -harbouring U.S. DT104 complex strains (ANOVA raw P >0.05); thus, future research is needed to elucidate the roles that artAB , gogB and Gifsy-1 play in DT104 virulence in humans and animals.
Collapse
Affiliation(s)
- Laura M. Carroll
- Department of Clinical Microbiology, SciLifeLab, Umeå University, Umeå, Sweden
- Laboratory for Molecular Infection Medicine Sweden (MIMS), Umeå University, Umeå, Sweden
- Umeå Centre for Microbial Research, Umeå University, Umeå, Sweden
- Integrated Science Lab, Umeå University, Umeå, Sweden
| | - Nicolo Piacenza
- Chair for Food Safety and Analytics, Ludwig-Maximillians-University Munich, Munich, Germany
| | - Rachel A. Cheng
- Department of Food Science and Technology, Virginia Tech, Blacksburg, VA, USA
| | - Martin Wiedmann
- Department of Food Science, Cornell University, Ithaca, NY, USA
| | - Claudia Guldimann
- Chair for Food Safety and Analytics, Ludwig-Maximillians-University Munich, Munich, Germany
| |
Collapse
|
7
|
Tonkin-Hill G, Gladstone RA, Pöntinen AK, Arredondo-Alonso S, Bentley SD, Corander J. Robust analysis of prokaryotic pangenome gene gain and loss rates with Panstripe. Genome Res 2023; 33:129-140. [PMID: 36669850 PMCID: PMC9977150 DOI: 10.1101/gr.277340.122] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2022] [Accepted: 12/14/2022] [Indexed: 01/21/2023]
Abstract
Horizontal gene transfer (HGT) plays a critical role in the evolution and diversification of many microbial species. The resulting dynamics of gene gain and loss can have important implications for the development of antibiotic resistance and the design of vaccine and drug interventions. Methods for the analysis of gene presence/absence patterns typically do not account for errors introduced in the automated annotation and clustering of gene sequences. In particular, methods adapted from ecological studies, including the pangenome gene accumulation curve, can be misleading as they may reflect the underlying diversity in the temporal sampling of genomes rather than a difference in the dynamics of HGT. Here, we introduce Panstripe, a method based on generalized linear regression that is robust to population structure, sampling bias, and errors in the predicted presence/absence of genes. We show using simulations that Panstripe can effectively identify differences in the rate and number of genes involved in HGT events, and illustrate its capability by analyzing several diverse bacterial genome data sets representing major human pathogens.
Collapse
Affiliation(s)
- Gerry Tonkin-Hill
- Department of Biostatistics, University of Oslo, 0372 Blindern, Norway;,Parasites and Microbes, Wellcome Sanger Institute, Cambridge CB10 1RQ, United Kingdom
| | | | - Anna K. Pöntinen
- Department of Biostatistics, University of Oslo, 0372 Blindern, Norway
| | - Sergio Arredondo-Alonso
- Department of Biostatistics, University of Oslo, 0372 Blindern, Norway;,Parasites and Microbes, Wellcome Sanger Institute, Cambridge CB10 1RQ, United Kingdom
| | - Stephen D. Bentley
- Parasites and Microbes, Wellcome Sanger Institute, Cambridge CB10 1RQ, United Kingdom
| | - Jukka Corander
- Department of Biostatistics, University of Oslo, 0372 Blindern, Norway;,Parasites and Microbes, Wellcome Sanger Institute, Cambridge CB10 1RQ, United Kingdom;,Helsinki Institute for Information Technology HIIT, Department of Mathematics and Statistics, University of Helsinki, 00014 Helsinki, Finland
| |
Collapse
|
8
|
Evolution of the connectivity and indispensability of a transferable gene: the simplicity hypothesis. BMC Ecol Evol 2022; 22:140. [PMID: 36451084 PMCID: PMC9710062 DOI: 10.1186/s12862-022-02091-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2022] [Accepted: 10/26/2022] [Indexed: 12/02/2022] Open
Abstract
BACKGROUND The number of interactions between a transferable gene or its protein product and genes or gene products native to its microbial host is referred to as connectivity. Such interactions impact the tendency of the gene to be retained by evolution following horizontal gene transfer (HGT) into a microbial population. The complexity hypothesis posits that the protein product of a transferable gene with lower connectivity is more likely to function in a way that is beneficial to a new microbial host compared to the protein product of a transferable gene with higher connectivity. A gene with lower connectivity is consequently more likely to be fixed in any microbial population it enters by HGT. The more recently proposed simplicity hypothesis posits that the connectivity of a transferable gene might increase over time within any single microbial population due to gene-host coevolution, but that differential rates of colonization of microbial populations by HGT in accordance with differences in connectivity might act to counter this and even reduce connectivity over time, comprising an evolutionary trade-off. RESULTS We present a theoretical model that can be used to predict the conditions under which gene-host coevolution might increase or decrease the connectivity of a transferable gene over time. We show that the opportunity to enter new microbial populations by HGT can cause the connectivity of a transferable gene to evolve toward lower values, particularly in an environment that is unstable with respect to the function of the gene's protein product. We also show that a lack of such opportunity in a stable environment can cause the connectivity of a transferable gene to evolve toward higher values. CONCLUSION Our theoretical model suggests that the connectivity of a transferable gene can change over time toward higher values corresponding to a more sessile state of lower transferability or lower values corresponding to a more itinerant state of higher transferability, depending on the ecological milieu in which the gene exists. We note, however, that a better understanding of gene-host coevolutionary dynamics in natural microbial systems is required before any further conclusions about the veracity of the simplicity hypothesis can be drawn.
Collapse
|
9
|
Strains Associated with Two 2020 Welder Anthrax Cases in the United States Belong to Separate Lineages within Bacillus cereus sensu lato. Pathogens 2022; 11:pathogens11080856. [PMID: 36014977 PMCID: PMC9413466 DOI: 10.3390/pathogens11080856] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 07/22/2022] [Accepted: 07/27/2022] [Indexed: 12/04/2022] Open
Abstract
Anthrax-causing members of Bacillus cereus sensu lato (s.l.) pose a serious threat to public health. While most anthrax-causing strains resemble B. anthracis phenotypically, rare cases of anthrax-like illness caused by strains resembling “B. cereus” have been reported. Here, whole-genome sequencing was used to characterize three B. cereus s.l. isolates associated with two 2020 welder anthrax cases in the United States, which resembled “B. cereus” phenotypically. Comparison of the three genomes sequenced here to all publicly available, high-quality B. cereus s.l. genomes (n = 2890 total genomes) demonstrated that genomes associated with each case effectively belonged to separate species at the conventional 95% average nucleotide identity prokaryotic species threshold. Two PubMLST sequence type 78 (ST78) genomes affiliated with a case in Louisiana were most closely related to B. tropicus and possessed genes encoding the Bps exopolysaccharide capsule, as well as hemolysin BL (Hbl) and cytotoxin K (CytK). Comparatively, a ST108 genome associated with a case in Texas was most closely related to B. anthracis; however, like other anthrax-causing strains most closely related to B. anthracis, this genome did not possess Bps-, Hbl-, or CytK-encoding genes. Overall, results presented here provide insights into the evolution of anthrax-causing B. cereus s.l.
Collapse
|
10
|
Shikov AE, Malovichko YV, Nizhnikov AA, Antonets KS. Current Methods for Recombination Detection in Bacteria. Int J Mol Sci 2022; 23:ijms23116257. [PMID: 35682936 PMCID: PMC9181119 DOI: 10.3390/ijms23116257] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 05/30/2022] [Accepted: 05/30/2022] [Indexed: 02/05/2023] Open
Abstract
The role of genetic exchanges, i.e., homologous recombination (HR) and horizontal gene transfer (HGT), in bacteria cannot be overestimated for it is a pivotal mechanism leading to their evolution and adaptation, thus, tracking the signs of recombination and HGT events is importance both for fundamental and applied science. To date, dozens of bioinformatics tools for revealing recombination signals are available, however, their pros and cons as well as the spectra of solvable tasks have not yet been systematically reviewed. Moreover, there are two major groups of software. One aims to infer evidence of HR, while the other only deals with horizontal gene transfer (HGT). However, despite seemingly different goals, all the methods use similar algorithmic approaches, and the processes are interconnected in terms of genomic evolution influencing each other. In this review, we propose a classification of novel instruments for both HR and HGT detection based on the genomic consequences of recombination. In this context, we summarize available methodologies paying particular attention to the type of traceable events for which a certain program has been designed.
Collapse
Affiliation(s)
- Anton E. Shikov
- Laboratory for Proteomics of Supra-Organismal Systems, All-Russia Research Institute for Agricultural Microbiology (ARRIAM), 196608 St. Petersburg, Russia; (A.E.S.); (Y.V.M.); (A.A.N.)
- Faculty of Biology, St. Petersburg State University (SPbSU), 199034 St. Petersburg, Russia
| | - Yury V. Malovichko
- Laboratory for Proteomics of Supra-Organismal Systems, All-Russia Research Institute for Agricultural Microbiology (ARRIAM), 196608 St. Petersburg, Russia; (A.E.S.); (Y.V.M.); (A.A.N.)
- Faculty of Biology, St. Petersburg State University (SPbSU), 199034 St. Petersburg, Russia
| | - Anton A. Nizhnikov
- Laboratory for Proteomics of Supra-Organismal Systems, All-Russia Research Institute for Agricultural Microbiology (ARRIAM), 196608 St. Petersburg, Russia; (A.E.S.); (Y.V.M.); (A.A.N.)
- Faculty of Biology, St. Petersburg State University (SPbSU), 199034 St. Petersburg, Russia
| | - Kirill S. Antonets
- Laboratory for Proteomics of Supra-Organismal Systems, All-Russia Research Institute for Agricultural Microbiology (ARRIAM), 196608 St. Petersburg, Russia; (A.E.S.); (Y.V.M.); (A.A.N.)
- Faculty of Biology, St. Petersburg State University (SPbSU), 199034 St. Petersburg, Russia
- Correspondence:
| |
Collapse
|
11
|
Fukunaga T, Iwasaki W. Mirage: estimation of ancestral gene-copy numbers by considering different evolutionary patterns among gene families. BIOINFORMATICS ADVANCES 2021; 1:vbab014. [PMID: 36700099 PMCID: PMC9710636 DOI: 10.1093/bioadv/vbab014] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/28/2021] [Revised: 07/22/2021] [Accepted: 07/28/2021] [Indexed: 01/28/2023]
Abstract
Motivation Reconstruction of gene copy number evolution is an essential approach for understanding how complex biological systems have been organized. Although various models have been proposed for gene copy number evolution, existing evolutionary models have not appropriately addressed the fact that different gene families can have very different gene gain/loss rates. Results In this study, we developed Mirage (MIxtuRe model for Ancestral Genome Estimation), which allows different gene families to have flexible gene gain/loss rates. Mirage can use three models for formulating heterogeneous evolution among gene families: the discretized Γ model, probability distribution-free model and pattern mixture (PM) model. Simulation analysis showed that Mirage can accurately estimate heterogeneous gene gain/loss rates and reconstruct gene-content evolutionary history. Application to empirical datasets demonstrated that the PM model fits genome data from various taxonomic groups better than the other heterogeneous models. Using Mirage, we revealed that metabolic function-related gene families displayed frequent gene gains and losses in all taxa investigated. Availability and implementation The source code of Mirage is freely available at https://github.com/fukunagatsu/Mirage. Supplementary information Supplementary data are available at Bioinformatics Advances online.
Collapse
Affiliation(s)
- Tsukasa Fukunaga
- Waseda Institute for Advanced Study, Waseda University, Tokyo 1690051, Japan,Department of Computer Science, Graduate School of Information Science and Technology, The University of Tokyo, Tokyo 1130032, Japan,To whom correspondence should be addressed. or
| | - Wataru Iwasaki
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba 2770882, Japan,Department of Biological Sciences, Graduate School of Science, The University of Tokyo, Tokyo 1130032, Japan,Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba 2770882, Japan,Atmosphere and Ocean Research Institute, The University of Tokyo, Chiba 2770882, Japan,Institute for Quantitative Biosciences, The University of Tokyo, Tokyo 1130032, Japan,Collaborative Research Institute for Innovative Microbiology, The University of Tokyo, Tokyo 1130032, Japan,To whom correspondence should be addressed. or
| |
Collapse
|
12
|
DeSalle R, Riley M. Should Networks Supplant Tree Building? Microorganisms 2020; 8:E1179. [PMID: 32756444 PMCID: PMC7466111 DOI: 10.3390/microorganisms8081179] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2020] [Revised: 07/21/2020] [Accepted: 07/29/2020] [Indexed: 12/15/2022] Open
Abstract
Recent studies suggested that network methods should supplant tree building as the basis of genealogical analysis. This proposition is based upon two arguments. First is the observation that bacterial and archaeal lineages experience processes oppositional to bifurcation and hence the representation of the evolutionary process in a tree like structure is illogical. Second is the argument tree building approaches are circular-you ask for a tree and you get one, which pins a verificationist label on tree building that, if correct, should be the end of phylogenetic analysis as we currently know it. In this review, we examine these questions and suggest that rumors of the death of the bacterial tree of life are exaggerated at best.
Collapse
Affiliation(s)
- Rob DeSalle
- Sackler Institute for Comparative Genomics, American Museum of Natural History, Central Park West at 79th Street, New York, NY 10024, USA;
| | - Margaret Riley
- Department of Biology, University of Massachusetts Amherst, 116 North Pleasant Street, Amherst, MA 01003, USA
| |
Collapse
|
13
|
Abe T, Akazawa Y, Toyoda A, Niki H, Baba T. Batch-Learning Self-Organizing Map Identifies Horizontal Gene Transfer Candidates and Their Origins in Entire Genomes. Front Microbiol 2020; 11:1486. [PMID: 32719664 PMCID: PMC7350273 DOI: 10.3389/fmicb.2020.01486] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2020] [Accepted: 06/08/2020] [Indexed: 02/05/2023] Open
Abstract
Horizontal gene transfer (HGT) has been widely suggested to play a critical role in the environmental adaptation of microbes; however, the number and origin of the genes in microbial genomes obtained through HGT remain unknown as the frequency of detected HGT events is generally underestimated, particularly in the absence of information on donor sequences. As an alternative to phylogeny-based methods that rely on sequence alignments, we have developed an alignment-free clustering method on the basis of an unsupervised neural network “Batch-Learning Self-Organizing Map (BLSOM)” in which sequence fragments are clustered based solely on oligonucleotide similarity without taxonomical information, to detect HGT candidates and their origin in entire genomes. By mapping the microbial genomic sequences on large-scale BLSOMs constructed with nearly all prokaryotic genomes, HGT candidates can be identified, and their origin assigned comprehensively, even for microbial genomes that exhibit high novelty. By focusing on two types of Alphaproteobacteria, specifically psychrotolerant Sphingomonas strains from an Antarctic lake, we detected HGT candidates using BLSOM and found higher proportions of HGT candidates from organisms belonging to Betaproteobacteria in the genomes of these two Antarctic strains compared with those of continental strains. Further, an origin difference was noted in the HGT candidates found in the two Antarctic strains. Although their origins were highly diversified, gene functions related to the cell wall or membrane biogenesis were shared among the HGT candidates. Moreover, analyses of amino acid frequency suggested that housekeeping genes and some HGT candidates of the Antarctic strains exhibited different characteristics to other continental strains. Lys, Ser, Thr, and Val were the amino acids found to be increased in the Antarctic strains, whereas Ala, Arg, Glu, and Leu were decreased. Our findings strongly suggest a low-temperature adaptation process for microbes that may have arisen convergently as an independent evolutionary strategy in each Antarctic strain. Hence, BLSOM analysis could serve as a powerful tool in not only detecting HGT candidates and their origins in entire genomes, but also in providing novel perspectives into the environmental adaptations of microbes.
Collapse
Affiliation(s)
- Takashi Abe
- Department of Information Engineering, Faculty of Engineering, Niigata University, Niigata, Japan
| | - Yu Akazawa
- Department of Information Engineering, Faculty of Engineering, Niigata University, Niigata, Japan
| | - Atsushi Toyoda
- Comparative Genomics Laboratory, National Institute of Genetics, Mishima, Japan.,Advanced Genomics Center, National Institute of Genetics, Mishima, Japan
| | - Hironori Niki
- Microbial Physiology Laboratory, National Institute of Genetics, Mishima, Japan
| | - Tomoya Baba
- Advanced Genomics Center, National Institute of Genetics, Mishima, Japan.,Joint Support-Center for Data Science Research, Research Organization of Information and Systems, Tokyo, Japan
| |
Collapse
|
14
|
Tonkin-Hill G, MacAlasdair N, Ruis C, Weimann A, Horesh G, Lees JA, Gladstone RA, Lo S, Beaudoin C, Floto RA, Frost SDW, Corander J, Bentley SD, Parkhill J. Producing polished prokaryotic pangenomes with the Panaroo pipeline. Genome Biol 2020; 21:180. [PMID: 32698896 PMCID: PMC7376924 DOI: 10.1186/s13059-020-02090-4] [Citation(s) in RCA: 537] [Impact Index Per Article: 107.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2020] [Accepted: 07/02/2020] [Indexed: 02/03/2023] Open
Abstract
Population-level comparisons of prokaryotic genomes must take into account the substantial differences in gene content resulting from horizontal gene transfer, gene duplication and gene loss. However, the automated annotation of prokaryotic genomes is imperfect, and errors due to fragmented assemblies, contamination, diverse gene families and mis-assemblies accumulate over the population, leading to profound consequences when analysing the set of all genes found in a species. Here, we introduce Panaroo, a graph-based pangenome clustering tool that is able to account for many of the sources of error introduced during the annotation of prokaryotic genome assemblies. Panaroo is available at https://github.com/gtonkinhill/panaroo .
Collapse
Affiliation(s)
- Gerry Tonkin-Hill
- Parasites and Microbes, Wellcome Sanger Institute, Cambridge, UK. .,Department of Biostatistics, University of Oslo, Blindern, 0317, Norway.
| | - Neil MacAlasdair
- Parasites and Microbes, Wellcome Sanger Institute, Cambridge, UK.,Department of Veterinary Medicine, University of Cambridge, Cambridge, UK
| | - Christopher Ruis
- Department of Veterinary Medicine, University of Cambridge, Cambridge, UK.,Molecular Immunity Unit, Department of Medicine, University of Cambridge, Cambridge, UK.,Medical Research Council (MRC)-Laboratory of Molecular Biology, Cambridge, UK
| | - Aaron Weimann
- Department of Veterinary Medicine, University of Cambridge, Cambridge, UK.,Molecular Immunity Unit, Department of Medicine, University of Cambridge, Cambridge, UK.,Medical Research Council (MRC)-Laboratory of Molecular Biology, Cambridge, UK.,European Bioinformatics Institute, Cambridge, UK
| | - Gal Horesh
- Parasites and Microbes, Wellcome Sanger Institute, Cambridge, UK
| | - John A Lees
- MRC Centre for Global Infectious Disease Analysis, Department of Infectious Disease Epidemiology, Imperial College London, London, W2 1PG, UK
| | | | - Stephanie Lo
- Parasites and Microbes, Wellcome Sanger Institute, Cambridge, UK
| | | | - R Andres Floto
- Molecular Immunity Unit, Department of Medicine, University of Cambridge, Cambridge, UK.,Cambridge Centre for Lung Infection, Royal Papworth Hospital, Cambridge, CB23 3RE, UK
| | - Simon D W Frost
- Microsoft Research, Redmond, 98052, WA, USA.,London School of Hygiene & Tropical Medicine, London, UK
| | - Jukka Corander
- Parasites and Microbes, Wellcome Sanger Institute, Cambridge, UK.,Department of Biostatistics, University of Oslo, Blindern, 0317, Norway.,Helsinki Institute for Information Technology HIIT, Department of Mathematics and Statistics, University of Helsinki, Helsinki, 00014, Finland
| | | | - Julian Parkhill
- Department of Veterinary Medicine, University of Cambridge, Cambridge, UK
| |
Collapse
|
15
|
Rycroft T, Hamilton K, Haas CN, Linkov I. A quantitative risk assessment method for synthetic biology products in the environment. THE SCIENCE OF THE TOTAL ENVIRONMENT 2019; 696:133940. [PMID: 31446290 DOI: 10.1016/j.scitotenv.2019.133940] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2019] [Revised: 08/13/2019] [Accepted: 08/14/2019] [Indexed: 06/10/2023]
Abstract
The need to prevent possible adverse environmental health impacts resulting from synthetic biology (SynBio) products is widely acknowledged in both the SynBio risk literature and the global regulatory community. To-date, however, discussions of potential risks of SynBio products have been largely speculative, and the limited attempts to characterize the risks of SynBio products have been non-uniform and entirely qualitative. As the SynBio discipline continues to accelerate and bring forth novel, highly-engineered life forms, a standardized risk assessment framework will become critical for ensuring that the environmental risks of these products are characterized in a consistent, reliable, and objective manner that incorporates all SynBio-unique risk factors. In their current forms, established risk assessment frameworks - including those that address traditional genetically modified organisms - fall short of the features required of this standard framework. To address this gap, we propose the Quantitative Risk Assessment Method for Synthetic Biology Products (QRA-SynBio) - an incremental build on established risk assessment methodologies that supplements traditional paradigms with the SynBio risk factors that are currently absent, and necessitates quantitative analysis for more transparent and objective risk characterizations. We demonstrate through a hypothetical case study that the proposed framework facilitates defensible quantification of the environmental risks of SynBio products in both foreseeable and hypothetical use scenarios. Additionally, we show how the quantitative nature of the proposed method can promote increased experimental investigation into the true likelihood of hazard and exposure parameters and highlight the most sensitive parameters where uncertainty should be reduced, ultimately leading to more targeted SynBio risk research and yielding more precise characterizations of risk.
Collapse
Affiliation(s)
- Taylor Rycroft
- Environmental Laboratory, U.S. Army Engineer Research and Development Center, Concord, MA, USA.
| | - Kerry Hamilton
- School for Sustainable Engineering and the Built Environment & The Biodesign Institute Center for Environmental Health Engineering, Arizona State University, Tempe, AZ, USA
| | - Charles N Haas
- Department of Civil, Architectural and Environmental Engineering, Drexel University, Philadelphia, PA, USA
| | - Igor Linkov
- Environmental Laboratory, U.S. Army Engineer Research and Development Center, Concord, MA, USA
| |
Collapse
|
16
|
Zeng Q, Liao C, Terhune J, Wang L. Impacts of florfenicol on the microbiota landscape and resistome as revealed by metagenomic analysis. MICROBIOME 2019; 7:155. [PMID: 31818316 PMCID: PMC6902485 DOI: 10.1186/s40168-019-0773-8] [Citation(s) in RCA: 51] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/13/2019] [Accepted: 12/02/2019] [Indexed: 05/26/2023]
Abstract
BACKGROUND Drug-resistant fish pathogens can cause significant economic loss to fish farmers. Since 2012, florfenicol has become an approved drug for treating both septicemia and columnaris diseases in freshwater fish. Due to the limited drug options available for aquaculture, the impact of the therapeutical florfenicol treatment on the microbiota landscape as well as the resistome present in the aquaculture farm environment needs to be evaluated. RESULTS Time-series metagenomic analyses were conducted to the aquatic microbiota present in the tank-based catfish production systems, in which catfish received standard therapeutic 10-day florfenicol treatment following the federal veterinary regulations. Results showed that the florfenicol treatment shifted the structure of the microbiota and reduced the biodiversity of it by acting as a strong stressor. Planctomycetes, Chloroflexi, and 13 other phyla were susceptible to the florfenicol treatment and their abundance was inhibited by the treatment. In contrast, the abundance of several bacteria belonging to the Proteobacteria, Bacteroidetes, Actinobacteria, and Verrucomicrobia phyla increased. These bacteria with increased abundance either harbor florfenicol-resistant genes (FRGs) or had beneficial mutations. The florfenicol treatment promoted the proliferation of florfenicol-resistant genes. The copy number of phenicol-specific resistance genes as well as multiple classes of antibiotic-resistant genes (ARGs) exhibited strong correlations across different genetic exchange communities (p < 0.05), indicating the horizontal transfer of florfenicol-resistant genes among these bacterial species or genera. Florfenicol treatment also induced mutation-driven resistance. Significant changes in single-nucleotide polymorphism (SNP) allele frequencies were observed in membrane transporters, genes involved in recombination, and in genes with primary functions of a resistance phenotype. CONCLUSIONS The therapeutical level of florfenicol treatment significantly altered the microbiome and resistome present in catfish tanks. Both intra-population and inter-population horizontal ARG transfer was observed, with the intra-population transfer being more common. The oxazolidinone/phenicol-resistant gene optrA was the most prevalent transferred ARG. In addition to horizontal gene transfer, bacteria could also acquire florfenicol resistance by regulating the innate efflux systems via mutations. The observations made by this study are of great importance for guiding the strategic use of florfenicol, thus preventing the formation, persistence, and spreading of florfenicol-resistant bacteria and resistance genes in aquaculture.
Collapse
Affiliation(s)
- Qifan Zeng
- Department of Animal Sciences, Auburn University, Auburn, AL, 36830, USA
- Ministry of Education Key Laboratory of Marine Genetics and Breeding, College of Marine Science, Ocean University of China, Qingdao, 266003, Shandong, China
| | - Chao Liao
- Department of Animal Sciences, Auburn University, Auburn, AL, 36830, USA
- Department of Food Science and Technology, University of California Davis, Davis, CA, 95616, USA
| | - Jeffery Terhune
- Department of Fisheries and Allied Aquacultures, 203 Swingle Hall, Auburn University, Auburn, AL, 36849, USA
| | - Luxin Wang
- Department of Animal Sciences, Auburn University, Auburn, AL, 36830, USA.
- Department of Food Science and Technology, University of California Davis, Davis, CA, 95616, USA.
| |
Collapse
|
17
|
Ding W, Baumdicker F, Neher RA. panX: pan-genome analysis and exploration. Nucleic Acids Res 2019; 46:e5. [PMID: 29077859 PMCID: PMC5758898 DOI: 10.1093/nar/gkx977] [Citation(s) in RCA: 167] [Impact Index Per Article: 27.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2016] [Accepted: 10/10/2017] [Indexed: 11/24/2022] Open
Abstract
Horizontal transfer, gene loss, and duplication result in dynamic bacterial genomes shaped by a complex mixture of different modes of evolution. Closely related strains can differ in the presence or absence of many genes, and the total number of distinct genes found in a set of related isolates—the pan-genome—is often many times larger than the genome of individual isolates. We have developed a pipeline that efficiently identifies orthologous gene clusters in the pan-genome. This pipeline is coupled to a powerful yet easy-to-use web-based visualization for interactive exploration of the pan-genome. The visualization consists of connected components that allow rapid filtering and searching of genes and inspection of their evolutionary history. For each gene cluster, panX displays an alignment, a phylogenetic tree, maps mutations within that cluster to the branches of the tree and infers gain and loss of genes on the core-genome phylogeny. PanX is available at pangenome.de. Custom pan-genomes can be visualized either using a web server or by serving panX locally as a browser-based application.
Collapse
Affiliation(s)
- Wei Ding
- Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Franz Baumdicker
- Mathematisches Institut, Albert-Ludwigs University of Freiburg, 79104 Freiburg, Germany
| | - Richard A Neher
- Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany.,Biozentrum and SIB Swiss Institute of Bioinformatics, University of Basel, 4056 Basel, Switzerland
| |
Collapse
|
18
|
Pett W, Adamski M, Adamska M, Francis WR, Eitel M, Pisani D, Wörheide G. The Role of Homology and Orthology in the Phylogenomic Analysis of Metazoan Gene Content. Mol Biol Evol 2019; 36:643-649. [PMID: 30690573 DOI: 10.1093/molbev/msz013] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Resolving the relationships of animals (Metazoa) is crucial to our understanding of the origin of key traits such as muscles, guts, and nerves. However, a broadly accepted metazoan consensus phylogeny has yet to emerge. In part, this is because the genomes of deeply diverging and fast-evolving lineages may undergo significant gene turnover, reducing the number of orthologs shared with related phyla. This can limit the usefulness of traditional phylogenetic methods that rely on alignments of orthologous sequences. Phylogenetic analysis of gene content has the potential to circumvent this orthology requirement, with binary presence/absence of homologous gene families representing a source of phylogenetically informative characters. Applying binary substitution models to the gene content of 26 complete animal genomes, we demonstrate that patterns of gene conservation differ markedly depending on whether gene families are defined by orthology or homology, that is, whether paralogs are excluded or included. We conclude that the placement of some deeply diverging lineages may exceed the limit of resolution afforded by the current methods based on comparisons of orthologous protein sequences, and novel approaches are required to fully capture the evolutionary signal from genes within genomes.
Collapse
Affiliation(s)
- Walker Pett
- Department of Ecology, Evolution and Organismal Biology, Iowa State University, Ames, IA
| | - Marcin Adamski
- Computational Biology and Bioinformatics Unit, Research School of Biology, The Australian National University, Canberra, Australia
| | - Maja Adamska
- Computational Biology and Bioinformatics Unit, Research School of Biology, The Australian National University, Canberra, Australia
| | - Warren R Francis
- Department of Earth & Environmental Sciences & GeoBio-Center, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Michael Eitel
- Department of Earth & Environmental Sciences & GeoBio-Center, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Davide Pisani
- School of Earth Sciences, University of Bristol, Bristol, United Kingdom.,School of Biological Sciences, University of Bristol, Bristol, United Kingdom
| | - Gert Wörheide
- Department of Earth & Environmental Sciences & GeoBio-Center, Ludwig-Maximilians-Universität München, Munich, Germany.,SNSB-Bayerische Staatssammlung für Paläontologie und Geologie, München, Germany
| |
Collapse
|
19
|
Harish A. What is an archaeon and are the Archaea really unique? PeerJ 2018; 6:e5770. [PMID: 30357005 PMCID: PMC6196074 DOI: 10.7717/peerj.5770] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2018] [Accepted: 09/05/2018] [Indexed: 12/05/2022] Open
Abstract
The recognition of the group Archaea as a major branch of the tree of life (ToL) prompted a new view of the evolution of biodiversity. The genomic representation of archaeal biodiversity has since significantly increased. In addition, advances in phylogenetic modeling of multi-locus datasets have resolved many recalcitrant branches of the ToL. Despite the technical advances and an expanded taxonomic representation, two important aspects of the origins and evolution of the Archaea remain controversial, even as we celebrate the 40th anniversary of the monumental discovery. These issues concern (i) the uniqueness (monophyly) of the Archaea, and (ii) the evolutionary relationships of the Archaea to the Bacteria and the Eukarya; both of these are relevant to the deep structure of the ToL. To explore the causes for this persistent ambiguity, I examine multiple datasets and different phylogenetic approaches that support contradicting conclusions. I find that the uncertainty is primarily due to a scarcity of information in standard datasets-universal core-genes datasets-to reliably resolve the conflicts. These conflicts can be resolved efficiently by comparing patterns of variation in the distribution of functional genomic signatures, which are less diffused unlike patterns of primary sequence variation. Relatively lower heterogeneity in distribution patterns minimizes uncertainties and supports statistically robust phylogenetic inferences, especially of the earliest divergences of life. This case study further highlights the limitations of primary sequence data in resolving difficult phylogenetic problems, and raises questions about evolutionary inferences drawn from the analyses of sequence alignments of a small set of core genes. In particular, the findings of this study corroborate the growing consensus that reversible substitution mutations may not be optimal phylogenetic markers for resolving early divergences in the ToL, nor for determining the polarity of evolutionary transitions across the ToL.
Collapse
Affiliation(s)
- Ajith Harish
- Department of Cell and Molecular Biology, Program in Molecular Biology, Uppsala University, Uppsala, Sweden
| |
Collapse
|
20
|
Harish A, Kurland CG. Akaryotes and Eukaryotes are independent descendants of a universal common ancestor. Biochimie 2017; 138:168-183. [PMID: 28461155 DOI: 10.1016/j.biochi.2017.04.013] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2017] [Accepted: 04/25/2017] [Indexed: 11/29/2022]
Abstract
We reconstructed a global tree of life (ToL) with non-reversible and non-stationary models of genome evolution that root trees intrinsically. We implemented Bayesian model selection tests and compared the statistical support for four conflicting ToL hypotheses. We show that reconstructions obtained with a Bayesian implementation (Klopfstein et al., 2015) are consistent with reconstructions obtained with an empirical Sankoff parsimony (ESP) implementation (Harish et al., 2013). Both are based on the genome contents of coding sequences for protein domains (superfamilies) from hundreds of genomes. Thus, we conclude that the independent descent of Eukaryotes and Akaryotes (archaea and bacteria) from the universal common ancestor (UCA) is the most probable as well as the most parsimonious hypothesis for the evolutionary origins of extant genomes. Reconstructions of ancestral proteomes by both Bayesian and ESP methods suggest that at least 70% of unique domain-superfamilies known in extant species were present in the UCA. In addition, identification of a vast majority (96%) of the mitochondrial superfamilies in the UCA proteome precludes a symbiotic hypothesis for the origin of eukaryotes. Accordingly, neither the archaeal origin of eukaryotes nor the bacterial origin of mitochondria is supported by the data. The proteomic complexity of the UCA suggests that the evolution of cellular phenotypes in the two primordial lineages, Akaryotes and Eukaryotes, was driven largely by duplication of common superfamilies as well as by loss of unique superfamilies. Finally, innovation of novel superfamilies has played a surprisingly small role in the evolution of Akaryotes and only a marginal role in the evolution of Eukaryotes.
Collapse
Affiliation(s)
- Ajith Harish
- Department of Cell and Molecular Biology, Structural and Molecular Biology Program, Uppsala University, Uppsala, Sweden.
| | - Charles G Kurland
- Department of Biology, Microbial Ecology Program, Lund University, Lund, Sweden.
| |
Collapse
|