1
|
Beavan A, Domingo-Sananes MR, McInerney JO. Contingency, repeatability, and predictability in the evolution of a prokaryotic pangenome. Proc Natl Acad Sci U S A 2024; 121:e2304934120. [PMID: 38147560 PMCID: PMC10769857 DOI: 10.1073/pnas.2304934120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Accepted: 11/05/2023] [Indexed: 12/28/2023] Open
Abstract
Pangenomes exhibit remarkable variability in many prokaryotic species, much of which is maintained through the processes of horizontal gene transfer and gene loss. Repeated acquisitions of near-identical homologs can easily be observed across pangenomes, leading to the question of whether these parallel events potentiate similar evolutionary trajectories, or whether the remarkably different genetic backgrounds of the recipients mean that postacquisition evolutionary trajectories end up being quite different. In this study, we present a machine learning method that predicts the presence or absence of genes in the Escherichia coli pangenome based on complex patterns of the presence or absence of other accessory genes within a genome. Our analysis leverages the repeated transfer of genes through the E. coli pangenome to observe patterns of repeated evolution following similar events. We find that the presence or absence of a substantial set of genes is highly predictable from other genes alone, indicating that selection potentiates and maintains gene-gene co-occurrence and avoidance relationships deterministically over long-term bacterial evolution and is robust to differences in host evolutionary history. We propose that at least part of the pangenome can be understood as a set of genes with relationships that govern their likely cohabitants, analogous to an ecosystem's set of interacting organisms. Our findings indicate that intragenomic gene fitness effects may be key drivers of prokaryotic evolution, influencing the repeated emergence of complex gene-gene relationships across the pangenome.
Collapse
Affiliation(s)
- Alan Beavan
- School of Life Sciences, The University of Nottingham, NottinghamNG7 2UH, United Kingdom
| | - Maria Rosa Domingo-Sananes
- School of Life Sciences, The University of Nottingham, NottinghamNG7 2UH, United Kingdom
- School of Science and Technology, Nottingham Trent University, NottinghamNG1 4FQ, United Kingdom
| | - James O. McInerney
- School of Life Sciences, The University of Nottingham, NottinghamNG7 2UH, United Kingdom
| |
Collapse
|
2
|
Cummins EA, Hall RJ, Connor C, McInerney JO, McNally A. Distinct evolutionary trajectories in the Escherichia coli pangenome occur within sequence types. Microb Genom 2022; 8:mgen000903. [PMID: 36748558 PMCID: PMC9836092 DOI: 10.1099/mgen.0.000903] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
The Escherichia coli species contains a diverse set of sequence types and there remain important questions regarding differences in genetic content within this population that need to be addressed. Pangenomes are useful vehicles for studying gene content within sequence types. Here, we analyse 21 E. coli sequence type pangenomes using comparative pangenomics to identify variance in both pangenome structure and content. We present functional breakdowns of sequence type core genomes and identify sequence types that are enriched in metabolism, transcription and cell membrane biogenesis genes. We also uncover metabolism genes that have variable core classification, depending on which allele is present. Our comparative pangenomics approach allows for detailed exploration of sequence type pangenomes within the context of the species. We show that ongoing gene gain and loss in the E. coli pangenome is sequence type-specific, which may be a consequence of distinct sequence type-specific evolutionary drivers.
Collapse
Affiliation(s)
- Elizabeth A. Cummins
- Institute of Microbiology and Infection, College of Medical and Dental Sciences, University of Birmingham, Birmingham, B15 2TT, UK
| | - Rebecca J. Hall
- Institute of Microbiology and Infection, College of Medical and Dental Sciences, University of Birmingham, Birmingham, B15 2TT, UK
| | - Chris Connor
- Institute of Microbiology and Infection, College of Medical and Dental Sciences, University of Birmingham, Birmingham, B15 2TT, UK,Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne 3000, Australia
| | - James O. McInerney
- School of Life Sciences, University of Nottingham, Nottingham, NG7 2UH, UK
| | - Alan McNally
- Institute of Microbiology and Infection, College of Medical and Dental Sciences, University of Birmingham, Birmingham, B15 2TT, UK,*Correspondence: Alan McNally,
| |
Collapse
|
3
|
Abstract
Understanding adaptation to the local environment is a central tenet and a major focus of evolutionary biology. But this is only part of the adaptionist story. In addition to the external environment, one of the main drivers of genome composition is genetic background. In this perspective, I argue that there is a growing body of evidence that intra-genomic selective pressures play a significant part in the composition of prokaryotic genomes and play a significant role in the origin, maintenance and structuring of prokaryotic pangenomes.
Collapse
|
4
|
Affiliation(s)
- Alan J S Beavan
- School of Life Sciences, University of Nottingham, Nottingham, UK.
| | | |
Collapse
|
5
|
Cummins EA, Hall RJ, McInerney JO, McNally A. Prokaryote pangenomes are dynamic entities. Curr Opin Microbiol 2022; 66:73-78. [PMID: 35104691 DOI: 10.1016/j.mib.2022.01.005] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Revised: 01/07/2022] [Accepted: 01/11/2022] [Indexed: 11/24/2022]
Abstract
Prokaryote pangenomes are influenced heavily by environmental factors and the opportunity for gene gain and loss events. As the field of pangenome analysis has expanded, so has the need to fully understand the complexity of how eco-evolutionary dynamics shape pangenomes. Here, we describe current models of pangenome evolution and discuss their suitability and accuracy. We suggest that pangenomes are dynamic entities under constant flux, highlighting the influence of two-way interactions between pangenome and environment. New classifications of core and accessory genes are also considered, underscoring the need for continuous evaluation of nomenclature in a fast-moving field. We conclude that future models of pangenome evolution should incorporate eco-evolutionary dynamics to fully encompass their dynamic, changeable nature.
Collapse
Affiliation(s)
- Elizabeth A Cummins
- Institute of Microbiology and Infection, College of Medical and Dental Sciences, University of Birmingham, Birmingham, B15 2TT, UK
| | - Rebecca J Hall
- Institute of Microbiology and Infection, College of Medical and Dental Sciences, University of Birmingham, Birmingham, B15 2TT, UK.
| | - James O McInerney
- School of Life Sciences, University of Nottingham, Nottingham, NG7 2UH, UK
| | - Alan McNally
- Institute of Microbiology and Infection, College of Medical and Dental Sciences, University of Birmingham, Birmingham, B15 2TT, UK.
| |
Collapse
|
6
|
Hall RJ, Whelan FJ, Cummins EA, Connor C, McNally A, McInerney JO. Gene-gene relationships in an Escherichia coli accessory genome are linked to function and mobility. Microb Genom 2021; 7. [PMID: 34499026 PMCID: PMC8715431 DOI: 10.1099/mgen.0.000650] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
The pangenome contains all genes encoded by a species, with the core genome present in all strains and the accessory genome in only a subset. Coincident gene relationships are expected within the accessory genome, where the presence or absence of one gene is influenced by the presence or absence of another. Here, we analysed the accessory genome of an Escherichia coli pangenome consisting of 400 genomes from 20 sequence types to identify genes that display significant co-occurrence or avoidance patterns with one another. We present a complex network of genes that are either found together or that avoid one another more often than would be expected by chance, and show that these relationships vary by lineage. We demonstrate that genes co-occur by function, and that several highly connected gene relationships are linked to mobile genetic elements. We find that genes are more likely to co-occur with, rather than avoid, another gene in the accessory genome. This work furthers our understanding of the dynamic nature of prokaryote pangenomes and implicates both function and mobility as drivers of gene relationships.
Collapse
Affiliation(s)
- Rebecca J Hall
- Institute of Microbiology and Infection, College of Medical and Dental Sciences, University of Birmingham, Birmingham, B15 2TT, UK.,School of Life Sciences, University of Nottingham, Nottingham, NG7 2UH, UK
| | - Fiona J Whelan
- School of Life Sciences, University of Nottingham, Nottingham, NG7 2UH, UK
| | - Elizabeth A Cummins
- Institute of Microbiology and Infection, College of Medical and Dental Sciences, University of Birmingham, Birmingham, B15 2TT, UK.,School of Life Sciences, University of Nottingham, Nottingham, NG7 2UH, UK
| | - Christopher Connor
- Institute of Microbiology and Infection, College of Medical and Dental Sciences, University of Birmingham, Birmingham, B15 2TT, UK
| | - Alan McNally
- Institute of Microbiology and Infection, College of Medical and Dental Sciences, University of Birmingham, Birmingham, B15 2TT, UK
| | - James O McInerney
- School of Life Sciences, University of Nottingham, Nottingham, NG7 2UH, UK
| |
Collapse
|
7
|
Abstract
A pangenome is the complete set of genes (core and accessory) present in a phylogenetic clade. We hypothesize that a pangenome's accessory gene content is structured and maintained by selection. To test this hypothesis, we interrogated the genomes of 40 Pseudomonas species for statistically significant coincident (i.e., co-occurring/avoiding) gene patterns. We found that 86.7% of common accessory genes are involved in ≥1 coincident relationship. Further, genes that co-occur and/or avoid each other-but are not vertically inherited-are more likely to share functional categories, are more likely to be simultaneously transcribed, and are more likely to produce interacting proteins, than would be expected by chance. These results are not due to coincident genes being adjacent to one another on the chromosome. Together, these findings suggest that the accessory genome is structured into sets of genes that function together within a given strain. Given the similarity of the Pseudomonas pangenome with open pangenomes of other prokaryotic species, we speculate that these results are generalizable.
Collapse
Affiliation(s)
- Fiona J Whelan
- School of Life Sciences, University of Nottingham, Nottingham, United Kingdom
| | - Rebecca J Hall
- School of Life Sciences, University of Nottingham, Nottingham, United Kingdom
- Institute of Microbiology and Infection, College of Medical and Dental Sciences, University of Birmingham, Birmingham, United Kingdom
| | - James O McInerney
- School of Life Sciences, University of Nottingham, Nottingham, United Kingdom
| |
Collapse
|
8
|
Domingo-Sananes MR, McInerney JO. Mechanisms That Shape Microbial Pangenomes. Trends Microbiol 2021; 29:493-503. [PMID: 33423895 DOI: 10.1016/j.tim.2020.12.004] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2020] [Revised: 12/09/2020] [Accepted: 12/10/2020] [Indexed: 01/02/2023]
Abstract
Analyses of multiple whole-genome sequences from the same species have revealed that differences in gene content can be substantial, particularly in prokaryotes. Such variation has led to the recognition of pangenomes, the complete set of genes present in a species - consisting of core genes, present in all individuals, and accessory genes whose presence is variable. Questions now arise about how pangenomes originate and evolve. We describe how gene content variation can arise as a result of the combination of several processes, including random drift, selection, gain/loss balance, and the influence of ecological and epistatic interactions. We believe that identifying the contributions of these processes to pangenomes will need novel theoretical approaches and empirical data.
Collapse
Affiliation(s)
- Maria Rosa Domingo-Sananes
- School of Life Sciences, University of Nottingham, Nottingham, UK; School of Science and Technology, Nottingham Trent University, Nottingham, UK.
| | | |
Collapse
|
9
|
Hall RJ, Whelan FJ, McInerney JO, Ou Y, Domingo-Sananes MR. Horizontal Gene Transfer as a Source of Conflict and Cooperation in Prokaryotes. Front Microbiol 2020; 11:1569. [PMID: 32849327 PMCID: PMC7396663 DOI: 10.3389/fmicb.2020.01569] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2020] [Accepted: 06/17/2020] [Indexed: 02/01/2023] Open
Abstract
Horizontal gene transfer (HGT) is one of the most important processes in prokaryote evolution. The sharing of DNA can spread neutral or beneficial genes, as well as genetic parasites across populations and communities, creating a large proportion of the variability acted on by natural selection. Here, we highlight the role of HGT in enhancing the opportunities for conflict and cooperation within and between prokaryote genomes. We discuss how horizontally acquired genes can cooperate or conflict both with each other and with a recipient genome, resulting in signature patterns of gene co-occurrence, avoidance, and dependence. We then describe how interactions involving horizontally transferred genes may influence cooperation and conflict at higher levels (populations, communities, and symbioses). Finally, we consider the benefits and drawbacks of HGT for prokaryotes and its fundamental role in understanding conflict and cooperation from the gene-gene to the microbiome level.
Collapse
Affiliation(s)
- Rebecca J Hall
- School of Life Sciences, University of Nottingham, Nottingham, United Kingdom
| | - Fiona J Whelan
- School of Life Sciences, University of Nottingham, Nottingham, United Kingdom
| | - James O McInerney
- School of Life Sciences, University of Nottingham, Nottingham, United Kingdom.,Division of Evolution and Genomic Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Manchester, United Kingdom
| | - Yaqing Ou
- Division of Evolution and Genomic Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Manchester, United Kingdom
| | | |
Collapse
|
10
|
McCartney AM, Hyland EM, Cormican P, Moran RJ, Webb AE, Lee KD, Hernandez-Rodriguez J, Prado-Martinez J, Creevey CJ, Aspden JL, McInerney JO, Marques-Bonet T, O'Connell MJ. Gene Fusions Derived by Transcriptional Readthrough are Driven by Segmental Duplication in Human. Genome Biol Evol 2020; 11:2678-2690. [PMID: 31400206 PMCID: PMC6764479 DOI: 10.1093/gbe/evz163] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/17/2019] [Indexed: 12/14/2022] Open
Abstract
Gene fusion occurs when two or more individual genes with independent open reading frames becoming juxtaposed under the same open reading frame creating a new fused gene. A small number of gene fusions described in detail have been associated with novel functions, for example, the hominid-specific PIPSL gene, TNFSF12, and the TWE-PRIL gene family. We use Sequence Similarity Networks and species level comparisons of great ape genomes to identify 45 new genes that have emerged by transcriptional readthrough, that is, transcription-derived gene fusion. For 35 of these putative gene fusions, we have been able to assess available RNAseq data to determine whether there are reads that map to each breakpoint. A total of 29 of the putative gene fusions had annotated transcripts (9/29 of which are human-specific). We carried out RT-qPCR in a range of human tissues (placenta, lung, liver, brain, and testes) and found that 23 of the putative gene fusion events were expressed in at least one tissue. Examining the available ribosome foot-printing data, we find evidence for translation of three of the fused genes in human. Finally, we find enrichment for transcription-derived gene fusions in regions of known segmental duplication in human. Together, our results implicate chromosomal structural variation brought about by segmental duplication with the emergence of novel transcripts and translated protein products.
Collapse
Affiliation(s)
- Ann M McCartney
- Bioinformatics and Molecular Evolution Group, School of Biotechnology, Dublin City University, Ireland.,Computational and Molecular Evolutionary Biology Group, School of Biology, Faculty of Biological Sciences, The University of Leeds, United Kingdom
| | - Edel M Hyland
- Bioinformatics and Molecular Evolution Group, School of Biotechnology, Dublin City University, Ireland.,Institute for Global Food Security, Queens University Belfast, United Kingdom
| | - Paul Cormican
- Teagasc Animal and Bioscience Research Department, Animal & Grassland Research and Innovation Centre, Teagasc, Grange, Dunsany, County Meath, Ireland
| | - Raymond J Moran
- Bioinformatics and Molecular Evolution Group, School of Biotechnology, Dublin City University, Ireland.,Computational and Molecular Evolutionary Biology Group, School of Biology, Faculty of Biological Sciences, The University of Leeds, United Kingdom
| | - Andrew E Webb
- Bioinformatics and Molecular Evolution Group, School of Biotechnology, Dublin City University, Ireland
| | - Kate D Lee
- Bioinformatics and Molecular Evolution Group, School of Biotechnology, Dublin City University, Ireland.,School of Biological Sciences, University of Auckland, New Zealand.,School of Fundamental Sciences, Massey University, New Zealand
| | | | - Javier Prado-Martinez
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain.,Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, United Kingdom
| | - Christopher J Creevey
- Institute for Global Food Security, Queens University Belfast, United Kingdom.,Institute of Biological, Environmental and Rural Sciences, Aberystwyth University, United Kingdom
| | - Julie L Aspden
- School of Molecular and Cellular Biology, Faculty of Biological Sciences, The University of Leeds, United Kingdom
| | - James O McInerney
- Division of Evolution and Genomic Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, M13 9PL, United Kingdom.,School of Life Sciences, Faculty of Medicine and Health Sciences, The University of Nottingham, NG7 2RD, United Kingdom
| | - Tomas Marques-Bonet
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain.,Catalan Institution of Research and Advanced Studies (ICREA), Passeig de Lluís Companys, 23, 08010, Barcelona, Spain.,NAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain.,Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Edifici ICTA-ICP, c/ Columnes s/n, 08193 Cerdanyola del Vallés, Barcelona, Spain
| | - Mary J O'Connell
- Bioinformatics and Molecular Evolution Group, School of Biotechnology, Dublin City University, Ireland.,Computational and Molecular Evolutionary Biology Group, School of Biology, Faculty of Biological Sciences, The University of Leeds, United Kingdom.,School of Life Sciences, Faculty of Medicine and Health Sciences, The University of Nottingham, NG7 2RD, United Kingdom
| |
Collapse
|
11
|
Ou Y, McInerney JO. Eukaryote Genes Are More Likely than Prokaryote Genes to Be Composites. Genes (Basel) 2019; 10:genes10090648. [PMID: 31466252 PMCID: PMC6769587 DOI: 10.3390/genes10090648] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2019] [Revised: 08/18/2019] [Accepted: 08/23/2019] [Indexed: 12/27/2022] Open
Abstract
The formation of new genes by combining parts of existing genes is an important evolutionary process. Remodelled genes, which we call composites, have been investigated in many species, however, their distribution across all of life is still unknown. We set out to examine the extent to which genomes from cells and mobile genetic elements contain composite genes. We identify composite genes as those that show partial homology to at least two unrelated component genes. In order to identify composite and component genes, we constructed sequence similarity networks (SSNs) of more than one million genes from all three domains of life, as well as viruses and plasmids. We identified non-transitive triplets of nodes in this network and explored the homology relationships in these triplets to see if the middle nodes were indeed composite genes. In total, we identified 221,043 (18.57%) composites genes, which were distributed across all genomic and functional categories. In particular, the presence of composite genes is statistically more likely in eukaryotes than prokaryotes.
Collapse
Affiliation(s)
- Yaqing Ou
- Division of Evolution and Genomic Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Manchester M13 9PL, UK.
| | - James O McInerney
- Division of Evolution and Genomic Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Manchester M13 9PL, UK.
- School of Life Sciences, University of Nottingham, Nottingham NG7 2UH, UK.
| |
Collapse
|
12
|
Abstract
Extensive microbial gene flows affect how we understand virology, microbiology, medical sciences, genetic modification, and evolutionary biology. Phylogenies only provide a narrow view of these gene flows: plasmids and viruses, lacking core genes, cannot be attached to cellular life on phylogenetic trees. Yet viruses and plasmids have a major impact on cellular evolution, affecting both the gene content and the dynamics of microbial communities. Using bipartite graphs that connect up to 149,000 clusters of homologous genes with 8,217 related and unrelated genomes, we can in particular show patterns of gene sharing that do not map neatly with the organismal phylogeny. Homologous genes are recycled by lateral gene transfer, and multiple copies of homologous genes are carried by otherwise completely unrelated (and possibly nested) genomes, that is, viruses, plasmids and prokaryotes. When a homologous gene is present on at least one plasmid or virus and at least one chromosome, a process of "gene externalization," affected by a postprocessed selected functional bias, takes place, especially in Bacteria. Bipartite graphs give us a view of vertical and horizontal gene flow beyond classic taxonomy on a single very large, analytically tractable, graph that goes beyond the cellular Web of Life.
Collapse
Affiliation(s)
- Eduardo Corel
- Unité Mixte de Recherche 7138 Evolution Paris-Seine, Centre National de la Recherche Scientifique, Institut de Biologie Paris-Seine, Sorbonne Université, Université Pierre et Marie Curie, Paris, France
| | - Raphaël Méheust
- Unité Mixte de Recherche 7138 Evolution Paris-Seine, Centre National de la Recherche Scientifique, Institut de Biologie Paris-Seine, Sorbonne Université, Université Pierre et Marie Curie, Paris, France
| | - Andrew K Watson
- Unité Mixte de Recherche 7138 Evolution Paris-Seine, Centre National de la Recherche Scientifique, Institut de Biologie Paris-Seine, Sorbonne Université, Université Pierre et Marie Curie, Paris, France
| | - James O McInerney
- Chair in Evolutionary Biology, The University of Manchester, United Kingdom
| | - Philippe Lopez
- Unité Mixte de Recherche 7138 Evolution Paris-Seine, Centre National de la Recherche Scientifique, Institut de Biologie Paris-Seine, Sorbonne Université, Université Pierre et Marie Curie, Paris, France
| | - Eric Bapteste
- Unité Mixte de Recherche 7138 Evolution Paris-Seine, Centre National de la Recherche Scientifique, Institut de Biologie Paris-Seine, Sorbonne Université, Université Pierre et Marie Curie, Paris, France
| |
Collapse
|
13
|
|
14
|
Abstract
BACKGROUND Eukaryotes evolved from the symbiotic association of at least two prokaryotic partners, and a good deal is known about the timings, mechanisms, and dynamics of these evolutionary steps. Recently, it was shown that a new class of nuclear genes, symbiogenetic genes (S-genes), was formed concomitant with endosymbiosis and the subsequent evolution of eukaryotic photosynthetic lineages. Understanding their origins and contributions to eukaryogenesis would provide insights into the ways in which cellular complexity has evolved. RESULTS Here, we show that chimeric nuclear genes (S-genes), built from prokaryotic domains, are critical for explaining the leap forward in cellular complexity achieved during eukaryogenesis. A total of 282 S-gene families contributed solutions to many of the challenges faced by early eukaryotes, including enhancing the informational machinery, processing spliceosomal introns, tackling genotoxicity within the cell, and ensuring functional protein interactions in a larger, more compartmentalized cell. For hundreds of S-genes, we confirmed the origins of their components (bacterial, archaeal, or generally prokaryotic) by maximum likelihood phylogenies. Remarkably, Bacteria contributed nine-fold more S-genes than Archaea, including a two-fold greater contribution to informational functions. Therefore, there is an additional, large bacterial contribution to the evolution of eukaryotes, implying that fundamental eukaryotic properties do not strictly follow the traditional informational/operational divide for archaeal/bacterial contributions to eukaryogenesis. CONCLUSION This study demonstrates the extent and process through which prokaryotic fragments from bacterial and archaeal genes inherited during eukaryogenesis underly the creation of novel chimeric genes with important functions.
Collapse
Affiliation(s)
- Raphaël Méheust
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, Evolution Paris Seine - Institut de Biologie Paris Seine (EPS - IBPS), 75005, Paris, France
| | - Debashish Bhattacharya
- Department of Biochemistry and Microbiology, Rutgers University, New Brunswick, NJ, 08901, USA
| | - Jananan S Pathmanathan
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, Evolution Paris Seine - Institut de Biologie Paris Seine (EPS - IBPS), 75005, Paris, France
| | - James O McInerney
- Division of Evolution and Genomic Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Manchester Academic Health Science Centre, M13 9PL, Manchester, UK
| | - Philippe Lopez
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, Evolution Paris Seine - Institut de Biologie Paris Seine (EPS - IBPS), 75005, Paris, France
| | - Eric Bapteste
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, Evolution Paris Seine - Institut de Biologie Paris Seine (EPS - IBPS), 75005, Paris, France.
| |
Collapse
|
15
|
Abstract
Biological public goods are broadly shared within an ecosystem and readily available. They appear to be widespread and may have played important roles in the history of life on Earth. Of particular importance to events in the early history of life are the roles of public goods in the merging of genomes, protein domains and even cells. We suggest that public goods facilitated the origin of the eukaryotic cell, a classic major evolutionary transition. The recognition of genomic public goods challenges advocates of a direct graph view of phylogeny, and those who deny that any useful phylogenetic signal persists in modern genomes. Ecological spillovers generate public goods that provide new ecological opportunities.This article is part of the themed issue 'Reconceptualizing the origins of life'.
Collapse
Affiliation(s)
- James O McInerney
- Division of Evolution and Genomic Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester Academic Health Science Centre, Manchester M13 9PL, UK
| | - Douglas H Erwin
- Department of Paleobiology, MRC-121, Smithsonian Institution, Washington, DC, USA
| |
Collapse
|
16
|
|
17
|
Bruns H, Crüsemann M, Letzel AC, Alanjary M, McInerney JO, Jensen PR, Schulz S, Moore BS, Ziemert N. Function-related replacement of bacterial siderophore pathways. ISME J 2017; 12:320-329. [PMID: 28809850 PMCID: PMC5776446 DOI: 10.1038/ismej.2017.137] [Citation(s) in RCA: 45] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/10/2017] [Revised: 06/13/2017] [Accepted: 07/14/2017] [Indexed: 01/16/2023]
Abstract
Bacterial genomes are rife with orphan biosynthetic gene clusters (BGCs) associated with secondary metabolism of unrealized natural product molecules. Often up to a tenth of the genome is predicted to code for the biosynthesis of diverse metabolites with mostly unknown structures and functions. This phenomenal diversity of BGCs coupled with their high rates of horizontal transfer raise questions about whether they are really active and beneficial, whether they are neutral and confer no advantage, or whether they are carried in genomes because they are parasitic or addictive. We previously reported that Salinispora bacteria broadly use the desferrioxamine family of siderophores for iron acquisition. Herein we describe a new and unrelated group of peptidic siderophores called salinichelins from a restricted number of Salinispora strains in which the desferrioxamine biosynthesis genes have been lost. We have reconstructed the evolutionary history of these two different siderophore families and show that the acquisition and retention of the new salinichelin siderophores co-occurs with the loss of the more ancient desferrioxamine pathway. This identical event occurred at least three times independently during the evolution of the genus. We surmise that certain BGCs may be extraneous because of their functional redundancy and demonstrate that the relative evolutionary pace of natural pathway replacement shows high selective pressure against retention of functionally superfluous gene clusters.
Collapse
Affiliation(s)
- Hilke Bruns
- Institute of Organic Chemistry, Technische Universität Braunschweig, Braunschweig, Germany.,Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Max Crüsemann
- Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Anne-Catrin Letzel
- Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Mohammad Alanjary
- German Center for Infection Biology (DZIF), Interfaculty Institute for Microbiology and Infection Medicine Tübingen (IMIT), University of Tübingen, Tübingen, Germany
| | - James O McInerney
- Division of Evolution and Genomic Sciences, School of Biological Sciences, Faculty of Biology, Medicine, Health and Manchester Academic Health Science Centre, The University of Manchester, Manchester, UK
| | - Paul R Jensen
- Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Stefan Schulz
- Institute of Organic Chemistry, Technische Universität Braunschweig, Braunschweig, Germany
| | - Bradley S Moore
- Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA.,Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Nadine Ziemert
- German Center for Infection Biology (DZIF), Interfaculty Institute for Microbiology and Infection Medicine Tübingen (IMIT), University of Tübingen, Tübingen, Germany
| |
Collapse
|
18
|
McInerney JO. Horizontal gene transfer is less frequent in eukaryotes than prokaryotes but can be important (retrospective on DOI 10.1002/bies.201300095). Bioessays 2017; 39. [PMID: 28418075 DOI: 10.1002/bies.201700002] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- James O McInerney
- Faculty of Biology, Division of Evolution and Genomic Sciences, School of Biological Sciences, Medicine and Health, The University of Manchester, Manchester Academic Health Science Centre, Manchester, United Kingdom
| |
Collapse
|
19
|
Affiliation(s)
- James O McInerney
- Division of Evolution and Genomic Sciences, School of Biological Sciences, and Manchester Academic Health Science Centre, University of Manchester, Manchester M13 9PL, UK
| | - Mary J O'Connell
- Computational and Molecular Evolutionary Biology Group, School of Biology, Faculty of Biological Sciences, University of Leeds, Leeds LS2 9JT, UK
| |
Collapse
|
20
|
McInerney JO. Society for Molecular Biology and Evolution, Council and Business Meetings, 2016, Gold Coast Australia. Mol Biol Evol 2017; 34:243-244. [PMID: 28039386 DOI: 10.1093/molbev/msw276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
21
|
McNally A, Oren Y, Kelly D, Pascoe B, Dunn S, Sreecharan T, Vehkala M, Välimäki N, Prentice MB, Ashour A, Avram O, Pupko T, Dobrindt U, Literak I, Guenther S, Schaufler K, Wieler LH, Zhiyong Z, Sheppard SK, McInerney JO, Corander J. Combined Analysis of Variation in Core, Accessory and Regulatory Genome Regions Provides a Super-Resolution View into the Evolution of Bacterial Populations. PLoS Genet 2016; 12:e1006280. [PMID: 27618184 PMCID: PMC5019451 DOI: 10.1371/journal.pgen.1006280] [Citation(s) in RCA: 125] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2016] [Accepted: 08/04/2016] [Indexed: 02/05/2023] Open
Abstract
The use of whole-genome phylogenetic analysis has revolutionized our understanding of the evolution and spread of many important bacterial pathogens due to the high resolution view it provides. However, the majority of such analyses do not consider the potential role of accessory genes when inferring evolutionary trajectories. Moreover, the recently discovered importance of the switching of gene regulatory elements suggests that an exhaustive analysis, combining information from core and accessory genes with regulatory elements could provide unparalleled detail of the evolution of a bacterial population. Here we demonstrate this principle by applying it to a worldwide multi-host sample of the important pathogenic E. coli lineage ST131. Our approach reveals the existence of multiple circulating subtypes of the major drug–resistant clade of ST131 and provides the first ever population level evidence of core genome substitutions in gene regulatory regions associated with the acquisition and maintenance of different accessory genome elements. We present an approach to evolutionary analysis of bacterial pathogens combining core genome, accessory genome, and gene regulatory region analyses. This enables unparalleled resolution of the evolution of a multi-drug resistant pandemic pathogen that would remain invisible to a core genome phylogenetic analysis alone. In particular, our combined analysis approach identifies population-level evidence for compensatory mutations offsetting the costs of resistance plasmid maintenance as a key event in the emergence of dominant MDR lineages of E. coli.
Collapse
Affiliation(s)
- Alan McNally
- Pathogen Research Group, Nottingham Trent University, Nottingham, United Kingdom
- Institute of Microbiology and Infection, University of Birmingham, Birmingham, United Kingdom
- * E-mail:
| | - Yaara Oren
- Department of Cell Research and Immunology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Darren Kelly
- Department of Biology, National University Ireland, Maynooth, Ireland
| | - Ben Pascoe
- College of Medicine, University of Swansea, Swansea, United Kingdom
| | - Steven Dunn
- Pathogen Research Group, Nottingham Trent University, Nottingham, United Kingdom
| | - Tristan Sreecharan
- Pathogen Research Group, Nottingham Trent University, Nottingham, United Kingdom
| | - Minna Vehkala
- Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland
| | - Niko Välimäki
- Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland
| | - Michael B. Prentice
- Departments of Pathology and Microbiology, University College Cork, Cork, Ireland
| | - Amgad Ashour
- Departments of Pathology and Microbiology, University College Cork, Cork, Ireland
| | - Oren Avram
- Department of Cell Research and Immunology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Tal Pupko
- Department of Cell Research and Immunology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Ulrich Dobrindt
- Institute of Hygiene, Universitat Muenster, Muenster, Germany
| | - Ivan Literak
- Department of Biology and Wildlife Diseases, Faculty of Veterinary Hygiene and Ecology, and CEITEC VFU, University of Veterinary and Pharmaceutical Sciences, Brno, Czech Republic
| | - Sebastian Guenther
- Centre for Infection Medicine, Institute of Microbiology and Epizootics, Freie Universitat, Berlin, Germany
| | - Katharina Schaufler
- Centre for Infection Medicine, Institute of Microbiology and Epizootics, Freie Universitat, Berlin, Germany
| | - Lothar H. Wieler
- Centre for Infection Medicine, Institute of Microbiology and Epizootics, Freie Universitat, Berlin, Germany
- Robert Koch Institute, Berlin, Germany
| | - Zong Zhiyong
- Centre for Infectious Diseases, West China Hospital of Sichuan University, Chengdu, China
| | | | - James O. McInerney
- Department of Biology, National University Ireland, Maynooth, Ireland
- Faculty of Life Sciences, The University of Manchester, Manchester, United Kingdom
| | - Jukka Corander
- Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland
- Department of Biostatistics, University of Oslo, Oslo, Norway
| |
Collapse
|
22
|
|
23
|
Akanni WA, Siu-Ting K, Creevey CJ, McInerney JO, Wilkinson M, Foster PG, Pisani D. Horizontal gene flow from Eubacteria to Archaebacteria and what it means for our understanding of eukaryogenesis. Philos Trans R Soc Lond B Biol Sci 2016; 370:20140337. [PMID: 26323767 DOI: 10.1098/rstb.2014.0337] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
The origin of the eukaryotic cell is considered one of the major evolutionary transitions in the history of life. Current evidence strongly supports a scenario of eukaryotic origin in which two prokaryotes, an archaebacterial host and an α-proteobacterium (the free-living ancestor of the mitochondrion), entered a stable symbiotic relationship. The establishment of this relationship was associated with a process of chimerization, whereby a large number of genes from the α-proteobacterial symbiont were transferred to the host nucleus. A general framework allowing the conceptualization of eukaryogenesis from a genomic perspective has long been lacking. Recent studies suggest that the origins of several archaebacterial phyla were coincident with massive imports of eubacterial genes. Although this does not indicate that these phyla originated through the same process that led to the origin of Eukaryota, it suggests that Archaebacteria might have had a general propensity to integrate into their genomes large amounts of eubacterial DNA. We suggest that this propensity provides a framework in which eukaryogenesis can be understood and studied in the light of archaebacterial ecology. We applied a recently developed supertree method to a genomic dataset composed of 392 eubacterial and 51 archaebacterial genera to test whether large numbers of genes flowing from Eubacteria are indeed coincident with the origin of major archaebacterial clades. In addition, we identified two potential large-scale transfers of uncertain directionality at the base of the archaebacterial tree. Our results are consistent with previous findings and seem to indicate that eubacterial gene imports (particularly from δ-Proteobacteria, Clostridia and Actinobacteria) were an important factor in archaebacterial history. Archaebacteria seem to have long relied on Eubacteria as a source of genetic diversity, and while the precise mechanism that allowed these imports is unknown, we suggest that our results support the view that processes comparable to those through which eukaryotes emerged might have been common in archaebacterial history.
Collapse
Affiliation(s)
- Wasiu A Akanni
- School of Biological Sciences and School of Earth Sciences, University of Bristol, Life Sciences Building, Bristol BS8 1TG, UK Department of Biology, National University of Ireland, Maynooth, Co. Kildare, Ireland Department of Life Science, The Natural History Museum, London SW7 5BD, UK
| | - Karen Siu-Ting
- School of Biological Sciences and School of Earth Sciences, University of Bristol, Life Sciences Building, Bristol BS8 1TG, UK Department of Biology, National University of Ireland, Maynooth, Co. Kildare, Ireland Department of Life Science, The Natural History Museum, London SW7 5BD, UK Institute of Biological, Environmental and Rural Sciences (IBERS), Aberystwyth University, Aberystwyth, Ceredigion SY23 3FG, UK
| | - Christopher J Creevey
- Institute of Biological, Environmental and Rural Sciences (IBERS), Aberystwyth University, Aberystwyth, Ceredigion SY23 3FG, UK
| | - James O McInerney
- Department of Biology, National University of Ireland, Maynooth, Co. Kildare, Ireland Faculty of Life Sciences, University of Manchester, Oxford Road, Manchester M13 9PL, UK
| | - Mark Wilkinson
- Department of Life Science, The Natural History Museum, London SW7 5BD, UK
| | - Peter G Foster
- Department of Life Science, The Natural History Museum, London SW7 5BD, UK
| | - Davide Pisani
- School of Biological Sciences and School of Earth Sciences, University of Bristol, Life Sciences Building, Bristol BS8 1TG, UK
| |
Collapse
|
24
|
Fondi M, Karkman A, Tamminen MV, Bosi E, Virta M, Fani R, Alm E, McInerney JO. "Every Gene Is Everywhere but the Environment Selects": Global Geolocalization of Gene Sharing in Environmental Samples through Network Analysis. Genome Biol Evol 2016; 8:1388-400. [PMID: 27190206 PMCID: PMC4898794 DOI: 10.1093/gbe/evw077] [Citation(s) in RCA: 64] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
The spatial distribution of microbes on our planet is famously formulated in the Baas Becking hypothesis as “everything is everywhere but the environment selects.” While this hypothesis does not strictly rule out patterns caused by geographical effects on ecology and historical founder effects, it does propose that the remarkable dispersal potential of microbes leads to distributions generally shaped by environmental factors rather than geographical distance. By constructing sequence similarity networks from uncultured environmental samples, we show that microbial gene pool distributions are not influenced nearly as much by geography as ecology, thus extending the Bass Becking hypothesis from whole organisms to microbial genes. We find that gene pools are shaped by their broad ecological niche (such as sea water, fresh water, host, and airborne). We find that freshwater habitats act as a gene exchange bridge between otherwise disconnected habitats. Finally, certain antibiotic resistance genes deviate from the general trend of habitat specificity by exhibiting a high degree of cross-habitat mobility. The strong cross-habitat mobility of antibiotic resistance genes is a cause for concern and provides a paradigmatic example of the rate by which genes colonize new habitats when new selective forces emerge.
Collapse
Affiliation(s)
- Marco Fondi
- Laboratory of Microbial and Molecular Evolution, Department of Biology, University of Florence, Italy Computational Biology Group, University of Florence, Italy
| | - Antti Karkman
- Department of Food and Environmental Sciences, University of Helsinki, Finland
| | - Manu V Tamminen
- Department of Environmental Systems Science, ETH Zürich, Switzerland Department of Aquatic Ecology, Eawag, Switzerland
| | - Emanuele Bosi
- Laboratory of Microbial and Molecular Evolution, Department of Biology, University of Florence, Italy Computational Biology Group, University of Florence, Italy
| | - Marko Virta
- Department of Food and Environmental Sciences, University of Helsinki, Finland
| | - Renato Fani
- Laboratory of Microbial and Molecular Evolution, Department of Biology, University of Florence, Italy Computational Biology Group, University of Florence, Italy
| | - Eric Alm
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology
| | - James O McInerney
- Department of Biology, National University of Ireland Maynooth, County Kildare, Ireland Computational Evolutionary Biology, Faculty of Life Sciences, The University of Manchester, United Kingdom
| |
Collapse
|
25
|
McInerney JO. Society forMolecular Biology and Evolution, Council and Business Meetings, 2015, Vienna Austria. Mol Biol Evol 2015. [PMCID: PMC4693982 DOI: 10.1093/molbev/msv275] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Affiliation(s)
- James O. McInerney
- University of Manchester, Manchester, United Kingdom
- *Corresponding author: E-mail:
| |
Collapse
|
26
|
Ku C, Nelson-Sathi S, Roettger M, Sousa FL, Lockhart PJ, Bryant D, Hazkani-Covo E, McInerney JO, Landan G, Martin WF. Endosymbiotic origin and differential loss of eukaryotic genes. Nature 2015; 524:427-32. [PMID: 26287458 DOI: 10.1038/nature14963] [Citation(s) in RCA: 188] [Impact Index Per Article: 20.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2015] [Accepted: 07/20/2015] [Indexed: 01/11/2023]
Abstract
Chloroplasts arose from cyanobacteria, mitochondria arose from proteobacteria. Both organelles have conserved their prokaryotic biochemistry, but their genomes are reduced, and most organelle proteins are encoded in the nucleus. Endosymbiotic theory posits that bacterial genes in eukaryotic genomes entered the eukaryotic lineage via organelle ancestors. It predicts episodic influx of prokaryotic genes into the eukaryotic lineage, with acquisition corresponding to endosymbiotic events. Eukaryotic genome sequences, however, increasingly implicate lateral gene transfer, both from prokaryotes to eukaryotes and among eukaryotes, as a source of gene content variation in eukaryotic genomes, which predicts continuous, lineage-specific acquisition of prokaryotic genes in divergent eukaryotic groups. Here we discriminate between these two alternatives by clustering and phylogenetic analysis of eukaryotic gene families having prokaryotic homologues. Our results indicate (1) that gene transfer from bacteria to eukaryotes is episodic, as revealed by gene distributions, and coincides with major evolutionary transitions at the origin of chloroplasts and mitochondria; (2) that gene inheritance in eukaryotes is vertical, as revealed by extensive topological comparison, sparse gene distributions stemming from differential loss; and (3) that continuous, lineage-specific lateral gene transfer, although it sometimes occurs, does not contribute to long-term gene content evolution in eukaryotic genomes.
Collapse
Affiliation(s)
- Chuan Ku
- Institute of Molecular Evolution, Heinrich-Heine University, 40225 Düsseldorf, Germany
| | - Shijulal Nelson-Sathi
- Institute of Molecular Evolution, Heinrich-Heine University, 40225 Düsseldorf, Germany
| | - Mayo Roettger
- Institute of Molecular Evolution, Heinrich-Heine University, 40225 Düsseldorf, Germany
| | - Filipa L Sousa
- Institute of Molecular Evolution, Heinrich-Heine University, 40225 Düsseldorf, Germany
| | - Peter J Lockhart
- Institute of Fundamental Sciences, Massey University, Palmerston North 4474, New Zealand
| | - David Bryant
- Department of Mathematics and Statistics, University of Otago, Dunedin 9054, New Zealand
| | - Einat Hazkani-Covo
- Department of Natural and Life Sciences, The Open University of Israel, Ra'anana 43107, Israel
| | - James O McInerney
- Department of Biology, National University of Ireland, Maynooth, County Kildare, Ireland.,Michael Smith Building, The University of Manchester, Oxford Rd, Manchester M13 9PL, UK
| | - Giddy Landan
- Genomic Microbiology Group, Institute of Microbiology, Christian-Albrechts-University of Kiel, 24118 Kiel, Germany
| | - William F Martin
- Institute of Molecular Evolution, Heinrich-Heine University, 40225 Düsseldorf, Germany.,Instituto de Tecnologia Química e Biológica, Universidade Nova de Lisboa, 2780-157 Oeiras, Portugal
| |
Collapse
|
27
|
Affiliation(s)
- James O McInerney
- Department of Biology, National University of Ireland, Maynooth, County Kildare, Ireland
| | - Mary J O'Connell
- School of Biotechnology, Dublin City University, Glasnevin, Dublin 9, Ireland
| |
Collapse
|
28
|
Affiliation(s)
- Tal Dagan
- Institute of Microbiology, Christian-Albrechts-University of Kiel, Germany
| | - Eric Bapteste
- UMR CNRS 7138 Systématique, Adaptation, Evolution, Université Pierre et Marie Curie, Paris, France
| | - James O McInerney
- Department of Biology, National University of Ireland Maynooth, Ireland
| | - William F Martin
- Institute of Molecular Evolution, Heinrich-Heine University Düsseldorf, Germany
| |
Collapse
|
29
|
McInerney JO, O'Connell MJ, Pisani D. The hybrid nature of the Eukaryota and a consilient view of life on Earth. Nat Rev Microbiol 2014; 12:449-55. [DOI: 10.1038/nrmicro3271] [Citation(s) in RCA: 96] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|
30
|
Liu S, Lorenzen ED, Fumagalli M, Li B, Harris K, Xiong Z, Zhou L, Korneliussen TS, Somel M, Babbitt C, Wray G, Li J, He W, Wang Z, Fu W, Xiang X, Morgan CC, Doherty A, O'Connell MJ, McInerney JO, Born EW, Dalén L, Dietz R, Orlando L, Sonne C, Zhang G, Nielsen R, Willerslev E, Wang J. Population genomics reveal recent speciation and rapid evolutionary adaptation in polar bears. Cell 2014; 157:785-94. [PMID: 24813606 PMCID: PMC4089990 DOI: 10.1016/j.cell.2014.03.054] [Citation(s) in RCA: 227] [Impact Index Per Article: 22.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2013] [Revised: 12/20/2013] [Accepted: 03/04/2014] [Indexed: 12/22/2022]
Abstract
Polar bears are uniquely adapted to life in the High Arctic and have undergone drastic physiological changes in response to Arctic climates and a hyper-lipid diet of primarily marine mammal prey. We analyzed 89 complete genomes of polar bear and brown bear using population genomic modeling and show that the species diverged only 479-343 thousand years BP. We find that genes on the polar bear lineage have been under stronger positive selection than in brown bears; nine of the top 16 genes under strong positive selection are associated with cardiomyopathy and vascular disease, implying important reorganization of the cardiovascular system. One of the genes showing the strongest evidence of selection, APOB, encodes the primary lipoprotein component of low-density lipoprotein (LDL); functional mutations in APOB may explain how polar bears are able to cope with life-long elevated LDL levels that are associated with high risk of heart disease in humans.
Collapse
Affiliation(s)
- Shiping Liu
- BGI-Shenzhen, Shenzhen 518083, China; School of Bioscience and Biotechnology, South China University of Technology, Guangzhou 510641, China
| | - Eline D Lorenzen
- Department of Integrative Biology, 3060 Valley Life Sciences Building, University of California, Berkeley, CA 94720, USA; Centre for GeoGenetics, Natural History Museum, University of Copenhagen, Øster Voldgade 5-7, 1350 Copenhagen K, Denmark
| | - Matteo Fumagalli
- Department of Integrative Biology, 3060 Valley Life Sciences Building, University of California, Berkeley, CA 94720, USA
| | - Bo Li
- BGI-Shenzhen, Shenzhen 518083, China
| | - Kelley Harris
- Department of Mathematics, 970 Evans Hall, University of California, Berkeley, CA 94720, USA
| | | | - Long Zhou
- BGI-Shenzhen, Shenzhen 518083, China
| | - Thorfinn Sand Korneliussen
- Centre for GeoGenetics, Natural History Museum, University of Copenhagen, Øster Voldgade 5-7, 1350 Copenhagen K, Denmark
| | - Mehmet Somel
- Department of Integrative Biology, 3060 Valley Life Sciences Building, University of California, Berkeley, CA 94720, USA
| | - Courtney Babbitt
- Department of Biology, 124 Science Drive, Duke Box # 90338, Duke University, Durham, NC 27708, USA; Institute for Genome Sciences & Policy, 101 Science Drive, DUMC Box 3382, Duke University, Durham, NC 27708, USA
| | - Greg Wray
- Department of Biology, 124 Science Drive, Duke Box # 90338, Duke University, Durham, NC 27708, USA; Institute for Genome Sciences & Policy, 101 Science Drive, DUMC Box 3382, Duke University, Durham, NC 27708, USA
| | | | - Weiming He
- BGI-Shenzhen, Shenzhen 518083, China; School of Bioscience and Biotechnology, South China University of Technology, Guangzhou 510641, China
| | - Zhuo Wang
- BGI-Shenzhen, Shenzhen 518083, China
| | | | - Xueyan Xiang
- BGI-Shenzhen, Shenzhen 518083, China; College of Life Sciences, Sichuan University, Chengdu 610064, China
| | - Claire C Morgan
- Bioinformatics and Molecular Evolution Group, School of Biotechnology, Dublin City University, Glasnevin, Dublin 9, Ireland
| | - Aoife Doherty
- Bioinformatics and Molecular Evolution Unit, Department of Biology, National University of Ireland, Maynooth, Co. Kildare, Ireland
| | - Mary J O'Connell
- Bioinformatics and Molecular Evolution Group, School of Biotechnology, Dublin City University, Glasnevin, Dublin 9, Ireland
| | - James O McInerney
- Bioinformatics and Molecular Evolution Unit, Department of Biology, National University of Ireland, Maynooth, Co. Kildare, Ireland
| | - Erik W Born
- Greenland Institute of Natural Resources, c/o Government of Greenland Representation in Denmark, Strandgade 91, 3. Floor, PO Box 2151, 1016 Copenhagen K, Denmark
| | - Love Dalén
- Department of Bioinformatics and Genetics, Swedish Museum of Natural History, PO Box 50007, 10405, Stockholm, Sweden
| | - Rune Dietz
- Department of Bioscience, Arctic Research Centre, Aarhus University, Frederiksborgvej 399, PO Box 358, 4000 Roskilde, Denmark
| | - Ludovic Orlando
- Centre for GeoGenetics, Natural History Museum, University of Copenhagen, Øster Voldgade 5-7, 1350 Copenhagen K, Denmark
| | - Christian Sonne
- Department of Bioscience, Arctic Research Centre, Aarhus University, Frederiksborgvej 399, PO Box 358, 4000 Roskilde, Denmark
| | - Guojie Zhang
- BGI-Shenzhen, Shenzhen 518083, China; Centre for Social Evolution, Department of Biology, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen, Denmark
| | - Rasmus Nielsen
- BGI-Shenzhen, Shenzhen 518083, China; Department of Integrative Biology, 3060 Valley Life Sciences Building, University of California, Berkeley, CA 94720, USA; Department of Statistics, 367 Evans Hall, University of California, Berkeley, CA 94720, USA; Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen Ø, Denmark.
| | - Eske Willerslev
- Centre for GeoGenetics, Natural History Museum, University of Copenhagen, Øster Voldgade 5-7, 1350 Copenhagen K, Denmark.
| | - Jun Wang
- BGI-Shenzhen, Shenzhen 518083, China; Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen Ø, Denmark; Princess Al Jawhara Center of Excellence in the Research of Hereditary Disorders, King Abdulaziz University, Jeddah 21589, Saudi Arabia; Macau University of Science and Technology, Avenida Wai Long, Taipa, Macau 999078, China; Department of Medicine, University of Hong Kong, Sassoon Road, Pokfulam, Hong Kong.
| |
Collapse
|
31
|
Haggerty LS, Jachiet PA, Hanage WP, Fitzpatrick DA, Lopez P, O'Connell MJ, Pisani D, Wilkinson M, Bapteste E, McInerney JO. A pluralistic account of homology: adapting the models to the data. Mol Biol Evol 2013; 31:501-16. [PMID: 24273322 PMCID: PMC3935183 DOI: 10.1093/molbev/mst228] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Defining homologous genes is important in many evolutionary studies but raises obvious issues. Some of these issues are conceptual and stem from our assumptions of how a gene evolves, others are practical, and depend on the algorithmic decisions implemented in existing software. Therefore, to make progress in the study of homology, both ontological and epistemological questions must be considered. In particular, defining homologous genes cannot be solely addressed under the classic assumptions of strong tree thinking, according to which genes evolve in a strictly tree-like fashion of vertical descent and divergence and the problems of homology detection are primarily methodological. Gene homology could also be considered under a different perspective where genes evolve as “public goods,” subjected to various introgressive processes. In this latter case, defining homologous genes becomes a matter of designing models suited to the actual complexity of the data and how such complexity arises, rather than trying to fit genetic data to some a priori tree-like evolutionary model, a practice that inevitably results in the loss of much information. Here we show how important aspects of the problems raised by homology detection methods can be overcome when even more fundamental roots of these problems are addressed by analyzing public goods thinking evolutionary processes through which genes have frequently originated. This kind of thinking acknowledges distinct types of homologs, characterized by distinct patterns, in phylogenetic and nonphylogenetic unrooted or multirooted networks. In addition, we define “family resemblances” to include genes that are related through intermediate relatives, thereby placing notions of homology in the broader context of evolutionary relationships. We conclude by presenting some payoffs of adopting such a pluralistic account of homology and family relationship, which expands the scope of evolutionary analyses beyond the traditional, yet relatively narrow focus allowed by a strong tree-thinking view on gene evolution.
Collapse
Affiliation(s)
- Leanne S Haggerty
- Bioinformatics and Molecular Evolution Unit, Department of Biology, National University of Ireland Maynooth, Maynooth, Co. Kildare, Ireland
| | | | | | | | | | | | | | | | | | | |
Collapse
|
32
|
Bogumil D, Alvarez-Ponce D, Landan G, McInerney JO, Dagan T. Integration of two ancestral chaperone systems into one: the evolution of eukaryotic molecular chaperones in light of eukaryogenesis. Mol Biol Evol 2013; 31:410-8. [PMID: 24188869 PMCID: PMC3907059 DOI: 10.1093/molbev/mst212] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
Eukaryotic genomes are mosaics of genes acquired from their prokaryotic ancestors, the eubacterial endosymbiont that gave rise to the mitochondrion and its archaebacterial host. Genomic footprints of the prokaryotic merger at the origin of eukaryotes are still discernable in eukaryotic genomes, where gene expression and function correlate with their prokaryotic ancestry. Molecular chaperones are essential in all domains of life as they assist the functional folding of their substrate proteins and protect the cell against the cytotoxic effects of protein misfolding. Eubacteria and archaebacteria code for slightly different chaperones, comprising distinct protein folding pathways. Here we study the evolution of the eukaryotic protein folding pathways following the endosymbiosis event. A phylogenetic analysis of all 64 chaperones encoded in the Saccharomyces cerevisiae genome revealed 25 chaperones of eubacterial ancestry, 11 of archaebacterial ancestry, 10 of ambiguous prokaryotic ancestry, and 18 that may represent eukaryotic innovations. Several chaperone families (e.g., Hsp90 and Prefoldin) trace their ancestry to only one prokaryote group, while others, such as Hsp40 and Hsp70, are of mixed ancestry, with members contributed from both prokaryotic ancestors. Analysis of the yeast chaperone–substrate interaction network revealed no preference for interaction between chaperones and substrates of the same origin. Our results suggest that the archaebacterial and eubacterial protein folding pathways have been reorganized and integrated into the present eukaryotic pathway. The highly integrated chaperone system of yeast is a manifestation of the central role of chaperone-mediated folding in maintaining cellular fitness. Most likely, both archaebacterial and eubacterial chaperone systems were essential at the very early stages of eukaryogenesis, and the retention of both may have offered new opportunities for expanding the scope of chaperone-mediated folding.
Collapse
Affiliation(s)
- David Bogumil
- Institute of Microbiology, Christian-Albrechts-University of Kiel, Kiel, Germany
| | | | | | | | | |
Collapse
|
33
|
McInerney JO. More than tree dimensions: inter-lineage evolution's ecological importance. Trends Ecol Evol 2013; 28:624-5. [PMID: 24035465 DOI: 10.1016/j.tree.2013.09.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2013] [Revised: 09/01/2013] [Accepted: 09/03/2013] [Indexed: 10/26/2022]
Abstract
Horizontal transfer of genes has sometimes been viewed as a nuisance for the work of understanding the evolutionary history of lineages. Recent work has shown that clever analysis of inter-lineage gene transfer is productive and has tremendous explanatory power, in particular, for niche adaptation. These studies alter our perception of what are the fundamental units of evolution and selection.
Collapse
Affiliation(s)
- James O McInerney
- Department of Biology, National University of Ireland Maynooth, Co. Kildare, Ireland.
| |
Collapse
|
34
|
Doherty A, McInerney JO. Translational selection frequently overcomes genetic drift in shaping synonymous codon usage patterns in vertebrates. Mol Biol Evol 2013; 30:2263-7. [PMID: 23883522 DOI: 10.1093/molbev/mst128] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
Synonymous codon usage patterns are shaped by a balance between mutation, drift, and natural selection. To date, detection of translational selection in vertebrates has proven to be a challenging task, obscured by small long-term effective population sizes in larger animals and the existence of isochores in some species. The consensus is that, in such species, natural selection is either completely ineffective at overcoming mutational pressures and genetic drift or perhaps is effective but so weak that it is not detectable. The aim of this research is to understand the interplay between mutation, selection, and genetic drift in vertebrates. We observe that although variation in mutational bias is undoubtedly the dominant force influencing codon usage, translational selection acts as a weak additional factor influencing synonymous codon usage. These observations indicate that translational selection is a widespread phenomenon in vertebrates and is not limited to a few species.
Collapse
Affiliation(s)
- Aoife Doherty
- Bioinformatics and Molecular Evolution Unit, Department of Biology, National University of Ireland Maynooth, Maynooth, Co. Kildare, Ireland
| | | |
Collapse
|
35
|
Abstract
Heterogeneity among life traits in mammals has resulted in considerable phylogenetic conflict, particularly concerning the position of the placental root. Layered upon this are gene- and lineage-specific variation in amino acid substitution rates and compositional biases. Life trait variations that may impact upon mutational rates are longevity, metabolic rate, body size, and germ line generation time. Over the past 12 years, three main conflicting hypotheses have emerged for the placement of the placental root. These hypotheses place the Atlantogenata (common ancestor of Xenarthra plus Afrotheria), the Afrotheria, or the Xenarthra as the sister group to all other placental mammals. Model adequacy is critical for accurate tree reconstruction and by failing to account for these compositional and character exchange heterogeneities across the tree and data set, previous studies have not provided a strongly supported hypothesis for the placental root. For the first time, models that accommodate both tree and data set heterogeneity have been applied to mammal data. Here, we show the impact of accurate model assignment and the importance of data sets in accommodating model parameters while maintaining the power to reject competing hypotheses. Through these sophisticated methods, we demonstrate the importance of model adequacy, data set power and provide strong support for the Atlantogenata over other competing hypotheses for the position of the placental root.
Collapse
Affiliation(s)
- Claire C Morgan
- Bioinformatics and Molecular Evolution Group, School of Biotechnology, Dublin City University, Glasnevin, Dublin, Ireland
| | | | | | | | | | | |
Collapse
|
36
|
Abstract
BACKGROUND Every year the human population encounters epidemic outbreaks of influenza, and history reveals recurring pandemics that have had devastating consequences. The current work focuses on the development of a robust algorithm for detecting influenza strains that have a composite genomic architecture. These influenza subtypes can be generated through a reassortment process, whereby a virus can inherit gene segments from two different types of influenza particles during replication. Reassortant strains are often not immediately recognised by the adaptive immune system of the hosts and hence may be the source of pandemic outbreaks. Owing to their importance in public health and their infectious ability, it is essential to identify reassortant influenza strains in order to understand the evolution of this virus and describe reassortment pathways that may be biased towards particular viral segments. Phylogenetic methods have been used traditionally to identify reassortant viruses. In many studies up to now, the assumption has been that if two phylogenetic trees differ, it is because reassortment has caused them to be different. While phylogenetic incongruence may be caused by real differences in evolutionary history, it can also be the result of phylogenetic error. Therefore, we wish to develop a method for distinguishing between topological inconsistency that is due to confounding effects and topological inconsistency that is due to reassortment. RESULTS The current work describes the implementation of two approaches for robustly identifying reassortment events. The algorithms rest on the idea of significance of difference between phylogenetic trees or phylogenetic tree sets, and subtree pruning and regrafting operations, which mimic the effect of reassortment on tree topologies. The first method is based on a maximum likelihood (ML) framework (MLreassort) and the second implements a Bayesian approach (Breassort) for reassortment detection. We focus on reassortment events that are found by both methods. We test both methods on a simulated dataset and on a small collection of real viral data isolated in Hong Kong in 1999. CONCLUSIONS The nature of segmented viral genomes present many challenges with respect to disease. The algorithms developed here can effectively identify reassortment events in small viral datasets and can be applied not only to influenza but also to other segmented viruses. Owing to computational demands of comparing tree topologies, further development in this area is necessary to allow their application to larger datasets.
Collapse
Affiliation(s)
- Victoria Svinti
- Department of Biology, National University of Ireland at Maynooth, Maynooth, Co Kildare, Ireland
- Current address: Department of Microbiology & Immunology, Life Sciences Centre, University of British Columbia, Vancouver, BC, V6T 1Z3, Canada
| | - James A Cotton
- Department of Biology, National University of Ireland at Maynooth, Maynooth, Co Kildare, Ireland
- Current address: Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - James O McInerney
- Department of Biology, National University of Ireland at Maynooth, Maynooth, Co Kildare, Ireland
| |
Collapse
|
37
|
Doherty A, Alvarez-Ponce D, McInerney JO. Increased genome sampling reveals a dynamic relationship between gene duplicability and the structure of the primate protein-protein interaction network. Mol Biol Evol 2012; 29:3563-73. [PMID: 22723304 DOI: 10.1093/molbev/mss165] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
Although gene duplications occur at a higher rate, only a small fraction of these are retained. The position of a gene's encoded product in the protein-protein interaction network has recently emerged as a determining factor of gene duplicability. However, the direction of the relationship between network centrality and duplicability is not universal: In Escherichia coli, yeast, fly, and worm, duplicated genes more often act at the periphery of the network, whereas in humans, such genes tend to occupy the most central positions. Herein, we have inferred duplication events that took place in the different branches of the primate phylogeny. In agreement with previous observations, we found that duplications generally affected the most central network genes, which is presumably the process that has most influenced the trend in humans. However, the opposite trend--that is, duplication being more common in genes whose encoded products are peripheral in the network--is observed for three recent branches, including, quite counterintuitively, the external branch leading to humans. This indicates a shift in the relationship between centrality and duplicability during primate evolution. Furthermore, we found that genes encoding interacting proteins exhibit phylogenetic tree topologies that are more similar than expected for random pairs and that genes duplicated in a given branch of the phylogeny tend to interact with those that duplicated in the same lineage. These results indicate that duplication of a gene increases the likelihood of duplication of its interacting partners. Our observations indicate that the structure of the primate protein-protein interaction network affects gene duplicability in previously unrecognized ways.
Collapse
Affiliation(s)
- Aoife Doherty
- Department of Biology, National University of Ireland Maynooth, Maynooth, County Kildare, Ireland
| | | | | |
Collapse
|
38
|
McInerney JO, Pisani D, Bapteste E, O'Connell MJ. The Public Goods Hypothesis for the evolution of life on Earth. Biol Direct 2011; 6:41. [PMID: 21861918 PMCID: PMC3179745 DOI: 10.1186/1745-6150-6-41] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2011] [Accepted: 08/23/2011] [Indexed: 02/01/2023] Open
Abstract
It is becoming increasingly difficult to reconcile the observed extent of horizontal gene transfers with the central metaphor of a great tree uniting all evolving entities on the planet. In this manuscript we describe the Public Goods Hypothesis and show that it is appropriate in order to describe biological evolution on the planet. According to this hypothesis, nucleotide sequences (genes, promoters, exons, etc.) are simply seen as goods, passed from organism to organism through both vertical and horizontal transfer. Public goods sequences are defined by having the properties of being largely non-excludable (no organism can be effectively prevented from accessing these sequences) and non-rival (while such a sequence is being used by one organism it is also available for use by another organism). The universal nature of genetic systems ensures that such non-excludable sequences exist and non-excludability explains why we see a myriad of genes in different combinations in sequenced genomes. There are three features of the public goods hypothesis. Firstly, segments of DNA are seen as public goods, available for all organisms to integrate into their genomes. Secondly, we expect the evolution of mechanisms for DNA sharing and of defense mechanisms against DNA intrusion in genomes. Thirdly, we expect that we do not see a global tree-like pattern. Instead, we expect local tree-like patterns to emerge from the combination of a commonage of genes and vertical inheritance of genomes by cell division. Indeed, while genes are theoretically public goods, in reality, some genes are excludable, particularly, though not only, when they have variant genetic codes or behave as coalition or club goods, available for all organisms of a coalition to integrate into their genomes, and non-rival within the club. We view the Tree of Life hypothesis as a regionalized instance of the Public Goods hypothesis, just like classical mechanics and euclidean geometry are seen as regionalized instances of quantum mechanics and Riemannian geometry respectively. We argue for this change using an axiomatic approach that shows that the Public Goods hypothesis is a better accommodation of the observed data than the Tree of Life hypothesis.
Collapse
Affiliation(s)
- James O McInerney
- Molecular Evolution and Bioinformatics Unit, Department of Biology, National University of Ireland Maynooth, County Kildare, Ireland.
| | | | | | | |
Collapse
|
39
|
McInerney JO, Martin WF, Koonin EV, Allen JF, Galperin MY, Lane N, Archibald JM, Embley TM. Planctomycetes and eukaryotes: a case of analogy not homology. Bioessays 2011; 33:810-7. [PMID: 21858844 DOI: 10.1002/bies.201100045] [Citation(s) in RCA: 75] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2011] [Revised: 07/13/2011] [Accepted: 07/15/2011] [Indexed: 11/11/2022]
Abstract
Planctomycetes, Verrucomicrobia and Chlamydia are prokaryotic phyla, sometimes grouped together as the PVC superphylum of eubacteria. Some PVC species possess interesting attributes, in particular, internal membranes that superficially resemble eukaryotic endomembranes. Some biologists now claim that PVC bacteria are nucleus-bearing prokaryotes and are considered evolutionary intermediates in the transition from prokaryote to eukaryote. PVC prokaryotes do not possess a nucleus and are not intermediates in the prokaryote-to-eukaryote transition. Here we summarise the evidence that shows why all of the PVC traits that are currently cited as evidence for aspiring eukaryoticity are either analogous (the result of convergent evolution), not homologous, to eukaryotic traits; or else they are the result of horizontal gene transfers.
Collapse
Affiliation(s)
- James O McInerney
- Department of Biology, National University of Ireland Maynooth, Maynooth, Co. Kildare, Ireland.
| | | | | | | | | | | | | | | |
Collapse
|
40
|
Cummins CA, McInerney JO. A Method for Inferring the Rate of Evolution of Homologous Characters that Can Potentially Improve Phylogenetic Inference, Resolve Deep Divergence and Correct Systematic Biases. Syst Biol 2011; 60:833-44. [DOI: 10.1093/sysbio/syr064] [Citation(s) in RCA: 126] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Affiliation(s)
- Carla A. Cummins
- Molecular Evolution and Bioinformatics Unit, Department of Biology, National University of Ireland, Maynooth, Co. Kildare, Ireland
| | - James O. McInerney
- Molecular Evolution and Bioinformatics Unit, Department of Biology, National University of Ireland, Maynooth, Co. Kildare, Ireland
| |
Collapse
|
41
|
Alvarez-Ponce D, McInerney JO. The human genome retains relics of its prokaryotic ancestry: human genes of archaebacterial and eubacterial origin exhibit remarkable differences. Genome Biol Evol 2011; 3:782-90. [PMID: 21795752 PMCID: PMC3163467 DOI: 10.1093/gbe/evr073] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
Eukaryotes are generally thought to stem from a fusion event involving an archaebacterium and a eubacterium. As a result of this event, contemporaneous eukaryotic genomes are chimeras of genes inherited from both endosymbiotic partners. These two coexisting gene repertoires have been shown to differ in a number of ways in yeast. Here we combine genomic and functional data in order to determine if and how human genes that have been inherited from both prokaryotic ancestors remain distinguishable. We show that, despite being fewer in number, human genes of archaebacterial origin are more highly and broadly expressed across tissues, are more likely to have lethal mouse orthologs, tend to be involved in informational processes, are more selectively constrained, and encode shorter and more central proteins in the protein–protein interaction network than eubacterium-like genes. Furthermore, consistent with endosymbiotic theory, we show that proteins tend to interact with those encoded by genes of the same ancestry. Most interestingly from a human health perspective, archaebacterial genes are less likely to be involved in heritable human disease. Taken together, these results show that more than 2 billion years after eukaryogenesis, the human genome retains at least two somewhat distinct communities of genes.
Collapse
Affiliation(s)
- David Alvarez-Ponce
- Department of Biology, National University of Ireland Maynooth, Maynooth, Co. Kildare, Ireland
| | | |
Collapse
|
42
|
Affiliation(s)
- Mark A Ragan
- Institute for Molecular Bioscience, and ARC Centre of Excellence in Bioinformatics, The University of Queensland, Brisbane, Australia.
| | | | | |
Collapse
|
43
|
Abstract
Horizontal gene transfer (HGT) plays a significant role in microbial evolution. It can accelerate the adaptation of an organism, it can generate new metabolic pathways and it can completely remodel an organism's genome. We examine 27 closely related genomes from the YESS group of gamma proteobacteria and a variety of four-taxon datasets from a diverse range of prokaryotes in order to explore the kinds of effects HGT has had on these organisms.
Collapse
Affiliation(s)
- Leanne S Haggerty
- Department of Biology, The National University of Ireland, , Maynooth, County Kildare, Ireland
| | | | | | | |
Collapse
|
44
|
Hatadani LM, McInerney JO, de Medeiros HF, Junqueira ACM, de Azeredo-Espin AM, Klaczko LB. Molecular phylogeny of the Drosophila tripunctata and closely related species groups (Diptera: Drosophilidae). Mol Phylogenet Evol 2009; 51:595-600. [PMID: 19285146 DOI: 10.1016/j.ympev.2009.02.022] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2008] [Revised: 02/27/2009] [Accepted: 02/28/2009] [Indexed: 10/21/2022]
Abstract
We suggest a new phylogenetic hypothesis for the tripunctata radiation based on sequences of mitochondrial genes. Phylogenetic trees were reconstructed by parsimony, maximum likelihood and Bayesian methods. We performed tests for hypotheses of monophyly for taxonomic groups and other specific hypotheses. Results reject the monophyly for the tripunctata group whereas monophyly is not rejected for the tripunctata radiation and other specific groups within the radiation. Although most of the basal nodes were unresolved we were able to identify four clusters within the tripunctata radiation. These results suggest the collection of additional data before a proper taxonomic revision could be proposed.
Collapse
Affiliation(s)
- Luciane Mendes Hatadani
- Departamento de Genética e Evolução, Instituto de Biologia, Universidade Estadual de Campinas, Campinas, SP, Brazil
| | | | | | | | | | | |
Collapse
|
45
|
Abstract
Background A large number of theories have been advanced to explain why genes involved in the same biochemical processes are often co-located in genomes. Most of these theories have been dismissed because empirical data do not match the expectations of the models. In this work we test the hypothesis that cluster formation is most likely due to a selective pressure to gradually co-localise protein products and that operon formation is not an inevitable conclusion of the process. Results We have selected an exemplar well-characterised biochemical pathway, the phenylacetate degradation pathway, and we show that its complex history is only compatible with a model where a selective advantage accrues from moving genes closer together. This selective pressure is likely to be reasonably weak and only twice in our dataset of 102 genomes do we see independent formation of a complete cluster containing all the catabolic genes in the pathway. Additionally, de novo clustering of genes clearly occurs repeatedly, even though recombination should result in the random dispersal of such genes in their respective genomes. Interspecies gene transfer has frequently replaced in situ copies of genes resulting in clusters that have similar content but very different evolutionary histories. Conclusion Our model for cluster formation in prokaryotes, therefore, consists of a two-stage selection process. The first stage is selection to move genes closer together, either because of macromolecular crowding, chromatin relaxation or transcriptional regulation pressure. This proximity opportunity sets up a separate selection for co-transcription.
Collapse
Affiliation(s)
- Fergal J Martin
- Department of Biology, National University of Ireland Maynooth, Maynooth, Co. Kildare, Ireland.
| | | |
Collapse
|
46
|
Abstract
Supertree methods combine multiple phylogenetic trees to produce the overall best "supertree." They can be used to combine phylogenetic information from datasets only partially overlapping and from disparate sources (like molecular and morphological data), or to break down problems thought to be computationally intractable. Some of the longest standing phylogenetic conundrums are now being brought to light using supertree approaches. We describe the most widely used supertree methods implemented in the software program "clann" and provide a step by step tutorial for investigating phylogenetic information and reconstructing the best supertree. Clann is freely available for Windows, Mac and Unix/Linux operating systems under the GNU public licence at (http://bioinf.nuim.ie/software/clann).
Collapse
|
47
|
McCann A, Cotton JA, McInerney JO. The tree of genomes: an empirical comparison of genome-phylogeny reconstruction methods. BMC Evol Biol 2008; 8:312. [PMID: 19014489 PMCID: PMC2592249 DOI: 10.1186/1471-2148-8-312] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2008] [Accepted: 11/12/2008] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In the past decade or more, the emphasis for reconstructing species phylogenies has moved from the analysis of a single gene to the analysis of multiple genes and even completed genomes. The simplest method of scaling up is to use familiar analysis methods on a larger scale and this is the most popular approach. However, duplications and losses of genes along with horizontal gene transfer (HGT) can lead to a situation where there is only an indirect relationship between gene and genome phylogenies. In this study we examine five widely-used approaches and their variants to see if indeed they are more-or-less saying the same thing. In particular, we focus on Conditioned Reconstruction as it is a method that is designed to work well even if HGT is present. RESULTS We confirm a previous suggestion that this method has a systematic bias. We show that no two methods produce the same results and most current methods of inferring genome phylogenies produce results that are significantly different to other methods. CONCLUSION We conclude that genome phylogenies need to be interpreted differently, depending on the method used to construct them.
Collapse
Affiliation(s)
- Angela McCann
- Bioinformatics laboratory, Department of Biology, National University of Ireland Maynooth, Maynooth, Co, Kildare, Ireland.
| | | | | |
Collapse
|
48
|
Affiliation(s)
- James O McInerney
- Department of Biology, National University of Ireland Maynooth, Maynooth, County Kildare, Ireland.
| | | |
Collapse
|
49
|
Abstract
With the number of fully sequenced genomes increasing steadily, there is greater interest in performing large-scale phylogenomic analyses from large numbers of individual gene families. Maximum likelihood (ML) has been shown repeatedly to be one of the most accurate methods for phylogenetic construction. Recently, there have been a number of algorithmic improvements in maximum-likelihood-based tree search methods. However, it can still take a long time to analyse the evolutionary history of many gene families using a single computer. Distributed computing refers to a method of combining the computing power of multiple computers in order to perform some larger overall calculation. In this article, we present the first high-throughput implementation of a distributed phylogenetics platform, MultiPhyl, capable of using the idle computational resources of many heterogeneous non-dedicated machines to form a phylogenetics supercomputer. MultiPhyl allows a user to upload hundreds or thousands of amino acid or nucleotide alignments simultaneously and perform computationally intensive tasks such as model selection, tree searching and bootstrapping of each of the alignments using many desktop machines. The program implements a set of 88 amino acid models and 56 nucleotide maximum likelihood models and a variety of statistical methods for choosing between alternative models. A MultiPhyl webserver is available for public use at: http://www.cs.nuim.ie/distributed/multiphyl.php.
Collapse
Affiliation(s)
- Thomas M Keane
- Pathogen Sequencing Unit, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, CB10 1SA Hinxton, UK.
| | | | | |
Collapse
|
50
|
Abstract
Eukaryotes are traditionally considered to be one of the three natural divisions of the tree of life and the sister group of the Archaebacteria. However, eukaryotic genomes are replete with genes of eubacterial ancestry, and more than 20 mutually incompatible hypotheses have been proposed to account for eukaryote origins. Here we test the predictions of these hypotheses using a novel supertree-based phylogenetic signal-stripping method, and recover supertrees of life based on phylogenies for up to 5,741 single gene families distributed across 185 genomes. Using our signal-stripping method, we show that there are three distinct phylogenetic signals in eukaryotic genomes. In order of strength, these link eukaryotes with the Cyanobacteria, the Proteobacteria, and the Thermoplasmatales, an archaebacterial (euryarchaeotes) group. These signals correspond to distinct symbiotic partners involved in eukaryote evolution: plastids, mitochondria, and the elusive host lineage. According to our whole-genome data, eukaryotes are hardly the sister group of the Archaebacteria, because up to 83% of eukaryotic genes with a prokaryotic homolog have eubacterial, not archaebacterial, origins. The results reject all but two of the current hypotheses for the origin of eukaryotes: those assuming a sulfur-dependent or hydrogen-dependent syntrophy for the origin of mitochondria.
Collapse
Affiliation(s)
- Davide Pisani
- Department of Biology, The National University of Ireland, Maynooth, Maynooth, County Kildare, Ireland, UK
| | | | | |
Collapse
|