51
|
Koonin EV. The Turbulent Network Dynamics of Microbial Evolution and the Statistical Tree of Life. J Mol Evol 2015; 80:244-50. [PMID: 25894542 PMCID: PMC4472940 DOI: 10.1007/s00239-015-9679-7] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2015] [Accepted: 04/08/2015] [Indexed: 11/05/2024]
Abstract
The wide spread and high rate of gene exchange and loss in the prokaryotic world translate into “network genomics”. The rates of gene gain and loss are comparable with the rate of point mutations but are substantially greater than the duplication rate. Thus, evolution of prokaryotes is primarily shaped by gene gain and loss. These processes are essential to prevent mutational meltdown of microbial populations by stopping Muller’s ratchet and appear to trigger emergence of major novel clades by opening up new ecological niches. At least some bacteria and archaea seem to have evolved dedicated devices for gene transfer. Despite the dominance of gene gain and loss, evolution of genes is intrinsically tree-like. The significant coherence between the topologies of numerous gene trees, particularly those for (nearly) universal genes, is compatible with the concept of a statistical tree of life, which forms the framework for reconstruction of the evolutionary processes in the prokaryotic world.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA,
| |
Collapse
|
52
|
|
53
|
McInerney J, Cummins C, Haggerty L. Goods-thinking vs. tree-thinking: Finding a place for mobile genetic elements. Mob Genet Elements 2014; 1:304-308. [PMID: 22545244 PMCID: PMC3337142 DOI: 10.4161/mge.19153] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
While it has become increasingly clear that the Tree of Life hypothesis has limitations in its ability to describe the evolution of all evolving entities on the planet, there has been a marked reluctance to move away from the tree-based language. Ironically, while modifying the idea of the Tree of Life to the extent that it is only very distantly related to its original descriptions, there has been a very careful attempt to retain the language of tree-thinking. The recent movement away from a tree-thinking language toward a goods-thinking language and perspective is a significant improvement. In this commentary, we describe how goods-thinking can provide better descriptions of evolution, can integrate evolution with environment more closely and can offer an equal place for Mobile Genetic Elements and chromosomal elements in discussions of evolutionary history.
Collapse
Affiliation(s)
- James McInerney
- Bioinformatics and Molecular Evolution Unit; Department of Biology; National University of Ireland Maynooth, Co.; Kildare, Ireland
| | | | | |
Collapse
|
54
|
Lobkovsky AE, Wolf YI, Koonin EV. Estimation of prokaryotic supergenome size and composition from gene frequency distributions. BMC Genomics 2014; 15 Suppl 6:S14. [PMID: 25572821 PMCID: PMC4240607 DOI: 10.1186/1471-2164-15-s6-s14] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Because prokaryotic genomes experience a rapid flux of genes, selection may act at a higher level than an individual genome. We explore a quantitative model of the distributed genome whereby groups of genomes evolve by acquiring genes from a fixed reservoir which we denote as supergenome. Previous attempts to understand the nature of the supergenome treated genomes as random, independent collections of genes and assumed that the supergenome consists of a small number of homogeneous sub-reservoirs. Here we explore the consequences of relaxing both assumptions. RESULTS We surveyed several methods for estimating the size and composition of the supergenome. The methods assumed that genomes were either random, independent samples of the supergenome or that they evolved from a common ancestor along a known tree via stochastic sampling from the reservoir. The reservoir was assumed to be either a collection of homogeneous sub-reservoirs or alternatively composed of genes with Gamma distributed gain probabilities. Empirical gene frequencies were used to either compute the likelihood of the data directly or first to reconstruct the history of gene gains and then compute the likelihood of the reconstructed numbers of gains. CONCLUSIONS Supergenome size estimates using the empirical gene frequencies directly are not robust with respect to the choice of the model. By contrast, using the gene frequencies and the phylogenetic tree to reconstruct multiple gene gains produces reliable estimates of the supergenome size and indicates that a homogeneous supergenome is more consistent with the data than a supergenome with Gamma distributed gain probabilities.
Collapse
|
55
|
Origins of major archaeal clades correspond to gene acquisitions from bacteria. Nature 2014; 517:77-80. [PMID: 25317564 PMCID: PMC4285555 DOI: 10.1038/nature13805] [Citation(s) in RCA: 170] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2014] [Accepted: 08/28/2014] [Indexed: 01/28/2023]
Abstract
The mechanisms that underlie the origin of major prokaryotic groups are poorly understood. In principle, the origin of both species and higher taxa among prokaryotes should entail similar mechanisms — ecological interactions with the environment paired with natural genetic variation involving lineage-specific gene innovations and lineage-specific gene acquisitions1,2,3,4. To investigate the origin of higher taxa in archaea, we have determined gene distributions and gene phylogenies for the 267,568 protein coding genes of 134 sequenced archaeal genomes in the context of their homologs from 1,847 reference bacterial genomes. Archaea-specific gene families define 13 traditionally recognized archaeal higher taxa in our sample. Here we report that the origins of these 13 groups unexpectedly correspond to 2,264 group-specific gene acquisitions from bacteria. Interdomain gene transfer is highly asymmetric, transfers from bacteria to archaea are more than 5-fold more frequent than vice versa. Gene transfers identified at major evolutionary transitions among prokaryotes specifically implicate gene acquisitions for metabolic functions from bacteria as key innovations in the origin of higher archaeal taxa.
Collapse
|
56
|
Inferring phylogenies of evolving sequences without multiple sequence alignment. Sci Rep 2014; 4:6504. [PMID: 25266120 PMCID: PMC4179140 DOI: 10.1038/srep06504] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2014] [Accepted: 09/10/2014] [Indexed: 12/25/2022] Open
Abstract
Alignment-free methods, in which shared properties of sub-sequences (e.g. identity or match length) are extracted and used to compute a distance matrix, have recently been explored for phylogenetic inference. However, the scalability and robustness of these methods to key evolutionary processes remain to be investigated. Here, using simulated sequence sets of various sizes in both nucleotides and amino acids, we systematically assess the accuracy of phylogenetic inference using an alignment-free approach, based on D2 statistics, under different evolutionary scenarios. We find that compared to a multiple sequence alignment approach, D2 methods are more robust against among-site rate heterogeneity, compositional biases, genetic rearrangements and insertions/deletions, but are more sensitive to recent sequence divergence and sequence truncation. Across diverse empirical datasets, the alignment-free methods perform well for sequences sharing low divergence, at greater computation speed. Our findings provide strong evidence for the scalability and the potential use of alignment-free methods in large-scale phylogenomics.
Collapse
|
57
|
Puigbò P, Lobkovsky AE, Kristensen DM, Wolf YI, Koonin EV. Genomes in turmoil: quantification of genome dynamics in prokaryote supergenomes. BMC Biol 2014; 12:66. [PMID: 25141959 PMCID: PMC4166000 DOI: 10.1186/s12915-014-0066-4] [Citation(s) in RCA: 125] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2014] [Accepted: 07/31/2014] [Indexed: 01/12/2023] Open
Abstract
BACKGROUND Genomes of bacteria and archaea (collectively, prokaryotes) appear to exist in incessant flux, expanding via horizontal gene transfer and gene duplication, and contracting via gene loss. However, the actual rates of genome dynamics and relative contributions of different types of event across the diversity of prokaryotes are largely unknown, as are the sizes of microbial supergenomes, i.e. pools of genes that are accessible to the given microbial species. RESULTS We performed a comprehensive analysis of the genome dynamics in 35 groups (34 bacterial and one archaeal) of closely related microbial genomes using a phylogenetic birth-and-death maximum likelihood model to quantify the rates of gene family gain and loss, as well as expansion and reduction. The results show that loss of gene families dominates the evolution of prokaryotes, occurring at approximately three times the rate of gain. The rates of gene family expansion and reduction are typically seven and twenty times less than the gain and loss rates, respectively. Thus, the prevailing mode of evolution in bacteria and archaea is genome contraction, which is partially compensated by the gain of new gene families via horizontal gene transfer. However, the rates of gene family gain, loss, expansion and reduction vary within wide ranges, with the most stable genomes showing rates about 25 times lower than the most dynamic genomes. For many groups, the supergenome estimated from the fraction of repetitive gene family gains includes about tenfold more gene families than the typical genome in the group although some groups appear to have vast, 'open' supergenomes. CONCLUSIONS Reconstruction of evolution for groups of closely related bacteria and archaea reveals an extremely rapid and highly variable flux of genes in evolving microbial genomes, demonstrates that extensive gene loss and horizontal gene transfer leading to innovation are the two dominant evolutionary processes, and yields robust estimates of the supergenome size.
Collapse
|
58
|
Sojo V, Pomiankowski A, Lane N. A bioenergetic basis for membrane divergence in archaea and bacteria. PLoS Biol 2014; 12:e1001926. [PMID: 25116890 PMCID: PMC4130499 DOI: 10.1371/journal.pbio.1001926] [Citation(s) in RCA: 73] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2014] [Accepted: 07/02/2014] [Indexed: 01/03/2023] Open
Abstract
Membrane bioenergetics are universal, yet the phospholipid membranes of archaea and bacteria-the deepest branches in the tree of life-are fundamentally different. This deep divergence in membrane chemistry is reflected in other stark differences between the two domains, including ion pumping and DNA replication. We resolve this paradox by considering the energy requirements of the last universal common ancestor (LUCA). We develop a mathematical model based on the premise that LUCA depended on natural proton gradients. Our analysis shows that such gradients can power carbon and energy metabolism, but only in leaky cells with a proton permeability equivalent to fatty acid vesicles. Membranes with lower permeability (equivalent to modern phospholipids) collapse free-energy availability, precluding exploitation of natural gradients. Pumping protons across leaky membranes offers no advantage, even when permeability is decreased 1,000-fold. We hypothesize that a sodium-proton antiporter (SPAP) provided the first step towards modern membranes. SPAP increases the free energy available from natural proton gradients by ∼60%, enabling survival in 50-fold lower gradients, thereby facilitating ecological spread and divergence. Critically, SPAP also provides a steadily amplifying advantage to proton pumping as membrane permeability falls, for the first time favoring the evolution of ion-tight phospholipid membranes. The phospholipids of archaea and bacteria incorporate different stereoisomers of glycerol phosphate. We conclude that the enzymes involved took these alternatives by chance in independent populations that had already evolved distinct ion pumps. Our model offers a quantitatively robust explanation for why membrane bioenergetics are universal, yet ion pumps and phospholipid membranes arose later and independently in separate populations. Our findings elucidate the paradox that archaea and bacteria share DNA transcription, ribosomal translation, and ATP synthase, yet differ in equally fundamental traits that depend on the membrane, including DNA replication.
Collapse
Affiliation(s)
- Víctor Sojo
- Department of Genetics, Evolution and Environment, University College London, London, United Kingdom
- CoMPLEX, University College London, London, United Kingdom
| | - Andrew Pomiankowski
- Department of Genetics, Evolution and Environment, University College London, London, United Kingdom
- CoMPLEX, University College London, London, United Kingdom
| | - Nick Lane
- Department of Genetics, Evolution and Environment, University College London, London, United Kingdom
- CoMPLEX, University College London, London, United Kingdom
| |
Collapse
|
59
|
Ramulu HG, Groussin M, Talla E, Planel R, Daubin V, Brochier-Armanet C. Ribosomal proteins: toward a next generation standard for prokaryotic systematics? Mol Phylogenet Evol 2014; 75:103-17. [PMID: 24583288 DOI: 10.1016/j.ympev.2014.02.013] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2013] [Revised: 01/23/2014] [Accepted: 02/17/2014] [Indexed: 10/25/2022]
Abstract
The seminal work of Carl Woese and co-workers has contributed to promote the RNA component of the small subunit of the ribosome (SSU rRNA) as a "gold standard" of modern prokaryotic taxonomy and systematics, and an essential tool to explore microbial diversity. Yet, this marker has a limited resolving power, especially at deep phylogenetic depth and can lead to strongly biased trees. The ever-larger number of available complete genomes now calls for a novel standard dataset of robust protein markers that may complement SSU rRNA. In this respect, concatenation of ribosomal proteins (r-proteins) is being growingly used to reconstruct large-scale prokaryotic phylogenies, but their suitability for systematic and/or taxonomic purposes has not been specifically addressed. Using Proteobacteria as a case study, we show that amino acid and nucleic acid r-protein sequences contain a reliable phylogenetic signal at a wide range of taxonomic depths, which has not been totally blurred by mutational saturation or horizontal gene transfer. The use of accurate evolutionary models and reconstruction methods allows overcoming most tree reconstruction artefacts resulting from compositional biases and/or fast evolutionary rates. The inferred phylogenies allow clarifying the relationships among most proteobacterial orders and families, along with the position of several unclassified lineages, suggesting some possible revisions of the current classification. In addition, we investigate the root of the Proteobacteria by considering the time-variation of nucleic acid composition of r-protein sequences and the information carried by horizontal gene transfers, two approaches that do not require the use of an outgroup and limit tree reconstruction artefacts. Altogether, our analyses indicate that r-proteins may represent a promising standard for prokaryotic taxonomy and systematics.
Collapse
Affiliation(s)
- Hemalatha Golaconda Ramulu
- Aix-Marseille Université, CNRS, UMR 7283, Laboratoire de Chimie Bactérienne, IMM, 31 chemin Joseph Aiguier, F-13402 Marseille, France
| | - Mathieu Groussin
- Université de Lyon, Université Lyon 1, CNRS, UMR 5558, Laboratoire de Biométrie et Biologie Evolutive, 43 boulevard du 11 novembre 1918, F-69622 Villeurbanne, France
| | - Emmanuel Talla
- Aix-Marseille Université, CNRS, UMR 7283, Laboratoire de Chimie Bactérienne, IMM, 31 chemin Joseph Aiguier, F-13402 Marseille, France
| | - Remi Planel
- Aix-Marseille Université, CNRS, UMR 7283, Laboratoire de Chimie Bactérienne, IMM, 31 chemin Joseph Aiguier, F-13402 Marseille, France
| | - Vincent Daubin
- Université de Lyon, Université Lyon 1, CNRS, UMR 5558, Laboratoire de Biométrie et Biologie Evolutive, 43 boulevard du 11 novembre 1918, F-69622 Villeurbanne, France
| | - Céline Brochier-Armanet
- Université de Lyon, Université Lyon 1, CNRS, UMR 5558, Laboratoire de Biométrie et Biologie Evolutive, 43 boulevard du 11 novembre 1918, F-69622 Villeurbanne, France.
| |
Collapse
|
60
|
List JM, Nelson-Sathi S, Geisler H, Martin W. Networks of lexical borrowing and lateral gene transfer in language and genome evolution. Bioessays 2014; 36:141-50. [PMID: 24375688 PMCID: PMC3910147 DOI: 10.1002/bies.201300096] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Like biological species, languages change over time. As noted by Darwin, there are many parallels between language evolution and biological evolution. Insights into these parallels have also undergone change in the past 150 years. Just like genes, words change over time, and language evolution can be likened to genome evolution accordingly, but what kind of evolution? There are fundamental differences between eukaryotic and prokaryotic evolution. In the former, natural variation entails the gradual accumulation of minor mutations in alleles. In the latter, lateral gene transfer is an integral mechanism of natural variation. The study of language evolution using biological methods has attracted much interest of late, most approaches focusing on language tree construction. These approaches may underestimate the important role that borrowing plays in language evolution. Network approaches that were originally designed to study lateral gene transfer may provide more realistic insights into the complexities of language evolution.
Collapse
Affiliation(s)
- Johann-Mattis List
- Research Center Deutscher Sprachatlas, Philipps-University MarburgMarburg, Germany
| | - Shijulal Nelson-Sathi
- Institute of Molecular Evolution, Heinrich-Heine University DüsseldorfDüsseldorf, Germany
| | - Hans Geisler
- Institute of Romance Languages and Literature, Heinrich-Heine University DüsseldorfDüsseldorf, Germany
| | - William Martin
- Institute of Molecular Evolution, Heinrich-Heine University DüsseldorfDüsseldorf, Germany
| |
Collapse
|
61
|
Abstract
In a series of conceptual articles published around the millennium, Carl Woese emphasized that evolution of cells is the central problem of evolutionary biology, that the three-domain ribosomal tree of life is an essential framework for reconstructing cellular evolution, and that the evolutionary dynamics of functionally distinct cellular systems are fundamentally different, with the information processing systems “crystallizing” earlier than operational systems. The advances of evolutionary genomics over the last decade vindicate major aspects of Woese’s vision. Despite the observations of pervasive horizontal gene transfer among bacteria and archaea, the ribosomal tree of life comes across as a central statistical trend in the “forest” of phylogenetic trees of individual genes, and hence, an appropriate scaffold for evolutionary reconstruction. The evolutionary stability of information processing systems, primarily translation, becomes ever more striking with the accumulation of comparative genomic data indicating that nearly allof the few universal genes encode translation system components. Woese’s view on the fundamental distinctions between the three domains of cellular life also withstand the test of comparative genomics, although his non-acceptance of symbiogenetic scenarios for the origin of eukaryotes might not. Above all, Woese’s key prediction that understanding evolution of microbes will be the core of the new evolutionary biology appears to be materializing.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institute of Health, Bethesda, MD 20894
| |
Collapse
|
62
|
Bolshoy A, Salih B, Cohen I, Tatarinova T. Ranking of Prokaryotic Genomes Based on Maximization of Sortedness of Gene Lengths. ACTA ACUST UNITED AC 2014; 5. [PMID: 26146586 DOI: 10.4172/2153-0602.1000151] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
How variations of gene lengths (some genes become longer than their predecessors, while other genes become shorter and the sizes of these factions are randomly different from organism to organism) depend on organismal evolution and adaptation is still an open question. We propose to rank the genomes according to lengths of their genes, and then find association between the genome rank and variousproperties, such as growth temperature, nucleotide composition, and pathogenicity. This approach reveals evolutionary driving factors. The main purpose of this study is to test effectiveness and robustness of several ranking methods. The selected method of evaluation is measuring of overall sortedness of the data. We have demonstrated that all considered methods give consistent results and Bubble Sort and Simulated Annealing achieve the highest sortedness. Also, Bubble Sort is considerably faster than the Simulated Annealing method.
Collapse
Affiliation(s)
- A Bolshoy
- Department of Evolutionary and Environmental Biology and Institute of Evolution, University of Haifa, Israel
| | - B Salih
- Department of Evolutionary and Environmental Biology and Institute of Evolution, University of Haifa, Israel ; Department of Computer Science, University of Haifa, Israel, Israel
| | - I Cohen
- Department of Evolutionary and Environmental Biology and Institute of Evolution, University of Haifa, Israel ; The Tauber Bioinformatics Research Center at University of Haifa
| | - T Tatarinova
- Children's Hospital Los Angeles, University of Southern California, Los Angeles, California, USA
| |
Collapse
|
63
|
Gupta RS. Identification of Conserved Indels that are Useful for Classification and Evolutionary Studies. J Microbiol Methods 2014. [DOI: 10.1016/bs.mim.2014.05.003] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
|
64
|
Gilbert C, Cordaux R. Horizontal transfer and evolution of prokaryote transposable elements in eukaryotes. Genome Biol Evol 2013; 5:822-32. [PMID: 23563966 PMCID: PMC3673617 DOI: 10.1093/gbe/evt057] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Horizontal transfer (HT) of transposable elements (TEs) plays a key role in prokaryotic evolution, and mounting evidence suggests that it has also had an important impact on eukaryotic evolution. Although many prokaryote-to-prokaryote and eukaryote-to-eukaryote HTs of TEs have been characterized, only few cases have been reported between prokaryotes and eukaryotes. Here, we carried out a comprehensive search for all major groups of prokaryotic insertion sequences (ISs) in 430 eukaryote genomes. We uncovered a total of 80 sequences, all deriving from the IS607 family, integrated in the genomes of 14 eukaryote species belonging to four distinct phyla (Amoebozoa, Ascomycetes, Basidiomycetes, and Stramenopiles). Given that eukaryote IS607-like sequences are most closely related to cyanobacterial IS607 and that their phylogeny is incongruent with that of their hosts, we conclude that the presence of IS607-like sequences in eukaryotic genomes is the result of several HT events. Selection analyses further suggest that our ability to detect these prokaryote TEs today in eukaryotes is because HT of these sequences occurred recently and/or some IS607 elements were domesticated after HT, giving rise to new eukaryote genes. Supporting the recent age of some of these HTs, we uncovered intact full-length, potentially active IS607 copies in the amoeba Acanthamoeba castellani. Overall, our study shows that prokaryote-to-eukaryote HT of TEs occurred at relatively low frequency during recent eukaryote evolution and it sets IS607 as the most widespread TE (being present in prokaryotes, eukaryotes, and viruses).
Collapse
Affiliation(s)
- Clément Gilbert
- Université de Poitiers, UMR CNRS 7267 Ecologie et Biologie des Interactions, Equipe Ecologie Evolution Symbiose, Poitiers, France.
| | | |
Collapse
|
65
|
Shifman A, Ninyo N, Gophna U, Snir S. Phylo SI: a new genome-wide approach for prokaryotic phylogeny. Nucleic Acids Res 2013; 42:2391-404. [PMID: 24243847 PMCID: PMC3936750 DOI: 10.1093/nar/gkt1138] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The evolutionary history of all life forms is usually represented as a vertical tree-like process. In prokaryotes, however, the vertical signal is partly obscured by the massive influence of horizontal gene transfer (HGT). The HGT creates widespread discordance between evolutionary histories of different genes as genomes become mosaics of gene histories. Thus, the Tree of Life (TOL) has been questioned as an appropriate representation of the evolution of prokaryotes. Nevertheless a common hypothesis is that prokaryotic evolution is primarily tree-like, and a routine effort is made to place new isolates in their appropriate location in the TOL. Moreover, it appears desirable to exploit non–tree-like evolutionary processes for the task of microbial classification. In this work, we present a novel technique that builds on the straightforward observation that gene order conservation (‘synteny’) decreases in time as a result of gene mobility. This is particularly true in prokaryotes, mainly due to HGT. Using a ‘synteny index’ (SI) that measures the average synteny between a pair of genomes, we developed the phylogenetic reconstruction tool ‘Phylo SI’. Phylo SI offers several attractive properties such as easy bootstrapping, high sensitivity in cases where phylogenetic signal is weak and computational efficiency. Phylo SI was tested both on simulated data and on two bacterial data sets and compared with two well-established phylogenetic methods. Phylo SI is particularly efficient on short evolutionary distances where synteny footprints remain detectable, whereas the nucleotide substitution signal is too weak for reliable sequence-based phylogenetic reconstruction. The method is publicly available at http://research.haifa.ac.il/ssagi/software/PhyloSI.zip.
Collapse
Affiliation(s)
- Anton Shifman
- Department of Evolutionary & Environmental Biology, University of Haifa, Haifa 31905 Israel, Department of Molecular Microbiology and Biotechnology Tel Aviv University, Tel Aviv 69978, Israel and National Evolutionary Synthesis Center, 2024 W. Main Street A200, Durham, NC 27705, USA
| | | | | | | |
Collapse
|
66
|
Skippington E, Ragan MA. Phylogeny rather than ecology or lifestyle biases the construction of Escherichia coli-Shigella genetic exchange communities. Open Biol 2013; 2:120112. [PMID: 23091700 PMCID: PMC3472396 DOI: 10.1098/rsob.120112] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2012] [Accepted: 08/20/2012] [Indexed: 11/12/2022] Open
Abstract
Genetic material can be transmitted not only vertically from parent to offspring, but also laterally (horizontally) from one bacterial lineage to another. Lateral genetic transfer is non-uniform; biases in its nature or frequency construct communities of genetic exchange. These biases have been proposed to arise from phylogenetic relatedness, shared ecology and/or common lifestyle. Here, we test these hypotheses using a graph-based abstraction of inferred genetic-exchange relationships among 27 Escherichia coli and Shigella genomes. We show that although barriers to inter-phylogenetic group lateral transfer are low, E. coli and Shigella are more likely to have exchanged genetic material with close relatives. We find little evidence of bias arising from shared environment or lifestyle. More than one-third of donor-recipient pairs in our analysis show some level of fragmentary gene transfer. Thus, within the E. coli-Shigella clade, intact genes and gene fragments have been disseminated non-uniformly and at appreciable frequency, constructing communities that transgress environmental and lifestyle boundaries.
Collapse
Affiliation(s)
- Elizabeth Skippington
- Institute for Molecular Bioscience and Australian Research Council Centre of Excellence in Bioinformatics, The University of Queensland, Brisbane, Queensland 4072, Australia
| | | |
Collapse
|
67
|
Roch S, Snir S. Recovering the treelike trend of evolution despite extensive lateral genetic transfer: a probabilistic analysis. J Comput Biol 2013; 20:93-112. [PMID: 23383996 DOI: 10.1089/cmb.2012.0234] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Lateral gene transfer (LGT) is a common mechanism of nonvertical evolution, during which genetic material is transferred between two more or less distantly related organisms. It is particularly common in bacteria where it contributes to adaptive evolution with important medical implications. In evolutionary studies, LGT has been shown to create widespread discordance between gene trees as genomes become mosaics of gene histories. In particular, the Tree of Life has been questioned as an appropriate representation of bacterial evolutionary history. Nevertheless a common hypothesis is that prokaryotic evolution is primarily treelike, but that the underlying trend is obscured by LGT. Extensive empirical work has sought to extract a common treelike signal from conflicting gene trees. Here we give a probabilistic perspective on the problem of recovering the treelike trend despite LGT. Under a model of randomly distributed LGT, we show that the species phylogeny can be reconstructed even in the presence of surprisingly many (almost linear number of) LGT events per gene tree. Our results, which are optimal up to logarithmic factors, are based on the analysis of a robust, computationally efficient reconstruction method and provides insight into the design of such methods. Finally, we show that our results have implications for the discovery of highways of gene sharing.
Collapse
Affiliation(s)
- Sebastien Roch
- Department of Mathematics and Bioinformatics Program, University of California at Los Angeles, Los Angeles, CA, USA.
| | | |
Collapse
|
68
|
EGN: a wizard for construction of gene and genome similarity networks. BMC Evol Biol 2013; 13:146. [PMID: 23841456 PMCID: PMC3727994 DOI: 10.1186/1471-2148-13-146] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2013] [Accepted: 07/05/2013] [Indexed: 01/11/2023] Open
Abstract
Background Increasingly, similarity networks are being used for evolutionary analyses of molecular datasets. These networks are very useful, in particular for the analysis of gene sharing, lateral gene transfer and for the detection of distant homologs. Currently, such analyses require some computer programming skills due to the limited availability of user-friendly freely distributed software. Consequently, although appealing, the construction and analyses of these networks remain less familiar to biologists than do phylogenetic approaches. Results In order to ease the use of similarity networks in the community of evolutionary biologists, we introduce a software program, EGN, that runs under Linux or MacOSX. EGN automates the reconstruction of gene and genome networks from nucleic and proteic sequences. EGN also implements statistics describing genetic diversity in these samples, for various user-defined thresholds of similarities. In the interest of studying the complexity of evolutionary processes affecting microbial evolution, we applied EGN to a dataset of 571,044 proteic sequences from the three domains of life and from mobile elements. We observed that, in Borrelia, plasmids play a different role than in most other eubacteria. Rather than being genetic couriers involved in lateral gene transfer, Borrelia’s plasmids and their genes act as private genetic goods, that contribute to the creation of genetic diversity within their parasitic hosts. Conclusion EGN can be used for constructing, analyzing, and mining molecular datasets in evolutionary studies. The program can help increase our knowledge of the processes through which genes from distinct sources and/or from multiple genomes co-evolve in lineages of cellular organisms.
Collapse
|
69
|
Coelho MA, Gonçalves C, Sampaio JP, Gonçalves P. Extensive intra-kingdom horizontal gene transfer converging on a fungal fructose transporter gene. PLoS Genet 2013; 9:e1003587. [PMID: 23818872 PMCID: PMC3688497 DOI: 10.1371/journal.pgen.1003587] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2012] [Accepted: 05/08/2013] [Indexed: 11/19/2022] Open
Abstract
Comparative genomics revealed in the last decade a scenario of rampant horizontal gene transfer (HGT) among prokaryotes, but for fungi a clearly dominant pattern of vertical inheritance still stands, punctuated however by an increasing number of exceptions. In the present work, we studied the phylogenetic distribution and pattern of inheritance of a fungal gene encoding a fructose transporter (FSY1) with unique substrate selectivity. 109 FSY1 homologues were identified in two sub-phyla of the Ascomycota, in a survey that included 241 available fungal genomes. At least 10 independent inter-species instances of horizontal gene transfer (HGT) involving FSY1 were identified, supported by strong phylogenetic evidence and synteny analyses. The acquisition of FSY1 through HGT was sometimes suggestive of xenolog gene displacement, but several cases of pseudoparalogy were also uncovered. Moreover, evidence was found for successive HGT events, possibly including those responsible for transmission of the gene among yeast lineages. These occurrences do not seem to be driven by functional diversification of the Fsy1 proteins because Fsy1 homologues from widely distant lineages, including at least one acquired by HGT, appear to have similar biochemical properties. In summary, retracing the evolutionary path of the FSY1 gene brought to light an unparalleled number of independent HGT events involving a single fungal gene. We propose that the turbulent evolutionary history of the gene may be linked to the unique biochemical properties of the encoded transporter, whose predictable effect on fitness may be highly variable. In general, our results support the most recent views suggesting that inter-species HGT may have contributed much more substantially to shape fungal genomes than heretofore assumed.
Collapse
Affiliation(s)
- Marco A. Coelho
- Centro de Recursos Microbiológicos, Departamento de Ciências da Vida, Faculdade de Ciências e Tecnologia, Universidade Nova de Lisboa, Caparica, Portugal
| | - Carla Gonçalves
- Centro de Recursos Microbiológicos, Departamento de Ciências da Vida, Faculdade de Ciências e Tecnologia, Universidade Nova de Lisboa, Caparica, Portugal
| | - José Paulo Sampaio
- Centro de Recursos Microbiológicos, Departamento de Ciências da Vida, Faculdade de Ciências e Tecnologia, Universidade Nova de Lisboa, Caparica, Portugal
| | - Paula Gonçalves
- Centro de Recursos Microbiológicos, Departamento de Ciências da Vida, Faculdade de Ciências e Tecnologia, Universidade Nova de Lisboa, Caparica, Portugal
| |
Collapse
|
70
|
van Wolferen M, Ajon M, Driessen AJM, Albers SV. How hyperthermophiles adapt to change their lives: DNA exchange in extreme conditions. Extremophiles 2013; 17:545-63. [PMID: 23712907 DOI: 10.1007/s00792-013-0552-6] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2013] [Accepted: 05/12/2013] [Indexed: 01/24/2023]
Abstract
Transfer of DNA has been shown to be involved in genome evolution. In particular with respect to the adaptation of bacterial species to high temperatures, DNA transfer between the domains of bacteria and archaea seems to have played a major role. In addition, DNA exchange between similar species likely plays a role in repair of DNA via homologous recombination, a process that is crucial under DNA damaging conditions such as high temperatures. Several mechanisms for the transfer of DNA have been described in prokaryotes, emphasizing its general importance. However, until recently, not much was known about this process in prokaryotes growing in highly thermophilic environments. This review describes the different mechanisms of DNA transfer in hyperthermophiles, and how this may contribute to the survival and adaptation of hyperthermophilic archaea and bacteria to extreme environments.
Collapse
Affiliation(s)
- Marleen van Wolferen
- Molecular Biology of Archaea, Max Planck Institute for Terrestrial Microbiology, Karl-von-Frisch-Strasse 10, 35043 Marburg, Germany
| | | | | | | |
Collapse
|
71
|
Lasek-Nesselquist E, Gogarten JP. The effects of model choice and mitigating bias on the ribosomal tree of life. Mol Phylogenet Evol 2013; 69:17-38. [PMID: 23707703 DOI: 10.1016/j.ympev.2013.05.006] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2013] [Revised: 04/26/2013] [Accepted: 05/08/2013] [Indexed: 01/03/2023]
Abstract
Deep-level relationships within Bacteria, Archaea, and Eukarya as well as the relationships of these three domains to each other require resolution. The ribosomal machinery, universal to all cellular life, represents a protein repertoire resistant to horizontal gene transfer, which provides a largely congruent signal necessary for reconstructing a tree suitable as a backbone for life's reticulate history. Here, we generate a ribosomal tree of life from a robust taxonomic sampling of Bacteria, Archaea, and Eukarya to elucidate deep-level intra-domain and inter-domain relationships. Lack of phylogenetic information and systematic errors caused by inadequate models (that cannot account for substitution rate or compositional heterogeneities) or improper model selection compound conflicting phylogenetic signals from HGT and/or paralogy. Thus, we tested several models of varying sophistication on three different datasets, performed removal of fast-evolving or long-branched Archaea and Eukarya, and employed three different strategies to remove compositional heterogeneity to examine their effects on the topological outcome. Our results support a two-domain topology for the tree of life, where Eukarya emerges from within Archaea as sister to a Korarchaeota/Thaumarchaeota (KT) or Crenarchaeota/KT clade for all models under all or at least one of the strategies employed. Taxonomic manipulation allows single-matrix and certain mixture models to vacillate between two-domain and three-domain phylogenies. We find that models vary in their ability to resolve different areas of the tree of life, which does not necessarily correlate with model complexity. For example, both single-matrix and some mixture models recover monophyletic Crenarchaeota and Euryarchaeota archaeal phyla. In contrast, the most sophisticated model recovers a paraphyletic Euryarchaeota but detects two large clades that comprise the Bacteria, which were recovered separately but never together in the other models. Overall, models recovered consistent topologies despite dataset modifications due to the removal of compositional bias, which reflects either ineffective bias reduction or robust datasets that allow models to overcome reconstruction artifacts. We recommend a comparative approach for evolutionary models to identify model weaknesses as well as consensus relationships.
Collapse
|
72
|
Puigbò P, Wolf YI, Koonin EV. Seeing the Tree of Life behind the phylogenetic forest. BMC Biol 2013; 11:46. [PMID: 23587361 PMCID: PMC3626908 DOI: 10.1186/1741-7007-11-46] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2013] [Accepted: 04/12/2013] [Indexed: 02/08/2023] Open
Affiliation(s)
- Pere Puigbò
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | | |
Collapse
|
73
|
Bapteste E, Dupré J. Towards a processual microbial ontology. BIOLOGY & PHILOSOPHY 2013; 28:379-404. [PMID: 23487350 PMCID: PMC3591535 DOI: 10.1007/s10539-012-9350-2] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/16/2012] [Accepted: 10/17/2012] [Indexed: 05/26/2023]
Abstract
Standard microbial evolutionary ontology is organized according to a nested hierarchy of entities at various levels of biological organization. It typically detects and defines these entities in relation to the most stable aspects of evolutionary processes, by identifying lineages evolving by a process of vertical inheritance from an ancestral entity. However, recent advances in microbiology indicate that such an ontology has important limitations. The various dynamics detected within microbiological systems reveal that a focus on the most stable entities (or features of entities) over time inevitably underestimates the extent and nature of microbial diversity. These dynamics are not the outcome of the process of vertical descent alone. Other processes, often involving causal interactions between entities from distinct levels of biological organisation, or operating at different time scales, are responsible not only for the destabilisation of pre-existing entities, but also for the emergence and stabilisation of novel entities in the microbial world. In this article we consider microbial entities as more or less stabilised functional wholes, and sketch a network-based ontology that can represent a diverse set of processes including, for example, as well as phylogenetic relations, interactions that stabilise or destabilise the interacting entities, spatial relations, ecological connections, and genetic exchanges. We use this pluralistic framework for evaluating (i) the existing ontological assumptions in evolution (e.g. whether currently recognized entities are adequate for understanding the causes of change and stabilisation in the microbial world), and (ii) for identifying hidden ontological kinds, essentially invisible from within a more limited perspective. We propose to recognize additional classes of entities that provide new insights into the structure of the microbial world, namely "processually equivalent" entities, "processually versatile" entities, and "stabilized" entities.
Collapse
Affiliation(s)
- Eric Bapteste
- />UMR CNRS 7138, Université Pierre et Marie Curie, 75005 Paris, France
| | - John Dupré
- />ESRC Centre for Genomics in Society (Egenis), University of Exeter, Exeter, UK
| |
Collapse
|
74
|
Abstract
Thanks to advances in next-generation technologies, genome sequences are now being generated at breadth (e.g. across environments) and depth (thousands of closely related strains, individuals or samples) unimaginable only a few years ago. Phylogenomics--the study of evolutionary relationships based on comparative analysis of genome-scale data--has so far been developed as industrial-scale molecular phylogenetics, proceeding in the two classical steps: multiple alignment of homologous sequences, followed by inference of a tree (or multiple trees). However, the algorithms typically employed for these steps scale poorly with number of sequences, such that for an increasing number of problems, high-quality phylogenomic analysis is (or soon will be) computationally infeasible. Moreover, next-generation data are often incomplete and error-prone, and analysis may be further complicated by genome rearrangement, gene fusion and deletion, lateral genetic transfer, and transcript variation. Here we argue that next-generation data require next-generation phylogenomics, including so-called alignment-free approaches.
Collapse
Affiliation(s)
- Cheong Xin Chan
- Institute for Molecular Bioscience, and ARC Centre of Excellence in Bioinformatics, The University of Queensland, Brisbane, QLD, 4072, Australia
| | | |
Collapse
|
75
|
Eveleigh RJ, Meehan CJ, Archibald JM, Beiko RG. Being Aquifex aeolicus: Untangling a hyperthermophile's checkered past. Genome Biol Evol 2013; 5:2478-97. [PMID: 24281050 PMCID: PMC3879981 DOI: 10.1093/gbe/evt195] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/22/2013] [Indexed: 12/20/2022] Open
Abstract
Lateral gene transfer (LGT) is an important factor contributing to the evolution of prokaryotic genomes. The Aquificae are a hyperthermophilic bacterial group whose genes show affiliations to many other lineages, including the hyperthermophilic Thermotogae, the Proteobacteria, and the Archaea. Previous phylogenomic analyses focused on Aquifex aeolicus identified Thermotogae and Aquificae either as successive early branches or sisters in a rooted bacterial phylogeny, but many phylogenies and cellular traits have suggested a stronger affiliation with the Epsilonproteobacteria. Different scenarios for the evolution of the Aquificae yield different phylogenetic predictions. Here, we outline these scenarios and consider the fit of the available data, including three sequenced Aquificae genomes, to different sets of predictions. Evidence from phylogenetic profiles and trees suggests that the Epsilonproteobacteria have the strongest affinities with the three Aquificae analyzed. However, this pattern is shown by only a minority of encoded proteins, and the Archaea, many lineages of thermophilic bacteria, and members of genus Clostridium and class Deltaproteobacteria also show strong connections to the Aquificae. The phylogenetic affiliations of different functional subsystems showed strong biases: Most but not all genes implicated in the core translational apparatus tended to group Aquificae with Thermotogae, whereas a wide range of metabolic and cellular processes strongly supported the link between Aquificae and Epsilonproteobacteria. Depending on which sets of genes are privileged, either Thermotogae or Epsilonproteobacteria is the most plausible adjacent lineage to the Aquificae. Both scenarios require massive sharing of genes to explain the history of this enigmatic group, whose history is further complicated by specific affinities of different members of Aquificae to different partner lineages.
Collapse
Affiliation(s)
- Robert J.M. Eveleigh
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia, Canada
- Faculty of Computer Science, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Conor J. Meehan
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia, Canada
- Faculty of Computer Science, Dalhousie University, Halifax, Nova Scotia, Canada
| | - John M. Archibald
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Robert G. Beiko
- Faculty of Computer Science, Dalhousie University, Halifax, Nova Scotia, Canada
| |
Collapse
|
76
|
Orthologous gene clusters and taxon signature genes for viruses of prokaryotes. J Bacteriol 2012; 195:941-50. [PMID: 23222723 DOI: 10.1128/jb.01801-12] [Citation(s) in RCA: 89] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
Viruses are the most abundant biological entities on earth and encompass a vast amount of genetic diversity. The recent rapid increase in the number of sequenced viral genomes has created unprecedented opportunities for gaining new insight into the structure and evolution of the virosphere. Here, we present an update of the phage orthologous groups (POGs), a collection of 4,542 clusters of orthologous genes from bacteriophages that now also includes viruses infecting archaea and encompasses more than 1,000 distinct virus genomes. Analysis of this expanded data set shows that the number of POGs keeps growing without saturation and that a substantial majority of the POGs remain specific to viruses, lacking homologues in prokaryotic cells, outside known proviruses. Thus, the great majority of virus genes apparently remains to be discovered. A complementary observation is that numerous viral genomes remain poorly, if at all, covered by POGs. The genome coverage by POGs is expected to increase as more genomes are sequenced. Taxon-specific, single-copy signature genes that are not observed in prokaryotic genomes outside detected proviruses were identified for two-thirds of the 57 taxa (those with genomes available from at least 3 distinct viruses), with half of these present in all members of the respective taxon. These signatures can be used to specifically identify the presence and quantify the abundance of viruses from particular taxa in metagenomic samples and thus gain new insights into the ecology and evolution of viruses in relation to their hosts.
Collapse
|
77
|
Snir S, Wolf YI, Koonin EV. Universal pacemaker of genome evolution. PLoS Comput Biol 2012; 8:e1002785. [PMID: 23209393 PMCID: PMC3510094 DOI: 10.1371/journal.pcbi.1002785] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2012] [Accepted: 10/02/2012] [Indexed: 11/18/2022] Open
Abstract
A fundamental observation of comparative genomics is that the distribution of evolution rates across the complete sets of orthologous genes in pairs of related genomes remains virtually unchanged throughout the evolution of life, from bacteria to mammals. The most straightforward explanation for the conservation of this distribution appears to be that the relative evolution rates of all genes remain nearly constant, or in other words, that evolutionary rates of different genes are strongly correlated within each evolving genome. This correlation could be explained by a model that we denoted Universal PaceMaker (UPM) of genome evolution. The UPM model posits that the rate of evolution changes synchronously across genome-wide sets of genes in all evolving lineages. Alternatively, however, the correlation between the evolutionary rates of genes could be a simple consequence of molecular clock (MC). We sought to differentiate between the MC and UPM models by fitting thousands of phylogenetic trees for bacterial and archaeal genes to supertrees that reflect the dominant trend of vertical descent in the evolution of archaea and bacteria and that were constrained according to the two models. The goodness of fit for the UPM model was better than the fit for the MC model, with overwhelming statistical significance, although similarly to the MC, the UPM is strongly overdispersed. Thus, the results of this analysis reveal a universal, genome-wide pacemaker of evolution that could have been in operation throughout the history of life.
Collapse
Affiliation(s)
- Sagi Snir
- Department of Evolutionary and Environmental Biology and The Institute of Evolution, University of Haifa Mount Carmel, Haifa, Israel
| | | | | |
Collapse
|
78
|
Chan CX, Soares MB, Bonaldo MF, Wisecaver JH, Hackett JD, Anderson DM, Erdner DL, Bhattacharya D. ANALYSIS OF ALEXANDRIUM TAMARENSE (DINOPHYCEAE) GENES REVEALS THE COMPLEX EVOLUTIONARY HISTORY OF A MICROBIAL EUKARYOTE(). JOURNAL OF PHYCOLOGY 2012; 48:1130-1142. [PMID: 23066170 PMCID: PMC3466611 DOI: 10.1111/j.1529-8817.2012.01194.x] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
Microbial eukaryotes may extinguish much of their nuclear phylogenetic history due to endosymbiotic/horizontal gene transfer (E/HGT). We studied E/HGT in 32,110 contigs of expressed sequence tags (ESTs) from the dinoflagellate Alexandrium tamarense (Dinophyceae) using a conservative phylogenomic approach. The vast majority of predicted proteins (86.4%) in this alga are novel or dinoflagellate-specific. We searched for putative homologs of these predicted proteins against a taxonomically broadly sampled protein database that includes all currently available data from algae and protists and reconstructed a phylogeny from each of the putative homologous protein sets. Of the 2,523 resulting phylogenies, 14-17% are potentially impacted by E/HGT involving both prokaryote and eukaryote lineages, with 2-4% showing clear evidence of reticulate evolution. The complex evolutionary histories of the remaining proteins, many of which may also have been affected by E/HGT, cannot be interpreted using our approach with currently available gene data. We present empirical evidence of reticulate genome evolution that combined with inadequate or highly complex phylogenetic signal in many proteins may impede genome-wide approaches to infer the tree of microbial eukaryotes.
Collapse
Affiliation(s)
- Cheong Xin Chan
- Department of Ecology, Evolution and Natural Resources, and Institute of Marine and Coastal Sciences, Rutgers University, New Brunswick, NJ 08901, USA
| | - Marcelo B. Soares
- Northwestern University, Children's Memorial Research Center, Chicago, IL 60614, USA
| | - Maria F. Bonaldo
- Northwestern University, Children's Memorial Research Center, Chicago, IL 60614, USA
| | - Jennifer H. Wisecaver
- Department of Ecology and Evolutionary Biology, The University of Arizona, Tucson, AZ 85721, USA
| | - Jeremiah D. Hackett
- Department of Ecology and Evolutionary Biology, The University of Arizona, Tucson, AZ 85721, USA
| | | | - Deana L. Erdner
- Marine Science Institute, University of Texas, Port Aransas, TX 78373, USA
| | - Debashish Bhattacharya
- Department of Ecology, Evolution and Natural Resources, and Institute of Marine and Coastal Sciences, Rutgers University, New Brunswick, NJ 08901, USA
| |
Collapse
|
79
|
Koonin EV, Wolf YI. Evolution of microbes and viruses: a paradigm shift in evolutionary biology? Front Cell Infect Microbiol 2012; 2:119. [PMID: 22993722 PMCID: PMC3440604 DOI: 10.3389/fcimb.2012.00119] [Citation(s) in RCA: 86] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2012] [Accepted: 08/27/2012] [Indexed: 01/21/2023] Open
Abstract
When Charles Darwin formulated the central principles of evolutionary biology in the Origin of Species in 1859 and the architects of the Modern Synthesis integrated these principles with population genetics almost a century later, the principal if not the sole objects of evolutionary biology were multicellular eukaryotes, primarily animals and plants. Before the advent of efficient gene sequencing, all attempts to extend evolutionary studies to bacteria have been futile. Sequencing of the rRNA genes in thousands of microbes allowed the construction of the three- domain “ribosomal Tree of Life” that was widely thought to have resolved the evolutionary relationships between the cellular life forms. However, subsequent massive sequencing of numerous, complete microbial genomes revealed novel evolutionary phenomena, the most fundamental of these being: (1) pervasive horizontal gene transfer (HGT), in large part mediated by viruses and plasmids, that shapes the genomes of archaea and bacteria and call for a radical revision (if not abandonment) of the Tree of Life concept, (2) Lamarckian-type inheritance that appears to be critical for antivirus defense and other forms of adaptation in prokaryotes, and (3) evolution of evolvability, i.e., dedicated mechanisms for evolution such as vehicles for HGT and stress-induced mutagenesis systems. In the non-cellular part of the microbial world, phylogenomics and metagenomics of viruses and related selfish genetic elements revealed enormous genetic and molecular diversity and extremely high abundance of viruses that come across as the dominant biological entities on earth. Furthermore, the perennial arms race between viruses and their hosts is one of the defining factors of evolution. Thus, microbial phylogenomics adds new dimensions to the fundamental picture of evolution even as the principle of descent with modification discovered by Darwin and the laws of population genetics remain at the core of evolutionary biology.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health Bethesda, MD, USA.
| | | |
Collapse
|
80
|
Merhej V, Raoult D. Rhizome of life, catastrophes, sequence exchanges, gene creations, and giant viruses: how microbial genomics challenges Darwin. Front Cell Infect Microbiol 2012; 2:113. [PMID: 22973559 PMCID: PMC3428605 DOI: 10.3389/fcimb.2012.00113] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2012] [Accepted: 08/06/2012] [Indexed: 11/29/2022] Open
Abstract
Darwin's theory about the evolution of species has been the object of considerable dispute. In this review, we have described seven key principles in Darwin's book The Origin of Species and tried to present how genomics challenge each of these concepts and improve our knowledge about evolution. Darwin believed that species evolution consists on a positive directional selection ensuring the “survival of the fittest.” The most developed state of the species is characterized by increasing complexity. Darwin proposed the theory of “descent with modification” according to which all species evolve from a single common ancestor through a gradual process of small modification of their vertical inheritance. Finally, the process of evolution can be depicted in the form of a tree. However, microbial genomics showed that evolution is better described as the “biological changes over time.” The mode of change is not unidirectional and does not necessarily favors advantageous mutations to increase fitness it is rather subject to random selection as a result of catastrophic stochastic processes. Complexity is not necessarily the completion of development: several complex organisms have gone extinct and many microbes including bacteria with intracellular lifestyle have streamlined highly effective genomes. Genomes evolve through large events of gene deletions, duplications, insertions, and genomes rearrangements rather than a gradual adaptative process. Genomes are dynamic and chimeric entities with gene repertoires that result from vertical and horizontal acquisitions as well as de novo gene creation. The chimeric character of microbial genomes excludes the possibility of finding a single common ancestor for all the genes recorded currently. Genomes are collections of genes with different evolutionary histories that cannot be represented by a single tree of life (TOL). A forest, a network or a rhizome of life may be more accurate to represent evolutionary relationships among species.
Collapse
Affiliation(s)
- Vicky Merhej
- URMITE, UM63, CNRS 7278, IRD 198, INSERM U1095, Aix Marseille Université Marseille, France
| | | |
Collapse
|
81
|
Forterre P. Darwin's goldmine is still open: variation and selection run the world. Front Cell Infect Microbiol 2012; 2:106. [PMID: 22919695 PMCID: PMC3417645 DOI: 10.3389/fcimb.2012.00106] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2012] [Accepted: 07/21/2012] [Indexed: 01/02/2023] Open
Abstract
The scientific contribution of Darwin, still agonized in many religious circles, has now been recognized and celebrated by scientists from various disciplines. However, in recent years, several evolutionists have criticized Darwin as outdated, arguing that "Darwinism," assimilated to the "tree of life," cannot explain microbial evolution, or else was not operating in early life evolution. These critics either confuse "Darwinism" and old versions of "neo-Darwinism" or misunderstand the role of gene transfers in evolution. The core of Darwin explanation of evolution (variation/selection) remains necessary and sufficient to decipher the history of life. The enormous diversity of mechanisms underlying variations has been successfully interpreted by evolutionists in this framework and has considerably enriched the corpus of evolutionary biology without the necessity to kill the father. However, it remains for evolutionists to acknowledge interactions between cells and viruses (unknown for Darwin) as a major driving force in life evolution.
Collapse
Affiliation(s)
- Patrick Forterre
- Institut PasteurParis, France
- Institut de Génétique et Microbiologie, Université Paris-Sud, CNRS UMR8621Orsay Cedex, France
| |
Collapse
|
82
|
Bhandari V, Naushad HS, Gupta RS. Protein based molecular markers provide reliable means to understand prokaryotic phylogeny and support Darwinian mode of evolution. Front Cell Infect Microbiol 2012; 2:98. [PMID: 22919687 PMCID: PMC3417386 DOI: 10.3389/fcimb.2012.00098] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2012] [Accepted: 06/27/2012] [Indexed: 11/20/2022] Open
Abstract
The analyses of genome sequences have led to the proposal that lateral gene transfers (LGTs) among prokaryotes are so widespread that they disguise the interrelationships among these organisms. This has led to questioning of whether the Darwinian model of evolution is applicable to prokaryotic organisms. In this review, we discuss the usefulness of taxon-specific molecular markers such as conserved signature indels (CSIs) and conserved signature proteins (CSPs) for understanding the evolutionary relationships among prokaryotes and to assess the influence of LGTs on prokaryotic evolution. The analyses of genomic sequences have identified large numbers of CSIs and CSPs that are unique properties of different groups of prokaryotes ranging from phylum to genus levels. The species distribution patterns of these molecular signatures strongly support a tree-like vertical inheritance of the genes containing these molecular signatures that is consistent with phylogenetic trees. Recent detailed studies in this regard on the Thermotogae and Archaea, which are reviewed here, have identified large numbers of CSIs and CSPs that are specific for the species from these two taxa and a number of their major clades. The genetic changes responsible for these CSIs (and CSPs) initially likely occurred in the common ancestors of these taxa and then vertically transferred to various descendants. Although some CSIs and CSPs in unrelated groups of prokaryotes were identified, their small numbers and random occurrence has no apparent influence on the consistent tree-like branching pattern emerging from other markers. These results provide evidence that although LGT is an important evolutionary force, it does not mask the tree-like branching pattern of prokaryotes or understanding of their evolutionary relationships. The identified CSIs and CSPs also provide novel and highly specific means for identification of different groups of microbes and for taxonomical and biochemical studies.
Collapse
Affiliation(s)
- Vaibhav Bhandari
- Department of Biochemistry and Biomedical Sciences, McMaster University Hamilton, ON, Canada
| | | | | |
Collapse
|
83
|
Lang AS, Zhaxybayeva O, Beatty JT. Gene transfer agents: phage-like elements of genetic exchange. Nat Rev Microbiol 2012; 10:472-82. [PMID: 22683880 DOI: 10.1038/nrmicro2802] [Citation(s) in RCA: 265] [Impact Index Per Article: 20.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Horizontal gene transfer is important in the evolution of bacterial and archaeal genomes. An interesting genetic exchange process is carried out by diverse phage-like gene transfer agents (GTAs) that are found in a wide range of prokaryotes. Although GTAs resemble phages, they lack the hallmark capabilities that define typical phages, and they package random pieces of the producing cell's genome. In this Review, we discuss the defining characteristics of the GTAs that have been identified to date, along with potential functions for these agents and the possible evolutionary forces that act on the genes involved in their production.
Collapse
Affiliation(s)
- Andrew S Lang
- Department of Biology, Memorial University, St. John's, Newfoundland and Labrador A1B 3X9, Canada.
| | | | | |
Collapse
|
84
|
Mao F, Williams D, Zhaxybayeva O, Poptsova M, Lapierre P, Gogarten JP, Xu Y. Quartet decomposition server: a platform for analyzing phylogenetic trees. BMC Bioinformatics 2012; 13:123. [PMID: 22676320 PMCID: PMC3447714 DOI: 10.1186/1471-2105-13-123] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2011] [Accepted: 06/07/2012] [Indexed: 11/11/2022] Open
Abstract
Background The frequent exchange of genetic material among prokaryotes means that extracting a majority or plurality phylogenetic signal from many gene families, and the identification of gene families that are in significant conflict with the plurality signal is a frequent task in comparative genomics, and especially in phylogenomic analyses. Decomposition of gene trees into embedded quartets (unrooted trees each with four taxa) is a convenient and statistically powerful technique to address this challenging problem. This approach was shown to be useful in several studies of completely sequenced microbial genomes. Results We present here a web server that takes a collection of gene phylogenies, decomposes them into quartets, generates a Quartet Spectrum, and draws a split network. Users are also provided with various data download options for further analyses. Each gene phylogeny is to be represented by an assessment of phylogenetic information content, such as sets of trees reconstructed from bootstrap replicates or sampled from a posterior distribution. The Quartet Decomposition server is accessible at http://quartets.uga.edu. Conclusions The Quartet Decomposition server presented here provides a convenient means to perform Quartet Decomposition analyses and will empower users to find statistically supported phylogenetic conflicts.
Collapse
Affiliation(s)
- Fenglou Mao
- Department of Biochemistry and Molecular Biology, University of Georgia, 120 Green St, Athens, GA 30622, USA
| | | | | | | | | | | | | |
Collapse
|
85
|
Braakman R, Smith E. The emergence and early evolution of biological carbon-fixation. PLoS Comput Biol 2012; 8:e1002455. [PMID: 22536150 PMCID: PMC3334880 DOI: 10.1371/journal.pcbi.1002455] [Citation(s) in RCA: 107] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2011] [Accepted: 02/13/2012] [Indexed: 11/18/2022] Open
Abstract
The fixation of CO₂ into living matter sustains all life on Earth, and embeds the biosphere within geochemistry. The six known chemical pathways used by extant organisms for this function are recognized to have overlaps, but their evolution is incompletely understood. Here we reconstruct the complete early evolutionary history of biological carbon-fixation, relating all modern pathways to a single ancestral form. We find that innovations in carbon-fixation were the foundation for most major early divergences in the tree of life. These findings are based on a novel method that fully integrates metabolic and phylogenetic constraints. Comparing gene-profiles across the metabolic cores of deep-branching organisms and requiring that they are capable of synthesizing all their biomass components leads to the surprising conclusion that the most common form for deep-branching autotrophic carbon-fixation combines two disconnected sub-networks, each supplying carbon to distinct biomass components. One of these is a linear folate-based pathway of CO₂ reduction previously only recognized as a fixation route in the complete Wood-Ljungdahl pathway, but which more generally may exclude the final step of synthesizing acetyl-CoA. Using metabolic constraints we then reconstruct a "phylometabolic" tree with a high degree of parsimony that traces the evolution of complete carbon-fixation pathways, and has a clear structure down to the root. This tree requires few instances of lateral gene transfer or convergence, and instead suggests a simple evolutionary dynamic in which all divergences have primary environmental causes. Energy optimization and oxygen toxicity are the two strongest forces of selection. The root of this tree combines the reductive citric acid cycle and the Wood-Ljungdahl pathway into a single connected network. This linked network lacks the selective optimization of modern fixation pathways but its redundancy leads to a more robust topology, making it more plausible than any modern pathway as a primitive universal ancestral form.
Collapse
Affiliation(s)
- Rogier Braakman
- Santa Fe Institute, Santa Fe, New Mexico, United States of America
| | | |
Collapse
|
86
|
Guzmán D, Balderrama-Subieta A, Cardona-Ortuño C, Guevara-Martínez M, Callisaya-Quispe N, Quillaguamán J. Evolutionary patterns of carbohydrate transport and metabolism in Halomonas boliviensis as derived from its genome sequence: influences on polyester production. AQUATIC BIOSYSTEMS 2012; 8:9. [PMID: 22510370 PMCID: PMC3384467 DOI: 10.1186/2046-9063-8-9] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/02/2012] [Accepted: 04/17/2012] [Indexed: 05/31/2023]
Abstract
BACKGROUND Halomonas boliviensis is a halophilic bacterium that is included in the γ-Proteobacteria sub-group, and is able to assimilate different types of carbohydrates. H. boliviensis is also able to produce poly(3-hydroxybutyrate) (PHB) in high yields using glucose as the carbon precursor. Accumulation of PHB by microorganisms is induced by excess of intracellular NADH.The genome sequences and organization in microorganisms should be the result of evolution and adaptation influenced by mutation, gene duplication, horizontal gen transfer (HGT) and recombination. Furthermore, the nearly neutral theory of evolution sustains that genetic modification of DNA could be neutral or selected, albeit most mutations should be at the border between neutrality and selection, i.e. slightly deleterious base substitutions in DNA are followed by a slightly advantageous substitutions. RESULTS This article reports the genome sequence of H. boliviensis. The chromosome size of H. boliviensis was 4 119 979 bp, and contained 3 863 genes. A total of 160 genes of H. boliviensis were related to carbohydrate transport and metabolism, and were organized as: 70 genes for metabolism of carbohydrates; 47 genes for ABC transport systems and 43 genes for TRAP-type C4-dicarboxylate transport systems. Protein sequences of H. boliviensis related to carbohydrate transport and metabolism were selected from clusters of orthologous proteins (COGs). Similar proteins derived from the genome sequences of other 41 archaea and 59 bacteria were used as reference. We found that most of the 160 genes in H. boliviensis, c.a. 44%, were obtained from other bacteria by horizontal gene transfer, while 13% of the genes were acquired from haloarchaea and thermophilic archaea, only 34% of the genes evolved among Proteobacteria and the remaining genes encoded proteins that did not cluster with any of the proteins obtained from the reference strains. Furthermore, the diversity of the enzymes derived from these genes led to polymorphism in glycolysis and gluconeogenesis. We found further that an optimum ratio of glucose and sucrose in the culture medium of H. boliviensis favored cell growth and PHB production. CONCLUSIONS Results obtained in this article depict that most genetic modifications and enzyme polymorphism in the genome of H. boliviensis were mainly influenced by HGT rather than nearly neutral mutations. Molecular adaptation and evolution experienced by H. boliviensis were also a response to environmental conditions such as the type and amount of carbohydrates in its ecological niche. Consequently, the genome evolution of H. boliviensis showed to be strongly influenced by the type of microorganisms, genetic interaction among microbial species and its environment. Such trend should also be experienced by other prokaryotes. A system for PHB production by H. boliviensis that takes into account the evolutionary adaptation of this bacterium to the assimilation of combinations of carbohydrates suggests the feasibility of a bioprocess economically viable and environmentally friendly.
Collapse
Affiliation(s)
- Daniel Guzmán
- Centro de Biotecnología, Facultad de Ciencias y Tecnología, Universidad Mayor de San Simón, Cochabamba, Bolivia
- Department of Biotechnology, Lund University, P.O. Box 124, SE-221 00 Lund, Sweden
| | - Andrea Balderrama-Subieta
- Centro de Biotecnología, Facultad de Ciencias y Tecnología, Universidad Mayor de San Simón, Cochabamba, Bolivia
| | - Carla Cardona-Ortuño
- Centro de Biotecnología, Facultad de Ciencias y Tecnología, Universidad Mayor de San Simón, Cochabamba, Bolivia
| | - Mónica Guevara-Martínez
- Centro de Biotecnología, Facultad de Ciencias y Tecnología, Universidad Mayor de San Simón, Cochabamba, Bolivia
| | - Nataly Callisaya-Quispe
- Centro de Biotecnología, Facultad de Ciencias y Tecnología, Universidad Mayor de San Simón, Cochabamba, Bolivia
| | - Jorge Quillaguamán
- Centro de Biotecnología, Facultad de Ciencias y Tecnología, Universidad Mayor de San Simón, Cochabamba, Bolivia
| |
Collapse
|
87
|
Ramulu HG, Raoult D, Pontarotti P. The rhizome of life: what about metazoa? Front Cell Infect Microbiol 2012; 2:50. [PMID: 22919641 PMCID: PMC3417402 DOI: 10.3389/fcimb.2012.00050] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2012] [Accepted: 03/23/2012] [Indexed: 02/03/2023] Open
Abstract
The increase in huge number of genomic sequences in recent years has contributed to various genetic events such as horizontal gene transfer (HGT), gene duplication and hybridization of species. Among them HGT has played an important role in the genome evolution and was believed to occur only in Bacterial and Archaeal genomes. As a result, genomes were found to be chimeric and the evolution of life was represented in different forms such as forests, networks and species evolution was described more like a rhizome, rather than a tree. However, in the last few years, HGT has also been evidenced in other group such as metazoa (for example in root-knot nematodes, bdelloid rotifers and mammals). In addition to HGT, other genetic events such as transfer by retrotransposons and hybridization between more closely related lineages are also well established. Therefore, in the light of such genetic events, whether the evolution of metazoa exists in the form of a tree, network or rhizome is highly questionable and needs to be determined. In the current review, we will focus on the role of HGT, retrotransposons and hybridization in the metazoan evolution.
Collapse
Affiliation(s)
- Hemalatha G. Ramulu
- LATP UMR-CNRS 7353, Evolution Biologique et Modélisation, Aix-Marseille UniversitéeMarseille, France
- URMITE CNRS-IRD UMR6236-198Marseille, France
| | | | - Pierre Pontarotti
- LATP UMR-CNRS 7353, Evolution Biologique et Modélisation, Aix-Marseille UniversitéeMarseille, France
| |
Collapse
|
88
|
Merhej V, Notredame C, Royer-Carenzi M, Pontarotti P, Raoult D. The rhizome of life: the sympatric Rickettsia felis paradigm demonstrates the random transfer of DNA sequences. Mol Biol Evol 2012; 28:3213-23. [PMID: 22024628 DOI: 10.1093/molbev/msr239] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
The intracellular flea symbiont, Rickettsia felis, may meet other organisms intracellularly such as R. typhi. We used a single-gene phylogenetic approach of the 1375 R. felis genes to look for horizontal transfers that occurred as a result of the bacterial promiscuity with other organisms. Our results showed that besides genes that are linked to the Spotted Fever Group, 165 genes have a different history and are linked to other Rickettsia such as R. bellii (107 genes), R. typhi (15 genes), or to other bacteria such as Legionella sp. and Francisella sp. or to eukaryotes. Among these genes, we identified 73 individual genes and 34 spatial clusters containing 2-4 adjacent genes, a total of 79 genes, with evidence of en bloc transfer. We described 13 chimeric genes resulting from gene recombination with sympatric R. typhi. The transferred DNA sequences present different sizes and functions, suggesting that the horizontal transfer in R. felis is random and neutral within its specific host. Our study shows that the strict intracellular bacteria R. felis exhibits a mosaic genome. We therefore developed a new representation for the evolutionary history of R. felis showing its different putative ancestors in the form of a rhizome.
Collapse
Affiliation(s)
- Vicky Merhej
- Unité de Recherche en Maladies Infectieuses et Tropicales Emergentes, CNRS-IRD UMR6236-198, Université de la Méditerranée, Faculté de Médecine, Marseille, France
| | | | | | | | | |
Collapse
|
89
|
Rosenfeld JA, DeSalle R. E value cutoff and eukaryotic genome content phylogenetics. Mol Phylogenet Evol 2012; 63:342-50. [PMID: 22306824 DOI: 10.1016/j.ympev.2012.01.003] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2011] [Revised: 01/02/2012] [Accepted: 01/03/2012] [Indexed: 10/14/2022]
Abstract
Genome content analysis has been used as a source of phylogenetic information in large prokaryotic tree of life studies. Recently the sequencing of many eukaryotic genomes has allowed for the similar use of genome content analysis for these organisms too. In this communication we examine the utility of genome content analysis for recovering phylogenetic patterns in several eukaryotic groups. By constructing multiple matrices using different e value cutoffs we examine the dynamics of altering the e value cutoff on five eukaryotic genome data sets. Our analysis indicates that the e value cutoff that is used as a criterion in the construction of the genome content matrix is a critical factor in both the accuracy and information content of the analysis. Strikingly, genome content by itself is not a reliable or accurate source of characters for phylogenetic analysis of the taxa in the five data sets we analyzed. We discuss two problems--small genome attraction and genome duplications as being involved in the rather poor performance of genome content data in recovering eukaryotic phylogeny.
Collapse
Affiliation(s)
- Jeffrey A Rosenfeld
- IST/High Performance and Research Computing, University of Medicine and Dentistry of New Jersey, Newark, NJ 07103, United States.
| | | |
Collapse
|
90
|
Egel R. Primal eukaryogenesis: on the communal nature of precellular States, ancestral to modern life. Life (Basel) 2012; 2:170-212. [PMID: 25382122 PMCID: PMC4187143 DOI: 10.3390/life2010170] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2011] [Revised: 12/29/2011] [Accepted: 01/11/2012] [Indexed: 02/08/2023] Open
Abstract
This problem-oriented, exploratory and hypothesis-driven discourse toward the unknown combines several basic tenets: (i) a photo-active metal sulfide scenario of primal biogenesis in the porespace of shallow sedimentary flats, in contrast to hot deep-sea hydrothermal vent conditions; (ii) an inherently complex communal system at the common root of present life forms; (iii) a high degree of internal compartmentalization at this communal root, progressively resembling coenocytic (syncytial) super-cells; (iv) a direct connection from such communal super-cells to proto-eukaryotic macro-cell organization; and (v) multiple rounds of micro-cellular escape with streamlined reductive evolution-leading to the major prokaryotic cell lines, as well as to megaviruses and other viral lineages. Hopefully, such nontraditional concepts and approaches will contribute to coherent and plausible views about the origins and early life on Earth. In particular, the coevolutionary emergence from a communal system at the common root can most naturally explain the vast discrepancy in subcellular organization between modern eukaryotes on the one hand and both archaea and bacteria on the other.
Collapse
Affiliation(s)
- Richard Egel
- Department of Biology, University of Copenhagen Biocenter, Ole Maaløes Vej 5, DK-2200 Copenhagen, Denmark.
| |
Collapse
|
91
|
Puigbò P, Wolf YI, Koonin EV. Genome-wide comparative analysis of phylogenetic trees: the prokaryotic forest of life. Methods Mol Biol 2012; 856:53-79. [PMID: 22399455 PMCID: PMC3842619 DOI: 10.1007/978-1-61779-585-5_3] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Genome-wide comparison of phylogenetic trees is becoming an increasingly common approach in evolutionary genomics, and a variety of approaches for such comparison have been developed. In this article, we present several methods for comparative analysis of large numbers of phylogenetic trees. To compare phylogenetic trees taking into account the bootstrap support for each internal branch, the Boot-Split Distance (BSD) method is introduced as an extension of the previously developed Split Distance method for tree comparison. The BSD method implements the straightforward idea that comparison of phylogenetic trees can be made more robust by treating tree splits differentially depending on the bootstrap support. Approaches are also introduced for detecting tree-like and net-like evolutionary trends in the phylogenetic Forest of Life (FOL), i.e., the entirety of the phylogenetic trees for conserved genes of prokaryotes. The principal method employed for this purpose includes mapping quartets of species onto trees to calculate the support of each quartet topology and so to quantify the tree and net contributions to the distances between species. We describe the application of these methods to analyze the FOL and the results obtained with these methods. These results support the concept of the Tree of Life (TOL) as a central evolutionary trend in the FOL as opposed to the traditional view of the TOL as a "species tree."
Collapse
Affiliation(s)
- Pere Puigbò
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health. Bethesda, Maryland 20894. USA
| | - Yuri I. Wolf
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health. Bethesda, Maryland 20894. USA
| | - Eugene V. Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health. Bethesda, Maryland 20894. USA
| |
Collapse
|
92
|
Affiliation(s)
- Olga Zhaxybayeva
- Department of Biology, West Virginia University, PO Box 6057, Morgantown, WV 26506, USA
| | | |
Collapse
|
93
|
Andam CP, Gogarten JP. Biased gene transfer and its implications for the concept of lineage. Biol Direct 2011; 6:47. [PMID: 21943000 PMCID: PMC3191353 DOI: 10.1186/1745-6150-6-47] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2011] [Accepted: 09/23/2011] [Indexed: 11/10/2022] Open
Abstract
Background In the presence of horizontal gene transfer (HGT), the concepts of lineage and genealogy in the microbial world become more ambiguous because chimeric genomes trace their ancestry from a myriad of sources, both living and extinct. Results We present the evolutionary histories of three aminoacyl-tRNA synthetases (aaRS) to illustrate that the concept of organismal lineage in the prokaryotic world is defined by both vertical inheritance and reticulations due to HGT. The acquisition of a novel gene from a distantly related taxon can be considered as a shared derived character that demarcates a group of organisms, as in the case of the spirochaete Phenylalanyl-tRNA synthetase (PheRS). On the other hand, when organisms transfer genetic material with their close kin, the similarity and therefore relatedness observed among them is essentially shaped by gene transfer. Studying the distribution patterns of divergent genes with identical functions, referred to as homeoalleles, can reveal preferences for transfer partners. We describe the very ancient origin and the distribution of the archaeal homeoalleles for Threonyl-tRNA synthetases (ThrRS) and Seryl-tRNA synthetases (SerRS). Conclusions Patterns created through biased HGT can be undistinguishable from those created through shared organismal ancestry. A re-evaluation of the definition of lineage is necessary to reflect genetic relatedness due to both HGT and vertical inheritance. In most instances, HGT bias will maintain and strengthen similarity within groups. Only in cases where HGT bias is due to other factors, such as shared ecological niche, do patterns emerge from gene phylogenies that are in conflict with those reflecting shared organismal ancestry. Reviewers This article was reviewed by W. Ford Doolittle, François-Joseph Lapointe, and Frederic Bouchard.
Collapse
Affiliation(s)
- Cheryl P Andam
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT 06269-3125, USA.
| | | |
Collapse
|
94
|
Williams D, Fournier GP, Lapierre P, Swithers KS, Green AG, Andam CP, Gogarten JP. A rooted net of life. Biol Direct 2011; 6:45. [PMID: 21936906 PMCID: PMC3189188 DOI: 10.1186/1745-6150-6-45] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2011] [Accepted: 09/21/2011] [Indexed: 01/29/2023] Open
Abstract
Abstract Phylogenetic reconstruction using DNA and protein sequences has allowed the reconstruction of evolutionary histories encompassing all life. We present and discuss a means to incorporate much of this rich narrative into a single model that acknowledges the discrete evolutionary units that constitute the organism. Briefly, this Rooted Net of Life genome phylogeny is constructed around an initial, well resolved and rooted tree scaffold inferred from a supermatrix of combined ribosomal genes. Extant sampled ribosomes form the leaves of the tree scaffold. These leaves, but not necessarily the deeper parts of the scaffold, can be considered to represent a genome or pan-genome, and to be associated with members of other gene families within that sequenced (pan)genome. Unrooted phylogenies of gene families containing four or more members are reconstructed and superimposed over the scaffold. Initially, reticulations are formed where incongruities between topologies exist. Given sufficient evidence, edges may then be differentiated as those representing vertical lines of inheritance within lineages and those representing horizontal genetic transfers or endosymbioses between lineages. Reviewers W. Ford Doolittle, Eric Bapteste and Robert Beiko.
Collapse
Affiliation(s)
- David Williams
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT 06269-3125, USA.
| | | | | | | | | | | | | |
Collapse
|
95
|
Puigbò P, Mekhedov S, Wolf YI, Koonin EV. A comprehensive census of horizontal gene transfers from prokaryotes to unikonts. Genome Biol 2011. [DOI: 10.1186/1465-6906-12-s1-p20] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
|
96
|
Kristensen DM, Wolf YI, Mushegian AR, Koonin EV. Computational methods for Gene Orthology inference. Brief Bioinform 2011; 12:379-91. [PMID: 21690100 PMCID: PMC3178053 DOI: 10.1093/bib/bbr030] [Citation(s) in RCA: 162] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2011] [Revised: 05/05/2011] [Indexed: 12/14/2022] Open
Abstract
Accurate inference of orthologous genes is a pre-requisite for most comparative genomics studies, and is also important for functional annotation of new genomes. Identification of orthologous gene sets typically involves phylogenetic tree analysis, heuristic algorithms based on sequence conservation, synteny analysis, or some combination of these approaches. The most direct tree-based methods typically rely on the comparison of an individual gene tree with a species tree. Once the two trees are accurately constructed, orthologs are straightforwardly identified by the definition of orthology as those homologs that are related by speciation, rather than gene duplication, at their most recent point of origin. Although ideal for the purpose of orthology identification in principle, phylogenetic trees are computationally expensive to construct for large numbers of genes and genomes, and they often contain errors, especially at large evolutionary distances. Moreover, in many organisms, in particular prokaryotes and viruses, evolution does not appear to have followed a simple 'tree-like' mode, which makes conventional tree reconciliation inapplicable. Other, heuristic methods identify probable orthologs as the closest homologous pairs or groups of genes in a set of organisms. These approaches are faster and easier to automate than tree-based methods, with efficient implementations provided by graph-theoretical algorithms enabling comparisons of thousands of genomes. Comparisons of these two approaches show that, despite conceptual differences, they produce similar sets of orthologs, especially at short evolutionary distances. Synteny also can aid in identification of orthologs. Often, tree-based, sequence similarity- and synteny-based approaches can be combined into flexible hybrid methods.
Collapse
Affiliation(s)
- David M Kristensen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | | | | |
Collapse
|
97
|
Andam CP, Fournier GP, Gogarten JP. Multilevel populations and the evolution of antibiotic resistance through horizontal gene transfer. FEMS Microbiol Rev 2011; 35:756-67. [DOI: 10.1111/j.1574-6976.2011.00274.x] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
|
98
|
McInerney JO, Pisani D, Bapteste E, O'Connell MJ. The Public Goods Hypothesis for the evolution of life on Earth. Biol Direct 2011; 6:41. [PMID: 21861918 PMCID: PMC3179745 DOI: 10.1186/1745-6150-6-41] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2011] [Accepted: 08/23/2011] [Indexed: 02/01/2023] Open
Abstract
It is becoming increasingly difficult to reconcile the observed extent of horizontal gene transfers with the central metaphor of a great tree uniting all evolving entities on the planet. In this manuscript we describe the Public Goods Hypothesis and show that it is appropriate in order to describe biological evolution on the planet. According to this hypothesis, nucleotide sequences (genes, promoters, exons, etc.) are simply seen as goods, passed from organism to organism through both vertical and horizontal transfer. Public goods sequences are defined by having the properties of being largely non-excludable (no organism can be effectively prevented from accessing these sequences) and non-rival (while such a sequence is being used by one organism it is also available for use by another organism). The universal nature of genetic systems ensures that such non-excludable sequences exist and non-excludability explains why we see a myriad of genes in different combinations in sequenced genomes. There are three features of the public goods hypothesis. Firstly, segments of DNA are seen as public goods, available for all organisms to integrate into their genomes. Secondly, we expect the evolution of mechanisms for DNA sharing and of defense mechanisms against DNA intrusion in genomes. Thirdly, we expect that we do not see a global tree-like pattern. Instead, we expect local tree-like patterns to emerge from the combination of a commonage of genes and vertical inheritance of genomes by cell division. Indeed, while genes are theoretically public goods, in reality, some genes are excludable, particularly, though not only, when they have variant genetic codes or behave as coalition or club goods, available for all organisms of a coalition to integrate into their genomes, and non-rival within the club. We view the Tree of Life hypothesis as a regionalized instance of the Public Goods hypothesis, just like classical mechanics and euclidean geometry are seen as regionalized instances of quantum mechanics and Riemannian geometry respectively. We argue for this change using an axiomatic approach that shows that the Public Goods hypothesis is a better accommodation of the observed data than the Tree of Life hypothesis.
Collapse
Affiliation(s)
- James O McInerney
- Molecular Evolution and Bioinformatics Unit, Department of Biology, National University of Ireland Maynooth, County Kildare, Ireland.
| | | | | | | |
Collapse
|
99
|
Popa O, Dagan T. Trends and barriers to lateral gene transfer in prokaryotes. Curr Opin Microbiol 2011; 14:615-23. [PMID: 21856213 DOI: 10.1016/j.mib.2011.07.027] [Citation(s) in RCA: 159] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2011] [Revised: 07/19/2011] [Accepted: 07/25/2011] [Indexed: 11/19/2022]
Abstract
Gene acquisition by lateral gene transfer (LGT) is an important mechanism for natural variation among prokaryotes. Laboratory experiments show that protein-coding genes can be laterally transferred extremely fast among microbial cells, inherited to most of their descendants, and adapt to a new regulatory regime within a short time. Recent advance in the phylogenetic analysis of microbial genomes using networks approach reveals a substantial impact of LGT during microbial genome evolution. Phylogenomic networks of LGT among prokaryotes reconstructed from completely sequenced genomes uncover barriers to LGT in multiple levels. Here we discuss the kinds of barriers to gene acquisition in nature including physical barriers for gene transfer between cells, genomic barriers for the integration of acquired DNA, and functional barriers for the acquisition of new genes.
Collapse
Affiliation(s)
- Ovidiu Popa
- Institute of Molecular Evolution, Heinrich-Heine University of Düsseldorf, Universitätstr. 1 40225, Düsseldorf, Germany
| | | |
Collapse
|
100
|
Beauregard-Racine J, Bicep C, Schliep K, Lopez P, Lapointe FJ, Bapteste E. Of woods and webs: possible alternatives to the tree of life for studying genomic fluidity in E. coli. Biol Direct 2011; 6:39; discussion 39. [PMID: 21774799 PMCID: PMC3160433 DOI: 10.1186/1745-6150-6-39] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2011] [Accepted: 07/20/2011] [Indexed: 12/26/2022] Open
Abstract
Background We introduce several forest-based and network-based methods for exploring microbial evolution, and apply them to the study of thousands of genes from 30 strains of E. coli. This case study illustrates how additional analyses could offer fast heuristic alternatives to standard tree of life (TOL) approaches. Results We use gene networks to identify genes with atypical modes of evolution, and genome networks to characterize the evolution of genetic partnerships between E. coli and mobile genetic elements. We develop a novel polychromatic quartet method to capture patterns of recombination within E. coli, to update the clanistic toolkit, and to search for the impact of lateral gene transfer and of pathogenicity on gene evolution in two large forests of trees bearing E. coli. We unravel high rates of lateral gene transfer involving E. coli (about 40% of the trees under study), and show that both core genes and shell genes of E. coli are affected by non-tree-like evolutionary processes. We show that pathogenic lifestyle impacted the structure of 30% of the gene trees, and that pathogenic strains are more likely to transfer genes with one another than with non-pathogenic strains. In addition, we propose five groups of genes as candidate mobile modules of pathogenicity. We also present strong evidence for recent lateral gene transfer between E. coli and mobile genetic elements. Conclusions Depending on which evolutionary questions biologists want to address (i.e. the identification of modules, genetic partnerships, recombination, lateral gene transfer, or genes with atypical evolutionary modes, etc.), forest-based and network-based methods are preferable to the reconstruction of a single tree, because they provide insights and produce hypotheses about the dynamics of genome evolution, rather than the relative branching order of species and lineages. Such a methodological pluralism - the use of woods and webs - is to be encouraged to analyse the evolutionary processes at play in microbial evolution. This manuscript was reviewed by: Ford Doolittle, Tal Pupko, Richard Burian, James McInerney, Didier Raoult, and Yan Boucher
Collapse
|