1
|
Russo CAM, Eyre-Walker A, Katz LA, Gaut BS. Forty Years of Inferential Methods in the Journals of the Society for Molecular Biology and Evolution. Mol Biol Evol 2024; 41:msad264. [PMID: 38197288 PMCID: PMC10763999 DOI: 10.1093/molbev/msad264] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Accepted: 11/27/2023] [Indexed: 01/11/2024] Open
Abstract
We are launching a series to celebrate the 40th anniversary of the first issue of Molecular Biology and Evolution. In 2024, we will publish virtual issues containing selected papers published in the Society for Molecular Biology and Evolution journals, Molecular Biology and Evolution and Genome Biology and Evolution. Each virtual issue will be accompanied by a perspective that highlights the historic and contemporary contributions of our journals to a specific topic in molecular evolution. This perspective, the first in the series, presents an account of the broad array of methods that have been published in the Society for Molecular Biology and Evolution journals, including methods to infer phylogenies, to test hypotheses in a phylogenetic framework, and to infer population genetic processes. We also mention many of the software implementations that make methods tractable for empiricists. In short, the Society for Molecular Biology and Evolution community has much to celebrate after four decades of publishing high-quality science including numerous important inferential methods.
Collapse
Affiliation(s)
- Claudia A M Russo
- Departamento de Genética, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | | | - Laura A Katz
- Department of Biological Sciences, Smith College, Northampton, MA, USA
| | - Brandon S Gaut
- School of Biological Sciences, University of California, Irvine, CA, USA
| |
Collapse
|
2
|
Dang CC, Minh BQ, McShea H, Masel J, James JE, Vinh LS, Lanfear R. OUP accepted manuscript. Syst Biol 2022; 71:1110-1123. [PMID: 35139203 PMCID: PMC9366462 DOI: 10.1093/sysbio/syac007] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2021] [Accepted: 01/30/2022] [Indexed: 11/12/2022] Open
Affiliation(s)
- Cuong Cao Dang
- Faculty of Information Technology, University of Engineering and Technology, Vietnam National University, 144 Xuan Thuy, Cau Giay, Hanoi 10000, Vietnam
| | - Bui Quang Minh
- Computational Phylogenomics Lab, School of Computing, Australian National University, Canberra, Australian Capital Territory 2601, Australia
| | - Hanon McShea
- Department of Earth System Science, School of Earth, Energy, and Environmental Sciences, Stanford University, Palo Alto, CA 94305, USA
| | - Joanna Masel
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA
| | - Jennifer Eleanor James
- Department of Ecology and Genetics, Plant Ecology and Evolution, Evolutionary Biology Center, Uppsala University, Uppsala, SE-752 36, Sweden
| | - Le Sy Vinh
- Correspondence to be sent to: Faculty of Information Technology, University of Engineering and Technology, Vietnam National University, Hanoi, 144 Xuan Thuy, Cau Giay, Hanoi 10000, Vietnam; E-mail: Cuong Cao Dang and Bui Quang Minh contributed equally to the work.
| | - Robert Lanfear
- Division of Ecology and Evolution, Research School of Biology, Australian National University, Canberra, ACT 2601, Australia
| |
Collapse
|
3
|
Naser-Khdour S, Minh BQ, Lanfear R. Assessing Confidence in Root Placement on Phylogenies: An Empirical Study Using Non-Reversible Models for Mammals. Syst Biol 2021; 71:959-972. [PMID: 34387349 PMCID: PMC9260635 DOI: 10.1093/sysbio/syab067] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Revised: 08/03/2021] [Accepted: 08/11/2021] [Indexed: 11/14/2022] Open
Abstract
Using time-reversible Markov models is a very common practice in phylogenetic analysis,
because although we expect many of their assumptions to be violated by empirical data,
they provide high computational efficiency. However, these models lack the ability to
infer the root placement of the estimated phylogeny. In order to compensate for the
inability of these models to root the tree, many researchers use external information such
as using outgroup taxa or additional assumptions such as molecular clocks. In this study,
we investigate the utility of nonreversible models to root empirical phylogenies and
introduce a new bootstrap measure, the rootstrap, which provides
information on the statistical support for any given root position. [Bootstrap;
nonreversible models; phylogenetic inference; root estimation.]
Collapse
Affiliation(s)
- Suha Naser-Khdour
- Department of Ecology and Evolution, Research School of Biology, Australian National University, Canberra, Australian Capital Territory, Australia
| | - Bui Quang Minh
- Department of Ecology and Evolution, Research School of Biology, Australian National University, Canberra, Australian Capital Territory, Australia.,Research School of Computer Science, Australian National University, Canberra, Australian Capital Territory, Australia
| | - Robert Lanfear
- Department of Ecology and Evolution, Research School of Biology, Australian National University, Canberra, Australian Capital Territory, Australia
| |
Collapse
|
4
|
Caetano-Anollés G, Nasir A, Kim KM, Caetano-Anollés D. Rooting Phylogenies and the Tree of Life While Minimizing Ad Hoc and Auxiliary Assumptions. Evol Bioinform Online 2018; 14:1176934318805101. [PMID: 30364468 PMCID: PMC6196624 DOI: 10.1177/1176934318805101] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2018] [Accepted: 09/05/2018] [Indexed: 12/25/2022] Open
Abstract
Phylogenetic methods unearth evolutionary history when supported by three starting points of reason: (1) the continuity axiom begs the existence of a "model" of evolutionary change, (2) the singularity axiom defines the historical ground plan (phylogeny) in which biological entities (taxa) evolve, and (3) the memory axiom demands identification of biological attributes (characters) with historical information. Axiom consequences are interlinked, making the retrodiction enterprise an endeavor of reciprocal fulfillment. In particular, establishing direction of evolutionary change (character polarization) roots phylogenies and enables testing the existence of historical memory (homology). Unfortunately, rooting phylogenies, especially the "tree of life," generally follow narratives instead of integrating empirical and theoretical knowledge of retrodictive exploration. This stems mostly from a focus on molecular sequence analysis and uncertainties about rooting methods. Here, we review available rooting criteria, highlighting the need to minimize both ad hoc and auxiliary assumptions, especially argumentative ad hocness. We show that while the outgroup comparison method has been widely adopted, the generality criterion of nesting and additive phylogenetic change embodied in Weston rule offers the most powerful rooting approach. We also propose a change of focus, from phylogenies that describe the evolution of biological systems to those that describe the evolution of parts of those systems. This weakens violation of character independence, helps formalize the generality criterion of rooting, and provides new ways to study the problem of evolution.
Collapse
Affiliation(s)
- Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Arshan Nasir
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Department of Biosciences, COMSATS University Islamabad, Islamabad, Pakistan
| | - Kyung Mo Kim
- Division of Polar Life Sciences, Korea Polar Research Institute, Incheon, Republic of Korea
| | - Derek Caetano-Anollés
- Department of Evolutionary Genetics, Max-Planck-Institut für Evolutionsbiologie, Plön, Germany
| |
Collapse
|
5
|
Levasseur A, Merhej V, Baptiste E, Sharma V, Pontarotti P, Raoult D. The Rhizome of Lokiarchaeota Illustrates the Mosaicity of Archaeal Genomes. Genome Biol Evol 2018; 9:2635-2639. [PMID: 29048529 PMCID: PMC5737619 DOI: 10.1093/gbe/evx208] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/26/2017] [Indexed: 12/23/2022] Open
Abstract
Genome remodeling and exchange of sequences are widespread in the prokaryotic world and mosaic genomes challenge the classification of prokaryotes, which cannot be properly achieved in terms of a single gene or group of genes. Here, we studied individually the gene collection of the archaic microorganism Lokiarchaeum sp., suggested as an archaeal host close to the emergence of the eukaryotes. The network or rhizome of all Lokiarchaeum sp. genes revealed that the genomic repertoire is mainly composed of genes from archaeal (∼36%) and bacterial origin (∼28%), distantly followed by components of eukaryotic origin (∼2%). Thirty-three percent of genes were unique to this species (ORFans). The mosaicity of archaea was also supported by studying Methanomassiliicoccus luminyensis, an archaea from the gut, in which 67% of the genomic repertoire arised from archaea and 22% from bacteria. Our results illustrate the intricate evolutionary relationships of the archaeal genome repertoire and highlight the rhizome-like processes of evolution in archaea, their mosaicity, and chimeric origin composed of different domains of life, questioning the reality of a tree of life.
Collapse
Affiliation(s)
- Anthony Levasseur
- Unité de Recherche sur les Maladies Infectieuses et Tropicales Emergentes (URMITE), UM63, CNRS 7278, IRD 198, INSERM 1095, AP-HM, IHU - Méditerranée Infection, Faculté de Médecine, Aix-Marseille Université, France.,Institut Universitaire de France (IUF), Paris, France
| | - Vicky Merhej
- Unité de Recherche sur les Maladies Infectieuses et Tropicales Emergentes (URMITE), UM63, CNRS 7278, IRD 198, INSERM 1095, AP-HM, IHU - Méditerranée Infection, Faculté de Médecine, Aix-Marseille Université, France
| | - Emeline Baptiste
- Unité de Recherche sur les Maladies Infectieuses et Tropicales Emergentes (URMITE), UM63, CNRS 7278, IRD 198, INSERM 1095, AP-HM, IHU - Méditerranée Infection, Faculté de Médecine, Aix-Marseille Université, France
| | - Vikas Sharma
- CNRS, Centrale Marseille, I2M, UMR7373, FR 4213 - FR Eccorev 3098, Équipe EBM, Aix-Marseille Université, France
| | - Pierre Pontarotti
- CNRS, Centrale Marseille, I2M, UMR7373, FR 4213 - FR Eccorev 3098, Équipe EBM, Aix-Marseille Université, France
| | - Didier Raoult
- Unité de Recherche sur les Maladies Infectieuses et Tropicales Emergentes (URMITE), UM63, CNRS 7278, IRD 198, INSERM 1095, AP-HM, IHU - Méditerranée Infection, Faculté de Médecine, Aix-Marseille Université, France
| |
Collapse
|
6
|
Tocheva EI, Ortega DR, Jensen GJ. Sporulation, bacterial cell envelopes and the origin of life. Nat Rev Microbiol 2016; 14:535-542. [PMID: 28232669 DOI: 10.1038/nrmicro.2016.85] [Citation(s) in RCA: 59] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Electron cryotomography (ECT) enables the 3D reconstruction of intact cells in a near-native state. Images produced by ECT have led to the proposal that an ancient sporulation-like event gave rise to the second membrane in diderm bacteria. Tomograms of sporulating monoderm and diderm bacterial cells show how sporulation can lead to the generation of diderm cells. Tomograms of Gram-negative and Gram-positive cell walls and purified sacculi suggest that they are more closely related than previously thought and support the hypothesis that they share a common origin. Mapping the distribution of cell envelope architectures onto a recent phylogenetic tree of life indicates that the diderm cell plan, and therefore the sporulation-like event that gave rise to it, must be very ancient. One explanation for this model is that during the cataclysmic transitions of the early Earth, cellular evolution may have gone through a bottleneck in which only spores survived, which implies that the last bacterial common ancestor was a spore.
Collapse
Affiliation(s)
- Elitza I Tocheva
- Department of Stomatology and the Department of Biochemistry and Molecular Medicine, Université de Montréal, P. O. Box 6128 Station Centre-Ville, Montreal, Québec H3C 3J7, Canada
| | - Davi R Ortega
- Department of Biology and Biological Engineering, California Institute of Technology, 1200 East California Boulevard, Pasadena, California 91125, USA
| | - Grant J Jensen
- Howard Hughes Medical Institute, Department of Biology and Biological Engineering, California Institute of Technology, 1200 East California Boulevard, Pasadena, California 91125, USA
| |
Collapse
|
7
|
Gupta RS. Impact of genomics on the understanding of microbial evolution and classification: the importance of Darwin's views on classification. FEMS Microbiol Rev 2016; 40:520-53. [PMID: 27279642 DOI: 10.1093/femsre/fuw011] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/14/2016] [Indexed: 12/24/2022] Open
Abstract
Analyses of genome sequences, by some approaches, suggest that the widespread occurrence of horizontal gene transfers (HGTs) in prokaryotes disguises their evolutionary relationships and have led to questioning of the Darwinian model of evolution for prokaryotes. These inferences are critically examined in the light of comparative genome analysis, characteristic synapomorphies, phylogenetic trees and Darwin's views on examining evolutionary relationships. Genome sequences are enabling discovery of numerous molecular markers (synapomorphies) such as conserved signature indels (CSIs) and conserved signature proteins (CSPs), which are distinctive characteristics of different prokaryotic taxa. Based on these molecular markers, exhibiting high degree of specificity and predictive ability, numerous prokaryotic taxa of different ranks, currently identified based on the 16S rRNA gene trees, can now be reliably demarcated in molecular terms. Within all studied groups, multiple CSIs and CSPs have been identified for successive nested clades providing reliable information regarding their hierarchical relationships and these inferences are not affected by HGTs. These results strongly support Darwin's views on evolution and classification and supplement the current phylogenetic framework based on 16S rRNA in important respects. The identified molecular markers provide important means for developing novel diagnostics, therapeutics and for functional studies providing important insights regarding prokaryotic taxa.
Collapse
Affiliation(s)
- Radhey S Gupta
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, ON, Canada
| |
Collapse
|
8
|
Gao B, Gupta RS. Phylogenetic framework and molecular signatures for the main clades of the phylum Actinobacteria. Microbiol Mol Biol Rev 2012; 76:66-112. [PMID: 22390973 PMCID: PMC3294427 DOI: 10.1128/mmbr.05011-11] [Citation(s) in RCA: 167] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
The phylum Actinobacteria harbors many important human pathogens and also provides one of the richest sources of natural products, including numerous antibiotics and other compounds of biotechnological interest. Thus, a reliable phylogeny of this large phylum and the means to accurately identify its different constituent groups are of much interest. Detailed phylogenetic and comparative analyses of >150 actinobacterial genomes reported here form the basis for achieving these objectives. In phylogenetic trees based upon 35 conserved proteins, most of the main groups of Actinobacteria as well as a number of their superageneric clades are resolved. We also describe large numbers of molecular markers consisting of conserved signature indels in protein sequences and whole proteins that are specific for either all Actinobacteria or their different clades (viz., orders, families, genera, and subgenera) at various taxonomic levels. These signatures independently support the existence of different phylogenetic clades, and based upon them, it is now possible to delimit the phylum Actinobacteria (excluding Coriobacteriia) and most of its major groups in clear molecular terms. The species distribution patterns of these markers also provide important information regarding the interrelationships among different main orders of Actinobacteria. The identified molecular markers, in addition to enabling the development of a stable and reliable phylogenetic framework for this phylum, also provide novel and powerful means for the identification of different groups of Actinobacteria in diverse environments. Genetic and biochemical studies on these Actinobacteria-specific markers should lead to the discovery of novel biochemical and/or other properties that are unique to different groups of Actinobacteria.
Collapse
Affiliation(s)
- Beile Gao
- Department of Biochemistry and Biomedical Science, McMaster University, Hamilton, Ontario, Canada
| | | |
Collapse
|
9
|
Microbial systematics in the post-genomics era. Antonie van Leeuwenhoek 2011; 101:45-54. [DOI: 10.1007/s10482-011-9663-1] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/01/2011] [Accepted: 10/15/2011] [Indexed: 10/16/2022]
|
10
|
Rooting phylogenies using gene duplications: An empirical example from the bees (Apoidea). Mol Phylogenet Evol 2011; 60:295-304. [DOI: 10.1016/j.ympev.2011.05.002] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2010] [Revised: 04/26/2011] [Accepted: 05/03/2011] [Indexed: 12/23/2022]
|
11
|
Origin of diderm (Gram-negative) bacteria: antibiotic selection pressure rather than endosymbiosis likely led to the evolution of bacterial cells with two membranes. Antonie van Leeuwenhoek 2011; 100:171-82. [PMID: 21717204 PMCID: PMC3133647 DOI: 10.1007/s10482-011-9616-8] [Citation(s) in RCA: 96] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/12/2011] [Accepted: 06/20/2011] [Indexed: 01/01/2023]
Abstract
The prokaryotic organisms can be divided into two main groups depending upon whether their cell envelopes contain one membrane (monoderms) or two membranes (diderms). It is important to understand how these and other variations that are observed in the cell envelopes of prokaryotic organisms have originated. In 2009, James Lake proposed that cells with two membranes (primarily Gram-negative bacteria) originated from an ancient endosymbiotic event involving an Actinobacteria and a Clostridia (Lake 2009). However, this Perspective argues that this proposal is based on a number of incorrect assumptions and the data presented in support of this model are also of questionable nature. Thus, there is no reliable evidence to support the endosymbiotic origin of double membrane bacteria. In contrast, many observations suggest that antibiotic selection pressure was an important selective force in prokaryotic evolution and that it likely played a central role in the evolution of diderm (Gram-negative) bacteria. Some bacterial phyla, such as Deinococcus-Thermus, which lack lipopolysaccharide (LPS) and yet contain some characteristics of the diderm bacteria, are postulated as evolutionary intermediates (simple diderms) in the transition between the monoderm bacterial taxa and the bacterial groups that have the archetypal LPS-containing outer cell membrane found in Gram-negative bacteria. It is possible to distinguish the two stages in the evolution of diderm-LPS cells (viz. monoderm bacteria → simple diderms lacking LPS → LPS containing archetypal diderm bacteria) by means of conserved inserts in the Hsp70 and Hsp60 proteins. The insert in the Hsp60 protein also distinguishes the traditional Gram-negative diderm bacterial phyla from atypical taxa of diderm bacteria (viz. Negativicutes, Fusobacteria, Synergistetes and Elusimicrobia). The Gram-negative bacterial phyla with an LPS-diderm cell envelope, as defined by the presence of the Hsp60 insert, are indicated to form a monophyletic clade and no loss of the outer membrane from any species from this group seems to have occurred. This argues against the origin of monoderm prokaryotes from diderm bacteria by loss of outer membrane.
Collapse
|
12
|
McDonald GD, Storrie-Lombardi MC. Biochemical constraints in a protobiotic earth devoid of basic amino acids: the "BAA(-) world". ASTROBIOLOGY 2010; 10:989-1000. [PMID: 21162678 DOI: 10.1089/ast.2010.0484] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
It has been hypothesized in this journal and elsewhere, based on surveys of published data from prebiotic synthesis experiments and carbonaceous meteorite analyses, that basic amino acids such as lysine and arginine were not abundant on prebiotic Earth. If the basic amino acids were incorporated only rarely into the first peptides formed in that environment, it is important to understand what protobiotic chemistry is possible in their absence. As an initial test of the hypothesis that basic amino acid negative [BAA(-)] proteins could have performed at least a subset of protobiotic chemistry, the current work reports on a survey of 13 archaeal and 13 bacterial genomes that has identified 61 modern gene sequences coding for known or putative proteins not containing arginine or lysine. Eleven of the sequences found code for proteins whose functions are well known and important in the biochemistry of modern microbial life: lysine biosynthesis protein LysW, arginine cluster proteins, copper ion binding proteins, bacterial flagellar proteins, and PE or PPE family proteins. These data indicate that the lack of basic amino acids does not prevent peptides or proteins from serving useful structural and biochemical functions. However, as would be predicted from fundamental physicochemical principles, we see no fossil evidence of prebiotic BAA(-) peptide sequences capable of interacting directly with nucleic acids.
Collapse
Affiliation(s)
- Gene D McDonald
- Department of Chemistry and Biochemistry, University of Texas at Austin, Austin, Texas 78712, USA.
| | | |
Collapse
|
13
|
Molecular signatures for the Crenarchaeota and the Thaumarchaeota. Antonie van Leeuwenhoek 2010; 99:133-57. [PMID: 20711675 DOI: 10.1007/s10482-010-9488-3] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2010] [Accepted: 07/26/2010] [Indexed: 10/19/2022]
Abstract
Crenarchaeotes found in mesophilic marine environments were recently placed into a new phylum of Archaea called the Thaumarchaeota. However, very few molecular characteristics of this new phylum are currently known which can be used to distinguish them from the Crenarchaeota. In addition, their relationships to deep-branching archaeal lineages are unclear. We report here detailed analyses of protein sequences from Crenarchaeota and Thaumarchaeota that have identified many conserved signature indels (CSIs) and signature proteins (SPs) (i.e., proteins for which all significant blast hits are from these groups) that are specific for these archaeal groups. Of the identified signatures 6 CSIs and 13 SPs are specific for the Crenarchaeota phylum; 6 CSIs and >250 SPs are uniquely found in various Thaumarchaeota (viz. Cenarchaeum symbiosum, Nitrosopumilus maritimus and a number of uncultured marine crenarchaeotes) and 3 CSIs and ~10 SPs are found in both Thaumarchaeota and Crenarchaeota species. Some of the molecular signatures are also present in Korarchaeum cryptofilum, which forms the independent phylum Korarchaeota. Although some of these molecular signatures suggest a distant shared ancestry between Thaumarchaeota and Crenarchaeota, our identification of large numbers of Thaumarchaeota-specific proteins and their deep branching between the Crenarchaeota and Euryarchaeota phyla in phylogenetic trees shows that they are distinct from both Crenarchaeota and Euryarchaeota in both genetic and phylogenetic terms. These observations support the placement of marine mesophilic archaea into the separate phylum Thaumarchaeota. Additionally, many CSIs and SPs have been found that are specific for different orders within Crenarchaeota (viz. Sulfolobales-3 CSIs and 169 SPs, Thermoproteales-5 CSIs and 25 SPs, Desulfurococcales-4 SPs, and Sulfolobales and Desulfurococcales-2 CSIs and 18 SPs). The signatures described here provide novel means for distinguishing the Crenarchaeota and the Thaumarchaeota and for the classification of related and novel species in different environments. Functional studies on these signature proteins could lead to discovery of novel biochemical properties that are unique to these groups of archaea.
Collapse
|
14
|
Dagan T, Roettger M, Bryant D, Martin W. Genome networks root the tree of life between prokaryotic domains. Genome Biol Evol 2010; 2:379-92. [PMID: 20624742 PMCID: PMC2997548 DOI: 10.1093/gbe/evq025] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Eukaryotes arose from prokaryotes, hence the root in the tree of life resides among the prokaryotic domains. The position of the root is still debated, although pinpointing it would aid our understanding of the early evolution of life. Because prokaryote evolution was long viewed as a tree-like process of lineage bifurcations, efforts to identify the most ancient microbial lineage split have traditionally focused on positioning a root on a phylogenetic tree constructed from one or several genes. Such studies have delivered widely conflicting results on the position of the root, this being mainly due to methodological problems inherent to deep gene phylogeny and the workings of lateral gene transfer among prokaryotes over evolutionary time. Here, we report the position of the root determined with whole genome data using network-based procedures that take into account both gene presence or absence and the level of sequence similarity among all individual gene families that are shared across genomes. On the basis of 562,321 protein-coding gene families distributed across 191 genomes, we find that the deepest divide in the prokaryotic world is interdomain, that is, separating the archaebacteria from the eubacteria. This result resonates with some older views but conflicts with the results of most studies over the last decade that have addressed the issue. In particular, several studies have suggested that the molecular distinctness of archaebacteria is not evidence for their antiquity relative to eubacteria but instead stems from some kind of inherently elevated rate of archaebacterial sequence change. Here, we specifically test for such a rate elevation across all prokaryotic lineages through the analysis of all possible quartets among eight genes duplicated in all prokaryotes, hence the last common ancestor thereof. The results show that neither the archaebacteria as a group nor the eubacteria as a group harbor evidence for elevated evolutionary rates in the sampled genes, either in the recent evolutionary past or in their common ancestor. The interdomain prokaryotic position of the root is thus not attributable to lineage-specific rate variation.
Collapse
Affiliation(s)
- Tal Dagan
- Institute of Botany III, Heinrich-Heine University of Düsseldorf, Düsseldorf, Germany.
| | | | | | | |
Collapse
|
15
|
Valas RE, Bourne PE. Save the tree of life or get lost in the woods. Biol Direct 2010; 5:44. [PMID: 20594329 PMCID: PMC2910001 DOI: 10.1186/1745-6150-5-44] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2010] [Accepted: 07/01/2010] [Indexed: 11/21/2022] Open
Abstract
BACKGROUND The wealth of prokaryotic genomic data available has revealed that the histories of many genes are inconsistent, leading some to question the value of the tree of life hypothesis. It has been argued that a tree-like representation requires suppressing too much information, and that a more pluralistic approach is necessary for understanding prokaryotic evolution. We argue that trees may still be a useful representation for evolutionary histories in light of new data. RESULTS Genomic data alone can be highly misleading when trying to resolve the tree of life. We present evidence from protein abundance data sets that genomic conservation greatly underestimates functional conservation. Function follows more of a tree-like structure than genetic material, even in the presence of horizontal transfer. We argue that the tree of cells must be incorporated into any new synthesis in order to place horizontal transfers into their proper selective context. We also discuss the role data sources other than primary sequence can play in resolving the tree of cells. CONCLUSIONS The tree of life is alive, but not well. Construction of the tree of cells has been viewed as the end goal of the study of evolution, where in reality we need to consider it more of a starting point. We propose a duality where we must consider variation of genetic material in terms of networks and selection of cellular function in terms of trees. Otherwise one gets lost in the woods of neutral evolution.
Collapse
Affiliation(s)
- Ruben E Valas
- Bioinformatics Program, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA
| | - Philip E Bourne
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA
| |
Collapse
|
16
|
|
17
|
Cavalier-Smith T. Origin of the cell nucleus, mitosis and sex: roles of intracellular coevolution. Biol Direct 2010; 5:7. [PMID: 20132544 PMCID: PMC2837639 DOI: 10.1186/1745-6150-5-7] [Citation(s) in RCA: 139] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2009] [Accepted: 02/04/2010] [Indexed: 12/18/2022] Open
Abstract
BACKGROUND The transition from prokaryotes to eukaryotes was the most radical change in cell organisation since life began, with the largest ever burst of gene duplication and novelty. According to the coevolutionary theory of eukaryote origins, the fundamental innovations were the concerted origins of the endomembrane system and cytoskeleton, subsequently recruited to form the cell nucleus and coevolving mitotic apparatus, with numerous genetic eukaryotic novelties inevitable consequences of this compartmentation and novel DNA segregation mechanism. Physical and mutational mechanisms of origin of the nucleus are seldom considered beyond the long-standing assumption that it involved wrapping pre-existing endomembranes around chromatin. Discussions on the origin of sex typically overlook its association with protozoan entry into dormant walled cysts and the likely simultaneous coevolutionary, not sequential, origin of mitosis and meiosis. RESULTS I elucidate nuclear and mitotic coevolution, explaining the origins of dicer and small centromeric RNAs for positionally controlling centromeric heterochromatin, and how 27 major features of the cell nucleus evolved in four logical stages, making both mechanisms and selective advantages explicit: two initial stages (origin of 30 nm chromatin fibres, enabling DNA compaction; and firmer attachment of endomembranes to heterochromatin) protected DNA and nascent RNA from shearing by novel molecular motors mediating vesicle transport, division, and cytoplasmic motility. Then octagonal nuclear pore complexes (NPCs) arguably evolved from COPII coated vesicle proteins trapped in clumps by Ran GTPase-mediated cisternal fusion that generated the fenestrated nuclear envelope, preventing lethal complete cisternal fusion, and allowing passive protein and RNA exchange. Finally, plugging NPC lumens by an FG-nucleoporin meshwork and adopting karyopherins for nucleocytoplasmic exchange conferred compartmentation advantages. These successive changes took place in naked growing cells, probably as indirect consequences of the origin of phagotrophy. The first eukaryote had 1-2 cilia and also walled resting cysts; I outline how encystation may have promoted the origin of meiotic sex. I also explain why many alternative ideas are inadequate. CONCLUSION Nuclear pore complexes are evolutionary chimaeras of endomembrane- and mitosis-related chromatin-associated proteins. The keys to understanding eukaryogenesis are a proper phylogenetic context and understanding organelle coevolution: how innovations in one cell component caused repercussions on others.
Collapse
|
18
|
Lake JA, Skophammer RG, Herbold CW, Servin JA. Genome beginnings: rooting the tree of life. Philos Trans R Soc Lond B Biol Sci 2009; 364:2177-85. [PMID: 19571238 DOI: 10.1098/rstb.2009.0035] [Citation(s) in RCA: 61] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
A rooted tree of life provides a framework to answer central questions about the evolution of life. Here we review progress on rooting the tree of life and introduce a new root of life obtained through the analysis of indels, insertions and deletions, found within paralogous gene sets. Through the analysis of indels in eight paralogous gene sets, the root is localized to the branch between the clade consisting of the Actinobacteria and the double-membrane (Gram-negative) prokaryotes and one consisting of the archaebacteria and the firmicutes. This root provides a new perspective on the habitats of early life, including the evolution of methanogenesis, membranes and hyperthermophily, and the speciation of major prokaryotic taxa. Our analyses exclude methanogenesis as a primitive metabolism, in contrast to previous findings. They parsimoniously imply that the ether archaebacterial lipids are not primitive and that the cenancestral prokaryotic population consisted of organisms enclosed by a single, ester-linked lipid membrane, covered by a peptidoglycan layer. These results explain the similarities previously noted by others between the lipid synthesis pathways in eubacteria and archaebacteria. The new root also implies that the last common ancestor was not hyperthermophilic, although moderate thermophily cannot be excluded.
Collapse
Affiliation(s)
- James A Lake
- Department of Molecular, Cellular and Developmental Biology, University of California, Los Angeles, CA 90095, USA.
| | | | | | | |
Collapse
|
19
|
Abstract
Endosymbioses have dramatically altered eukaryotic life, but are thought to have negligibly affected prokaryotic evolution. Here, by analysing the flows of protein families, I present evidence that the double-membrane, gram-negative prokaryotes were formed as the result of a symbiosis between an ancient actinobacterium and an ancient clostridium. The resulting taxon has been extraordinarily successful, and has profoundly altered the evolution of life by providing endosymbionts necessary for the emergence of eukaryotes and by generating Earth's oxygen atmosphere. Their double-membrane architecture and the observed genome flows into them suggest a common evolutionary mechanism for their origin: an endosymbiosis between a clostridium and actinobacterium.
Collapse
Affiliation(s)
- James A Lake
- Department of Molecular, Cellular and Developmental Biology, University of California, Los Angeles, California 90095, USA.
| |
Collapse
|
20
|
Valas RE, Bourne PE. Structural analysis of polarizing indels: an emerging consensus on the root of the tree of life. Biol Direct 2009; 4:30. [PMID: 19706177 PMCID: PMC3224940 DOI: 10.1186/1745-6150-4-30] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2009] [Accepted: 08/25/2009] [Indexed: 11/18/2022] Open
Abstract
Background The root of the tree of life has been a holy grail ever since Darwin first used the tree as a metaphor for evolution. New methods seek to narrow down the location of the root by excluding it from branches of the tree of life. This is done by finding traits that must be derived, and excluding the root from the taxa those traits cover. However the two most comprehensive attempts at this strategy, performed by Cavalier-Smith and Lake et al., have excluded each other's rootings. Results The indel polarizations of Lake et al. rely on high quality alignments between paralogs that diverged before the last universal common ancestor (LUCA). Therefore, sequence alignment artifacts may skew their conclusions. We have reviewed their data using protein structure information where available. Several of the conclusions are quite different when viewed in the light of structure which is conserved over longer evolutionary time scales than sequence. We argue there is no polarization that excludes the root from all Gram-negatives, and that polarizations robustly exclude the root from the Archaea. Conclusion We conclude that there is no contradiction between the polarization datasets. The combination of these datasets excludes the root from every possible position except near the Chloroflexi. Reviewers This article was reviewed by Greg Fournier (nominated by J. Peter Gogarten), Purificación López-García, and Eugene Koonin.
Collapse
Affiliation(s)
- Ruben E Valas
- Bioinformatics Program, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA.
| | | |
Collapse
|
21
|
Singh B, Gupta RS. Conserved inserts in the Hsp60 (GroEL) and Hsp70 (DnaK) proteins are essential for cellular growth. Mol Genet Genomics 2009; 281:361-73. [PMID: 19127371 DOI: 10.1007/s00438-008-0417-3] [Citation(s) in RCA: 70] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2008] [Accepted: 12/22/2008] [Indexed: 11/25/2022]
Abstract
The Hsp60 and Hsp70 chaperones contain a number of conserved inserts that are restricted to particular phyla of bacteria. A one aa insert in the E. coli GroEL and a 21-23 insert in the DnaK proteins are specific for most Gram-negative bacteria. Two other inserts in DnaK are limited to certain groups of proteobacteria. The requirement of these inserts for cellular growth was examined by carrying out complementation studies with temperature-sensitive (T(s)) mutants of E. coli groEL or dnaK. Our results demonstrate that deletion or most changes in these inserts completely abolished the complementation ability of the mutant proteins. Studies with GroEL and DnaK from some other species that either lacked or contained these inserts also indicated that these inserts are essential for growth of E. coli. The DnaK from some bacteria contains a two aa insert that is not found in E. coli. Introduction of this insert into the E. coli DnaK also led to its inactivation, indicating that these inserts are specific for different groups. We postulate that these conserved inserts that are localized in loop regions on protein surfaces, are involved in some ancillary functions that are essential for the groups of bacteria where they are found.
Collapse
Affiliation(s)
- Bhag Singh
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, L8N 3Z5, Canada
| | | |
Collapse
|
22
|
Renner SS, Grimm GW, Schneeweiss GM, Stuessy TF, Ricklefs RE. Rooting and dating maples (Acer) with an uncorrelated-rates molecular clock: implications for north American/Asian disjunctions. Syst Biol 2008; 57:795-808. [PMID: 18853365 DOI: 10.1080/10635150802422282] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022] Open
Abstract
Simulations suggest that molecular clock analyses can correctly identify the root of a tree even when the clock assumption is severely violated. Clock-based rooting of phylogenies may be particularly useful when outgroup rooting is problematic. Here, we explore relaxed-clock rooting in the Acer/Dipteronia clade of Sapindaceae, which comprises genera of highly uneven species richness and problematic mutual monophyly. Using an approach that does not presuppose rate autocorrelation between ancestral and descendant branches and hence does not require a rooted a priori topology, we analyzed data from up to seven chloroplast loci for some 50 ingroup species. For comparison, we used midpoint and outgroup rooting and dating methods that rely on rooted input trees, namely penalized likelihood, a Bayesian autocorrelated-rates model, and a strict clock. The chloroplast sequences used here reject a single global substitution rate, and the assumption of autocorrelated rates was also rejected. The root was placed between Acer and Dipteronia by all three rooting methods, albeit with low statistical support. Analyses of Acer diversification with a lineage-through-time plot and different survival models, although sensitive to missing data, suggest a gradual decrease in the average diversification rate. The nine North American species of Acer diverged from their nearest relatives at widely different times: eastern American Acer diverged in the Oligocene and Late Miocene; western American species in the Late Eocene and Mid Miocene; and the Acer core clade, including A. saccharum, dates to the Miocene. Recent diversification in North America is strikingly rare compared to diversification in eastern Asia.
Collapse
|
23
|
Lake JA, Servin JA, Herbold CW, Skophammer RG. Evidence for a New Root of the Tree of Life. Syst Biol 2008; 57:835-43. [DOI: 10.1080/10635150802555933] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Affiliation(s)
- James A. Lake
- Department of Molecular, Cellular, and Developmental Biology, University of California Los Angeles, California 90095, USA; E-mail: (J.A.L.)
- Molecular Biology Institute, University of California Los Angeles, California 90095, USA
- Department of Human Genetics, University of California Los Angeles, California 90095, USA
- UCLA Astrobiology Institute, University of California Los Angeles, California 90095, USA
| | - Jacqueline A. Servin
- Molecular Biology Institute, University of California Los Angeles, California 90095, USA
- UCLA Astrobiology Institute, University of California Los Angeles, California 90095, USA
| | - Craig W. Herbold
- Molecular Biology Institute, University of California Los Angeles, California 90095, USA
- UCLA Astrobiology Institute, University of California Los Angeles, California 90095, USA
| | - Ryan G. Skophammer
- Department of Molecular, Cellular, and Developmental Biology, University of California Los Angeles, California 90095, USA; E-mail: (J.A.L.)
- UCLA Astrobiology Institute, University of California Los Angeles, California 90095, USA
| |
Collapse
|
24
|
Cavalier-Smith T. Predation and eukaryote cell origins: a coevolutionary perspective. Int J Biochem Cell Biol 2008; 41:307-22. [PMID: 18935970 DOI: 10.1016/j.biocel.2008.10.002] [Citation(s) in RCA: 112] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2008] [Revised: 10/06/2008] [Accepted: 10/08/2008] [Indexed: 12/23/2022]
Abstract
Cells are of only two kinds: bacteria, with DNA segregated by surface membrane motors, dating back approximately 3.5Gy; and eukaryotes, which evolved from bacteria, possibly as recently as 800-850My ago. The last common ancestor of eukaryotes was a sexual phagotrophic protozoan with mitochondria, one or two centrioles and cilia. Conversion of bacteria (=prokaryotes) into a eukaryote involved approximately 60 major innovations. Numerous contradictory ideas about eukaryogenesis fail to explain fundamental features of eukaryotic cell biology or conflict with phylogeny. Data are best explained by the intracellular coevolutionary theory, with three basic tenets: (1) the eukaryotic cytoskeleton and endomembrane system originated through cooperatively enabling the evolution of phagotrophy; (2) phagocytosis internalised DNA-membrane attachments, unavoidably disrupting bacterial division; recovery entailed the evolution of the nucleus and mitotic cycle; (3) the symbiogenetic origin of mitochondria immediately followed the perfection of phagotrophy and intracellular digestion, contributing greater energy efficiency and group II introns as precursors of spliceosomal introns. Eukaryotes plus their archaebacterial sisters form the clade neomura, which evolved from a radically modified derivative of an actinobacterial posibacterium that had replaced the ancestral eubacterial murein peptidoglycan by N-linked glycoproteins, radically modified its DNA-handling enzymes, and evolved cotranslational protein secretion, but not the isoprenoid-ether lipids of archaebacteria. I focus on this phylogenetic background and on explaining how in response to novel phagotrophic selective pressures and ensuing genome internalisation this prekaryote evolved efficient digestion of prey proteins by retrotranslocation and 26S proteasomes, then internal digestion by phagocytosis, lysosomes, and peroxisomes, and eukaryotic vesicle trafficking and intracellular compartmentation.
Collapse
Affiliation(s)
- T Cavalier-Smith
- Department of Zoology, University of Oxford, South Parks Road, Oxford, OX1 3PS, UK.
| |
Collapse
|
25
|
Omland KE, Cook LG, Crisp MD. Tree thinking for all biology: the problem with reading phylogenies as ladders of progress. Bioessays 2008; 30:854-67. [PMID: 18693264 DOI: 10.1002/bies.20794] [Citation(s) in RCA: 87] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Phylogenies are increasingly prominent across all of biology, especially as DNA sequencing makes more and more trees available. However, their utility is compromised by widespread misconceptions about what phylogenies can tell us, and improved "tree thinking" is crucial. The most-serious problem comes from reading trees as ladders from "left to right"--many biologists assume that species-poor lineages that appear "early branching" or "basal" are ancestral--we call this the "primitive lineage fallacy". This mistake causes misleading inferences about changes in individual characteristics and leads to misrepresentation of the evolutionary process. The problem can be rectified by considering that modern phylogenies of present-day species and genes show relationships among evolutionary cousins. Emphasizing that these are extant entities in the 21(st) century will help correct inferences about ancestral characteristics, and will enable us to leave behind 19(th) century notions about the ladder of progress driving evolution.
Collapse
Affiliation(s)
- Kevin E Omland
- Department of Biological Sciences, University of Maryland, Baltimore County, Baltimore MD 21250, USA.
| | | | | |
Collapse
|
26
|
A unique combination of genetic systems for the synthesis of trehalose in Rubrobacter xylanophilus: properties of a rare actinobacterial TreT. J Bacteriol 2008; 190:7939-46. [PMID: 18835983 DOI: 10.1128/jb.01055-08] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Trehalose is the primary organic solute in Rubrobacter xylanophilus under all conditions tested, including those for optimal growth. We detected genes of four different pathways for trehalose synthesis in the genome of this organism, namely, the trehalose-6-phosphate synthase (Tps)/trehalose-6-phosphate phosphatase (Tpp), TreS, TreY/TreZ, and TreT pathways. Moreover, R. xylanophilus is the only known member of the phylum Actinobacteria to harbor TreT. The Tps sequence is typically bacterial, but the Tpp sequence is closely related to eukaryotic counterparts. Both the Tps/Tpp and the TreT pathways were active in vivo, while the TreS and the TreY/TreZ pathways were not active under the growth conditions tested and appear not to contribute to the levels of trehalose observed. The genes from the active pathways were functionally expressed in Escherichia coli, and Tps was found to be highly specific for GDP-glucose, a rare feature among these enzymes. The trehalose-6-phosphate formed was specifically dephosphorylated to trehalose by Tpp. The recombinant TreT synthesized trehalose from different nucleoside diphosphate-glucose donors and glucose, but the activity in R. xylanophilus cell extracts was specific for ADP-glucose. The TreT could also catalyze trehalose hydrolysis in the presence of ADP, but with a very high K(m). Here, we functionally characterize two systems for the synthesis of trehalose in R. xylanophilus, a representative of an ancient lineage of the actinobacteria, and discuss a possible scenario for the exceptional occurrence of treT in this extremophilic bacterium.
Collapse
|
27
|
Evolutionary persistence of the molybdopyranopterin-containing sulfite oxidase protein fold. Microbiol Mol Biol Rev 2008; 72:228-48, table of contents. [PMID: 18535145 DOI: 10.1128/mmbr.00041-07] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
SUMMARY The importance of molybdoenzymes is exemplified both by the debilitating and fatal human diseases caused by their deficiency and by their persistence throughout evolution. Here, we show that the protein fold of the molybdopyranopterin-containing domain of sulfite oxidase (the SUOX fold) can be found in all three domains of life. Analyses of sequence data and protein structure comparisons (secondary structure matching) show that the SUOX fold is found in enzymes that have quite distinct macromolecular architectures comprising one or more domains and sometimes subsidiary subunits. These are summarized as follows: (i) animal SUOXs that contain an N-terminal cytochrome b(5) domain and an SUOX fold fused to a C-terminal dimerization domain; (ii) plant SUOX that contains an SUOX fold fused to a C-terminal dimerization domain; (iii) the YedY protein from Escherichia coli, which comprises only the SUOX fold; (iv) the sulfite dehydrogenase from Starkeya novella that contains the SUOX fold, a dimerization domain, and an additional c-type cytochrome subunit; and (v) the plant-type nitrate reductases, exemplified by that of Pichia angusta, that contain an N-terminal SUOX fold, a dimerization domain, a cytochrome b(5) domain, and a C-terminal NADH binding flavin adenine dinucleotide-containing domain. We used the primary sequences of the proteins containing an SUOX fold to mine 559 sequences of related proteins. A phylogeny of a nonredundant subset of these sequences was generated, and the resultant clades were categorized by sequence motif analyses in the context of the available protein structures. Based on the motif analyses, cladistics, and domain conservations, we are able to postulate a plausible pathway of SUOX fold enzyme evolution.
Collapse
|
28
|
Abstract
The increasing recognition that symbioses have greatly altered evolution through genome fusions is creating a need for algorithms that can reliably detect and reconstruct fusions. Here, we generalize the bootstrappers gambit algorithm (a quartet method) in order to permit it to analyze both bifurcations and fusions under a single mathematical model, and thereby detect past genomic branchings and endosymbioses. This new method, 3-dimensional parsimony, can be applied to aligned sequences, such as gene, indel, or other genomic presence/absence sequences. It also provides a statistical measure of support for each possible graph. The usefulness of this method is demonstrated by applying it to the ring of life.
Collapse
Affiliation(s)
- James A Lake
- Department of Molecular, Cellular, and Developmental Biology, University of California, Los Angeles, USA
| |
Collapse
|
29
|
Poole AM, Willerslev E. Can identification of a fourth domain of life be made from sequence data alone, and could it be done on Mars? ASTROBIOLOGY 2007; 7:801-814. [PMID: 17963478 DOI: 10.1089/ast.2006.0094] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
A central question in astrobiology is whether life exists elsewhere in the universe. If so, is it related to Earth life? Technologies exist that enable identification of DNA- or RNA-based microbial life directly from environmental samples here on Earth. Such technologies could, in principle, be applied to the search for life elsewhere; indeed, efforts are underway to initiate such a search. However, surveying for nucleic acid-based life on other planets, if attempted, must be carried out with caution, owing to the risk of contamination by Earth-based life. Here we argue that the null hypothesis must be that any DNA discovered and sequenced from samples taken elsewhere in the universe are Earth-based contaminants. Experience from studies of low-biomass ancient DNA demonstrates that some results, by their very nature, will not enable complete rejection of the null hypothesis. In terms of eliminating contamination as an explanation of the data, there may be value in identification of sequences that lie outside the known diversity of the three domains of life. We therefore have examined whether a fourth domain could be readily identified from environmental DNA sequence data alone. We concluded that, even on Earth, this would be far from trivial, and we illustrate this point by way of examples drawn from the literature. Overall, our conclusions do not bode well for planned PCR-based surveys for life on Mars, and we argue that other independent biosignatures will be essential in corroborating any claims for the presence of life based on nucleic acid sequences.
Collapse
Affiliation(s)
- Anthony M Poole
- Department of Molecular Biology and Functional Genomics, Stockholm University, Stockholm, Sweden.
| | | |
Collapse
|
30
|
Becerra A, Delaye L, Lazcano A, Orgel LE. Protein disulfide oxidoreductases and the evolution of thermophily: was the last common ancestor a heat-loving microbe? J Mol Evol 2007; 65:296-303. [PMID: 17726569 DOI: 10.1007/s00239-007-9005-0] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2007] [Accepted: 06/04/2007] [Indexed: 12/13/2022]
Abstract
Protein disulfide oxidoreductases (PDOs) are redox enzymes that catalyze dithiol-disulfide exchange reactions. Their sequences and structure reveal the presence of two thioredoxin fold units, each of which is endowed with a catalytic site CXXC motif. PDOs are the outcome of an ancient gene duplication event. They have been described in a number of thermophilic and hyperthermophilic species, where they play a critical role in the structural stabilization of intracellular proteins. PDOs are homologous to both the N-terminal domain of the bacterial alkyl hydroperoxide reductase (AhpF) and to the eukaryotic protein disulfide isomerase (PDI). Phylogenetic analysis of PDOs suggests that they first evolved in the crenarchaeota, spreading from them into the Bacteria via the euryarchaeota. These results imply that the last common ancestor (LCA) of all extant living beings lacked a PDO and argue, albeit weakly, against a thermophilic LCA.
Collapse
Affiliation(s)
- Arturo Becerra
- Facultad de Ciencias, UNAM, Apdo Postal 70-407, Cd Universitaria, 04510, Mexico, DF, Mexico
| | | | | | | |
Collapse
|
31
|
Poptsova MS, Gogarten JP. BranchClust: a phylogenetic algorithm for selecting gene families. BMC Bioinformatics 2007; 8:120. [PMID: 17425803 PMCID: PMC1853112 DOI: 10.1186/1471-2105-8-120] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2006] [Accepted: 04/10/2007] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Automated methods for assembling families of orthologous genes include those based on sequence similarity scores and those based on phylogenetic approaches. The first are easy to automate but usually they do not distinguish between paralogs and orthologs or have restriction on the number of taxa. Phylogenetic methods often are based on reconciliation of a gene tree with a known rooted species tree; a limitation of this approach, especially in case of prokaryotes, is that the species tree is often unknown, and that from the analyses of single gene families the branching order between related organisms frequently is unresolved. RESULTS Here we describe an algorithm for the automated selection of orthologous genes that recognizes orthologous genes from different species in a phylogenetic tree for any number of taxa. The algorithm is capable of distinguishing complete (containing all taxa) and incomplete (not containing all taxa) families and recognizes in- and outparalogs. The BranchClust algorithm is implemented in Perl with the use of the BioPerl module for parsing trees and is freely available at http://bioinformatics.org/branchclust. CONCLUSION BranchClust outperforms the Reciprocal Best Blast hit method in selecting more sets of putatively orthologous genes. In the test cases examined, the correctness of the selected families and of the identified in- and outparalogs was confirmed by inspection of the pertinent phylogenetic trees.
Collapse
Affiliation(s)
- Maria S Poptsova
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT 06269-3125, USA
| | - J Peter Gogarten
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT 06269-3125, USA
| |
Collapse
|