1
|
Aggarwal S, Huang E, Do H, Makthal N, Li Y, Bapteste E, Lopez P, Bernard C, Kumaraswami M. The leaderless communication peptide (LCP) class of quorum-sensing peptides is broadly distributed among Firmicutes. Nat Commun 2023; 14:5947. [PMID: 37741855 PMCID: PMC10518010 DOI: 10.1038/s41467-023-41719-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Accepted: 09/14/2023] [Indexed: 09/25/2023] Open
Abstract
The human pathogen Streptococcus pyogenes secretes a short peptide (leaderless communication peptide, LCP) that mediates intercellular communication and controls bacterial virulence through interaction with its receptor, RopB. Here, we show that LCP and RopB homologues are present in other Firmicutes. We experimentally validate that LCPs with distinct peptide communication codes act as bacterial intercellular signals and regulate gene expression in Streptococcus salivarius, Streptococcus porcinus, Enterococcus malodoratus and Limosilactobacillus reuteri. Our results indicate that LCPs are more widespread than previously thought, and their characterization may uncover new signaling mechanisms and roles in coordinating diverse bacterial traits.
Collapse
Affiliation(s)
- Shifu Aggarwal
- Center for Molecular and Translational Human Infectious Diseases Research, Houston Methodist Research Institute, Houston, TX, 77030, USA
- Department of Pathology and Genomic Medicine, Houston Methodist Hospital, Houston, TX, 77030, USA
| | - Elaine Huang
- Center for Molecular and Translational Human Infectious Diseases Research, Houston Methodist Research Institute, Houston, TX, 77030, USA
- Department of Pathology and Genomic Medicine, Houston Methodist Hospital, Houston, TX, 77030, USA
| | - Hackwon Do
- Center for Molecular and Translational Human Infectious Diseases Research, Houston Methodist Research Institute, Houston, TX, 77030, USA
- Department of Pathology and Genomic Medicine, Houston Methodist Hospital, Houston, TX, 77030, USA
- Research Unit of Cryogenic Novel Material, Korea Polar Research Institute, Incheon, 21990, South Korea
| | - Nishanth Makthal
- Center for Molecular and Translational Human Infectious Diseases Research, Houston Methodist Research Institute, Houston, TX, 77030, USA
- Department of Pathology and Genomic Medicine, Houston Methodist Hospital, Houston, TX, 77030, USA
| | - Yanyan Li
- Communication Molecules and Adaptation of Microorganisms (MCAM), CNRS, Muséum National d'Histoire Naturelle, Paris, France
| | - Eric Bapteste
- Institut de Systématique, Evolution et Biodiversité (ISYEB), Sorbonne Université, CNRS, Muséum National d'Histoire Naturelle, Paris, France
| | - Philippe Lopez
- Institut de Systématique, Evolution et Biodiversité (ISYEB), Sorbonne Université, CNRS, Muséum National d'Histoire Naturelle, Paris, France
| | - Charles Bernard
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland.
| | - Muthiah Kumaraswami
- Center for Molecular and Translational Human Infectious Diseases Research, Houston Methodist Research Institute, Houston, TX, 77030, USA.
- Department of Pathology and Genomic Medicine, Houston Methodist Hospital, Houston, TX, 77030, USA.
| |
Collapse
|
2
|
Bapteste E, Huneman P, Keller L, Teulière J, Lopez P, Teeling EC, Lindner AB, Baudisch A, Ludington WB, Franceschi C. Expanding evolutionary theories of ageing to better account for symbioses and interactions throughout the Web of Life. Ageing Res Rev 2023; 89:101982. [PMID: 37321383 PMCID: PMC10771319 DOI: 10.1016/j.arr.2023.101982] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Revised: 05/26/2023] [Accepted: 06/11/2023] [Indexed: 06/17/2023]
Abstract
How, when, and why organisms age are fascinating issues that can only be fully addressed by adopting an evolutionary perspective. Consistently, the main evolutionary theories of ageing, namely the Mutation Accumulation theory, the Antagonistic Pleiotropy theory, and the Disposable Soma theory, have formulated stimulating hypotheses that structure current debates on both the proximal and ultimate causes of organismal ageing. However, all these theories leave a common area of biology relatively under-explored. The Mutation Accumulation theory and the Antagonistic Pleiotropy theory were developed under the traditional framework of population genetics, and therefore are logically centred on the ageing of individuals within a population. The Disposable Soma theory, based on principles of optimising physiology, mainly explains ageing within a species. Consequently, current leading evolutionary theories of ageing do not explicitly model the countless interspecific and ecological interactions, such as symbioses and host-microbiomes associations, increasingly recognized to shape organismal evolution across the Web of Life. Moreover, the development of network modelling supporting a deeper understanding on the molecular interactions associated with ageing within and between organisms is also bringing forward new questions regarding how and why molecular pathways associated with ageing evolved. Here, we take an evolutionary perspective to examine the effects of organismal interactions on ageing across different levels of biological organisation, and consider the impact of surrounding and nested systems on organismal ageing. We also apply this perspective to suggest open issues with potential to expand the standard evolutionary theories of ageing.
Collapse
Affiliation(s)
- Eric Bapteste
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, Paris, France.
| | - Philippe Huneman
- Institut d'Histoire et de Philosophie des Sciences et des Techniques (CNRS/ Université Paris I Sorbonne), Paris, France
| | - Laurent Keller
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland
| | - Jérôme Teulière
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, Paris, France
| | - Philippe Lopez
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, Paris, France
| | - Emma C Teeling
- School of Biology and Environmental Science, University College Dublin, Ireland
| | - Ariel B Lindner
- Université de Paris, INSERM U1284, Center for Research and Interdisciplinarity (CRI), Paris, France
| | - Annette Baudisch
- Interdisciplinary Centre on Population Dynamics, University of Southern Denmark, 5230 Odense M, Denmark
| | - William B Ludington
- Department of Embryology, Carnegie Institution for Science, Baltimore, MD 21218, USA; Department of Biology, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Claudio Franceschi
- Department of Medical and Surgical Sciences, Alma Mater Studiorum University of Bologna, 40138 Bologna, Italy; Department of Applied Mathematics and Laboratory of Systems Medicine of Aging, Lobachevsky University, Nizhny Novgorod 603950, Russia
| |
Collapse
|
3
|
Bernard C, Li Y, Lopez P, Bapteste E. Large-scale identification of known and novel RRNPP quorum sensing systems by RRNPP_detector captures novel features of bacterial, plasmidic and viral co-evolution. Mol Biol Evol 2023; 40:7079777. [PMID: 36929912 PMCID: PMC10075063 DOI: 10.1093/molbev/msad062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Revised: 03/08/2023] [Accepted: 03/09/2023] [Indexed: 03/18/2023] Open
Abstract
Gram-positive Firmicutes bacteria and their mobile genetic elements (plasmids, bacteriophages) encode peptide-based quorum sensing systems (QSSs) that orchestrate behavioral transitions as a function of population densities. In their simplest form, termed "RRNPP", these QSSs are composed of two adjacent genes: a communication propeptide and its cognate intracellular receptor. RRNPP QSSs notably regulate social/competitive behaviors such as virulence or biofilm formation in bacteria, conjugation in plasmids or lysogeny in temperate bacteriophages. However, the genetic diversity and the prevalence of these communication systems, together with the breadth of behaviors they control, remain largely under-appreciated. To better assess the impact of density-dependency on microbial community dynamics and evolution, we developed the RRNPP_detector software, which predicts known and novel RRNPP QSSs in chromosomes, plasmids and bacteriophages of Firmicutes. Applying RRNPP_detector against available complete genomes of viruses and Firmicutes, we identified a rich repertoire of RRNPP QSSs from 11 already-known subfamilies and 21 novel high-confidence candidate subfamilies distributed across a vast diversity of taxa. The analysis of high-confidence RRNPP subfamilies notably revealed 14 subfamilies shared between chromosomes/plasmids/phages, 181 plasmids and 82 phages encoding multiple communication systems, phage-encoded QSSs predicted to dynamically modulate bacterial behaviors, and 196 candidate biosynthetic gene clusters under density-dependent regulation. Overall, our work enhances the field of quorum sensing research and reveals novel insights into the co-evolution of gram-positive bacteria and their mobile genetic elements.
Collapse
Affiliation(s)
- Charles Bernard
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, Campus Jussieu, Bâtiment A, 4eme et. Pièce 429, 75005 Paris, France.,Unité Molécules de Communication et Adaptation des Micro-organismes (MCAM), CNRS, Museum National d'Histoire Naturelle, CP 54, 57 rue Cuvier, 75005 Paris, France.,Department of Computational Biology, University of Lausanne, Genopode - CH-1015 Lausanne, Switzerland
| | - Yanyan Li
- Unité Molécules de Communication et Adaptation des Micro-organismes (MCAM), CNRS, Museum National d'Histoire Naturelle, CP 54, 57 rue Cuvier, 75005 Paris, France
| | - Philippe Lopez
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, Campus Jussieu, Bâtiment A, 4eme et. Pièce 429, 75005 Paris, France
| | - Eric Bapteste
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, Campus Jussieu, Bâtiment A, 4eme et. Pièce 429, 75005 Paris, France
| |
Collapse
|
4
|
Teulière J, Bernard C, Bonnefous H, Martens J, Lopez P, Bapteste E. Interactomics: Dozens of Viruses, Co-evolving With Humans, Including the Influenza A Virus, may Actively Distort Human Aging. Mol Biol Evol 2023; 40:6989791. [PMID: 36649176 PMCID: PMC9897028 DOI: 10.1093/molbev/msad012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Revised: 12/07/2022] [Accepted: 01/09/2023] [Indexed: 01/18/2023] Open
Abstract
Some viruses (e.g., human immunodeficiency virus 1 and severe acute respiratory syndrome coronavirus 2) have been experimentally proposed to accelerate features of human aging and of cellular senescence. These observations, along with evolutionary considerations on viral fitness, raised the more general puzzling hypothesis that, beyond documented sources in human genetics, aging in our species may also depend on virally encoded interactions distorting our aging to the benefits of diverse viruses. Accordingly, we designed systematic network-based analyses of the human and viral protein interactomes, which unraveled dozens of viruses encoding proteins experimentally demonstrated to interact with proteins from pathways associated with human aging, including cellular senescence. We further corroborated our predictions that specific viruses interfere with human aging using published experimental evidence and transcriptomic data; identifying influenza A virus (subtype H1N1) as a major candidate age distorter, notably through manipulation of cellular senescence. By providing original evidence that viruses may convergently contribute to the evolution of numerous age-associated pathways through co-evolution, our network-based and bipartite network-based methodologies support an ecosystemic study of aging, also searching for genetic causes of aging outside a focal aging species. Our findings, predicting age distorters and targets for anti-aging therapies among human viruses, could have fundamental and practical implications for evolutionary biology, aging study, virology, medicine, and demography.
Collapse
Affiliation(s)
| | | | - Hugo Bonnefous
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d’Histoire Naturelle, EPHE, Université des Antilles, Paris, France
| | - Johannes Martens
- Sciences, Normes, Démocratie (SND), Sorbonne Université, CNRS, Paris, France
| | - Philippe Lopez
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d’Histoire Naturelle, EPHE, Université des Antilles, Paris, France
| | | |
Collapse
|
5
|
Teulière J, Bernard C, Corel E, Lapointe FJ, Martens J, Lopez P, Bapteste E. Network analyses unveil ageing-associated pathways evolutionarily conserved from fungi to animals. GeroScience 2022; 45:1059-1080. [PMID: 36508078 PMCID: PMC9886728 DOI: 10.1007/s11357-022-00704-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Accepted: 11/22/2022] [Indexed: 12/14/2022] Open
Abstract
The genetic roots of the diverse paces and shapes of ageing and of the large variations in longevity observed across the tree of life are poorly understood. Indeed, pathways associated with ageing/longevity are incompletely known, both in terms of their constitutive genes/proteins and of their molecular interactions. Moreover, there is limited overlap between the genes constituting these pathways across mammals. Yet, dedicated comparative analyses might still unravel evolutionarily conserved, important pathways associated with longevity or ageing. Here, we used an original strategy with a double evolutionary and systemic focus to analyse protein interactions associated with ageing or longevity during the evolution of five species of Opisthokonta. We ranked these proteins and interactions based on their evolutionary conservation and centrality in past and present protein-protein interaction (PPI) networks, providing a big systemic picture of the evolution of ageing and longevity pathways that identified which pathways emerged in which Opisthokonta lineages, were conserved, and/or central. We confirmed that longevity/ageing-associated proteins (LAPs), be they pro- or anti-longevity, are highly central in extant PPI, consistently with the antagonistic pleiotropy theory of ageing, and identified key antagonistic regulators of ageing/longevity, 52 of which with homologues in humans. While some highly central LAPs were evolutionarily conserved for over a billion years, we report a clear transition in the functionally important components of ageing/longevity within bilaterians. We also predicted 487 novel evolutionarily conserved LAPs in humans, 54% of which are more central than mTOR, and 138 of which are druggable, defining new potential targets for anti-ageing treatments in humans.
Collapse
Affiliation(s)
- Jérôme Teulière
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d’Histoire Naturelle, EPHE, Université Des Antilles, Paris, France
| | - Charles Bernard
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d’Histoire Naturelle, EPHE, Université Des Antilles, Paris, France
| | - Eduardo Corel
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d’Histoire Naturelle, EPHE, Université Des Antilles, Paris, France
| | - François-Joseph Lapointe
- grid.14848.310000 0001 2292 3357Département de Sciences Biologiques, Complexe Des Sciences, Université de Montréal, Montréal, QC Canada
| | - Johannes Martens
- Sciences, Normes, Démocratie (SND), Sorbonne Université, CNRS, 75005 Paris, France
| | - Philippe Lopez
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d’Histoire Naturelle, EPHE, Université Des Antilles, Paris, France
| | - Eric Bapteste
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université Des Antilles, Paris, France.
| |
Collapse
|
6
|
Abstract
All genomes include gene families with very limited taxonomic distributions that potentially represent new genes and innovations in protein-coding sequence, raising questions on the origins of such genes. Some of these genes are hypothesized to have formed de novo, from noncoding sequences, and recent work has begun to elucidate the processes by which de novo gene formation can occur. A special case of de novo gene formation, overprinting, describes the origin of new genes from noncoding alternative reading frames of existing open reading frames (ORFs). We argue that additionally, out-of-frame gene fission/fusion events of alternative reading frames of ORFs and out-of-frame lateral gene transfers could contribute to the origin of new gene families. To demonstrate this, we developed an original pattern-search in sequence similarity networks, enhancing the use of these graphs, commonly used to detect in-frame remodeled genes. We applied this approach to gene families in 524 complete genomes of Escherichia coli. We identified 767 gene families whose evolutionary history likely included at least one out-of-frame remodeling event. These genes with out-of-frame components represent ∼2.5% of all genes in the E. coli pangenome, suggesting that alternative reading frames of existing ORFs can contribute to a significant proportion of de novo genes in bacteria.
Collapse
Affiliation(s)
- Andrew K Watson
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, 7, quai Saint Bernard, Paris, 75005, France
| | - Philippe Lopez
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, 7, quai Saint Bernard, Paris, 75005, France
| | - Eric Bapteste
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, 7, quai Saint Bernard, Paris, 75005, France
| |
Collapse
|
7
|
Teulière J, Bernard C, Bapteste E. Interspecific interactions that affect ageing: Age-distorters manipulate host ageing to their own evolutionary benefits. Ageing Res Rev 2021; 70:101375. [PMID: 34082078 DOI: 10.1016/j.arr.2021.101375] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Revised: 05/22/2021] [Accepted: 05/26/2021] [Indexed: 02/07/2023]
Abstract
Genetic causes for ageing are traditionally investigated within a species. Yet, the lifecycles of many organisms intersect. Additional evolutionary and genetic causes of ageing, external to a focal species/organism, may thus be overlooked. Here, we introduce the phrase and concept of age-distorters and its evidence. Age-distorters carry ageing interfering genes, used to manipulate the biological age of other entities upon which the reproduction of age-distorters relies, e.g. age-distorters bias the reproduction/maintenance trade-offs of cells/organisms for their own evolutionary interests. Candidate age-distorters include viruses, parasites and symbionts, operating through specific, genetically encoded interferences resulting from co-evolution and arms race between manipulative non-kins and manipulable species. This interference results in organismal ageing when age-distorters prompt manipulated organisms to favor their reproduction at the expense of their maintenance, turning these hosts into expanded disposable soma. By relying on reproduction/maintenance trade-offs affecting disposable entities, which are left ageing to the reproductive benefit of other physically connected lineages with conflicting evolutionary interests, the concept of age-distorters expands the logic of the Disposable Soma theory beyond species with fixed germen/soma distinctions. Moreover, acknowledging age-distorters as external sources of mutation accumulation and antagonistic pleiotropic genes expands the scope of the mutation accumulation and of the antagonistic pleiotropy theories.
Collapse
Affiliation(s)
- Jérôme Teulière
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, Paris, France
| | - Charles Bernard
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, Paris, France
| | - Eric Bapteste
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, Paris, France.
| |
Collapse
|
8
|
Watson AK, Lopez P, Bapteste E. Retracing lineage history: time to emphasize genetic turnover. Trends Microbiol 2021; 29:957-958. [PMID: 34452802 DOI: 10.1016/j.tim.2021.08.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Revised: 07/07/2021] [Accepted: 08/02/2021] [Indexed: 10/20/2022]
Affiliation(s)
- Andrew K Watson
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, 7, quai Saint Bernard, 75005 Paris, France
| | - Philippe Lopez
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, 7, quai Saint Bernard, 75005 Paris, France
| | - Eric Bapteste
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, 7, quai Saint Bernard, 75005 Paris, France.
| |
Collapse
|
9
|
Arroyo AS, Iannes R, Bapteste E, Ruiz-Trillo I. Corrigendum to: Gene Similarity Networks Unveil a Potential Novel Unicellular Group Closely Related to Animals from the Tara Oceans Expedition. Genome Biol Evol 2021; 13:6356204. [PMID: 34418058 PMCID: PMC8380070 DOI: 10.1093/gbe/evab140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
|
10
|
Lannes R, Cavaud L, Lopez P, Bapteste E. Marine Ultrasmall Prokaryotes Likely Affect the Cycling of Carbon, Methane, Nitrogen, and Sulfur. Genome Biol Evol 2020; 13:6039174. [PMID: 33325996 PMCID: PMC7851587 DOI: 10.1093/gbe/evaa261] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/11/2020] [Indexed: 12/23/2022] Open
Abstract
Recently, we uncovered the genetic components from six carbon fixation autotrophic pathways in cleaned ultrasmall size fractions from marine samples (<0.22 µm) gathered worldwide by the Tara Oceans Expedition. This first finding suggested that prokaryotic nanoorganisms, phylogenetically distantly related to the known CPR and DPANN groups, could collectively impact carbon cycling and carbon fixation across the world's ocean. To extend our mining of the functional and taxonomic microbial dark matter from the ultrasmall size fraction from the Tara Oceans Expedition, we investigated the distribution of 28 metabolic pathways associated with the cycling of carbon, methane, nitrogen, and sulfur. For all of these pathways, we report the existence not only of novel metabolic homologs in the ultrasmall size fraction of the oceanic microbiome, associated with nanoorganisms belonging to the CPR and DPANN lineages, but also of metabolic homologs exclusively found in marine host taxa belonging to other (still unassigned) microbial lineages. Therefore, we conclude that marine nanoorganisms contribute to a greater diversity of key biogeochemical cycles than currently appreciated. In particular, we suggest that oceanic nanoorganisms may be involved in a metabolic loop around Acetyl-CoA, have an underappreciated genetic potential to degrade methane, contribute to sustaining redox-reactions by producing Coenzyme F420, and affect sulfur cycling, notably as they harbor a complete suite of homologs of enzymes of the SOX system.
Collapse
Affiliation(s)
- Romain Lannes
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, Paris, France
| | - Louise Cavaud
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, Paris, France
| | - Philippe Lopez
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, Paris, France
| | - Eric Bapteste
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, Paris, France
| |
Collapse
|
11
|
Bapteste E, Papale F. Modeling the evolution of interconnected processes: It is the song and the singers: Tracking units of selection with interaction networks. Bioessays 2020; 43:e2000077. [PMID: 33165956 DOI: 10.1002/bies.202000077] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2020] [Revised: 09/18/2020] [Accepted: 09/24/2020] [Indexed: 01/04/2023]
Abstract
Recently, Doolittle and Inkpen formulated a thought provoking theory, asserting that evolution by natural selection was responsible for the sideways evolution of two radically different kinds of selective units (also called Domains). The former entities, termed singers, correspond to the usual objects studied by evolutionary biologists (gene, genomes, individuals, species, etc.), whereas the later, termed songs, correspond to re-produced biological and ecosystemic functions, processes, information, and memes. Singers perform songs through selected patterns of interactions, meaning that a wealth of critical phenomena might receive novel evolutionary explanations. However, this theory did not provide an empirical approach to study evolution in such a broadened context. Here, we show that analyzing songs and singers, using patterns of interaction networks as a common ontology for both, offers a novel, actionable, inclusive and mathematical way to analyze not only the re-production but also the evolution and fitness of biological and ecosystemic interconnected processes.
Collapse
Affiliation(s)
- Eric Bapteste
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, 7, quai Saint Bernard, Bâtiment A 4ème étage, pièce 427, Paris, 75005, France
| | - François Papale
- Departement of Philosophy, University of Montreal, 2910 Édouard-Montpetit blvd, Montréal, QC, H3C 3J7, Canada
| |
Collapse
|
12
|
Teulière J, Bernard G, Bapteste E. The Distribution of Genes Associated With Regulated Cell Death Is Decoupled From the Mitochondrial Phenotypes Within Unicellular Eukaryotic Hosts. Front Cell Dev Biol 2020; 8:536389. [PMID: 33072737 PMCID: PMC7539657 DOI: 10.3389/fcell.2020.536389] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2020] [Accepted: 09/07/2020] [Indexed: 11/13/2022] Open
Abstract
Genetically regulated cell death (RCD) occurs in all domains of life. In eukaryotes, the evolutionary origin of the mitochondrion and of certain forms of RCD, in particular apoptosis, are thought to coincide, suggesting a central general role for mitochondria in cellular suicide. We tested this mitochondrial centrality hypothesis across a dataset of 67 species of protists, presenting 5 classes of mitochondrial phenotypes, including functional mitochondria, metabolically diversified mitochondria, functionally reduced mitochondria (Mitochondrion Related Organelle or MRO) and even complete absence of mitochondria. We investigated the distribution of genes associated with various forms of RCD. No homologs for described mammalian regulators of regulated necrosis could be identified in our set of 67 unicellular taxa. Protists with MRO and the secondarily a mitochondriate Monocercomonoides exilis display heterogeneous reductions of apoptosis gene sets with respect to typical mitochondriate protists. Remarkably, despite the total lack of mitochondria in M. exilis, apoptosis-associated genes could still be identified. These same species of protists with MRO and M. exilis harbored non-reduced autophagic cell death gene sets. Moreover, transiently multicellular protist taxa appeared enriched in apoptotic and autophagy associated genes compared to free-living protists. This analysis suggests that genes associated with apoptosis in animals and the presence of the mitochondria are significant yet non-essential biological components for RCD in protists. More generally, our results support the hypothesis of a selection for RCD, including both apoptosis and autophagy, as a developmental mechanism linked to multicellularity.
Collapse
Affiliation(s)
- Jérôme Teulière
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, Paris, France
| | - Guillaume Bernard
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, Paris, France
| | - Eric Bapteste
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, Paris, France
| |
Collapse
|
13
|
Bernard C, Li Y, Lopez P, Bapteste E. Beyond arbitrium: identification of a second communication system in Bacillus phage phi3T that may regulate host defense mechanisms. ISME J 2020; 15:545-549. [PMID: 33028977 PMCID: PMC8027211 DOI: 10.1038/s41396-020-00795-9] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/02/2020] [Revised: 09/17/2020] [Accepted: 09/24/2020] [Indexed: 12/20/2022]
Abstract
The evolutionary stability of temperate bacteriophages at low abundance of susceptible bacterial hosts lies in the trade-off between the maximization of phage replication, performed by the host-destructive lytic cycle, and the protection of the phage-host collective, enacted by lysogeny. Upon Bacillus infection, Bacillus phages phi3T rely on the “arbitrium” quorum sensing (QS) system to communicate on their population density in order to orchestrate the lysis-to-lysogeny transition. At high phage densities, where there may be limited host cells to infect, lysogeny is induced to preserve chances of phage survival. Here, we report the presence of an additional, host-derived QS system in the phi3T genome, making it the first known virus with two communication systems. Specifically, this additional system, coined “Rapφ-Phrφ”, is predicted to downregulate host defense mechanisms during the viral infection, but only upon stress or high abundance of Bacillus cells and at low density of population of the phi3T phages. Post-lysogenization, Rapφ-Phrφ is also predicted to provide the lysogenized bacteria with an immediate fitness advantage: delaying the costly production of public goods while nonetheless benefiting from the public goods produced by other non-lysogenized Bacillus bacteria. The discovered “Rapφ-Phrφ” QS system hence provides novel mechanistic insights into how phage communication systems could contribute to the phage-host evolutionary stability.
Collapse
Affiliation(s)
- Charles Bernard
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Muséum National d'Histoire Naturelle, Campus Jussieu, Bâtiment A, 4eme et. Pièce 429, 75005, Paris, France. .,Unité Molécules de Communication et Adaptation des Micro-organismes (MCAM), CNRS, Muséum National d'Histoire Naturelle, CP 54, 57 rue Cuvier, 75005, Paris, France.
| | - Yanyan Li
- Unité Molécules de Communication et Adaptation des Micro-organismes (MCAM), CNRS, Muséum National d'Histoire Naturelle, CP 54, 57 rue Cuvier, 75005, Paris, France
| | - Philippe Lopez
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Muséum National d'Histoire Naturelle, Campus Jussieu, Bâtiment A, 4eme et. Pièce 429, 75005, Paris, France
| | - Eric Bapteste
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Muséum National d'Histoire Naturelle, Campus Jussieu, Bâtiment A, 4eme et. Pièce 429, 75005, Paris, France
| |
Collapse
|
14
|
Arroyo AS, Lannes R, Bapteste E, Ruiz-Trillo I. Gene Similarity Networks Unveil a Potential Novel Unicellular Group Closely Related to Animals from the Tara Oceans Expedition. Genome Biol Evol 2020; 12:1664-1678. [PMID: 32533833 PMCID: PMC7533066 DOI: 10.1093/gbe/evaa117] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/08/2020] [Indexed: 12/21/2022] Open
Abstract
The Holozoa clade comprises animals and several unicellular lineages (choanoflagellates, filastereans, and teretosporeans). Understanding their full diversity is essential to address the origins of animals and other evolutionary questions. However, they are poorly known. To provide more insights into the real diversity of holozoans and check for undiscovered diversity, we here analyzed 18S rDNA metabarcoding data from the global Tara Oceans expedition. To overcome the low phylogenetic information contained in the metabarcoding data set (composed of sequences from the short V9 region of the gene), we used similarity networks by combining two data sets: unknown environmental sequences from Tara Oceans and known reference sequences from GenBank. We then calculated network metrics to compare environmental sequences with reference sequences. These metrics reflected the divergence between both types of sequences and provided an effective way to search for evolutionary relevant diversity, further validated by phylogenetic placements. Our results showed that the percentage of unicellular holozoan diversity remains hidden. We found novelties in several lineages, especially in Acanthoecida choanoflagellates. We also identified a potential new holozoan group that could not be assigned to any of the described extant clades. Data on geographical distribution showed that, although ubiquitous, each unicellular holozoan lineage exhibits a different distribution pattern. We also identified a positive association between new animal hosts and the ichthyosporean symbiont Creolimax fragrantissima, as well as for other holozoans previously reported as free-living. Overall, our analyses provide a fresh perspective into the diversity and ecology of unicellular holozoans, highlighting the amount of undescribed diversity.
Collapse
Affiliation(s)
- Alicia S Arroyo
- Institut de Biologia Evolutiva (CSIC-Universitat Pompeu Fabra), Barcelona, Spain
| | - Romain Lannes
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d’Histoire Naturelle, EPHE, Université des Antilles, Paris, France
| | - Eric Bapteste
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d’Histoire Naturelle, EPHE, Université des Antilles, Paris, France
| | - Iñaki Ruiz-Trillo
- Institut de Biologia Evolutiva (CSIC-Universitat Pompeu Fabra), Barcelona, Spain
- Departament de Genètica, Microbiologia I Estadística, Institut de Recerca de la Biodiversitat, Universitat de Barcelona, Spain
- ICREA, Barcelona, Spain
| |
Collapse
|
15
|
Watson A, Habib M, Bapteste E. Phylosystemics: Merging Phylogenomics, Systems Biology, and Ecology to Study Evolution. Trends Microbiol 2020; 28:176-190. [DOI: 10.1016/j.tim.2019.10.011] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2019] [Revised: 10/21/2019] [Accepted: 10/22/2019] [Indexed: 11/28/2022]
|
16
|
Lord E, Pathmanathan JS, Corel E, Makarenkov V, Lopez P, Bouchard F, Bhattacharya D, Antoine PO, Le Guyader H, Lapointe FJ, Bapteste E. Introducing Trait Networks to Elucidate the Fluidity of Organismal Evolution Using Palaeontological Data. Genome Biol Evol 2019; 11:2653-2665. [PMID: 31504500 PMCID: PMC6761957 DOI: 10.1093/gbe/evz182] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/03/2019] [Indexed: 11/25/2022] Open
Abstract
Explaining the evolution of animals requires ecological, developmental, paleontological, and phylogenetic considerations because organismal traits are affected by complex evolutionary processes. Modeling a plurality of processes, operating at distinct time-scales on potentially interdependent traits, can benefit from approaches that are complementary treatments to phylogenetics. Here, we developed an inclusive network approach, implemented in the command line software ComponentGrapher, and analyzed trait co-occurrence of rhinocerotoid mammals. We identified stable, unstable, and pivotal traits, as well as traits contributing to complexes, that may follow to a common developmental regulation, that point to an early implementation of the postcranial Bauplan among rhinocerotoids. Strikingly, most identified traits are highly dissociable, used repeatedly in distinct combinations and in different taxa, which usually do not form clades. Therefore, the genes encoding these traits are likely recruited into novel gene regulation networks during the course of evolution. Our evo-systemic framework, generalizable to other evolved organizations, supports a pluralistic modeling of organismal evolution, including trees and networks.
Collapse
Affiliation(s)
- Etienne Lord
- Département d'informatique, Université du Québec à Montréal, Montréal, Québec, Canada
- Département de sciences biologiques, Université de Montréal, Montréal, Québec, Canada
| | - Jananan S Pathmanathan
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, Paris, France
| | - Eduardo Corel
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, Paris, France
| | - Vladimir Makarenkov
- Département d'informatique, Université du Québec à Montréal, Montréal, Québec, Canada
| | - Philippe Lopez
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, Paris, France
| | - Frédéric Bouchard
- Département de Philosophie, Université de Montreal, Montréal, Quebec, Canada
| | | | - Pierre-Olivier Antoine
- Institut des Sciences de l'Evolution, cc64, Université de Montpellier, CNRS, Université des Antilles, IRD, EPHE, Montpellier, France
| | - Hervé Le Guyader
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, Paris, France
| | - François-Joseph Lapointe
- Département de sciences biologiques, Université de Montréal, Montréal, Québec, Canada
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, Paris, France
| | - Eric Bapteste
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, Paris, France
| |
Collapse
|
17
|
Abstract
Autotrophic carbon fixation is a crucial process for sustaining life on Earth. To date, six pathways, the Calvin–Benson–Bassham cycle, the reductive tricarboxylic acid cycle, the 3-hydroxypropionate bi-cycle, the Wood–Ljungdahl pathway, the dicarboxylate/4-hydroxybutyrate cycle, and the 4-hydroxybutyrate cycle, have been described. Nano-organisms such as members of the Candidate Phyla Radiation (CPR) bacterial superphylum and the Diapherotrites, Parvarchaeota, Aenigmarchaeota, Nanoarchaeota, Nanohalorchaeota (DPANN) archaeal superphylum could deeply impact carbon cycling and carbon fixation in ways that are still to be determined. CPR and DPANN are ubiquitous in the environment but understudied; their gene contents are not exhaustively described; and their metabolisms are not yet fully understood. Here, the completeness of each of the above pathways was quantified and tested for the presence of all key enzymes in nano-organisms from across the World Ocean. The novel marine ultrasmall prokaryotes were demonstrated to collectively harbor the genes required for carbon fixation, in particular the “energetically efficient” dicarboxylate/4-hydroxybutyrate pathway and the 4-hydroxybutyrate pathway. This contrasted with the known carbon metabolic pathways associated with CPR members in aquifers, where they are described as degraders (Castelle CJ, et al. 2015. Genomic expansion of domain archaea highlights roles for organisms from new phyla in anaerobic carbon cycling. Curr Biol. 25(6):690–701; Castelle CJ, et al. 2018. Biosynthetic capacity, metabolic variety and unusual biology in the CPR and DPANN radiations. Nat Rev Microbiol. 16(10):629–645; Anantharaman K, et al. 2016. Thousands of microbial genomes shed light on interconnected biogeochemical processes in an aquifer system. Nat Commun. 7:13219.). Our findings suggest that nano-organisms have a broader contribution to carbon fixation and cycling than currently assumed. Furthermore, CPR and DPANN superphyla are possibly not the only nanosized prokaryotes; therefore, the discovery of new autotrophic marine nano-organisms by future single cell genomics is anticipated.
Collapse
Affiliation(s)
- Romain Lannes
- Sorbonne Université, Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, Paris, France
| | - Karen Olsson-Francis
- School of Environment, Earth and Ecosystems, The Open University, Milton Keynes, United Kingdom
| | - Philippe Lopez
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, Paris, France
| | - Eric Bapteste
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Sorbonne Université, CNRS, Museum National d'Histoire Naturelle, EPHE, Université des Antilles, Paris, France
| |
Collapse
|
18
|
Abstract
Extensive microbial gene flows affect how we understand virology, microbiology, medical sciences, genetic modification, and evolutionary biology. Phylogenies only provide a narrow view of these gene flows: plasmids and viruses, lacking core genes, cannot be attached to cellular life on phylogenetic trees. Yet viruses and plasmids have a major impact on cellular evolution, affecting both the gene content and the dynamics of microbial communities. Using bipartite graphs that connect up to 149,000 clusters of homologous genes with 8,217 related and unrelated genomes, we can in particular show patterns of gene sharing that do not map neatly with the organismal phylogeny. Homologous genes are recycled by lateral gene transfer, and multiple copies of homologous genes are carried by otherwise completely unrelated (and possibly nested) genomes, that is, viruses, plasmids and prokaryotes. When a homologous gene is present on at least one plasmid or virus and at least one chromosome, a process of "gene externalization," affected by a postprocessed selected functional bias, takes place, especially in Bacteria. Bipartite graphs give us a view of vertical and horizontal gene flow beyond classic taxonomy on a single very large, analytically tractable, graph that goes beyond the cellular Web of Life.
Collapse
Affiliation(s)
- Eduardo Corel
- Unité Mixte de Recherche 7138 Evolution Paris-Seine, Centre National de la Recherche Scientifique, Institut de Biologie Paris-Seine, Sorbonne Université, Université Pierre et Marie Curie, Paris, France
| | - Raphaël Méheust
- Unité Mixte de Recherche 7138 Evolution Paris-Seine, Centre National de la Recherche Scientifique, Institut de Biologie Paris-Seine, Sorbonne Université, Université Pierre et Marie Curie, Paris, France
| | - Andrew K Watson
- Unité Mixte de Recherche 7138 Evolution Paris-Seine, Centre National de la Recherche Scientifique, Institut de Biologie Paris-Seine, Sorbonne Université, Université Pierre et Marie Curie, Paris, France
| | - James O McInerney
- Chair in Evolutionary Biology, The University of Manchester, United Kingdom
| | - Philippe Lopez
- Unité Mixte de Recherche 7138 Evolution Paris-Seine, Centre National de la Recherche Scientifique, Institut de Biologie Paris-Seine, Sorbonne Université, Université Pierre et Marie Curie, Paris, France
| | - Eric Bapteste
- Unité Mixte de Recherche 7138 Evolution Paris-Seine, Centre National de la Recherche Scientifique, Institut de Biologie Paris-Seine, Sorbonne Université, Université Pierre et Marie Curie, Paris, France
| |
Collapse
|
19
|
Abstract
Genes evolve by point mutations, but also by shuffling, fusion, and fission of genetic fragments. Therefore, similarity between two sequences can be due to common ancestry producing homology, and/or partial sharing of component fragments. Disentangling these processes is especially challenging in large molecular data sets, because of computational time. In this article, we present CompositeSearch, a memory-efficient, fast, and scalable method to detect composite gene families in large data sets (typically in the range of several million sequences). CompositeSearch generalizes the use of similarity networks to detect composite and component gene families with a greater recall, accuracy, and precision than recent programs (FusedTriplets and MosaicFinder). Moreover, CompositeSearch provides user-friendly quality descriptions regarding the distribution and primary sequence conservation of these gene families allowing critical biological analyses of these data.
Collapse
Affiliation(s)
| | - Philippe Lopez
- Institut de Biologie Paris-Seine (IBPS), UPMC Université Paris 06, Sorbonne Universités, Paris, France
| | | | - Eric Bapteste
- Institut de Biologie Paris-Seine (IBPS), UPMC Université Paris 06, Sorbonne Universités, Paris, France
| |
Collapse
|
20
|
Watson AK, Lannes R, Pathmanathan JS, Méheust R, Karkar S, Colson P, Corel E, Lopez P, Bapteste E. The Methodology Behind Network Thinking: Graphs to Analyze Microbial Complexity and Evolution. Methods Mol Biol 2019; 1910:271-308. [PMID: 31278668 DOI: 10.1007/978-1-4939-9074-0_9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
In the post genomic era, large and complex molecular datasets from genome and metagenome sequencing projects expand the limits of what is possible for bioinformatic analyses. Network-based methods are increasingly used to complement phylogenetic analysis in studies in molecular evolution, including comparative genomics, classification, and ecological studies. Using network methods, the vertical and horizontal relationships between all genes or genomes, whether they are from cellular chromosomes or mobile genetic elements, can be explored in a single expandable graph. In recent years, development of new methods for the construction and analysis of networks has helped to broaden the availability of these approaches from programmers to a diversity of users. This chapter introduces the different kinds of networks based on sequence similarity that are already available to tackle a wide range of biological questions, including sequence similarity networks, gene-sharing networks and bipartite graphs, and a guide for their construction and analyses.
Collapse
Affiliation(s)
- Andrew K Watson
- Sorbonne Universités, Institut de Biologie Paris-Seine, UPMC Université Paris 6, Paris, France
| | - Romain Lannes
- Sorbonne Universités, Institut de Biologie Paris-Seine, UPMC Université Paris 6, Paris, France
| | - Jananan S Pathmanathan
- Sorbonne Universités, Institut de Biologie Paris-Seine, UPMC Université Paris 6, Paris, France
| | - Raphaël Méheust
- Sorbonne Universités, Institut de Biologie Paris-Seine, UPMC Université Paris 6, Paris, France
| | - Slim Karkar
- Sorbonne Universités, Institut de Biologie Paris-Seine, UPMC Université Paris 6, Paris, France
- Department of Ecology, Evolution, and Natural Resources, School of Environmental and Biological Sciences, Rutgers, The State University of NJ, New Brunswick, NJ, USA
| | - Philippe Colson
- Fondation Institut Hospitalo-Universitaire Méditerranée Infection, Pôle des Maladies Infectieuses et Tropicales Clinique et Biologique, Fédération de Bactériologie-Hygiène-Virologie, Centre Hospitalo-Universitaire Tione, Assistance Publique-Hôpitaux de Marseille, Marseille, France
- Unité de Recherche sur les Maladies Infectieuses et Tropicales Emergentes (URMITE) UM63, CNRS 7278, IRD 198, INSERM U1095, Aix-Marseille University, Marseille, France
| | - Eduardo Corel
- Sorbonne Universités, Institut de Biologie Paris-Seine, UPMC Université Paris 6, Paris, France
| | - Philippe Lopez
- Sorbonne Universités, Institut de Biologie Paris-Seine, UPMC Université Paris 6, Paris, France
| | - Eric Bapteste
- Sorbonne Universités, Institut de Biologie Paris-Seine, UPMC Université Paris 6, Paris, France.
| |
Collapse
|
21
|
Bernard G, Pathmanathan JS, Lannes R, Lopez P, Bapteste E. Microbial Dark Matter Investigations: How Microbial Studies Transform Biological Knowledge and Empirically Sketch a Logic of Scientific Discovery. Genome Biol Evol 2018; 10:707-715. [PMID: 29420719 PMCID: PMC5830969 DOI: 10.1093/gbe/evy031] [Citation(s) in RCA: 54] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/05/2018] [Indexed: 02/07/2023] Open
Abstract
Microbes are the oldest and most widespread, phylogenetically and metabolically diverse life forms on Earth. However, they have been discovered only 334 years ago, and their diversity started to become seriously investigated even later. For these reasons, microbial studies that unveil novel microbial lineages and processes affecting or involving microbes deeply (and repeatedly) transform knowledge in biology. Considering the quantitative prevalence of taxonomically and functionally unassigned sequences in environmental genomics data sets, and that of uncultured microbes on the planet, we propose that unraveling the microbial dark matter should be identified as a central priority for biologists. Based on former empirical findings of microbial studies, we sketch a logic of discovery with the potential to further highlight the microbial unknowns.
Collapse
Affiliation(s)
- Guillaume Bernard
- Sorbonne Universités, UPMC Université Paris 06, Institut de Biologie Paris-Seine (IBPS), France
| | - Jananan S Pathmanathan
- Sorbonne Universités, UPMC Université Paris 06, Institut de Biologie Paris-Seine (IBPS), France
| | - Romain Lannes
- Sorbonne Universités, UPMC Université Paris 06, Institut de Biologie Paris-Seine (IBPS), France
| | - Philippe Lopez
- Sorbonne Universités, UPMC Université Paris 06, Institut de Biologie Paris-Seine (IBPS), France
| | - Eric Bapteste
- Sorbonne Universités, UPMC Université Paris 06, Institut de Biologie Paris-Seine (IBPS), France
| |
Collapse
|
22
|
Corel E, Pathmanathan JS, Watson AK, Karkar S, Lopez P, Bapteste E. MultiTwin: A Software Suite to Analyze Evolution at Multiple Levels of Organization Using Multipartite Graphs. Genome Biol Evol 2018; 10:2777-2784. [PMID: 30247672 PMCID: PMC6199892 DOI: 10.1093/gbe/evy209] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/19/2018] [Indexed: 01/08/2023] Open
Abstract
The inclusion of introgressive processes in evolutionary studies induces a less constrained view of evolution. Network-based methods (like large-scale similarity networks) allow to include in comparative genomics all extrachromosomic carriers (like viruses, the most abundant biological entities on the planet) with their cellular hosts. The integration of several levels of biological organization (genes, genomes, communities, environments) enables more comprehensive analyses of gene sharing and improved sequence-based classifications. However, the algorithmic tools for the analysis of such networks are usually restricted to people with high programming skills. We present an integrated suite of software tools named MultiTwin, aimed at the construction, structuring, and analysis of multipartite graphs for evolutionary biology. Typically, this kind of graph is useful for the comparative analysis of the gene content of genomes in microbial communities from the environment and for exploring patterns of gene sharing, for example between distantly related cellular genomes, pangenomes, or between cellular genomes and their mobile genetic elements. We illustrate the use of this tool with an application of the bipartite approach (using gene family-genome graphs) for the analysis of pathogenicity traits in prokaryotes.
Collapse
Affiliation(s)
- Eduardo Corel
- Unité Mixte de Recherche, Centre National de la Recherche Scientifique, Institut de Biologie Paris-Seine, Université Pierre et Marie Curie, Sorbonne Université, Paris, France
| | - Jananan S Pathmanathan
- Unité Mixte de Recherche, Centre National de la Recherche Scientifique, Institut de Biologie Paris-Seine, Université Pierre et Marie Curie, Sorbonne Université, Paris, France
| | - Andrew K Watson
- Unité Mixte de Recherche, Centre National de la Recherche Scientifique, Institut de Biologie Paris-Seine, Université Pierre et Marie Curie, Sorbonne Université, Paris, France
| | - Slim Karkar
- Unité Mixte de Recherche, Centre National de la Recherche Scientifique, Institut de Biologie Paris-Seine, Université Pierre et Marie Curie, Sorbonne Université, Paris, France
| | - Philippe Lopez
- Unité Mixte de Recherche, Centre National de la Recherche Scientifique, Institut de Biologie Paris-Seine, Université Pierre et Marie Curie, Sorbonne Université, Paris, France
| | - Eric Bapteste
- Unité Mixte de Recherche, Centre National de la Recherche Scientifique, Institut de Biologie Paris-Seine, Université Pierre et Marie Curie, Sorbonne Université, Paris, France
| |
Collapse
|
23
|
Méheust R, Watson AK, Lapointe FJ, Papke RT, Lopez P, Bapteste E. Hundreds of novel composite genes and chimeric genes with bacterial origins contributed to haloarchaeal evolution. Genome Biol 2018; 19:75. [PMID: 29880023 PMCID: PMC5992828 DOI: 10.1186/s13059-018-1454-9] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2017] [Accepted: 05/16/2018] [Indexed: 01/03/2023] Open
Abstract
BACKGROUND Haloarchaea, a major group of archaea, are able to metabolize sugars and to live in oxygenated salty environments. Their physiology and lifestyle strongly contrast with that of their archaeal ancestors. Amino acid optimizations, which lowered the isoelectric point of haloarchaeal proteins, and abundant lateral gene transfers from bacteria have been invoked to explain this deep evolutionary transition. We use network analyses to show that the evolution of novel genes exclusive to Haloarchaea also contributed to the evolution of this group. RESULTS We report the creation of 320 novel composite genes, both early in the evolution of Haloarchaea during haloarchaeal genesis and later in diverged haloarchaeal groups. One hundred and twenty-six of these novel composite genes derived from genetic material from bacterial genomes. These latter genes, largely involved in metabolic functions but also in oxygenic lifestyle, constitute a different gene pool from the laterally acquired bacterial genes formerly identified. These novel composite genes were likely advantageous for their hosts, since they show significant residence times in haloarchaeal genomes-consistent with a long phylogenetic history involving vertical descent and lateral gene transfer-and encode proteins with optimized isoelectric points. CONCLUSIONS Overall, our work encourages a systematic search for composite genes across all archaeal major groups, in order to better understand the origins of novel prokaryotic genes, and in order to test to what extent archaea might have adjusted their lifestyles by incorporating and recycling laterally acquired bacterial genetic fragments into new archaeal genes.
Collapse
Affiliation(s)
- Raphaël Méheust
- Sorbonne Universités, UPMC Univ Paris 06, Institut de Biologie Paris Seine, Centre National de la Recherche Scientifique, Unité Mixte de Recherche 7138 Evolution Paris Seine, 75005, Paris, France
| | - Andrew K Watson
- Sorbonne Universités, UPMC Univ Paris 06, Institut de Biologie Paris Seine, Centre National de la Recherche Scientifique, Unité Mixte de Recherche 7138 Evolution Paris Seine, 75005, Paris, France
| | | | - R Thane Papke
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, 06269, USA
| | - Philippe Lopez
- Sorbonne Universités, UPMC Univ Paris 06, Institut de Biologie Paris Seine, Centre National de la Recherche Scientifique, Unité Mixte de Recherche 7138 Evolution Paris Seine, 75005, Paris, France
| | - Eric Bapteste
- Sorbonne Universités, UPMC Univ Paris 06, Institut de Biologie Paris Seine, Centre National de la Recherche Scientifique, Unité Mixte de Recherche 7138 Evolution Paris Seine, 75005, Paris, France.
| |
Collapse
|
24
|
Abstract
The classic Darwinian theory and the Synthetic evolutionary theory and their linear models, while invaluable to study the origins and evolution of species, are not primarily designed to model the evolution of organisations, typically that of ecosystems, nor that of processes. How could evolutionary theory better explain the evolution of biological complexity and diversity? Inclusive network-based analyses of dynamic systems could retrace interactions between (related or unrelated) components. This theoretical shift from a Tree of Life to a Dynamic Interaction Network of Life, which is supported by diverse molecular, cellular, microbiological, organismal, ecological and evolutionary studies, would further unify evolutionary biology.
Collapse
Affiliation(s)
- Eric Bapteste
- Sorbonne Universités, UPMC Université Paris 06, Institut de Biologie Paris-Seine (IBPS), F-75005 Paris, France
- CNRS, UMR7138, Institut de Biologie Paris-Seine, F-75005 Paris, France
| | - Philippe Huneman
- Institut d’Histoire et de Philosophie des Sciences et des Techniques (CNRS / Paris I Sorbonne), F-75006 Paris, France
| |
Collapse
|
25
|
Abstract
BACKGROUND Eukaryotes evolved from the symbiotic association of at least two prokaryotic partners, and a good deal is known about the timings, mechanisms, and dynamics of these evolutionary steps. Recently, it was shown that a new class of nuclear genes, symbiogenetic genes (S-genes), was formed concomitant with endosymbiosis and the subsequent evolution of eukaryotic photosynthetic lineages. Understanding their origins and contributions to eukaryogenesis would provide insights into the ways in which cellular complexity has evolved. RESULTS Here, we show that chimeric nuclear genes (S-genes), built from prokaryotic domains, are critical for explaining the leap forward in cellular complexity achieved during eukaryogenesis. A total of 282 S-gene families contributed solutions to many of the challenges faced by early eukaryotes, including enhancing the informational machinery, processing spliceosomal introns, tackling genotoxicity within the cell, and ensuring functional protein interactions in a larger, more compartmentalized cell. For hundreds of S-genes, we confirmed the origins of their components (bacterial, archaeal, or generally prokaryotic) by maximum likelihood phylogenies. Remarkably, Bacteria contributed nine-fold more S-genes than Archaea, including a two-fold greater contribution to informational functions. Therefore, there is an additional, large bacterial contribution to the evolution of eukaryotes, implying that fundamental eukaryotic properties do not strictly follow the traditional informational/operational divide for archaeal/bacterial contributions to eukaryogenesis. CONCLUSION This study demonstrates the extent and process through which prokaryotic fragments from bacterial and archaeal genes inherited during eukaryogenesis underly the creation of novel chimeric genes with important functions.
Collapse
Affiliation(s)
- Raphaël Méheust
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, Evolution Paris Seine - Institut de Biologie Paris Seine (EPS - IBPS), 75005, Paris, France
| | - Debashish Bhattacharya
- Department of Biochemistry and Microbiology, Rutgers University, New Brunswick, NJ, 08901, USA
| | - Jananan S Pathmanathan
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, Evolution Paris Seine - Institut de Biologie Paris Seine (EPS - IBPS), 75005, Paris, France
| | - James O McInerney
- Division of Evolution and Genomic Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Manchester Academic Health Science Centre, M13 9PL, Manchester, UK
| | - Philippe Lopez
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, Evolution Paris Seine - Institut de Biologie Paris Seine (EPS - IBPS), 75005, Paris, France
| | - Eric Bapteste
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, Evolution Paris Seine - Institut de Biologie Paris Seine (EPS - IBPS), 75005, Paris, France.
| |
Collapse
|
26
|
Jaffe AL, Corel E, Pathmanathan JS, Lopez P, Bapteste E. Bipartite graph analyses reveal interdomain LGT involving ultrasmall prokaryotes and their divergent, membrane-related proteins. Environ Microbiol 2016; 18:5072-5081. [PMID: 27485833 DOI: 10.1111/1462-2920.13477] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2016] [Accepted: 07/27/2016] [Indexed: 12/24/2022]
Abstract
Based on their small size and genomic properties, ultrasmall prokaryotic groups like the Candidate Phyla Radiation have been proposed as possible symbionts dependent on other bacteria or archaea. In this study, we use a bipartite graph analysis to examine patterns of sequence similarity between draft and complete genomes from ultrasmall bacteria and other complete prokaryotic genomes, assessing whether the former group might engage in significant gene transfer (or even endosymbioses) with other community members. Our results provide preliminary evidence for many lateral gene transfers with other prokaryotes, including members of the archaea, and report the presence of divergent, membrane-associated proteins among these ultrasmall taxa. In particular, these divergent genes were found in TM6 relatives of the intracellular parasite Babela massiliensis.
Collapse
Affiliation(s)
- Alexander L Jaffe
- Equipe AIRE, UMR 7138, Laboratoire Evolution Paris-Seine, Université Pierre et Marie Curie, 7 Quai St. Bernard 75005, Paris, France
| | - Eduardo Corel
- Equipe AIRE, UMR 7138, Laboratoire Evolution Paris-Seine, Université Pierre et Marie Curie, 7 Quai St. Bernard 75005, Paris, France
| | - Jananan Sylvestre Pathmanathan
- Equipe AIRE, UMR 7138, Laboratoire Evolution Paris-Seine, Université Pierre et Marie Curie, 7 Quai St. Bernard 75005, Paris, France
| | - Philippe Lopez
- Equipe AIRE, UMR 7138, Laboratoire Evolution Paris-Seine, Université Pierre et Marie Curie, 7 Quai St. Bernard 75005, Paris, France
| | - Eric Bapteste
- Equipe AIRE, UMR 7138, Laboratoire Evolution Paris-Seine, Université Pierre et Marie Curie, 7 Quai St. Bernard 75005, Paris, France
| |
Collapse
|
27
|
List JM, Pathmanathan JS, Lopez P, Bapteste E. Unity and disunity in evolutionary sciences: process-based analogies open common research avenues for biology and linguistics. Biol Direct 2016; 11:39. [PMID: 27544206 PMCID: PMC4992195 DOI: 10.1186/s13062-016-0145-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2016] [Accepted: 08/06/2016] [Indexed: 11/13/2022] Open
Abstract
Background For a long time biologists and linguists have been noticing surprising similarities between the evolution of life forms and languages. Most of the proposed analogies have been rejected. Some, however, have persisted, and some even turned out to be fruitful, inspiring the transfer of methods and models between biology and linguistics up to today. Most proposed analogies were based on a comparison of the research objects rather than the processes that shaped their evolution. Focusing on process-based analogies, however, has the advantage of minimizing the risk of overstating similarities, while at the same time reflecting the common strategy to use processes to explain the evolution of complexity in both fields. Results We compared important evolutionary processes in biology and linguistics and identified processes specific to only one of the two disciplines as well as processes which seem to be analogous, potentially reflecting core evolutionary processes. These new process-based analogies support novel methodological transfer, expanding the application range of biological methods to the field of historical linguistics. We illustrate this by showing (i) how methods dealing with incomplete lineage sorting offer an introgression-free framework to analyze highly mosaic word distributions across languages; (ii) how sequence similarity networks can be used to identify composite and borrowed words across different languages; (iii) how research on partial homology can inspire new methods and models in both fields; and (iv) how constructive neutral evolution provides an original framework for analyzing convergent evolution in languages resulting from common descent (Sapir’s drift). Conclusions Apart from new analogies between evolutionary processes, we also identified processes which are specific to either biology or linguistics. This shows that general evolution cannot be studied from within one discipline alone. In order to get a full picture of evolution, biologists and linguists need to complement their studies, trying to identify cross-disciplinary and discipline-specific evolutionary processes. The fact that we found many process-based analogies favoring transfer from biology to linguistics further shows that certain biological methods and models have a broader scope than previously recognized. This opens fruitful paths for collaboration between the two disciplines. Reviewers This article was reviewed by W. Ford Doolittle and Eugene V. Koonin. Electronic supplementary material The online version of this article (doi:10.1186/s13062-016-0145-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Johann-Mattis List
- CRLAO/EHESS, 2 rue de Lille, Paris, 75007, France. .,Equipe AIRE, UMR 7138, Laboratoire Evolution Paris-Seine, Université Pierre et Marie Curie, 7 quai St Bernard, Paris, 75005, France.
| | - Jananan Sylvestre Pathmanathan
- Equipe AIRE, UMR 7138, Laboratoire Evolution Paris-Seine, Université Pierre et Marie Curie, 7 quai St Bernard, Paris, 75005, France
| | - Philippe Lopez
- Equipe AIRE, UMR 7138, Laboratoire Evolution Paris-Seine, Université Pierre et Marie Curie, 7 quai St Bernard, Paris, 75005, France
| | - Eric Bapteste
- Equipe AIRE, UMR 7138, Laboratoire Evolution Paris-Seine, Université Pierre et Marie Curie, 7 quai St Bernard, Paris, 75005, France
| |
Collapse
|
28
|
Forster D, Bittner L, Karkar S, Dunthorn M, Romac S, Audic S, Lopez P, Stoeck T, Bapteste E. Testing ecological theories with sequence similarity networks: marine ciliates exhibit similar geographic dispersal patterns as multicellular organisms. BMC Biol 2015; 13:16. [PMID: 25762112 PMCID: PMC4381497 DOI: 10.1186/s12915-015-0125-5] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2014] [Accepted: 01/28/2015] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND High-throughput sequencing technologies are lifting major limitations to molecular-based ecological studies of eukaryotic microbial diversity, but analyses of the resulting millions of short sequences remain a major bottleneck for these approaches. Here, we introduce the analytical and statistical framework of sequence similarity networks, increasingly used in evolutionary studies and graph theory, into the field of ecology to analyze novel pyrosequenced V4 small subunit rDNA (SSU-rDNA) sequence data sets in the context of previous studies, including SSU-rDNA Sanger sequence data from cultured ciliates and from previous environmental diversity inventories. RESULTS Our broadly applicable protocol quantified the progress in the description of genetic diversity of ciliates by environmental SSU-rDNA surveys, detected a fundamental historical bias in the tendency to recover already known groups in these surveys, and revealed substantial amounts of hidden microbial diversity. Moreover, network measures demonstrated that ciliates are not globally dispersed, but are structured by habitat and geographical location at intermediate geographical scale, as observed for bacteria, plants, and animals. CONCLUSIONS Currently available 'universal' primers used for local in-depth sequencing surveys provide little hope to exhaust the significantly higher ciliate (and most likely microbial) diversity than previously thought. Network analyses such as presented in this study offer a promising way to guide the design of novel primers and to further explore this vast and structured microbial diversity.
Collapse
Affiliation(s)
- Dominik Forster
- Department of Ecology, University of Kaiserslautern, Erwin-Schrödinger-Straße 14, Kaiserslautern, D-67633, Germany.
| | - Lucie Bittner
- Department of Ecology, University of Kaiserslautern, Erwin-Schrödinger-Straße 14, Kaiserslautern, D-67633, Germany.
- CNRS, FR3631, Institut de Biologie Paris-Seine, Paris, F-75005, France.
- Sorbonne Universités, UPMC Univ Paris 06, Institut de Biologie Paris-Seine (IBPS), Paris, F-75005, France.
| | - Slim Karkar
- Sorbonne Universités, UPMC Univ Paris 06, Institut de Biologie Paris-Seine (IBPS), Paris, F-75005, France.
- CNRS, UMR7138, Institut de Biologie Paris-Seine, Paris, F-75005, France.
| | - Micah Dunthorn
- Department of Ecology, University of Kaiserslautern, Erwin-Schrödinger-Straße 14, Kaiserslautern, D-67633, Germany.
| | - Sarah Romac
- CNRS, UMR 7144, Station Biologique de Roscoff, Place Georges Teissier, Roscoff, F-29680, France.
- Sorbonne Universités, UPMC Univ Paris 06, UMR 7144, Station Biologique de Roscoff, Place Georges Teissier, Roscoff, F-29680, France.
| | - Stéphane Audic
- CNRS, UMR 7144, Station Biologique de Roscoff, Place Georges Teissier, Roscoff, F-29680, France.
- Sorbonne Universités, UPMC Univ Paris 06, UMR 7144, Station Biologique de Roscoff, Place Georges Teissier, Roscoff, F-29680, France.
| | - Philippe Lopez
- Sorbonne Universités, UPMC Univ Paris 06, Institut de Biologie Paris-Seine (IBPS), Paris, F-75005, France.
- CNRS, UMR7138, Institut de Biologie Paris-Seine, Paris, F-75005, France.
| | - Thorsten Stoeck
- Department of Ecology, University of Kaiserslautern, Erwin-Schrödinger-Straße 14, Kaiserslautern, D-67633, Germany.
| | - Eric Bapteste
- Sorbonne Universités, UPMC Univ Paris 06, Institut de Biologie Paris-Seine (IBPS), Paris, F-75005, France.
- CNRS, UMR7138, Institut de Biologie Paris-Seine, Paris, F-75005, France.
| |
Collapse
|
29
|
Méheust R, Lopez P, Bapteste E. Metabolic bacterial genes and the construction of high-level composite lineages of life. Trends Ecol Evol 2015; 30:127-9. [PMID: 25601290 PMCID: PMC4359277 DOI: 10.1016/j.tree.2015.01.001] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2014] [Revised: 01/02/2015] [Accepted: 01/06/2015] [Indexed: 11/17/2022]
Abstract
Understanding how major organismal lineages originated is fundamental for understanding processes by which life evolved. Major evolutionary transitions, like eukaryogenesis, merging genetic material from distantly related organisms, are rare events, hence difficult ones to explain causally. If most archaeal lineages emerged after massive acquisitions of bacterial genes, a rule however arises: metabolic bacterial genes contributed to all major evolutionary transitions.
Collapse
Affiliation(s)
- Raphaël Méheust
- UMR7138 Evolution Paris-Seine, Institut de Biologie Paris-Seine, Université Pierre et Marie Curie, 9 quai saint Bernard, 75005 Paris, France
| | - Philippe Lopez
- UMR7138 Evolution Paris-Seine, Institut de Biologie Paris-Seine, Université Pierre et Marie Curie, 9 quai saint Bernard, 75005 Paris, France
| | - Eric Bapteste
- UMR7138 Evolution Paris-Seine, Institut de Biologie Paris-Seine, Université Pierre et Marie Curie, 9 quai saint Bernard, 75005 Paris, France.
| |
Collapse
|
30
|
Abstract
Although heterokaryons have been reported in nature, multicellular organisms are generally assumed genetically homogeneous. Here, we investigate the case of arbuscular mycorrhizal fungi (AMF) that form symbiosis with plant roots. The growth advantages they confer to their hosts are of great potential benefit to sustainable agricultural practices. However, measuring genetic diversity for these coenocytes is a major challenge: Within the same cytoplasm, AMF contain thousands of nuclei and show extremely high levels of genetic variation for some loci. The extent and physical location of polymorphism within and between AMF genomes is unclear. We used two complementary strategies to estimate genetic diversity in AMF, investigating polymorphism both on a genome scale and in putative single copy loci. First, we used data from whole-genome pyrosequencing of four AMF isolates to describe genetic diversity, based on a conservative network-based clustering approach. AMF isolates showed marked differences in genome-wide diversity patterns in comparison to a panel of control fungal genomes. This clustering approach further allowed us to provide conservative estimates of Rhizophagus spp. genomes sizes. Second, we designed new putative single copy genomic markers, which we investigated by massive parallel amplicon sequencing for two Rhizophagus irregularis and one Rhizophagus sp. isolates. Most loci showed high polymorphism, with up to 103 alleles per marker. This polymorphism could be distributed within or between nuclei. However, we argue that the Rhizophagus isolates under study might be heterokaryotic, at least for the putative single copy markers we studied. Considering that genetic information is the main resource for identification of AMF, we suggest that special attention is warranted for the study of these ecologically important organisms.
Collapse
Affiliation(s)
- Eva Boon
- Département de Sciences Biologiques, Institut de Recherche en Biologie Végétale, Université de Montréal, Quebec, Canada
| | - Sébastien Halary
- Département de Sciences Biologiques, Institut de Recherche en Biologie Végétale, Université de Montréal, Quebec, Canada
| | - Eric Bapteste
- CNRS, UMR7138, Institut de Biologie Paris-Seine, Paris, France Sorbonne Universités, UPMC Univ Paris 06, Institut de Biologie Paris-Seine (IBPS), Paris, France
| | - Mohamed Hijri
- Département de Sciences Biologiques, Institut de Recherche en Biologie Végétale, Université de Montréal, Quebec, Canada
| |
Collapse
|
31
|
Cheng S, Karkar S, Bapteste E, Yee N, Falkowski P, Bhattacharya D. Sequence similarity network reveals the imprints of major diversification events in the evolution of microbial life. Front Ecol Evol 2014. [DOI: 10.3389/fevo.2014.00072] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
|
32
|
Affiliation(s)
- Tal Dagan
- Institute of Microbiology, Christian-Albrechts-University of Kiel, Germany
| | - Eric Bapteste
- UMR CNRS 7138 Systématique, Adaptation, Evolution, Université Pierre et Marie Curie, Paris, France
| | - James O McInerney
- Department of Biology, National University of Ireland Maynooth, Ireland
| | - William F Martin
- Institute of Molecular Evolution, Heinrich-Heine University Düsseldorf, Germany
| |
Collapse
|
33
|
Jachiet PA, Colson P, Lopez P, Bapteste E. Extensive gene remodeling in the viral world: new evidence for nongradual evolution in the mobilome network. Genome Biol Evol 2014; 6:2195-205. [PMID: 25104113 PMCID: PMC4202312 DOI: 10.1093/gbe/evu168] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Complex nongradual evolutionary processes such as gene remodeling are difficult to model, to visualize, and to investigate systematically. Despite these challenges, the creation of composite (or mosaic) genes by combination of genetic segments from unrelated gene families was established as an important adaptive phenomena in eukaryotic genomes. In contrast, almost no general studies have been conducted to quantify composite genes in viruses. Although viral genome mosaicism has been well-described, the extent of gene mosaicism and its rules of emergence remain largely unexplored. Applying methods from graph theory to inclusive similarity networks, and using data from more than 3,000 complete viral genomes, we provide the first demonstration that composite genes in viruses are 1) functionally biased, 2) involved in key aspects of the arm race between cells and viruses, and 3) can be classified into two distinct types of composite genes in all viral classes. Beyond the quantification of the widespread recombination of genes among different viruses of the same class, we also report a striking sharing of genetic information between viruses of different classes and with different nucleic acid types. This latter discovery provides novel evidence for the existence of a large and complex mobilome network, which appears partly bound by the sharing of genetic information and by the formation of composite genes between mobile entities with different genetic material. Considering that there are around 10E31 viruses on the planet, gene remodeling appears as a hugely significant way of generating and moving novel sequences between different kinds of organisms on Earth.
Collapse
Affiliation(s)
- Pierre-Alain Jachiet
- UMR CNRS 7138 Evolution Paris Seine, IBPS, Université Pierre et Marie Curie, Paris, France
| | - Philippe Colson
- URMITE UMR CNRS 6236 IRD 198, Facultés de Médecine et de Pharmacie, Université de la Méditerranée, Marseille, France Pôle des Maladies Infectieuses et Tropicales Clinique et Biologique, Fédération de Bactériologie-Hygiène-Virologie, Centre Hospitalo-Universitaire Timone, Marseille, France
| | - Philippe Lopez
- UMR CNRS 7138 Evolution Paris Seine, IBPS, Université Pierre et Marie Curie, Paris, France
| | - Eric Bapteste
- UMR CNRS 7138 Evolution Paris Seine, IBPS, Université Pierre et Marie Curie, Paris, France
| |
Collapse
|
34
|
Bapteste E. The origins of microbial adaptations: how introgressive descent, egalitarian evolutionary transitions and expanded kin selection shape the network of life. Front Microbiol 2014; 5:83. [PMID: 24624128 PMCID: PMC3940903 DOI: 10.3389/fmicb.2014.00083] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2014] [Accepted: 02/16/2014] [Indexed: 11/20/2022] Open
Affiliation(s)
- Eric Bapteste
- UPMC, Institut de Biologie Paris Seine, UMR7138 'Evolution Paris Seine' Paris, France ; CNRS, Institut de Biologie Paris Seine, UMR7138 'Evolution Paris Seine' Paris, France
| |
Collapse
|
35
|
Haggerty LS, Jachiet PA, Hanage WP, Fitzpatrick DA, Lopez P, O'Connell MJ, Pisani D, Wilkinson M, Bapteste E, McInerney JO. A pluralistic account of homology: adapting the models to the data. Mol Biol Evol 2013; 31:501-16. [PMID: 24273322 PMCID: PMC3935183 DOI: 10.1093/molbev/mst228] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Defining homologous genes is important in many evolutionary studies but raises obvious issues. Some of these issues are conceptual and stem from our assumptions of how a gene evolves, others are practical, and depend on the algorithmic decisions implemented in existing software. Therefore, to make progress in the study of homology, both ontological and epistemological questions must be considered. In particular, defining homologous genes cannot be solely addressed under the classic assumptions of strong tree thinking, according to which genes evolve in a strictly tree-like fashion of vertical descent and divergence and the problems of homology detection are primarily methodological. Gene homology could also be considered under a different perspective where genes evolve as “public goods,” subjected to various introgressive processes. In this latter case, defining homologous genes becomes a matter of designing models suited to the actual complexity of the data and how such complexity arises, rather than trying to fit genetic data to some a priori tree-like evolutionary model, a practice that inevitably results in the loss of much information. Here we show how important aspects of the problems raised by homology detection methods can be overcome when even more fundamental roots of these problems are addressed by analyzing public goods thinking evolutionary processes through which genes have frequently originated. This kind of thinking acknowledges distinct types of homologs, characterized by distinct patterns, in phylogenetic and nonphylogenetic unrooted or multirooted networks. In addition, we define “family resemblances” to include genes that are related through intermediate relatives, thereby placing notions of homology in the broader context of evolutionary relationships. We conclude by presenting some payoffs of adopting such a pluralistic account of homology and family relationship, which expands the scope of evolutionary analyses beyond the traditional, yet relatively narrow focus allowed by a strong tree-thinking view on gene evolution.
Collapse
Affiliation(s)
- Leanne S Haggerty
- Bioinformatics and Molecular Evolution Unit, Department of Biology, National University of Ireland Maynooth, Maynooth, Co. Kildare, Ireland
| | | | | | | | | | | | | | | | | | | |
Collapse
|
36
|
Misner I, Bicep C, Lopez P, Halary S, Bapteste E, Lane CE. Sequence comparative analysis using networks: software for evaluating de novo transcript assembly from next-generation sequencing. Mol Biol Evol 2013; 30:1975-86. [PMID: 23666209 DOI: 10.1093/molbev/mst087] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
DNA sequencing technology is becoming more accessible to a variety of researchers as costs continue to decline. As researchers begin to sequence novel transcriptomes, most of these data sets lack a reference genome and will have to rely on de novo assemblers. Making comparisons across assemblies can be difficult: each program has its strengths and weaknesses, and no tool exists to comparatively evaluate these data sets. We developed software in R, called Sequence Comparative Analysis using Networks (SCAN), to perform statistical comparisons between distinct assemblies. SCAN uses a reference data set to identify the most accurate de novo assembly and the "good" transcripts in the user's data. We tested SCAN on three publicly available transcriptomes, each assembled using three assembly programs. Moreover, we sequenced the transcriptome of the oomycete Achlya hypogyna and compared de novo assemblies from Velvet, ABySS, and the CLC Genomics Workbench assembly algorithms. One thousand one hundred twenty-eight of the CLC transcripts were statistically similar to the reference, compared with 49 of the Velvet transcripts and 937 of the ABySS transcripts. SCAN's strength is providing statistical support for transcript assemblies in a biological context. However, SCAN is designed to compare distinct node sets in networks, therefore it can also easily be extended to perform statistical comparisons on any network graph regardless of what the nodes represent.
Collapse
Affiliation(s)
- Ian Misner
- Department of Biological Sciences, University of Rhode Island, RI, USA
| | | | | | | | | | | |
Collapse
|
37
|
Abstract
Standard microbial evolutionary ontology is organized according to a nested hierarchy of entities at various levels of biological organization. It typically detects and defines these entities in relation to the most stable aspects of evolutionary processes, by identifying lineages evolving by a process of vertical inheritance from an ancestral entity. However, recent advances in microbiology indicate that such an ontology has important limitations. The various dynamics detected within microbiological systems reveal that a focus on the most stable entities (or features of entities) over time inevitably underestimates the extent and nature of microbial diversity. These dynamics are not the outcome of the process of vertical descent alone. Other processes, often involving causal interactions between entities from distinct levels of biological organisation, or operating at different time scales, are responsible not only for the destabilisation of pre-existing entities, but also for the emergence and stabilisation of novel entities in the microbial world. In this article we consider microbial entities as more or less stabilised functional wholes, and sketch a network-based ontology that can represent a diverse set of processes including, for example, as well as phylogenetic relations, interactions that stabilise or destabilise the interacting entities, spatial relations, ecological connections, and genetic exchanges. We use this pluralistic framework for evaluating (i) the existing ontological assumptions in evolution (e.g. whether currently recognized entities are adequate for understanding the causes of change and stabilisation in the microbial world), and (ii) for identifying hidden ontological kinds, essentially invisible from within a more limited perspective. We propose to recognize additional classes of entities that provide new insights into the structure of the microbial world, namely "processually equivalent" entities, "processually versatile" entities, and "stabilized" entities.
Collapse
Affiliation(s)
- Eric Bapteste
- />UMR CNRS 7138, Université Pierre et Marie Curie, 75005 Paris, France
| | - John Dupré
- />ESRC Centre for Genomics in Society (Egenis), University of Exeter, Exeter, UK
| |
Collapse
|
38
|
Bhattacharya D, Price DC, Bicep C, Bapteste E, Sarwade M, Rajah VD, Yoon HS. Identification of a Marine Cyanophage in a Protist Single-cell Metagenome Assembly. J Phycol 2013; 49:207-212. [PMID: 27008402 DOI: 10.1111/jpy.12028] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/21/2012] [Accepted: 08/27/2012] [Indexed: 06/05/2023]
Abstract
Analysis of microbial biodiversity is hampered by a lack of reference genomes from most bacteria, viruses, and algae. This necessitates either the cultivation of a restricted number of species for standard sequencing projects or the analysis of highly complex environmental DNA metagenome data. Single-cell genomics (SCG) offers a solution to this problem by constraining the studied DNA sample to an individual cell and its associated symbionts, prey, and pathogens. We used SCG to study marine heterotrophic amoebae related to Paulinella ovalis (A. Wulff) P.W. Johnson, P.E. Hargraves & J.M. Sieburth (Rhizaria). The genus Paulinella is best known for its photosynthetic members such as P. chromatophora Lauterborn that is the only case of plastid primary endosymbiosis known outside of algae and plants. Here, we studied the phagotrophic sister taxa of P. chromatophora that are related to P. ovalis and found one SCG assembly to contain α-cyanobacterial DNA. These cyanobacterial contigs are presumably derived from prey. We also uncovered an associated cyanophage lineage (provisionally named phage PoL_MC2). Phylogenomic analysis of the fragmented genome assembly suggested a minimum genome size of 200 Kbp for phage PoL_MC2 that encodes 179 proteins and is most closely related to Synechococcus phage S-SM2. For this phage, gene network analysis demonstrates a highly modular genome structure typical of other cyanophages. Our work demonstrates that SCG is a powerful approach for discovering algal and protist biodiversity and for elucidating biotic interactions in natural samples.
Collapse
Affiliation(s)
- Debashish Bhattacharya
- Department of Ecology, Evolution and Natural Resources and Institute of Marine and Coastal Science, Rutgers University, New Brunswick, New Jersey, 08901, USA
| | - Dana C Price
- Department of Ecology, Evolution and Natural Resources and Institute of Marine and Coastal Science, Rutgers University, New Brunswick, New Jersey, 08901, USA
| | - Cedric Bicep
- Unité Mixte de Recherche, Centre National de la Recherche Scientifique 7138, Systématique, Adaptation, Evolution, Université Pierre et Marie Curie, Paris, 75005, France
| | - Eric Bapteste
- Unité Mixte de Recherche, Centre National de la Recherche Scientifique 7138, Systématique, Adaptation, Evolution, Université Pierre et Marie Curie, Paris, 75005, France
| | - Mihir Sarwade
- Department of Ecology, Evolution Natural Resources, Rutgers University, New Brunswick, New Jersey, 08901, USA
| | - Veeran D Rajah
- Department of Ecology, Evolution Natural Resources, Rutgers University, New Brunswick, New Jersey, 08901, USA
| | - Hwan Su Yoon
- Department of Biological Sciences, Sungkyunkwan University, Suwon, 440-746, Korea
| |
Collapse
|
39
|
Jachiet PA, Pogorelcnik R, Berry A, Lopez P, Bapteste E. MosaicFinder: identification of fused gene families in sequence similarity networks. ACTA ACUST UNITED AC 2013; 29:837-44. [PMID: 23365410 DOI: 10.1093/bioinformatics/btt049] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
MOTIVATION Gene fusion is an important evolutionary process. It can yield valuable information to infer the interactions and functions of proteins. Fused genes have been identified as non-transitive patterns of similarity in triplets of genes. To be computationally tractable, this approach usually imposes an a priori distinction between a dataset in which fused genes are searched for, and a dataset that may have provided genetic material for fusion. This reduces the 'genetic space' in which fusion can be discovered, as only a subset of triplets of genes is investigated. Moreover, this approach may have a high-false-positive rate, and it does not identify gene families descending from a common fusion event. RESULTS We represent similarities between sequences as a network. This leads to an efficient formulation of previous methods of fused gene identification, which we implemented in the Python program FusedTriplets. Furthermore, we propose a new characterization of families of fused genes, as clique minimal separators of the sequence similarity network. This well-studied graph topology provides a robust and fast method of detection, well suited for automatic analyses of big datasets. We implemented this method in the C++ program MosaicFinder, which additionally uses local alignments to discard false-positive candidates and indicates potential fusion points. The grouping into families will help distinguish sequencing or prediction errors from real biological fusions, and it will yield additional insight into the function and history of fused genes. AVAILABILITY FusedTriplets and MosaicFinder are published under the GPL license and are freely available with their source code at this address: http://sourceforge.net/projects/mosaicfinder. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Pierre-Alain Jachiet
- UMR CNRS 7138 Systématique, Adaptation, Evolution, Université Pierre et Marie Curie, 75005 Paris, France
| | | | | | | | | |
Collapse
|
40
|
Bapteste E, Bicep C, Lopez P. Evolution of genetic diversity using networks: the human gut microbiome as a case study. Clin Microbiol Infect 2012; 18 Suppl 4:40-3. [DOI: 10.1111/j.1469-0691.2012.03856.x] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
41
|
Abstract
Ever since Darwin, the familiar genealogical pattern known as the Tree of Life (TOL) has been prominent in evolutionary thinking and has dominated not only systematics, but also the analysis of the units of evolution. However, recent findings indicate that the evolution of DNA, especially in prokaryotes and such DNA vehicles as viruses and plasmids, does not follow a unique tree-like pattern. Because evolutionary patterns track a greater range of processes than those captured in genealogies, genealogical patterns are in fact only a subset of a broader set of evolutionary patterns. This fact suggests that evolutionists who focus exclusively on genealogical patterns are blocked from providing a significant range of genuine evolutionary explanations. Consequently, we highlight challenges to tree-based approaches, and point the way toward more appropriate methods to study evolution (although we do not present them in technical detail). We argue that there is significant benefit in adopting wider range of models, evolutionary representations, and evolutionary explanations, based on an analysis of the full range of evolutionary processes. We introduce an ecosystem orientation into evolutionary thinking that highlights the importance of "type 1 coalitions" (functionally related units with genetic exchanges, aka "friends with genetic benefits"), "type 2 coalitions" (functionally related units without genetic exchanges), "communal interactions," and "emergent evolutionary properties." On this basis, we seek to promote the study of (especially prokaryotic) evolution with dynamic evolutionary networks, which are less constrained than the TOL, and to provide new ways to analyze an expanded range of evolutionary units (genetic modules, recombined genes, plasmids, phages and prokaryotic genomes, pangenomes, microbial communities) and evolutionary processes. Finally, we discuss some of the conceptual and practical questions raised by such network-based representation.
Collapse
|
42
|
McInerney JO, Pisani D, Bapteste E, O'Connell MJ. The Public Goods Hypothesis for the evolution of life on Earth. Biol Direct 2011; 6:41. [PMID: 21861918 PMCID: PMC3179745 DOI: 10.1186/1745-6150-6-41] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2011] [Accepted: 08/23/2011] [Indexed: 02/01/2023] Open
Abstract
It is becoming increasingly difficult to reconcile the observed extent of horizontal gene transfers with the central metaphor of a great tree uniting all evolving entities on the planet. In this manuscript we describe the Public Goods Hypothesis and show that it is appropriate in order to describe biological evolution on the planet. According to this hypothesis, nucleotide sequences (genes, promoters, exons, etc.) are simply seen as goods, passed from organism to organism through both vertical and horizontal transfer. Public goods sequences are defined by having the properties of being largely non-excludable (no organism can be effectively prevented from accessing these sequences) and non-rival (while such a sequence is being used by one organism it is also available for use by another organism). The universal nature of genetic systems ensures that such non-excludable sequences exist and non-excludability explains why we see a myriad of genes in different combinations in sequenced genomes. There are three features of the public goods hypothesis. Firstly, segments of DNA are seen as public goods, available for all organisms to integrate into their genomes. Secondly, we expect the evolution of mechanisms for DNA sharing and of defense mechanisms against DNA intrusion in genomes. Thirdly, we expect that we do not see a global tree-like pattern. Instead, we expect local tree-like patterns to emerge from the combination of a commonage of genes and vertical inheritance of genomes by cell division. Indeed, while genes are theoretically public goods, in reality, some genes are excludable, particularly, though not only, when they have variant genetic codes or behave as coalition or club goods, available for all organisms of a coalition to integrate into their genomes, and non-rival within the club. We view the Tree of Life hypothesis as a regionalized instance of the Public Goods hypothesis, just like classical mechanics and euclidean geometry are seen as regionalized instances of quantum mechanics and Riemannian geometry respectively. We argue for this change using an axiomatic approach that shows that the Public Goods hypothesis is a better accommodation of the observed data than the Tree of Life hypothesis.
Collapse
Affiliation(s)
- James O McInerney
- Molecular Evolution and Bioinformatics Unit, Department of Biology, National University of Ireland Maynooth, County Kildare, Ireland.
| | | | | | | |
Collapse
|
43
|
Beauregard-Racine J, Bicep C, Schliep K, Lopez P, Lapointe FJ, Bapteste E. Of woods and webs: possible alternatives to the tree of life for studying genomic fluidity in E. coli. Biol Direct 2011; 6:39; discussion 39. [PMID: 21774799 PMCID: PMC3160433 DOI: 10.1186/1745-6150-6-39] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2011] [Accepted: 07/20/2011] [Indexed: 12/26/2022] Open
Abstract
Background We introduce several forest-based and network-based methods for exploring microbial evolution, and apply them to the study of thousands of genes from 30 strains of E. coli. This case study illustrates how additional analyses could offer fast heuristic alternatives to standard tree of life (TOL) approaches. Results We use gene networks to identify genes with atypical modes of evolution, and genome networks to characterize the evolution of genetic partnerships between E. coli and mobile genetic elements. We develop a novel polychromatic quartet method to capture patterns of recombination within E. coli, to update the clanistic toolkit, and to search for the impact of lateral gene transfer and of pathogenicity on gene evolution in two large forests of trees bearing E. coli. We unravel high rates of lateral gene transfer involving E. coli (about 40% of the trees under study), and show that both core genes and shell genes of E. coli are affected by non-tree-like evolutionary processes. We show that pathogenic lifestyle impacted the structure of 30% of the gene trees, and that pathogenic strains are more likely to transfer genes with one another than with non-pathogenic strains. In addition, we propose five groups of genes as candidate mobile modules of pathogenicity. We also present strong evidence for recent lateral gene transfer between E. coli and mobile genetic elements. Conclusions Depending on which evolutionary questions biologists want to address (i.e. the identification of modules, genetic partnerships, recombination, lateral gene transfer, or genes with atypical evolutionary modes, etc.), forest-based and network-based methods are preferable to the reconstruction of a single tree, because they provide insights and produce hypotheses about the dynamics of genome evolution, rather than the relative branching order of species and lineages. Such a methodological pluralism - the use of woods and webs - is to be encouraged to analyse the evolutionary processes at play in microbial evolution. This manuscript was reviewed by: Ford Doolittle, Tal Pupko, Richard Burian, James McInerney, Didier Raoult, and Yan Boucher
Collapse
|
44
|
Abstract
Congruence is a broadly applied notion in evolutionary biology used to justify multigene phylogeny or phylogenomics, as well as in studies of coevolution, lateral gene transfer, and as evidence for common descent. Existing methods for identifying incongruence or heterogeneity using character data were designed for data sets that are both small and expected to be rarely incongruent. At the same time, methods that assess incongruence using comparison of trees test a null hypothesis of uncorrelated tree structures, which may be inappropriate for phylogenomic studies. As such, they are ill-suited for the growing number of available genome sequences, most of which are from prokaryotes and viruses, either for phylogenomic analysis or for studies of the evolutionary forces and events that have shaped these genomes. Specifically, many existing methods scale poorly with large numbers of genes, cannot accommodate high levels of incongruence, and do not adequately model patterns of missing taxa for different markers. We propose the development of novel incongruence assessment methods suitable for the analysis of the molecular evolution of the vast majority of life and support the investigation of homogeneity of evolutionary process in cases where markers do not share identical tree structures.
Collapse
Affiliation(s)
- Jessica W Leigh
- Department of Mathematics and Statistics, University of Otago, Dunedin, New Zealand.
| | | | | | | |
Collapse
|
45
|
Leigh JW, Schliep K, Lopez P, Bapteste E. Let Them Fall Where They May: Congruence Analysis in Massive Phylogenetically Messy Data Sets. Mol Biol Evol 2011; 28:2773-85. [DOI: 10.1093/molbev/msr110] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
|
46
|
Abstract
Phylogenomic studies produce increasingly large phylogenetic forests of trees with patchy taxonomical sampling. Typically, prokaryotic data generate thousands of gene trees of all sizes that are difficult, if not impossible, to root. Their topologies do not match the genealogy of lineages, as they are influenced not only by duplication, losses, and vertical descent but also by lateral gene transfer (LGT) and recombination. Because this complexity in part reflects the diversity of evolutionary processes, the study of phylogenetic forests is thus a great opportunity to improve our understanding of prokaryotic evolution. Here, we show how the rich evolutionary content of such novel phylogenetic objects can be exploited through the development of new approaches designed specifically for extracting the multiple evolutionary signals present in the forest of life, that is, by slicing up trees into remarkable bits and pieces: clans, slices, and clips. We harvested a forest of 6,901 unrooted gene trees comprising up to 100 prokaryotic genomes (41 archaea and 59 bacteria) to search for evolutionary events that a species tree would not account for. We identified 1) trees and partitions of trees that reflected the lifestyle of organisms rather than their taxonomy, 2) candidate lifestyle-specific genetic modules, used by distinct unrelated organisms to adapt to the same environment, 3) gene families, nonrandomly distributed in the functional space, that were frequently exchanged between archaea and bacteria, sometimes without major changes in their sequences. Finally, 4) we reconstructed polarized networks of genetic partnerships between archaea and bacteria to describe some of the rules affecting LGT between these two Domains.
Collapse
Affiliation(s)
- Klaus Schliep
- UMR CNRS 7138 Systématique, Adaptation, Evolution, Muséum National d'Histoire Naturelle, Paris, France
| | | | | | | |
Collapse
|
47
|
Bittner L, Halary S, Payri C, Cruaud C, de Reviers B, Lopez P, Bapteste E. Some considerations for analyzing biodiversity using integrative metagenomics and gene networks. Biol Direct 2010; 5:47. [PMID: 20673351 PMCID: PMC2921367 DOI: 10.1186/1745-6150-5-47] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2010] [Accepted: 07/30/2010] [Indexed: 02/03/2023] Open
Abstract
BACKGROUND Improving knowledge of biodiversity will benefit conservation biology, enhance bioremediation studies, and could lead to new medical treatments. However there is no standard approach to estimate and to compare the diversity of different environments, or to study its past, and possibly, future evolution. PRESENTATION OF THE HYPOTHESIS We argue that there are two conditions for significant progress in the identification and quantification of biodiversity. First, integrative metagenomic studies - aiming at the simultaneous examination (or even better at the integration) of observations about the elements, functions and evolutionary processes captured by the massive sequencing of multiple markers - should be preferred over DNA barcoding projects and over metagenomic projects based on a single marker. Second, such metagenomic data should be studied with novel inclusive network-based approaches, designed to draw inferences both on the many units and on the many processes present in the environments. TESTING THE HYPOTHESIS We reached these conclusions through a comparison of the theoretical foundations of two molecular approaches seeking to assess biodiversity: metagenomics (mostly used on prokaryotes and protists) and DNA barcoding (mostly used on multicellular eukaryotes), and by pragmatic considerations of the issues caused by the 'species problem' in biodiversity studies. IMPLICATIONS OF THE HYPOTHESIS Evolutionary gene networks reduce the risk of producing biodiversity estimates with limited explanatory power, biased either by unequal rates of LGT, or difficult to interpret due to (practical) problems caused by type I and type II grey zones. Moreover, these networks would easily accommodate additional (meta)transcriptomic and (meta)proteomic data.
Collapse
Affiliation(s)
- Lucie Bittner
- UMR CNRS 7138 Systématique, Adaptation, Evolution, Muséum National d'Histoire Naturelle, Paris, France
| | - Sébastien Halary
- UMR CNRS 7138 Systématique, Adaptation, Evolution, Université Pierre et Marie Curie, Paris, France
| | - Claude Payri
- UR227, IRD-BPA5, Nouméa, Nouvelle-Calédonie, France
| | | | - Bruno de Reviers
- UMR CNRS 7138 Systématique, Adaptation, Evolution, Muséum National d'Histoire Naturelle, Paris, France
| | - Philippe Lopez
- UMR CNRS 7138 Systématique, Adaptation, Evolution, Université Pierre et Marie Curie, Paris, France
| | - Eric Bapteste
- UMR CNRS 7138 Systématique, Adaptation, Evolution, Université Pierre et Marie Curie, Paris, France
| |
Collapse
|
48
|
Bapteste E, O'Malley MA, Beiko RG, Ereshefsky M, Gogarten JP, Franklin-Hall L, Lapointe FJ, Dupré J, Dagan T, Boucher Y, Martin W. Prokaryotic evolution and the tree of life are two different things. Biol Direct 2009; 4:34. [PMID: 19788731 PMCID: PMC2761302 DOI: 10.1186/1745-6150-4-34] [Citation(s) in RCA: 128] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2009] [Accepted: 09/29/2009] [Indexed: 01/04/2023] Open
Abstract
BACKGROUND The concept of a tree of life is prevalent in the evolutionary literature. It stems from attempting to obtain a grand unified natural system that reflects a recurrent process of species and lineage splittings for all forms of life. Traditionally, the discipline of systematics operates in a similar hierarchy of bifurcating (sometimes multifurcating) categories. The assumption of a universal tree of life hinges upon the process of evolution being tree-like throughout all forms of life and all of biological time. In multicellular eukaryotes, the molecular mechanisms and species-level population genetics of variation do indeed mainly cause a tree-like structure over time. In prokaryotes, they do not. Prokaryotic evolution and the tree of life are two different things, and we need to treat them as such, rather than extrapolating from macroscopic life to prokaryotes. In the following we will consider this circumstance from philosophical, scientific, and epistemological perspectives, surmising that phylogeny opted for a single model as a holdover from the Modern Synthesis of evolution. RESULTS It was far easier to envision and defend the concept of a universal tree of life before we had data from genomes. But the belief that prokaryotes are related by such a tree has now become stronger than the data to support it. The monistic concept of a single universal tree of life appears, in the face of genome data, increasingly obsolete. This traditional model to describe evolution is no longer the most scientifically productive position to hold, because of the plurality of evolutionary patterns and mechanisms involved. Forcing a single bifurcating scheme onto prokaryotic evolution disregards the non-tree-like nature of natural variation among prokaryotes and accounts for only a minority of observations from genomes. CONCLUSION Prokaryotic evolution and the tree of life are two different things. Hence we will briefly set out alternative models to the tree of life to study their evolution. Ultimately, the plurality of evolutionary patterns and mechanisms involved, such as the discontinuity of the process of evolution across the prokaryote-eukaryote divide, summons forth a pluralistic approach to studying evolution. REVIEWERS This article was reviewed by Ford Doolittle, John Logsdon and Nicolas Galtier.
Collapse
|
49
|
|
50
|
Abstract
We describe the reasons why the newly recognized process of horizontal gene transfer (HGT) forces evolutionists who study classification and microbiology to go beyond the classical Darwinian framework. We recall the importance of processes in philosophical definitions of species and for taxonomical purposes in general. More precisely, we present a brief description of a possible transition from a thinking inspired by essentialism to eliminative pluralism in the species debate and we insist on a major philosophical lesson: that processes matter and that, consequently, HGT cannot be overlooked in microbial classification. We then expand the conclusions of eliminative pluralism to microbial classification, namely (i) that species are not real and (ii) that overlapping taxonomies are equally legitimate when they are based on real natural processes. We introduce alternatives to the traditional species concept and describe what we call evolutionary units. Two types of units can be described: coherent and composite. The former are sets of co-evolving genes, pathways, or organisms, which share the same phylogenetic origin, while the latter comprise genes, pathways, or organisms with component parts from multiple phylogenetic origins. These evolutionary units are either "mostly flexible" or "mostly rigid" in their genetic composition and we discuss how this dissimilarity could profoundly affect our systematics practice. In this chapter, we illustrate how much there is to learn from the reconstruction of the complex evolutionary histories of all evolutionary units - large or small - by giving up the notion of species for recombining microbes, and suggest replacing a unique nested hierarchy of life with a comprehensive database including overlapping taxonomical groups.
Collapse
|