1
|
Feng Y, Arsenault D, Louyakis AS, Altman-Price N, Gophna U, Papke RT, Gogarten JP. Using the pan-genomic framework for the discovery of genomic islands in the haloarchaeon Halorubrum ezzemoulense. mBio 2024; 15:e0040824. [PMID: 38619241 PMCID: PMC11078007 DOI: 10.1128/mbio.00408-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Accepted: 03/19/2024] [Indexed: 04/16/2024] Open
Abstract
In this study, we use pan-genomics to characterize the genomic variability of the widely dispersed halophilic archaeal species Halorubrum ezzemoulense (Hez). We include a multi-regional sampling of newly sequenced, high-quality draft genomes. The pan-genome graph of the species reveals 50 genomic islands that represent rare accessory genetic capabilities available to members. Most notably, we observe rearrangements that have led to the insertion/recombination/replacement of mutually exclusive genomic islands in equivalent genome positions ("homeocassettes"). These conflicting islands encode for similar functions, but homologs from islands located between the same core genes exhibit high divergence on the amino acid level, while the neighboring core genes are nearly identical. Both islands of a homeocassette often coexist in the same geographic location, suggesting that either island may be beyond the reach of selective sweeps and that these loci of divergence between Hez members are maintained and persist long term. This implies that subsections of the population have different niche preferences and rare metabolic capabilities. After an evaluation of the gene content in the homeocassettes, we speculate that these islands may play a role in the speciation, niche adaptability, and group selection dynamics in Hez. Though homeocassettes are first described in this study, similar replacements and divergence of genes on genomic islands have been previously reported in other Haloarchaea and distantly related Archaea, suggesting that homeocassettes may be a feature in a wide range of organisms outside of Hez.IMPORTANCEThis study catalogs the rare genes discovered in strains of the species Halorubrum ezzemoulense (Hez), an obligate halophilic archaeon, through the perspective of its pan-genome. These rare genes are often found to be arranged on islands that confer metabolic and transport functions and contain genes that have eluded previous studies. The discovery of divergent, but homologous islands occupying equivalent genome positions ("homeocassettes") in different genomes, reveals significant new information on genome evolution in Hez. Homeocassette pairs encode for similar functions, but their dissimilarity and distribution imply high rates of recombination, different specializations, and niche preferences in Hez. The coexistence of both islands of a homeocassette pair in multiple environments demonstrates that both islands are beyond the reach of selective sweeps and that these genome content differences between strains persist long term. The switch between islands through recombination under different environmental conditions may lead to a greater range of niche adaptability in Hez.
Collapse
Affiliation(s)
- Yutian Feng
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA
| | - Danielle Arsenault
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA
| | - Artemis S. Louyakis
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA
| | - Neta Altman-Price
- The Shmunis School of Biomedicine and Cancer Research, Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
- Avinoam Adam Department of Natural Sciences, The Open University of Israel, Raanana, Israel
| | - Uri Gophna
- The Shmunis School of Biomedicine and Cancer Research, Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - R. Thane Papke
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA
| | - Johann Peter Gogarten
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA
- Institute for Systems Genomics, University of Connecticut, Storrs, Connecticut, USA
| |
Collapse
|
2
|
Dmitrijeva M, Tackmann J, Matias Rodrigues JF, Huerta-Cepas J, Coelho LP, von Mering C. A global survey of prokaryotic genomes reveals the eco-evolutionary pressures driving horizontal gene transfer. Nat Ecol Evol 2024; 8:986-998. [PMID: 38443606 DOI: 10.1038/s41559-024-02357-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Accepted: 02/05/2024] [Indexed: 03/07/2024]
Abstract
Horizontal gene transfer, the exchange of genetic material through means other than reproduction, is a fundamental force in prokaryotic genome evolution. Genomic persistence of horizontally transferred genes has been shown to be influenced by both ecological and evolutionary factors. However, there is limited availability of ecological information about species other than the habitats from which they were isolated, which has prevented a deeper exploration of ecological contributions to horizontal gene transfer. Here we focus on transfers detected through comparison of individual gene trees to the species tree, assessing the distribution of gene-exchanging prokaryotes across over a million environmental sequencing samples. By analysing detected horizontal gene transfer events, we show distinct functional profiles for recent versus old events. Although most genes transferred are part of the accessory genome, genes transferred earlier in evolution tend to be more ubiquitous within present-day species. We find that co-occurring, interacting and high-abundance species tend to exchange more genes. Finally, we show that host-associated specialist species are most likely to exchange genes with other host-associated specialist species, whereas species found across different habitats have similar gene exchange rates irrespective of their preferred habitat. Our study covers an unprecedented scale of integrated horizontal gene transfer and environmental information, highlighting broad eco-evolutionary trends.
Collapse
Affiliation(s)
- Marija Dmitrijeva
- Department of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zürich, Zurich, Switzerland
- Department of Biology, Institute of Microbiology and Swiss Institute of Bioinformatics, ETH Zürich, Zurich, Switzerland
| | - Janko Tackmann
- Department of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zürich, Zurich, Switzerland
| | | | - Jaime Huerta-Cepas
- Centro de Biotecnología y Genómica de Plantas, Universidad Politécnica de Madrid (UPM)-Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA-CSIC), Campus de Montegancedo-UPM, Madrid, Spain
| | - Luis Pedro Coelho
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China.
- Centre for Microbiome Research, School of Biomedical Sciences, Queensland University of Technology, Translational Research Institute, Woolloongabba, Queensland, Australia.
| | - Christian von Mering
- Department of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zürich, Zurich, Switzerland.
| |
Collapse
|
3
|
Ray M, Manu S, Rastogi G, Umapathy G. Cyanobacterial Genomes from a Brackish Coastal Lagoon Reveal Potential for Novel Biogeochemical Functions and Their Evolution. J Mol Evol 2024; 92:121-137. [PMID: 38489069 DOI: 10.1007/s00239-024-10159-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2023] [Accepted: 01/24/2024] [Indexed: 03/17/2024]
Abstract
Cyanobacteria are recognised for their pivotal roles in aquatic ecosystems, serving as primary producers and major agents in diazotrophic processes. Currently, the primary focus of cyanobacterial research lies in gaining a more detailed understanding of these well-established ecosystem functions. However, their involvement and impact on other crucial biogeochemical cycles remain understudied. This knowledge gap is partially attributed to the challenges associated with culturing cyanobacteria in controlled laboratory conditions and the limited understanding of their specific growth requirements. This can be circumvented partially by the culture-independent methods which can shed light on the genomic potential of cyanobacterial species and answer more profound questions about the evolution of other key biogeochemical functions. In this study, we assembled 83 cyanobacterial genomes from metagenomic data generated from environmental DNA extracted from a brackish water lagoon (Chilika Lake, India). We taxonomically classified these metagenome-assembled genomes (MAGs) and found that about 92.77% of them are novel genomes at the species level. We then annotated these cyanobacterial MAGs for all the encoded functions using KEGG Orthology. Interestingly, we found two previously unreported functions in Cyanobacteria, namely, DNRA (Dissimilatory Nitrate Reduction to Ammonium) and DMSP (Dimethylsulfoniopropionate) synthesis in multiple MAGs using nirBD and dsyB genes as markers. We validated their presence in several publicly available cyanobacterial isolate genomes. Further, we identified incongruities between the evolutionary patterns of species and the marker genes and elucidated the underlying reasons for these discrepancies. This study expands our overall comprehension of the contribution of cyanobacteria to the biogeochemical cycling in coastal brackish ecosystems.
Collapse
Affiliation(s)
- Manisha Ray
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, 201002, India
- CSIR-Centre for Cellular and Molecular Biology, Hyderabad, 500007, India
| | - Shivakumara Manu
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, 201002, India
- CSIR-Centre for Cellular and Molecular Biology, Hyderabad, 500007, India
| | - Gurdeep Rastogi
- Wetland Research and Training Centre, Chilika Development Authority, Balugaon, Odisha, 752030, India
| | - Govindhaswamy Umapathy
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, 201002, India.
- CSIR-Centre for Cellular and Molecular Biology, Hyderabad, 500007, India.
| |
Collapse
|
4
|
Liao BY, Weng MP, Chang TY, Chang AYF, Ching YH, Wu CH. Degeneration of the Olfactory System in a Murid Rodent that Evolved Diurnalism. Mol Biol Evol 2024; 41:msae037. [PMID: 38376543 PMCID: PMC10906987 DOI: 10.1093/molbev/msae037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 01/03/2024] [Accepted: 02/13/2024] [Indexed: 02/21/2024] Open
Abstract
In mammalian research, it has been debated what can initiate an evolutionary tradeoff between different senses, and the phenomenon of sensory tradeoff in rodents, the most abundant mammalian clade, is not evident. The Nile rat (Arvicanthis niloticus), a murid rodent, recently adapted to a diurnal niche through an evolutionary acquisition of daylight vision with enhanced visual acuity. As such, this model provides an opportunity for a cross-species investigation where comparative morphological and multi-omic analyses of the Nile rat are made with its closely related nocturnal species, e.g. the mouse (Mus musculus) and the rat (Rattus norvegicus). Thus, morphological examinations were performed, and evolutionary reductions in relative sizes of turbinal bone surfaces, the cribriform plate, and the olfactory bulb were discovered in Nile rats. Subsequently, we compared multiple murid genomes, and profiled olfactory epithelium transcriptomes of mice and Nile rats at various ages with RNA sequencing. The results further demonstrate that, in comparison with mouse olfactory receptor (OR) genes, Nile rat OR genes have experienced less frequent gain, more frequent loss, and more frequent expression reduction during their evolution. Furthermore, functional degeneration of coding sequences in the Nile rat lineage was found in OR genes, yet not in other genes. Taken together, these results suggest that acquisition of improved vision in the Nile rat has been accompanied by degeneration of both olfaction-related anatomical structures and OR gene repertoires, consistent with the hypothesis of an olfaction-vision tradeoff initiated by the switch from a nocturnal to a diurnal lifestyle in mammals.
Collapse
Affiliation(s)
- Ben-Yang Liao
- Institute of Population Health Sciences, National Health Research Institutes, Taiwan, Republic of China
| | - Meng-Pin Weng
- Institute of Population Health Sciences, National Health Research Institutes, Taiwan, Republic of China
| | - Ting-Yan Chang
- Institute of Population Health Sciences, National Health Research Institutes, Taiwan, Republic of China
| | - Andrew Ying-Fei Chang
- Institute of Population Health Sciences, National Health Research Institutes, Taiwan, Republic of China
| | - Yung-Hao Ching
- Department of Molecular Biology and Human Genetics, Tzu Chi University, Taiwan, Republic of China
| | - Chia-Hwa Wu
- Laboratory Animal Center, National Health Research Institutes, Taiwan, Republic of China
| |
Collapse
|
5
|
Rühlemann MC, Bang C, Gogarten JF, Hermes BM, Groussin M, Waschina S, Poyet M, Ulrich M, Akoua-Koffi C, Deschner T, Muyembe-Tamfum JJ, Robbins MM, Surbeck M, Wittig RM, Zuberbühler K, Baines JF, Leendertz FH, Franke A. Functional host-specific adaptation of the intestinal microbiome in hominids. Nat Commun 2024; 15:326. [PMID: 38182626 PMCID: PMC10770139 DOI: 10.1038/s41467-023-44636-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Accepted: 12/20/2023] [Indexed: 01/07/2024] Open
Abstract
Fine-scale knowledge of the changes in composition and function of the human gut microbiome compared that of our closest relatives is critical for understanding the evolutionary processes underlying its developmental trajectory. To infer taxonomic and functional changes in the gut microbiome across hominids at different timescales, we perform high-resolution metagenomic-based analyzes of the fecal microbiome from over two hundred samples including diverse human populations, as well as wild-living chimpanzees, bonobos, and gorillas. We find human-associated taxa depleted within non-human apes and patterns of host-specific gut microbiota, suggesting the widespread acquisition of novel microbial clades along the evolutionary divergence of hosts. In contrast, we reveal multiple lines of evidence for a pervasive loss of diversity in human populations in correlation with a high Human Development Index, including evolutionarily conserved clades. Similarly, patterns of co-phylogeny between microbes and hosts are found to be disrupted in humans. Together with identifying individual microbial taxa and functional adaptations that correlate to host phylogeny, these findings offer insights into specific candidates playing a role in the diverging trajectories of the gut microbiome of hominids. We find that repeated horizontal gene transfer and gene loss, as well as the adaptation to transient microaerobic conditions appear to have played a role in the evolution of the human gut microbiome.
Collapse
Affiliation(s)
- M C Rühlemann
- Institute of Clinical Molecular Biology, Kiel University, Kiel, Germany.
- Institute for Medical Microbiology and Hospital Epidemiology, Hannover Medical School, Hannover, Germany.
| | - C Bang
- Institute of Clinical Molecular Biology, Kiel University, Kiel, Germany
| | - J F Gogarten
- Applied Zoology and Nature Conservation, University of Greifswald, Greifswald, Germany
- Helmholtz Institute for One Health, Helmholtz-Centre for Infection Research (HZI), Greifswald, Germany
- Epidemiology of Highly Pathogenic Microorganisms, Robert Koch Institute, Berlin, Germany
- Viral Evolution, Robert Koch Institute, Berlin, Germany
| | - B M Hermes
- Evolutionary Genomics, Max Planck Institute for Evolutionary Biology, Plön, Germany
- Institute of Experimental Medicine, Kiel University, Kiel, Germany
| | - M Groussin
- Institute of Clinical Molecular Biology, Kiel University, Kiel, Germany
| | - S Waschina
- Nutriinformatics Research Group, Institute for Human Nutrition and Food Science, Kiel University, Kiel, Germany
| | - M Poyet
- Institute of Experimental Medicine, Kiel University, Kiel, Germany
| | - M Ulrich
- Helmholtz Institute for One Health, Helmholtz-Centre for Infection Research (HZI), Greifswald, Germany
- Epidemiology of Highly Pathogenic Microorganisms, Robert Koch Institute, Berlin, Germany
| | - C Akoua-Koffi
- Training and Research Unit for in Medical Sciences, Alassane Ouattara University / University Teaching Hospital of Bouaké, Bouaké, Côte d'Ivoire
| | - T Deschner
- Comparative BioCognition, Institute of Cognitive Science, University of Osnabrück, Osnabrück, Germany
| | - J J Muyembe-Tamfum
- National Institute for Biomedical Research, National Laboratory of Public Health, Kinshasa, Democratic Republic of the Congo
| | - M M Robbins
- Department of Primate Behavior and Evolution, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - M Surbeck
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - R M Wittig
- Institute of Cognitive Sciences, CNRS UMR5229 University Lyon 1, Bron Cedex, France
- Taï Chimpanzee Project, CSRS, Abidjan, Côte d'Ivoire
| | - K Zuberbühler
- Institute of Biology, University of Neuchatel, Neuchatel, Switzerland
- School of Psychology & Neuroscience, University of St Andrews, St Andrews, Scotland, UK
| | - J F Baines
- Evolutionary Genomics, Max Planck Institute for Evolutionary Biology, Plön, Germany
- Institute of Experimental Medicine, Kiel University, Kiel, Germany
| | - F H Leendertz
- Helmholtz Institute for One Health, Helmholtz-Centre for Infection Research (HZI), Greifswald, Germany
- Epidemiology of Highly Pathogenic Microorganisms, Robert Koch Institute, Berlin, Germany
| | - A Franke
- Institute of Clinical Molecular Biology, Kiel University, Kiel, Germany.
| |
Collapse
|
6
|
Cribbie EP, Doerr D, Chauve C. AGO, a Framework for the Reconstruction of Ancestral Syntenies and Gene Orders. Methods Mol Biol 2024; 2802:247-265. [PMID: 38819563 DOI: 10.1007/978-1-0716-3838-5_10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2024]
Abstract
Reconstructing ancestral gene orders from the genome data of extant species is an important problem in comparative and evolutionary genomics. In a phylogenomics setting that accounts for gene family evolution through gene duplication and gene loss, the reconstruction of ancestral gene orders involves several steps, including multiple sequence alignment, the inference of reconciled gene trees, and the inference of ancestral syntenies and gene adjacencies. For each of the steps of such a process, several methods can be used and implemented using a growing corpus of, often parameterized, tools; in practice, interfacing such tools into an ancestral gene order reconstruction pipeline is far from trivial. This chapter introduces AGO, a Python-based framework aimed at creating ancestral gene order reconstruction pipelines allowing to interface and parameterize different bioinformatics tools. The authors illustrate the features of AGO by reconstructing ancestral gene orders for the X chromosome of three ancestral Anopheles species using three different pipelines. AGO is freely available at https://github.com/cchauve/AGO-pipeline .
Collapse
Affiliation(s)
- Evan P Cribbie
- Department of Mathematics, Simon Fraser University, Burnaby, BC, Canada
| | - Daniel Doerr
- Department for Endocrinology and Diabetology, Medical Faculty and University Hospital Düsseldorf, German Diabetes Center (DDZ), Leibniz Institute for Diabetes Research, and Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
| | - Cedric Chauve
- Department of Mathematics, Simon Fraser University, Burnaby, BC, Canada.
| |
Collapse
|
7
|
Sinaimeri B, Urbini L, Sagot MF, Matias C. Cophylogeny Reconstruction Allowing for Multiple Associations Through Approximate Bayesian Computation. Syst Biol 2023; 72:1370-1386. [PMID: 37703307 DOI: 10.1093/sysbio/syad058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Revised: 09/05/2023] [Accepted: 09/11/2023] [Indexed: 09/15/2023] Open
Abstract
Phylogenetic tree reconciliation is extensively employed for the examination of coevolution between host and symbiont species. An important concern is the requirement for dependable cost values when selecting event-based parsimonious reconciliation. Although certain approaches deduce event probabilities unique to each pair of host and symbiont trees, which can subsequently be converted into cost values, a significant limitation lies in their inability to model the invasion of diverse host species by the same symbiont species (termed as a spread event), which is believed to occur in symbiotic relationships. Invasions lead to the observation of multiple associations between symbionts and their hosts (indicating that a symbiont is no longer exclusive to a single host), which are incompatible with the existing methods of coevolution. Here, we present a method called AmoCoala (an enhanced version of the tool Coala) that provides a more realistic estimation of cophylogeny event probabilities for a given pair of host and symbiont trees, even in the presence of spread events. We expand the classical 4-event coevolutionary model to include 2 additional outcomes, vertical and horizontal spreads, that lead to multiple associations. In the initial step, we estimate the probabilities of spread events using heuristic frequencies. Subsequently, in the second step, we employ an approximate Bayesian computation approach to infer the probabilities of the remaining 4 classical events (cospeciation, duplication, host switch, and loss) based on these values. By incorporating spread events, our reconciliation model enables a more accurate consideration of multiple associations. This improvement enhances the precision of estimated cost sets, paving the way to a more reliable reconciliation of host and symbiont trees. To validate our method, we conducted experiments on synthetic datasets and demonstrated its efficacy using real-world examples. Our results showcase that AmoCoala produces biologically plausible reconciliation scenarios, further emphasizing its effectiveness.
Collapse
Affiliation(s)
- Blerina Sinaimeri
- Libera Università Internazionale degli Studi Sociali Guido Carli, Rome, Department of Business and Management, Viale Romania, 32 - 00197, Rome, Italy
| | - Laura Urbini
- ERABLE team, Inria - Institut national de recherche en informatique et en automatique, Lyon, 56 Bd Niels Bohr, 69100 Villeurbanne, France
- Université de Lyon, F-69000, Lyon, France
- Laboratoire de Biométrie et Biologie Evolutive (LBBE), Centre National de la Recherche Scientifique (CNRS), UMR5558, 43 Boulevard du 11 Novembre 1918, 69622 Villurbanne Cedex, France
| | - Marie-France Sagot
- ERABLE team, Inria - Institut national de recherche en informatique et en automatique, Lyon, 56 Bd Niels Bohr, 69100 Villeurbanne, France
- Université de Lyon, F-69000, Lyon, France
- Laboratoire de Biométrie et Biologie Evolutive (LBBE), Centre National de la Recherche Scientifique (CNRS), UMR5558, 43 Boulevard du 11 Novembre 1918, 69622 Villurbanne Cedex, France
| | - Catherine Matias
- Sorbonne Université, Paris, France
- Université de Paris Cité, Paris, France
- Centre National de la Recherche Scientifique, Laboratoire de Probabilités, Statistique et Modélisation, Paris, France
| |
Collapse
|
8
|
Katriel G, Mahanaymi U, Brezner S, Kezel N, Koutschan C, Zeilberger D, Steel M, Snir S. Gene Transfer-Based Phylogenetics: Analytical Expressions and Additivity via Birth-Death Theory. Syst Biol 2023; 72:1403-1417. [PMID: 37862116 DOI: 10.1093/sysbio/syad060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2022] [Revised: 09/01/2023] [Accepted: 10/05/2023] [Indexed: 10/22/2023] Open
Abstract
The genomic era has opened up vast opportunities in molecular systematics, one of which is deciphering the evolutionary history in fine detail. Under this mass of data, analyzing the point mutations of standard markers is often too crude and slow for fine-scale phylogenetics. Nevertheless, genome dynamics (GD) events provide alternative, often richer information. The synteny index (SI) between a pair of genomes combines gene order and gene content information, allowing the comparison of genomes of unequal gene content, together with order considerations of their common genes. Recently, genome dynamics has been modeled as a continuous-time Markov process, and gene distance in the genome as a birth-death-immigration process. Nevertheless, due to complexities arising in this setting, no precise and provably consistent estimators could be derived, resulting in heuristic solutions. Here, we extend this modeling approach by using techniques from birth-death theory to derive explicit expressions of the system's probabilistic dynamics in the form of rational functions of the model parameters. This, in turn, allows us to infer analytically accurate distances between organisms based on their SI. Subsequently, we establish additivity of this estimated evolutionary distance (a desirable property yielding phylogenetic consistency). Applying the new measure in simulation studies shows that it provides accurate results in realistic settings and even under model extensions such as gene gain/loss or over a tree structure. In the real-data realm, we applied the new formulation to unique data structure that we constructed-the ordered orthology DB-based on a new version of the EggNOG database, to construct a tree with more than 4.5K taxa. To the best of our knowledge, this is the largest gene-order-based tree constructed and it overcomes shortcomings found in previous approaches. Constructing a GD-based tree allows to confirm and contrast findings based on other phylogenetic approaches, as we show.
Collapse
Affiliation(s)
- Guy Katriel
- Department of Mathematics, Braude College of Engineering, Karmiel, Israel
| | - Udi Mahanaymi
- Department of Evolutionary and Environmental Biology, University of Haifa, Haifa, Israel
| | - Shelly Brezner
- Department of Evolutionary and Environmental Biology, University of Haifa, Haifa, Israel
| | - Noor Kezel
- Department of Mathematics, University of Haifa, Haifa, Israel
| | | | - Doron Zeilberger
- Department of Mathematics, Rutgers University, New Brunwick, NJ, USA
| | - Mike Steel
- School of Mathematics and Statistics, University of Canterbury, Christchurch, New Zealand
| | - Sagi Snir
- Department of Evolutionary and Environmental Biology, University of Haifa, Haifa, Israel
| |
Collapse
|
9
|
Perez-Lamarque B, Morlon H. Comparing different computational approaches for detecting long-term vertical transmission in host-associated microbiota. Mol Ecol 2023; 32:6671-6685. [PMID: 36065594 DOI: 10.1111/mec.16681] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2022] [Revised: 08/29/2022] [Accepted: 09/01/2022] [Indexed: 11/30/2022]
Abstract
Long-term vertical transmissions of gut bacteria are thought to be frequent and functionally important in mammals. Several phylogenetic-based approaches have been proposed to detect, among species-rich microbiota, the bacteria that have been vertically transmitted during a host clade radiation. Applied to mammal microbiota, these methods have sometimes led to conflicting results; in addition, how they cope with the slow evolution of markers typically used to characterize bacterial microbiota remains unclear. Here, we use simulations to test the statistical performances of two widely-used global-fit approaches (ParaFit and PACo) and two event-based approaches (ALE and HOME). We find that these approaches have different strengths and weaknesses depending on the amount of variation in the bacterial DNA sequences and are therefore complementary. In particular, we show that ALE performs better when there is a lot of variation in the bacterial DNA sequences, whereas HOME performs better when there is not. Global-fit approaches (ParaFit and PACo) have higher type I error rates (false positives) but have the advantage to be very fast to run. We apply these methods to the gut microbiota of primates and our results suggest that only a small fraction of their gut bacteria is vertically transmitted.
Collapse
Affiliation(s)
- Benoît Perez-Lamarque
- Institut de Biologie de l'École Normale Supérieure (IBENS), École Normale Supérieure, CNRS, INSERM, Université PSL, Paris, France
- Institut de Systématique, Évolution, Biodiversité (ISYEB), Muséum National d'histoire Naturelle, CNRS, Sorbonne Université, EPHE, UA, Paris, France
| | - Hélène Morlon
- Institut de Biologie de l'École Normale Supérieure (IBENS), École Normale Supérieure, CNRS, INSERM, Université PSL, Paris, France
| |
Collapse
|
10
|
Aubin E, Llauro C, Garrigue J, Mirouze M, Panaud O, El Baidouri M. Genome-wide analysis of horizontal transfer in non-model wild species from a natural ecosystem reveals new insights into genetic exchange in plants. PLoS Genet 2023; 19:e1010964. [PMID: 37856455 PMCID: PMC10586619 DOI: 10.1371/journal.pgen.1010964] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Accepted: 09/11/2023] [Indexed: 10/21/2023] Open
Abstract
Horizontal transfer (HT) refers to the exchange of genetic material between divergent species by mechanisms other than reproduction. In recent years, several studies have demonstrated HTs in eukaryotes, particularly in the context of parasitic relationships and in model species. However, very little is known about HT in natural ecosystems, especially those involving non-parasitic wild species, and the nature of the ecological relationships that promote these HTs. In this work, we conducted a pilot study investigating HTs by sequencing the genomes of 17 wild non-model species from a natural ecosystem, the Massane forest, located in southern France. To this end, we developed a new computational pipeline called INTERCHANGE that is able to characterize HTs at the whole genome level without prior annotation and directly in the raw sequencing reads. Using this pipeline, we identified 12 HT events, half of which occurred between lianas and trees. We found that mainly low copy number LTR-retrotransposons from the Copia superfamily were transferred between these wild plant species, especially those of the Ivana and Ale lineages. This study revealed a possible new route for HTs between non-parasitic plants and provides new insights into the genomic characteristics of horizontally transferred DNA in plant genomes.
Collapse
Affiliation(s)
- Emilie Aubin
- Laboratoire Génome et Développement des Plantes, Perpignan, Université de Perpignan Via Domitia, Perpignan, France
| | - Christel Llauro
- Laboratoire Génome et Développement des Plantes, Perpignan, Université de Perpignan Via Domitia, Perpignan, France
- Laboratoire Génome et Développement des Plantes, Centre National de la Recherche Scientifique, Perpignan, France
| | - Joseph Garrigue
- Réserve Naturelle Nationale de la forêt de la Massane, France
| | - Marie Mirouze
- Laboratoire Génome et Développement des Plantes, Perpignan, Université de Perpignan Via Domitia, Perpignan, France
- Diversité, Adaptation, Développement des Plantes, Institut de Recherche pour le Développement, Université de Montpellier, Montpellier, France
| | - Olivier Panaud
- Laboratoire Génome et Développement des Plantes, Perpignan, Université de Perpignan Via Domitia, Perpignan, France
- Institut Universitaire de France, Paris, France
| | - Moaine El Baidouri
- Laboratoire Génome et Développement des Plantes, Perpignan, Université de Perpignan Via Domitia, Perpignan, France
- Laboratoire Génome et Développement des Plantes, Centre National de la Recherche Scientifique, Perpignan, France
| |
Collapse
|
11
|
Ma J, Zhao H, Mo S, Li J, Ma X, Tang Y, Li H, Liu Z. Acquisition of Type I methyltransferase via horizontal gene transfer increases the drug resistance of Aeromonas veronii. Microb Genom 2023; 9:001107. [PMID: 37754275 PMCID: PMC10569733 DOI: 10.1099/mgen.0.001107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Accepted: 09/14/2023] [Indexed: 09/28/2023] Open
Abstract
Aeromonas veronii is an opportunistic pathogen that affects both fish and mammals, including humans, leading to bacteraemia, sepsis, meningitis and even death. The increasing virulence and drug resistance of A. veronii are of significant concern and pose a severe risk to public safety. The Type I restriction-modification (RM) system, which functions as a bacterial defence mechanism, can influence gene expression through DNA methylation. However, little research has been conducted to explore its origin, evolutionary path, and relationship to virulence and drug resistance in A. veronii. In this study, we analysed the pan-genome of 233 A. veronii strains, and the results indicated that it was 'open', meaning that A. veronii has acquired additional genes from other species. This suggested that A. veronii had the potential to adapt and evolve rapidly, which might have contributed to its drug resistance. One Type I methyltransferase (MTase) and two complete Type I RM systems were identified, namely AveC4I, AveC4II and AveC4III in A. veronii strain C4, respectively. Notably, AveC4I was exclusive to A. veronii C4. Phylogenetic analysis revealed that AveC4I was derived from horizontal gene transfer from Thiocystis violascens and exchanged genes with the human pathogen Comamonas kerstersii. Single molecule real-time sequencing was applied to identify the motif methylated by AveC4I, which was unique and not recognized by any reported MTases in the REBASE database. We also annotated the functions and pathways of the genes containing the motif, revealing that AveC4I may control drug resistance in A. veronii C4. Our findings provide new insight on the mechanisms underlying drug resistance in pathogenic bacteria. By identifying the specific genes and pathways affected by AveC4I, this study may aid in the development of new therapeutic approaches to combat A. veronii infections.
Collapse
Affiliation(s)
- Jiayue Ma
- School of Life Sciences, Hainan University, Haikou, PR China
| | - Honghao Zhao
- School of Life Sciences, Hainan University, Haikou, PR China
| | - Shuangyi Mo
- School of Life Sciences, Hainan University, Haikou, PR China
| | - Juanjuan Li
- School of Life Sciences, Hainan University, Haikou, PR China
| | - Xiang Ma
- School of Life Sciences, Hainan University, Haikou, PR China
| | - Yanqiong Tang
- School of Life Sciences, Hainan University, Haikou, PR China
| | - Hong Li
- School of Life Sciences, Hainan University, Haikou, PR China
| | - Zhu Liu
- School of Life Sciences, Hainan University, Haikou, PR China
| |
Collapse
|
12
|
Liu N, Gonzalez TA, Fischer J, Hong C, Johnson M, Mawhorter R, Mugnatto F, Soh R, Somji S, Wirth JS, Libeskind-Hadas R, Bush EC. xenoGI 3: using the DTLOR model to reconstruct the evolution of gene families in clades of microbes. BMC Bioinformatics 2023; 24:295. [PMID: 37480009 PMCID: PMC10362622 DOI: 10.1186/s12859-023-05410-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2022] [Accepted: 07/11/2023] [Indexed: 07/23/2023] Open
Abstract
To understand genome evolution in a group of microbes, we need to know the timing of events such as duplications, deletions and horizontal transfers. A common approach is to perform a gene-tree / species-tree reconciliation. While a number of software packages perform this type of analysis, none are geared toward a complete reconstruction for all families in an entire clade. Here we describe an update to the xenoGI software package which allows users to perform such an analysis using the newly developed DTLOR (duplication-transfer-loss-origin-rearrangement) reconciliation model starting from genome sequences as input.
Collapse
Affiliation(s)
- Nuo Liu
- Department of Biology, Harvey Mudd College, Claremont, CA, USA
| | | | - Jacob Fischer
- Department of Computer Science, Harvey Mudd College, Claremont, CA, USA
| | - Chan Hong
- Department of Biology, Harvey Mudd College, Claremont, CA, USA
| | | | - Ross Mawhorter
- Department of Computer Science and Engineering, University of California Santa Cruz, Santa Cruz, CA, USA
| | | | - Rachael Soh
- Department of Biology, Harvey Mudd College, Claremont, CA, USA
| | - Shifa Somji
- Department of Computer Science, Harvey Mudd College, Claremont, CA, USA
| | - Joseph S Wirth
- Department of Biology, Harvey Mudd College, Claremont, CA, USA
- Division of Foodborne, Waterborne, and Environmental Diseases, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - Ran Libeskind-Hadas
- Department of Integrated Sciences, Claremont McKenna College, Claremont, CA, USA
| | - Eliot C Bush
- Department of Biology, Harvey Mudd College, Claremont, CA, USA.
| |
Collapse
|
13
|
Kuppa Baskaran DK, Umale S, Zhou Z, Raman K, Anantharaman K. Metagenome-based metabolic modelling predicts unique microbial interactions in deep-sea hydrothermal plume microbiomes. ISME COMMUNICATIONS 2023; 3:42. [PMID: 37120693 PMCID: PMC10148797 DOI: 10.1038/s43705-023-00242-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Revised: 03/20/2023] [Accepted: 04/12/2023] [Indexed: 05/01/2023]
Abstract
Deep-sea hydrothermal vents are abundant on the ocean floor and play important roles in ocean biogeochemistry. In vent ecosystems such as hydrothermal plumes, microorganisms rely on reduced chemicals and gases in hydrothermal fluids to fuel primary production and form diverse and complex microbial communities. However, microbial interactions that drive these complex microbiomes remain poorly understood. Here, we use microbiomes from the Guaymas Basin hydrothermal system in the Pacific Ocean to shed more light on the key species in these communities and their interactions. We built metabolic models from metagenomically assembled genomes (MAGs) and infer possible metabolic exchanges and horizontal gene transfer (HGT) events within the community. We highlight possible archaea-archaea and archaea-bacteria interactions and their contributions to the robustness of the community. Cellobiose, D-Mannose 1-phosphate, O2, CO2, and H2S were among the most exchanged metabolites. These interactions enhanced the metabolic capabilities of the community by exchange of metabolites that cannot be produced by any other community member. Archaea from the DPANN group stood out as key microbes, benefiting significantly as acceptors in the community. Overall, our study provides key insights into the microbial interactions that drive community structure and organisation in complex hydrothermal plume microbiomes.
Collapse
Affiliation(s)
- Dinesh Kumar Kuppa Baskaran
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology (IIT) Madras, Chennai, India
- Centre for Integrative Biology and Systems mEdicine (IBSE), Indian Institute of Technology (IIT) Madras, Chennai, India
- Robert Bosch Centre for Data Science and Artificial Intelligence (RBCDSAI), IIT Madras, Chennai, India
| | - Shreyansh Umale
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology (IIT) Madras, Chennai, India
- Centre for Integrative Biology and Systems mEdicine (IBSE), Indian Institute of Technology (IIT) Madras, Chennai, India
| | - Zhichao Zhou
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA
| | - Karthik Raman
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology (IIT) Madras, Chennai, India.
- Centre for Integrative Biology and Systems mEdicine (IBSE), Indian Institute of Technology (IIT) Madras, Chennai, India.
- Robert Bosch Centre for Data Science and Artificial Intelligence (RBCDSAI), IIT Madras, Chennai, India.
| | | |
Collapse
|
14
|
Mei R, Kaneko M, Imachi H, Nobu MK. The origin and evolution of methanogenesis and Archaea are intertwined. PNAS NEXUS 2023; 2:pgad023. [PMID: 36874274 PMCID: PMC9982363 DOI: 10.1093/pnasnexus/pgad023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Accepted: 01/20/2023] [Indexed: 02/01/2023]
Abstract
Methanogenesis has been widely accepted as an ancient metabolism, but the precise evolutionary trajectory remains hotly debated. Disparate theories exist regarding its emergence time, ancestral form, and relationship with homologous metabolisms. Here, we report the phylogenies of anabolism-involved proteins responsible for cofactor biosynthesis, providing new evidence for the antiquity of methanogenesis. Revisiting the phylogenies of key catabolism-involved proteins further suggests that the last Archaea common ancestor (LACA) was capable of versatile H2-, CO2-, and methanol-utilizing methanogenesis. Based on phylogenetic analyses of the methyl/alkyl-S-CoM reductase family, we propose that, in contrast to current paradigms, substrate-specific functions emerged through parallel evolution traced back to a nonspecific ancestor, which likely originated from protein-free reactions as predicted from autocatalytic experiments using cofactor F430. After LACA, inheritance/loss/innovation centered around methanogenic lithoautotrophy coincided with ancient lifestyle divergence, which is clearly reflected by genomically predicted physiologies of extant archaea. Thus, methanogenesis is not only a hallmark metabolism of Archaea, but the key to resolve the enigmatic lifestyle that ancestral archaea took and the transition that led to physiologies prominent today.
Collapse
Affiliation(s)
- Ran Mei
- Bioproduction Research Institute, National Institute of Advanced Industrial Science and Technology (AIST), 1-1-1 Higashi, Tsukuba 305-8566, Japan
| | - Masanori Kaneko
- Institute for Geo-Resources and Environment, Geological Survey of Japan, National Institute of Advanced Industrial Science and Technology (AIST), 1-1-1 Higashi, Tsukuba 305-8567, Japan
| | - Hiroyuki Imachi
- Institute for Extra-cutting-edge Science and Technology Avant-garde Research (X-star), Japan Agency for Marine-Earth Science and Technology (JAMSTEC), 2-15 Natsushima-cho, Yokosuka, Kanagawa 237-0061, Japan
| | - Masaru K Nobu
- Bioproduction Research Institute, National Institute of Advanced Industrial Science and Technology (AIST), 1-1-1 Higashi, Tsukuba 305-8566, Japan.,Institute for Extra-cutting-edge Science and Technology Avant-garde Research (X-star), Japan Agency for Marine-Earth Science and Technology (JAMSTEC), 2-15 Natsushima-cho, Yokosuka, Kanagawa 237-0061, Japan
| |
Collapse
|
15
|
Zaman S, Sledzieski S, Berger B, Wu YC, Bansal MS. virDTL: Viral Recombination Analysis Through Phylogenetic Reconciliation and Its Application to Sarbecoviruses and SARS-CoV-2. J Comput Biol 2023; 30:3-20. [PMID: 36125448 PMCID: PMC10081712 DOI: 10.1089/cmb.2021.0507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
An accurate understanding of the evolutionary history of rapidly-evolving viruses like SARS-CoV-2, responsible for the COVID-19 pandemic, is crucial to tracking and preventing the spread of emerging pathogens. However, viruses undergo frequent recombination, which makes it difficult to trace their evolutionary history using traditional phylogenetic methods. In this study, we present a phylogenetic workflow, virDTL, for analyzing viral evolution in the presence of recombination. Our approach leverages reconciliation methods developed for inferring horizontal gene transfer in prokaryotes and, compared to existing tools, is uniquely able to identify ancestral recombinations while accounting for several sources of inference uncertainty, including in the construction of a strain tree, estimation and rooting of gene family trees, and reconciliation itself. We apply this workflow to the Sarbecovirus subgenus and demonstrate how a principled analysis of predicted recombination gives insight into the evolution of SARS-CoV-2. In addition to providing confirming evidence for the horseshoe bat as its zoonotic origin, we identify several ancestral recombination events that merit further study.
Collapse
Affiliation(s)
- Sumaira Zaman
- Department of Computer Science and Engineering, University of Connecticut, Storrs, Connecticut, USA
| | - Samuel Sledzieski
- Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - Bonnie Berger
- Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA.,Department of Mathematics, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - Yi-Chieh Wu
- Department of Computer Science, Harvey Mudd College, Claremont, California, USA
| | - Mukul S Bansal
- Department of Computer Science and Engineering, University of Connecticut, Storrs, Connecticut, USA.,The Institute for Systems Genomics, University of Connecticut, Storrs, Connecticut, USA
| |
Collapse
|
16
|
Choi Y, Ahn S, Park M, Lee S, Cho S, Kim H. HGTree v2.0: a comprehensive database update for horizontal gene transfer (HGT) events detected by the tree-reconciliation method. Nucleic Acids Res 2022; 51:D1010-D1018. [PMID: 36350646 PMCID: PMC9825516 DOI: 10.1093/nar/gkac929] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Revised: 09/30/2022] [Accepted: 11/01/2022] [Indexed: 11/11/2022] Open
Abstract
HGTree is a database that provides horizontal gene transfer (HGT) event information on 2472 prokaryote genomes using the tree-reconciliation method. HGTree was constructed in 2015, and a large number of prokaryotic genomes have been additionally published since then. To cope with the rapid rise of prokaryotic genome data, we present HGTree v2.0 (http://hgtree2.snu.ac.kr), a newly updated version of our HGT database with much more extensive data, including a total of 20 536 completely sequenced non-redundant prokaryotic genomes, and more reliable HGT information results curated with various steps. As a result, HGTree v2.0 has a set of expanded data results of 6 361 199 putative horizontally transferred genes integrated with additional functional information such as the KEGG pathway, virulence factors and antimicrobial resistance. Furthermore, various visualization tools in the HGTree v2.0 database website provide intuitive biological insights, allowing the users to investigate their genomes of interest.
Collapse
Affiliation(s)
| | | | - Myeongkyu Park
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul 08826, Republic of Korea
| | | | - Seoae Cho
- eGnome Inc., Seoul 05836, Republic of Korea
| | - Heebal Kim
- To whom correspondence should be addressed. Tel: +82 2 880 4803; Fax: +82 2 883 8812;
| |
Collapse
|
17
|
Grouzdev D, Gaisin V, Lunina O, Krutkina M, Krasnova E, Voronov D, Baslerov R, Sigalevich P, Savvichev A, Gorlenko V. Microbial communities of stratified aquatic ecosystems of Kandalaksha Bay (White Sea) shed light on the evolutionary history of green and brown morphotypes of Chlorobiota. FEMS Microbiol Ecol 2022; 98:6693937. [PMID: 36073352 DOI: 10.1093/femsec/fiac103] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2022] [Revised: 08/30/2022] [Accepted: 09/01/2022] [Indexed: 12/14/2022] Open
Abstract
Anoxygenic photoautotrophic metabolism of green sulfur bacteria of the family Chlorobiaceae played a significant role in establishing the Earth's biosphere. Two known major ecological forms of these phototrophs differ in their pigment composition and, therefore, in color: the green and brown forms. The latter form often occurs in low-light environments and is specialized to harvest blue light, which can penetrate to the greatest depth in the water column. In the present work, metagenomic sequencing was used to investigate the natural population of brown Chl. phaeovibrioides ZM in a marine stratified Zeleny Mys lagoon in the Kandalaksha Bay (the White Sea) to supplement the previously obtained genomes of brown Chlorobiaceae. The genomes of brown and green Chlorobiaceae were investigated using comparative genome analysis and phylogenetic and reconciliation analysis to reconstruct the evolution of these ecological forms. Our results support the suggestion that the last common ancestor of Chlorobiaceae belonged to the brown form, i.e. it was adapted to the conditions of low illumination. However, despite the vertical inheritance of these characteristics, among modern Chlorobiaceae populations, the genes responsible for synthesizing the pigments of the brown form are subject to active horizontal transfer.
Collapse
Affiliation(s)
- Denis Grouzdev
- SciBear OU, 10115 Tallinn, Estonia.,School of Marine and Atmospheric Sciences, Stony Brook University, 11794, Stony Brook, USA
| | - Vasil Gaisin
- Research Center of Biotechnology of the Russian Academy of Sciences, 119071, Moscow, Russia.,Current affiliation: Institute of Molecular Biology & Biophysics, Eidgenössische Technische Hochschule Zürich, Zurich, Switzerland
| | - Olga Lunina
- Research Center of Biotechnology of the Russian Academy of Sciences, 119071, Moscow, Russia
| | | | - Elena Krasnova
- Pertsov White Sea Biological Station, 184042, Republic Karelia, Russia
| | - Dmitry Voronov
- Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, 127051, Moscow, Russia
| | - Roman Baslerov
- Research Center of Biotechnology of the Russian Academy of Sciences, 119071, Moscow, Russia
| | - Pavel Sigalevich
- Research Center of Biotechnology of the Russian Academy of Sciences, 119071, Moscow, Russia
| | - Alexander Savvichev
- Research Center of Biotechnology of the Russian Academy of Sciences, 119071, Moscow, Russia
| | - Vladimir Gorlenko
- Research Center of Biotechnology of the Russian Academy of Sciences, 119071, Moscow, Russia
| |
Collapse
|
18
|
Uzun M, Koziaeva V, Dziuba M, Leão P, Krutkina M, Grouzdev D. Detection of interphylum transfers of the magnetosome gene cluster in magnetotactic bacteria. Front Microbiol 2022; 13:945734. [PMID: 35979495 PMCID: PMC9376291 DOI: 10.3389/fmicb.2022.945734] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Accepted: 06/27/2022] [Indexed: 01/01/2023] Open
Abstract
Magnetosome synthesis in magnetotactic bacteria (MTB) is regarded as a very ancient evolutionary process that dates back to deep-branching phyla. Magnetotactic bacteria belonging to one of such phyla, Nitrospirota, contain the classical genes for the magnetosome synthesis (e.g., mam, mms) and man genes, which were considered to be specific for this group. However, the recent discovery of man genes in MTB from the Thermodesulfobacteriota phylum has raised several questions about the inheritance of these genes in MTB. In this work, three new man genes containing MTB genomes affiliated with Nitrospirota and Thermodesulfobacteriota, were obtained. By applying reconciliation with these and the previously published MTB genomes, we demonstrate that the last common ancestor of all Nitrospirota was most likely not magnetotactic as assumed previously. Instead, our findings suggest that the genes for magnetosome synthesis were transmitted to the phylum Nitrospirota by horizontal gene transfer (HGT), which is the first case of the interphylum transfer of magnetosome genes detected to date. Furthermore, we provide evidence for the HGT of magnetosome genes from the Magnetobacteriaceae to the Dissulfurispiraceae family within Nitrospirota. Thus, our results imply a more significant role of HGT in the MTB evolution than deemed before and challenge the hypothesis of the ancient origin of magnetosome synthesis.
Collapse
Affiliation(s)
- Maria Uzun
- Skryabin Institute of Bioengineering, Research Center of Biotechnology of the Russian Academy of Sciences, Moscow, Russia
- Faculty of Biology, Lomonosov Moscow State University, Moscow, Russia
| | - Veronika Koziaeva
- Skryabin Institute of Bioengineering, Research Center of Biotechnology of the Russian Academy of Sciences, Moscow, Russia
| | - Marina Dziuba
- Skryabin Institute of Bioengineering, Research Center of Biotechnology of the Russian Academy of Sciences, Moscow, Russia
- Department of Microbiology, University of Bayreuth, Bayreuth, Germany
| | - Pedro Leão
- Instituto de Microbiologia Paulo de Góes, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
- Department of Marine Science, The University of Texas at Austin, Austin, TX, United States
| | | | - Denis Grouzdev
- SciBear OU, Tallinn, Estonia
- *Correspondence: Denis Grouzdev,
| |
Collapse
|
19
|
Tree Reconciliation Methods for Host-Symbiont Cophylogenetic Analyses. Life (Basel) 2022; 12:life12030443. [PMID: 35330194 PMCID: PMC8951107 DOI: 10.3390/life12030443] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Revised: 03/05/2022] [Accepted: 03/10/2022] [Indexed: 12/16/2022] Open
Abstract
Phylogenetic reconciliation is a fundamental method in the study of pairs of coevolving species. This paper provides an overview of the underlying theory of reconciliation in the context of host-symbiont cophylogenetics, identifying some of the major challenges to users of these methods, such as selecting event costs and selecting representative reconciliations. Next, recent advances to address these challenges are discussed followed by a discussion of several established and recent software tools.
Collapse
|
20
|
Abstract
The burial of organic carbon, which prevents its remineralization via oxygen-consuming processes, is considered one of the causes of Earth’s oxygenation. Yet, higher levels of oxygen are thought to inhibit burial. Here we propose a resolution of this conundrum, wherein Earth’s initial oxygenation is favored by oxidative metabolisms generating partially oxidized organic matter (POOM), increasing burial via interaction with minerals in sediments. First, we introduce the POOM hypothesis via a mathematical argument. Second, we reconstruct the evolutionary history of one key enzyme family, flavin-dependent Baeyer–Villiger monooxygenases, that generates POOM, and show the temporal consistency of its diversification with the Proterozoic and Phanerozoic atmospheric oxygenation. Finally, we propose that the expansion of oxidative metabolisms instigated a positive feedback, which was amplified by the chemical changes to minerals on Earth’s surface. Collectively, these results suggest that Earth’s oxygenation is an autocatalytic transition induced by a combination of biological innovations and geological changes. How Earth’s atmosphere became oxygenated remains enigmatic. Here the authors use mathematical and phylogenetic analyses to find that Earth’s oxygenation is induced by the interactions of microbial oxidative metabolites with sediment minerals.
Collapse
|
21
|
Harris BJ, Sheridan PO, Davín AA, Gubry-Rangin C, Szöllősi GJ, Williams TA. Rooting Species Trees Using Gene Tree-Species Tree Reconciliation. Methods Mol Biol 2022; 2569:189-211. [PMID: 36083449 DOI: 10.1007/978-1-0716-2691-7_9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Interpreting phylogenetic trees requires a root, which provides the direction of evolution and polarizes ancestor-descendant relationships. But inferring the root using genetic data is difficult, particularly in cases where the closest available outgroup is only distantly related, which are common for microbes. In this chapter, we present a workflow for estimating rooted species trees and the evolutionary history of the gene families that evolve within them using probabilistic gene tree-species tree reconciliation. We illustrate the pipeline using a small dataset of prokaryotic genomes, for which the example scripts can be run using modest computer resources. We describe the rooting method used in this work in the context or other rooting strategies and discuss some of the limitations and opportunities presented by probabilistic gene tree-species tree reconciliation methods.
Collapse
Affiliation(s)
- Brogan J Harris
- School of Biological Sciences, University of Bristol, Bristol, UK
| | - Paul O Sheridan
- School of Biological Sciences, University of Bristol, Bristol, UK
- School of Biological Sciences, University of Aberdeen, Aberdeen, UK
| | - Adrián A Davín
- Australian Centre for Ecogenomics, School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | | | - Gergely J Szöllősi
- Dept. of Biological Physics, Eötvös Loránd University, Budapest, Hungary
- MTA-ELTE "Lendület" Evolutionary Genomics Research Group, Budapest, Hungary
- Institute of Evolution, Centre for Ecological Research, Budapest, Hungary
| | - Tom A Williams
- School of Biological Sciences, University of Bristol, Bristol, UK.
| |
Collapse
|
22
|
Bansal MS. Deciphering Microbial Gene Family Evolution Using Duplication-Transfer-Loss Reconciliation and RANGER-DTL. Methods Mol Biol 2022; 2569:233-252. [PMID: 36083451 DOI: 10.1007/978-1-0716-2691-7_11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Phylogenetic reconciliation has emerged as a principled, highly effective technique for investigating the origin, spread, and evolutionary history of microbial gene families. Proper application of phylogenetic reconciliation requires a clear understanding of potential pitfalls and sources of error, and knowledge of the most effective reconciliation-based tools and protocols to use to maximize accuracy. In this book chapter, we provide a brief overview of Duplication-Transfer-Loss (DTL) reconciliation, the standard reconciliation model used to study microbial gene families and provide a step-by-step computational protocol to maximize the accuracy of DTL reconciliation and minimize false-positive evolutionary inferences.
Collapse
Affiliation(s)
- Mukul S Bansal
- Department of Computer Science & Engineering, University of Connecticut, Storrs, CT, USA.
| |
Collapse
|
23
|
Sharda M, Badrinarayanan A, Seshasayee ASN. Evolutionary and Comparative Analysis of Bacterial Nonhomologous End Joining Repair. Genome Biol Evol 2021; 12:2450-2466. [PMID: 33078828 PMCID: PMC7719229 DOI: 10.1093/gbe/evaa223] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/14/2020] [Indexed: 01/04/2023] Open
Abstract
DNA double-strand breaks (DSBs) are a threat to genome stability. In all domains of life, DSBs are faithfully fixed via homologous recombination. Recombination requires the presence of an uncut copy of duplex DNA which is used as a template for repair. Alternatively, in the absence of a template, cells utilize error-prone nonhomologous end joining (NHEJ). Although ubiquitously found in eukaryotes, NHEJ is not universally present in bacteria. It is unclear as to why many prokaryotes lack this pathway. Toward understanding what could have led to the current distribution of bacterial NHEJ, we carried out comparative genomics and phylogenetic analysis across ∼6,000 genomes. Our results show that this pathway is sporadically distributed across the phylogeny. Ancestral reconstruction further suggests that NHEJ was absent in the eubacterial ancestor and can be acquired via specific routes. Integrating NHEJ occurrence data for archaea, we also find evidence for extensive horizontal exchange of NHEJ genes between the two kingdoms as well as across bacterial clades. The pattern of occurrence in bacteria is consistent with correlated evolution of NHEJ with key genome characteristics of genome size and growth rate; NHEJ presence is associated with large genome sizes and/or slow growth rates, with the former being the dominant correlate. Given the central role these traits play in determining the ability to carry out recombination, it is possible that the evolutionary history of bacterial NHEJ may have been shaped by requirement for efficient DSB repair.
Collapse
Affiliation(s)
- Mohak Sharda
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, Bangalore, Karnataka, India.,School of Life Science, The University of Trans-Disciplinary Health Sciences & Technology (TDU), Bangalore, Karnataka, India
| | - Anjana Badrinarayanan
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, Bangalore, Karnataka, India
| | - Aswin Sai Narain Seshasayee
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, Bangalore, Karnataka, India
| |
Collapse
|
24
|
Lin D, Lan L, Zheng T, Shi P, Xu J, Li J. Comparative Genomics Reveals Recent Adaptive Evolution in Himalayan Giant Honeybee Apis laboriosa. Genome Biol Evol 2021; 13:6380142. [PMID: 34599331 PMCID: PMC8536543 DOI: 10.1093/gbe/evab227] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/26/2021] [Indexed: 01/07/2023] Open
Abstract
The Himalayan giant honeybee, Apis laboriosa, is the largest individual honeybee with major ecological and economic importance in high-latitude environments. However, our understanding of its environmental adaptations is circumscribed by the paucity of genomic data for this species. Here, we provide a draft genome of wild A. laboriosa, along with a comparison to its closely related species, Apis dorsata. The draft genome of A. laboriosa based on the de novo assembly is 226.1 Mbp in length with a scaffold N50 size of 3.34 Mbp, a GC content of 32.2%, a repeat content of 6.86%, and a gene family number of 8,404. Comparative genomics analysis revealed that the genes in A. laboriosa genome have undergone stronger positive selection (2.5 times more genes) and more recent duplication/loss events (6.1 times more events) than those in the A. dorsata genome. Our study implies the potential molecular mechanisms underlying the high-altitude adaptation of A. laboriosa and will catalyze future comparative studies to understand the environmental adaptation of modern honeybees.
Collapse
Affiliation(s)
- Dan Lin
- Department of Infectious Diseases and Public Health, Jockey Club College of Veterinary Medicine and Life Sciences, City University of Hong Kong, Hong Kong, China
| | - Lan Lan
- College of Life Sciences, Chongqing Normal University, Chongqing, China
| | - Tingting Zheng
- Department of Infectious Diseases and Public Health, Jockey Club College of Veterinary Medicine and Life Sciences, City University of Hong Kong, Hong Kong, China
| | - Peng Shi
- College of Life Sciences, Chongqing Normal University, Chongqing, China
| | - Jinshan Xu
- College of Life Sciences, Chongqing Normal University, Chongqing, China
| | - Jun Li
- Department of Infectious Diseases and Public Health, Jockey Club College of Veterinary Medicine and Life Sciences, City University of Hong Kong, Hong Kong, China.,School of Data Science, City University of Hong Kong, Hong Kong, China
| |
Collapse
|
25
|
Kloub L, Gosselin S, Fullmer M, Graf J, Gogarten JP, Bansal MS. Systematic Detection of Large-Scale Multigene Horizontal Transfer in Prokaryotes. Mol Biol Evol 2021; 38:2639-2659. [PMID: 33565580 PMCID: PMC8136488 DOI: 10.1093/molbev/msab043] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Horizontal gene transfer (HGT) is central to prokaryotic evolution. However, little is known about the “scale” of individual HGT events. In this work, we introduce the first computational framework to help answer the following fundamental question: How often does more than one gene get horizontally transferred in a single HGT event? Our method, called HoMer, uses phylogenetic reconciliation to infer single-gene HGT events across a given set of species/strains, employs several techniques to account for inference error and uncertainty, combines that information with gene order information from extant genomes, and uses statistical analysis to identify candidate horizontal multigene transfers (HMGTs) in both extant and ancestral species/strains. HoMer is highly scalable and can be easily used to infer HMGTs across hundreds of genomes. We apply HoMer to a genome-scale data set of over 22,000 gene families from 103 Aeromonas genomes and identify a large number of plausible HMGTs of various scales at both small and large phylogenetic distances. Analysis of these HMGTs reveals interesting relationships between gene function, phylogenetic distance, and frequency of multigene transfer. Among other insights, we find that 1) the observed relative frequency of HMGT increases as divergence between genomes increases, 2) HMGTs often have conserved gene functions, and 3) rare genes are frequently acquired through HMGT. We also analyze in detail HMGTs involving the zonula occludens toxin and type III secretion systems. By enabling the systematic inference of HMGTs on a large scale, HoMer will facilitate a more accurate and more complete understanding of HGT and microbial evolution.
Collapse
Affiliation(s)
- Lina Kloub
- Department of Computer Science and Engineering, University of Connecticut, Storrs, CT, USA
| | - Sean Gosselin
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | - Matthew Fullmer
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA.,Bioinformatics Institute, School of Biological Sciences, The University of Auckland, Auckland, New Zealand
| | - Joerg Graf
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA.,The Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
| | - Johann Peter Gogarten
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA.,The Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
| | - Mukul S Bansal
- Department of Computer Science and Engineering, University of Connecticut, Storrs, CT, USA.,The Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
| |
Collapse
|
26
|
Improved Duplication-Transfer-Loss Reconciliation with Extinct and Unsampled Lineages. ALGORITHMS 2021. [DOI: 10.3390/a14080231] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Duplication-Transfer-Loss (DTL) reconciliation is a widely used computational technique for understanding gene family evolution and inferring horizontal gene transfer (transfer for short) in microbes. However, most existing models and implementations of DTL reconciliation cannot account for the effect of unsampled or extinct species lineages on the evolution of gene families, likely affecting their accuracy. Accounting for the presence and possible impact of any unsampled species lineages, including those that are extinct, is especially important for inferring and studying horizontal transfer since many genes in the species lineages represented in the reconciliation analysis are likely to have been acquired through horizontal transfer from unsampled lineages. While models of DTL reconciliation that account for transfer from unsampled lineages have already been proposed, they use a relatively simple framework for transfer from unsampled lineages and cannot explicitly infer the location on the species tree of each unsampled or extinct lineage associated with an identified transfer event. Furthermore, there does not yet exist any systematic studies to assess the impact of accounting for unsampled lineages on the accuracy of DTL reconciliation. In this work, we address these deficiencies by (i) introducing an extended DTL reconciliation model, called the DTLx reconciliation model, that accounts for unsampled and extinct species lineages in a new, more functional manner compared to existing models, (ii) showing that optimal reconciliations under the new DTLx reconciliation model can be computed just as efficiently as under the fastest DTL reconciliation model, (iii) providing an efficient algorithm for sampling optimal DTLx reconciliations uniformly at random, (iv) performing the first systematic simulation study to assess the impact of accounting for unsampled lineages on the accuracy of DTL reconciliation, and (v) comparing the accuracies of inferring transfers from unsampled lineages under our new model and the only other previously proposed parsimony-based model for this problem.
Collapse
|
27
|
Liu J, Mawhorter R, Liu N, Santichaivekin S, Bush E, Libeskind-Hadas R. Maximum parsimony reconciliation in the DTLOR model. BMC Bioinformatics 2021; 22:394. [PMID: 34348661 PMCID: PMC8340394 DOI: 10.1186/s12859-021-04290-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2021] [Accepted: 07/06/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Analyses of microbial evolution often use reconciliation methods. However, the standard duplication-transfer-loss (DTL) model does not account for the fact that species trees are often not fully sampled and thus, from the perspective of reconciliation, a gene family may enter the species tree from the outside. Moreover, within the genome, genes are often rearranged, causing them to move to new syntenic regions. RESULTS We extend the DTL model to account for two events that commonly arise in the evolution of microbes: origin of a gene from outside the sampled species tree and rearrangement of gene syntenic regions. We describe an efficient algorithm for maximum parsimony reconciliation in this new DTLOR model and then show how it can be extended to account for non-binary gene trees to handle uncertainty in gene tree topologies. Finally, we describe preliminary experimental results from the integration of our algorithm into the existing xenoGI tool for reconstructing the histories of genomic islands in closely related bacteria. CONCLUSIONS Reconciliation in the DTLOR model can offer new insights into the evolution of microbes that is not currently possible under the DTL model.
Collapse
Affiliation(s)
- Jingyi Liu
- Department of Computer Science, Harvey Mudd College, Claremont, CA, USA
| | - Ross Mawhorter
- Department of Computer Science and Engineering, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Nuo Liu
- Department of Computer Science, Harvey Mudd College, Claremont, CA, USA.,Department of Biology, Harvey Mudd College, Claremont, CA, USA
| | | | - Eliot Bush
- Department of Biology, Harvey Mudd College, Claremont, CA, USA
| | | |
Collapse
|
28
|
Williams TA, Schrempf D, Szöllősi GJ, Cox CJ, Foster PG, Embley TM. Inferring the deep past from molecular data. Genome Biol Evol 2021; 13:6192802. [PMID: 33772552 PMCID: PMC8175050 DOI: 10.1093/gbe/evab067] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/22/2021] [Indexed: 12/17/2022] Open
Abstract
There is an expectation that analyses of molecular sequences might be able to distinguish between alternative hypotheses for ancient relationships, but the phylogenetic methods used and types of data analyzed are of critical importance in any attempt to recover historical signal. Here, we discuss some common issues that can influence the topology of trees obtained when using overly simple models to analyze molecular data that often display complicated patterns of sequence heterogeneity. To illustrate our discussion, we have used three examples of inferred relationships which have changed radically as models and methods of analysis have improved. In two of these examples, the sister-group relationship between thermophilic Thermus and mesophilic Deinococcus, and the position of long-branch Microsporidia among eukaryotes, we show that recovering what is now generally considered to be the correct tree is critically dependent on the fit between model and data. In the third example, the position of eukaryotes in the tree of life, the hypothesis that is currently supported by the best available methods is fundamentally different from the classical view of relationships between major cellular domains. Since heterogeneity appears to be pervasive and varied among all molecular sequence data, and even the best available models can still struggle to deal with some problems, the issues we discuss are generally relevant to phylogenetic analyses. It remains essential to maintain a critical attitude to all trees as hypotheses of relationship that may change with more data and better methods.
Collapse
Affiliation(s)
- Tom A Williams
- School of Biological Sciences, University of Bristol, Bristol BS8 1TQ, United Kingdom
| | - Dominik Schrempf
- Dept. of Biological Physics, Eötvös Loránd University, 1117 Budapest, Hungary
| | - Gergely J Szöllősi
- Dept. of Biological Physics, Eötvös Loránd University, 1117 Budapest, Hungary.,MTA-ELTE "Lendület" Evolutionary Genomics Research Group, 1117 Budapest, Hungary.,Institute of Evolution, Centre for Ecological Research, 1121 Budapest, Hungary
| | - Cymon J Cox
- Centro de Ciências do Mar, Universidade do Algarve, Gambelas, 8005-319 Faro, Portugal
| | - Peter G Foster
- Department of Life Sciences, Natural History Museum, London SW7 5BD, United Kingdom
| | - T Martin Embley
- Biosciences Institute, Centre for Bacterial Cell Biology, Newcastle University, Newcastle upon Tyne NE2 4AX, United Kingdom
| |
Collapse
|
29
|
Comte N, Morel B, Hasić D, Guéguen L, Boussau B, Daubin V, Penel S, Scornavacca C, Gouy M, Stamatakis A, Tannier E, Parsons DP. Treerecs: an integrated phylogenetic tool, from sequences to reconciliations. Bioinformatics 2021; 36:4822-4824. [PMID: 33085745 DOI: 10.1093/bioinformatics/btaa615] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2019] [Revised: 06/22/2020] [Accepted: 07/09/2020] [Indexed: 11/15/2022] Open
Abstract
MOTIVATION Gene and species tree reconciliation methods are used to interpret gene trees, root them and correct uncertainties that are due to scarcity of signal in multiple sequence alignments. So far, reconciliation tools have not been integrated in standard phylogenetic software and they either lack performance on certain functions, or usability for biologists. RESULTS We present Treerecs, a phylogenetic software based on duplication-loss reconciliation. Treerecs is simple to install and to use. It is fast and versatile, has a graphic output, and can be used along with methods for phylogenetic inference on multiple alignments like PLL and Seaview. AVAILABILITY AND IMPLEMENTATION Treerecs is open-source. Its source code (C++, AGPLv3) and manuals are available from https://project.inria.fr/treerecs/.
Collapse
Affiliation(s)
- Nicolas Comte
- Inria Grenoble Rhône-Alpes, 38334 Montbonnot, France
| | - Benoit Morel
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
| | - Damir Hasić
- Department of Mathematics, University of Sarajevo, Sarajevo, Bosnia and Herzegovina
| | - Laurent Guéguen
- Université de Lyon, Laboratoire de Biométrie et Biologie Évolutive, CNRS UMR5558, F-69622 Villeurbanne, France
| | - Bastien Boussau
- Université de Lyon, Laboratoire de Biométrie et Biologie Évolutive, CNRS UMR5558, F-69622 Villeurbanne, France
| | - Vincent Daubin
- Université de Lyon, Laboratoire de Biométrie et Biologie Évolutive, CNRS UMR5558, F-69622 Villeurbanne, France
| | - Simon Penel
- Université de Lyon, Laboratoire de Biométrie et Biologie Évolutive, CNRS UMR5558, F-69622 Villeurbanne, France
| | - Celine Scornavacca
- ISEM, CNRS, Université de Montpellier, IRD, EPHE, Montpellier 34000, France
| | - Manolo Gouy
- Université de Lyon, Laboratoire de Biométrie et Biologie Évolutive, CNRS UMR5558, F-69622 Villeurbanne, France
| | - Alexandros Stamatakis
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany.,Institute of Theoretical Informatics, Karlsruhe Institute of Technology, Karlsruhe, Germany
| | - Eric Tannier
- Inria Grenoble Rhône-Alpes, 38334 Montbonnot, France.,Université de Lyon, Laboratoire de Biométrie et Biologie Évolutive, CNRS UMR5558, F-69622 Villeurbanne, France
| | | |
Collapse
|
30
|
Orlando M, Pucciarelli S, Lotti M. Endolysins from Antarctic Pseudomonas Display Lysozyme Activity at Low Temperature. Mar Drugs 2020; 18:E579. [PMID: 33233712 PMCID: PMC7699920 DOI: 10.3390/md18110579] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Revised: 11/17/2020] [Accepted: 11/18/2020] [Indexed: 12/18/2022] Open
Abstract
Organisms specialized to thrive in cold environments (so-called psychrophiles) produce enzymes with the remarkable ability to catalyze chemical reactions at low temperature. Cold activity relies on adaptive changes in the proteins' sequence and structural organization that result in high conformational flexibility. As a consequence of flexibility, several such enzymes are inherently heat sensitive. Cold-active enzymes are of interest for application in a number of bioprocesses, where cold activity coupled with easy thermal inactivation can be of advantage. We describe the biochemical and functional properties of two glycosyl hydrolases (named LYS177 and LYS188) of family 19 (GH19), identified in the genome of an Antarctic marine Pseudomonas. Molecular evolutionary analysis placed them in a group of characterized GH19 endolysins active on lysozyme substrates, such as peptidoglycan. Enzyme activity peaks at about 25-35 °C and 40% residual activity is retained at 5 °C. LYS177 and LYS188 are thermolabile, with Tm of 52 and 45 °C and half-lives of 48 and 12 h at 37 °C, respectively. Bioinformatics analyses suggest that low heat stability may be associated to temperature-driven increases in local flexibility occurring mainly in a specific region of the polypeptide that is predicted to contain hot spots for aggregation.
Collapse
Affiliation(s)
- Marco Orlando
- Department of Biotechnology and Biosciences, State University of Milano Bicocca, 20126 Milano, Italy;
| | - Sandra Pucciarelli
- School of Biosciences and Veterinary Medicine, University of Camerino, 62032 Camerino, Italy;
| | - Marina Lotti
- Department of Biotechnology and Biosciences, State University of Milano Bicocca, 20126 Milano, Italy;
| |
Collapse
|
31
|
Li X, Cheng J, Liu X, Guo X, Liu Y, Fan W, Lu L, Ma Y, Liu T, Tao S, Jiang H. Origin and Evolution of Fusidane-Type Antibiotics Biosynthetic Pathway through Multiple Horizontal Gene Transfers. Genome Biol Evol 2020; 12:1830-1840. [PMID: 32915993 PMCID: PMC7750971 DOI: 10.1093/gbe/evaa163] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/27/2020] [Indexed: 01/08/2023] Open
Abstract
Fusidane-type antibiotics represented by fusidic acid, helvolic acid, and cephalosporin P1 have very similar core structures, but they are produced by fungi belonging to different taxonomic groups. The origin and evolution of fusidane-type antibiotics biosynthetic gene clusters (BGCs) in different antibiotics producing strains remained an enigma. In this study, we investigated the distribution and evolution of the fusidane BGCs in 1,284 fungal genomes. We identified 12 helvolic acid BGCs, 4 fusidic acid BGCs, and 1 cephalosporin P1 BGC in Pezizomycotina fungi. Phylogenetic analyses indicated six horizontal gene transfer (HGT) events in the evolutionary trajectory of the BGCs, including 1) three transfers across Eurotiomycetes and Sordariomycetes classes; 2) one transfer between genera under Sordariomycetes class; and 3) two transfers within Aspergillus genus under Eurotiomycetes classes. Finally, we proposed that the ancestor of fusidane BGCs would be originated from the Zoopagomycota by ancient HGT events according to the phylogenetic trees of key enzymes in fusidane BGCs (OSC and P450 genes). Our results extensively clarify the evolutionary trajectory of fusidane BGCs by HGT among distantly related fungi and provide new insights into the evolutionary mechanisms of metabolic pathways in fungi.
Collapse
Affiliation(s)
- Xiangchen Li
- College of Life Science, Northwest A&F University, Yangling, Shaanxi, China
| | - Jian Cheng
- Key Laboratory of Systems Microbial Biotechnology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, China
| | - Xiaonan Liu
- Key Laboratory of Systems Microbial Biotechnology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, China
| | - Xiaoxian Guo
- Key Laboratory of Systems Microbial Biotechnology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, China
| | - Yuqian Liu
- School of Biology and Biological Engineering, South China University of Technology, Guangzhou, China
| | - Wenjing Fan
- Key Laboratory of Systems Microbial Biotechnology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, China
| | - Lina Lu
- Key Laboratory of Systems Microbial Biotechnology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, China
| | - Yanhe Ma
- Key Laboratory of Systems Microbial Biotechnology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, China
| | - Tao Liu
- Key Laboratory of Systems Microbial Biotechnology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, China
| | - Shiheng Tao
- College of Life Science, Northwest A&F University, Yangling, Shaanxi, China
| | - Huifeng Jiang
- Key Laboratory of Systems Microbial Biotechnology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, China
| |
Collapse
|
32
|
Piatkowski BT, Imwattana K, Tripp EA, Weston DJ, Healey A, Schmutz J, Shaw AJ. Phylogenomics reveals convergent evolution of red-violet coloration in land plants and the origins of the anthocyanin biosynthetic pathway. Mol Phylogenet Evol 2020; 151:106904. [PMID: 32645485 DOI: 10.1016/j.ympev.2020.106904] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2020] [Revised: 06/23/2020] [Accepted: 07/01/2020] [Indexed: 11/17/2022]
Abstract
The flavonoids, one of the largest classes of plant secondary metabolites, are found in lineages that span the land plant phylogeny and play important roles in stress responses and as pigments. Perhaps the most well-studied flavonoids are the anthocyanins that have human health benefits and help plants attract pollinators, regulate hormone production, and confer resistance to abiotic and biotic stresses. The canonical biochemical pathway responsible for the production of these pigments is well-characterized for flowering plants yet its conservation across deep divergences in land plants remains debated and poorly understood. Many early land plants such as mosses, liverworts, and ferns produce flavonoid pigments, but their biosynthetic origins and homologies to the anthocyanin pathway remain uncertain. We conducted phylogenetic analyses using full genome sequences representing nearly all major green plant lineages to reconstruct the evolutionary history of the anthocyanin biosynthetic pathway then test the hypothesis that genes in this pathway are present in early land plants. We found that the entire pathway was not intact until the most recent common ancestor of seed plants and that orthologs of many downstream enzymes are absent from seedless plants including mosses, liverworts, and ferns. Our results also highlight the utility of phylogenetic inference, as compared to pairwise sequence similarity, in orthology assessment within large gene families that have complex duplication-loss histories. We suggest that the production of red-violet flavonoid pigments widespread in seedless plants, including the 3-deoxyanthocyanins, requires the activity of novel, as-yet discovered enzymes, and represents convergent evolution of red-violet coloration across land plants.
Collapse
Affiliation(s)
- Bryan T Piatkowski
- Department of Biology, Duke University, Durham, NC 27708, United States.
| | - Karn Imwattana
- Department of Biology, Duke University, Durham, NC 27708, United States
| | - Erin A Tripp
- Department of Ecology and Evolutionary Biology, University of Colorado Boulder, Boulder, CO 80309, United States
| | - David J Weston
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, United States; Climate Change Science Institute, Oak Ridge National Laboratory, Oak Ridge, TN 37831, United States
| | - Adam Healey
- HudsonAlpha Institute of Biotechnology, Huntsville, AL 35806, United States
| | - Jeremy Schmutz
- HudsonAlpha Institute of Biotechnology, Huntsville, AL 35806, United States; Department of Energy Joint Genome Institute, Berkeley, CA 94720, United States
| | - A Jonathan Shaw
- Department of Biology, Duke University, Durham, NC 27708, United States
| |
Collapse
|
33
|
Mangiagalli M, Lapi M, Maione S, Orlando M, Brocca S, Pesce A, Barbiroli A, Camilloni C, Pucciarelli S, Lotti M, Nardini M. The co-existence of cold activity and thermal stability in an Antarctic GH42 β-galactosidase relies on its hexameric quaternary arrangement. FEBS J 2020; 288:546-565. [PMID: 32363751 DOI: 10.1111/febs.15354] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Revised: 04/25/2020] [Accepted: 04/29/2020] [Indexed: 11/28/2022]
Abstract
To survive in cold environments, psychrophilic organisms produce enzymes endowed with high specific activity at low temperature. The structure of these enzymes is usually flexible and mostly thermolabile. In this work, we investigate the structural basis of cold adaptation of a GH42 β-galactosidase from the psychrophilic Marinomonas ef1. This enzyme couples cold activity with astonishing robustness for a psychrophilic protein, for it retains 23% of its highest activity at 5 °C and it is stable for several days at 37 °C and even 50 °C. Phylogenetic analyses indicate a close relationship with thermophilic β-galactosidases, suggesting that the present-day enzyme evolved from a thermostable scaffold modeled by environmental selective pressure. The crystallographic structure reveals the overall similarity with GH42 enzymes, along with a hexameric arrangement (dimer of trimers) not found in psychrophilic, mesophilic, and thermophilic homologues. In the quaternary structure, protomers form a large central cavity, whose accessibility to the substrate is promoted by the dynamic behavior of surface loops, even at low temperature. A peculiar cooperative behavior of the enzyme is likely related to the increase of the internal cavity permeability triggered by heating. Overall, our results highlight a novel strategy of enzyme cold adaptation, based on the oligomerization state of the enzyme, which effectively challenges the paradigm of cold activity coupled with intrinsic thermolability. DATABASE: Structural data are available in the Protein Data Bank database under the accession number 6Y2K.
Collapse
Affiliation(s)
- Marco Mangiagalli
- Department of Biotechnology and Biosciences, University of Milano-Bicocca, Italy
| | - Michela Lapi
- Department of Biosciences, University of Milano, Italy
| | - Serena Maione
- Department of Biotechnology and Biosciences, University of Milano-Bicocca, Italy
| | - Marco Orlando
- Department of Biotechnology and Biosciences, University of Milano-Bicocca, Italy
| | - Stefania Brocca
- Department of Biotechnology and Biosciences, University of Milano-Bicocca, Italy
| | | | - Alberto Barbiroli
- Department of Food, Environmental and Nutritional Sciences, University of Milano, Italy
| | | | - Sandra Pucciarelli
- School of Biosciences and Veterinary Medicine, University of Camerino, Italy
| | - Marina Lotti
- Department of Biotechnology and Biosciences, University of Milano-Bicocca, Italy
| | - Marco Nardini
- Department of Biosciences, University of Milano, Italy
| |
Collapse
|
34
|
Wade T, Rangel LT, Kundu S, Fournier GP, Bansal MS. Assessing the accuracy of phylogenetic rooting methods on prokaryotic gene families. PLoS One 2020; 15:e0232950. [PMID: 32413061 PMCID: PMC7228096 DOI: 10.1371/journal.pone.0232950] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2019] [Accepted: 04/24/2020] [Indexed: 12/18/2022] Open
Abstract
Almost all standard phylogenetic methods for reconstructing gene trees result in unrooted trees; yet, many of the most useful applications of gene trees require that the gene trees be correctly rooted. As a result, several computational methods have been developed for inferring the root of unrooted gene trees. However, the accuracy of such methods has never been systematically evaluated on prokaryotic gene families, where horizontal gene transfer is often one of the dominant evolutionary events driving gene family evolution. In this work, we address this gap by conducting a thorough comparative evaluation of five different rooting methods using large collections of both simulated and empirical prokaryotic gene trees. Our simulation study is based on 6000 true and reconstructed gene trees on 100 species and characterizes the rooting accuracy of the four methods under 36 different evolutionary conditions and 3 levels of gene tree reconstruction error. The empirical study is based on a large, carefully designed data set of 3098 gene trees from 504 bacterial species (406 Alphaproteobacteria and 98 Cyanobacteria) and reveals insights that supplement those gleaned from the simulation study. Overall, this work provides several valuable insights into the accuracy of the considered methods that will help inform the choice of rooting methods to use when studying microbial gene family evolution. Among other findings, this study identifies parsimonious Duplication-Transfer-Loss (DTL) rooting and Minimal Ancestor Deviation (MAD) rooting as two of the most accurate gene tree rooting methods for prokaryotes and specifies the evolutionary conditions under which these methods are most accurate, demonstrates that DTL rooting is highly sensitive to high evolutionary rates and gene tree error, and that rooting methods based on branch-lengths are generally robust to gene tree reconstruction error.
Collapse
Affiliation(s)
- Taylor Wade
- Department of Computer Science & Engineering, University of Connecticut, Storrs, CT, United States of America
| | - L. Thiberio Rangel
- Department of Earth, Atmospheric & Planetary Sciences, Massachusetts Institute of Technology, Cambridge, MA, United States of America
| | - Soumya Kundu
- Department of Computer Science & Engineering, University of Connecticut, Storrs, CT, United States of America
| | - Gregory P. Fournier
- Department of Earth, Atmospheric & Planetary Sciences, Massachusetts Institute of Technology, Cambridge, MA, United States of America
| | - Mukul S. Bansal
- Department of Computer Science & Engineering, University of Connecticut, Storrs, CT, United States of America
- Institute for Systems Genomics, University of Connecticut, Storrs, CT, United States of America
| |
Collapse
|
35
|
Sevillya G, Doerr D, Lerner Y, Stoye J, Steel M, Snir S. Horizontal Gene Transfer Phylogenetics: A Random Walk Approach. Mol Biol Evol 2020; 37:1470-1479. [PMID: 31845962 DOI: 10.1093/molbev/msz302] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The dramatic decrease in time and cost for generating genetic sequence data has opened up vast opportunities in molecular systematics, one of which is the ability to decipher the evolutionary history of strains of a species. Under this fine systematic resolution, the standard markers are too crude to provide a phylogenetic signal. Nevertheless, among prokaryotes, genome dynamics in the form of horizontal gene transfer (HGT) between organisms and gene loss seem to provide far richer information by affecting both gene order and gene content. The "synteny index" (SI) between a pair of genomes combines these latter two factors, allowing comparison of genomes with unequal gene content, together with order considerations of their common genes. Although this approach is useful for classifying close relatives, no rigorous statistical modeling for it has been suggested. Such modeling is valuable, as it allows observed measures to be transformed into estimates of time periods during evolution, yielding the "additivity" of the measure. To the best of our knowledge, there is no other additivity proof for other gene order/content measures under HGT. Here, we provide a first statistical model and analysis for the SI measure. We model the "gene neighborhood" as a "birth-death-immigration" process affected by the HGT activity over the genome, and analytically relate the HGT rate and time to the expected SI. This model is asymptotic and thus provides accurate results, assuming infinite size genomes. Therefore, we also developed a heuristic model following an "exponential decay" function, accounting for biologically realistic values, which performed well in simulations. Applying this model to 1,133 prokaryotes partitioned to 39 clusters by the rank of genus yields that the average number of genome dynamics events per gene in the phylogenetic depth of genus is around half with significant variability between genera. This result extends and confirms similar results obtained for individual genera in different manners.
Collapse
Affiliation(s)
- Gur Sevillya
- Department of Evolutionary Biology, University of Haifa, Haifa, Israel
| | - Daniel Doerr
- Faculty of Technology, Bielefeld University, Bielefeld, Germany
| | - Yael Lerner
- Department of Evolutionary Biology, University of Haifa, Haifa, Israel
| | - Jens Stoye
- Faculty of Technology, Bielefeld University, Bielefeld, Germany
| | - Mike Steel
- School of Mathematics and Statistics, University of Canterbury, Christchurch, New Zealand
| | - Sagi Snir
- Department of Evolutionary Biology, University of Haifa, Haifa, Israel
| |
Collapse
|
36
|
A Tale of Two Families: Whole Genome and Segmental Duplications Underlie Glutamine Synthetase and Phosphoenolpyruvate Carboxylase Diversity in Narrow-Leafed Lupin ( Lupinus angustifolius L.). Int J Mol Sci 2020; 21:ijms21072580. [PMID: 32276381 PMCID: PMC7177731 DOI: 10.3390/ijms21072580] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2020] [Revised: 04/04/2020] [Accepted: 04/06/2020] [Indexed: 01/04/2023] Open
Abstract
Narrow-leafed lupin (Lupinus angustifolius L.) has recently been supplied with advanced genomic resources and, as such, has become a well-known model for molecular evolutionary studies within the legume family—a group of plants able to fix nitrogen from the atmosphere. The phylogenetic position of lupins in Papilionoideae and their evolutionary distance to other higher plants facilitates the use of this model species to improve our knowledge on genes involved in nitrogen assimilation and primary metabolism, providing novel contributions to our understanding of the evolutionary history of legumes. In this study, we present a complex characterization of two narrow-leafed lupin gene families—glutamine synthetase (GS) and phosphoenolpyruvate carboxylase (PEPC). We combine a comparative analysis of gene structures and a synteny-based approach with phylogenetic reconstruction and reconciliation of the gene family and species history in order to examine events underlying the extant diversity of both families. Employing the available evidence, we show the impact of duplications on the initial complement of the analyzed gene families within the genistoid clade and posit that the function of duplicates has been largely retained. In terms of a broader perspective, our results concerning GS and PEPC gene families corroborate earlier findings pointing to key whole genome duplication/triplication event(s) affecting the genistoid lineage.
Collapse
|
37
|
Counting and sampling gene family evolutionary histories in the duplication-loss and duplication-loss-transfer models. J Math Biol 2020; 80:1353-1388. [PMID: 32060618 PMCID: PMC7052048 DOI: 10.1007/s00285-019-01465-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2019] [Revised: 11/18/2019] [Indexed: 10/28/2022]
Abstract
Given a set of species whose evolution is represented by a species tree, a gene family is a group of genes having evolved from a single ancestral gene. A gene family evolves along the branches of a species tree through various mechanisms, including-but not limited to-speciation ([Formula: see text]), gene duplication ([Formula: see text]), gene loss ([Formula: see text]), and horizontal gene transfer ([Formula: see text]). The reconstruction of a gene tree representing the evolution of a gene family constrained by a species tree is an important problem in phylogenomics. However, unlike in the multispecies coalescent evolutionary model that considers only speciation and incomplete lineage sorting events, very little is known about the search space for gene family histories accounting for gene duplication, gene loss and horizontal gene transfer (the [Formula: see text]-model). In this work, we introduce the notion of evolutionary histories defined as a binary ordered rooted tree describing the evolution of a gene family, constrained by a species tree in the [Formula: see text]-model. We provide formal grammars describing the set of all evolutionary histories that are compatible with a given species tree, whether it is ranked or unranked. These grammars allow us, using either analytic combinatorics or dynamic programming, to efficiently compute the number of histories of a given size, and also to generate random histories of a given size under the uniform distribution. We apply these tools to obtain exact asymptotics for the number of gene family histories for two species trees, the rooted caterpillar and complete binary tree, as well as estimates of the range of the exponential growth factor of the number of histories for random species trees of size up to 25. Our results show that including horizontal gene transfers induce a dramatic increase of the number of evolutionary histories. We also show that, within ranked species trees, the number of evolutionary histories in the [Formula: see text]-model is almost independent of the species tree topology. These results establish firm foundations for the development of ensemble methods for the prediction of reconciliations.
Collapse
|
38
|
Duchemin W, Gence G, Arigon Chifolleau AM, Arvestad L, Bansal MS, Berry V, Boussau B, Chevenet F, Comte N, Davín AA, Dessimoz C, Dylus D, Hasic D, Mallo D, Planel R, Posada D, Scornavacca C, Szöllosi G, Zhang L, Tannier É, Daubin V. RecPhyloXML: a format for reconciled gene trees. Bioinformatics 2019; 34:3646-3652. [PMID: 29762653 PMCID: PMC6198865 DOI: 10.1093/bioinformatics/bty389] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2017] [Accepted: 05/09/2018] [Indexed: 12/21/2022] Open
Abstract
Motivation A reconciliation is an annotation of the nodes of a gene tree with evolutionary events—for example, speciation, gene duplication, transfer, loss, etc.—along with a mapping onto a species tree. Many algorithms and software produce or use reconciliations but often using different reconciliation formats, regarding the type of events considered or whether the species tree is dated or not. This complicates the comparison and communication between different programs. Results Here, we gather a consortium of software developers in gene tree species tree reconciliation to propose and endorse a format that aims to promote an integrative—albeit flexible—specification of phylogenetic reconciliations. This format, named recPhyloXML, is accompanied by several tools such as a reconciled tree visualizer and conversion utilities. Availability and implementation http://phylariane.univ-lyon1.fr/recphyloxml/.
Collapse
Affiliation(s)
- Wandrille Duchemin
- Univ Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Évolutive UMR5558, F-69622 Villeurbanne, France.,INRIA Grenoble Rhône-Alpes, F-38334 Montbonnot, France.,MTA-ELTE Lendület Evolutionary Genomics Research Group, Budapest, Hungary.,Department of Biological Physics, Eötvös Loránd University, Budapest, Hungary
| | - Guillaume Gence
- Univ Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Évolutive UMR5558, F-69622 Villeurbanne, France
| | - Anne-Muriel Arigon Chifolleau
- LIRMM, Université de Montpellier, CNRS, Montpellier, France.,Institut de Biologie Computationnelle (IBC), Montpellier, France
| | - Lars Arvestad
- Department of Mathematics, Stockholm University, Stockholm, Sweden.,Swedish e-Science Research Centre (SeRC), Stockholm, Sweden
| | - Mukul S Bansal
- Department of Computer Science and Engineering, University of Connecticut, Storrs, CT, USA.,Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
| | - Vincent Berry
- LIRMM, Université de Montpellier, CNRS, Montpellier, France.,Institut de Biologie Computationnelle (IBC), Montpellier, France.,ISEM, CNRS, Université de Montpellier, IRD, EPHE, Montpellier, France
| | - Bastien Boussau
- Univ Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Évolutive UMR5558, F-69622 Villeurbanne, France
| | - François Chevenet
- LIRMM, Université de Montpellier, CNRS, Montpellier, France.,MIVEGEC, CNRS 5290, IRD 224, Université de Montpellier, Montpellier, France
| | - Nicolas Comte
- INRIA Grenoble Rhône-Alpes, F-38334 Montbonnot, France
| | - Adrián A Davín
- Univ Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Évolutive UMR5558, F-69622 Villeurbanne, France.,MTA-ELTE Lendület Evolutionary Genomics Research Group, Budapest, Hungary.,Department of Biological Physics, Eötvös Loránd University, Budapest, Hungary
| | - Christophe Dessimoz
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland.,Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland.,Department of Genetics, Evolution and Environment, University College London, London, UK.,Department of Computer Science, University College London, London, UK.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - David Dylus
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
| | - Damir Hasic
- Department of Mathematics, Faculty of Science, University of Sarajevo, Sarajevo, Bosnia and Herzegovina
| | - Diego Mallo
- Virginia G. Piper Center for Personalized Diagnostics, Biodesign Institute, Arizona State University, Tempe, AZ, USA
| | - Rémi Planel
- Laboratoire d'Analyse Bio-informatique en Génomique et Métabolisme CNRS-UMR 8030, Commissariat à l'Énergie Atomique (CEA), Institut de Génomique, Genoscope, Evry, France
| | - David Posada
- Department of Biochemistry, Genetics and Immunology, University of Vigo, Vigo, Spain
| | - Celine Scornavacca
- Institut de Biologie Computationnelle (IBC), Montpellier, France.,ISEM, CNRS, Université de Montpellier, IRD, EPHE, Montpellier, France
| | - Gergely Szöllosi
- MTA-ELTE Lendület Evolutionary Genomics Research Group, Budapest, Hungary.,Department of Biological Physics, Eötvös Loránd University, Budapest, Hungary
| | - Louxin Zhang
- Department of Mathematics, National University of Singapore, Singapore, Singapore
| | - Éric Tannier
- Univ Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Évolutive UMR5558, F-69622 Villeurbanne, France.,INRIA Grenoble Rhône-Alpes, F-38334 Montbonnot, France
| | - Vincent Daubin
- Univ Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Évolutive UMR5558, F-69622 Villeurbanne, France
| |
Collapse
|
39
|
Douglas GM, Langille MGI. Current and Promising Approaches to Identify Horizontal Gene Transfer Events in Metagenomes. Genome Biol Evol 2019; 11:2750-2766. [PMID: 31504488 PMCID: PMC6777429 DOI: 10.1093/gbe/evz184] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/19/2019] [Indexed: 12/16/2022] Open
Abstract
High-throughput shotgun metagenomics sequencing has enabled the profiling of myriad natural communities. These data are commonly used to identify gene families and pathways that were potentially gained or lost in an environment and which may be involved in microbial adaptation. Despite the widespread interest in these events, there are no established best practices for identifying gene gain and loss in metagenomics data. Horizontal gene transfer (HGT) represents several mechanisms of gene gain that are especially of interest in clinical microbiology due to the rapid spread of antibiotic resistance genes in natural communities. Several additional mechanisms of gene gain and loss, including gene duplication, gene loss-of-function events, and de novo gene birth are also important to consider in the context of metagenomes but have been less studied. This review is largely focused on detecting HGT in prokaryotic metagenomes, but methods for detecting these other mechanisms are first discussed. For this article to be self-contained, we provide a general background on HGT and the different possible signatures of this process. Lastly, we discuss how improved assembly of genomes from metagenomes would be the most straight-forward approach for improving the inference of gene gain and loss events. Several recent technological advances could help improve metagenome assemblies: long-read sequencing, determining the physical proximity of contigs, optical mapping of short sequences along chromosomes, and single-cell metagenomics. The benefits and limitations of these advances are discussed and open questions in this area are highlighted.
Collapse
Affiliation(s)
- Gavin M Douglas
- Department of Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Morgan G I Langille
- Department of Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia, Canada
| |
Collapse
|
40
|
Kundu S, Bansal MS. SaGePhy: an improved phylogenetic simulation framework for gene and subgene evolution. Bioinformatics 2019; 35:3496-3498. [PMID: 30715213 DOI: 10.1093/bioinformatics/btz081] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2018] [Revised: 01/21/2019] [Accepted: 01/31/2019] [Indexed: 11/14/2022] Open
Abstract
SUMMARY SaGePhy is a software package for improved phylogenetic simulation of gene and subgene evolution. SaGePhy can be used to generate species trees, gene trees and subgene or (protein) domain trees using a probabilistic birth-death process that allows for gene and subgene duplication, horizontal gene and subgene transfer and gene and subgene loss. SaGePhy implements a range of important features not found in other phylogenetic simulation frameworks/software. These include (i) simulation of subgene or domain level evolution inside one or more gene trees, (ii) simultaneous simulation of both additive and replacing horizontal gene/subgene transfers and (iii) probabilistic sampling of species tree and gene tree nodes, respectively, for gene- and domain-family birth. SaGePhy is open-source, platform independent and written in Java and Python. AVAILABILITY AND IMPLEMENTATION Executables, source code (open-source under the revised BSD license) and a detailed manual are freely available from http://compbio.engr.uconn.edu/software/sagephy/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Soumya Kundu
- Department of Computer Science & Engineering, Storrs, CT, USA
| | - Mukul S Bansal
- Department of Computer Science & Engineering, Storrs, CT, USA.,The Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
| |
Collapse
|
41
|
Horizontal Gene Transfer as an Indispensable Driver for Evolution of Neocallimastigomycota into a Distinct Gut-Dwelling Fungal Lineage. Appl Environ Microbiol 2019; 85:AEM.00988-19. [PMID: 31126947 DOI: 10.1128/aem.00988-19] [Citation(s) in RCA: 52] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Accepted: 05/19/2019] [Indexed: 01/01/2023] Open
Abstract
Survival and growth of the anaerobic gut fungi (AGF; Neocallimastigomycota) in the herbivorous gut necessitate the possession of multiple abilities absent in other fungal lineages. We hypothesized that horizontal gene transfer (HGT) was instrumental in forging the evolution of AGF into a phylogenetically distinct gut-dwelling fungal lineage. The patterns of HGT were evaluated in the transcriptomes of 27 AGF strains, 22 of which were isolated and sequenced in this study, and 4 AGF genomes broadly covering the breadth of AGF diversity. We identified 277 distinct incidents of HGT in AGF transcriptomes, with subsequent gene duplication resulting in an HGT frequency of 2 to 3.5% in AGF genomes. The majority of HGT events were AGF specific (91.7%) and wide (70.8%), indicating their occurrence at early stages of AGF evolution. The acquired genes allowed AGF to expand their substrate utilization range, provided new venues for electron disposal, augmented their biosynthetic capabilities, and facilitated their adaptation to anaerobiosis. The majority of donors were anaerobic fermentative bacteria prevalent in the herbivorous gut. This study strongly indicates that HGT indispensably forged the evolution of AGF as a distinct fungal phylum and provides a unique example of the role of HGT in shaping the evolution of a high-rank taxonomic eukaryotic lineage.IMPORTANCE The anaerobic gut fungi (AGF) represent a distinct basal phylum lineage (Neocallimastigomycota) commonly encountered in the rumen and alimentary tracts of herbivores. Survival and growth of anaerobic gut fungi in these anaerobic, eutrophic, and prokaryote-dominated habitats necessitates the acquisition of several traits absent in other fungal lineages. We assess here the role of horizontal gene transfer as a relatively fast mechanism for trait acquisition by the Neocallimastigomycota postsequestration in the herbivorous gut. Analysis of 27 transcriptomes that represent the broad diversity of Neocallimastigomycota identified 277 distinct HGT events, with subsequent gene duplication resulting in an HGT frequency of 2 to 3.5% in AGF genomes. These HGT events have allowed AGF to survive in the herbivorous gut by expanding their substrate utilization range, augmenting their biosynthetic pathway, providing new routes for electron disposal by expanding fermentative capacities, and facilitating their adaptation to anaerobiosis. HGT in the AGF is also shown to be mainly a cross-kingdom affair, with the majority of donors belonging to the bacteria. This study represents a unique example of the role of HGT in shaping the evolution of a high-rank taxonomic eukaryotic lineage.
Collapse
|
42
|
The Patchy Distribution of Restriction⁻Modification System Genes and the Conservation of Orphan Methyltransferases in Halobacteria. Genes (Basel) 2019; 10:genes10030233. [PMID: 30893937 PMCID: PMC6471742 DOI: 10.3390/genes10030233] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2019] [Revised: 03/13/2019] [Accepted: 03/14/2019] [Indexed: 12/26/2022] Open
Abstract
Restriction⁻modification (RM) systems in bacteria are implicated in multiple biological roles ranging from defense against parasitic genetic elements, to selfish addiction cassettes, and barriers to gene transfer and lineage homogenization. In bacteria, DNA-methylation without cognate restriction also plays important roles in DNA replication, mismatch repair, protein expression, and in biasing DNA uptake. Little is known about archaeal RM systems and DNA methylation. To elucidate further understanding for the role of RM systems and DNA methylation in Archaea, we undertook a survey of the presence of RM system genes and related genes, including orphan DNA methylases, in the halophilic archaeal class Halobacteria. Our results reveal that some orphan DNA methyltransferase genes were highly conserved among lineages indicating an important functional constraint, whereas RM systems demonstrated patchy patterns of presence and absence. This irregular distribution is due to frequent horizontal gene transfer and gene loss, a finding suggesting that the evolution and life cycle of RM systems may be best described as that of a selfish genetic element. A putative target motif (CTAG) of one of the orphan methylases was underrepresented in all of the analyzed genomes, whereas another motif (GATC) was overrepresented in most of the haloarchaeal genomes, particularly in those that encoded the cognate orphan methylase.
Collapse
|