1
|
Sengupta S, Azad RK. Leveraging comparative genomics to uncover alien genes in bacterial genomes. Microb Genom 2023; 9:mgen000939. [PMID: 36748570 PMCID: PMC9973850 DOI: 10.1099/mgen.0.000939] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
A significant challenge in bacterial genomics is to catalogue genes acquired through the evolutionary process of horizontal gene transfer (HGT). Both comparative genomics and sequence composition-based methods have often been invoked to quantify horizontally acquired genes in bacterial genomes. Comparative genomics methods rely on completely sequenced genomes and therefore the confidence in their predictions increases as the databases become more enriched in completely sequenced genomes. Recent developments including in microbial genome sequencing call for reassessment of alien genes based on information-rich resources currently available. We revisited the comparative genomics approach and developed a new algorithm for alien gene detection. Our algorithm compared favourably with the existing comparative genomics-based methods and is capable of detecting both recent and ancient transfers. It can be used as a standalone tool or in concert with other complementary algorithms for comprehensively cataloguing alien genes in bacterial genomes.
Collapse
Affiliation(s)
- Soham Sengupta
- Department of Biological Sciences and BioDiscovery Institute, University of North Texas, Denton, Texas, 76203, USA
| | - Rajeev K Azad
- Department of Biological Sciences and BioDiscovery Institute, University of North Texas, Denton, Texas, 76203, USA.,Department of Mathematics, University of North Texas, Denton, Texas, 76203, USA
| |
Collapse
|
2
|
Choi Y, Ahn S, Park M, Lee S, Cho S, Kim H. HGTree v2.0: a comprehensive database update for horizontal gene transfer (HGT) events detected by the tree-reconciliation method. Nucleic Acids Res 2022; 51:D1010-D1018. [PMID: 36350646 PMCID: PMC9825516 DOI: 10.1093/nar/gkac929] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Revised: 09/30/2022] [Accepted: 11/01/2022] [Indexed: 11/11/2022] Open
Abstract
HGTree is a database that provides horizontal gene transfer (HGT) event information on 2472 prokaryote genomes using the tree-reconciliation method. HGTree was constructed in 2015, and a large number of prokaryotic genomes have been additionally published since then. To cope with the rapid rise of prokaryotic genome data, we present HGTree v2.0 (http://hgtree2.snu.ac.kr), a newly updated version of our HGT database with much more extensive data, including a total of 20 536 completely sequenced non-redundant prokaryotic genomes, and more reliable HGT information results curated with various steps. As a result, HGTree v2.0 has a set of expanded data results of 6 361 199 putative horizontally transferred genes integrated with additional functional information such as the KEGG pathway, virulence factors and antimicrobial resistance. Furthermore, various visualization tools in the HGTree v2.0 database website provide intuitive biological insights, allowing the users to investigate their genomes of interest.
Collapse
Affiliation(s)
| | | | - Myeongkyu Park
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul 08826, Republic of Korea
| | | | - Seoae Cho
- eGnome Inc., Seoul 05836, Republic of Korea
| | - Heebal Kim
- To whom correspondence should be addressed. Tel: +82 2 880 4803; Fax: +82 2 883 8812;
| |
Collapse
|
3
|
Sengupta S, Azad RK. Reconstructing horizontal gene flow network to understand prokaryotic evolution. Open Biol 2022; 12:220169. [PMID: 36446404 PMCID: PMC9708380 DOI: 10.1098/rsob.220169] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022] Open
Abstract
Horizontal gene transfer (HGT) is a major source of phenotypic innovation and a mechanism of niche adaptation in prokaryotes. Quantification of HGT is critical to decipher its myriad roles in microbial evolution and adaptation. Advances in genome sequencing and bioinformatics have augmented our ability to understand the microbial world, particularly the direct or indirect influence of HGT on diverse life forms. Methods for detecting HGT can be classified into phylogenetic-based and parametric or composition-based approaches. Here, we exploited the complementary strengths of both the approaches to construct a high confidence horizontal gene flow network. Our network is unique in its ability to detect the transfer of native genes of a genome to genomes from other taxa, thus establishing donor and recipient organisms (taxa), rather than through a post hoc analysis as is the practice with several other approaches. The scale-free horizontal gene flow network presented here provides new insights into modes of transfer for the exchange of genetic information and also illuminates differential gene flow across phyla.
Collapse
Affiliation(s)
- Soham Sengupta
- Department of Biological Sciences and BioDiscovery Institute, University of North Texas, Denton, TX 76203, USA
| | - Rajeev K. Azad
- Department of Biological Sciences and BioDiscovery Institute, University of North Texas, Denton, TX 76203, USA,Department of Mathematics, University of North Texas, Denton, TX 76203, USA
| |
Collapse
|
4
|
Sengupta S, Azad RK. Reconstructing horizontal gene flow network to understand prokaryotic evolution. Open Biol 2022. [PMID: 36446404 DOI: 10.6084/m9.figshare.c.6307519] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022] Open
Abstract
Horizontal gene transfer (HGT) is a major source of phenotypic innovation and a mechanism of niche adaptation in prokaryotes. Quantification of HGT is critical to decipher its myriad roles in microbial evolution and adaptation. Advances in genome sequencing and bioinformatics have augmented our ability to understand the microbial world, particularly the direct or indirect influence of HGT on diverse life forms. Methods for detecting HGT can be classified into phylogenetic-based and parametric or composition-based approaches. Here, we exploited the complementary strengths of both the approaches to construct a high confidence horizontal gene flow network. Our network is unique in its ability to detect the transfer of native genes of a genome to genomes from other taxa, thus establishing donor and recipient organisms (taxa), rather than through a post hoc analysis as is the practice with several other approaches. The scale-free horizontal gene flow network presented here provides new insights into modes of transfer for the exchange of genetic information and also illuminates differential gene flow across phyla.
Collapse
Affiliation(s)
- Soham Sengupta
- Department of Biological Sciences and BioDiscovery Institute, University of North Texas, Denton, TX 76203, USA
| | - Rajeev K Azad
- Department of Biological Sciences and BioDiscovery Institute, University of North Texas, Denton, TX 76203, USA.,Department of Mathematics, University of North Texas, Denton, TX 76203, USA
| |
Collapse
|
5
|
Demirkiran O, Almaliti J, Leão T, Navarro G, Byrum T, Valeriote FA, Gerwick L, Gerwick WH. Portobelamides A and B and Caciqueamide, Cytotoxic Peptidic Natural Products from a Caldora sp. Marine Cyanobacterium. JOURNAL OF NATURAL PRODUCTS 2021; 84:2081-2093. [PMID: 34269583 DOI: 10.1021/acs.jnatprod.0c01383] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Three new compounds, portobelamides A and B (1 and 2), 3-amino-2-methyl-7-octynoic acid (AMOYA) and hydroxyisovaleric acid (Hiva) containing cyclic depsipeptides, and one long chain lipopeptide caciqueamide (3), were isolated from a field-collection of a Caldora sp. marine cyanobacterium obtained from Panama as part of the Panama International Cooperative Biodiversity Group Program. Their planar structures were elucidated through analysis of 2D NMR and MS data, especially high resolution (HR) MS2/MS3 fragmentation methods. The absolute configurations of compounds 1 and 2 were deduced by traditional hydrolysis, derivative formation, and chromatographic analyses compared with standards. Portobelamide A (1) showed good cytotoxicity against H-460 human lung cancer cells (33% survival at 0.9 μM).
Collapse
Affiliation(s)
- Ozlem Demirkiran
- Department of Pharmacognosy, Faculty of Pharmacy, Trakya University, Edirne 22030, Turkey
- Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California San Diego, La Jolla, California 92037, United States
| | - Jehad Almaliti
- Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California San Diego, La Jolla, California 92037, United States
| | - Tiago Leão
- Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California San Diego, La Jolla, California 92037, United States
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California 92093, United States
| | - Gabriel Navarro
- Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California San Diego, La Jolla, California 92037, United States
| | - Tara Byrum
- Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California San Diego, La Jolla, California 92037, United States
| | - Frederick A Valeriote
- Henry Ford Health System, Department of Internal Medicine, Josephine Ford Cancer Center, Detroit, Michigan 48202, United States
| | - Lena Gerwick
- Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California San Diego, La Jolla, California 92037, United States
| | - William H Gerwick
- Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California San Diego, La Jolla, California 92037, United States
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California 92093, United States
| |
Collapse
|
6
|
Widespread divergent transcription from bacterial and archaeal promoters is a consequence of DNA-sequence symmetry. Nat Microbiol 2021; 6:746-756. [PMID: 33958766 PMCID: PMC7612053 DOI: 10.1038/s41564-021-00898-9] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2020] [Accepted: 03/25/2021] [Indexed: 02/03/2023]
Abstract
Transcription initiates at promoters, DNA regions recognized by a DNA-dependent RNA polymerase. We previously identified horizontally acquired Escherichia coli promoters from which the direction of transcription was unclear. In the present study, we show that more than half of these promoters are bidirectional and drive divergent transcription. Using genome-scale approaches, we demonstrate that 19% of all transcription start sites detected in E. coli are associated with a bidirectional promoter. Bidirectional promoters are similarly common in diverse bacteria and archaea, and have inherent symmetry: specific bases required for transcription initiation are reciprocally co-located on opposite DNA strands. Bidirectional promoters enable co-regulation of divergent genes and are enriched in both intergenic and horizontally acquired regions. Divergent transcription is conserved among bacteria, archaea and eukaryotes, but the underlying mechanisms for bidirectionality are different.
Collapse
|
7
|
A Multi-Omics Characterization of the Natural Product Potential of Tropical Filamentous Marine Cyanobacteria. Mar Drugs 2021; 19:md19010020. [PMID: 33418911 PMCID: PMC7825088 DOI: 10.3390/md19010020] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Revised: 12/31/2020] [Accepted: 01/02/2021] [Indexed: 12/29/2022] Open
Abstract
Microbial natural products are important for the understanding of microbial interactions, chemical defense and communication, and have also served as an inspirational source for numerous pharmaceutical drugs. Tropical marine cyanobacteria have been highlighted as a great source of new natural products, however, few reports have appeared wherein a multi-omics approach has been used to study their natural products potential (i.e., reports are often focused on an individual natural product and its biosynthesis). This study focuses on describing the natural product genetic potential as well as the expressed natural product molecules in benthic tropical cyanobacteria. We collected from several sites around the world and sequenced the genomes of 24 tropical filamentous marine cyanobacteria. The informatics program antiSMASH was used to annotate the major classes of gene clusters. BiG-SCAPE phylum-wide analysis revealed the most promising strains for natural product discovery among these cyanobacteria. LCMS/MS-based metabolomics highlighted the most abundant molecules and molecular classes among 10 of these marine cyanobacterial samples. We observed that despite many genes encoding for peptidic natural products, peptides were not as abundant as lipids and lipopeptides in the chemical extracts. Our results highlight a number of highly interesting biosynthetic gene clusters for genome mining among these cyanobacterial samples.
Collapse
|
8
|
Bertelli C, Tilley KE, Brinkman FSL. Microbial genomic island discovery, visualization and analysis. Brief Bioinform 2020; 20:1685-1698. [PMID: 29868902 PMCID: PMC6917214 DOI: 10.1093/bib/bby042] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2018] [Revised: 04/30/2018] [Indexed: 12/27/2022] Open
Abstract
Horizontal gene transfer (also called lateral gene transfer) is a major mechanism for microbial genome evolution, enabling rapid adaptation and survival in specific niches. Genomic islands (GIs), commonly defined as clusters of bacterial or archaeal genes of probable horizontal origin, are of particular medical, environmental and/or industrial interest, as they disproportionately encode virulence factors and some antimicrobial resistance genes and may harbor entire metabolic pathways that confer a specific adaptation (solvent resistance, symbiosis properties, etc). As large-scale analyses of microbial genomes increases, such as for genomic epidemiology investigations of infectious disease outbreaks in public health, there is increased appreciation of the need to accurately predict and track GIs. Over the past decade, numerous computational tools have been developed to tackle the challenges inherent in accurate GI prediction. We review here the main types of GI prediction methods and discuss their advantages and limitations for a routine analysis of microbial genomes in this era of rapid whole-genome sequencing. An assessment is provided of 20 GI prediction software methods that use sequence-composition bias to identify the GIs, using a reference GI data set from 104 genomes obtained using an independent comparative genomics approach. Finally, we present guidelines to assist researchers in effectively identifying these key genomic regions.
Collapse
Affiliation(s)
- Claire Bertelli
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada
| | - Keith E Tilley
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada
| | - Fiona S L Brinkman
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada
| |
Collapse
|
9
|
Dotto BR, Carvalho EL, da Silva AF, Dezordi FZ, Pinto PM, Campos TDL, Rezende AM, Wallau GDL. HTT-DB: new features and updates. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2018; 2018:4791205. [PMID: 29315358 PMCID: PMC7206651 DOI: 10.1093/database/bax102] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/02/2017] [Accepted: 12/12/2017] [Indexed: 11/25/2022]
Abstract
Horizontal Transfer (HT) of genetic material between species is a common phenomenon among Bacteria and Archaea species and several databases are available for information retrieval and data mining. However, little attention has been given to this phenomenon among eukaryotic species mainly due to the lower proportion of these events. In the last years, a vertiginous amount of new HT events involving eukaryotic species was reported in the literature, highlighting the need of a common repository to keep the scientific community up to date and describe overall trends. Recently, we published the first HT database focused on HT of transposable elements among eukaryotes: the Horizontal Transposon Transfer DataBase: Database URL: (http://lpa.saogabriel.unipampa.edu.br: 8080/httdatabase/). Here, we present new features and updates of this unique database: (i) its expansion to include virus-host exchange of genetic material, which we called Horizontal Virus Transfer (HVT) and (ii) the availability of a web server for HT detection, where we implemented the online version of vertical and horizontal inheritance consistence analysis (VHICA), an R package developed for HT detection. These improvements will help researchers to navigate through known HVT cases, take data-informed decision and export figures based on keywords searches. Moreover, the availability of the VHICA as an online tool will make this software easily reachable even for researchers with no or little computation knowledge as well as foster our capability to detect new HT events in a wide variety of taxa. Database URL: http://lpa.saogabriel.unipampa.edu.br:8080/httdatabase/
Collapse
Affiliation(s)
- Bruno Reis Dotto
- Campus São Gabriel, Universidade Federal do Pampa, Avenida Antonio Trilha, 1847, São Gabriel, Rio Grande do Sul
| | - Evelise Leis Carvalho
- Pos Graduation in Biological Sciences, Universidade Federal do Pampa, Avenida Antonio Trilha, 1847, São Gabriel, Rio Grande do Sul
| | - Alexandre Freitas da Silva
- Pos Graduation in Biosciences and Health Biotechnology, Aggeu Magalhães Institute (IAM), Avenida Professor Moraes Rego, s/n, Recife, Pernambuco
| | - Filipe Zimmer Dezordi
- Campus São Gabriel, Universidade Federal do Pampa, Avenida Antonio Trilha, 1847, São Gabriel, Rio Grande do Sul.,Pos Graduation in Biosciences and Health Biotechnology, Aggeu Magalhães Institute (IAM), Avenida Professor Moraes Rego, s/n, Recife, Pernambuco
| | - Paulo Marcos Pinto
- Campus São Gabriel, Universidade Federal do Pampa, Avenida Antonio Trilha, 1847, São Gabriel, Rio Grande do Sul.,Pos Graduation in Biological Sciences, Universidade Federal do Pampa, Avenida Antonio Trilha, 1847, São Gabriel, Rio Grande do Sul
| | - Tulio de Lima Campos
- Bioinformatic Core, Aggeu Magalhães Institute (IAM), Avenida Professor Moraes Rego, s/n, Recife, Pernambuco
| | - Antonio Mauro Rezende
- Department of Microbiology, Aggeu Magalhães Institute (IAM), Avenida Professor Moraes Rego, s/n, Recife, Pernambuco
| | - Gabriel da Luz Wallau
- Department of Entomology, Aggeu Magalhães Institute (IAM), Avenida Professor Moraes Rego, s/n, Recife, Pernambuco
| |
Collapse
|
10
|
Jamil HM. Optimizing Phylogenetic Queries for Performance. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:1692-1705. [PMID: 28858810 DOI: 10.1109/tcbb.2017.2743706] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
The vast majority of phylogenetic databases do not support declarative querying using which their contents can be flexibly and conveniently accessed and the template based query interfaces they support do not allow arbitrary speculative queries. They therefore also do not support query optimization leveraging unique phylogeny properties. While a small number of graph query languages such as XQuery, Cypher, and GraphQL exist for computer savvy users, most are too general and complex to be useful for biologists, and too inefficient for large phylogeny querying. In this paper, we discuss a recently introduced visual query language, called PhyQL, that leverages phylogeny specific properties to support essential and powerful constructs for a large class of phylogentic queries. We develop a range of pruning aids, and propose a substantial set of query optimization strategies using these aids suitable for large phylogeny querying. A hybrid optimization technique that exploits a set of indices and "graphlet" partitioning is discussed. A "fail soonest" strategy is used to avoid hopeless processing and is shown to produce dividends. Possible novel optimization techniques yet to be explored are also discussed.
Collapse
|
11
|
Paquola ACM, Asif H, Pereira CADB, Feltes BC, Bonatto D, Lima WC, Menck CFM. Horizontal Gene Transfer Building Prokaryote Genomes: Genes Related to Exchange Between Cell and Environment are Frequently Transferred. J Mol Evol 2018; 86:190-203. [DOI: 10.1007/s00239-018-9836-x] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2017] [Accepted: 03/15/2018] [Indexed: 10/17/2022]
|
12
|
Fuchsman CA, Collins RE, Rocap G, Brazelton WJ. Effect of the environment on horizontal gene transfer between bacteria and archaea. PeerJ 2017; 5:e3865. [PMID: 28975058 PMCID: PMC5624296 DOI: 10.7717/peerj.3865] [Citation(s) in RCA: 49] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2017] [Accepted: 09/08/2017] [Indexed: 01/24/2023] Open
Abstract
Background Horizontal gene transfer, the transfer and incorporation of genetic material between different species of organisms, has an important but poorly quantified role in the adaptation of microbes to their environment. Previous work has shown that genome size and the number of horizontally transferred genes are strongly correlated. Here we consider how genome size confuses the quantification of horizontal gene transfer because the number of genes an organism accumulates over time depends on its evolutionary history and ecological context (e.g., the nutrient regime for which it is adapted). Results We investigated horizontal gene transfer between archaea and bacteria by first counting reciprocal BLAST hits among 448 bacterial and 57 archaeal genomes to find shared genes. Then we used the DarkHorse algorithm, a probability-based, lineage-weighted method (Podell & Gaasterland, 2007), to identify potential horizontally transferred genes among these shared genes. By removing the effect of genome size in the bacteria, we have identified bacteria with unusually large numbers of shared genes with archaea for their genome size. Interestingly, archaea and bacteria that live in anaerobic and/or high temperature conditions are more likely to share unusually large numbers of genes. However, high salt was not found to significantly affect the numbers of shared genes. Numbers of shared (genome size-corrected, reciprocal BLAST hits) and transferred genes (identified by DarkHorse) were strongly correlated. Thus archaea and bacteria that live in anaerobic and/or high temperature conditions are more likely to share horizontally transferred genes. These horizontally transferred genes are over-represented by genes involved in energy conversion as well as the transport and metabolism of inorganic ions and amino acids. Conclusions Anaerobic and thermophilic bacteria share unusually large numbers of genes with archaea. This is mainly due to horizontal gene transfer of genes from the archaea to the bacteria. In general, these transfers are from archaea that live in similar oxygen and temperature conditions as the bacteria that receive the genes. Potential hotspots of horizontal gene transfer between archaea and bacteria include hot springs, marine sediments, and oil wells. Cold spots for horizontal transfer included dilute, aerobic, mesophilic environments such as marine and freshwater surface waters.
Collapse
Affiliation(s)
- Clara A Fuchsman
- School of Oceanography, University of Washington, Seattle, WA, United States of America
| | - Roy Eric Collins
- School of Oceanography, University of Washington, Seattle, WA, United States of America.,College of Fisheries and Ocean Sciences, University of Alaska-Fairbanks, Fairbanks, AK, United States of America
| | - Gabrielle Rocap
- School of Oceanography, University of Washington, Seattle, WA, United States of America
| | - William J Brazelton
- School of Oceanography, University of Washington, Seattle, WA, United States of America.,Department of Biology, University of Utah, Salt Lake City, UT, United States of America
| |
Collapse
|
13
|
Jeong H, Nasir A. A Preliminary List of Horizontally Transferred Genes in Prokaryotes Determined by Tree Reconstruction and Reconciliation. Front Genet 2017; 8:112. [PMID: 28894459 PMCID: PMC5581361 DOI: 10.3389/fgene.2017.00112] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2017] [Accepted: 08/14/2017] [Indexed: 12/03/2022] Open
Abstract
Genome-wide global detection of genes involved in horizontal gene transfer (HGT) remains an active area of research in medical microbiology and evolutionary genomics. Utilizing the explicit evolutionary method of comparing topologies of a total of 154,805 orthologous gene trees against corresponding 16S rRNA “reference” trees, we previously detected a total of 660,894 candidate HGT events in 2,472 completely-sequenced prokaryotic genomes. Here, we report an HGT-index for each individual gene-reference tree pair reconciliation, representing the total number of detected HGT events on the gene tree divided by the total number of genomes (taxa) member of that tree. HGT-index is thus a simple measure indicating the sensitivity of prokaryotic genes to participate (or not participate) in HGT. Our preliminary list provides HGT-indices for a total of 69,365 genes (detected in >10 and <50% available prokaryotic genomes) that are involved in a wide range of biological processes such as metabolism, information, and bacterial response to environment. Identification of horizontally-derived genes is important to combat antibiotic resistance and is a step forward toward reconstructions of improved phylogenies describing the history of life. Our effort is thus expected to benefit ongoing research in the fields of clinical microbiology and evolutionary biology.
Collapse
Affiliation(s)
- Hyeonsoo Jeong
- Department of Animal Sciences, University of Illinois at Urbana-ChampaignUrbana, IL, United States
| | - Arshan Nasir
- Department of Biosciences, COMSATS Institute of Information TechnologyIslamabad, Pakistan.,Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois at Urbana-ChampaignUrbana, IL, United States
| |
Collapse
|
14
|
Srivastava SS, Jamkhindikar AA, Raman R, Jobby MK, Chadalawada S, Sankaranarayanan R, Sharma Y. A Transition Metal-Binding, Trimeric βγ-Crystallin from Methane-Producing Thermophilic Archaea, Methanosaeta thermophila. Biochemistry 2017; 56:1299-1310. [DOI: 10.1021/acs.biochem.6b00985] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
| | | | - Rajeev Raman
- CSIR-Centre for Cellular and Molecular Biology (CCMB), Uppal Road, Hyderabad 500 007, India
| | - Maroor K. Jobby
- CSIR-Centre for Cellular and Molecular Biology (CCMB), Uppal Road, Hyderabad 500 007, India
| | - Swathi Chadalawada
- CSIR-Centre for Cellular and Molecular Biology (CCMB), Uppal Road, Hyderabad 500 007, India
| | - Rajan Sankaranarayanan
- CSIR-Centre for Cellular and Molecular Biology (CCMB), Uppal Road, Hyderabad 500 007, India
| | - Yogendra Sharma
- CSIR-Centre for Cellular and Molecular Biology (CCMB), Uppal Road, Hyderabad 500 007, India
| |
Collapse
|
15
|
Jamil HM. A Visual Interface for Querying Heterogeneous Phylogenetic Databases. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2017; 14:131-144. [PMID: 26812733 DOI: 10.1109/tcbb.2016.2520943] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Despite the recent growth in the number of phylogenetic databases, access to these wealth of resources remain largely tool or form-based interface driven. It is our thesis that the flexibility afforded by declarative query languages may offer the opportunity to access these repositories in a better way, and to use such a language to pose truly powerful queries in unprecedented ways. In this paper, we propose a substantially enhanced closed visual query language, called PhyQL, that can be used to query phylogenetic databases represented in a canonical form. The canonical representation presented helps capture most phylogenetic tree formats in a convenient way, and is used as the storage model for our PhyloBase database for which PhyQL serves as the query language. We have implemented a visual interface for the end users to pose PhyQL queries using visual icons, and drag and drop operations defined over them. Once a query is posed, the interface translates the visual query into a Datalog query for execution over the canonical database. Responses are returned as hyperlinks to phylogenies that can be viewed in several formats using the tree viewers supported by PhyloBase. Results cached in PhyQL buffer allows secondary querying on the computed results making it a truly powerful querying architecture.
Collapse
|
16
|
Jeong H, Sung S, Kwon T, Seo M, Caetano-Anollés K, Choi SH, Cho S, Nasir A, Kim H. HGTree: database of horizontally transferred genes determined by tree reconciliation. Nucleic Acids Res 2015; 44:D610-9. [PMID: 26578597 PMCID: PMC4702880 DOI: 10.1093/nar/gkv1245] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2015] [Accepted: 11/01/2015] [Indexed: 01/13/2023] Open
Abstract
The HGTree database provides putative genome-wide horizontal gene transfer (HGT) information for 2472 completely sequenced prokaryotic genomes. This task is accomplished by reconstructing approximate maximum likelihood phylogenetic trees for each orthologous gene and corresponding 16S rRNA reference species sets and then reconciling the two trees under parsimony framework. The tree reconciliation method is generally considered to be a reliable way to detect HGT events but its practical use has remained limited because the method is computationally intensive and conceptually challenging. In this regard, HGTree (http://hgtree.snu.ac.kr) represents a useful addition to the biological community and enables quick and easy retrieval of information for HGT-acquired genes to better understand microbial taxonomy and evolution. The database is freely available and can be easily scaled and updated to keep pace with the rapid rise in genomic information.
Collapse
Affiliation(s)
- Hyeonsoo Jeong
- Interdisciplinary Program in Bioinformatics, Seoul National University, Kwan-ak St. 599, Kwan-ak Gu, Seoul, 151-741, Republic of Korea Department of Animal Sciences, University of Illinois, Urbana, IL 61801, USA
| | - Samsun Sung
- C&K genomics, Main Bldg. #514, SNU Research Park, Seoul 151-919, Republic of Korea
| | - Taehyung Kwon
- Department of Agricultural Biotechnology, Seoul National University, Seoul 151-742, Republic of Korea
| | - Minseok Seo
- Interdisciplinary Program in Bioinformatics, Seoul National University, Kwan-ak St. 599, Kwan-ak Gu, Seoul, 151-741, Republic of Korea
| | | | - Sang Ho Choi
- National Research Laboratory of Molecular Microbiology and Toxicology, Department of Agricultural Biotechnology, Center for Food Safety and Toxicology, Seoul National University, Seoul 151-921, Republic of Korea
| | - Seoae Cho
- C&K genomics, Main Bldg. #514, SNU Research Park, Seoul 151-919, Republic of Korea
| | - Arshan Nasir
- Department of Biosciences, COMSATS Institute of Information Technology, Park Road, Chak Shahzad, Islamabad 45550, Pakistan
| | - Heebal Kim
- Interdisciplinary Program in Bioinformatics, Seoul National University, Kwan-ak St. 599, Kwan-ak Gu, Seoul, 151-741, Republic of Korea Department of Agricultural Biotechnology, Seoul National University, Seoul 151-742, Republic of Korea
| |
Collapse
|
17
|
Grazziotin AL, Vidal NM, Venancio TM. Uncovering major genomic features of essential genes in Bacteria and a methanogenic Archaea. FEBS J 2015; 282:3395-3411. [PMID: 26084810 DOI: 10.1111/febs.13350] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2015] [Revised: 06/02/2015] [Accepted: 06/15/2015] [Indexed: 12/19/2022]
Abstract
Identification of essential genes is critical to understanding the physiology of a species, proposing novel drug targets and uncovering minimal gene sets required for life. Although essential gene sets of several organisms have been determined using large-scale mutagenesis techniques, systematic studies addressing their conservation, genomic context and functions remain scant. Here we integrate 17 essential gene sets from genome-wide in vitro screenings and three gene collections required for growth in vivo, encompassing 15 Bacteria and one Archaea. We refine and generalize important theories proposed using Escherichia coli. Essential genes are typically monogenic and more conserved than nonessential genes. Genes required in vivo are less conserved than those essential in vitro, suggesting that more divergent strategies are deployed when the organism is stressed by the host immune system and unstable nutrient availability. We identified essential analogous pathways that would probably be missed by orthology-based essentiality prediction strategies. For example, Streptococcus sanguinis carries horizontally transferred isoprenoid biosynthesis genes that are widespread in Archaea. Genes specifically essential in Mycobacterium tuberculosis and Burkholderia pseudomallei are reported as potential drug targets. Moreover, essential genes are not only preferentially located in operons, but also occupy the first position therein, supporting the influence of their regulatory regions in driving transcription of whole operons. Finally, these important genomic features are shared between Bacteria and at least one Archaea, suggesting that high order properties of gene essentiality and genome architecture were probably present in the last universal common ancestor or evolved independently in the prokaryotic domains.
Collapse
Affiliation(s)
- Ana Laura Grazziotin
- Laboratório de Química e Função de Proteínas e Peptídeos, Centro de Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, Brazil.,National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Newton Medeiros Vidal
- Laboratório de Química e Função de Proteínas e Peptídeos, Centro de Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, Brazil.,National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Thiago Motta Venancio
- Laboratório de Química e Função de Proteínas e Peptídeos, Centro de Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, Brazil
| |
Collapse
|
18
|
Álvarez-Canales G, Arellano-Álvarez G, González-Domenech CM, de la Cruz F, Moya A, Delaye L. Identification of Xenologs and Their Characteristic Low Expression Levels in the Cyanobacterium Synechococcus elongatus. J Mol Evol 2015; 80:292-304. [PMID: 26040248 DOI: 10.1007/s00239-015-9684-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2014] [Accepted: 05/28/2015] [Indexed: 02/07/2023]
Abstract
Horizontal gene transfer (HGT) is a central process in prokaryotic evolution. Once a gene is introduced into a genome by HGT, its contribution to the fitness of the recipient cell depends in part on its expression level. Here we show that in Synechococcus elongatus PCC 7942, xenologs derived from non-cyanobacterial sources exhibited lower expression levels than native genes in the genome. In accord with our observation, xenolog codon adaptation indexes also displayed relatively low expression values. These results are in agreement with previous reports that suggested the relative neutrality of most xenologs. However, we also demonstrated that some of the xenologs detected participated in cellular functions, including iron starvation acclimation and nitrate reduction, which corroborate the role of HGT in bacterial adaptation. For example, the expression levels of some of the xenologs detected are known to increase under iron-limiting conditions. We interpreted the overall pattern as an indication that there is a selection pressure against high expression levels of xenologs. However, when a xenolog protein product confers a selective advantage, natural selection can further modulate its expression level to meet the requirements of the recipient cell. In addition, we show that ORFans did not exhibit significantly lower expression levels than native genes in the genome, which suggested an origin other than xenology.
Collapse
Affiliation(s)
- Gilberto Álvarez-Canales
- Departamento de Ingeniería Genética, CINVESTAV-Irapuato, Km. 9.6 Libramiento Norte, Carretera Irapuato-León, 36821, Irapuato, Guanajuato, Mexico
| | | | | | | | | | | |
Collapse
|
19
|
Zhu Q, Kosoy M, Dittmar K. HGTector: an automated method facilitating genome-wide discovery of putative horizontal gene transfers. BMC Genomics 2014; 15:717. [PMID: 25159222 PMCID: PMC4155097 DOI: 10.1186/1471-2164-15-717] [Citation(s) in RCA: 90] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2014] [Accepted: 08/20/2014] [Indexed: 11/23/2022] Open
Abstract
Background First pass methods based on BLAST match are commonly used as an initial step to separate the different phylogenetic histories of genes in microbial genomes, and target putative horizontal gene transfer (HGT) events. This will continue to be necessary given the rapid growth of genomic data and the technical difficulties in conducting large-scale explicit phylogenetic analyses. However, these methods often produce misleading results due to their inability to resolve indirect phylogenetic links and their vulnerability to stochastic events. Results A new computational method of rapid, exhaustive and genome-wide detection of HGT was developed, featuring the systematic analysis of BLAST hit distribution patterns in the context of a priori defined hierarchical evolutionary categories. Genes that fall beyond a series of statistically determined thresholds are identified as not adhering to the typical vertical history of the organisms in question, but instead having a putative horizontal origin. Tests on simulated genomic data suggest that this approach effectively targets atypically distributed genes that are highly likely to be HGT-derived, and exhibits robust performance compared to conventional BLAST-based approaches. This method was further tested on real genomic datasets, including Rickettsia genomes, and was compared to previous studies. Results show consistency with currently employed categories of HGT prediction methods. In-depth analysis of both simulated and real genomic data suggests that the method is notably insensitive to stochastic events such as gene loss, rate variation and database error, which are common challenges to the current methodology. An automated pipeline was created to implement this approach and was made publicly available at: https://github.com/DittmarLab/HGTector. The program is versatile, easily deployed, has a low requirement for computational resources. Conclusions HGTector is an effective tool for initial or standalone large-scale discovery of candidate HGT-derived genes. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-717) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Qiyun Zhu
- Department of Biological Sciences, University at Buffalo, State University of New York, 109 Cooke Hall, Buffalo, NY 14260, USA.
| | | | | |
Collapse
|
20
|
Grilli J, Romano M, Bassetti F, Cosentino Lagomarsino M. Cross-species gene-family fluctuations reveal the dynamics of horizontal transfers. Nucleic Acids Res 2014; 42:6850-60. [PMID: 24829449 PMCID: PMC4066789 DOI: 10.1093/nar/gku378] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
Prokaryotes vary their protein repertoire mainly through horizontal transfer and gene loss. To elucidate the links between these processes and the cross-species gene-family statistics, we perform a large-scale data analysis of the cross-species variability of gene-family abundance (the number of members of the family found on a given genome). We find that abundance fluctuations are related to the rate of horizontal transfers. This is rationalized by a minimal theoretical model, which predicts this link. The families that are not captured by the model show abundance profiles that are markedly peaked around a mean value, possibly because of specific abundance selection. Based on these results, we define an abundance variability index that captures a family's evolutionary behavior (and thus some of its relevant functional properties) purely based on its cross-species abundance fluctuations. Analysis and model, combined, show a quantitative link between cross-species family abundance statistics and horizontal transfer dynamics, which can be used to analyze genome ‘flux’. Groups of families with different values of the abundance variability index correspond to genome sub-parts having different plasticity in terms of the level of horizontal exchange allowed by natural selection.
Collapse
Affiliation(s)
- Jacopo Grilli
- Dipartimento di Fisica e Astronomia "G. Galilei", Università di Padova, Via Marzolo 8, I-35131 Padova, Italy
| | - Mariacristina Romano
- Dipartimento di Fisica, Università degli Studi di Milano, via Celoria, 16, 20133 Milano, Italy
| | - Federico Bassetti
- Università di Pavia, Dipartimento di Matematica, via Ferrata 1, 27100 Pavia, Italy
| | - Marco Cosentino Lagomarsino
- CNRS, UMR 7238, Paris, France Sorbonne Universités, UPMC Université Paris 06, UMR 7238 Computational and Quantitative Biology, Genomic Physics Group, 15 rue de l'École de Médecine, Paris, France
| |
Collapse
|
21
|
Kodavali PK, Dudkiewicz M, Pikuła S, Pawłowski K. Bioinformatics analysis of bacterial annexins--putative ancestral relatives of eukaryotic annexins. PLoS One 2014; 9:e85428. [PMID: 24454864 PMCID: PMC3894181 DOI: 10.1371/journal.pone.0085428] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2013] [Accepted: 12/03/2013] [Indexed: 11/19/2022] Open
Abstract
Annexins are Ca(2+)-binding, membrane-interacting proteins, widespread among eukaryotes, consisting usually of four structurally similar repeated domains. It is accepted that vertebrate annexins derive from a double genome duplication event. It has been postulated that a single domain annexin, if found, might represent a molecule related to the hypothetical ancestral annexin. The recent discovery of a single-domain annexin in a bacterium, Cytophaga hutchinsonii, apparently confirmed this hypothesis. Here, we present a more complex picture. Using remote sequence similarity detection tools, a survey of bacterial genomes was performed in search of annexin-like proteins. In total, we identified about thirty annexin homologues, including single-domain and multi-domain annexins, in seventeen bacterial species. The thorough search yielded, besides the known annexin homologue from C. hutchinsonii, homologues from the Bacteroidetes/Chlorobi phylum, from Gemmatimonadetes, from beta- and delta-Proteobacteria, and from Actinobacteria. The sequences of bacterial annexins exhibited remote but statistically significant similarity to sequence profiles built of the eukaryotic ones. Some bacterial annexins are equipped with additional, different domains, for example those characteristic for toxins. The variation in bacterial annexin sequences, much wider than that observed in eukaryotes, and different domain architectures suggest that annexins found in bacteria may actually descend from an ancestral bacterial annexin, from which eukaryotic annexins also originate. The hypothesis of an ancient origin of bacterial annexins has to be reconciled with the fact that remarkably few bacterial strains possess annexin genes compared to the thousands of known bacterial genomes and with the patchy, anomalous phylogenetic distribution of bacterial annexins. Thus, a massive annexin gene loss in several bacterial lineages or very divergent evolution would appear a likely explanation. Alternative evolutionary scenarios, involving horizontal gene transfer between bacteria and protozoan eukaryotes, in either direction, appear much less likely. Altogether, current evidence does not allow unequivocal judgement as to the origin of bacterial annexins.
Collapse
Affiliation(s)
- Praveen Kumar Kodavali
- Department of Biochemistry, Nencki Institute of Experimental Biology, Polish Academy of Sciences, Warsaw, Poland
| | - Małgorzata Dudkiewicz
- Faculty of Agriculture and Biology, Warsaw University of Life Sciences, Warsaw, Poland
| | - Sławomir Pikuła
- Department of Biochemistry, Nencki Institute of Experimental Biology, Polish Academy of Sciences, Warsaw, Poland
| | - Krzysztof Pawłowski
- Faculty of Agriculture and Biology, Warsaw University of Life Sciences, Warsaw, Poland
| |
Collapse
|
22
|
Seasonal fluctuations in ionic concentrations drive microbial succession in a hypersaline lake community. ISME JOURNAL 2013; 8:979-90. [PMID: 24335829 DOI: 10.1038/ismej.2013.221] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/20/2013] [Revised: 11/05/2013] [Accepted: 11/10/2013] [Indexed: 02/01/2023]
Abstract
Microbial community succession was examined over a two-year period using spatially and temporally coordinated water chemistry measurements, metagenomic sequencing, phylogenetic binning and de novo metagenomic assembly in the extreme hypersaline habitat of Lake Tyrrell, Victoria, Australia. Relative abundances of Haloquadratum-related sequences were positively correlated with co-varying concentrations of potassium, magnesium and sulfate, but not sodium, chloride or calcium ions, while relative abundances of Halorubrum, Haloarcula, Halonotius, Halobaculum and Salinibacter-related sequences correlated negatively with Haloquadratum and these same ionic factors. Nanohaloarchaea and Halorhabdus-related sequence abundances were inversely correlated with each other, but not other taxonomic groups. These data, along with predicted gene functions from nearly-complete assembled population metagenomes, suggest different ecological phenotypes for Nanohaloarchaea and Halorhabdus-related strains versus other community members. Nucleotide percent G+C compositions were consistently lower in community metagenomic reads from summer versus winter samples. The same seasonal G+C trends were observed within taxonomically binned read subsets from each of seven different genus-level archaeal groups. Relative seasonal abundances were also linked to percent G+C for assembled population genomes. Together, these data suggest that extreme ionic conditions may exert selective pressure on archaeal populations at the level of genomic nucleotide composition, thus contributing to seasonal successional processes. Despite the unavailability of cultured representatives for most of the organisms identified in this study, effective coordination of physical and biological measurements has enabled discovery and quantification of unexpected taxon-specific, environmentally mediated factors influencing microbial community structure.
Collapse
|
23
|
Yue J, Sun G, Hu X, Huang J. The scale and evolutionary significance of horizontal gene transfer in the choanoflagellate Monosiga brevicollis. BMC Genomics 2013; 14:729. [PMID: 24156600 PMCID: PMC4046809 DOI: 10.1186/1471-2164-14-729] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2012] [Accepted: 10/17/2013] [Indexed: 12/29/2022] Open
Abstract
Background It is generally agreed that horizontal gene transfer (HGT) is common in phagotrophic protists. However, the overall scale of HGT and the cumulative impact of acquired genes on the evolution of these organisms remain largely unknown. Results Choanoflagellates are phagotrophs and the closest living relatives of animals. In this study, we performed phylogenomic analyses to investigate the scale of HGT and the evolutionary importance of horizontally acquired genes in the choanoflagellate Monosiga brevicollis. Our analyses identified 405 genes that are likely derived from algae and prokaryotes, accounting for approximately 4.4% of the Monosiga nuclear genome. Many of the horizontally acquired genes identified in Monosiga were probably acquired from food sources, rather than by endosymbiotic gene transfer (EGT) from obsolete endosymbionts or plastids. Of 193 genes identified in our analyses with functional information, 84 (43.5%) are involved in carbohydrate or amino acid metabolism, and 45 (23.3%) are transporters and/or involved in response to oxidative, osmotic, antibiotic, or heavy metal stresses. Some identified genes may also participate in biosynthesis of important metabolites such as vitamins C and K12, porphyrins and phospholipids. Conclusions Our results suggest that HGT is frequent in Monosiga brevicollis and might have contributed substantially to its adaptation and evolution. This finding also highlights the importance of HGT in the genome and organismal evolution of phagotrophic eukaryotes. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-14-729) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | - Jinling Huang
- Department of Biology, East Carolina University, Greenville, NC 27858, USA.
| |
Collapse
|
24
|
Lenart A, Dudkiewicz M, Grynberg M, Pawłowski K. CLCAs - a family of metalloproteases of intriguing phylogenetic distribution and with cases of substituted catalytic sites. PLoS One 2013; 8:e62272. [PMID: 23671590 PMCID: PMC3650047 DOI: 10.1371/journal.pone.0062272] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2012] [Accepted: 03/19/2013] [Indexed: 01/08/2023] Open
Abstract
The zinc-dependent metalloproteases with His-Glu-x-x-His (HExxH) active site motif, zincins, are a broad group of proteins involved in many metabolic and regulatory functions, and found in all forms of life. Human genome contains more than 100 genes encoding proteins with known zincin-like domains. A survey of all proteins containing the HExxH motif shows that approximately 52% of HExxH occurrences fall within known protein structural domains (as defined in the Pfam database). Domain families with majority of members possessing a conserved HExxH motif include, not surprisingly, many known and putative metalloproteases. Furthermore, several HExxH-containing protein domains thus identified can be confidently predicted to be putative peptidases of zincin fold. Thus, we predict zincin-like fold for eight uncharacterised Pfam families. Besides the domains with the HExxH motif strictly conserved, and those with sporadic occurrences, intermediate families are identified that contain some members with a conserved HExxH motif, but also many homologues with substitutions at the conserved positions. Such substitutions can be evolutionarily conserved and non-random, yet functional roles of these inactive zincins are not known. The CLCAs are a novel zincin-like protease family with many cases of substituted active sites. We show that this allegedly metazoan family has a number of bacterial and archaeal members. An extremely patchy phylogenetic distribution of CLCAs in prokaryotes and their conserved protein domain composition strongly suggests an evolutionary scenario of horizontal gene transfer (HGT) from multicellular eukaryotes to bacteria, providing an example of eukaryote-derived xenologues in bacterial genomes. Additionally, in a protein family identified here as closely homologous to CLCA, the CLCA_X (CLCA-like) family, a number of proteins is found in phages and plasmids, supporting the HGT scenario.
Collapse
Affiliation(s)
- Anna Lenart
- Department of Cellular and Molecular Neurobiology, Nencki Institute of Experimental Biology, Polish Academy of Sciences, Warsaw, Poland
| | - Małgorzata Dudkiewicz
- Faculty of Agriculture and Biology, Warsaw University of Life Sciences, Warsaw, Poland
| | - Marcin Grynberg
- Department of Genetics, Institute of Biochemistry and Biophysics, Polish Academy of Sciences,Warsaw, Poland
| | - Krzysztof Pawłowski
- Department of Cellular and Molecular Neurobiology, Nencki Institute of Experimental Biology, Polish Academy of Sciences, Warsaw, Poland
- Faculty of Agriculture and Biology, Warsaw University of Life Sciences, Warsaw, Poland
- * E-mail:
| |
Collapse
|
25
|
Liu L, Chen X, Skogerbø G, Zhang P, Chen R, He S, Huang DW. The human microbiome: A hot spot of microbial horizontal gene transfer. Genomics 2012; 100:265-70. [DOI: 10.1016/j.ygeno.2012.07.012] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2012] [Revised: 07/06/2012] [Accepted: 07/16/2012] [Indexed: 12/19/2022]
|
26
|
Baudoux AC, Hendrix RW, Lander GC, Bailly X, Podell S, Paillard C, Johnson JE, Potter CS, Carragher B, Azam F. Genomic and functional analysis of Vibrio phage SIO-2 reveals novel insights into ecology and evolution of marine siphoviruses. Environ Microbiol 2012; 14:2071-86. [PMID: 22225728 PMCID: PMC3338904 DOI: 10.1111/j.1462-2920.2011.02685.x] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
We report on a genomic and functional analysis of a novel marine siphovirus, the Vibrio phage SIO-2. This phage is lytic for related Vibrio species of great ecological interest including the broadly antagonistic bacterium Vibrio sp. SWAT3 as well as notable members of the Harveyi clade (V.harveyi ATTC BAA-1116 and V.campbellii ATCC 25920). Vibrio phage SIO-2 has a circularly permuted genome of 80598 bp, which displays unusual features. This genome is larger than that of most known siphoviruses and only 38 of the 116 predicted proteins had homologues in databases. Another divergence is manifest by the origin of core genes, most of which share robust similarities with unrelated viruses and bacteria spanning a wide range of phyla. These core genes are arranged in the same order as in most bacteriophages but they are unusually interspaced at two places with insertions of DNA comprising a high density of uncharacterized genes. The acquisition of these DNA inserts is associated with morphological variation of SIO-2 capsid, which assembles as a large (80 nm) shell with a novel T=12 symmetry. These atypical structural features confer on SIO-2 a remarkable stability to a variety of physical, chemical and environmental factors. Given this high level of functional and genomic novelty, SIO-2 emerges as a model of considerable interest in ecological and evolutionary studies.
Collapse
Affiliation(s)
- A-C Baudoux
- Scripps Institution of Oceanography, Marine Biology Research Division, University of California San Diego, La Jolla, CA 92093, USA.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
27
|
Zaneveld JRR, Parfrey LW, Van Treuren W, Lozupone C, Clemente JC, Knights D, Stombaugh J, Kuczynski J, Knight R. Combined phylogenetic and genomic approaches for the high-throughput study of microbial habitat adaptation. Trends Microbiol 2011; 19:472-82. [PMID: 21872475 PMCID: PMC3184378 DOI: 10.1016/j.tim.2011.07.006] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2011] [Revised: 07/22/2011] [Accepted: 07/25/2011] [Indexed: 01/21/2023]
Abstract
High-throughput sequencing technologies provide new opportunities to address longstanding questions about habitat adaptation in microbial organisms. How have microbes managed to adapt to such a wide range of environments, and what genomic features allow for such adaptation? We review recent large-scale studies of habitat adaptation, with emphasis on those that utilize phylogenetic techniques. On the basis of current trends, we summarize methodological challenges faced by investigators, and the tools, techniques and analytical approaches available to overcome them. Phylogenetic approaches and detailed information about each environmental sample will be crucial as the ability to collect genome sequences continues to expand.
Collapse
Affiliation(s)
- Jesse R R Zaneveld
- Department of Molecular, Cellular and Developmental Biology, University of Colorado, Boulder, CO 80309, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Narasingarao P, Podell S, Ugalde JA, Brochier-Armanet C, Emerson JB, Brocks JJ, Heidelberg KB, Banfield JF, Allen EE. De novo metagenomic assembly reveals abundant novel major lineage of Archaea in hypersaline microbial communities. ISME JOURNAL 2011; 6:81-93. [PMID: 21716304 DOI: 10.1038/ismej.2011.78] [Citation(s) in RCA: 227] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
This study describes reconstruction of two highly unusual archaeal genomes by de novo metagenomic assembly of multiple, deeply sequenced libraries from surface waters of Lake Tyrrell (LT), a hypersaline lake in NW Victoria, Australia. Lineage-specific probes were designed using the assembled genomes to visualize these novel archaea, which were highly abundant in the 0.1-0.8 μm size fraction of lake water samples. Gene content and inferred metabolic capabilities were highly dissimilar to all previously identified hypersaline microbial species. Distinctive characteristics included unique amino acid composition, absence of Gvp gas vesicle proteins, atypical archaeal metabolic pathways and unusually small cell size (approximately 0.6 μm diameter). Multi-locus phylogenetic analyses demonstrated that these organisms belong to a new major euryarchaeal lineage, distantly related to halophilic archaea of class Halobacteria. Consistent with these findings, we propose creation of a new archaeal class, provisionally named 'Nanohaloarchaea'. In addition to their high abundance in LT surface waters, we report the prevalence of Nanohaloarchaea in other hypersaline environments worldwide. The simultaneous discovery and genome sequencing of a novel yet ubiquitous lineage of uncultivated microorganisms demonstrates that even historically well-characterized environments can reveal unexpected diversity when analyzed by metagenomics, and advances our understanding of the ecology of hypersaline environments and the evolutionary history of the archaea.
Collapse
Affiliation(s)
- Priya Narasingarao
- Marine Biology Research Division, Scripps Institution of Oceanography, University of California, San Diego, La Jolla, CA, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Zheng H, Wu H. Short prokaryotic DNA fragment binning using a hierarchical classifier based on linear discriminant analysis and principal component analysis. J Bioinform Comput Biol 2011; 8:995-1011. [PMID: 21121023 DOI: 10.1142/s0219720010005051] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2010] [Accepted: 07/31/2010] [Indexed: 11/18/2022]
Abstract
Metagenomics is an emerging field in which the power of genomic analysis is applied to an entire microbial community, bypassing the need to isolate and culture individual microbial species. Assembling of metagenomic DNA fragments is very much like the overlap-layout-consensus procedure for assembling isolated genomes, but is augmented by an additional binning step to differentiate scaffolds, contigs and unassembled reads into various taxonomic groups. In this paper, we employed n-mer oligonucleotide frequencies as the features and developed a hierarchical classifier (PCAHIER) for binning short (≤ 1,000 bps) metagenomic fragments. The principal component analysis was used to reduce the high dimensionality of the feature space. The hierarchical classifier consists of four layers of local classifiers that are implemented based on the linear discriminant analysis. These local classifiers are responsible for binning prokaryotic DNA fragments into superkingdoms, of the same superkingdom into phyla, of the same phylum into genera, and of the same genus into species, respectively. We evaluated the performance of the PCAHIER by using our own simulated data sets as well as the widely used simHC synthetic metagenome data set from the IMG/M system. The effectiveness of the PCAHIER was demonstrated through comparisons against a non-hierarchical classifier, and two existing binning algorithms (TETRA and Phylopythia).
Collapse
Affiliation(s)
- Hao Zheng
- School of Electrical and Computer Engineering, Georgia Institute of Technology, 210 Technology Circle, Savannah, GA 31407, USA.
| | | |
Collapse
|
30
|
Penn K, Jenkins C, Nett M, Udwary DW, Gontang EA, McGlinchey RP, Foster B, Lapidus A, Podell S, Allen EE, Moore BS, Jensen PR. Genomic islands link secondary metabolism to functional adaptation in marine Actinobacteria. ISME JOURNAL 2009; 3:1193-203. [PMID: 19474814 PMCID: PMC2749086 DOI: 10.1038/ismej.2009.58] [Citation(s) in RCA: 150] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Genomic islands have been shown to harbor functional traits that differentiate ecologically distinct populations of environmental bacteria. A comparative analysis of the complete genome sequences of the marine Actinobacteria Salinispora tropica and S. arenicola reveals that 75% of the species-specific genes are located in 21 genomic islands. These islands are enriched in genes associated with secondary metabolite biosynthesis providing evidence that secondary metabolism is linked to functional adaptation. Secondary metabolism accounts for 8.8% and 10.9% of the genes in the S. tropica and S. arenicola genomes, respectively, and represents the major functional category of annotated genes that differentiates the two species. Genomic islands harbor all 25 of the species-specific biosynthetic pathways, the majority of which occur in S. arenicola and may contribute to the cosmopolitan distribution of this species. Genome evolution is dominated by gene duplication and acquisition, which in the case of secondary metabolism provide immediate opportunities for the production of new bioactive products. Evidence that secondary metabolic pathways are exchanged horizontally, coupled with prior evidence for fixation among globally distributed populations, supports a functional role and suggests that the acquisition of natural product biosynthetic gene clusters represents a previously unrecognized force driving bacterial diversification. Species-specific differences observed in CRISPR (clustered regularly interspaced short palindromic repeat) sequences suggest that S. arenicola may possess a higher level of phage immunity, while a highly duplicated family of polymorphic membrane proteins provides evidence of a new mechanism of marine adaptation in Gram-positive bacteria.
Collapse
Affiliation(s)
- Kevin Penn
- Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA 92093, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|