1
|
Barcytė D, Jaške K, Pánek T, Yurchenko T, Ševčíková T, Eliášová A, Eliáš M. A cryptic plastid and a novel mitochondrial plasmid in Leucomyxa plasmidifera gen. and sp. nov. (Ochrophyta) push the frontiers of organellar biology. Open Biol 2024; 14:240022. [PMID: 39474867 PMCID: PMC11528492 DOI: 10.1098/rsob.240022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Revised: 05/22/2024] [Accepted: 09/02/2024] [Indexed: 11/03/2024] Open
Abstract
Complete plastid loss seems to be very rare among secondarily non-photosynthetic eukaryotes. Leukarachnion sp. PRA-24, an amoeboid colourless protist related to the photosynthetic algal class Synchromophyceae (Ochrophyta), is a candidate for such a case based on a previous investigation by transmission electron microscopy. Here, we characterize this organism in further detail and describe it as Leucomyxa plasmidifera gen. et sp. nov., additionally demonstrating it is the first known representative of a broader clade of non-photosynthetic ochrophytes. We recovered its complete plastid genome, exhibiting a reduced gene set similar to plastomes of other non-photosynthetic ochrophytes, yet being even more extreme in sequence divergence. Identification of components of the plastid protein import machinery in the L. plasmidifera transcriptome assembly corroborated that the organism possesses a cryptic plastid organelle. According to our bioinformatic reconstruction, the plastid contains a unique combination of biosynthetic pathways producing haem, a folate precursor and tocotrienols. As another twist to its organellar biology, L. plasmidifera turned out to contain an unusual long insertion in its mitogenome related to a newly discovered mitochondrial plasmid exhibiting unprecedented features in terms of its size and coding capacity. Combined, our work uncovered further striking outcomes of the evolutionary course of semiautonomous organelles in protists.
Collapse
Affiliation(s)
- Dovilė Barcytė
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Chittussiho 10, Ostrava710 00, Czech Republic
| | - Karin Jaške
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Chittussiho 10, Ostrava710 00, Czech Republic
| | - Tomáš Pánek
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Chittussiho 10, Ostrava710 00, Czech Republic
- Department of Zoology, Faculty of Science, Charles University, Viničná 7, Prague 2,128 43, Czech Republic
| | - Tatiana Yurchenko
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Chittussiho 10, Ostrava710 00, Czech Republic
| | - Tereza Ševčíková
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Chittussiho 10, Ostrava710 00, Czech Republic
| | - Anežka Eliášová
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Chittussiho 10, Ostrava710 00, Czech Republic
| | - Marek Eliáš
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Chittussiho 10, Ostrava710 00, Czech Republic
| |
Collapse
|
2
|
Jiang C, Gu S, Pan T, Wang X, Qin W, Wang G, Gao X, Zhang J, Chen K, Warren A, Xiong J, Miao W. Dynamics and timing of diversification events of ciliated eukaryotes from a large phylogenomic perspective. Mol Phylogenet Evol 2024; 197:108110. [PMID: 38768875 DOI: 10.1016/j.ympev.2024.108110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2024] [Revised: 05/17/2024] [Accepted: 05/17/2024] [Indexed: 05/22/2024]
Abstract
Ciliophora, an exceptionally diverse lineage of unicellular eukaryotes, exhibits a remarkable range of species richness across classes in the ciliate Tree of Life. In this study, we have acquired transcriptome and genome data from 40 representative species in seven ciliate classes. Utilizing 247 genes and 105 taxa, we devised a comprehensive phylogenomic tree for Ciliophora, encompassing over 60 % of orders and constituting the most extensive dataset of ciliate species to date. We established a robust phylogenetic framework that encompasses ambiguous taxa and the major classes within the phylum. Our findings support the monophyly of each of two subphyla (Postciliodesmatophora and Intramacronucleata), along with three subclades (Protocruzia, CONTHREEP, and SAPML) nested within Intramacronucleata, and elucidate evolutionary positions among the major classes within the phylum. Drawing on the robust ciliate Tree of Life and three constraints, we estimated the radiation of Ciliophora around 1175 Ma during the middle of the Proterozoic Eon, and most of the ciliate classes diverged from their sister lineage during the latter half of this period. Additionally, based on the time-calibrated tree and species richness pattern, we investigated net diversification rates of Ciliophora and its classes. The global net diversification rate for Ciliophora was estimated at 0.004979 species/Ma. Heterogeneity in net diversification rates was evident at the class level, with faster rates observed in Oligohymenophorea and Spirotrichea than other classes within the subclades CONTHREEP and SAPML, respectively. Notably, our analysis suggests that variations in net diversification rates, rather than clade ages, appear to contribute to the differences in species richness in Ciliophora at the class level.
Collapse
Affiliation(s)
- Chuanqi Jiang
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China
| | - Siyu Gu
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China; University of Chinese Academy of Sciences, Beijing, China
| | - Tingting Pan
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China; University of Chinese Academy of Sciences, Beijing, China
| | - Xueyan Wang
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China; University of Chinese Academy of Sciences, Beijing, China
| | - Weiwei Qin
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China; University of Chinese Academy of Sciences, Beijing, China
| | - Guangying Wang
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China
| | - Xinxin Gao
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China; University of Chinese Academy of Sciences, Beijing, China
| | - Jing Zhang
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China
| | - Kai Chen
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China
| | - Alan Warren
- Department of Life Sciences, Natural History Museum, London, UK
| | - Jie Xiong
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China; Key Laboratory of Breeding Biotechnology and Sustainable Aquaculture, Chinese Academy of Sciences, Wuhan, China
| | - Wei Miao
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China; University of Chinese Academy of Sciences, Beijing, China; Key Laboratory of Breeding Biotechnology and Sustainable Aquaculture, Chinese Academy of Sciences, Wuhan, China; Hubei Hongshan Laboratory, Wuhan, China.
| |
Collapse
|
3
|
Cedrola F, Senra MVX, Morales MJA, Fregulia P, Canesin L, Dias RJP, Solferini VN. Giants' cooperation: a draft genome of the giant ciliate Muniziella cunhai suggests its ecological role in the capybara's digestive metabolism. Microb Genom 2024; 10:001263. [PMID: 38953769 PMCID: PMC11316547 DOI: 10.1099/mgen.0.001263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Accepted: 06/03/2024] [Indexed: 07/04/2024] Open
Abstract
Several hundred ciliate species live in animals' guts as a part of their microbiome. Among them, Muniziella cunhai (Trichostomatia, Pycnotrichidae), the largest described ciliate, is found exclusively associated with Hydrochoerus hydrochaeris (capybara), the largest known rodent reaching up to 90 kg. Here, we present the sequence, structural and functional annotation of this giant microeukaryote macronuclear genome and discuss its phylogenetic placement. The 85 Mb genome is highly AT rich (GC content 25.71 %) and encodes a total of 11 397 protein-coding genes, of which 2793 could have their functions predicted with automated functional assignments. Functional annotation showed that M. cunhai can digest recalcitrant structural carbohydrates, non-structural carbohydrates, and microbial cell walls, suggesting a role in diet metabolization and in microbial population control in the capybara's intestine. Moreover, the phylogenetic placement of M. cunhai provides insights on the origins of gigantism in the subclass Trichostomatia.
Collapse
Affiliation(s)
- Franciane Cedrola
- Laboratório de Diversidade Genética, Departamento de Genética, Evolução, Microbiologia e Imunologia, Instituto de Biologia, Universidade Estadual de Campinas, Campinas, São Paulo, Brazil
| | | | - Millke Jasmine Arminini Morales
- Laboratório de Diversidade Genética, Departamento de Genética, Evolução, Microbiologia e Imunologia, Instituto de Biologia, Universidade Estadual de Campinas, Campinas, São Paulo, Brazil
| | - Priscila Fregulia
- Laboratório de Protozoologia, Programa de Pós-Graduação em Biodiversidade e Conservação da Natureza, Instituto de Ciências Biológicas, Universidade Federal de Juiz de Fora, Juiz de Fora, Minas Gerais, Brazil
| | | | - Roberto Júnio Pedroso Dias
- Laboratório de Protozoologia, Programa de Pós-Graduação em Biodiversidade e Conservação da Natureza, Instituto de Ciências Biológicas, Universidade Federal de Juiz de Fora, Juiz de Fora, Minas Gerais, Brazil
| | - Vera Nisaka Solferini
- Laboratório de Diversidade Genética, Departamento de Genética, Evolução, Microbiologia e Imunologia, Instituto de Biologia, Universidade Estadual de Campinas, Campinas, São Paulo, Brazil
- Center for Computational Engineering and Sciences, Universidade Estadual de Campinas, Campinas, São Paulo, Brazil
| |
Collapse
|
4
|
Gao X, Chen K, Xiong J, Zou D, Yang F, Ma Y, Jiang C, Gao X, Wang G, Gu S, Zhang P, Luo S, Huang K, Bao Y, Zhang Z, Ma L, Miao W. The P10K database: a data portal for the protist 10 000 genomes project. Nucleic Acids Res 2024; 52:D747-D755. [PMID: 37930867 PMCID: PMC10767852 DOI: 10.1093/nar/gkad992] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 10/03/2023] [Accepted: 10/17/2023] [Indexed: 11/08/2023] Open
Abstract
Protists, a highly diverse group of microscopic eukaryotic organisms distinct from fungi, animals and plants, exert crucial roles within the earth's biosphere. However, the genomes of only a small fraction of known protist species have been published and made publicly accessible. To address this constraint, the Protist 10 000 Genomes Project (P10K) was initiated, implementing a specialized pipeline for single-cell genome/transcriptome assembly, decontamination and annotation of protists. The resultant P10K database (https://ngdc.cncb.ac.cn/p10k/) serves as a comprehensive platform, collating and disseminating genome sequences and annotations from diverse protist groups. Currently, the P10K database has incorporated 2959 genomes and transcriptomes, including 1101 newly sequenced datasets by P10K and 1858 publicly available datasets. Notably, it covers 45% of the protist orders, with a significant representation (53% coverage) of ciliates, featuring nearly a thousand genomes/transcriptomes. Intriguingly, analysis of the unique codon table usage among ciliates has revealed differences compared to the NCBI taxonomy system, suggesting a need to revise the codon tables used for these species. Collectively, the P10K database serves as a valuable repository of genetic resources for protist research and aims to expand its collection by incorporating more sequenced data and advanced analysis tools to benefit protist studies worldwide.
Collapse
Affiliation(s)
- Xinxin Gao
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Kai Chen
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
| | - Jie Xiong
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
- Key Laboratory of Breeding Biotechnology and Sustainable Aquaculture, Chinese Academy of Sciences, Wuhan 430072, China
| | - Dong Zou
- China National Center for Bioinformation, Beijing 100101, China
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Fangdian Yang
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
| | - Yingke Ma
- China National Center for Bioinformation, Beijing 100101, China
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Chuanqi Jiang
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
| | - Xiaoxuan Gao
- Shandong University of Technology, Zibo 255000, China
| | - Guangying Wang
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
| | - Siyu Gu
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Peng Zhang
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
| | - Shuai Luo
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
| | - Kaiyao Huang
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
- Key laboratory of Lake and Watershed Science for Water Security, Chinese Academy of Sciences, Nanjing 210008, China
| | - Yiming Bao
- University of Chinese Academy of Sciences, Beijing 100049, China
- China National Center for Bioinformation, Beijing 100101, China
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Zhang Zhang
- University of Chinese Academy of Sciences, Beijing 100049, China
- China National Center for Bioinformation, Beijing 100101, China
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Lina Ma
- University of Chinese Academy of Sciences, Beijing 100049, China
- China National Center for Bioinformation, Beijing 100101, China
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Wei Miao
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
- Key laboratory of Lake and Watershed Science for Water Security, Chinese Academy of Sciences, Nanjing 210008, China
- Hubei Hongshan Laboratory, Wuhan 430070, China
| |
Collapse
|
5
|
Pfennig A, Lomsadze A, Borodovsky M. MgCod: Gene Prediction in Phage Genomes with Multiple Genetic Codes. J Mol Biol 2023; 435:168159. [PMID: 37244571 DOI: 10.1016/j.jmb.2023.168159] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Revised: 05/19/2023] [Accepted: 05/21/2023] [Indexed: 05/29/2023]
Abstract
Massive sequencing of microbiomes has led to the discovery of a large number of phage genomes with intermittent stop codon recoding. We have developed a computational tool, MgCod, that identifies genomic regions (blocks) with distinct stop codon recoding simultaneously with the prediction of protein-coding regions. When MgCod was used to scan a large volume of human metagenomic contigs hundreds of viral contigs with intermittent stop codon recoding were revealed. Many of these contigs originated from genomes of known crAssphages. Further analyses had shown that intermittent recoding was associated with subtle patterns in the organization of protein-coding genes, such as 'single-coding' and 'dual-coding'. The dual-coding genes, clustered into blocks, could be translated by two alternative codes producing nearly identical proteins. It was observed that the dual-coded blocks were enriched with the early-stage phage genes, while the late-stage genes were residing in the single-coded blocks. MgCod can identify types of stop codon recoding in novel genomic sequences in parallel with gene prediction. It is available for download from https://github.com/gatech-genemark/MgCod.
Collapse
Affiliation(s)
- Aaron Pfennig
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA 30332, USA.
| | - Alexandre Lomsadze
- Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA.
| | - Mark Borodovsky
- Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA; School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA.
| |
Collapse
|
6
|
Nikolaeva OV, Beregova AM, Efeykin BD, Miroliubova TS, Zhuravlev AY, Ivantsov AY, Mikhailov KV, Spiridonov SE, Aleoshin VV. Expression of Hairpin-Enriched Mitochondrial DNA in Two Hairworm Species (Nematomorpha). Int J Mol Sci 2023; 24:11411. [PMID: 37511167 PMCID: PMC10380579 DOI: 10.3390/ijms241411411] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2023] [Revised: 07/07/2023] [Accepted: 07/08/2023] [Indexed: 07/30/2023] Open
Abstract
Nematomorpha (hairworms) is a phylum of parasitic ecdysozoans, best known for infecting arthropods and guiding their hosts toward water, where the parasite can complete its life cycle. Over 350 species of nematomorphs have been described, yet molecular data for the group remain scarce. The few available mitochondrial genomes of nematomorphs are enriched with long inverted repeats, which are embedded in the coding sequences of their genes-a remarkably unusual feature exclusive to this phylum. Here, we obtain and annotate the repeats in the mitochondrial genome of another nematomorph species-Parachordodes pustulosus. Using genomic and transcriptomic libraries, we investigate the impact of inverted repeats on the read coverage of the mitochondrial genome. Pronounced drops in the read coverage coincide with regions containing long inverted repeats, denoting the 'blind spots' of short-fragment sequencing libraries. Phylogenetic inference with the novel data reveals multiple disagreements between the traditional system of Nematomorpha and molecular data, rendering several genera paraphyletic, including Parachordodes.
Collapse
Affiliation(s)
- Olga V. Nikolaeva
- Belozersky Institute of Physicochemical Biology, Lomonosov Moscow State University, Leninskie Gory Str., 1, Bld. 40, Moscow 119991, Russia
| | - Aleksandra M. Beregova
- Belozersky Institute of Physicochemical Biology, Lomonosov Moscow State University, Leninskie Gory Str., 1, Bld. 40, Moscow 119991, Russia
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Leninskie Gory Str., 1, Bld. 73, Moscow 119991, Russia
| | - Boris D. Efeykin
- Severtsov Institute of Ecology and Evolution, Russian Academy of Sciences, Leninskiy Ave., 33, Moscow 119071, Russia (S.E.S.)
- Institute for Information Transmission Problems (Kharkevich Institute), Russian Academy of Sciences, Bolshoy Karetny Per., 19, Bld. 1, Moscow 127051, Russia
| | - Tatiana S. Miroliubova
- Severtsov Institute of Ecology and Evolution, Russian Academy of Sciences, Leninskiy Ave., 33, Moscow 119071, Russia (S.E.S.)
| | - Andrey Yu. Zhuravlev
- Faculty of Biology, Lomonosov Moscow State University, Leninskie Gory Str., 1, Bld. 12, Moscow 119991, Russia
| | - Andrey Yu. Ivantsov
- Borissiak Palaeontological Institute, Russian Academy of Sciences, Profsoyuznaya Str., 123, Moscow 117647, Russia
| | - Kirill V. Mikhailov
- Belozersky Institute of Physicochemical Biology, Lomonosov Moscow State University, Leninskie Gory Str., 1, Bld. 40, Moscow 119991, Russia
- Institute for Information Transmission Problems (Kharkevich Institute), Russian Academy of Sciences, Bolshoy Karetny Per., 19, Bld. 1, Moscow 127051, Russia
| | - Sergei E. Spiridonov
- Severtsov Institute of Ecology and Evolution, Russian Academy of Sciences, Leninskiy Ave., 33, Moscow 119071, Russia (S.E.S.)
| | - Vladimir V. Aleoshin
- Belozersky Institute of Physicochemical Biology, Lomonosov Moscow State University, Leninskie Gory Str., 1, Bld. 40, Moscow 119991, Russia
- Institute for Information Transmission Problems (Kharkevich Institute), Russian Academy of Sciences, Bolshoy Karetny Per., 19, Bld. 1, Moscow 127051, Russia
| |
Collapse
|
7
|
Senra MVX. In silico characterization of cysteine-stabilized αβ defensins from neglected unicellular microeukaryotes. BMC Microbiol 2023; 23:82. [PMID: 36966312 PMCID: PMC10040121 DOI: 10.1186/s12866-023-02817-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Accepted: 03/09/2023] [Indexed: 03/27/2023] Open
Abstract
BACKGROUND The emergence of multi-resistant pathogens have increased dramatically in recent years, becoming a major public-health concern. Among other promising antimicrobial molecules with potential to assist in this worldwide struggle, cysteine-stabilized αβ (CS-αβ) defensins are attracting attention due their efficacy, stability, and broad spectrum against viruses, bacteria, fungi, and protists, including many known human pathogens. RESULTS Here, 23 genomes of ciliated protists were screened and two CS-αβ defensins with a likely antifungal activity were identified and characterized, using bioinformatics, from a culturable freshwater species, Laurentiella sp. (LsAMP-1 and LsAMP-2). Although any potential cellular ligand could be predicted for LsAMP-2; evidences from structural, molecular dynamics, and docking analyses suggest that LsAMP-1 may form stably associations with phosphatidylinositol 4,5-bisphosphates (PIP2), a phospholipid found on many eukaryotic cells, which could, in turn, represent an anchorage mechanism within plasma membrane of targeted cells. CONCLUSION These data stress that more biotechnology-oriented studies should be conducted on neglected protists, such ciliates, which could become valuable sources of novel bioactive molecules for therapeutic uses.
Collapse
|
8
|
Shulgina Y, Eddy SR. Codetta: predicting the genetic code from nucleotide sequence. Bioinformatics 2023; 39:6895099. [PMID: 36511586 PMCID: PMC9825746 DOI: 10.1093/bioinformatics/btac802] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Revised: 11/10/2022] [Indexed: 12/15/2022] Open
Abstract
SUMMARY Codetta is a Python program for predicting the genetic code table of an organism from nucleotide sequences. Codetta can analyze an arbitrary nucleotide sequence and needs no sequence annotation or taxonomic placement. The most likely amino acid decoding for each of the 64 codons is inferred from alignments of profile hidden Markov models of conserved proteins to the input sequence. AVAILABILITY AND IMPLEMENTATION Codetta 2.0 is implemented as a Python 3 program for MacOS and Linux and is available from http://eddylab.org/software/codetta/codetta2.tar.gz and at http://github.com/kshulgina/codetta. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yekaterina Shulgina
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA 02138, USA
| | | |
Collapse
|
9
|
Mikhailov KV, Karpov SA, Letcher PM, Lee PA, Logacheva MD, Penin AA, Nesterenko MA, Pozdnyakov IR, Potapenko EV, Sherbakov DY, Panchin YV, Aleoshin VV. Genomic analysis reveals cryptic diversity in aphelids and sheds light on the emergence of Fungi. Curr Biol 2022; 32:4607-4619.e7. [PMID: 36126656 DOI: 10.1016/j.cub.2022.08.071] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2022] [Revised: 06/26/2022] [Accepted: 08/24/2022] [Indexed: 11/28/2022]
Abstract
Over the past decade, molecular phylogenetics has reshaped our understanding of the fungal tree of life by unraveling a hitherto elusive diversity of the protistan relatives of Fungi. Aphelida constitutes one of these novel deep branches that precede the emergence of osmotrophic fungal lifestyle and hold particular significance as the pathogens of algae. Here, we obtain and analyze the genomes of aphelid species Amoeboaphelidium protococcarum and Amoeboaphelidium occidentale. Genomic data unmask the vast divergence between these species, hidden behind their morphological similarity, and reveal hybrid genomes with a complex evolutionary history in two strains of A. protococcarum. We confirm the proposed sister relationship between Aphelida and Fungi using phylogenomic analysis and chart the reduction of characteristic proteins involved in phagocytic activity in the evolution of Holomycota. Annotation of aphelid genomes demonstrates the retention of actin nucleation-promoting complexes associated with phagocytosis and amoeboid motility and also reveals a conspicuous expansion of receptor-like protein kinases, uncharacteristic of fungal lineages. We find that aphelids possess multiple carbohydrate-processing enzymes that are involved in fungal cell wall synthesis but do not display rich complements of algal cell-wall-processing enzymes, suggesting an independent origin of fungal plant-degrading capabilities. Aphelid genomes show that the emergence of Fungi from phagotrophic ancestors relied on a common cell wall synthetic machinery but required a different set of proteins for digestion and interaction with the environment.
Collapse
Affiliation(s)
- Kirill V Mikhailov
- Belozersky Institute for Physico-Chemical Biology, Lomonosov Moscow State University, Moscow 119992, Russian Federation; Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow 127051, Russian Federation.
| | - Sergey A Karpov
- Zoological Institute, Russian Academy of Sciences, St. Petersburg 199034, Russian Federation; Biological Faculty, St. Petersburg State University, St. Petersburg 199034, Russian Federation
| | - Peter M Letcher
- Department of Biological Sciences, The University of Alabama, Tuscaloosa, AL 35487-0344, USA
| | - Philip A Lee
- Allegheny Science and Technology, Bridgeport, WV 26330, USA
| | - Maria D Logacheva
- Belozersky Institute for Physico-Chemical Biology, Lomonosov Moscow State University, Moscow 119992, Russian Federation; Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow 127051, Russian Federation; Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow 121205, Russian Federation
| | - Aleksey A Penin
- Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow 127051, Russian Federation
| | - Maksim A Nesterenko
- Zoological Institute, Russian Academy of Sciences, St. Petersburg 199034, Russian Federation; Biological Faculty, St. Petersburg State University, St. Petersburg 199034, Russian Federation
| | - Igor R Pozdnyakov
- Zoological Institute, Russian Academy of Sciences, St. Petersburg 199034, Russian Federation
| | - Evgenii V Potapenko
- Institute of Evolution, University of Haifa, Haifa 3498838, Israel; Department of Evolutionary and Environmental Biology, University of Haifa, Haifa 3498838, Israel
| | - Dmitry Y Sherbakov
- Limnological Institute, Siberian Branch of the Russian Academy of Sciences, Irkutsk 664033, Russian Federation; Novosibirsk State University, Novosibirsk 630090, Russian Federation
| | - Yuri V Panchin
- Belozersky Institute for Physico-Chemical Biology, Lomonosov Moscow State University, Moscow 119992, Russian Federation; Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow 127051, Russian Federation
| | - Vladimir V Aleoshin
- Belozersky Institute for Physico-Chemical Biology, Lomonosov Moscow State University, Moscow 119992, Russian Federation; Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow 127051, Russian Federation
| |
Collapse
|
10
|
Andrade-Martínez JS, Camelo Valera LC, Chica Cárdenas LA, Forero-Junco L, López-Leal G, Moreno-Gallego JL, Rangel-Pineros G, Reyes A. Computational Tools for the Analysis of Uncultivated Phage Genomes. Microbiol Mol Biol Rev 2022; 86:e0000421. [PMID: 35311574 PMCID: PMC9199400 DOI: 10.1128/mmbr.00004-21] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Over a century of bacteriophage research has uncovered a plethora of fundamental aspects of their biology, ecology, and evolution. Furthermore, the introduction of community-level studies through metagenomics has revealed unprecedented insights on the impact that phages have on a range of ecological and physiological processes. It was not until the introduction of viral metagenomics that we began to grasp the astonishing breadth of genetic diversity encompassed by phage genomes. Novel phage genomes have been reported from a diverse range of biomes at an increasing rate, which has prompted the development of computational tools that support the multilevel characterization of these novel phages based solely on their genome sequences. The impact of these technologies has been so large that, together with MAGs (Metagenomic Assembled Genomes), we now have UViGs (Uncultivated Viral Genomes), which are now officially recognized by the International Committee for the Taxonomy of Viruses (ICTV), and new taxonomic groups can now be created based exclusively on genomic sequence information. Even though the available tools have immensely contributed to our knowledge of phage diversity and ecology, the ongoing surge in software programs makes it challenging to keep up with them and the purpose each one is designed for. Therefore, in this review, we describe a comprehensive set of currently available computational tools designed for the characterization of phage genome sequences, focusing on five specific analyses: (i) assembly and identification of phage and prophage sequences, (ii) phage genome annotation, (iii) phage taxonomic classification, (iv) phage-host interaction analysis, and (v) phage microdiversity.
Collapse
Affiliation(s)
- Juan Sebastián Andrade-Martínez
- Max Planck Tandem Group in Computational Biology, Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia
| | - Laura Carolina Camelo Valera
- Max Planck Tandem Group in Computational Biology, Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia
| | - Luis Alberto Chica Cárdenas
- Max Planck Tandem Group in Computational Biology, Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia
| | - Laura Forero-Junco
- Max Planck Tandem Group in Computational Biology, Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia
- Department of Plant and Environmental Science, University of Copenhagen, Frederiksberg, Denmark
| | - Gamaliel López-Leal
- Max Planck Tandem Group in Computational Biology, Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia
| | - J. Leonardo Moreno-Gallego
- Max Planck Tandem Group in Computational Biology, Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia
- Department of Microbiome Science, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Guillermo Rangel-Pineros
- Max Planck Tandem Group in Computational Biology, Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia
- The GLOBE Institute, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Alejandro Reyes
- Max Planck Tandem Group in Computational Biology, Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri, USA
| |
Collapse
|
11
|
Shulgina Y, Eddy SR. A computational screen for alternative genetic codes in over 250,000 genomes. eLife 2021; 10:71402. [PMID: 34751130 PMCID: PMC8629427 DOI: 10.7554/elife.71402] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Accepted: 10/26/2021] [Indexed: 11/25/2022] Open
Abstract
The genetic code has been proposed to be a ‘frozen accident,’ but the discovery of alternative genetic codes over the past four decades has shown that it can evolve to some degree. Since most examples were found anecdotally, it is difficult to draw general conclusions about the evolutionary trajectories of codon reassignment and why some codons are affected more frequently. To fill in the diversity of genetic codes, we developed Codetta, a computational method to predict the amino acid decoding of each codon from nucleotide sequence data. We surveyed the genetic code usage of over 250,000 bacterial and archaeal genome sequences in GenBank and discovered five new reassignments of arginine codons (AGG, CGA, and CGG), representing the first sense codon changes in bacteria. In a clade of uncultivated Bacilli, the reassignment of AGG to become the dominant methionine codon likely evolved by a change in the amino acid charging of an arginine tRNA. The reassignments of CGA and/or CGG were found in genomes with low GC content, an evolutionary force that likely helped drive these codons to low frequency and enable their reassignment. All life forms rely on a ‘code’ to translate their genetic information into proteins. This code relies on limited permutations of three nucleotides – the building blocks that form DNA and other types of genetic information. Each ‘triplet’ of nucleotides – or codon – encodes a specific amino acid, the basic component of proteins. Reading the sequence of codons in the right order will let the cell know which amino acid to assemble next on a growing protein. For instance, the codon CGG – formed of the nucleotides guanine (G) and cytosine (C) – codes for the amino acid arginine. From bacteria to humans, most life forms rely on the same genetic code. Yet certain organisms have evolved to use slightly different codes, where one or several codons have an altered meaning. To better understand how alternative genetic codes have evolved, Shulgina and Eddy set out to find more organisms featuring these altered codons, creating a new software called Codetta that can analyze the genome of a microorganism and predict the genetic code it uses. Codetta was then used to sift through the genetic information of 250,000 microorganisms. This was made possible by the sequencing, in recent years, of the genomes of hundreds of thousands of bacteria and other microorganisms – including many never studied before. These analyses revealed five groups of bacteria with alternative genetic codes, all of which had changes in the codons that code for arginine. Amongst these, four had genomes with a low proportion of guanine and cytosine nucleotides. This may have made some guanine and cytosine-rich arginine codons very rare in these organisms and, therefore, easier to be reassigned to encode another amino acid. The work by Shulgina and Eddy demonstrates that Codetta is a new, useful tool that scientists can use to understand how genetic codes evolve. In addition, it can also help to ensure the accuracy of widely used protein databases, which assume which genetic code organisms use to predict protein sequences from their genomes.
Collapse
Affiliation(s)
| | - Sean R Eddy
- Molecular & Cellular Biology, Harvard University, Cambridge, United States
| |
Collapse
|
12
|
Xavier Senra MV, Fonseca AL. New tyrosinases with putative action against contaminants of emerging concern. Proteins 2021; 89:1180-1192. [PMID: 33969540 DOI: 10.1002/prot.26139] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Revised: 03/16/2021] [Accepted: 04/30/2021] [Indexed: 11/07/2022]
Abstract
Tyrosinases (EC 1.14.18.1) are type-3 copper metalloenzymes with strong oxidative capacities and low allosteric selectivity to phenolic and non-phenolic aromatic compounds, which have been used as biosensors and biocatalysts to mitigate the impacts of environmental contaminants over aquatic ecosystems. However, the widespread use of these polyphenol oxidases is limited by elevated production costs and restricted knowledge on their spectrum of action. Here, six tyrosinase homologs were identified and characterized from the genomes of four widespread freshwater ciliates using bioinformatics. Next, we performed a virtual screening to calculate binding energies between 3D models of these homologs and ~ 1000 contaminants of emerging concern (CECs), as an indirect approach to identify likely and unlikely targets for tyrosinases. Many fine chemicals, pharmaceuticals, personal care products, illicit drugs, natural toxins, and pesticides exhibited strong binding energies to these new tyrosinases, suggesting the spectrum of targets of these enzymes might be considerably broader than previously thought. Many ciliates, including those carrying tyrosinase genes, are fast-growing unicellular microeukaryotes that can be efficiently cultured, at large scales, under in vitro conditions, suggesting these organisms should be regarded as potential low-cost sources of new environmental biotechnological molecules.
Collapse
Affiliation(s)
| | - Ana Lúcia Fonseca
- Instituto de Recursos Naturais, Universidade Federal de Itajubá, Itajubá, Minas Gerais, Brazil
| |
Collapse
|
13
|
Žihala D, Salamonová J, Eliáš M. Evolution of the genetic code in the mitochondria of Labyrinthulea (Stramenopiles). Mol Phylogenet Evol 2020; 152:106908. [PMID: 32702525 DOI: 10.1016/j.ympev.2020.106908] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2020] [Revised: 06/30/2020] [Accepted: 07/15/2020] [Indexed: 02/02/2023]
Abstract
Mitochondrial translation often exhibits departures from the standard genetic code, but the full spectrum of these changes has certainly not yet been described and the molecular mechanisms behind the changes in codon meaning are rarely studied. Here we report a detailed analysis of the mitochondrial genetic code in the stramenopile group Labyrinthulea (Labyrinthulomycetes) and their relatives. In the genus Aplanochytrium, UAG is not a termination codon but encodes tyrosine, in contrast to the unaffected meaning of the UAA codon. This change is evolutionarily independent of the reassignment of both UAG and UAA as tyrosine codons recently reported from two uncultivated labyrinthuleans (S2 and S4), which we show are not thraustochytrids as proposed before, but represent the clade LAB14 previously recognised in environmental 18S rRNA gene surveys. We provide rigorous evidence that the UUA codon in the mitochondria of all labyrinthuleans serves as a termination codon instead of encoding leucine, and propose that a sense-to-stop reassignment has also affected the AGG and AGA codons in the LAB14 clade. The distribution of the different forms of sense-to-stop and stop-to-sense reassignments correlates with specific modifications of the mitochondrial release factor mtRF2a in different subsets of labyrinthuleans, and with the unprecedented loss of mtRF1a in Aplanochytrium and perhaps also in the LAB14 clade, pointing towards a possible mechanistic basis of the code changes observed. Curiously, we show that labyrinthulean mitochondria also exhibit a sense-to-sense codon reassignment, manifested as AUA encoding methionine instead of isoleucine. Furthermore, we show that this change evolved independently in the uncultivated stramenopile lineage MAST8b, together with the reassignment of the AGR codons from arginine to serine. Altogether, our study has uncovered novel variants of the mitochondrial genetic code and previously unknown modifications of the mitochondrial translation machinery, further enriching our understanding of the rules governing the evolution of one of the central molecular process in the cell.
Collapse
Affiliation(s)
- David Žihala
- Department of Biology and Ecology & Institute of Environmental Technologies, Faculty of Science, University of Ostrava, Chittussiho 10, 710 00 Ostrava, Czech Republic
| | - Jana Salamonová
- Department of Biology and Ecology & Institute of Environmental Technologies, Faculty of Science, University of Ostrava, Chittussiho 10, 710 00 Ostrava, Czech Republic
| | - Marek Eliáš
- Department of Biology and Ecology & Institute of Environmental Technologies, Faculty of Science, University of Ostrava, Chittussiho 10, 710 00 Ostrava, Czech Republic.
| |
Collapse
|
14
|
Žihala D, Eliáš M. Evolution and Unprecedented Variants of the Mitochondrial Genetic Code in a Lineage of Green Algae. Genome Biol Evol 2020; 11:2992-3007. [PMID: 31617565 PMCID: PMC6821328 DOI: 10.1093/gbe/evz210] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/17/2019] [Indexed: 12/15/2022] Open
Abstract
Mitochondria of diverse eukaryotes have evolved various departures from the standard genetic code, but the breadth of possible modifications and their phylogenetic distribution are known only incompletely. Furthermore, it is possible that some codon reassignments in previously sequenced mitogenomes have been missed, resulting in inaccurate protein sequences in databases. Here we show, considering the distribution of codons at conserved amino acid positions in mitogenome-encoded proteins, that mitochondria of the green algal order Sphaeropleales exhibit a diversity of codon reassignments, including previously missed ones and some that are unprecedented in any translation system examined so far, necessitating redefinition of existing translation tables and creating at least seven new ones. We resolve a previous controversy concerning the meaning the UAG codon in Hydrodictyaceae, which beyond any doubt encodes alanine. We further demonstrate that AGG, sometimes together with AGA, encodes alanine instead of arginine in diverse sphaeroplealeans. Further newly detected changes include Arg-to-Met reassignment of the AGG codon and Arg-to-Leu reassignment of the CGG codon in particular species. Analysis of tRNAs specified by sphaeroplealean mitogenomes provides direct support for and molecular underpinning of the proposed reassignments. Furthermore, we point to unique mutations in the mitochondrial release factor mtRF1a that correlate with changes in the use of termination codons in Sphaeropleales, including the two independent stop-to-sense UAG reassignments, the reintroduction of UGA in some Scenedesmaceae, and the sense-to-stop reassignment of UCA widespread in the group. Codon disappearance seems to be the main drive of the dynamic evolution of the mitochondrial genetic code in Sphaeropleales.
Collapse
Affiliation(s)
- David Žihala
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Czech Republic.,Institute of Environmental Technologies, Faculty of Science, University of Ostrava, Czech Republic
| | - Marek Eliáš
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Czech Republic.,Institute of Environmental Technologies, Faculty of Science, University of Ostrava, Czech Republic
| |
Collapse
|
15
|
Al-Shayeb B, Sachdeva R, Chen LX, Ward F, Munk P, Devoto A, Castelle CJ, Olm MR, Bouma-Gregson K, Amano Y, He C, Méheust R, Brooks B, Thomas A, Lavy A, Matheus-Carnevali P, Sun C, Goltsman DSA, Borton MA, Sharrar A, Jaffe AL, Nelson TC, Kantor R, Keren R, Lane KR, Farag IF, Lei S, Finstad K, Amundson R, Anantharaman K, Zhou J, Probst AJ, Power ME, Tringe SG, Li WJ, Wrighton K, Harrison S, Morowitz M, Relman DA, Doudna JA, Lehours AC, Warren L, Cate JHD, Santini JM, Banfield JF. Clades of huge phages from across Earth's ecosystems. Nature 2020; 578:425-431. [PMID: 32051592 PMCID: PMC7162821 DOI: 10.1038/s41586-020-2007-4] [Citation(s) in RCA: 285] [Impact Index Per Article: 57.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2019] [Accepted: 01/02/2020] [Indexed: 12/31/2022]
Abstract
Bacteriophages typically have small genomes1 and depend on their bacterial hosts for replication2. Here we sequenced DNA from diverse ecosystems and found hundreds of phage genomes with lengths of more than 200 kilobases (kb), including a genome of 735 kb, which is-to our knowledge-the largest phage genome to be described to date. Thirty-five genomes were manually curated to completion (circular and no gaps). Expanded genetic repertoires include diverse and previously undescribed CRISPR-Cas systems, transfer RNAs (tRNAs), tRNA synthetases, tRNA-modification enzymes, translation-initiation and elongation factors, and ribosomal proteins. The CRISPR-Cas systems of phages have the capacity to silence host transcription factors and translational genes, potentially as part of a larger interaction network that intercepts translation to redirect biosynthesis to phage-encoded functions. In addition, some phages may repurpose bacterial CRISPR-Cas systems to eliminate competing phages. We phylogenetically define the major clades of huge phages from human and other animal microbiomes, as well as from oceans, lakes, sediments, soils and the built environment. We conclude that the large gene inventories of huge phages reflect a conserved biological strategy, and that the phages are distributed across a broad bacterial host range and across Earth's ecosystems.
Collapse
Affiliation(s)
- Basem Al-Shayeb
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
| | - Rohan Sachdeva
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
| | - Lin-Xing Chen
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
| | - Fred Ward
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
| | - Patrick Munk
- National Food Institute, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Audra Devoto
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
| | - Cindy J Castelle
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
| | - Matthew R Olm
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
| | - Keith Bouma-Gregson
- Earth and Planetary Science, University of California Berkeley, Berkeley, CA, USA
| | - Yuki Amano
- Nuclear Fuel Cycle Engineering Laboratories, Japan Atomic Energy Agency, Tokai-mura, Japan
| | - Christine He
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
| | - Raphaël Méheust
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
| | - Brandon Brooks
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
| | - Alex Thomas
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
| | - Adi Lavy
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
| | | | - Christine Sun
- Department of Microbiology & Immunology, Stanford University, Stanford, CA, USA
| | | | - Mikayla A Borton
- Department of Soil and Crop Sciences, Colorado State University, Fort Collins, CO, USA
| | - Allison Sharrar
- Earth and Planetary Science, University of California Berkeley, Berkeley, CA, USA
| | - Alexander L Jaffe
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
| | - Tara C Nelson
- Department of Civil and Mineral Engineering, University of Toronto, Toronto, Ontario, Canada
| | - Rose Kantor
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
| | - Ray Keren
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
| | - Katherine R Lane
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
| | - Ibrahim F Farag
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
| | - Shufei Lei
- Earth and Planetary Science, University of California Berkeley, Berkeley, CA, USA
| | - Kari Finstad
- Environmental Science, Policy and Management, University of California Berkeley, Berkeley, CA, USA
| | - Ronald Amundson
- Environmental Science, Policy and Management, University of California Berkeley, Berkeley, CA, USA
| | - Karthik Anantharaman
- Earth and Planetary Science, University of California Berkeley, Berkeley, CA, USA
| | | | - Alexander J Probst
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
| | - Mary E Power
- Integrative Biology, University of California Berkeley, Berkeley, CA, USA
| | | | - Wen-Jun Li
- School of Life Sciences, Sun Yat-Sen University, Guangzhou, China
| | - Kelly Wrighton
- Department of Soil and Crop Sciences, Colorado State University, Fort Collins, CO, USA
| | - Sue Harrison
- Centre for Bioprocess Engineering Research, University of Cape Town, Cape Town, South Africa
| | - Michael Morowitz
- Department of Surgery, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
| | - David A Relman
- Department of Microbiology & Immunology, Stanford University, Stanford, CA, USA
| | - Jennifer A Doudna
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
| | - Anne-Catherine Lehours
- Laboratoire Microorganismes: Génome et Environnement, Université Clermont Auvergne, CNRS, Clermont-Ferrand, France
| | - Lesley Warren
- Department of Civil and Mineral Engineering, University of Toronto, Toronto, Ontario, Canada
| | - Jamie H D Cate
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
| | - Joanne M Santini
- Institute of Structural and Molecular Biology, University College London, London, UK
| | - Jillian F Banfield
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA.
- Earth and Planetary Science, University of California Berkeley, Berkeley, CA, USA.
- Environmental Science, Policy and Management, University of California Berkeley, Berkeley, CA, USA.
- School of Earth Sciences, University of Melbourne, Melbourne, Victoria, Australia.
| |
Collapse
|
16
|
Schultz DT, Eizenga JM, Corbett-Detig RB, Francis WR, Christianson LM, Haddock SH. Conserved novel ORFs in the mitochondrial genome of the ctenophore Beroe forskalii. PeerJ 2020; 8:e8356. [PMID: 32025367 PMCID: PMC6991124 DOI: 10.7717/peerj.8356] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2019] [Accepted: 12/04/2019] [Indexed: 11/20/2022] Open
Abstract
To date, five ctenophore species' mitochondrial genomes have been sequenced, and each contains open reading frames (ORFs) that if translated have no identifiable orthologs. ORFs with no identifiable orthologs are called unidentified reading frames (URFs). If truly protein-coding, ctenophore mitochondrial URFs represent a little understood path in early-diverging metazoan mitochondrial evolution and metabolism. We sequenced and annotated the mitochondrial genomes of three individuals of the beroid ctenophore Beroe forskalii and found that in addition to sharing the same canonical mitochondrial genes as other ctenophores, the B. forskalii mitochondrial genome contains two URFs. These URFs are conserved among the three individuals but not found in other sequenced species. We developed computational tools called pauvre and cuttlery to determine the likelihood that URFs are protein coding. There is evidence that the two URFs are under negative selection, and a novel Bayesian hypothesis test of trinucleotide frequency shows that the URFs are more similar to known coding genes than noncoding intergenic sequence. Protein structure and function prediction of all ctenophore URFs suggests that they all code for transmembrane transport proteins. These findings, along with the presence of URFs in other sequenced ctenophore mitochondrial genomes, suggest that ctenophores may have uncharacterized transmembrane proteins present in their mitochondria.
Collapse
Affiliation(s)
- Darrin T. Schultz
- Department of Biomolecular Engineering and Bioinformatics, University of California Santa Cruz, Santa Cruz, CA, USA
- Monterey Bay Aquarium Research Institute, Moss Landing, CA, USA
| | - Jordan M. Eizenga
- Department of Biomolecular Engineering and Bioinformatics, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Russell B. Corbett-Detig
- Department of Biomolecular Engineering and Bioinformatics, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Warren R. Francis
- Department of Biology, University of Southern Denmark, Odense, Denmark
| | | | - Steven H.D. Haddock
- Monterey Bay Aquarium Research Institute, Moss Landing, CA, USA
- Department of Ecology and Evolutionary Biology, University of California Santa Cruz, Santa Cruz, CA, USA
| |
Collapse
|
17
|
Mikhailov KV, Efeykin BD, Panchin AY, Knorre DA, Logacheva MD, Penin AA, Muntyan MS, Nikitin MA, Popova OV, Zanegina ON, Vyssokikh MY, Spiridonov SE, Aleoshin VV, Panchin YV. Coding palindromes in mitochondrial genes of Nematomorpha. Nucleic Acids Res 2020; 47:6858-6870. [PMID: 31194871 PMCID: PMC6649704 DOI: 10.1093/nar/gkz517] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2019] [Revised: 05/29/2019] [Accepted: 06/01/2019] [Indexed: 12/11/2022] Open
Abstract
Inverted repeats are common DNA elements, but they rarely overlap with protein-coding sequences due to the ensuing conflict with the structure and function of the encoded protein. We discovered numerous perfect inverted repeats of considerable length (up to 284 bp) embedded within the protein-coding genes in mitochondrial genomes of four Nematomorpha species. Strikingly, both arms of the inverted repeats encode conserved regions of the amino acid sequence. We confirmed enzymatic activity of the respiratory complex I encoded by inverted repeat-containing genes. The nucleotide composition of inverted repeats suggests strong selection at the amino acid level in these regions. We conclude that the inverted repeat-containing genes are transcribed and translated into functional proteins. The survey of available mitochondrial genomes reveals that several other organisms possess similar albeit shorter embedded repeats. Mitochondrial genomes of Nematomorpha demonstrate an extraordinary evolutionary compromise where protein function and stringent secondary structure elements within the coding regions are preserved simultaneously.
Collapse
Affiliation(s)
- Kirill V Mikhailov
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskiye Gory 1-40, Moscow 119991, Russian Federation.,Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow 127994, Russian Federation
| | - Boris D Efeykin
- Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow 127994, Russian Federation.,Severtsov Institute of Ecology and Evolution, Moscow 119071, Russian Federation
| | - Alexander Y Panchin
- Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow 127994, Russian Federation
| | - Dmitry A Knorre
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskiye Gory 1-40, Moscow 119991, Russian Federation.,Institute of Molecular Medicine, Sechenov First Moscow State Medical University, Moscow 119991, Russian Federation
| | - Maria D Logacheva
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskiye Gory 1-40, Moscow 119991, Russian Federation.,Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow 127994, Russian Federation.,Center for Data-Intensive Biomedicine and Biotechnology, Skolkovo Institute of Science and Technology, Moscow 143028, Russian Federation
| | - Aleksey A Penin
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskiye Gory 1-40, Moscow 119991, Russian Federation.,Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow 127994, Russian Federation
| | - Maria S Muntyan
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskiye Gory 1-40, Moscow 119991, Russian Federation
| | - Mikhail A Nikitin
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskiye Gory 1-40, Moscow 119991, Russian Federation.,Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow 127994, Russian Federation
| | - Olga V Popova
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskiye Gory 1-40, Moscow 119991, Russian Federation
| | - Olga N Zanegina
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskiye Gory 1-40, Moscow 119991, Russian Federation
| | - Mikhail Y Vyssokikh
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskiye Gory 1-40, Moscow 119991, Russian Federation
| | - Sergei E Spiridonov
- Severtsov Institute of Ecology and Evolution, Moscow 119071, Russian Federation
| | - Vladimir V Aleoshin
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskiye Gory 1-40, Moscow 119991, Russian Federation.,Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow 127994, Russian Federation
| | - Yuri V Panchin
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskiye Gory 1-40, Moscow 119991, Russian Federation.,Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow 127994, Russian Federation
| |
Collapse
|
18
|
Noutahi E, Calderon V, Blanchette M, El-Mabrouk N, Lang BF. Rapid Genetic Code Evolution in Green Algal Mitochondrial Genomes. Mol Biol Evol 2019; 36:766-783. [PMID: 30698742 PMCID: PMC6551751 DOI: 10.1093/molbev/msz016] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Genetic code deviations involving stop codons have been previously reported in mitochondrial genomes of several green plants (Viridiplantae), most notably chlorophyte algae (Chlorophyta). However, as changes in codon recognition from one amino acid to another are more difficult to infer, such changes might have gone unnoticed in particular lineages with high evolutionary rates that are otherwise prone to codon reassignments. To gain further insight into the evolution of the mitochondrial genetic code in green plants, we have conducted an in-depth study across mtDNAs from 51 green plants (32 chlorophytes and 19 streptophytes). Besides confirming known stop-to-sense reassignments, our study documents the first cases of sense-to-sense codon reassignments in Chlorophyta mtDNAs. In several Sphaeropleales, we report the decoding of AGG codons (normally arginine) as alanine, by tRNA(CCU) of various origins that carry the recognition signature for alanine tRNA synthetase. In Chromochloris, we identify tRNA variants decoding AGG as methionine and the synonymous codon CGG as leucine. Finally, we find strong evidence supporting the decoding of AUA codons (normally isoleucine) as methionine in Pycnococcus. Our results rely on a recently developed conceptual framework (CoreTracker) that predicts codon reassignments based on the disparity between DNA sequence (codons) and the derived protein sequence. These predictions are then validated by an evaluation of tRNA phylogeny, to identify the evolution of new tRNAs via gene duplication and loss, and structural modifications that lead to the assignment of new tRNA identities and a change in the genetic code.
Collapse
Affiliation(s)
- Emmanuel Noutahi
- Département d'Informatique et de Recherche opérationnelle (DIRO), Université de Montréal, CP 6128 succursale Centre-Ville, Montreal, QC, Canada
| | - Virginie Calderon
- Institut de Recherches Cliniques de Montréal, Montreal, Quebec, Canada
| | - Mathieu Blanchette
- School of Computer Science, McGill University, McConnell Engineering Bldg., Montréal, QC H3A 0E9, Canada
- McGill Centre for Bioinformatics, McGill University, Montréal, QC, Canada
| | - Nadia El-Mabrouk
- Département d'Informatique et de Recherche opérationnelle (DIRO), Université de Montréal, CP 6128 succursale Centre-Ville, Montreal, QC, Canada
| | - Bernd Franz Lang
- Département de Biochimie, Centre Robert Cedergren, Université de Montréal, CP 6128 succursale Centre-Ville, Montreal, QC, Canada
| |
Collapse
|
19
|
Bachvaroff TR. A precedented nuclear genetic code with all three termination codons reassigned as sense codons in the syndinean Amoebophrya sp. ex Karlodinium veneficum. PLoS One 2019; 14:e0212912. [PMID: 30818350 PMCID: PMC6394959 DOI: 10.1371/journal.pone.0212912] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2018] [Accepted: 02/12/2019] [Indexed: 02/06/2023] Open
Abstract
Amoebophrya is part of an enigmatic, diverse, and ubiquitous marine alveolate lineage known almost entirely from anonymous environmental sequencing. Two cultured Amoebophrya strains grown on core dinoflagellate hosts were used for transcriptome sequencing. BLASTx using different genetic codes suggests that Amoebophyra sp. ex Karlodinium veneficum uses the three typical stop codons (UAA, UAG, and UGA) to encode amino acids. When UAA and UAG are translated as glutamine about half of the alignments have better BLASTx scores, and when UGA is translated as tryptophan one fifth have better scores. However, the sole stop codon appears to be UGA based on conserved genes, suggesting contingent translation of UGA. Neither host sequences, nor sequences from the second strain, Amoebophrya sp. ex Akashiwo sanguinea had similar results in BLASTx searches. A genome survey of Amoebophyra sp. ex K. veneficum showed no evidence for transcript editing aside from mitochondrial transcripts. The dynein heavy chain (DHC) gene family was surveyed and of 14 transcripts only two did not use UAA, UAG, or UGA in a coding context. Overall the transcriptome displayed strong bias for A or U in third codon positions, while the tRNA genome survey showed bias against codons ending in U, particularly for amino acids with two codons ending in either C or U. Together these clues suggest contingent translation mechanisms in Amoebophyra sp. ex K. veneficum and a phylogenetically distinct instance of genetic code modification.
Collapse
Affiliation(s)
- Tsvetan R. Bachvaroff
- Institute of Marine and Environmental Technology, University of Maryland Center for Environmental Science, Baltimore, Maryland, United States of America
| |
Collapse
|
20
|
Megaphages infect Prevotella and variants are widespread in gut microbiomes. Nat Microbiol 2019; 4:693-700. [PMID: 30692672 PMCID: PMC6784885 DOI: 10.1038/s41564-018-0338-9] [Citation(s) in RCA: 125] [Impact Index Per Article: 20.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2018] [Accepted: 12/05/2018] [Indexed: 02/06/2023]
Abstract
Bacteriophages (phages) dramatically shape microbial community composition, redistribute nutrients via host lysis and drive evolution through horizontal gene transfer. Despite their importance, much remains to be learned about phages in the human microbiome. We investigated the gut microbiomes of humans from Bangladesh and Tanzania, two African baboon social groups and Danish pigs; many of these microbiomes contain phages belonging to a clade with genomes >540 kilobases in length, the largest yet reported in the human microbiome and close to the maximum size ever reported for phages. We refer to these as Lak phages. CRISPR spacer targeting indicates that Lak phages infect bacteria of the genus Prevotella. We manually curated to completion 15 distinct Lak phage genomes recovered from metagenomes. The genomes display several interesting features, including use of an alternative genetic code, large intergenic regions that are highly expressed and up to 35 putative transfer RNAs, some of which contain enigmatic introns. Different individuals have distinct phage genotypes, and shifts in variant frequencies over consecutive sampling days reflect changes in the relative abundance of phage subpopulations. Recent homologous recombination has resulted in extensive genome admixture of nine baboon Lak phage populations. We infer that Lak phages are widespread in gut communities that contain the Prevotella species, and conclude that megaphages, with fascinating and underexplored biology, may be common but largely overlooked components of human and animal gut microbiomes.
Collapse
|
21
|
Noutahi E, Calderon V, Blanchette M, Lang FB, El-Mabrouk N. CoreTracker: accurate codon reassignment prediction, applied to mitochondrial genomes. Bioinformatics 2018; 33:3331-3339. [PMID: 28655158 DOI: 10.1093/bioinformatics/btx421] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2017] [Accepted: 06/23/2017] [Indexed: 11/13/2022] Open
Abstract
Motivation Codon reassignments have been reported across all domains of life. With the increasing number of sequenced genomes, the development of systematic approaches for genetic code detection is essential for accurate downstream analyses. Three automated prediction tools exist so far: FACIL, GenDecoder and Bagheera; the last two respectively restricted to metazoan mitochondrial genomes and CUG reassignments in yeast nuclear genomes. These tools can only analyze a single genome at a time and are often not followed by a validation procedure, resulting in a high rate of false positives. Results We present CoreTracker, a new algorithm for the inference of sense-to-sense codon reassignments. CoreTracker identifies potential codon reassignments in a set of related genomes, then uses statistical evaluations and a random forest classifier to predict those that are the most likely to be correct. Predicted reassignments are then validated through a phylogeny-aware step that evaluates the impact of the new genetic code on the protein alignment. Handling simultaneously a set of genomes in a phylogenetic framework, allows tracing back the evolution of each reassignment, which provides information on its underlying mechanism. Applied to metazoan and yeast genomes, CoreTracker significantly outperforms existing methods on both precision and sensitivity. Availability and implementation CoreTracker is written in Python and available at https://github.com/UdeM-LBIT/CoreTracker. Contact mabrouk@iro.umontreal.ca. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Emmanuel Noutahi
- Département d'Informatique et de Recherche Opérationnelle (DIRO), Université de Montréal, Montréal, QC CP 6128, Canada
| | - Virginie Calderon
- Département d'Informatique et de Recherche Opérationnelle (DIRO), Université de Montréal, Montréal, QC CP 6128, Canada
| | - Mathieu Blanchette
- School of Computer Science, McGill University, McConnell Engineering Bldg., Montréal, QC H3A 0E9, Canada
| | - Franz B Lang
- Département de Biochimie, Centre Robert Cedergren, Université de Montréal, Montréal, QC CP 6128, Canada
| | - Nadia El-Mabrouk
- Département d'Informatique et de Recherche Opérationnelle (DIRO), Université de Montréal, Montréal, QC CP 6128, Canada
| |
Collapse
|
22
|
Mitochondrial Genomes of Kinorhyncha: trnM Duplication and New Gene Orders within Animals. PLoS One 2016; 11:e0165072. [PMID: 27755612 PMCID: PMC5068742 DOI: 10.1371/journal.pone.0165072] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2016] [Accepted: 10/05/2016] [Indexed: 11/19/2022] Open
Abstract
Many features of mitochondrial genomes of animals, such as patterns of gene arrangement, nucleotide content and substitution rate variation are extensively used in evolutionary and phylogenetic studies. Nearly 6,000 mitochondrial genomes of animals have already been sequenced, covering the majority of animal phyla. One of the groups that escaped mitogenome sequencing is phylum Kinorhyncha-an isolated taxon of microscopic worm-like ecdysozoans. The kinorhynchs are thought to be one of the early-branching lineages of Ecdysozoa, and their mitochondrial genomes may be important for resolving evolutionary relations between major animal taxa. Here we present the results of sequencing and analysis of mitochondrial genomes from two members of Kinorhyncha, Echinoderes svetlanae (Cyclorhagida) and Pycnophyes kielensis (Allomalorhagida). Their mitochondrial genomes are circular molecules approximately 15 Kbp in size. The kinorhynch mitochondrial gene sequences are highly divergent, which precludes accurate phylogenetic inference. The mitogenomes of both species encode a typical metazoan complement of 37 genes, which are all positioned on the major strand, but the gene order is distinct and unique among Ecdysozoa or animals as a whole. We predict four types of start codons for protein-coding genes in E. svetlanae and five in P. kielensis with a consensus DTD in single letter code. The mitochondrial genomes of E. svetlanae and P. kielensis encode duplicated methionine tRNA genes that display compensatory nucleotide substitutions. Two distant species of Kinorhyncha demonstrate similar patterns of gene arrangements in their mitogenomes. Both genomes have duplicated methionine tRNA genes; the duplication predates the divergence of two species. The kinorhynchs share a few features pertaining to gene order that align them with Priapulida. Gene order analysis reveals that gene arrangement specific of Priapulida may be ancestral for Scalidophora, Ecdysozoa, and even Protostomia.
Collapse
|
23
|
Swart EC, Serra V, Petroni G, Nowacki M. Genetic Codes with No Dedicated Stop Codon: Context-Dependent Translation Termination. Cell 2016; 166:691-702. [PMID: 27426948 PMCID: PMC4967479 DOI: 10.1016/j.cell.2016.06.020] [Citation(s) in RCA: 128] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2015] [Revised: 04/19/2016] [Accepted: 06/06/2016] [Indexed: 01/13/2023]
Abstract
The prevailing view of the nuclear genetic code is that it is largely frozen and unambiguous. Flexibility in the nuclear genetic code has been demonstrated in ciliates that reassign standard stop codons to amino acids, resulting in seven variant genetic codes, including three previously undescribed ones reported here. Surprisingly, in two of these species, we find efficient translation of all 64 codons as standard amino acids and recognition of either one or all three stop codons. How, therefore, does the translation machinery interpret a “stop” codon? We provide evidence, based on ribosomal profiling and “stop” codon depletion shortly before coding sequence ends, that mRNA 3′ ends may contribute to distinguishing stop from sense in a context-dependent manner. We further propose that such context-dependent termination/readthrough suppression near transcript ends enables genetic code evolution. Alternative nuclear genetic codes continue to be discovered in ciliates Genetic codes with stops and all their codons encoding standard amino acids exist Transcript ends may distinguish stop codons as such in ambiguous genetic codes The ability to resolve genetic code ambiguity may enable genetic code evolution
Collapse
Affiliation(s)
| | - Valentina Serra
- Department of Biology, University of Pisa, Pisa 56126, Italy
| | - Giulio Petroni
- Department of Biology, University of Pisa, Pisa 56126, Italy
| | - Mariusz Nowacki
- Institute of Cell Biology, University of Bern, 3012 Bern, Switzerland.
| |
Collapse
|
24
|
Phylogenomic analysis of Candidatus 'Izimaplasma' species: free-living representatives from a Tenericutes clade found in methane seeps. ISME JOURNAL 2016; 10:2679-2692. [PMID: 27058507 DOI: 10.1038/ismej.2016.55] [Citation(s) in RCA: 57] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/13/2016] [Revised: 02/29/2016] [Accepted: 03/04/2016] [Indexed: 11/09/2022]
Abstract
Tenericutes are a unique class of bacteria that lack a cell wall and are typically parasites or commensals of eukaryotic hosts. Environmental 16S rDNA surveys have identified a number of tenericute clades in diverse environments, introducing the possibility that these Tenericutes may represent non-host-associated, free-living microorganisms. Metagenomic sequencing of deep-sea methane seep sediments resulted in the assembly of two genomes from a Tenericutes-affiliated clade currently known as 'NB1-n' (SILVA taxonomy) or 'RF3' (Greengenes taxonomy). Metabolic reconstruction revealed that, like cultured members of the Mollicutes, these 'NB1-n' representatives lack a tricarboxylic acid cycle and instead use anaerobic fermentation of simple sugars for substrate level phosphorylation. Notably, the genomes also contained a number of unique metabolic features including hydrogenases and a simplified electron transport chain containing an RNF complex, cytochrome bd oxidase and complex I. On the basis of the metabolic potential predicted from the annotated genomes, we devised an anaerobic enrichment media that stimulated the growth of these Tenericutes at 10 °C, resulting in a mixed culture where these organisms represented ~60% of the total cells by targeted fluorescence in situ hybridization (FISH). Visual identification by FISH confirmed these organisms were not directly associated with Eukaryotes and electron cryomicroscopy of cells in the enrichment culture confirmed an ultrastructure consistent with the defining phenotypic property of Tenericutes, with a single membrane and no cell wall. On the basis of their unique gene content, phylogenetic placement and ultrastructure, we propose these organisms represent a novel class within the Tenericutes, and suggest the names Candidatus 'Izimaplasma sp. HR1' and Candidatus 'Izimaplasma sp. HR2' for the two genome representatives.
Collapse
|
25
|
Genome-based microbial ecology of anammox granules in a full-scale wastewater treatment system. Nat Commun 2016; 7:11172. [PMID: 27029554 PMCID: PMC4821891 DOI: 10.1038/ncomms11172] [Citation(s) in RCA: 277] [Impact Index Per Article: 30.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2015] [Accepted: 02/26/2016] [Indexed: 02/04/2023] Open
Abstract
Partial-nitritation anammox (PNA) is a novel wastewater treatment procedure for energy-efficient ammonium removal. Here we use genome-resolved metagenomics to build a genome-based ecological model of the microbial community in a full-scale PNA reactor. Sludge from the bioreactor examined here is used to seed reactors in wastewater treatment plants around the world; however, the role of most of its microbial community in ammonium removal remains unknown. Our analysis yielded 23 near-complete draft genomes that together represent the majority of the microbial community. We assign these genomes to distinct anaerobic and aerobic microbial communities. In the aerobic community, nitrifying organisms and heterotrophs predominate. In the anaerobic community, widespread potential for partial denitrification suggests a nitrite loop increases treatment efficiency. Of our genomes, 19 have no previously cultivated or sequenced close relatives and six belong to bacterial phyla without any cultivated members, including the most complete Omnitrophica (formerly OP3) genome to date. ANaerobic AMMonium OXidation (ANAMMOX) combined with partial nitritation has been adopted for removal of ammonium from wastewater. Here, Speth et al. describe the bacterial metagenome of a partial-nitritation/anammox (PNA) reactor, and provide 23 draft genomes, 19 of which were previously uncharacterized/sequenced/cultivated.
Collapse
|
26
|
Hanke A, Hamann E, Sharma R, Geelhoed JS, Hargesheimer T, Kraft B, Meyer V, Lenk S, Osmers H, Wu R, Makinwa K, Hettich RL, Banfield JF, Tegetmeyer HE, Strous M. Recoding of the stop codon UGA to glycine by a BD1-5/SN-2 bacterium and niche partitioning between Alpha- and Gammaproteobacteria in a tidal sediment microbial community naturally selected in a laboratory chemostat. Front Microbiol 2014; 5:231. [PMID: 24904545 PMCID: PMC4032931 DOI: 10.3389/fmicb.2014.00231] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2014] [Accepted: 04/30/2014] [Indexed: 11/13/2022] Open
Abstract
Sandy coastal sediments are global hotspots for microbial mineralization of organic matter and denitrification. These sediments are characterized by advective porewater flow, tidal cycling and an active and complex microbial community. Metagenomic sequencing of microbial communities sampled from such sediments showed that potential sulfur oxidizing Gammaproteobacteria and members of the enigmatic BD1-5/SN-2 candidate phylum were abundant in situ (>10% and ~2% respectively). By mimicking the dynamic oxic/anoxic environmental conditions of the sediment in a laboratory chemostat, a simplified microbial community was selected from the more complex inoculum. Metagenomics, proteomics and fluorescence in situ hybridization showed that this simplified community contained both a potential sulfur oxidizing Gammaproteobacteria (at 24 ± 2% abundance) and a member of the BD1-5/SN-2 candidate phylum (at 7 ± 6% abundance). Despite the abundant supply of organic substrates to the chemostat, proteomic analysis suggested that the selected gammaproteobacterium grew partially autotrophically and performed hydrogen/formate oxidation. The enrichment of a member of the BD1-5/SN-2 candidate phylum enabled, for the first time, direct microscopic observation by fluorescent in situ hybridization and the experimental validation of the previously predicted translation of the stop codon UGA into glycine.
Collapse
Affiliation(s)
- Anna Hanke
- Microbial Fitness Group, Max Planck Institute for Marine Microbiology Bremen, Germany
| | - Emmo Hamann
- Microbial Fitness Group, Max Planck Institute for Marine Microbiology Bremen, Germany
| | - Ritin Sharma
- UT-ORNL Graduate School of Genome Science and Technology, University of Tennessee Knoxville, TN, USA ; Chemical Science Division, Oak Ridge National Laboratory Oak Ridge, TN, USA
| | - Jeanine S Geelhoed
- Microbial Fitness Group, Max Planck Institute for Marine Microbiology Bremen, Germany
| | - Theresa Hargesheimer
- Microbial Fitness Group, Max Planck Institute for Marine Microbiology Bremen, Germany
| | - Beate Kraft
- Microbial Fitness Group, Max Planck Institute for Marine Microbiology Bremen, Germany
| | - Volker Meyer
- Microbial Fitness Group, Max Planck Institute for Marine Microbiology Bremen, Germany
| | - Sabine Lenk
- Microbial Fitness Group, Max Planck Institute for Marine Microbiology Bremen, Germany
| | - Harald Osmers
- Microbial Fitness Group, Max Planck Institute for Marine Microbiology Bremen, Germany
| | - Rong Wu
- Faculty of Electrical Engineering, Mathematics and Computer Science, Delft University of Technology Delft, Netherlands
| | - Kofi Makinwa
- Faculty of Electrical Engineering, Mathematics and Computer Science, Delft University of Technology Delft, Netherlands
| | - Robert L Hettich
- UT-ORNL Graduate School of Genome Science and Technology, University of Tennessee Knoxville, TN, USA ; Chemical Science Division, Oak Ridge National Laboratory Oak Ridge, TN, USA
| | - Jillian F Banfield
- Department of Earth and Planetary Science, Department of Environmental Science, Policy, and Management, University of California Berkeley, CA, USA
| | - Halina E Tegetmeyer
- Microbial Fitness Group, Max Planck Institute for Marine Microbiology Bremen, Germany ; Center for Biotechnology, University of Bielefeld Bielefeld, Germany
| | - Marc Strous
- Microbial Fitness Group, Max Planck Institute for Marine Microbiology Bremen, Germany ; Center for Biotechnology, University of Bielefeld Bielefeld, Germany ; Department of Geoscience, University of Calgary Calgary, AB, Canada
| |
Collapse
|
27
|
Bernt M, Braband A, Middendorf M, Misof B, Rota-Stabelli O, Stadler PF. Bioinformatics methods for the comparative analysis of metazoan mitochondrial genome sequences. Mol Phylogenet Evol 2013; 69:320-7. [DOI: 10.1016/j.ympev.2012.09.019] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2012] [Revised: 08/31/2012] [Accepted: 09/17/2012] [Indexed: 01/25/2023]
|
28
|
Dutilh BE, Schmieder R, Nulton J, Felts B, Salamon P, Edwards RA, Mokili JL. Reference-independent comparative metagenomics using cross-assembly: crAss. ACTA ACUST UNITED AC 2012; 28:3225-31. [PMID: 23074261 PMCID: PMC3519457 DOI: 10.1093/bioinformatics/bts613] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
MOTIVATION Metagenomes are often characterized by high levels of unknown sequences. Reads derived from known microorganisms can easily be identified and analyzed using fast homology search algorithms and a suitable reference database, but the unknown sequences are often ignored in further analyses, biasing conclusions. Nevertheless, it is possible to use more data in a comparative metagenomic analysis by creating a cross-assembly of all reads, i.e. a single assembly of reads from different samples. Comparative metagenomics studies the interrelationships between metagenomes from different samples. Using an assembly algorithm is a fast and intuitive way to link (partially) homologous reads without requiring a database of reference sequences. RESULTS Here, we introduce crAss, a novel bioinformatic tool that enables fast simple analysis of cross-assembly files, yielding distances between all metagenomic sample pairs and an insightful image displaying the similarities.
Collapse
Affiliation(s)
- Bas E Dutilh
- Centre for Molecular and Biomolecular Informatics, Nijmegen Centre for Molecular Life Sciences, Radboud University Medical Centre, 6525 GA Nijmegen, The Netherlands.
| | | | | | | | | | | | | |
Collapse
|
29
|
Touw WG, Bayjanov JR, Overmars L, Backus L, Boekhorst J, Wels M, van Hijum SAFT. Data mining in the Life Sciences with Random Forest: a walk in the park or lost in the jungle? Brief Bioinform 2012; 14:315-26. [PMID: 22786785 PMCID: PMC3659301 DOI: 10.1093/bib/bbs034] [Citation(s) in RCA: 220] [Impact Index Per Article: 16.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
In the Life Sciences 'omics' data is increasingly generated by different high-throughput technologies. Often only the integration of these data allows uncovering biological insights that can be experimentally validated or mechanistically modelled, i.e. sophisticated computational approaches are required to extract the complex non-linear trends present in omics data. Classification techniques allow training a model based on variables (e.g. SNPs in genetic association studies) to separate different classes (e.g. healthy subjects versus patients). Random Forest (RF) is a versatile classification algorithm suited for the analysis of these large data sets. In the Life Sciences, RF is popular because RF classification models have a high-prediction accuracy and provide information on importance of variables for classification. For omics data, variables or conditional relations between variables are typically important for a subset of samples of the same class. For example: within a class of cancer patients certain SNP combinations may be important for a subset of patients that have a specific subtype of cancer, but not important for a different subset of patients. These conditional relationships can in principle be uncovered from the data with RF as these are implicitly taken into account by the algorithm during the creation of the classification model. This review details some of the to the best of our knowledge rarely or never used RF properties that allow maximizing the biological insights that can be extracted from complex omics data sets using RF.
Collapse
|
30
|
Duarte I, Nabuurs SB, Magno R, Huynen M. Evolution and diversification of the organellar release factor family. Mol Biol Evol 2012; 29:3497-512. [PMID: 22688947 PMCID: PMC3472500 DOI: 10.1093/molbev/mss157] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Translation termination is accomplished by proteins of the Class I release factor family (RF) that recognize stop codons and catalyze the ribosomal release of the newly synthesized peptide. Bacteria have two canonical RFs: RF1 recognizes UAA and UAG, RF2 recognizes UAA and UGA. Despite that these two release factor proteins are sufficient for de facto translation termination, the eukaryotic organellar RF protein family, which has evolved from bacterial release factors, has expanded considerably, comprising multiple subfamilies, most of which have not been functionally characterized or formally classified. Here, we integrate multiple sources of information to analyze the remarkable differentiation of the RF family among organelles. We document the origin, phylogenetic distribution and sequence structure features of the mitochondrial and plastidial release factors: mtRF1a, mtRF1, mtRF2a, mtRF2b, mtRF2c, ICT1, C12orf65, pRF1, and pRF2, and review published relevant experimental data. The canonical release factors (mtRF1a, mtRF2a, pRF1, and pRF2) and ICT1 are derived from bacterial ancestors, whereas the others have resulted from gene duplications of another release factor. These new RF family members have all lost one or more specific motifs relevant for bona fide release factor function but are mostly targeted to the same organelle as their ancestor. We also characterize the subset of canonical release factor proteins that bear nonclassical PxT/SPF tripeptide motifs and provide a molecular-model-based rationale for their retained ability to recognize stop codons. Finally, we analyze the coevolution of canonical RFs with the organellar genetic code. Although the RF presence in an organelle and its stop codon usage tend to coevolve, we find three taxa that encode an RF2 without using UGA stop codons, and one reverse scenario, where mamiellales green algae use UGA stop codons in their mitochondria without having a mitochondrial type RF2. For the latter, we put forward a “stop-codon reinvention” hypothesis that involves the retargeting of the plastid release factor to the mitochondrion.
Collapse
Affiliation(s)
- Isabel Duarte
- Centre for Molecular and Biomolecular Informatics, Nijmegen Centre for Molecular Life Sciences, Radboud University Nijmegen Medical Centre, Nijmegen, The Netherlands
| | | | | | | |
Collapse
|
31
|
An overlapping genetic code for frameshifted overlapping genes in Drosophila mitochondria: Antisense antitermination tRNAs UAR insert serine. J Theor Biol 2012; 298:51-76. [DOI: 10.1016/j.jtbi.2011.12.026] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2010] [Revised: 12/19/2011] [Accepted: 12/22/2011] [Indexed: 01/27/2023]
|