1
|
Jiang C, Gu S, Pan T, Wang X, Qin W, Wang G, Gao X, Zhang J, Chen K, Warren A, Xiong J, Miao W. Dynamics and timing of diversification events of ciliated eukaryotes from a large phylogenomic perspective. Mol Phylogenet Evol 2024; 197:108110. [PMID: 38768875 DOI: 10.1016/j.ympev.2024.108110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2024] [Revised: 05/17/2024] [Accepted: 05/17/2024] [Indexed: 05/22/2024]
Abstract
Ciliophora, an exceptionally diverse lineage of unicellular eukaryotes, exhibits a remarkable range of species richness across classes in the ciliate Tree of Life. In this study, we have acquired transcriptome and genome data from 40 representative species in seven ciliate classes. Utilizing 247 genes and 105 taxa, we devised a comprehensive phylogenomic tree for Ciliophora, encompassing over 60 % of orders and constituting the most extensive dataset of ciliate species to date. We established a robust phylogenetic framework that encompasses ambiguous taxa and the major classes within the phylum. Our findings support the monophyly of each of two subphyla (Postciliodesmatophora and Intramacronucleata), along with three subclades (Protocruzia, CONTHREEP, and SAPML) nested within Intramacronucleata, and elucidate evolutionary positions among the major classes within the phylum. Drawing on the robust ciliate Tree of Life and three constraints, we estimated the radiation of Ciliophora around 1175 Ma during the middle of the Proterozoic Eon, and most of the ciliate classes diverged from their sister lineage during the latter half of this period. Additionally, based on the time-calibrated tree and species richness pattern, we investigated net diversification rates of Ciliophora and its classes. The global net diversification rate for Ciliophora was estimated at 0.004979 species/Ma. Heterogeneity in net diversification rates was evident at the class level, with faster rates observed in Oligohymenophorea and Spirotrichea than other classes within the subclades CONTHREEP and SAPML, respectively. Notably, our analysis suggests that variations in net diversification rates, rather than clade ages, appear to contribute to the differences in species richness in Ciliophora at the class level.
Collapse
Affiliation(s)
- Chuanqi Jiang
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China
| | - Siyu Gu
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China; University of Chinese Academy of Sciences, Beijing, China
| | - Tingting Pan
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China; University of Chinese Academy of Sciences, Beijing, China
| | - Xueyan Wang
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China; University of Chinese Academy of Sciences, Beijing, China
| | - Weiwei Qin
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China; University of Chinese Academy of Sciences, Beijing, China
| | - Guangying Wang
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China
| | - Xinxin Gao
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China; University of Chinese Academy of Sciences, Beijing, China
| | - Jing Zhang
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China
| | - Kai Chen
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China
| | - Alan Warren
- Department of Life Sciences, Natural History Museum, London, UK
| | - Jie Xiong
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China; Key Laboratory of Breeding Biotechnology and Sustainable Aquaculture, Chinese Academy of Sciences, Wuhan, China
| | - Wei Miao
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China; University of Chinese Academy of Sciences, Beijing, China; Key Laboratory of Breeding Biotechnology and Sustainable Aquaculture, Chinese Academy of Sciences, Wuhan, China; Hubei Hongshan Laboratory, Wuhan, China.
| |
Collapse
|
2
|
Chuang CN, Liu HC, Woo TT, Chao JL, Chen CY, Hu HT, Hsueh YP, Wang TF. Noncanonical usage of stop codons in ciliates expands proteins with structurally flexible Q-rich motifs. eLife 2024; 12:RP91405. [PMID: 38393970 PMCID: PMC10942620 DOI: 10.7554/elife.91405] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/25/2024] Open
Abstract
Serine(S)/threonine(T)-glutamine(Q) cluster domains (SCDs), polyglutamine (polyQ) tracts and polyglutamine/asparagine (polyQ/N) tracts are Q-rich motifs found in many proteins. SCDs often are intrinsically disordered regions that mediate protein phosphorylation and protein-protein interactions. PolyQ and polyQ/N tracts are structurally flexible sequences that trigger protein aggregation. We report that due to their high percentages of STQ or STQN amino acid content, four SCDs and three prion-causing Q/N-rich motifs of yeast proteins possess autonomous protein expression-enhancing activities. Since these Q-rich motifs can endow proteins with structural and functional plasticity, we suggest that they represent useful toolkits for evolutionary novelty. Comparative Gene Ontology (GO) analyses of the near-complete proteomes of 26 representative model eukaryotes reveal that Q-rich motifs prevail in proteins involved in specialized biological processes, including Saccharomyces cerevisiae RNA-mediated transposition and pseudohyphal growth, Candida albicans filamentous growth, ciliate peptidyl-glutamic acid modification and microtubule-based movement, Tetrahymena thermophila xylan catabolism and meiosis, Dictyostelium discoideum development and sexual cycles, Plasmodium falciparum infection, and the nervous systems of Drosophila melanogaster, Mus musculus and Homo sapiens. We also show that Q-rich-motif proteins are expanded massively in 10 ciliates with reassigned TAAQ and TAGQ codons. Notably, the usage frequency of CAGQ is much lower in ciliates with reassigned TAAQ and TAGQ codons than in organisms with expanded and unstable Q runs (e.g. D. melanogaster and H. sapiens), indicating that the use of noncanonical stop codons in ciliates may have coevolved with codon usage biases to avoid triplet repeat disorders mediated by CAG/GTC replication slippage.
Collapse
Affiliation(s)
| | - Hou-Cheng Liu
- Institute of Molecular Biology, Academia SinicaTaipeiTaiwan
| | - Tai-Ting Woo
- Institute of Molecular Biology, Academia SinicaTaipeiTaiwan
| | - Ju-Lan Chao
- Institute of Molecular Biology, Academia SinicaTaipeiTaiwan
| | - Chiung-Ya Chen
- Institute of Molecular Biology, Academia SinicaTaipeiTaiwan
| | - Hisao-Tang Hu
- Institute of Molecular Biology, Academia SinicaTaipeiTaiwan
| | - Yi-Ping Hsueh
- Institute of Molecular Biology, Academia SinicaTaipeiTaiwan
- Department of Biochemical Science and Technology, National Chiayi UniversityChiayiTaiwan
| | - Ting-Fang Wang
- Institute of Molecular Biology, Academia SinicaTaipeiTaiwan
- Department of Biochemical Science and Technology, National Chiayi UniversityChiayiTaiwan
| |
Collapse
|
3
|
Rotterová J, Pánek T, Salomaki ED, Kotyk M, Táborský P, Kolísko M, Čepička I. Single cell transcriptomics reveals UAR codon reassignment in Palmarella salina (Metopida, Armophorea) and confirms Armophorida belongs to APM clade. Mol Phylogenet Evol 2024; 191:107991. [PMID: 38092322 DOI: 10.1016/j.ympev.2023.107991] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2023] [Revised: 12/04/2023] [Accepted: 12/09/2023] [Indexed: 12/17/2023]
Abstract
Anaerobes have emerged in several major lineages of ciliates, but the number of independent transitions to anaerobiosis among ciliates is unknown. The APM clade (Armophorea, Muranotrichea, Parablepharismea) represents the largest clade of obligate anaerobes among ciliates and contains free-living marine and freshwater representatives as well as gut endobionts of animals. The evolution of APM group has only recently started getting attention, and our knowledge on its phylogeny and genetics is still limited to a fraction of taxa. While ciliates portray a wide array of alternatives to the standard genetic code across numerous classes, the APM ciliates were considered to be the largest group using exclusively standard nuclear genetic code. In this study, we present a pan-ciliate phylogenomic analysis with emphasis on the APM clade, bringing the first phylogenomic analysis of the family Tropidoatractidae (Armophorea) and confirming the position of Armophorida within Armophorea. We include five newly sequenced single cell transcriptomes from marine, freshwater, and endobiotic APM ciliates - Palmarella salina, Anteclevelandella constricta, Nyctotherus sp., Caenomorpha medusula, and Thigmothrix strigosa. We report the first discovery of an alternative nuclear genetic code among APM ciliates, used by Palmarella salina (Tropidoatractidae, Armophorea), but not by its close relative, Tropidoatractus sp., and provide a comparative analysis of stop codon identity and frequency indicating the precedency to the UAG codon loss/reassignment over the UAA codon reassignment in the specific ancestor of Palmarella. Comparative genomic and proteomic studies of this group may help explain the constraints that underlie UAR stop-to-sense reassignment, the most frequent type of alternative nuclear genetic code, not only in ciliates, but eukaryotes in general.
Collapse
Affiliation(s)
- Johana Rotterová
- Department of Zoology, Faculty of Science, Charles University, Prague 128 00, Czech Republic; Department of Marine Sciences, University of Puerto Rico Mayagüez, Mayagüez, PR, USA.
| | - Tomáš Pánek
- Department of Zoology, Faculty of Science, Charles University, Prague 128 00, Czech Republic
| | - Eric D Salomaki
- Institute of Parasitology, Biology Centre Czech Academy of Sciences, České Budějovice 370 05, Czech Republic; Center for Computational Biology of Human Disease and Center for Computation and Visualization, Brown University, Providence, Rhode Island, USA
| | - Michael Kotyk
- Department of Zoology, Faculty of Science, Charles University, Prague 128 00, Czech Republic
| | - Petr Táborský
- Department of Zoology, Faculty of Science, Charles University, Prague 128 00, Czech Republic
| | - Martin Kolísko
- Institute of Parasitology, Biology Centre Czech Academy of Sciences, České Budějovice 370 05, Czech Republic
| | - Ivan Čepička
- Department of Zoology, Faculty of Science, Charles University, Prague 128 00, Czech Republic.
| |
Collapse
|
4
|
McGowan J, Kilias ES, Alacid E, Lipscombe J, Jenkins BH, Gharbi K, Kaithakottil GG, Macaulay IC, McTaggart S, Warring SD, Richards TA, Hall N, Swarbreck D. Identification of a non-canonical ciliate nuclear genetic code where UAA and UAG code for different amino acids. PLoS Genet 2023; 19:e1010913. [PMID: 37796765 PMCID: PMC10553269 DOI: 10.1371/journal.pgen.1010913] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Accepted: 08/10/2023] [Indexed: 10/07/2023] Open
Abstract
The genetic code is one of the most highly conserved features across life. Only a few lineages have deviated from the "universal" genetic code. Amongst the few variants of the genetic code reported to date, the codons UAA and UAG virtually always have the same translation, suggesting that their evolution is coupled. Here, we report the genome and transcriptome sequencing of a novel uncultured ciliate, belonging to the Oligohymenophorea class, where the translation of the UAA and UAG stop codons have changed to specify different amino acids. Genomic and transcriptomic analyses revealed that UAA has been reassigned to encode lysine, while UAG has been reassigned to encode glutamic acid. We identified multiple suppressor tRNA genes with anticodons complementary to the reassigned codons. We show that the retained UGA stop codon is enriched in the 3'UTR immediately downstream of the coding region of genes, suggesting that there is functional drive to maintain tandem stop codons. Using a phylogenomics approach, we reconstructed the ciliate phylogeny and mapped genetic code changes, highlighting the remarkable number of independent genetic code changes within the Ciliophora group of protists. According to our knowledge, this is the first report of a genetic code variant where UAA and UAG encode different amino acids.
Collapse
Affiliation(s)
- Jamie McGowan
- Earlham Institute, Norwich Research Park, Norwich, United Kingdom
| | | | - Elisabet Alacid
- Department of Biology, University of Oxford, Oxford, United Kingdom
| | - James Lipscombe
- Earlham Institute, Norwich Research Park, Norwich, United Kingdom
| | | | - Karim Gharbi
- Earlham Institute, Norwich Research Park, Norwich, United Kingdom
| | | | - Iain C. Macaulay
- Earlham Institute, Norwich Research Park, Norwich, United Kingdom
| | - Seanna McTaggart
- Earlham Institute, Norwich Research Park, Norwich, United Kingdom
| | - Sally D. Warring
- Earlham Institute, Norwich Research Park, Norwich, United Kingdom
| | | | - Neil Hall
- Earlham Institute, Norwich Research Park, Norwich, United Kingdom
- School of Biological Sciences, University of East Anglia, Norwich, United Kingdom
| | - David Swarbreck
- Earlham Institute, Norwich Research Park, Norwich, United Kingdom
| |
Collapse
|
5
|
Valášek LS, Kučerová M, Zeman J, Beznosková P. Cysteine tRNA acts as a stop codon readthrough-inducing tRNA in the human HEK293T cell line. RNA (NEW YORK, N.Y.) 2023; 29:1379-1387. [PMID: 37221013 PMCID: PMC10573299 DOI: 10.1261/rna.079688.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Accepted: 05/12/2023] [Indexed: 05/25/2023]
Abstract
Under certain circumstances, any of the three termination codons can be read through by a near-cognate tRNA; i.e., a tRNA whose two out of three anticodon nucleotides base pair with those of the stop codon. Unless programed to synthetize C-terminally extended protein variants with expanded physiological roles, readthrough represents an undesirable translational error. On the other side of a coin, a significant number of human genetic diseases is associated with the introduction of nonsense mutations (premature termination codons [PTCs]) into coding sequences, where stopping is not desirable. Here, the tRNA's ability to induce readthrough opens up the intriguing possibility of mitigating the deleterious effects of PTCs on human health. In yeast, the UGA and UAR stop codons were described to be read through by four readthrough-inducing rti-tRNAs-tRNATrp and tRNACys, and tRNATyr and tRNAGln, respectively. The readthrough-inducing potential of tRNATrp and tRNATyr was also observed in human cell lines. Here, we investigated the readthrough-inducing potential of human tRNACys in the HEK293T cell line. The tRNACys family consists of two isoacceptors, one with ACA and the other with GCA anticodons. We selected nine representative tRNACys isodecoders (differing in primary sequence and expression level) and tested them using dual luciferase reporter assays. We found that at least two tRNACys can significantly elevate UGA readthrough when overexpressed. This indicates a mechanistically conserved nature of rti-tRNAs between yeast and human, supporting the idea that they could be used in the PTC-associated RNA therapies.
Collapse
MESH Headings
- Humans
- Codon, Terminator/genetics
- Cysteine/genetics
- Cysteine/metabolism
- HEK293 Cells
- Saccharomyces cerevisiae/genetics
- RNA, Transfer, Cys/metabolism
- RNA, Transfer, Trp/metabolism
- RNA, Transfer, Tyr
- RNA, Transfer/genetics
- RNA, Transfer/metabolism
- Anticodon
- Codon, Nonsense/genetics
- Protein Biosynthesis
Collapse
Affiliation(s)
- Leoš Shivaya Valášek
- Laboratory of Regulation of Gene Expression, Institute of Microbiology ASCR, 142 20 Prague, the Czech Republic
| | - Michaela Kučerová
- Laboratory of Regulation of Gene Expression, Institute of Microbiology ASCR, 142 20 Prague, the Czech Republic
| | - Jakub Zeman
- Laboratory of Regulation of Gene Expression, Institute of Microbiology ASCR, 142 20 Prague, the Czech Republic
| | - Petra Beznosková
- Laboratory of Regulation of Gene Expression, Institute of Microbiology ASCR, 142 20 Prague, the Czech Republic
| |
Collapse
|
6
|
Gaydukova SA, Moldovan MA, Vallesi A, Heaphy SM, Atkins JF, Gelfand MS, Baranov PV. Nontriplet feature of genetic code in Euplotes ciliates is a result of neutral evolution. Proc Natl Acad Sci U S A 2023; 120:e2221683120. [PMID: 37216548 PMCID: PMC10235951 DOI: 10.1073/pnas.2221683120] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Accepted: 04/12/2023] [Indexed: 05/24/2023] Open
Abstract
The triplet nature of the genetic code is considered a universal feature of known organisms. However, frequent stop codons at internal mRNA positions in Euplotes ciliates ultimately specify ribosomal frameshifting by one or two nucleotides depending on the context, thus posing a nontriplet feature of the genetic code of these organisms. Here, we sequenced transcriptomes of eight Euplotes species and assessed evolutionary patterns arising at frameshift sites. We show that frameshift sites are currently accumulating more rapidly by genetic drift than they are removed by weak selection. The time needed to reach the mutational equilibrium is several times longer than the age of Euplotes and is expected to occur after a several-fold increase in the frequency of frameshift sites. This suggests that Euplotes are at an early stage of the spread of frameshifting in expression of their genome. In addition, we find the net fitness burden of frameshift sites to be noncritical for the survival of Euplotes. Our results suggest that fundamental genome-wide changes such as a violation of the triplet character of genetic code can be introduced and maintained solely by neutral evolution.
Collapse
Affiliation(s)
- Sofya A. Gaydukova
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow199911, Russia
| | - Mikhail A. Moldovan
- A. A. Kharkevich Institute for Information Transmission Problems RAS, Moscow127051, Russia
| | - Adriana Vallesi
- Laboratory of Eukaryotic Microbiology and Animal Biology, School of Biosciences and Veterinary Medicine, University of Camerino, Camerino62032, Italy
| | - Stephen M. Heaphy
- School of Biochemistry and Cell Biology, University College Cork, CorkT12 XF62, Ireland
| | - John F. Atkins
- School of Biochemistry and Cell Biology, University College Cork, CorkT12 XF62, Ireland
- Department of Human Genetics, University of Utah, Salt Lake City, UT84112
| | - Mikhail S. Gelfand
- A. A. Kharkevich Institute for Information Transmission Problems RAS, Moscow127051, Russia
| | - Pavel V. Baranov
- School of Biochemistry and Cell Biology, University College Cork, CorkT12 XF62, Ireland
| |
Collapse
|
7
|
Valášek LS, Lukeš J, Paris Z. Stops making sense - For the people? Clin Transl Med 2023; 13:e1270. [PMID: 37203266 PMCID: PMC10196215 DOI: 10.1002/ctm2.1270] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Accepted: 05/09/2023] [Indexed: 05/20/2023] Open
Affiliation(s)
| | - Julius Lukeš
- Institute of ParasitologyBiology CentreCzech Academy of SciencesČeské Budějovice (Budweis)Czech Republic
- Faculty of SciencesUniversity of South BohemiaČeské Budějovice (Budweis)Czech Republic
| | - Zdeněk Paris
- Institute of ParasitologyBiology CentreCzech Academy of SciencesČeské Budějovice (Budweis)Czech Republic
- Faculty of SciencesUniversity of South BohemiaČeské Budějovice (Budweis)Czech Republic
| |
Collapse
|
8
|
Single-Cell Genomics Reveals the Divergent Mitochondrial Genomes of Retaria (Foraminifera and Radiolaria). mBio 2023; 14:e0030223. [PMID: 36939357 PMCID: PMC10127745 DOI: 10.1128/mbio.00302-23] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/21/2023] Open
Abstract
Mitochondria originated from an ancient bacterial endosymbiont that underwent reductive evolution by gene loss and endosymbiont gene transfer to the nuclear genome. The diversity of mitochondrial genomes published to date has revealed that gene loss and transfer processes are ongoing in many lineages. Most well-studied eukaryotic lineages are represented in mitochondrial genome databases, except for the superphylum Retaria-the lineage comprising Foraminifera and Radiolaria. Using single-cell approaches, we determined two complete mitochondrial genomes of Foraminifera and two nearly complete mitochondrial genomes of radiolarians. We report the complete coding content of an additional 14 foram species. We show that foraminiferan and radiolarian mitochondrial genomes contain a nearly fully overlapping but reduced mitochondrial gene complement compared to other sequenced rhizarians. In contrast to animals and fungi, many protists encode a diverse set of proteins on their mitochondrial genomes, including several ribosomal genes; however, some aerobic eukaryotic lineages (euglenids, myzozoans, and chlamydomonas-like algae) have reduced mitochondrial gene content and lack all ribosomal genes. Similar to these reduced outliers, we show that retarian mitochondrial genomes lack ribosomal protein and tRNA genes, contain truncated and divergent small and large rRNA genes, and contain only 14 or 15 protein-coding genes, including nad1, -3, -4, -4L, -5, and -7, cob, cox1, -2, and -3, and atp1, -6, and -9, with forams and radiolarians additionally carrying nad2 and nad6, respectively. In radiolarian mitogenomes, a noncanonical genetic code was identified in which all three stop codons encode amino acids. Collectively, these results add to our understanding of mitochondrial genome evolution and fill in one of the last major gaps in mitochondrial sequence databases. IMPORTANCE We present the reduced mitochondrial genomes of Retaria, the rhizarian lineage comprising the phyla Foraminifera and Radiolaria. By applying single-cell genomic approaches, we found that foraminiferan and radiolarian mitochondrial genomes contain an overlapping but reduced mitochondrial gene complement compared to other sequenced rhizarians. An alternative genetic code was identified in radiolarian mitogenomes in which all three stop codons encode amino acids. Collectively, these results shed light on the divergent nature of the mitochondrial genomes from an ecologically important group, warranting further questions into the biological underpinnings of gene content variability and genetic code variation between mitochondrial genomes.
Collapse
|
9
|
Pawlak K, Błażej P, Mackiewicz D, Mackiewicz P. The Influence of the Selection at the Amino Acid Level on Synonymous Codon Usage from the Viewpoint of Alternative Genetic Codes. Int J Mol Sci 2023; 24:ijms24021185. [PMID: 36674703 PMCID: PMC9866869 DOI: 10.3390/ijms24021185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Revised: 12/19/2022] [Accepted: 12/30/2022] [Indexed: 01/11/2023] Open
Abstract
Synonymous codon usage can be influenced by mutations and/or selection, e.g., for speed of protein translation and correct folding. However, this codon bias can also be affected by a general selection at the amino acid level due to differences in the acceptance of the loss and generation of these codons. To assess the importance of this effect, we constructed a mutation-selection model model, in which we generated almost 90,000 stationary nucleotide distributions produced by mutational processes and applied a selection based on differences in physicochemical properties of amino acids. Under these conditions, we calculated the usage of fourfold degenerated (4FD) codons and compared it with the usage characteristic of the pure mutations. We considered both the standard genetic code (SGC) and alternative genetic codes (AGCs). The analyses showed that a majority of AGCs produced a greater 4FD codon bias than the SGC. The mutations producing more thymine or adenine than guanine and cytosine increased the differences in usage. On the other hand, the mutational pressures generating a lot of cytosine or guanine with a low content of adenine and thymine decreased this bias because the nucleotide content of most 4FD codons stayed in the compositional equilibrium with these pressures. The comparison of the theoretical results with those for real protein coding sequences showed that the influence of selection at the amino acid level on the synonymous codon usage cannot be neglected. The analyses indicate that the effect of amino acid selection cannot be disregarded and that it can interfere with other selection factors influencing codon usage, especially in AT-rich genomes, in which AGCs are usually used.
Collapse
|
10
|
Dandekar T, Kunz M. How to Better Understand Signal Cascades and Measure the Encoded Information. Bioinformatics 2023. [DOI: 10.1007/978-3-662-65036-3_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2023] Open
|
11
|
No stopping with a short-stem transfer RNA. Nature 2023; 613:631-632. [PMID: 36631582 DOI: 10.1038/d41586-022-04585-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
|
12
|
Kachale A, Pavlíková Z, Nenarokova A, Roithová A, Durante IM, Miletínová P, Záhonová K, Nenarokov S, Votýpka J, Horáková E, Ross RL, Yurchenko V, Beznosková P, Paris Z, Valášek LS, Lukeš J. Short tRNA anticodon stem and mutant eRF1 allow stop codon reassignment. Nature 2023; 613:751-758. [PMID: 36631608 DOI: 10.1038/s41586-022-05584-2] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 11/18/2022] [Indexed: 01/13/2023]
Abstract
Cognate tRNAs deliver specific amino acids to translating ribosomes according to the standard genetic code, and three codons with no cognate tRNAs serve as stop codons. Some protists have reassigned all stop codons as sense codons, neglecting this fundamental principle1-4. Here we analyse the in-frame stop codons in 7,259 predicted protein-coding genes of a previously undescribed trypanosomatid, Blastocrithidia nonstop. We reveal that in this species in-frame stop codons are underrepresented in genes expressed at high levels and that UAA serves as the only termination codon. Whereas new tRNAsGlu fully cognate to UAG and UAA evolved to reassign these stop codons, the UGA reassignment followed a different path through shortening the anticodon stem of tRNATrpCCA from five to four base pairs (bp). The canonical 5-bp tRNATrp recognizes UGG as dictated by the genetic code, whereas its shortened 4-bp variant incorporates tryptophan also into in-frame UGA. Mimicking this evolutionary twist by engineering both variants from B. nonstop, Trypanosoma brucei and Saccharomyces cerevisiae and expressing them in the last two species, we recorded a significantly higher readthrough for all 4-bp variants. Furthermore, a gene encoding B. nonstop release factor 1 acquired a mutation that specifically restricts UGA recognition, robustly potentiating the UGA reassignment. Virtually the same strategy has been adopted by the ciliate Condylostoma magnum. Hence, we describe a previously unknown, universal mechanism that has been exploited in unrelated eukaryotes with reassigned stop codons.
Collapse
Affiliation(s)
- Ambar Kachale
- Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budějovice, Czech Republic.,Faculty of Sciences, University of South Bohemia, České Budějovice, Czech Republic
| | - Zuzana Pavlíková
- Institute of Microbiology, Czech Academy of Sciences, Prague, Czech Republic
| | - Anna Nenarokova
- Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budějovice, Czech Republic.,Faculty of Sciences, University of South Bohemia, České Budějovice, Czech Republic.,School of Biological Sciences, University of Bristol, Bristol, UK
| | - Adriana Roithová
- Institute of Microbiology, Czech Academy of Sciences, Prague, Czech Republic
| | - Ignacio M Durante
- Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budějovice, Czech Republic
| | - Petra Miletínová
- Institute of Microbiology, Czech Academy of Sciences, Prague, Czech Republic
| | - Kristína Záhonová
- Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budějovice, Czech Republic.,Faculty of Science, Charles University, BIOCEV, Prague, Czech Republic.,Life Science Research Centre, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| | - Serafim Nenarokov
- Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budějovice, Czech Republic.,Faculty of Sciences, University of South Bohemia, České Budějovice, Czech Republic
| | - Jan Votýpka
- Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budějovice, Czech Republic.,Faculty of Science, Charles University, BIOCEV, Prague, Czech Republic
| | - Eva Horáková
- Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budějovice, Czech Republic.,Institute of Microbiology, Czech Academy of Sciences, Třeboň, Czech Republic
| | | | - Vyacheslav Yurchenko
- Life Science Research Centre, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| | - Petra Beznosková
- Institute of Microbiology, Czech Academy of Sciences, Prague, Czech Republic
| | - Zdeněk Paris
- Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budějovice, Czech Republic. .,Faculty of Sciences, University of South Bohemia, České Budějovice, Czech Republic.
| | | | - Julius Lukeš
- Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budějovice, Czech Republic. .,Faculty of Sciences, University of South Bohemia, České Budějovice, Czech Republic.
| |
Collapse
|
13
|
Wang X, Dong Q, Chen G, Zhang J, Liu Y, Cai Y. Frameshift and wild-type proteins are often highly similar because the genetic code and genomes were optimized for frameshift tolerance. BMC Genomics 2022; 23:416. [PMID: 35655139 PMCID: PMC9164415 DOI: 10.1186/s12864-022-08435-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2021] [Accepted: 03/02/2022] [Indexed: 11/10/2022] Open
Abstract
Frameshift mutations have been considered of significant importance for the molecular evolution of proteins and their coding genes, while frameshift protein sequences encoded in the alternative reading frames of coding genes have been considered to be meaningless. However, functional frameshifts have been found widely existing. It was puzzling how a frameshift protein kept its structure and functionality while substantial changes occurred in its primary amino-acid sequence. This study shows that the similarities among frameshifts and wild types are higher than random similarities and are determined at different levels. Frameshift substitutions are more conservative than random substitutions in the standard genetic code (SGC). The frameshift substitutions score of SGC ranks in the top 2.0-3.5% of alternative genetic codes, showing that SGC is nearly optimal for frameshift tolerance. In many genes and certain genomes, frameshift-resistant codons and codon pairs appear more frequently than expected, suggesting that frameshift tolerance is achieved through not only the optimality of the genetic code but, more importantly, the further optimization of a specific gene or genome through the usages of codons/codon pairs, which sheds light on the role of frameshift mutations in molecular and genomic evolution.
Collapse
|
14
|
Wang Y, Yao L, Fan J, Zhao X, Zhang Q, Chen Y, Guo C. The Codon Usage Bias Analysis of Free-Living Ciliates' Macronuclear Genomes and Clustered Regularly Interspaced Short Palindromic Repeats/Cas9 Vector Construction of Stylonychia lemnae. Front Microbiol 2022; 13:785889. [PMID: 35308388 PMCID: PMC8927777 DOI: 10.3389/fmicb.2022.785889] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Accepted: 01/24/2022] [Indexed: 11/13/2022] Open
Abstract
Ciliates represent higher unicellular animals, and several species are also important model organisms for molecular biology research. Analyses of codon usage bias (CUB) of the macronuclear (MAC) genome in ciliates can not only promote a better understanding of the genetic mode and evolution history of these organisms but also help optimize codons to improve the gene editing efficiency of model ciliates. In this study, macronuclear genome sequences of nine free-living ciliates were analyzed with CodonW software to calculate the following indices: the guanine-cytosine content (GC); the frequency of the nucleotides U, C, A, and G at the third position of codons (U3s, C3s, A3s, G3s); the effective number of codons (ENC); the correlation between GC at the first and second positions (GC12); the frequency of the nucleotides G + C at the third position of synonymous codons (GC3s); the relative synonymous codon usage (RSCU). Parity rule 2 plot analysis, neutrality plot analysis, and correlation analysis were performed to explore the factors that influence codon preference. The results showed that the GC contents in nine ciliates' MAC genomes were lower than 50% and appeared AT-rich. The base compositions of GC12 and GC3s are markedly distinct and the codon usage pattern and evolution of ciliates are affected by genetic mutation and natural selection. According to the synonymous codon analysis, the codons of most ciliates ended with A or U and eight codons were the general optimal codons of nine ciliates. A clustered regularly interspaced short palindromic repeats/Cas9 (CRISPR/Cas9) expression vector of Stylonychia lemnae was constructed by optimizing the macronuclear genome codon and was successfully used to knock out the Adss gene. This is the first such extensive investigation of the MAC genome CUB of ciliates and the initial successful application of the CRISPR/Cas9 technique in free-living ciliates.
Collapse
Affiliation(s)
- Ying Wang
- Key Laboratory of Biodiversity of Aquatic Organisms, Harbin Normal University, Harbin, China
| | - Lin Yao
- Key Laboratory of Biodiversity of Aquatic Organisms, Harbin Normal University, Harbin, China.,Key Laboratory of Molecular Cytogenetics and Genetic Breeding of Heilongjiang Province, Harbin, China
| | - Jinfeng Fan
- Key Laboratory of Biodiversity of Aquatic Organisms, Harbin Normal University, Harbin, China
| | - Xue Zhao
- Key Laboratory of Biodiversity of Aquatic Organisms, Harbin Normal University, Harbin, China
| | - Qing Zhang
- Key Laboratory of Biodiversity of Aquatic Organisms, Harbin Normal University, Harbin, China
| | - Ying Chen
- Key Laboratory of Biodiversity of Aquatic Organisms, Harbin Normal University, Harbin, China.,School of Civil and Environmental Engineering, Harbin Institute of Technology (Shenzhen), Shenzhen, China
| | - Changhong Guo
- Key Laboratory of Molecular Cytogenetics and Genetic Breeding of Heilongjiang Province, Harbin, China
| |
Collapse
|
15
|
|
16
|
Shulgina Y, Eddy SR. A computational screen for alternative genetic codes in over 250,000 genomes. eLife 2021; 10:71402. [PMID: 34751130 PMCID: PMC8629427 DOI: 10.7554/elife.71402] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Accepted: 10/26/2021] [Indexed: 11/25/2022] Open
Abstract
The genetic code has been proposed to be a ‘frozen accident,’ but the discovery of alternative genetic codes over the past four decades has shown that it can evolve to some degree. Since most examples were found anecdotally, it is difficult to draw general conclusions about the evolutionary trajectories of codon reassignment and why some codons are affected more frequently. To fill in the diversity of genetic codes, we developed Codetta, a computational method to predict the amino acid decoding of each codon from nucleotide sequence data. We surveyed the genetic code usage of over 250,000 bacterial and archaeal genome sequences in GenBank and discovered five new reassignments of arginine codons (AGG, CGA, and CGG), representing the first sense codon changes in bacteria. In a clade of uncultivated Bacilli, the reassignment of AGG to become the dominant methionine codon likely evolved by a change in the amino acid charging of an arginine tRNA. The reassignments of CGA and/or CGG were found in genomes with low GC content, an evolutionary force that likely helped drive these codons to low frequency and enable their reassignment. All life forms rely on a ‘code’ to translate their genetic information into proteins. This code relies on limited permutations of three nucleotides – the building blocks that form DNA and other types of genetic information. Each ‘triplet’ of nucleotides – or codon – encodes a specific amino acid, the basic component of proteins. Reading the sequence of codons in the right order will let the cell know which amino acid to assemble next on a growing protein. For instance, the codon CGG – formed of the nucleotides guanine (G) and cytosine (C) – codes for the amino acid arginine. From bacteria to humans, most life forms rely on the same genetic code. Yet certain organisms have evolved to use slightly different codes, where one or several codons have an altered meaning. To better understand how alternative genetic codes have evolved, Shulgina and Eddy set out to find more organisms featuring these altered codons, creating a new software called Codetta that can analyze the genome of a microorganism and predict the genetic code it uses. Codetta was then used to sift through the genetic information of 250,000 microorganisms. This was made possible by the sequencing, in recent years, of the genomes of hundreds of thousands of bacteria and other microorganisms – including many never studied before. These analyses revealed five groups of bacteria with alternative genetic codes, all of which had changes in the codons that code for arginine. Amongst these, four had genomes with a low proportion of guanine and cytosine nucleotides. This may have made some guanine and cytosine-rich arginine codons very rare in these organisms and, therefore, easier to be reassigned to encode another amino acid. The work by Shulgina and Eddy demonstrates that Codetta is a new, useful tool that scientists can use to understand how genetic codes evolve. In addition, it can also help to ensure the accuracy of widely used protein databases, which assume which genetic code organisms use to predict protein sequences from their genomes.
Collapse
Affiliation(s)
| | - Sean R Eddy
- Molecular & Cellular Biology, Harvard University, Cambridge, United States
| |
Collapse
|
17
|
Kim S, Yi H, Kim YT, Lee HS. Engineering Translation Components for Genetic Code Expansion. J Mol Biol 2021; 434:167302. [PMID: 34673113 DOI: 10.1016/j.jmb.2021.167302] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Revised: 09/26/2021] [Accepted: 10/05/2021] [Indexed: 12/18/2022]
Abstract
The expansion of the genetic code consisting of four bases and 20 amino acids into diverse building blocks has been an exciting topic in synthetic biology. Many biochemical components are involved in gene expression; therefore, adding a new component to the genetic code requires engineering many other components that interact with it. Genetic code expansion has advanced significantly for the last two decades with the engineering of several components involved in protein synthesis. These components include tRNA/aminoacyl-tRNA synthetase, new codons, ribosomes, and elongation factor Tu. In addition, biosynthesis and enhanced uptake of non-canonical amino acids have been attempted and have made meaningful progress. This review discusses the efforts to engineer these translation components, to improve the genetic code expansion technology.
Collapse
Affiliation(s)
- Sooin Kim
- Department of Chemistry, Sogang University, 35 Baekbeomro Mapogu, Seoul 04107, Republic of Korea
| | - Hanbin Yi
- Department of Chemistry, Sogang University, 35 Baekbeomro Mapogu, Seoul 04107, Republic of Korea
| | - Yurie T Kim
- Department of Chemistry, Sogang University, 35 Baekbeomro Mapogu, Seoul 04107, Republic of Korea
| | - Hyun Soo Lee
- Department of Chemistry, Sogang University, 35 Baekbeomro Mapogu, Seoul 04107, Republic of Korea.
| |
Collapse
|
18
|
Korostelev AA. Diversity and Similarity of Termination and Ribosome Rescue in Bacterial, Mitochondrial, and Cytoplasmic Translation. BIOCHEMISTRY (MOSCOW) 2021; 86:1107-1121. [PMID: 34565314 DOI: 10.1134/s0006297921090066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
When a ribosome encounters the stop codon of an mRNA, it terminates translation, releases the newly made protein, and is recycled to initiate translation on a new mRNA. Termination is a highly dynamic process in which release factors (RF1 and RF2 in bacteria; eRF1•eRF3•GTP in eukaryotes) coordinate peptide release with large-scale molecular rearrangements of the ribosome. Ribosomes stalled on aberrant mRNAs are rescued and recycled by diverse bacterial, mitochondrial, or cytoplasmic quality control mechanisms. These are catalyzed by rescue factors with peptidyl-tRNA hydrolase activity (bacterial ArfA•RF2 and ArfB, mitochondrial ICT1 and mtRF-R, and cytoplasmic Vms1), that are distinct from each other and from release factors. Nevertheless, recent structural studies demonstrate a remarkable similarity between translation termination and ribosome rescue mechanisms. This review describes how these pathways rely on inherent ribosome dynamics, emphasizing the active role of the ribosome in all translation steps.
Collapse
Affiliation(s)
- Andrei A Korostelev
- RNA Therapeutics Institute, Department of Biochemistry and Molecular Pharmacology, UMass Medical School, Worcester, MA, USA.
| |
Collapse
|
19
|
Mozzicafreddo M, Pucciarelli S, Swart EC, Piersanti A, Emmerich C, Migliorelli G, Ballarini P, Miceli C. The macronuclear genome of the Antarctic psychrophilic marine ciliate Euplotes focardii reveals new insights on molecular cold adaptation. Sci Rep 2021; 11:18782. [PMID: 34548559 PMCID: PMC8455672 DOI: 10.1038/s41598-021-98168-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Accepted: 09/05/2021] [Indexed: 11/23/2022] Open
Abstract
The macronuclear (MAC) genomes of ciliates belonging to the genus Euplotes species are comprised of numerous small DNA molecules, nanochromosomes, each typically encoding a single gene. These genomes are responsible for all gene expression during vegetative cell growth. Here, we report the analysis of the MAC genome from the Antarctic psychrophile Euplotes focardii. Nanochromosomes containing bacterial sequences were not found, suggesting that phenomena of horizontal gene transfer did not occur recently, even though this ciliate species has a substantial associated bacterial consortium. As in other euplotid species, E. focardii MAC genes are characterized by a high frequency of translational frameshifting. Furthermore, in order to characterize differences that may be consequent to cold adaptation and defense to oxidative stress, the main constraints of the Antarctic marine microorganisms, we compared E. focardii MAC genome with those available from mesophilic Euplotes species. We focussed mainly on the comparison of tubulin, antioxidant enzymes and heat shock protein (HSP) 70 families, molecules which possess peculiar characteristic correlated with cold adaptation in E. focardii. We found that α-tubulin genes and those encoding SODs and CATs antioxidant enzymes are more numerous than in the mesophilic Euplotes species. Furthermore, the phylogenetic trees showed that these molecules are divergent in the Antarctic species. In contrast, there are fewer hsp70 genes in E. focardii compared to mesophilic Euplotes and these genes do not respond to thermal stress but only to oxidative stress. Our results suggest that molecular adaptation to cold and oxidative stress in the Antarctic environment may not only be due to particular amino acid substitutions but also due to duplication and divergence of paralogous genes.
Collapse
Affiliation(s)
- Matteo Mozzicafreddo
- School of Biosciences and Veterinary Medicine, University of Camerino, 62032, Camerino, MC, Italy.
| | - Sandra Pucciarelli
- School of Biosciences and Veterinary Medicine, University of Camerino, 62032, Camerino, MC, Italy
| | - Estienne C Swart
- Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Angela Piersanti
- School of Biosciences and Veterinary Medicine, University of Camerino, 62032, Camerino, MC, Italy
| | | | - Giovanna Migliorelli
- School of Biosciences and Veterinary Medicine, University of Camerino, 62032, Camerino, MC, Italy
| | - Patrizia Ballarini
- School of Biosciences and Veterinary Medicine, University of Camerino, 62032, Camerino, MC, Italy
| | - Cristina Miceli
- School of Biosciences and Veterinary Medicine, University of Camerino, 62032, Camerino, MC, Italy
| |
Collapse
|
20
|
Beznosková P, Bidou L, Namy O, Valášek LS. Increased expression of tryptophan and tyrosine tRNAs elevates stop codon readthrough of reporter systems in human cell lines. Nucleic Acids Res 2021; 49:5202-5215. [PMID: 34009360 PMCID: PMC8136774 DOI: 10.1093/nar/gkab315] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Revised: 04/13/2021] [Accepted: 04/15/2021] [Indexed: 11/25/2022] Open
Abstract
Regulation of translation via stop codon readthrough (SC-RT) expands not only tissue-specific but also viral proteomes in humans and, therefore, represents an important subject of study. Understanding this mechanism and all involved players is critical also from a point of view of prospective medical therapies of hereditary diseases caused by a premature termination codon. tRNAs were considered for a long time to be just passive players delivering amino acid residues according to the genetic code to ribosomes without any active regulatory roles. In contrast, our recent yeast work identified several endogenous tRNAs implicated in the regulation of SC-RT. Swiftly emerging studies of human tRNA-ome also advocate that tRNAs have unprecedented regulatory potential. Here, we developed a universal U6 promotor-based system expressing various human endogenous tRNA iso-decoders to study consequences of their increased dosage on SC-RT employing various reporter systems in vivo. This system combined with siRNA-mediated downregulations of selected aminoacyl-tRNA synthetases demonstrated that changing levels of human tryptophan and tyrosine tRNAs do modulate efficiency of SC-RT. Overall, our results suggest that tissue-to-tissue specific levels of selected near-cognate tRNAs may have a vital potential to fine-tune the final landscape of the human proteome, as well as that of its viral pathogens.
Collapse
Affiliation(s)
- Petra Beznosková
- Laboratory of Regulation of Gene Expression, Institute of Microbiology ASCR, Videnska 1083, 142 20 Prague, the Czech Republic
| | - Laure Bidou
- Sorbonne Universités, Paris, France.,Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Olivier Namy
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Leoš Shivaya Valášek
- Laboratory of Regulation of Gene Expression, Institute of Microbiology ASCR, Videnska 1083, 142 20 Prague, the Czech Republic
| |
Collapse
|
21
|
Bucchini F, Del Cortona A, Kreft Ł, Botzki A, Van Bel M, Vandepoele K. TRAPID 2.0: a web application for taxonomic and functional analysis of de novo transcriptomes. Nucleic Acids Res 2021; 49:e101. [PMID: 34197621 PMCID: PMC8464036 DOI: 10.1093/nar/gkab565] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Revised: 06/07/2021] [Accepted: 06/16/2021] [Indexed: 12/24/2022] Open
Abstract
Advances in high-throughput sequencing have resulted in a massive increase of RNA-Seq transcriptome data. However, the promise of rapid gene expression profiling in a specific tissue, condition, unicellular organism or microbial community comes with new computational challenges. Owing to the limited availability of well-resolved reference genomes, de novo assembled (meta)transcriptomes have emerged as popular tools for investigating the gene repertoire of previously uncharacterized organisms. Yet, despite their potential, these datasets often contain fragmented or contaminant sequences, and their analysis remains difficult. To alleviate some of these challenges, we developed TRAPID 2.0, a web application for the fast and efficient processing of assembled transcriptome data. The initial processing phase performs a global characterization of the input data, providing each transcript with several layers of annotation, comprising structural, functional, and taxonomic information. The exploratory phase enables downstream analyses from the web application. Available analyses include the assessment of gene space completeness, the functional analysis and comparison of transcript subsets, and the study of transcripts in an evolutionary context. A comparison with similar tools highlights TRAPID’s unique features. Finally, analyses performed within TRAPID 2.0 are complemented by interactive data visualizations, facilitating the extraction of new biological insights, as demonstrated with diatom community metatranscriptomes.
Collapse
Affiliation(s)
- François Bucchini
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium.,Department of Plant Systems Biology, VIB, 9052 Ghent, Belgium
| | - Andrea Del Cortona
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium.,Department of Plant Systems Biology, VIB, 9052 Ghent, Belgium
| | - Łukasz Kreft
- VIB Bioinformatics Core, VIB, 9052 Ghent, Belgium
| | | | - Michiel Van Bel
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium.,Department of Plant Systems Biology, VIB, 9052 Ghent, Belgium
| | - Klaas Vandepoele
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium.,Department of Plant Systems Biology, VIB, 9052 Ghent, Belgium.,Bioinformatics Institute Ghent, Ghent University, 9052 Ghent, Belgium
| |
Collapse
|
22
|
Atkins JF, O’Connor KM, Bhatt PR, Loughran G. From Recoding to Peptides for MHC Class I Immune Display: Enriching Viral Expression, Virus Vulnerability and Virus Evasion. Viruses 2021; 13:1251. [PMID: 34199077 PMCID: PMC8310308 DOI: 10.3390/v13071251] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Revised: 06/11/2021] [Accepted: 06/19/2021] [Indexed: 01/02/2023] Open
Abstract
Many viruses, especially RNA viruses, utilize programmed ribosomal frameshifting and/or stop codon readthrough in their expression, and in the decoding of a few a UGA is dynamically redefined to specify selenocysteine. This recoding can effectively increase viral coding capacity and generate a set ratio of products with the same N-terminal domain(s) but different C-terminal domains. Recoding can also be regulatory or generate a product with the non-universal 21st directly encoded amino acid. Selection for translation speed in the expression of many viruses at the expense of fidelity creates host immune defensive opportunities. In contrast to host opportunism, certain viruses, including some persistent viruses, utilize recoding or adventitious frameshifting as part of their strategy to evade an immune response or specific drugs. Several instances of recoding in small intensively studied viruses escaped detection for many years and their identification resolved dilemmas. The fundamental importance of ribosome ratcheting is consistent with the initial strong view of invariant triplet decoding which however did not foresee the possibility of transitory anticodon:codon dissociation. Deep level dynamics and structural understanding of recoding is underway, and a high level structure relevant to the frameshifting required for expression of the SARS CoV-2 genome has just been determined.
Collapse
Affiliation(s)
- John F. Atkins
- Schools of Biochemistry and Microbiology, University College Cork, T12 XF62 Cork, Ireland; (K.M.O.); (P.R.B.); (G.L.)
| | - Kate M. O’Connor
- Schools of Biochemistry and Microbiology, University College Cork, T12 XF62 Cork, Ireland; (K.M.O.); (P.R.B.); (G.L.)
| | - Pramod R. Bhatt
- Schools of Biochemistry and Microbiology, University College Cork, T12 XF62 Cork, Ireland; (K.M.O.); (P.R.B.); (G.L.)
- Department of Biology, Institute of Molecular Biology and Biophysics, ETH Zurich, 8093 Zurich, Switzerland
| | - Gary Loughran
- Schools of Biochemistry and Microbiology, University College Cork, T12 XF62 Cork, Ireland; (K.M.O.); (P.R.B.); (G.L.)
| |
Collapse
|
23
|
Tissue-specific dynamic codon redefinition in Drosophila. Proc Natl Acad Sci U S A 2021; 118:2012793118. [PMID: 33500350 DOI: 10.1073/pnas.2012793118] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Translational stop codon readthrough occurs in organisms ranging from viruses to mammals and is especially prevalent in decoding Drosophila and viral mRNAs. Recoding of UGA, UAG, or UAA to specify an amino acid allows a proportion of the protein encoded by a single gene to be C-terminally extended. The extended product from Drosophila kelch mRNA is 160 kDa, whereas unextended Kelch protein, a subunit of a Cullin3-RING ubiquitin ligase, is 76 kDa. Previously we reported tissue-specific regulation of readthrough of the first kelch stop codon. Here, we characterize major efficiency differences in a variety of cell types. Immunoblotting revealed low levels of readthrough in malpighian tubules, ovary, and testis but abundant readthrough product in lysates of larval and adult central nervous system (CNS) tissue. Reporters of readthrough demonstrated greater than 30% readthrough in adult brains, and imaging in larval and adult brains showed that readthrough occurred in neurons but not glia. The extent of readthrough stimulatory sequences flanking the readthrough stop codon was assessed in transgenic Drosophila and in human tissue culture cells where inefficient readthrough occurs. A 99-nucleotide sequence with potential to form an mRNA stem-loop 3' of the readthrough stop codon stimulated readthrough efficiency. However, even with just six nucleotides of kelch mRNA sequence 3' of the stop codon, readthrough efficiency only dropped to 6% in adult neurons. Finally, we show that high-efficiency readthrough in the Drosophila CNS is common; for many neuronal proteins, C-terminal extended forms of individual proteins are likely relatively abundant.
Collapse
|
24
|
Biodiversity-based development and evolution: the emerging research systems in model and non-model organisms. SCIENCE CHINA-LIFE SCIENCES 2021; 64:1236-1280. [PMID: 33893979 DOI: 10.1007/s11427-020-1915-y] [Citation(s) in RCA: 42] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/25/2020] [Accepted: 03/16/2021] [Indexed: 02/07/2023]
Abstract
Evolutionary developmental biology, or Evo-Devo for short, has become an established field that, broadly speaking, seeks to understand how changes in development drive major transitions and innovation in organismal evolution. It does so via integrating the principles and methods of many subdisciplines of biology. Although we have gained unprecedented knowledge from the studies on model organisms in the past decades, many fundamental and crucially essential processes remain a mystery. Considering the tremendous biodiversity of our planet, the current model organisms seem insufficient for us to understand the evolutionary and physiological processes of life and its adaptation to exterior environments. The currently increasing genomic data and the recently available gene-editing tools make it possible to extend our studies to non-model organisms. In this review, we review the recent work on the regulatory signaling of developmental and regeneration processes, environmental adaptation, and evolutionary mechanisms using both the existing model animals such as zebrafish and Drosophila, and the emerging nonstandard model organisms including amphioxus, ascidian, ciliates, single-celled phytoplankton, and marine nematode. In addition, the challenging questions and new directions in these systems are outlined as well.
Collapse
|
25
|
Limits to the cellular control of sequestered cryptophyte prey in the marine ciliate Mesodinium rubrum. THE ISME JOURNAL 2021; 15:1056-1072. [PMID: 33230263 PMCID: PMC8115319 DOI: 10.1038/s41396-020-00830-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Revised: 10/28/2020] [Accepted: 11/02/2020] [Indexed: 01/29/2023]
Abstract
The marine ciliate Mesodinium rubrum is famous for its ability to acquire and exploit chloroplasts and other cell organelles from some cryptophyte algal species. We sequenced genomes and transcriptomes of free-swimming Teleaulax amphioxeia, as well as well-fed and starved M. rubrum in order to understand cellular processes upon sequestration under different prey and light conditions. From its prey, the ciliate acquires the ability to photosynthesize as well as the potential to metabolize several essential compounds including lysine, glycan, and vitamins that elucidate its specific prey dependency. M. rubrum does not express photosynthesis-related genes itself, but elicits considerable transcriptional control of the acquired cryptophyte organelles. This control is limited as light-dependent transcriptional changes found in free-swimming T. amphioxeia got lost after sequestration. We found strong transcriptional rewiring of the cryptophyte nucleus upon sequestration, where 35% of the T. amphioxeia genes were significantly differentially expressed within well-fed M. rubrum. Qualitatively, 68% of all genes expressed within well-fed M. rubrum originated from T. amphioxeia. Quantitatively, these genes contributed up to 48% to the global transcriptome in well-fed M. rubrum and down to 11% in starved M. rubrum. This tertiary endosymbiosis system functions for several weeks, when deprived of prey. After this point in time, the ciliate dies if not supplied with fresh prey cells. M. rubrum represents one evolutionary way of acquiring photosystems from its algal prey, and might represent a step on the evolutionary way towards a permanent tertiary endosymbiosis.
Collapse
|
26
|
Zhang H, Wang Y, Wu X, Tang X, Wu C, Lu J. Determinants of genome-wide distribution and evolution of uORFs in eukaryotes. Nat Commun 2021; 12:1076. [PMID: 33597535 PMCID: PMC7889888 DOI: 10.1038/s41467-021-21394-y] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2019] [Accepted: 01/20/2021] [Indexed: 01/02/2023] Open
Abstract
Upstream open reading frames (uORFs) play widespread regulatory functions in modulating mRNA translation in eukaryotes, but the principles underlying the genomic distribution and evolution of uORFs remain poorly understood. Here, we analyze ~17 million putative canonical uORFs in 478 eukaryotic species that span most of the extant taxa of eukaryotes. We demonstrate how positive and purifying selection, coupled with differences in effective population size (Ne), has shaped the contents of uORFs in eukaryotes. Besides, gene expression level is important in influencing uORF occurrences across genes in a species. Our analyses suggest that most uORFs might play regulatory roles rather than encode functional peptides. We also show that the Kozak sequence context of uORFs has evolved across eukaryotic clades, and that noncanonical uORFs tend to have weaker suppressive effects than canonical uORFs in translation regulation. This study provides insights into the driving forces underlying uORF evolution in eukaryotes.
Collapse
Affiliation(s)
- Hong Zhang
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
| | - Yirong Wang
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
- College of Biology, Hunan University, Changsha, China
| | - Xinkai Wu
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
| | - Xiaolu Tang
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
| | - Changcheng Wu
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
| | - Jian Lu
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China.
| |
Collapse
|
27
|
Singh T, Yadav SK, Vainstein A, Kumar V. Genome recoding strategies to improve cellular properties: mechanisms and advances. ABIOTECH 2021; 2:79-95. [PMID: 34377578 PMCID: PMC7675020 DOI: 10.1007/s42994-020-00030-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Accepted: 10/07/2020] [Indexed: 11/10/2022]
Abstract
The genetic code, once believed to be universal and immutable, is now known to contain many variations and is not quite universal. The basis for genome recoding strategy is genetic code variation that can be harnessed to improve cellular properties. Thus, genome recoding is a promising strategy for the enhancement of genome flexibility, allowing for novel functions that are not commonly documented in the organism in its natural environment. Here, the basic concept of genetic code and associated mechanisms for the generation of genetic codon variants, including biased codon usage, codon reassignment, and ambiguous decoding, are extensively discussed. Knowledge of the concept of natural genetic code expansion is also detailed. The generation of recoded organisms and associated mechanisms with basic targeting components, including aminoacyl-tRNA synthetase-tRNA pairs, elongation factor EF-Tu and ribosomes, are highlighted for a comprehensive understanding of this concept. The research associated with the generation of diverse recoded organisms is also discussed. The success of genome recoding in diverse multicellular organisms offers a platform for expanding protein chemistry at the biochemical level with non-canonical amino acids, genetically isolating the synthetic organisms from the natural ones, and fighting viruses, including SARS-CoV2, through the creation of attenuated viruses. In conclusion, genome recoding can offer diverse applications for improving cellular properties in the genome-recoded organisms.
Collapse
Affiliation(s)
- Tanya Singh
- Department of Botany, School of Basic Sciences, Central University of Punjab, Bathinda, 151001 India
| | | | - Alexander Vainstein
- Institute of Plant Sciences and Genetics in Agriculture, The Hebrew University of Jerusalem, Rehovot, Israel
| | - Vinay Kumar
- Department of Botany, School of Basic Sciences, Central University of Punjab, Bathinda, 151001 India
| |
Collapse
|
28
|
Rzeszutek I, Maurer-Alcalá XX, Nowacki M. Programmed genome rearrangements in ciliates. Cell Mol Life Sci 2020; 77:4615-4629. [PMID: 32462406 PMCID: PMC7599177 DOI: 10.1007/s00018-020-03555-2] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2020] [Revised: 05/11/2020] [Accepted: 05/15/2020] [Indexed: 12/14/2022]
Abstract
Ciliates are a highly divergent group of unicellular eukaryotes with separate somatic and germline genomes found in distinct dimorphic nuclei. This characteristic feature is tightly linked to extremely laborious developmentally regulated genome rearrangements in the development of a new somatic genome/nuclei following sex. The transformation from germline to soma genome involves massive DNA elimination mediated by non-coding RNAs, chromosome fragmentation, as well as DNA amplification. In this review, we discuss the similarities and differences in the genome reorganization processes of the model ciliates Paramecium and Tetrahymena (class Oligohymenophorea), and the distantly related Euplotes, Stylonychia, and Oxytricha (class Spirotrichea).
Collapse
Affiliation(s)
- Iwona Rzeszutek
- Institute of Biology and Biotechnology, Department of Biotechnology, University of Rzeszow, Pigonia 1, 35-310, Rzeszow, Poland.
| | - Xyrus X Maurer-Alcalá
- Institute of Cell Biology, University of Bern, Baltzerstrasse 4, 3012, Bern, Switzerland
| | - Mariusz Nowacki
- Institute of Cell Biology, University of Bern, Baltzerstrasse 4, 3012, Bern, Switzerland.
| |
Collapse
|
29
|
Poly(A)-Binding Protein Regulates the Efficiency of Translation Termination. Cell Rep 2020; 33:108399. [PMID: 33207198 DOI: 10.1016/j.celrep.2020.108399] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2019] [Revised: 09/01/2020] [Accepted: 10/27/2020] [Indexed: 11/21/2022] Open
Abstract
Multiple factors influence translation termination efficiency, including nonsense codon identity and immediate context. To determine whether the relative position of a nonsense codon within an open reading frame (ORF) influences termination efficiency, we quantitate the production of prematurely terminated and/or readthrough polypeptides from 26 nonsense alleles of 3 genes expressed in yeast. The accumulation of premature termination products and the extent of readthrough for the respective premature termination codons (PTCs) manifest a marked dependence on PTC proximity to the mRNA 3' end. Premature termination products increase in relative abundance, whereas readthrough efficiencies decrease progressively across different ORFs, and readthrough efficiencies for a PTC increase in response to 3' UTR lengthening. These effects are eliminated and overall translation termination efficiency decreases considerably in cells harboring pab1 mutations. Our results support a critical role for poly(A)-binding protein in the regulation of translation termination and also suggest that inefficient termination is a trigger for nonsense-mediated mRNA decay (NMD).
Collapse
|
30
|
Smith SA, Maurer-Alcalá XX, Yan Y, Katz LA, Santoferrara LF, McManus GB. Combined Genome and Transcriptome Analyses of the Ciliate Schmidingerella arcuata (Spirotrichea) Reveal Patterns of DNA Elimination, Scrambling, and Inversion. Genome Biol Evol 2020; 12:1616-1622. [PMID: 32870974 PMCID: PMC7523726 DOI: 10.1093/gbe/evaa185] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/24/2020] [Indexed: 12/04/2022] Open
Abstract
Schmidingerella arcuata is an ecologically important tintinnid ciliate that has long served as a model species in plankton trophic ecology. We present a partial micronuclear genome and macronuclear transcriptome resource for S. arcuata, acquired using single-cell techniques, and we report on pilot analyses including functional annotation and genome architecture. Our analysis shows major fragmentation, elimination, and scrambling in the micronuclear genome of S. arcuata. This work introduces a new nonmodel genome resource for the study of ciliate ecology and genomic biology and provides a detailed functional counterpart to ecological research on S. arcuata.
Collapse
Affiliation(s)
- Susan A Smith
- Department of Marine Sciences, University of Connecticut, Groton
| | | | - Ying Yan
- Department of Biological Sciences, Smith College, Northampton, Massachusetts
| | - Laura A Katz
- Department of Biological Sciences, Smith College, Northampton, Massachusetts
| | - Luciana F Santoferrara
- Department of Marine Sciences, University of Connecticut, Groton.,Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs
| | - George B McManus
- Department of Marine Sciences, University of Connecticut, Groton
| |
Collapse
|
31
|
Abstract
Messenger RNAs (mRNAs) consist of a coding region (open reading frame (ORF)) and two untranslated regions (UTRs), 5'UTR and 3'UTR. Ribosomes travel along the coding region, translating nucleotide triplets (called codons) to a chain of amino acids. The coding region was long believed to mainly encode the amino acid content of proteins, whereas regulatory signals reside in the UTRs and in other genomic regions. However, in recent years we have learned that the ORF is expansively populated with various regulatory signals, or codes, which are related to all gene expression steps and additional intracellular aspects. In this paper, we review the current knowledge related to overlapping codes inside the coding regions, such as the influence of synonymous codon usage on translation speed (and, in turn, the effect of translation speed on protein folding), ribosomal frameshifting, mRNA stability, methylation, splicing, transcription and more. All these codes come together and overlap in the ORF sequence, ensuring production of the right protein at the right time.
Collapse
Affiliation(s)
- Shaked Bergman
- Department of Biomedical Engineering, Tel-Aviv University, Tel Aviv, Israel
| | | |
Collapse
|
32
|
Žihala D, Eliáš M. Evolution and Unprecedented Variants of the Mitochondrial Genetic Code in a Lineage of Green Algae. Genome Biol Evol 2020; 11:2992-3007. [PMID: 31617565 PMCID: PMC6821328 DOI: 10.1093/gbe/evz210] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/17/2019] [Indexed: 12/15/2022] Open
Abstract
Mitochondria of diverse eukaryotes have evolved various departures from the standard genetic code, but the breadth of possible modifications and their phylogenetic distribution are known only incompletely. Furthermore, it is possible that some codon reassignments in previously sequenced mitogenomes have been missed, resulting in inaccurate protein sequences in databases. Here we show, considering the distribution of codons at conserved amino acid positions in mitogenome-encoded proteins, that mitochondria of the green algal order Sphaeropleales exhibit a diversity of codon reassignments, including previously missed ones and some that are unprecedented in any translation system examined so far, necessitating redefinition of existing translation tables and creating at least seven new ones. We resolve a previous controversy concerning the meaning the UAG codon in Hydrodictyaceae, which beyond any doubt encodes alanine. We further demonstrate that AGG, sometimes together with AGA, encodes alanine instead of arginine in diverse sphaeroplealeans. Further newly detected changes include Arg-to-Met reassignment of the AGG codon and Arg-to-Leu reassignment of the CGG codon in particular species. Analysis of tRNAs specified by sphaeroplealean mitogenomes provides direct support for and molecular underpinning of the proposed reassignments. Furthermore, we point to unique mutations in the mitochondrial release factor mtRF1a that correlate with changes in the use of termination codons in Sphaeropleales, including the two independent stop-to-sense UAG reassignments, the reintroduction of UGA in some Scenedesmaceae, and the sense-to-stop reassignment of UCA widespread in the group. Codon disappearance seems to be the main drive of the dynamic evolution of the mitochondrial genetic code in Sphaeropleales.
Collapse
Affiliation(s)
- David Žihala
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Czech Republic.,Institute of Environmental Technologies, Faculty of Science, University of Ostrava, Czech Republic
| | - Marek Eliáš
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Czech Republic.,Institute of Environmental Technologies, Faculty of Science, University of Ostrava, Czech Republic
| |
Collapse
|
33
|
Cerón-Romero MA, Maurer-Alcalá XX, Grattepanche JD, Yan Y, Fonseca MM, Katz LA. PhyloToL: A Taxon/Gene-Rich Phylogenomic Pipeline to Explore Genome Evolution of Diverse Eukaryotes. Mol Biol Evol 2020; 36:1831-1842. [PMID: 31062861 PMCID: PMC6657734 DOI: 10.1093/molbev/msz103] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Estimating multiple sequence alignments (MSAs) and inferring phylogenies are essential for many aspects of comparative biology. Yet, many bioinformatics tools for such analyses have focused on specific clades, with greatest attention paid to plants, animals, and fungi. The rapid increase in high-throughput sequencing (HTS) data from diverse lineages now provides opportunities to estimate evolutionary relationships and gene family evolution across the eukaryotic tree of life. At the same time, these types of data are known to be error-prone (e.g., substitutions, contamination). To address these opportunities and challenges, we have refined a phylogenomic pipeline, now named PhyloToL, to allow easy incorporation of data from HTS studies, to automate production of both MSAs and gene trees, and to identify and remove contaminants. PhyloToL is designed for phylogenomic analyses of diverse lineages across the tree of life (i.e., at scales of >100 My). We demonstrate the power of PhyloToL by assessing stop codon usage in Ciliophora, identifying contamination in a taxon- and gene-rich database and exploring the evolutionary history of chromosomes in the kinetoplastid parasite Trypanosoma brucei, the causative agent of African sleeping sickness. Benchmarking PhyloToL’s homology assessment against that of OrthoMCL and a published paper on superfamilies of bacterial and eukaryotic organellar outer membrane pore-forming proteins demonstrates the power of our approach for determining gene family membership and inferring gene trees. PhyloToL is highly flexible and allows users to easily explore HTS data, test hypotheses about phylogeny and gene family evolution and combine outputs with third-party tools (e.g., PhyloChromoMap, iGTP).
Collapse
Affiliation(s)
- Mario A Cerón-Romero
- Department of Biological Sciences, Smith College, Northampton, MA.,Program in Organismic and Evolutionary Biology, University of Massachusetts Amherst, Amherst, MA
| | - Xyrus X Maurer-Alcalá
- Department of Biological Sciences, Smith College, Northampton, MA.,Program in Organismic and Evolutionary Biology, University of Massachusetts Amherst, Amherst, MA.,Institute of Cell Biology, University of Bern, Bern, Switzerland
| | - Jean-David Grattepanche
- Department of Biological Sciences, Smith College, Northampton, MA.,Biology Department, Temple University, Philadelphia, PA
| | - Ying Yan
- Department of Biological Sciences, Smith College, Northampton, MA
| | - Miguel M Fonseca
- CIIMAR - Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Porto, Portugal
| | - L A Katz
- Department of Biological Sciences, Smith College, Northampton, MA.,Program in Organismic and Evolutionary Biology, University of Massachusetts Amherst, Amherst, MA
| |
Collapse
|
34
|
Wangen JR, Green R. Stop codon context influences genome-wide stimulation of termination codon readthrough by aminoglycosides. eLife 2020; 9:52611. [PMID: 31971508 PMCID: PMC7089771 DOI: 10.7554/elife.52611] [Citation(s) in RCA: 111] [Impact Index Per Article: 27.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2019] [Accepted: 01/22/2020] [Indexed: 12/14/2022] Open
Abstract
Stop codon readthrough (SCR) occurs when the ribosome miscodes at a stop codon. Such readthrough events can be therapeutically desirable when a premature termination codon (PTC) is found in a critical gene. To study SCR in vivo in a genome-wide manner, we treated mammalian cells with aminoglycosides and performed ribosome profiling. We find that in addition to stimulating readthrough of PTCs, aminoglycosides stimulate readthrough of normal termination codons (NTCs) genome-wide. Stop codon identity, the nucleotide following the stop codon, and the surrounding mRNA sequence context all influence the likelihood of SCR. In comparison to NTCs, downstream stop codons in 3′UTRs are recognized less efficiently by ribosomes, suggesting that targeting of critical stop codons for readthrough may be achievable without general disruption of translation termination. Finally, we find that G418-induced miscoding alters gene expression with substantial effects on translation of histone genes, selenoprotein genes, and S-adenosylmethionine decarboxylase (AMD1). Many genes provide a set of instructions needed to build a protein, which are read by structures called ribosomes through a process called translation. The genetic information contains a short, coded instruction called a stop codon which marks the end of the protein. When a ribosome finds a stop codon it should stop building and release the protein it has made. Ribosomes do not always stop at stop codons. Certain chemicals can actually prevent ribosomes from detecting stop codons correctly, and aminoglycosides are drugs that have exactly this effect. Aminoglycosides can be used as antibiotics at low doses because they interfere with ribosomes in bacteria, but at higher doses they can also prevent ribosomes from detecting stop codons in human cells. When ribosomes do not stop at a stop codon this is called readthrough. There are different types of stop codons and some are naturally more effective at stopping ribosomes than others. Wangen and Green have now examined the effect of an aminoglycoside called G418 on ribosomes in human cells grown in the laboratory. The results showed how ribosomes interacted with genetic information and revealed that certain stop codons are more affected by G418 than others. The stop codon and other genetic sequences around it affect the likelihood of readthrough. Wangen and Green also showed that sequences that encourage translation to stop are more common in the area around stop codons. These findings highlight an evolutionary pressure driving more genes to develop strong stop codons that resist readthrough. Despite this, some are still more affected by drugs like G418 than others. Some genetic conditions, like cystic fibrosis, result from incorrect stop codons in genes. Drugs that promote readthrough specifically in these genes could be useful new treatments.
Collapse
Affiliation(s)
- Jamie R Wangen
- Department of Molecular Biology and Genetics, Howard Hughes Medical Institute, Johns Hopkins University School of Medicine, Baltimore, United States
| | - Rachel Green
- Department of Molecular Biology and Genetics, Howard Hughes Medical Institute, Johns Hopkins University School of Medicine, Baltimore, United States
| |
Collapse
|
35
|
Yan Y, Maurer-Alcalá XX, Knight R, Kosakovsky Pond SL, Katz LA. Single-Cell Transcriptomics Reveal a Correlation between Genome Architecture and Gene Family Evolution in Ciliates. mBio 2019; 10:e02524-19. [PMID: 31874915 PMCID: PMC6935857 DOI: 10.1128/mbio.02524-19] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2019] [Accepted: 10/30/2019] [Indexed: 12/17/2022] Open
Abstract
Ciliates, a eukaryotic clade that is over 1 billion years old, are defined by division of genome function between transcriptionally inactive germline micronuclei and functional somatic macronuclei. To date, most analyses of gene family evolution have been limited to cultivable model lineages (e.g., Tetrahymena, Paramecium, Oxytricha, and Stylonychia). Here, we focus on the uncultivable Karyorelictea and its understudied sister class Heterotrichea, which represent two extremes in genome architecture. Somatic macronuclei within the Karyorelictea are described as nearly diploid, while the Heterotrichea have hyperpolyploid somatic genomes. Previous analyses indicate that genome architecture impacts ciliate gene family evolution as the most diverse and largest gene families are found in lineages with extensively processed somatic genomes (i.e., possessing thousands of gene-sized chromosomes). To further assess ciliate gene family evolution, we analyzed 43 single-cell transcriptomes from 33 ciliate species representing 10 classes. Focusing on conserved eukaryotic genes, we use estimates of transcript diversity as a proxy for the number of paralogs in gene families among four focal clades: Karyorelictea, Heterotrichea, extensive fragmenters (with gene-size somatic chromosomes), and non-extensive fragmenters (with more traditional somatic chromosomes), the latter two within the subphylum Intramacronucleata. Our results show that (i) the Karyorelictea have the lowest average transcript diversity, while Heterotrichea are highest among the four groups; (ii) proteins in Karyorelictea are under the highest functional constraints, and the patterns of selection in ciliates may reflect genome architecture; and (iii) stop codon reassignments vary among members of the Heterotrichea and Spirotrichea but are conserved in other classes.IMPORTANCE To further our understanding of genome evolution in eukaryotes, we assess the relationship between patterns of molecular evolution within gene families and variable genome structures found among ciliates. We combine single-cell transcriptomics with bioinformatic tools, focusing on understudied and uncultivable lineages selected from across the ciliate tree of life. Our analyses show that genome architecture correlates with patterns of protein evolution as lineages with more canonical somatic genomes, such as the class Karyorelictea, have more conserved patterns of molecular evolution compared to other classes. This study showcases the power of single-cell transcriptomics for investigating genome architecture and evolution in uncultivable microbial lineages and provides transcriptomic resources for further research on genome evolution.
Collapse
Affiliation(s)
- Ying Yan
- Smith College, Department of Biological Sciences, Northampton, Massachusetts, USA
| | - Xyrus X Maurer-Alcalá
- Smith College, Department of Biological Sciences, Northampton, Massachusetts, USA
- University of Massachusetts Amherst, Program in Organismic and Evolutionary Biology, Amherst, Massachusetts, USA
| | - Rob Knight
- University of California San Diego, Department of Pediatrics, San Diego, California, USA
- University of California San Diego, Department of Computer Science and Engineering, San Diego, California, USA
- University of California San Diego, Center for Microbiome Innovation, San Diego, California, USA
| | - Sergei L Kosakovsky Pond
- Temple University, Institute for Genomics and Evolutionary Medicine, Philadelphia, Pennsylvania, USA
| | - Laura A Katz
- Smith College, Department of Biological Sciences, Northampton, Massachusetts, USA
- University of Massachusetts Amherst, Program in Organismic and Evolutionary Biology, Amherst, Massachusetts, USA
| |
Collapse
|
36
|
Pan B, Chen X, Hou L, Zhang Q, Qu Z, Warren A, Miao M. Comparative Genomics Analysis of Ciliates Provides Insights on the Evolutionary History Within "Nassophorea-Synhymenia-Phyllopharyngea" Assemblage. Front Microbiol 2019; 10:2819. [PMID: 31921016 PMCID: PMC6920121 DOI: 10.3389/fmicb.2019.02819] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2019] [Accepted: 11/20/2019] [Indexed: 11/13/2022] Open
Abstract
Ciliated protists (ciliates) are widely used for investigating evolution, mostly due to their successful radiation after their early evolutionary branching. In this study, we employed high-throughput sequencing technology to reveal the phylogenetic position of Synhymenia, as well as two classes Nassophorea and Phyllopharyngea, which have been a long-standing puzzle in the field of ciliate systematics and evolution. We obtained genomic and transcriptomic data from single cells of one synhymenian (Chilodontopsis depressa) and six other species of phyllopharyngeans (Chilodochona sp., Dysteria derouxi, Hartmannula sinica, Trithigmostoma cucullulus, Trochilia petrani, and Trochilia sp.). Phylogenomic analysis based on 157 orthologous genes comprising 173,835 amino acid residues revealed the affiliation of C. depressa within the class Phyllopharyngea, and the monophyly of Nassophorea, which strongly support the assignment of Synhymenia as a subclass within the class Phyllopharyngea. Comparative genomic analyses further revealed that C. depressa shares more orthologous genes with the class Nassophorea than with Phyllopharyngea, and the stop codon usage in C. depressa resembles that of Phyllopharyngea. Functional enrichment analysis demonstrated that biological pathways in C. depressa are more similar to Phyllopharyngea than Nassophorea. These results suggest that genomic and transcriptomic data can be used to provide insights into the evolutionary relationships within the "Nassophorea-Synhymenia-Phyllopharyngea" assemblage.
Collapse
Affiliation(s)
- Bo Pan
- Institute of Evolution and Marine Biodiversity, Ocean University of China, Qingdao, China.,Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
| | - Xiao Chen
- Department of Genetics and Development, Columbia University Medical Center, New York, NY, United States
| | - Lina Hou
- Savaid Medical School, University of Chinese Academy of Sciences, Beijing, China
| | - Qianqian Zhang
- Yantai Institute of Coastal Zone Research, Chinese Academy of Sciences, Yantai, China
| | - Zhishuai Qu
- Institute of Evolution and Marine Biodiversity, Ocean University of China, Qingdao, China.,Ecology Group, Technical University of Kaiserslautern, Kaiserslautern, Germany
| | - Alan Warren
- Department of Life Sciences, Natural History Museum, London, United Kingdom
| | - Miao Miao
- Savaid Medical School, University of Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
37
|
Johnson LK, Alexander H, Brown CT. Re-assembly, quality evaluation, and annotation of 678 microbial eukaryotic reference transcriptomes. Gigascience 2019; 8:5241890. [PMID: 30544207 PMCID: PMC6481552 DOI: 10.1093/gigascience/giy158] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2018] [Revised: 09/18/2018] [Accepted: 11/29/2018] [Indexed: 12/18/2022] Open
Abstract
BACKGROUND De novo transcriptome assemblies are required prior to analyzing RNA sequencing data from a species without an existing reference genome or transcriptome. Despite the prevalence of transcriptomic studies, the effects of using different workflows, or "pipelines," on the resulting assemblies are poorly understood. Here, a pipeline was programmatically automated and used to assemble and annotate raw transcriptomic short-read data collected as part of the Marine Microbial Eukaryotic Transcriptome Sequencing Project. The resulting transcriptome assemblies were evaluated and compared against assemblies that were previously generated with a different pipeline developed by the National Center for Genome Research. RESULTS New transcriptome assemblies contained the majority of previous contigs as well as new content. On average, 7.8% of the annotated contigs in the new assemblies were novel gene names not found in the previous assemblies. Taxonomic trends were observed in the assembly metrics. Assemblies from the Dinoflagellata showed a higher number of contigs and unique k-mers than transcriptomes from other phyla, while assemblies from Ciliophora had a lower percentage of open reading frames compared to other phyla. CONCLUSIONS Given current bioinformatics approaches, there is no single "best" reference transcriptome for a particular set of raw data. As the optimum transcriptome is a moving target, improving (or not) with new tools and approaches, automated and programmable pipelines are invaluable for managing the computationally intensive tasks required for re-processing large sets of samples with revised pipelines and ensuring a common evaluation workflow is applied to all samples. Thus, re-assembling existing data with new tools using automated and programmable pipelines may yield more accurate identification of taxon-specific trends across samples in addition to novel and useful products for the community.
Collapse
Affiliation(s)
- Lisa K Johnson
- Department of Population Health, and Reproduction, School of Veterinary Medicine, University of California Davis, One Shields Ave, Davis, CA 95616, USA.,Molecular, Cellular, and Integrative Physiology Graduate Group, University of California Davis, One Shields Ave, Davis, CA 95616, USA
| | - Harriet Alexander
- Department of Population Health, and Reproduction, School of Veterinary Medicine, University of California Davis, One Shields Ave, Davis, CA 95616, USA.,Biology Department, Woods Hole Oceanographic Institution, Woods Hole, MA 02543, USA
| | - C Titus Brown
- Department of Population Health, and Reproduction, School of Veterinary Medicine, University of California Davis, One Shields Ave, Davis, CA 95616, USA.,Molecular, Cellular, and Integrative Physiology Graduate Group, University of California Davis, One Shields Ave, Davis, CA 95616, USA.,Genome Center, University of California Davis, 451 Health Sciences Dr, Davis, CA 95616, USA
| |
Collapse
|
38
|
Dexter JP, Prabakaran S, Gunawardena J. A Complex Hierarchy of Avoidance Behaviors in a Single-Cell Eukaryote. Curr Biol 2019; 29:4323-4329.e2. [PMID: 31813604 DOI: 10.1016/j.cub.2019.10.059] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2019] [Revised: 08/21/2019] [Accepted: 10/29/2019] [Indexed: 11/29/2022]
Abstract
Complex behavior is associated with animals with nervous systems, but decision-making and learning also occur in non-neural organisms [1], including singly nucleated cells [2-5] and multi-nucleate synctia [6-8]. Ciliates are single-cell eukaryotes, widely dispersed in aquatic habitats [9], with an extensive behavioral repertoire [10-13]. In 1906, Herbert Spencer Jennings [14, 15] described in the sessile ciliate Stentor roeseli a hierarchy of responses to repeated stimulation, which are among the most complex behaviors reported for a singly nucleated cell [16, 17]. These results attracted widespread interest [18, 19] and exert continuing fascination [7, 20-22] but were discredited during the behaviorist orthodoxy by claims of non-reproducibility [23]. These claims were based on experiments with the motile ciliate Stentor coeruleus. We acquired and maintained the correct organism in laboratory culture and used micromanipulation and video microscopy to confirm Jennings' observations. Despite significant individual variation, not addressed by Jennings, S. roeseli exhibits avoidance behaviors in a characteristic hierarchy of bending, ciliary alteration, contractions, and detachment, which is distinct from habituation or conditioning. Remarkably, the choice of contraction versus detachment is consistent with a fair coin toss. Such behavioral complexity may have had an evolutionary advantage in protist ecosystems, and the ciliate cortex may have provided mechanisms for implementing such behavior prior to the emergence of multicellularity. Our work resurrects Jennings' pioneering insights and adds to the list of exceptional features, including regeneration [24], genome rearrangement [25], codon reassignment [26], and cortical inheritance [27], for which the ciliate clade is renowned.
Collapse
Affiliation(s)
- Joseph P Dexter
- Department of Systems Biology, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115, USA; Neukom Institute for Computational Science, Dartmouth College, 27 North Main Street, Hanover, NH 03755, USA
| | - Sudhakaran Prabakaran
- Department of Systems Biology, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115, USA
| | - Jeremy Gunawardena
- Department of Systems Biology, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115, USA.
| |
Collapse
|
39
|
Lasek-Nesselquist E, Johnson MD. A Phylogenomic Approach to Clarifying the Relationship of Mesodinium within the Ciliophora: A Case Study in the Complexity of Mixed-Species Transcriptome Analyses. Genome Biol Evol 2019; 11:3218-3232. [PMID: 31665294 PMCID: PMC6859813 DOI: 10.1093/gbe/evz233] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/29/2019] [Indexed: 11/25/2022] Open
Abstract
Recent high-throughput sequencing endeavors have yielded multigene/protein phylogenies that confidently resolve several inter- and intra-class relationships within the phylum Ciliophora. We leverage the massive sequencing efforts from the Marine Microbial Eukaryote Transcriptome Sequencing Project, other SRA submissions, and available genome data with our own sequencing efforts to determine the phylogenetic position of Mesodinium and to generate the most taxonomically rich phylogenomic ciliate tree to date. Regardless of the data mining strategy, the multiprotein data set, or the molecular models of evolution employed, we consistently recovered the same well-supported relationships among ciliate classes, confirming many of the higher-level relationships previously identified. Mesodinium always formed a monophyletic group with members of the Litostomatea, with mixotrophic species of Mesodinium-M. rubrum, M. major, and M. chamaeleon-being more closely related to each other than to the heterotrophic member, M. pulex. The well-supported position of Mesodinium as sister to other litostomes contrasts with previous molecular analyses including those from phylogenomic studies that exploited the same transcriptomic databases. These topological discrepancies illustrate the need for caution when mining mixed-species transcriptomes and indicate that identifying ciliate sequences among prey contamination-particularly for Mesodinium species where expression from stolen prey nuclei appears to dominate-requires thorough and iterative vetting with phylogenies that incorporate sequences from a large outgroup of prey.
Collapse
Affiliation(s)
| | - Matthew D Johnson
- Biology, Woods Hole Oceanographic Institution, Woods Hole, Massachusetts
| |
Collapse
|
40
|
Schmidt M. A metric space for semantic containment: Towards the implementation of genetic firewalls. Biosystems 2019; 185:104015. [PMID: 31408698 DOI: 10.1016/j.biosystems.2019.104015] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2019] [Revised: 08/06/2019] [Accepted: 08/08/2019] [Indexed: 12/13/2022]
Abstract
Analysing or engineering the genetic code has mainly been considered as an approach to reduce or increase the mutational robustness of the genetic code, i.e. the error tolerance in DNA mutations, or to enable the incorporation of non-canonical amino acids. The approach of "semantic containment", however, is less interested in altering the mutational tolerance of the standard code, but to create synthetic alternative genetic codes that limit or all together impede horizontal gene transfer between a natural and genomically recoded organisms (GRO). A major claim or conjecture of semantic containment is: "the farther, the safer", meaning, the less similarity there is between two codes, the less chance of a horizontal gene transfer, and the stronger the genetic firewall. So far, no metrics were available to measure and quantify the "genetic distance" between different genetic codes. Such a metric, however, is iis paramount to allow the experimental testing and evaluation of the validity of semantic biocontainment for the first time. Here, we introduce a metric space to measure exactly the distance (dissimilarity) between different genetic codes, in order to provide a framework to evaluate the relation between distance and strength of a genetic firewall. Results are presented that incorporate bespoken metrics when producing alternative genetic codes according to predefined goals, specifications and limitations. Finally, as an outlook, implications and challenges for genetic firewall(s) are discussed for dual- and multi-code systems.
Collapse
|
41
|
Wang R, Liu J, Di Giuseppe G, Liang A. UAA and UAG may Encode Amino Acid in Cathepsin B Gene of Euplotes octocarinatus. J Eukaryot Microbiol 2019; 67:144-149. [PMID: 31419839 DOI: 10.1111/jeu.12755] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2019] [Revised: 07/23/2019] [Accepted: 08/11/2019] [Indexed: 12/28/2022]
Abstract
The ciliate Euplotes deviates from the universal genetic code by translating UGA as cysteine and using UAA and UAG as the termination codon. Here, we cloned and sequenced the Cathepsin B gene of Euplotes octocarinatus (Eo-CTSB) which containing several in-frame stop codons throughout the coding sequence. We provide evidences, based on 3'-RACE method and Western blot, that the Eo-CTSB gene is actively expressed. Comparison of the derived amino acid sequence with the homologs in other eukaryotes revealed that UAA and UAG may code for glutamine in Eo-CTSB. These findings imply an evolutionary complexity of stop codon reassignment in eukaryotes.
Collapse
Affiliation(s)
- Ruanlin Wang
- Key Laboratory of Chemical Biology and Molecular Engineering of Ministry of Education, Institute of Biotechnology, Shanxi University, Taiyuan, 030006, China
| | - Jingni Liu
- Key Laboratory of Chemical Biology and Molecular Engineering of Ministry of Education, Institute of Biotechnology, Shanxi University, Taiyuan, 030006, China
| | | | - Aihua Liang
- Key Laboratory of Chemical Biology and Molecular Engineering of Ministry of Education, Institute of Biotechnology, Shanxi University, Taiyuan, 030006, China
| |
Collapse
|
42
|
Ivanov A, Shuvalova E, Egorova T, Shuvalov A, Sokolova E, Bizyaev N, Shatsky I, Terenin I, Alkalaeva E. Polyadenylate-binding protein-interacting proteins PAIP1 and PAIP2 affect translation termination. J Biol Chem 2019; 294:8630-8639. [PMID: 30992367 DOI: 10.1074/jbc.ra118.006856] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2018] [Revised: 03/29/2019] [Indexed: 12/29/2022] Open
Abstract
Polyadenylate-binding protein (PABP) stimulates translation termination via interaction of its C-terminal domain with eukaryotic polypeptide chain release factor, eRF3. Additionally, two other proteins, poly(A)-binding protein-interacting proteins 1 and 2 (PAIP1 and PAIP2), bind the same domain of PABP and regulate its translation-related activity. To study the biochemistry of eRF3 and PAIP1/2 competition for PABP binding, we quantified the effects of PAIPs on translation termination in the presence or absence of PABP. Our results demonstrated that both PAIP1 and PAIP2 prevented translation termination at the premature termination codon, by controlling PABP activity. Moreover, PAIP1 and PAIP2 inhibited the activity of free PABP on translation termination in vitro However, after binding the poly(A) tail, PABP became insensitive to suppression by PAIPs and efficiently activated translation termination in the presence of eRF3a. Additionally, we revealed that PAIP1 binds eRF3 in solution, which stabilizes the post-termination complex. These results indicated that PAIP1 and PAIP2 participate in translation termination and are important regulators of readthrough at the premature termination codon.
Collapse
Affiliation(s)
- Alexandr Ivanov
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow 119991, Russia; Faculty of Bioengineering and Bioinformatics, M. V. Lomonosov Moscow State University, Moscow 119234, Russia
| | - Ekaterina Shuvalova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow 119991, Russia
| | - Tatiana Egorova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow 119991, Russia
| | - Alexey Shuvalov
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow 119991, Russia
| | - Elizaveta Sokolova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow 119991, Russia
| | - Nikita Bizyaev
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow 119991, Russia
| | - Ivan Shatsky
- Belozersky Institute of Physico-Chemical Biology, M. V. Lomonosov Moscow State University, Moscow 119234, Russia
| | - Ilya Terenin
- Belozersky Institute of Physico-Chemical Biology, M. V. Lomonosov Moscow State University, Moscow 119234, Russia; Sechenov First Moscow State Medical University, Institute of Molecular Medicine, Moscow 119146, Russia.
| | - Elena Alkalaeva
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow 119991, Russia.
| |
Collapse
|
43
|
BłaŻej P, Wnetrzak M, Mackiewicz D, Mackiewicz P. The influence of different types of translational inaccuracies on the genetic code structure. BMC Bioinformatics 2019; 20:114. [PMID: 30841864 PMCID: PMC6404327 DOI: 10.1186/s12859-019-2661-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2018] [Accepted: 01/29/2019] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND The standard genetic code is a recipe for assigning unambiguously 21 labels, i.e. amino acids and stop translation signal, to 64 codons. However, at early stages of the translational machinery development, the codons did not have to be read unambiguously and the early genetic codes could have contained some ambiguous assignments of codons to amino acids. Therefore, the goal of this work was to obtain the genetic code structures which could have evolved assuming different types of inaccuracy of the translational machinery starting from unambiguous assignments of codons to amino acids. RESULTS We developed a theoretical model assuming that the level of uncertainty of codon assignments can gradually decrease during the simulations. Since it is postulated that the standard code has evolved to be robust against point mutations and mistranslations, we developed three simulation scenarios assuming that such errors can influence one, two or three codon positions. The simulated codes were selected using the evolutionary algorithm methodology to decrease coding ambiguity and increase their robustness against mistranslation. CONCLUSIONS The results indicate that the typical codon block structure of the genetic code could have evolved to decrease the ambiguity of amino acid to codon assignments and to increase the fidelity of reading the genetic information. However, the robustness to errors was not the decisive factor that influenced the genetic code evolution because it is possible to find theoretical codes that minimize the reading errors better than the standard genetic code.
Collapse
Affiliation(s)
- Paweł BłaŻej
- Department of Genomics, University of Wrocław, ul. Joliot-Curie 14a, Wrocław, 50-383 Poland
| | - Małgorzata Wnetrzak
- Department of Genomics, University of Wrocław, ul. Joliot-Curie 14a, Wrocław, 50-383 Poland
| | - Dorota Mackiewicz
- Department of Genomics, University of Wrocław, ul. Joliot-Curie 14a, Wrocław, 50-383 Poland
| | - Paweł Mackiewicz
- Department of Genomics, University of Wrocław, ul. Joliot-Curie 14a, Wrocław, 50-383 Poland
| |
Collapse
|
44
|
Many alternative and theoretical genetic codes are more robust to amino acid replacements than the standard genetic code. J Theor Biol 2019; 464:21-32. [DOI: 10.1016/j.jtbi.2018.12.030] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2018] [Revised: 12/17/2018] [Accepted: 12/19/2018] [Indexed: 02/07/2023]
|
45
|
A precedented nuclear genetic code with all three termination codons reassigned as sense codons in the syndinean Amoebophrya sp. ex Karlodinium veneficum. PLoS One 2019; 14:e0212912. [PMID: 30818350 PMCID: PMC6394959 DOI: 10.1371/journal.pone.0212912] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2018] [Accepted: 02/12/2019] [Indexed: 02/06/2023] Open
Abstract
Amoebophrya is part of an enigmatic, diverse, and ubiquitous marine alveolate lineage known almost entirely from anonymous environmental sequencing. Two cultured Amoebophrya strains grown on core dinoflagellate hosts were used for transcriptome sequencing. BLASTx using different genetic codes suggests that Amoebophyra sp. ex Karlodinium veneficum uses the three typical stop codons (UAA, UAG, and UGA) to encode amino acids. When UAA and UAG are translated as glutamine about half of the alignments have better BLASTx scores, and when UGA is translated as tryptophan one fifth have better scores. However, the sole stop codon appears to be UGA based on conserved genes, suggesting contingent translation of UGA. Neither host sequences, nor sequences from the second strain, Amoebophrya sp. ex Akashiwo sanguinea had similar results in BLASTx searches. A genome survey of Amoebophyra sp. ex K. veneficum showed no evidence for transcript editing aside from mitochondrial transcripts. The dynein heavy chain (DHC) gene family was surveyed and of 14 transcripts only two did not use UAA, UAG, or UGA in a coding context. Overall the transcriptome displayed strong bias for A or U in third codon positions, while the tRNA genome survey showed bias against codons ending in U, particularly for amino acids with two codons ending in either C or U. Together these clues suggest contingent translation mechanisms in Amoebophyra sp. ex K. veneficum and a phylogenetically distinct instance of genetic code modification.
Collapse
|
46
|
Wada M, Ito K. Misdecoding of rare CGA codon by translation termination factors, eRF1/eRF3, suggests novel class of ribosome rescue pathway in S. cerevisiae. FEBS J 2019; 286:788-802. [PMID: 30471181 PMCID: PMC7379694 DOI: 10.1111/febs.14709] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2018] [Revised: 10/24/2018] [Accepted: 11/22/2018] [Indexed: 12/13/2022]
Abstract
The CGA arginine codon is a rare codon in Saccharomyces cerevisiae. Thus, full-length mature protein synthesis from reporter genes with internal CGA codon repeats are markedly reduced, and the reporters, instead, produce short-sized polypeptides via an unknown mechanism. Considering the product size and similar properties between CGA sense and UGA stop codons, we hypothesized that eukaryote polypeptide-chain release factor complex eRF1/eRF3 catalyses polypeptide release at CGA repeats. Herein, we performed a series of analyses and report that the CGA codon can be, to a certain extent, decoded as a stop codon in yeast. This also raises an intriguing possibility that translation termination factors eRF1/eRF3 rescue ribosomes stalled at CGA codons, releasing premature polypeptides, and competing with canonical tRNAICG to the CGA codon. Our results suggest an alternative ribosomal rescue pathway in eukaryotes. The present results suggest that misdecoding of low efficient codons may play a novel role in global translation regulation in S. cerevisiae.
Collapse
Affiliation(s)
- Miki Wada
- Department of Computational Biology and Medical SciencesGraduate School of Frontier SciencesThe University of TokyoKashiwa‐cityJapan
- Technical officeThe Institute of Medical ScienceThe University of TokyoMinato‐kuJapan
| | - Koichi Ito
- Department of Computational Biology and Medical SciencesGraduate School of Frontier SciencesThe University of TokyoKashiwa‐cityJapan
| |
Collapse
|
47
|
Abstract
The eukaryotic translation pathway has been studied for more than four decades, but the molecular mechanisms that regulate each stage of the pathway are not completely defined. This is in part because we have very little understanding of the kinetic framework for the assembly and disassembly of pathway intermediates. Steps of the pathway are thought to occur in the subsecond to second time frame, but most assays to monitor these events require minutes to hours to complete. Understanding translational control in sufficient detail will therefore require the development of assays that can precisely monitor the kinetics of the translation pathway in real time. Here, we describe the translation pathway from the perspective of its kinetic parameters, discuss advances that are helping us move toward the goal of a rigorous kinetic understanding, and highlight some of the challenges that remain.
Collapse
|
48
|
Su HJ, Barkman TJ, Hao W, Jones SS, Naumann J, Skippington E, Wafula EK, Hu JM, Palmer JD, dePamphilis CW. Novel genetic code and record-setting AT-richness in the highly reduced plastid genome of the holoparasitic plant Balanophora. Proc Natl Acad Sci U S A 2019; 116:934-943. [PMID: 30598433 PMCID: PMC6338844 DOI: 10.1073/pnas.1816822116] [Citation(s) in RCA: 52] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Plastid genomes (plastomes) vary enormously in size and gene content among the many lineages of nonphotosynthetic plants, but key lineages remain unexplored. We therefore investigated plastome sequence and expression in the holoparasitic and morphologically bizarre Balanophoraceae. The two Balanophora plastomes examined are remarkable, exhibiting features rarely if ever seen before in plastomes or in any other genomes. At 15.5 kb in size and with only 19 genes, they are among the most reduced plastomes known. They have no tRNA genes for protein synthesis, a trait found in only three other plastid lineages, and thus Balanophora plastids must import all tRNAs needed for translation. Balanophora plastomes are exceptionally compact, with numerous overlapping genes, highly reduced spacers, loss of all cis-spliced introns, and shrunken protein genes. With A+T contents of 87.8% and 88.4%, the Balanophora genomes are the most AT-rich genomes known save for a single mitochondrial genome that is merely bloated with AT-rich spacer DNA. Most plastid protein genes in Balanophora consist of ≥90% AT, with several between 95% and 98% AT, resulting in the most biased codon usage in any genome described to date. A potential consequence of its radical compositional evolution is the novel genetic code used by Balanophora plastids, in which TAG has been reassigned from stop to tryptophan. Despite its many exceptional properties, the Balanophora plastome must be functional because all examined genes are transcribed, its only intron is correctly trans-spliced, and its protein genes, although highly divergent, are evolving under various degrees of selective constraint.
Collapse
Affiliation(s)
- Huei-Jiun Su
- Department of Earth and Life Sciences, University of Taipei, 100 Taipei, Taiwan
- Department of Biology, Pennsylvania State University, University Park, PA 16802
- Institute of Molecular Evolutionary Genetics, Pennsylvania State University, University Park, PA 16802
| | - Todd J Barkman
- Department of Biological Sciences, Western Michigan University, Kalamazoo, MI 49008
| | - Weilong Hao
- Department of Biological Sciences, Wayne State University, Detroit, MI 48202
| | - Samuel S Jones
- Graduate Program in Plant Biology, Pennsylvania State University, University Park, PA 16802
| | - Julia Naumann
- Department of Biology, Pennsylvania State University, University Park, PA 16802
- Institute of Molecular Evolutionary Genetics, Pennsylvania State University, University Park, PA 16802
| | | | - Eric K Wafula
- Department of Biology, Pennsylvania State University, University Park, PA 16802
- Institute of Molecular Evolutionary Genetics, Pennsylvania State University, University Park, PA 16802
| | - Jer-Ming Hu
- Institute of Ecology and Evolutionary Biology, National Taiwan University, 106 Taipei, Taiwan
| | - Jeffrey D Palmer
- Department of Biology, Indiana University, Bloomington, IN 47405;
| | - Claude W dePamphilis
- Department of Biology, Pennsylvania State University, University Park, PA 16802;
- Institute of Molecular Evolutionary Genetics, Pennsylvania State University, University Park, PA 16802
- Graduate Program in Plant Biology, Pennsylvania State University, University Park, PA 16802
| |
Collapse
|
49
|
Hines HN, Onsbring H, Ettema TJ, Esteban GF. Molecular Investigation of the Ciliate Spirostomum semivirescens, with First Transcriptome and New Geographical Records. Protist 2018; 169:875-886. [DOI: 10.1016/j.protis.2018.08.001] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2018] [Revised: 08/06/2018] [Accepted: 08/09/2018] [Indexed: 11/26/2022]
|
50
|
Pyrrolysine in archaea: a 22nd amino acid encoded through a genetic code expansion. Emerg Top Life Sci 2018; 2:607-618. [PMID: 33525836 DOI: 10.1042/etls20180094] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2018] [Revised: 09/24/2018] [Accepted: 09/25/2018] [Indexed: 11/17/2022]
Abstract
The 22nd amino acid discovered to be directly encoded, pyrrolysine, is specified by UAG. Until recently, pyrrolysine was only known to be present in archaea from a methanogenic lineage (Methanosarcinales), where it is important in enzymes catalysing anoxic methylamines metabolism, and a few anaerobic bacteria. Relatively new discoveries have revealed wider presence in archaea, deepened functional understanding, shown remarkable carbon source-dependent expression of expanded decoding and extended exploitation of the pyrrolysine machinery for synthetic code expansion. At the same time, other studies have shown the presence of pyrrolysine-containing archaea in the human gut and this has prompted health considerations. The article reviews our knowledge of this fascinating exception to the 'standard' genetic code.
Collapse
|