1
|
Su M, Roberts SJ, Sutherland JD. Initial Amino Acid:Codon Assignments and Strength of Codon:Anticodon Binding. J Am Chem Soc 2024; 146:12857-12863. [PMID: 38676654 PMCID: PMC11082893 DOI: 10.1021/jacs.4c03644] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Revised: 04/16/2024] [Accepted: 04/18/2024] [Indexed: 04/29/2024]
Abstract
The ribosome brings 3'-aminoacyl-tRNA and 3'-peptidyl-tRNAs together to enable peptidyl transfer by binding them in two major ways. First, their anticodon loops are bound to mRNA, itself anchored at the ribosomal subunit interface, by contiguous anticodon:codon pairing augmented by interactions with the decoding center of the small ribosomal subunit. Second, their acceptor stems are bound by the peptidyl transferase center, which aligns the 3'-aminoacyl- and 3'-peptidyl-termini for optimal interaction of the nucleophilic amino group and electrophilic ester carbonyl group. Reasoning that intrinsic codon:anticodon binding might have been a major contributor to bringing tRNA 3'-termini into proximity at an early stage of ribosomal peptide synthesis, we wondered if primordial amino acids might have been assigned to those codons that bind the corresponding anticodon loops most tightly. By measuring the binding of anticodon stem loops to short oligonucleotides, we determined that family-box codon:anticodon pairings are typically tighter than split-box codon:anticodon pairings. Furthermore, we find that two family-box anticodon stem loops can tightly bind a pair of contiguous codons simultaneously, whereas two split-box anticodon stem loops cannot. The amino acids assigned to family boxes correspond to those accessible by what has been termed cyanosulfidic chemistry, supporting the contention that these limited amino acids might have been the first used in primordial coded peptide synthesis.
Collapse
Affiliation(s)
- Meng Su
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge
Biomedical Campus, Cambridge CB2 0QH, U.K.
| | - Samuel J. Roberts
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge
Biomedical Campus, Cambridge CB2 0QH, U.K.
| | - John D. Sutherland
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge
Biomedical Campus, Cambridge CB2 0QH, U.K.
| |
Collapse
|
2
|
Li G, Zhang L, Xue P. Codon usage divergence in Delta variants (B.1.617.2) of SARS-CoV-2. Infect Genet Evol 2022; 97:105175. [PMID: 34871776 PMCID: PMC8641433 DOI: 10.1016/j.meegid.2021.105175] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/07/2021] [Revised: 11/27/2021] [Accepted: 12/01/2021] [Indexed: 01/17/2023]
Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) spreads all over the world and brings great harm to humans in many countries. Many new SARS-CoV-2 variants appeared during its transmission. In the present study, the Delta variants (B.1.617.2) of SARS-CoV-2, which have appeared in many countries, were considered for analysis. In order to evaluate the evolutionary divergence of the Delta variants(B.1.617.2), the codon usage divergence in Delta variants (B.1.617.2) of SARS-CoV-2 was compared to that of the SARS-CoV-2 genomes emerged before June 2020. All Delta variants (B.1.617.2) and 350 early genomes of SARS-CoV-2 in the NCBI database were downloaded. Codon usage pattern including the basic composition, the GC ratio of the third position (GC3) and the first two positions (GC12) in codons, overall GC contents, the effective number of codons (ENC), the codon bias index (CBI), the relative synonymous codon usage (RSCU) values, etc., of all concerned important gene sequences were all calculated. Codon usage divergence of them was calculated via summing their standard deviations. The results suggested that base compositions in both Delta variants (B.1.617.2) of SARS-CoV-2 and the early SARS-CoV-2 genomes were similar to each other. However, the internal codon usage divergence for most genes in Delta variants (B.1.617.2) was significantly wider than that of SARS-CoV-2. The RSCU values were further used to explore the synonymous and non-synonymous mutations in the sequences of the Delta variants (B.1.617.2), and the results showed the synonymous mutations are more obvious than the non-synonymous in the concerned sequences. The related codon usage divergence analysis is helpful for further study on the adaptability and disease prognosis of the SARS-CoV-2 variants.
Collapse
Affiliation(s)
- Gun Li
- Laboratory for Biodiversity Science, Department of Biomedical Engineering, School of Electronic Information Engineering, Xi'An Technological University, Xi'An, China.
| | - Liang Zhang
- Laboratory for Biodiversity Science, Department of Biomedical Engineering, School of Electronic Information Engineering, Xi'An Technological University, Xi'An, China
| | - Pei Xue
- Laboratory for Biodiversity Science, Department of Biomedical Engineering, School of Electronic Information Engineering, Xi'An Technological University, Xi'An, China
| |
Collapse
|
3
|
Abstract
A key characteristic of cancer cells is their increased proliferative capacity, which requires elevated levels of protein synthesis. The process of protein synthesis involves the translation of codons within the mRNA coding sequence into a string of amino acids to form a polypeptide chain. As most amino acids are encoded by multiple codons, the nucleotide sequence of a coding region can vary dramatically without altering the polypeptide sequence of the encoded protein. Although mutations that do not alter the final amino acid sequence are often thought of as silent/synonymous, these can still have dramatic effects on protein output. Because each codon has a distinct translation elongation rate and can differentially impact mRNA stability, each codon has a different degree of 'optimality' for protein synthesis. Recent data demonstrates that the codon preference of a transcriptome matches the abundance of tRNAs within the cell and that this supply and demand between tRNAs and mRNAs varies between different cell types. The largest observed distinction is between mRNAs encoding proteins associated with proliferation or differentiation. Nevertheless, precisely how codon optimality and tRNA expression levels regulate cell fate decisions and their role in malignancy is not fully understood. This review describes the current mechanistic understanding on codon optimality, its role in malignancy and discusses the potential to target codon optimality therapeutically in the context of cancer.
Collapse
Affiliation(s)
- Sarah L Gillen
- Cancer Research UK Beatson Institute, Garscube Estate, Switchback Road, Glasgow, G61 1BD, UK.
| | - Joseph A Waldron
- Cancer Research UK Beatson Institute, Garscube Estate, Switchback Road, Glasgow, G61 1BD, UK
| | - Martin Bushell
- Cancer Research UK Beatson Institute, Garscube Estate, Switchback Road, Glasgow, G61 1BD, UK.
- Institute of Cancer Sciences, University of Glasgow, Glasgow, UK, G61 1QH.
| |
Collapse
|
4
|
Daouda T, Dumont-Lagacé M, Feghaly A, Benslimane Y, Panes R, Courcelles M, Benhammadi M, Harrington L, Thibault P, Major F, Bengio Y, Gagnon É, Lemieux S, Perreault C. CAMAP: Artificial neural networks unveil the role of codon arrangement in modulating MHC-I peptides presentation. PLoS Comput Biol 2021; 17:e1009482. [PMID: 34679099 PMCID: PMC8577786 DOI: 10.1371/journal.pcbi.1009482] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2021] [Revised: 11/09/2021] [Accepted: 09/27/2021] [Indexed: 12/02/2022] Open
Abstract
MHC-I associated peptides (MAPs) play a central role in the elimination of virus-infected and neoplastic cells by CD8 T cells. However, accurately predicting the MAP repertoire remains difficult, because only a fraction of the transcriptome generates MAPs. In this study, we investigated whether codon arrangement (usage and placement) regulates MAP biogenesis. We developed an artificial neural network called Codon Arrangement MAP Predictor (CAMAP), predicting MAP presentation solely from mRNA sequences flanking the MAP-coding codons (MCCs), while excluding the MCC per se. CAMAP predictions were significantly more accurate when using original codon sequences than shuffled codon sequences which reflect amino acid usage. Furthermore, predictions were independent of mRNA expression and MAP binding affinity to MHC-I molecules and applied to several cell types and species. Combining MAP ligand scores, transcript expression level and CAMAP scores was particularly useful to increase MAP prediction accuracy. Using an in vitro assay, we showed that varying the synonymous codons in the regions flanking the MCCs (without changing the amino acid sequence) resulted in significant modulation of MAP presentation at the cell surface. Taken together, our results demonstrate the role of codon arrangement in the regulation of MAP presentation and support integration of both translational and post-translational events in predictive algorithms to ameliorate modeling of the immunopeptidome. MHC-I associated peptides (MAPs) are small fragments of intracellular proteins presented at the surface of cells and used by the immune system to detect and eliminate cancerous or virus-infected cells. While it is theoretically possible to predict which portions of the intracellular proteins will be naturally processed by the cells to ultimately reach the surface, current methodologies have prohibitively high false discovery rates. Here we introduce an artificial neural network called Codon Arrangement MAP Predictor (CAMAP) which integrates information from mRNA-to-protein translation to other factors regulating MAP biogenesis (e.g. MAP ligand score and transcript expression levels) to improve MAP prediction accuracy. While most MAP predictive approaches focus on MAP sequences per se, CAMAP’s novelty is to analyze the MAP-flanking mRNA sequences, thereby providing completely independent information for MAP prediction. We show on several datasets that the integration of CAMAP scores with other known factors involved in MAP presentation (i.e. MAP ligand score and mRNA expression) significantly improves MAP prediction accuracy, and further validate CAMAP learned features using an in-vitro assay. These findings may have major implications for the design of vaccines against cancers and viruses, and in times of pandemics could accelerate the identification of relevant MAPs of viral origins.
Collapse
Affiliation(s)
- Tariq Daouda
- Institute for Research in Immunology and Cancer, Université de Montréal, Montréal, Canada
- Department of Biochemistry, Université de Montréal, Montréal, Canada
- * E-mail:
| | - Maude Dumont-Lagacé
- Institute for Research in Immunology and Cancer, Université de Montréal, Montréal, Canada
- Department of Medicine, Université de Montréal, Montréal, Canada
| | - Albert Feghaly
- Institute for Research in Immunology and Cancer, Université de Montréal, Montréal, Canada
| | - Yahya Benslimane
- Institute for Research in Immunology and Cancer, Université de Montréal, Montréal, Canada
- Department of Medicine, Université de Montréal, Montréal, Canada
| | - Rébecca Panes
- Institute for Research in Immunology and Cancer, Université de Montréal, Montréal, Canada
- Department of Microbiology, Infectiology and Immunology, Université de Montréal, Montréal, Canada
| | - Mathieu Courcelles
- Institute for Research in Immunology and Cancer, Université de Montréal, Montréal, Canada
| | - Mohamed Benhammadi
- Institute for Research in Immunology and Cancer, Université de Montréal, Montréal, Canada
- Department of Medicine, Université de Montréal, Montréal, Canada
| | - Lea Harrington
- Institute for Research in Immunology and Cancer, Université de Montréal, Montréal, Canada
- Department of Medicine, Université de Montréal, Montréal, Canada
| | - Pierre Thibault
- Institute for Research in Immunology and Cancer, Université de Montréal, Montréal, Canada
- Department of Chemistry, Université de Montréal, Montréal, Canada
| | - François Major
- Institute for Research in Immunology and Cancer, Université de Montréal, Montréal, Canada
- Department of Computer Science and Operations Research, Université de Montréal, Montréal, Canada
| | - Yoshua Bengio
- Department of Computer Science and Operations Research, Université de Montréal, Montréal, Canada
| | - Étienne Gagnon
- Institute for Research in Immunology and Cancer, Université de Montréal, Montréal, Canada
- Department of Microbiology, Infectiology and Immunology, Université de Montréal, Montréal, Canada
| | - Sébastien Lemieux
- Institute for Research in Immunology and Cancer, Université de Montréal, Montréal, Canada
- Department of Biochemistry, Université de Montréal, Montréal, Canada
- Department of Computer Science and Operations Research, Université de Montréal, Montréal, Canada
| | - Claude Perreault
- Institute for Research in Immunology and Cancer, Université de Montréal, Montréal, Canada
- Department of Medicine, Université de Montréal, Montréal, Canada
| |
Collapse
|
5
|
Niblett D, Nelson C, Leung CS, Rexroad G, Cozy J, Zhou J, Lancaster L, Noller HF. Mutations in domain IV of elongation factor EF-G confer -1 frameshifting. RNA 2021; 27:40-53. [PMID: 33008838 PMCID: PMC7749637 DOI: 10.1261/rna.077339.120] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Accepted: 09/24/2020] [Indexed: 05/25/2023]
Abstract
A recent crystal structure of a ribosome complex undergoing partial translocation in the absence of elongation factor EF-G showed disruption of codon-anticodon pairing and slippage of the reading frame by -1, directly implicating EF-G in preservation of the translational reading frame. Among mutations identified in a random screen for dominant-lethal mutations of EF-G were a cluster of six that map to the tip of domain IV, which has been shown to contact the codon-anticodon duplex in trapped translocation intermediates. In vitro synthesis of a full-length protein using these mutant EF-Gs revealed dramatically increased -1 frameshifting, providing new evidence for a role for domain IV of EF-G in maintaining the reading frame. These mutations also caused decreased rates of mRNA translocation and rotational movement of the head and body domains of the 30S ribosomal subunit during translocation. Our results are in general agreement with recent findings from Rodnina and coworkers based on in vitro translation of an oligopeptide using EF-Gs containing mutations at two positions in domain IV, who found an inverse correlation between the degree of frameshifting and rates of translocation. Four of our six mutations are substitutions at positions that interact with the translocating tRNA, in each case contacting the RNA backbone of the anticodon loop. We suggest that EF-G helps to preserve the translational reading frame by preventing uncoupled movement of the tRNA through these contacts; a further possibility is that these interactions may stabilize a conformation of the anticodon that favors base-pairing with its codon.
Collapse
Affiliation(s)
- Dustin Niblett
- Center for Molecular Biology of RNA and Department of Molecular, Cell and Developmental Biology, University of California at Santa Cruz, Santa Cruz, California 95064, USA
| | - Charlotte Nelson
- Center for Molecular Biology of RNA and Department of Molecular, Cell and Developmental Biology, University of California at Santa Cruz, Santa Cruz, California 95064, USA
| | - Calvin S Leung
- Center for Molecular Biology of RNA and Department of Molecular, Cell and Developmental Biology, University of California at Santa Cruz, Santa Cruz, California 95064, USA
| | - Gillian Rexroad
- Center for Molecular Biology of RNA and Department of Molecular, Cell and Developmental Biology, University of California at Santa Cruz, Santa Cruz, California 95064, USA
| | - Jake Cozy
- Center for Molecular Biology of RNA and Department of Molecular, Cell and Developmental Biology, University of California at Santa Cruz, Santa Cruz, California 95064, USA
| | - Jie Zhou
- Center for Molecular Biology of RNA and Department of Molecular, Cell and Developmental Biology, University of California at Santa Cruz, Santa Cruz, California 95064, USA
| | - Laura Lancaster
- Center for Molecular Biology of RNA and Department of Molecular, Cell and Developmental Biology, University of California at Santa Cruz, Santa Cruz, California 95064, USA
| | - Harry F Noller
- Center for Molecular Biology of RNA and Department of Molecular, Cell and Developmental Biology, University of California at Santa Cruz, Santa Cruz, California 95064, USA
| |
Collapse
|
6
|
Sarkar AK, Sarzynska J, Lahiri A. Ensemble Allosteric Model for the Modified Wobble Hypothesis. J Phys Chem Lett 2020; 11:6337-6343. [PMID: 32701298 DOI: 10.1021/acs.jpclett.0c00854] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
The residue 2-thiouridine (s2U) provides a remarkable example for the "modified wobble" hypothesis, which postulates that some post-transcriptional modifications at the wobble position of tRNAs restrict recognition of degenerate codons. Through extensive molecular dynamics simulations using our χIDRP force field parameters, we demonstrate how this modification shifts the conformational ensemble from a predominantly disordered, as in the case of an RNA pentamer (GUUUC), to a substantially ordered population in Gs2UUUC. Our simulations clearly showed that the van der Waals interaction of sulfur played a major role in driving the disorder-to-order transition. The conformational redistribution and the slowing down of the transition between the clusters within the population in the presence of s2U suggest ensemble allostery to be a key mechanism that may play a general role in the functioning of the wobble modifications of tRNAs.
Collapse
Affiliation(s)
- Aditya K Sarkar
- Department of Biophysics, Molecular Biology and Bioinformatics, University of Calcutta, 92, Acharya Prafulla Chandra Road, Kolkata 700 009, India
| | - Joanna Sarzynska
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Z. Noskowskiego 12/14, 61-704 Poznan, Poland
| | - Ansuman Lahiri
- Department of Biophysics, Molecular Biology and Bioinformatics, University of Calcutta, 92, Acharya Prafulla Chandra Road, Kolkata 700 009, India
| |
Collapse
|
7
|
Demongeot J, Seligmann H. Deamination gradients within codons after 1<->2 position swap predict amino acid hydrophobicity and parallel β-sheet conformational preference. Biosystems 2020; 191-192:104116. [PMID: 32081715 DOI: 10.1016/j.biosystems.2020.104116] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2019] [Revised: 12/04/2019] [Accepted: 02/10/2020] [Indexed: 12/16/2022]
Abstract
Deaminations C->T and A->G are frequent mutations producing nucleotide content gradients across genomes proportional to singlestrandedness during replication/transcription. Hence, within single codons, deamination risks increase from first to third codon positions, while second codon positions are functionally most crucial. Here genetic codes are analyzed assuming that after anticodons protected codons from deaminations, first and second codon positions swapped (N2N1N3->N1N2N3), with lowest deamination risks for N2 in presumed primitive N2N1N3 codons. N2N1N3, not standard N1N2N3, codon structure minimizes deaminations inversely proportionally to cognate amino acid hydrophobicity and parallel betasheet conformational preference. For N1N2N3, deamination minimization increases with genetic code integration order of cognate amino acids: during the presumed N2N1N3->N1N2N3 codon structure transition, protein synthesis combined direct codon-amino acid interactions for late amino acids and tRNA-based translation for early amino acids. Hence N2N1N3 codons would correspond to tRNA-free translation by spontaneous codon-amino acid affinities, and tRNA-mediated translation presumably caused N2N1N3->N1N2N3 swaps. Results show that rational, not arbitrary rules link codon and amino acid structures. Some analyses detect mitochondrial RNAs and peptides in public data corresponding to systematic position swaps, suggesting occasional swapping polymerase activity.
Collapse
Affiliation(s)
- Jacques Demongeot
- Université Grenoble Alpes, Faculty of Medicine, Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical, F-38700, La Tronche, France.
| | - Hervé Seligmann
- Université Grenoble Alpes, Faculty of Medicine, Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical, F-38700, La Tronche, France; The National Natural History Collections, The Hebrew University of Jerusalem, 91404, Jerusalem, Israel.
| |
Collapse
|
8
|
Moreira MH, Barros GC, Requião RD, Rossetto S, Domitrovic T, Palhano FL. From reporters to endogenous genes: the impact of the first five codons on translation efficiency in Escherichia coli. RNA Biol 2019; 16:1806-1816. [PMID: 31470761 PMCID: PMC6844562 DOI: 10.1080/15476286.2019.1661213] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2019] [Accepted: 08/23/2019] [Indexed: 12/29/2022] Open
Abstract
Translation initiation is a critical step in the regulation of protein synthesis, and it is subjected to different control mechanisms, such as 5' UTR secondary structure and initiation codon context, that can influence the rates at which initiation and consequentially translation occur. For some genes, translation elongation also affects the rate of protein synthesis. With a GFP library containing nearly all possible combinations of nucleotides from the 3rd to the 5th codon positions in the protein coding region of the mRNA, it was previously demonstrated that some nucleotide combinations increased GFP expression up to four orders of magnitude. While it is clear that the codon region from positions 3 to 5 can influence protein expression levels of artificial constructs, its impact on endogenous proteins is still unknown. Through bioinformatics analysis, we identified the nucleotide combinations of the GFP library in Escherichia coli genes and examined the correlation between the expected levels of translation according to the GFP data with the experimental measures of protein expression. We observed that E. coli genes were enriched with the nucleotide compositions that enhanced protein expression in the GFP library, but surprisingly, it seemed to affect the translation efficiency only marginally. Nevertheless, our data indicate that different enterobacteria present similar nucleotide composition enrichment as E. coli, suggesting an evolutionary pressure towards the conservation of short translational enhancer sequences.
Collapse
Affiliation(s)
- Mariana H. Moreira
- Programa de Biologia Estrutural, Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Géssica C. Barros
- Programa de Biologia Estrutural, Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Rodrigo D. Requião
- Programa de Biologia Estrutural, Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Silvana Rossetto
- Departamento de Ciência da Computação, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Tatiana Domitrovic
- Departamento de Virologia, Instituto de Microbiologia Paulo de Góes, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Fernando L. Palhano
- Programa de Biologia Estrutural, Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| |
Collapse
|
9
|
Abstract
Gene prediction, also known as gene identification, gene finding, gene recognition, or gene discovery, is among one of the important problems of molecular biology and is receiving increasing attention due to the advent of large-scale genome sequencing projects. We designed an ab initio model (called ChemGenome) for gene prediction in prokaryotic genomes based on physicochemical characteristics of codons. In this chapter, we present the methodology of the latest version of this model ChemGenome2.1 (CG2.1). The first module of the protocol builds a three-dimensional vector from three calculated quantities for each codon-the double-helical trinucleotide base pairing energy, the base pair stacking energy, and an index of the propensity of a codon for protein-nucleic acid interactions. As this three-dimensional vector moves along any genome, the net orientation of the resultant vector should differ significantly for gene and non-genic regions to make a distinction feasible. The predicted putative protein-coding genes from above parameters are passed through a second module of the protocol which reduces the number of false positives by utilizing a filter based on stereochemical properties of protein sequences. The chemical properties of amino acid side chains taken into consideration are the presence of sp3 hybridized γ carbon atom, hydrogen bond donor ability, short/absence of δ carbon and linearity of the side chains/non-occurrence of bi-dentate forks with terminal hydrogen atoms in the side chain. The final prediction of the potential protein-coding genes is based on the frequency of occurrence of amino acids in the predicted protein sequences and their deviation from the frequency values of Swissprot protein sequences, both at monomer and tripeptide levels. The final screening is based on Z-score. Though CG2.1 is a gene finding tool for prokaryotes, considering the underlying similarity in the chemical and physical properties of DNA among prokaryotes and eukaryotes, we attempted to evaluate its applicability for gene finding in the lower eukaryotes. The results give a hope that the concept of gene finding based on physicochemical model of codons is a viable idea for eukaryotes as well, though, undoubtedly, improvements are needed.
Collapse
Affiliation(s)
- Akhilesh Mishra
- Supercomputing Facility for Bioinformatics and Computational Biology, Indian Institute of Technology Delhi, New Delhi, India
- Kusuma School of Biological Sciences, Indian Institute of Technology Delhi, New Delhi, India
| | - Priyanka Siwach
- Supercomputing Facility for Bioinformatics and Computational Biology, Indian Institute of Technology Delhi, New Delhi, India
- Department of Biotechnology, Chaudhary Devi Lal University, Sirsa, Haryana, India
| | - Poonam Singhal
- Supercomputing Facility for Bioinformatics and Computational Biology, Indian Institute of Technology Delhi, New Delhi, India
| | - B Jayaram
- Supercomputing Facility for Bioinformatics and Computational Biology, Indian Institute of Technology Delhi, New Delhi, India.
- Kusuma School of Biological Sciences, Indian Institute of Technology Delhi, New Delhi, India.
- Department of Chemistry, Indian Institute of Technology Delhi, New Delhi, India.
| |
Collapse
|
10
|
Hoernes TP, Faserl K, Juen MA, Kremser J, Gasser C, Fuchs E, Shi X, Siewert A, Lindner H, Kreutz C, Micura R, Joseph S, Höbartner C, Westhof E, Hüttenhofer A, Erlacher MD. Translation of non-standard codon nucleotides reveals minimal requirements for codon-anticodon interactions. Nat Commun 2018; 9:4865. [PMID: 30451861 PMCID: PMC6242847 DOI: 10.1038/s41467-018-07321-8] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2018] [Accepted: 10/25/2018] [Indexed: 01/16/2023] Open
Abstract
The precise interplay between the mRNA codon and the tRNA anticodon is crucial for ensuring efficient and accurate translation by the ribosome. The insertion of RNA nucleobase derivatives in the mRNA allowed us to modulate the stability of the codon-anticodon interaction in the decoding site of bacterial and eukaryotic ribosomes, allowing an in-depth analysis of codon recognition. We found the hydrogen bond between the N1 of purines and the N3 of pyrimidines to be sufficient for decoding of the first two codon nucleotides, whereas adequate stacking between the RNA bases is critical at the wobble position. Inosine, found in eukaryotic mRNAs, is an important example of destabilization of the codon-anticodon interaction. Whereas single inosines are efficiently translated, multiple inosines, e.g., in the serotonin receptor 5-HT2C mRNA, inhibit translation. Thus, our results indicate that despite the robustness of the decoding process, its tolerance toward the weakening of codon-anticodon interactions is limited.
Collapse
Affiliation(s)
- Thomas Philipp Hoernes
- Division of Genomics and RNomics, Biocenter, Medical University of Innsbruck, 6020, Innsbruck, Austria
| | - Klaus Faserl
- Division of Clinical Biochemistry, Biocenter, Medical University of Innsbruck, 6020, Innsbruck, Austria
| | - Michael Andreas Juen
- Institute of Organic Chemistry and Center for Molecular Biosciences (CMBI), University of Innsbruck, 6020, Innsbruck, Austria
| | - Johannes Kremser
- Institute of Organic Chemistry and Center for Molecular Biosciences (CMBI), University of Innsbruck, 6020, Innsbruck, Austria
| | - Catherina Gasser
- Institute of Organic Chemistry and Center for Molecular Biosciences (CMBI), University of Innsbruck, 6020, Innsbruck, Austria
| | - Elisabeth Fuchs
- Institute of Organic Chemistry and Center for Molecular Biosciences (CMBI), University of Innsbruck, 6020, Innsbruck, Austria
| | - Xinying Shi
- Department of Chemistry and Biochemistry, University of California at San Diego, 9500 Gilman Drive, La Jolla, CA, 92093-0314, USA
| | - Aaron Siewert
- Institute of Organic Chemistry, University of Würzburg, Am Hubland, 97074, Würzburg, Germany
| | - Herbert Lindner
- Division of Clinical Biochemistry, Biocenter, Medical University of Innsbruck, 6020, Innsbruck, Austria
| | - Christoph Kreutz
- Institute of Organic Chemistry and Center for Molecular Biosciences (CMBI), University of Innsbruck, 6020, Innsbruck, Austria
| | - Ronald Micura
- Institute of Organic Chemistry and Center for Molecular Biosciences (CMBI), University of Innsbruck, 6020, Innsbruck, Austria
| | - Simpson Joseph
- Department of Chemistry and Biochemistry, University of California at San Diego, 9500 Gilman Drive, La Jolla, CA, 92093-0314, USA
| | - Claudia Höbartner
- Institute of Organic Chemistry, University of Würzburg, Am Hubland, 97074, Würzburg, Germany
| | - Eric Westhof
- Architecture and Reactivity of RNA, Institute of Molecular and Cellular Biology of the CNRS UPR9002/University of Strasbourg, Strasbourg, 67084, France
| | - Alexander Hüttenhofer
- Division of Genomics and RNomics, Biocenter, Medical University of Innsbruck, 6020, Innsbruck, Austria
| | - Matthias David Erlacher
- Division of Genomics and RNomics, Biocenter, Medical University of Innsbruck, 6020, Innsbruck, Austria.
| |
Collapse
|
11
|
Tian L, Shen X, Murphy RW, Shen Y. The adaptation of codon usage of +ssRNA viruses to their hosts. Infect Genet Evol 2018; 63:175-179. [PMID: 29864509 PMCID: PMC7106036 DOI: 10.1016/j.meegid.2018.05.034] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/09/2018] [Revised: 05/16/2018] [Accepted: 05/31/2018] [Indexed: 02/05/2023]
Abstract
Viruses depend on their host's cellular structure to survive. Most of them do not have tRNAs, their translation relies on hosts' tRNA pools. Over the course of evolution, viruses needed to optimally exploit cellular processes of their host. Thus, codon usage of a virus should coevolve with its host to efficiently and rapidly replicate. Some viruses can invade a broad spectrum of hosts (BSTVs), while others can invade a narrow spectrum only (NSTVs). Consequently, we test the hypothesis that similarity of codon usage preference and the degree of matching between BSTVs and their hosts will be lower than that of NSTVs, which only need to coevolve with few hosts. We compare the patterns of codon usage in 255 virus genomes to test this hypothesis. Our results show that NSTVs have a higher degree of matching to their hosts' tRNA pools than BSTVs. Further, analysis of the effective number of codons (ENC) infers that codon usage bias of NSTVs is relatively stronger than that of BSTVs. Thus, codon usage of NSTVs tends to better match their host than that of BSTVs. This supports the hypothesis that viruses adapt to the expression system of their host(s).
Collapse
Affiliation(s)
- Lin Tian
- College of Veterinary Medicine, South China Agricultural University, Guangzhou 510642, China; Shantou University Medical College, Shantou 515041, China
| | - Xuejuan Shen
- College of Veterinary Medicine, South China Agricultural University, Guangzhou 510642, China
| | - Robert W Murphy
- Centre for Biodiversity and Conservation Biology, Royal Ontario Museum, Toronto M5S 2C6, Canada
| | - Yongyi Shen
- College of Veterinary Medicine, South China Agricultural University, Guangzhou 510642, China; Shantou University Medical College, Shantou 515041, China; Key Laboratory of Zoonosis Prevention and Control of Guangdong Province, Guangzhou 510642, China.
| |
Collapse
|
12
|
Abstract
DNA watermarking is a data hiding technique that aims to protect the copyright of DNA sequences and ensures the security of private genetic information. In this paper, we proposed a novel DNA watermarking technique that can be used to embed binary bits into real DNA sequences. The proposed technique mutates the codon postfix according to the embedded bit. Our method was tested for a sample set of DNA sequences and the extracted bits showed robustness against mutation. Furthermore, the proposed DNA watermarking method proved to be secured, undetectable, resistance, and preservative to biological functions.
Collapse
|
13
|
Abstract
Gene translation depends on accurate decoding of mRNA, the structural mechanism of which remains poorly understood. Ribosomes decode mRNA codons by selecting cognate aminoacyl-tRNAs delivered by elongation factor Tu (EF-Tu). Here we present high-resolution structural ensembles of ribosomes with cognate or near-cognate aminoacyl-tRNAs delivered by EF-Tu. Both cognate and near-cognate tRNA anticodons explore the aminoacyl-tRNA-binding site (A site) of an open 30S subunit, while inactive EF-Tu is separated from the 50S subunit. A transient conformation of decoding-centre nucleotide G530 stabilizes the cognate codon-anticodon helix, initiating step-wise 'latching' of the decoding centre. The resulting closure of the 30S subunit docks EF-Tu at the sarcin-ricin loop of the 50S subunit, activating EF-Tu for GTP hydrolysis and enabling accommodation of the aminoacyl-tRNA. By contrast, near-cognate complexes fail to induce the G530 latch, thus favouring open 30S pre-accommodation intermediates with inactive EF-Tu. This work reveals long-sought structural differences between the pre-accommodation of cognate and near-cognate tRNAs that elucidate the mechanism of accurate decoding.
Collapse
MESH Headings
- Anticodon/chemistry
- Anticodon/genetics
- Anticodon/ultrastructure
- Codon/chemistry
- Codon/genetics
- Codon/ultrastructure
- Cryoelectron Microscopy
- Escherichia coli/chemistry
- Escherichia coli/genetics
- Escherichia coli/ultrastructure
- GTP Phosphohydrolases/metabolism
- GTP Phosphohydrolases/ultrastructure
- Guanosine Triphosphate/metabolism
- Hydrolysis
- Models, Molecular
- Peptide Elongation Factor Tu/metabolism
- Peptide Elongation Factor Tu/ultrastructure
- Protein Biosynthesis
- Protein Domains
- RNA, Ribosomal, 16S/genetics
- RNA, Ribosomal, 16S/metabolism
- RNA, Ribosomal, 16S/ultrastructure
- RNA, Transfer, Amino Acyl/genetics
- RNA, Transfer, Amino Acyl/metabolism
- RNA, Transfer, Amino Acyl/ultrastructure
- Ribosome Subunits/chemistry
- Ribosome Subunits/metabolism
- Ribosome Subunits/ultrastructure
- Ribosomes/chemistry
- Ribosomes/metabolism
- Ribosomes/ultrastructure
Collapse
Affiliation(s)
- Anna B. Loveland
- RNA Therapeutics Institute, Department of Biochemistry and Molecular Pharmacology. University of Massachusetts Medical School, 368 Plantation St., Worcester, MA 01605, USA
| | - Gabriel Demo
- RNA Therapeutics Institute, Department of Biochemistry and Molecular Pharmacology. University of Massachusetts Medical School, 368 Plantation St., Worcester, MA 01605, USA
| | - Nikolaus Grigorieff
- Janelia Research Campus, Howard Hughes Medical Institute, 19700 Helix Drive, Ashburn, VA 20147, USA
| | - Andrei A. Korostelev
- RNA Therapeutics Institute, Department of Biochemistry and Molecular Pharmacology. University of Massachusetts Medical School, 368 Plantation St., Worcester, MA 01605, USA
| |
Collapse
|
14
|
Wang SX, Xu YP, He LM. A novel HLA-E allele, HLA-E*01:01:01:07, identified in a Chinese leukemia patient. HLA 2017; 89:327-330. [PMID: 28244222 DOI: 10.1111/tan.12999] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2016] [Revised: 02/09/2017] [Accepted: 02/10/2017] [Indexed: 11/30/2022]
Abstract
A novel HLA-E allele, HLA-E*01:01:01:07, was identified in a Chinese leukemia patient.
Collapse
Affiliation(s)
- S-X Wang
- Key Laboratory of Shenzhen for Histocompatibility and Immunogenetics, Shenzhen Blood Center, Shenzhen, PR China
| | - Y-P Xu
- Key Laboratory of Shenzhen for Histocompatibility and Immunogenetics, Shenzhen Blood Center, Shenzhen, PR China
| | - Li-M He
- Key Laboratory of Shenzhen for Histocompatibility and Immunogenetics, Shenzhen Blood Center, Shenzhen, PR China
| |
Collapse
|
15
|
Bae YA. Codon Usage Patterns of Tyrosinase Genes in Clonorchis sinensis. Korean J Parasitol 2017; 55:175-183. [PMID: 28506040 PMCID: PMC5450960 DOI: 10.3347/kjp.2017.55.2.175] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/13/2017] [Revised: 04/05/2017] [Accepted: 04/06/2017] [Indexed: 11/28/2022]
Abstract
Codon usage bias (CUB) is a unique property of genomes and has contributed to the better understanding of the molecular features and the evolution processes of particular gene. In this study, genetic indices associated with CUB, including relative synonymous codon usage and effective numbers of codons, as well as the nucleotide composition, were investigated in the Clonorchis sinensis tyrosinase genes and their platyhelminth orthologs, which play an important role in the eggshell formation. The relative synonymous codon usage patterns substantially differed among tyrosinase genes examined. In a neutrality analysis, the correlation between GC12 and GC3 was statistically significant, and the regression line had a relatively gradual slope (0.218). NC-plot, i.e., GC3 vs effective number of codons (ENC), showed that most of the tyrosinase genes were below the expected curve. The codon adaptation index (CAI) values of the platyhelminth tyrosinases had a narrow distribution between 0.685/0.714 and 0.797/0.837, and were negatively correlated with their ENC. Taken together, these results suggested that CUB in the tyrosinase genes seemed to be basically governed by selection pressures rather than mutational bias, although the latter factor provided an additional force in shaping CUB of the C. sinensis and Opisthorchis viverrini genes. It was also apparent that the equilibrium point between selection pressure and mutational bias is much more inclined to selection pressure in highly expressed C. sinensis genes, than in poorly expressed genes.
Collapse
|
16
|
Záhonová K, Kostygov AY, Ševčíková T, Yurchenko V, Eliáš M. An Unprecedented Non-canonical Nuclear Genetic Code with All Three Termination Codons Reassigned as Sense Codons. Curr Biol 2016; 26:2364-9. [PMID: 27593378 DOI: 10.1016/j.cub.2016.06.064] [Citation(s) in RCA: 73] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2016] [Revised: 06/25/2016] [Accepted: 06/28/2016] [Indexed: 12/31/2022]
Abstract
A limited number of non-canonical genetic codes have been described in eukaryotic nuclear genomes. Most involve reassignment of one or two termination codons as sense ones [1-4], but no code variant is known that would have reassigned all three termination codons. Here, we describe such a variant that we discovered in a clade of trypanosomatids comprising nominal Blastocrithidia species. In these protists, UGA has been reassigned to encode tryptophan, while UAG and UAA (UAR) have become glutamate encoding. Strikingly, UAA and, less frequently, UAG also serve as bona fide termination codons. The release factor eRF1 in Blastocrithidia contains a substitution of a conserved serine residue predicted to decrease its affinity to UGA, which explains why this triplet can be read as a sense codon. However, the molecular basis for the dual interpretation of UAR codons remains elusive. Our findings expand the limits of comprehension of one of the fundamental processes in molecular biology.
Collapse
Affiliation(s)
- Kristína Záhonová
- Life Science Research Centre, Department of Biology and Ecology, Faculty of Science, University of Ostrava, Chittussiho 10, 710 00 Ostrava, Czech Republic; Institute of Environmental Technologies, Faculty of Science, University of Ostrava, Chittussiho 10, 710 00 Ostrava, Czech Republic
| | - Alexei Y Kostygov
- Life Science Research Centre, Department of Biology and Ecology, Faculty of Science, University of Ostrava, Chittussiho 10, 710 00 Ostrava, Czech Republic; Zoological Institute of the Russian Academy of Sciences, Universitetskaya nab. 1, 199034 Saint Petersburg, Russia
| | - Tereza Ševčíková
- Life Science Research Centre, Department of Biology and Ecology, Faculty of Science, University of Ostrava, Chittussiho 10, 710 00 Ostrava, Czech Republic
| | - Vyacheslav Yurchenko
- Life Science Research Centre, Department of Biology and Ecology, Faculty of Science, University of Ostrava, Chittussiho 10, 710 00 Ostrava, Czech Republic; Institute of Environmental Technologies, Faculty of Science, University of Ostrava, Chittussiho 10, 710 00 Ostrava, Czech Republic; Institute of Parasitology, Czech Academy of Sciences, Branišovská 31, 370 05 České Budějovice, Czech Republic
| | - Marek Eliáš
- Life Science Research Centre, Department of Biology and Ecology, Faculty of Science, University of Ostrava, Chittussiho 10, 710 00 Ostrava, Czech Republic; Institute of Environmental Technologies, Faculty of Science, University of Ostrava, Chittussiho 10, 710 00 Ostrava, Czech Republic.
| |
Collapse
|
17
|
Pitchai GP, Hickson ID, Streicher W, Montoya G, Mesa P. Characterization of the NTPR and BD1 interacting domains of the human PICH-BEND3 complex. Acta Crystallogr F Struct Biol Commun 2016; 72:646-51. [PMID: 27487930 PMCID: PMC4973307 DOI: 10.1107/s2053230x16010724] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2016] [Accepted: 07/01/2016] [Indexed: 11/10/2022] Open
Abstract
Chromosome integrity depends on DNA structure-specific processing complexes that resolve DNA entanglement between sister chromatids. If left unresolved, these entanglements can generate either chromatin bridging or ultrafine DNA bridging in the anaphase of mitosis. These bridge structures are defined by the presence of the PICH protein, which interacts with the BEND3 protein in mitosis. To obtain structural insights into PICH-BEND3 complex formation at the atomic level, their respective NTPR and BD1 domains were cloned, overexpressed and crystallized using 1.56 M ammonium sulfate as a precipitant at pH 7.0. The protein complex readily formed large hexagonal crystals belonging to space group P6122, with unit-cell parameters a = b = 47.28, c = 431.58 Å and with one heterodimer in the asymmetric unit. A complete multiwavelength anomalous dispersion (MAD) data set extending to 2.2 Å resolution was collected from a selenomethionine-labelled crystal at the Swiss Light Source.
Collapse
Affiliation(s)
- Ganesha P. Pitchai
- Protein Structure and Function Programme, Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Blegdamsvej 3b, 2200 Copenhagen, Denmark
- Department of Cellular and Molecular Medicine, Center for Chromosome Stability and Center for Healthy Aging, University of Copenhagen, Blegdamsvej 3b, 2200 Copenhagen, Denmark
| | - Ian D. Hickson
- Department of Cellular and Molecular Medicine, Center for Chromosome Stability and Center for Healthy Aging, University of Copenhagen, Blegdamsvej 3b, 2200 Copenhagen, Denmark
| | - Werner Streicher
- Protein Structure and Function Programme, Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Blegdamsvej 3b, 2200 Copenhagen, Denmark
| | - Guillermo Montoya
- Protein Structure and Function Programme, Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Blegdamsvej 3b, 2200 Copenhagen, Denmark
| | - Pablo Mesa
- Protein Structure and Function Programme, Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Blegdamsvej 3b, 2200 Copenhagen, Denmark
| |
Collapse
|
18
|
Tang NC, Chilkoti A. Combinatorial codon scrambling enables scalable gene synthesis and amplification of repetitive proteins. Nat Mater 2016; 15:419-24. [PMID: 26726995 PMCID: PMC4809025 DOI: 10.1038/nmat4521] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/18/2015] [Accepted: 11/25/2015] [Indexed: 05/08/2023]
Abstract
Most genes are synthesized using seamless assembly methods that rely on the polymerase chain reaction (PCR). However, PCR of genes encoding repetitive proteins either fails or generates nonspecific products. Motivated by the need to efficiently generate new protein polymers through high-throughput gene synthesis, here we report a codon-scrambling algorithm that enables the PCR-based gene synthesis of repetitive proteins by exploiting the codon redundancy of amino acids and finding the least-repetitive synonymous gene sequence. We also show that the codon-scrambling problem is analogous to the well-known travelling salesman problem, and obtain an exact solution to it by using De Bruijn graphs and a modern mixed integer linear programme solver. As experimental proof of the utility of this approach, we use it to optimize the synthetic genes for 19 repetitive proteins, and show that the gene fragments are amenable to PCR-based gene assembly and recombinant expression.
Collapse
Affiliation(s)
- Nicholas C Tang
- Department of Biomedical Engineering, Duke University, Durham, North Carolina 27708, USA
| | - Ashutosh Chilkoti
- Department of Biomedical Engineering, Duke University, Durham, North Carolina 27708, USA
| |
Collapse
|
19
|
Abstract
We explore the origin-of-life consequences of the view that biological systems are demarcated from inanimate matter by their possession of referential information, which is processed computationally to control choices of specific physico-chemical events. Cells are cybernetic: they use genetic information in processes of communication and control, subjecting physical events to a system of integrated governance. The genetic code is the most obvious example of how cells use information computationally, but the historical origin of the usefulness of molecular information is not well understood. Genetic coding made information useful because it imposed a modular metric on the evolutionary search and thereby offered a general solution to the problem of finding catalysts of any specificity. We use the term "quasispecies symmetry breaking" to describe the iterated process of self-organisation whereby the alphabets of distinguishable codons and amino acids increased, step by step.
Collapse
Affiliation(s)
- Peter R Wills
- Department of Physics, University of Auckland, PB 92019, Auckland, 1142, New Zealand,
| | | | | |
Collapse
|
20
|
José MV, Zamudio GS, Palacios-Pérez M, Bobadilla JR, de Farías ST. Symmetrical and Thermodynamic Properties of Phenotypic Graphs of Amino Acids Encoded by the Primeval RNY Code. ORIGINS LIFE EVOL B 2015; 45:77-83. [PMID: 25796391 DOI: 10.1007/s11084-015-9427-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2014] [Accepted: 01/26/2015] [Indexed: 10/23/2022]
Abstract
The 12 different types of graphs of the 8 amino acids encoded by the presumably primeval RNY code are derived. The symmetry groups of these graphs are analyzed and coincide with the corresponding values of polar requirement for each amino acid. The symmetry groups at the codon level are partially carried over as a group or subgroup at the amino acid level. Measures of centrality of the 12 graphs indicate that all amino acids were equally relevant irrespective of its chronological order of its appearance. The elimination of any amino acid would be strongly selected against and therefore the genetic code at this stage was already frozen.
Collapse
Affiliation(s)
- Marco V José
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, México, D.F., 04510, Mexico,
| | | | | | | | | |
Collapse
|
21
|
Ikehara K. Protein ordered sequences are formed by random joining of amino acids in protein 0(th)-order structure, followed by evolutionary process. ORIGINS LIFE EVOL B 2014; 44:279-81. [PMID: 25585800 DOI: 10.1007/s11084-014-9384-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2014] [Accepted: 11/03/2014] [Indexed: 10/24/2022]
Abstract
Only random processes should occur on the primitive Earth. In contrast, many ordered sequences are synthesized according to genetic information on the present Earth. In this communication, I have proposed an idea that protein 0(th)-order structures or specific amino acid compositions would mediate the transfer from random process to formation of ordered sequences, after formation of double-stranded genes.
Collapse
Affiliation(s)
- Kenji Ikehara
- Nara Study Center, The Open University of Japan, Nara, 630-8589, Japan,
| |
Collapse
|
22
|
Sengupta S, Aggarwal N, Bandhu AV. Two perspectives on the origin of the standard genetic code. ORIGINS LIFE EVOL B 2014; 44:287-91. [PMID: 25585805 DOI: 10.1007/s11084-014-9394-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2014] [Accepted: 09/30/2014] [Indexed: 10/24/2022]
Abstract
The origin of a genetic code made it possible to create ordered sequences of amino acids. In this article we provide two perspectives on code origin by carrying out simulations of code-sequence coevolution in finite populations with the aim of examining how the standard genetic code may have evolved from more primitive code(s) encoding a small number of amino acids. We determine the efficacy of the physico-chemical hypothesis of code origin in the absence and presence of horizontal gene transfer (HGT) by allowing a diverse collection of code-sequence sets to compete with each other. We find that in the absence of horizontal gene transfer, natural selection between competing codes distinguished by differences in the degree of physico-chemical optimization is unable to explain the structure of the standard genetic code. However, for certain probabilities of the horizontal transfer events, a universal code emerges having a structure that is consistent with the standard genetic code.
Collapse
MESH Headings
- Amino Acids/chemistry
- Amino Acids/metabolism
- Amino Acyl-tRNA Synthetases/chemistry
- Amino Acyl-tRNA Synthetases/metabolism
- Codon/chemistry
- Codon/metabolism
- Evolution, Molecular
- Gene Transfer, Horizontal
- Genes
- Genetic Code
- Models, Genetic
- Origin of Life
- Probability
- Protein Biosynthesis
- RNA, Messenger/chemistry
- RNA, Messenger/metabolism
- RNA, Transfer, Amino Acid-Specific/chemistry
- RNA, Transfer, Amino Acid-Specific/metabolism
- Selection, Genetic
Collapse
Affiliation(s)
- Supratim Sengupta
- Department of Physical Sciences, Indian Institute of Science Education and Research Kolkata, Mohanpur, 741246, India,
| | | | | |
Collapse
|
23
|
Fang J, Zou L, Zhou X, Cheng B, Fan J. Synonymous rare arginine codons and tRNA abundance affect protein production and quality of TEV protease variant. PLoS One 2014; 9:e112254. [PMID: 25426854 PMCID: PMC4245098 DOI: 10.1371/journal.pone.0112254] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2014] [Accepted: 08/10/2014] [Indexed: 11/27/2022] Open
Abstract
It has been identified that a TEV protease (TEVp) variant, TEVp5M, displays improved solubility. Here, we constructed fifteen TEVp5M variants with one or more of six rare arginine codons in the coding sequence replaced with abundant E. coli arginine codons. These codon variants expressed in either E. coli BL21 (DE3) or Rossetta (DE3) showed different solubility and activity. Supply of rare tRNAs increased the tendency of certain codon variants to form insoluble aggregates at early induction stage, as determined by the fused S-tag. About 32% increase in soluble protein production of M5 variant with four synonymously mutated arginine codons was identified in Rossetta (DE3) cells using GFP fusion reporter, comparable to that of TEVp5M. After purification, two other codon variants from both E. coli strains exhibited less activity than TEVp5M on cleaving the native or modified recognition sequence incorporated between GST and E. coli diaminopropionate ammonialyase by enzyme-coupled assay, whereas purified M5 variant showed activity similar to the TEVp5M. Supply of rare tRNAs caused the decrease of activity of TEVp5M and M5 by about 21%. Our results revealed that engineering of highly soluble TEVp variants can be achieved by the combined mutations of amino acid residues and optimization of specific rare codons, whereas simple augment of rare tRNAs abundance resulted in partial loss of activity.
Collapse
Affiliation(s)
- Jie Fang
- Key Laboratory of Crop Biology of Anhui Province, School of Life Science, Anhui Agricultural University, Hefei, Anhui, PR China
| | - Lingling Zou
- Key Laboratory of Crop Biology of Anhui Province, School of Life Science, Anhui Agricultural University, Hefei, Anhui, PR China
| | - Xuan Zhou
- Key Laboratory of Crop Biology of Anhui Province, School of Life Science, Anhui Agricultural University, Hefei, Anhui, PR China
| | - Beijiu Cheng
- Key Laboratory of Crop Biology of Anhui Province, School of Life Science, Anhui Agricultural University, Hefei, Anhui, PR China
| | - Jun Fan
- Key Laboratory of Crop Biology of Anhui Province, School of Life Science, Anhui Agricultural University, Hefei, Anhui, PR China
- * E-mail:
| |
Collapse
|
24
|
Sharma VK, Xiong W, Glick J, Vouros P. Determination of site selectivity of different carcinogens for preferential mutational hot spots in oligonucleotide fragments by ion-pair reversed-phase nano liquid chromatography tandem mass spectrometry. Eur J Mass Spectrom (Chichester) 2014; 20:63-72. [PMID: 24881456 DOI: 10.1255/ejms.1268] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Ion-pair reversed-phase nano liquid chromatography coupled with nanospray ion trap mass spectrometry was used to investigate site selectivity of the known carcinogens N-acetoxy-2-acetylaminofluorene, N-hydroxy-4-aminobiphenyl and (+/-)-anti-benzo[a]pyrene diol epoxide with the synthetic double-strand 14-mer long oligonucleotide fragment of the p53 gene containing two mutational hot-spot codons (5'-P-ACC155 CGC156 GTC157 CGC158 GC/5'-GCG CGG ACG CGG GT). The investigation was performed using a monolithic polystyrene divinylbenzene capillary column and triethylammonium bicarbonate as an ion-pair reagent. The exact location of the carcinogen on the modified oligonucleotide backbone was determined using characteristic collision-induced dissociation fragmentation patterns obtained under negative-ion mode ionization. In all these cases, the adducted, isomeric oligonucleotides formed were chromatographically resolved and structural identification was performed without any prior deoxyribonucleic acid cleavage or hydrolysis. The knowledge of the site specificity of a carcinogen, especially at purported mutational hot spots, is of paramount importance (1) in establishing the identity of biomarkers for an early risk assessment of the formed DNA adducts, (2) developing repair mechanisms for the formed carcinogen adducted DNA, and (3) understanding the nature of the covalent bond formed and mapping the frequency of the adduction process.
Collapse
|
25
|
Banda M, Recio L, Parsons BL. ACB-PCR measurement of spontaneous and furan-induced H-ras codon 61 CAA to CTA and CAA to AAA mutation in B6C3F1 mouse liver. Environ Mol Mutagen 2013; 54:659-667. [PMID: 24038307 DOI: 10.1002/em.21808] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/16/2013] [Revised: 06/27/2013] [Accepted: 07/02/2013] [Indexed: 06/02/2023]
Abstract
Furan is a rodent liver carcinogen, but the mode of action for furan hepatocarcinogenicity is unclear. H-ras codon 61 mutations have been detected in spontaneous liver tumors of B6C3F1 mice, and the fraction of liver tumors carrying H-ras codon 61 CAA to AAA mutation increased in furan-treated mice. Allele-specific competitive blocker PCR (ACB-PCR) has been used previously to quantify early, carcinogen-induced increases in tumor-associated mutations. The present pilot study investigated whether furan drives clonal expansion of pre-existing H-ras mutant cells in B6C3F1 mouse liver. H-ras codon 61 CAA to CTA and CAA to AAA mutations were measured in DNA isolated from liver tissue of female mice treated with 0, 1, 2, 4, or 8 mg furan/kg body weight, five days per week for three weeks, using five mice per treatment group. Spontaneous levels of mutation were low, with two of five control mice having an H-ras codon 61 CTA or AAA mutant fraction (MF) greater than 10(-5) . Several furan-treated mice had H-ras codon 61 AAA or CTA MFs greater than those measured in control mice and lower bound estimates of induced MF were calculated. However, no statistically-significant differences were observed between treatment groups. Therefore, while sustained exposure to furan is carcinogenic, at the early stage of carcinogenesis examined in this study (three weeks), there was not a significant expansion of H-ras mutant cells.
Collapse
Affiliation(s)
- Malathi Banda
- National Center for Toxicological Research, U.S. Food and Drug Administration, Division of Genetic and Molecular Toxicology, Jefferson, Arkansas
| | | | | |
Collapse
|
26
|
Abstract
The phenotypes of biological systems are to some extent robust to genotypic changes. Such robustness exists on multiple levels of biological organization. We analyzed this robustness for two categories of amino acids in proteins. Specifically, we studied the codons of amino acids that bind or do not bind small molecular ligands. We asked to what extent codon changes caused by mutation or mistranslation may affect physicochemical amino acid properties or protein folding. We found that the codons of ligand-binding amino acids are on average more robust than those of non-binding amino acids. Because mistranslation is usually more frequent than mutation, we speculate that selection for error mitigation at the translational level stands behind this phenomenon. Our observations suggest that natural selection can affect the robustness of very small units of biological organization.
Collapse
Affiliation(s)
- Tugce Bilgin
- Institute of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland.
| | | | | |
Collapse
|
27
|
Numata T. [Molecular basis for tRNA(Ile) agmatinylation essential for AUA codon decoding]. Seikagaku 2012; 84:1004-1008. [PMID: 23350510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Affiliation(s)
- Tomoyuki Numata
- Biomedical Research Institute, National Institute of Advanced Industrial Science and Technology, Ibaraki, Japan
| |
Collapse
|
28
|
Parisien M, Yi C, Pan T. Rationalization and prediction of selective decoding of pseudouridine-modified nonsense and sense codons. RNA 2012; 18:355-367. [PMID: 22282339 PMCID: PMC3285925 DOI: 10.1261/rna.031351.111] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/11/2011] [Accepted: 12/05/2011] [Indexed: 05/31/2023]
Abstract
A stop or nonsense codon is an in-frame triplet within a messenger RNA that signals the termination of translation. One common feature shared among all three nonsense codons (UAA, UAG, and UGA) is a uridine present at the first codon position. It has been recently shown that the conversion of this uridine into pseudouridine (Ψ) suppresses translation termination, both in vitro and in vivo. Furthermore, decoding of the pseudouridylated nonsense codons is accompanied by the incorporation of two specific amino acids in a nonsense codon-dependent fashion. Ψ differs from uridine by a single N¹H group at the C5 position; how Ψ suppresses termination and, more importantly, enables selective decoding is poorly understood. Here, we provide molecular rationales for how pseudouridylated stop codons are selectively decoded. Our analysis applies crystal structures of ribosomes in varying states of translation to consider weakened interaction of Ψ with release factor; thermodynamic and geometric considerations of the codon-anticodon base pairs to rank and to eliminate mRNA-tRNA pairs; the mechanism of fidelity check of the codon-anticodon pairing by the ribosome to evaluate noncanonical codon-anticodon base pairs and the role of water. We also consider certain tRNA modifications that interfere with the Ψ-coordinated water in the major groove of the codon-anticodon mini-helix. Our analysis of nonsense codons enables prediction of potential decoding properties for Ψ-modified sense codons, such as decoding ΨUU potentially as Cys and Tyr. Our results provide molecular rationale for the remarkable dynamics of ribosome decoding and insights on possible reprogramming of the genetic code using mRNA modifications.
Collapse
Affiliation(s)
| | - Chengqi Yi
- Department of Biochemistry and Molecular Biology
| | - Tao Pan
- Department of Biochemistry and Molecular Biology
- Institute for Biophysical Dynamics, University of Chicago, Chicago, Illinois 60637, USA
| |
Collapse
|
29
|
Brandman R, Brandman Y, Pande VS. Sequence coevolution between RNA and protein characterized by mutual information between residue triplets. PLoS One 2012; 7:e30022. [PMID: 22279560 PMCID: PMC3261191 DOI: 10.1371/journal.pone.0030022] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2011] [Accepted: 12/09/2011] [Indexed: 11/21/2022] Open
Abstract
Coevolving residues in a multiple sequence alignment provide evolutionary clues of biophysical interactions in 3D structure. Despite a rich literature describing amino acid coevolution within or between proteins and nucleic acid coevolution within RNA, to date there has been no direct evidence of coevolution between protein and RNA. The ribosome, a structurally conserved macromolecular machine composed of over 50 interacting protein and RNA chains, provides a natural example of RNA/protein interactions that likely coevolved. We provide the first direct evidence of RNA/protein coevolution by characterizing the mutual information in residue triplets from a multiple sequence alignment of ribosomal protein L22 and neighboring 23S RNA. We define residue triplets as three positions in the multiple sequence alignment, where one position is from the 23S RNA and two positions are from the L22 protein. We show that residue triplets with high mutual information are more likely than residue doublets to be proximal in 3D space. Some high mutual information residue triplets cluster in a connected series across the L22 protein structure, similar to patterns seen in protein coevolution. We also describe RNA nucleotides for which switching from one nucleotide to another (or between purines and pyrimidines) results in a change in amino acid distribution for proximal amino acid positions. Multiple crystal structures for evolutionarily distinct ribosome species can provide structural evidence for these differences. For one residue triplet, a pyrimidine in one species is a purine in another, and RNA/protein hydrogen bonds are present in one species but not the other. The results provide the first direct evidence of RNA/protein coevolution by using higher order mutual information, suggesting that biophysical constraints on interacting RNA and protein chains are indeed a driving force in their evolution.
Collapse
Affiliation(s)
- Relly Brandman
- Chemical and Systems Biology, Stanford University, Stanford, California, United States of America
| | - Yigal Brandman
- Chemistry, Stanford University, Stanford, California, United States of America
| | - Vijay S. Pande
- Chemistry, Stanford University, Stanford, California, United States of America
- * E-mail:
| |
Collapse
|
30
|
Abstract
In the universal genetic code, most amino acids can be encoded by multiple trinucleotide codons, and the choice among available codons can influence position-specific translation elongation rates. By using sequence-based ribosome profiling, we obtained transcriptome-wide profiles of in vivo ribosome occupancy as a function of codon identity in Caenorhabditis elegans and human cells. Particularly striking in these profiles was a universal trend of higher ribosome occupancy for codons translated via G:U wobble base-pairing compared with synonymous codons that pair with the same tRNA family using G:C base-pairing. These data support a model in which ribosomal translocation is slowed at wobble codon positions.
Collapse
Affiliation(s)
- Michael Stadler
- Department of Genetics, Stanford University, Stanford, California 94305-5324, USA
| | - Andrew Fire
- Department of Genetics, Stanford University, Stanford, California 94305-5324, USA
- Department of Pathology, Stanford University, Stanford, California 94305-5324, USA
| |
Collapse
|
31
|
Zubko E, Kunova A, Meyer P. Sense and antisense transcripts of convergent gene pairs in Arabidopsis thaliana can share a common polyadenylation region. PLoS One 2011; 6:e16769. [PMID: 21311762 PMCID: PMC3032780 DOI: 10.1371/journal.pone.0016769] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2010] [Accepted: 12/24/2010] [Indexed: 12/03/2022] Open
Abstract
The Arabidopsis genome contains a large number of gene pairs that encode sense and antisense transcripts with overlapping 3′ regions, indicative for a potential role of natural antisense transcription in regulating sense gene expression or transcript processing. When we mapped poly(A) transcripts of three plant gene pairs with long overlapping antisense transcripts, we identified an unusual transcript composition for two of the three gene pairs. Both genes pairs encoded a class of long sense transcripts and a class of short sense transcripts that terminate within the same polyadenylation region as the antisense transcripts encoded by the opposite strand. We find that the presence of the short sense transcript was not dependent on the expression of an antisense transcript. This argues against the assumption that the common termination region for sense and antisense poly(A) transcripts is the result of antisense-specific regulation. We speculate that for some genes evolution may have especially favoured alternative polyadenylation events that shorten transcript length for gene pairs with overlapping sense/antisense transcription, if this reduces the likelihood for dsRNA formation and transcript degradation.
Collapse
MESH Headings
- Arabidopsis/genetics
- Arabidopsis/metabolism
- Codon/chemistry
- Codon/genetics
- Codon/metabolism
- Gene Expression Profiling
- Gene Expression Regulation, Plant
- Genes, Plant
- Genotype
- Oligonucleotide Array Sequence Analysis
- Plants, Genetically Modified
- Poly A/genetics
- Poly A/metabolism
- Polyadenylation/genetics
- RNA Processing, Post-Transcriptional/genetics
- RNA, Antisense/chemistry
- RNA, Antisense/genetics
- RNA, Antisense/metabolism
- RNA, Double-Stranded/metabolism
- RNA, Messenger/chemistry
- RNA, Messenger/genetics
- RNA, Messenger/metabolism
- Regulatory Sequences, Ribonucleic Acid/physiology
- Transcription, Genetic
Collapse
Affiliation(s)
- Elena Zubko
- University of Leeds, Centre for Plant Sciences, Leeds, United Kingdom
| | | | | |
Collapse
|
32
|
Galimand M, Schmitt E, Panvert M, Desmolaize B, Douthwaite S, Mechulam Y, Courvalin P. Intrinsic resistance to aminoglycosides in Enterococcus faecium is conferred by the 16S rRNA m5C1404-specific methyltransferase EfmM. RNA 2011; 17:251-262. [PMID: 21159796 PMCID: PMC3022275 DOI: 10.1261/rna.2233511] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2010] [Accepted: 11/04/2010] [Indexed: 05/29/2023]
Abstract
Aminoglycosides are ribosome-targeting antibiotics and a major drug group of choice in the treatment of serious enterococcal infections. Here we show that aminoglycoside resistance in Enterococcus faecium strain CIP 54-32 is conferred by the chromosomal gene efmM, encoding the E. faecium methyltransferase, as well as by the previously characterized aac(6')-Ii that encodes a 6'-N-aminoglycoside acetyltransferase. Inactivation of efmM in E. faecium increases susceptibility to the aminoglycosides kanamycin and tobramycin, and, conversely, expression of a recombinant version of efmM in Escherichia coli confers resistance to these drugs. The EfmM protein shows significant sequence similarity to E. coli RsmF (previously called YebU), which is a 5-methylcytidine (m⁵C) methyltransferase modifying 16S rRNA nucleotide C1407. The target for EfmM is shown by mass spectrometry to be a neighboring 16S rRNA nucleotide at C1404. EfmM uses the methyl group donor S-adenosyl-L-methionine to catalyze formation of m⁵C1404 on the 30S ribosomal subunit, whereas naked 16S rRNA and the 70S ribosome are not substrates. Addition of the 5-methyl to C1404 sterically hinders aminoglycoside binding. Crystallographic structure determination of EfmM at 2.28 Å resolution reveals an N-terminal domain connected to a central methyltransferase domain that is linked by a flexible lysine-rich region to two C-terminal subdomains. Mutagenesis of the methyltransferase domain established that two cysteines at specific tertiary locations are required for catalysis. The tertiary structure of EfmM is highly similar to that of RsmF, consistent with m⁵C formation at adjacent sites on the 30S subunit, while distinctive structural features account for the enzymes' respective specificities for nucleotides C1404 and C1407.
Collapse
Affiliation(s)
- Marc Galimand
- Unité des Agents Antibactériens, Institut Pasteur, F-75724 Paris Cedex 15, France.
| | | | | | | | | | | | | |
Collapse
|
33
|
Anhlan D, Grundmann N, Makalowski W, Ludwig S, Scholtissek C. Origin of the 1918 pandemic H1N1 influenza A virus as studied by codon usage patterns and phylogenetic analysis. RNA 2011; 17:64-73. [PMID: 21068184 PMCID: PMC3004067 DOI: 10.1261/rna.2395211] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
The pandemic of 1918 was caused by an H1N1 influenza A virus, which is a negative strand RNA virus; however, little is known about the nature of its direct ancestral strains. Here we applied a broad genetic and phylogenetic analysis of a wide range of influenza virus genes, in particular the PB1 gene, to gain information about the phylogenetic relatedness of the 1918 H1N1 virus. We compared the RNA genome of the 1918 strain to many other influenza strains of different origin by several means, including relative synonymous codon usage (RSCU), effective number of codons (ENC), and phylogenetic relationship. We found that the PB1 gene of the 1918 pandemic virus had ENC values similar to the H1N1 classical swine and human viruses, but different ENC values from avian as well as H2N2 and H3N2 human viruses. Also, according to the RSCU of the PB1 gene, the 1918 virus grouped with all human isolates and "classical" swine H1N1 viruses. The phylogenetic studies of all eight RNA gene segments of influenza A viruses may indicate that the 1918 pandemic strain originated from a H1N1 swine virus, which itself might be derived from a H1N1 avian precursor, which was separated from the bulk of other avian viruses in toto a long time ago. The high stability of the RSCU pattern of the PB1 gene indicated that the integrity of RNA structure is more important for influenza virus evolution than previously thought.
Collapse
Affiliation(s)
- Darisuren Anhlan
- Institute of Molecular Virology (IMV), Centre of Molecular Biology of Inflammation (ZMBE), University of Münster, 48149 Münster, Germany
| | | | | | | | | |
Collapse
|
34
|
Letzring DP, Dean KM, Grayhack EJ. Control of translation efficiency in yeast by codon-anticodon interactions. RNA 2010; 16:2516-28. [PMID: 20971810 PMCID: PMC2995412 DOI: 10.1261/rna.2411710] [Citation(s) in RCA: 110] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/12/2010] [Accepted: 09/24/2010] [Indexed: 05/17/2023]
Abstract
The choice of synonymous codons used to encode a polypeptide contributes to substantial differences in translation efficiency between genes. However, both the magnitude and the mechanisms of codon-mediated effects are unknown, as neither the effects of individual codons nor the parameters that modulate codon-mediated regulation are understood, particularly in eukaryotes. To explore this problem in Saccharomyces cerevisiae, we performed the first systematic analysis of codon effects on expression. We find that the arginine codon CGA is strongly inhibitory, resulting in progressively and sharply reduced expression with increased CGA codon dosage. CGA-mediated inhibition of expression is primarily due to wobble decoding of CGA, since it is nearly completely suppressed by coexpression of an exact match anticodon-mutated tRNA(Arg(UCG)), and is associated with generation of a smaller RNA fragment, likely due to endonucleolytic cleavage at a stalled ribosome. Moreover, CGA codon pairs are more effective inhibitors of expression than individual CGA codons. These results directly implicate decoding by the ribosome and interactions at neighboring sites within the ribosome as mediators of codon-specific translation efficiency.
Collapse
Affiliation(s)
- Daniel P Letzring
- Department of Biochemistry and Biophysics, University of Rochester Medical School, Rochester, New York 14642, USA
| | | | | |
Collapse
|
35
|
Zaher HS, Green R. Kinetic basis for global loss of fidelity arising from mismatches in the P-site codon:anticodon helix. RNA 2010; 16:1980-1989. [PMID: 20724456 PMCID: PMC2941106 DOI: 10.1261/rna.2241810] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2010] [Accepted: 07/16/2010] [Indexed: 05/27/2023]
Abstract
Faithful decoding of the genetic information by the ribosome relies on kinetically driven mechanisms that promote selection of cognate substrates during elongation. Recently, we have shown that in addition to these kinetically driven mechanisms, the ribosome possesses a post peptidyl transfer quality control system that retrospectively monitors the codon-anticodon interaction in the P site, triggering substantial losses in the specificity of the A site during subsequent tRNA and RF selection when a mistake has occurred. Here, we report a detailed kinetic analysis of tRNA selection in the context of a mismatched P-site codon:anticodon interaction. We observe pleiotropic effects of a P-site mismatch on tRNA selection, such that near-cognate tRNA is processed by the ribosome almost as efficiently as cognate. In particular, after a miscoding event, near-cognate codon-anticodon complexes are stabilized on the ribosome to an extent similar to that observed for cognate ones. Moreover, the two observed forward rates of GTPase activation and accommodation are greatly accelerated (∼10-fold) for near-cognate tRNAs. Because the ensemble of effects of a mismatched P site on substrate selection were found to be different from those reported for other ribosomal perturbations and miscoding agents, we propose that the structural integrity of the mRNA-tRNA helix in the P site provides a distinct molecular switch that dictates the specificity of the A site.
Collapse
Affiliation(s)
- Hani S Zaher
- Howard Hughes Medical Institute, Department of Molecular Biology and Genetics, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| | | |
Collapse
|
36
|
Ortiz-Meoz RF, Green R. Functional elucidation of a key contact between tRNA and the large ribosomal subunit rRNA during decoding. RNA 2010; 16:2002-2013. [PMID: 20739608 PMCID: PMC2941108 DOI: 10.1261/rna.2232710] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2010] [Accepted: 07/14/2010] [Indexed: 05/29/2023]
Abstract
The selection of cognate tRNAs during translation is specified by a kinetic discrimination mechanism driven by distinct structural states of the ribosome. While the biochemical steps that drive the tRNA selection process have been carefully documented, it remains unclear how recognition of matched codon:anticodon helices in the small subunit facilitate global rearrangements in the ribosome complex that efficiently promote tRNA decoding. Here we use an in vitro selection approach to isolate tRNA(Trp) miscoding variants that exhibit a globally perturbed tRNA tertiary structure. Interestingly, the most substantial distortions are positioned in the elbow region of the tRNA that closely approaches helix 69 (H69) of the large ribosomal subunit. The importance of these specific interactions to tRNA selection is underscored by our kinetic analysis of both tRNA and rRNA variants that perturb the integrity of this interaction.
Collapse
MESH Headings
- Base Sequence
- Binding Sites/genetics
- Codon/chemistry
- Codon/genetics
- Codon/metabolism
- Escherichia coli/genetics
- Escherichia coli/metabolism
- Kinetics
- Models, Molecular
- Mutation
- Nucleic Acid Conformation
- Peptide Chain Elongation, Translational
- RNA, Bacterial/chemistry
- RNA, Bacterial/genetics
- RNA, Bacterial/metabolism
- RNA, Ribosomal/chemistry
- RNA, Ribosomal/genetics
- RNA, Ribosomal/metabolism
- RNA, Transfer/chemistry
- RNA, Transfer/genetics
- RNA, Transfer/metabolism
- RNA, Transfer, Trp/chemistry
- RNA, Transfer, Trp/genetics
- RNA, Transfer, Trp/metabolism
- Ribosome Subunits, Large, Bacterial/chemistry
- Ribosome Subunits, Large, Bacterial/genetics
- Ribosome Subunits, Large, Bacterial/metabolism
- Thermodynamics
Collapse
Affiliation(s)
- Rodrigo F Ortiz-Meoz
- Department of Molecular Biology and Genetics, Howard Hughes Medical Institute, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| | | |
Collapse
|
37
|
Kampenusa I, Zikmanis P. Distinguishable codon usage and amino acid composition patterns among substrates of leaderless secretory pathways from proteobacteria. Appl Microbiol Biotechnol 2010; 86:285-93. [PMID: 20107986 DOI: 10.1007/s00253-009-2423-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2009] [Revised: 12/21/2009] [Accepted: 12/24/2009] [Indexed: 11/26/2022]
Abstract
The combined set of codon usage frequencies (61 sense codons) from the 111 annotated sequences of leaderless secreted type I, type III, type IV, and type VI proteins from proteobacteria were subjected to the forward and backward selection to obtain a combination of most effective predictor variables for classification/prediction purposes. The group of 24 codon frequencies displayed a strong discriminatory power with an accuracy of 100% for originally grouped and 97.3 +/- 1.6% for cross-validated (LOOCV) cases and an acceptable error rate (0.062 +/- 0.012) in k-fold (k = 6) cross-validation (KCV). The summary frequencies of synonymous codons for ten amino acids as the alternative predictor variables revealed a comparable discriminatory power (92.8 +/- 2.5% for LOOCV), however at somewhat lower levels of prediction accuracy (0.106 +/- 0.015 of KCV). A number of significant (p < 0.001) differences were found among indices of codon usage and amino acid composition depending on a definite secretion type. About 60% of secretion substrates were characterized as apparently originated from horizontal gene transfer events or putative alien genes and found to be unequally allocated in respect of groups. The proposed prediction approaches could be used to specify secretome proteins from genomic sequences as well as to assess the compatibility between bacterial secretion pathways and secretion substrates.
Collapse
Affiliation(s)
- Ināra Kampenusa
- Institute of Microbiology and Biotechnology, University of Latvia, Riga, Latvia, Kronvalda Boulevard 4, LV-1010 Riga, Latvia
| | | |
Collapse
|
38
|
Ponnala L. Detecting slow-translating regions in E.coli. Int J Bioinform Res Appl 2010; 6:522-530. [PMID: 21224208 DOI: 10.1504/ijbra.2010.03799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
We present an application of the spatial scan statistic to identifying clusters of slow-translating codons in E.coli. We use an estimate of the time taken to process each codon based on the availability of transfer RNA. We analyse our findings using experimental measurements, and provide support for a well-known hypothesis related to the process of protein production.
Collapse
Affiliation(s)
- Lalit Ponnala
- Computational Biology Service Unit, Cornell University, Ithaca, NY 14853, USA.
| |
Collapse
|
39
|
Lee W, Kang J, Jung C, Hoelmer K, Lee SH, Lee S. Complete mitochondrial genome of brown marmorated stink bug Halyomorpha halys (Hemiptera: Pentatomidae), and phylogenetic relationships of hemipteran suborders. Mol Cells 2009; 28:155-65. [PMID: 19756390 DOI: 10.1007/s10059-009-0125-9] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2009] [Revised: 06/21/2009] [Accepted: 08/04/2009] [Indexed: 10/20/2022] Open
Abstract
The newly sequenced complete mitochondrial genome of the brown marmorated stink bug, Halyomorpha halys (Stål) (Hemiptera: Pentatomidae), is a circular molecule of 16,518 bp with a total A+T content of 76.4% and two extensive repeat regions in A+T rich region. Nucleotide composition and codon usage of H. halys are about average when compared with values observed in 19 other published hemipteran mitochondrial genomes. Phylogenetic analyses using these 20 hemipteran mitochondrial genomes support the currently accepted hypothesis that suborders Heteroptera and Auchenorrhyncha form a monophyletic group. The mitochondrial gene arrangements of the 20 genomes=are also consistent with our results.
Collapse
Affiliation(s)
- Wonhoon Lee
- Insect Biosystematics Laboratory, Research Institute for Agricultural and Life Sciences, Department of Agricultural Biotechnology, Seoul National University, Korea
| | | | | | | | | | | |
Collapse
|
40
|
Knoch TA, Göker M, Lohner R, Abuseiris A, Grosveld FG. Fine-structured multi-scaling long-range correlations in completely sequenced genomes--features, origin, and classification. Eur Biophys J 2009; 38:757-79. [PMID: 19533117 PMCID: PMC2701493 DOI: 10.1007/s00249-009-0489-y] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/20/2009] [Revised: 05/05/2009] [Accepted: 05/13/2009] [Indexed: 11/26/2022]
Abstract
The sequential organization of genomes, i.e. the relations between distant base pairs and regions within sequences, and its connection to the three-dimensional organization of genomes is still a largely unresolved problem. Long-range power-law correlations were found using correlation analysis on almost the entire observable scale of 132 completely sequenced chromosomes of 0.5 × 106 to 3.0 × 107 bp from Archaea, Bacteria, Arabidopsis thaliana, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Drosophila melanogaster, and Homo sapiens. The local correlation coefficients show a species-specific multi-scaling behaviour: close to random correlations on the scale of a few base pairs, a first maximum from 40 to 3,400 bp (for Arabidopsis thaliana and Drosophila melanogaster divided in two submaxima), and often a region of one or more second maxima from 105 to 3 × 105 bp. Within this multi-scaling behaviour, an additional fine-structure is present and attributable to codon usage in all except the human sequences, where it is related to nucleosomal binding. Computer-generated random sequences assuming a block organization of genomes, the codon usage, and nucleosomal binding explain these results. Mutation by sequence reshuffling destroyed all correlations. Thus, the stability of correlations seems to be evolutionarily tightly controlled and connected to the spatial genome organization, especially on large scales. In summary, genomes show a complex sequential organization related closely to their three-dimensional organization.
Collapse
MESH Headings
- Algorithms
- Animals
- Arabidopsis/genetics
- Chromosomes/chemistry
- Chromosomes/genetics
- Chromosomes/ultrastructure
- Chromosomes, Fungal/chemistry
- Chromosomes, Fungal/genetics
- Chromosomes, Fungal/ultrastructure
- Chromosomes, Human/chemistry
- Chromosomes, Human/genetics
- Chromosomes, Human/ultrastructure
- Chromosomes, Plant/chemistry
- Chromosomes, Plant/genetics
- Chromosomes, Plant/ultrastructure
- Codon/chemistry
- Computer Simulation
- DNA/chemistry
- Drosophila melanogaster/genetics
- Genome
- Humans
- Models, Genetic
- Mutation
- Nucleosomes/chemistry
- Saccharomyces cerevisiae/genetics
- Schizosaccharomyces/genetics
- Sequence Analysis, DNA
Collapse
Affiliation(s)
- Tobias A Knoch
- Biophysical Genomics, Cell Biology and Genetics, Erasmus Medical Center, Rotterdam, The Netherlands.
| | | | | | | | | |
Collapse
|
41
|
Chen XG, Hu J, Yang X. [Analysis of correlation of local GC level in human protein coding genes]. Yi Chuan 2008; 30:1169-1174. [PMID: 18779175 DOI: 10.3724/sp.j.1005.2008.01169] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
GC level is an important feature of genomic composition, which significantly improve our understanding of structure, function and evolution of genes. In this paper, the nonredundant DNA sequence of 7,992 human protein coding genes were retrieved from public database and the local GC level of different sequence regions and correlation between GC levels were analyzed.. The results showed that the GC levels of different sequence regions were strikingly nonuniform. 5' untranslated regions were of richest GC, with average GC content being 62.5%. 3'-untranslated regions were of poorest GC, with average GC content being 43.97%. GC contents of 3' flanking sequences profoundly matched the GC levels of DNA large fragments where the genes were located. Although the GC contents of open reading frames (ORFs) were higher than that of intron, 3' non-translated region and 3' flanking sequences, high correlation existed among the GC contents of the four regions. Average GC content of the third codon position (GC3) was 58.9%, higher than that of the fist and second position, and showed high correlation to GC contents of ORFs, with correlation coefficients being 0.91, besides of its significant association with GC contents of intron, 3'-untranslated region and 3' flanking sequences. Moreover, the linear regression of GC3 against GC contents of 3' flanking sequences yielded a slope of 1.25. Thus, GC3 was a sensitive indicator for GC change of local genome. As for 5' flanking sequences, 5' untranslated regions, fist and second codon position, however, their GC level exhibited weaker correlation with that of other regions. These results suggest that the third codon positions, introns, 3'-untranslated regions and 3' flanking sequences may evolve similarly while first and second codon positions, 5' flanking sequences and 5' untranslated region were expected to bear more selective stress for holding their functions.
Collapse
Affiliation(s)
- Xiang-Gui Chen
- School of Bioengineering, Xihua University, Chengdu 610039, China.
| | | | | |
Collapse
|
42
|
Anichina J, Feil S, Uggerud E, Bohme DK. Structures, fragmentation, and protonation of trideoxynucleotide CCC mono- and dianions. J Am Soc Mass Spectrom 2008; 19:987-996. [PMID: 18440240 DOI: 10.1016/j.jasms.2008.03.009] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2007] [Revised: 02/26/2008] [Accepted: 03/20/2008] [Indexed: 05/26/2023]
Abstract
Both quantum chemical calculations and ESI mass spectrometry are used here to explore the gas-phase structures, energies, and stabilities against collision-induced dissociation of a relatively small model DNA molecule--a trideoxynucleotide with the sequence CCC, in its singly and doubly deprotonated forms, (CCC-H)(-) and (CCC-2H)(2-), respectively. Also, the gas-phase reactivity of these two anions was measured with HBr, a potential proton donor, using an ESI/SIFT/QqQ instrument. The computational results provide insight into the gas-phase structures of the electrosprayed (CCC-2H)(2-) and (CCC-H)(-) anions and the neutral CCC, as well as the proton affinities of the di- and monoanions. The dianion (CCC-2H)(2-) was found to dissociate upon CID by charge separation via two competing channels: separation into deprotonated cytosine (C-H)(-) and (CCC-(C-H)-2H)(-), and by w(1)(-)/a(2)(-) cleavage of the backbone. The monoanion (CCC-H)(-) loses a neutral cytosine upon CID, and an H/D-exchangeable proton, presumably residing on one of the phosphate groups, is transferred to the partially liberated (C-H)(-) before dissociation. This was confirmed by MS/MS experiments with the deuterated analog. The reaction of (CCC-2H)(2-) with HBr was observed to be rapid, k=(1.4+/-0.4) x 10(-9) cm(3) molecule(-1) s(-1), and to proceed both by addition (78%) and by proton transfer (22%) while (CCC-H)(-) reacts only by HBr addition, k=(7.1+/-2.1) x 10(-10) cm(3) molecule(-1) s(-1). This is in accord with the computed proton affinities of (CCC-2H)(2-) and (CCC-H)(-) anions that bracket the known proton affinity of Br(-).
Collapse
Affiliation(s)
- Janna Anichina
- Department of Chemistry and Centre for Research in Mass Spectrometry, York University, Toronto, Ontario, Canada
| | | | | | | |
Collapse
|
43
|
Targanski I, Cherkasova V. Analysis of genomic tRNA sets from Bacteria, Archaea, and Eukarya points to anticodon-codon hydrogen bonds as a major determinant of tRNA compositional variations. RNA 2008; 14:1095-109. [PMID: 18441051 PMCID: PMC2390787 DOI: 10.1261/rna.896108] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
Analysis of 100 complete sets of the cytoplasmic elongator tRNA genes from Bacteria, Archaea, and Eukarya pointed to correspondences between types of anticodon and composition of the rest of the tRNA body. The number of the hydrogen bonds formed between the complementary nucleotides in the anticodon-codon duplex appeared as a major quantitative parameter determining covariations in all three domains of life. Our analysis has supported and advanced the "extended anticodon" concept that is based on the argument that the decoding performance of the anticodon is enhanced by selection of a matching anticodon stem-loop sequence, as reported by Yarus in 1982. In addition to the anticodon stem-loop, we have found covariations between the anticodon nucleotides and the composition of the distant regions of their respective tRNAs that include dihydrouridine (D) and thymidyl (T) stem-loops. The majority of the covariable tRNA positions were found at the regions with the increased dynamic potential--such as stem-loop and stem-stem junctions. The consistent occurrences of the covariations on the multigenomic level suggest that the number and pattern of the hydrogen bonds in the anticodon-codon duplex constitute a major factor in the course of translation that is reflected in the fine-tuning of the tRNA composition and structure.
Collapse
|
44
|
Singhal P, Jayaram B, Dixit SB, Beveridge DL. Prokaryotic gene finding based on physicochemical characteristics of codons calculated from molecular dynamics simulations. Biophys J 2008; 94:4173-83. [PMID: 18326660 PMCID: PMC2480686 DOI: 10.1529/biophysj.107.116392] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2007] [Accepted: 11/29/2007] [Indexed: 01/27/2023] Open
Abstract
An ab initio model for gene prediction in prokaryotic genomes is proposed based on physicochemical characteristics of codons calculated from molecular dynamics (MD) simulations. The model requires a specification of three calculated quantities for each codon: the double-helical trinucleotide base pairing energy, the base pair stacking energy, and an index of the propensity of a codon for protein-nucleic acid interactions. The base pairing and stacking energies for each codon are obtained from recently reported MD simulations on all unique tetranucleotide steps, and the third parameter is assigned based on the conjugate rule previously proposed to account for the wobble hypothesis with respect to degeneracies in the genetic code. The third interaction propensity parameter values correlate well with ab initio MD calculated solvation energies and flexibility of codon sequences as well as codon usage in genes and amino acid composition frequencies in approximately 175,000 protein sequences in the Swissprot database. Assignment of these three parameters for each codon enables the calculation of the magnitude and orientation of a cumulative three-dimensional vector for a DNA sequence of any length in each of the six genomic reading frames. Analysis of 372 genomes comprising approximately 350,000 genes shows that the orientations of the gene and nongene vectors are well differentiated and make a clear distinction feasible between genic and nongenic sequences at a level equivalent to or better than currently available knowledge-based models trained on the basis of empirical data, presenting a strong support for the possibility of a unique and useful physicochemical characterization of DNA sequences from codons to genomes.
Collapse
Affiliation(s)
- Poonam Singhal
- Department of Chemistry and Supercomputing Facility for Bioinformatics and Computational Biology, Indian Institute of Technology, Hauz Khas, New Delhi 110016, India
| | | | | | | |
Collapse
|
45
|
Thakur A, Alam MT, Bora H, Kaur P, Sharma YD. Plasmodium vivax: sequence polymorphism and effect of natural selection at apical membrane antigen 1 (PvAMA1) among Indian population. Gene 2008; 419:35-42. [PMID: 18547744 DOI: 10.1016/j.gene.2008.04.012] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2007] [Revised: 04/10/2008] [Accepted: 04/18/2008] [Indexed: 11/17/2022]
Abstract
Present study describes the characterization of apical membrane antigen 1 (PvAMA1) polymorphisms among Indian Plasmodium vivax isolates. The partial PvAMA1 gene (covering domain I and domain II regions) sequenced from sixty-one (n=61) isolates in this study resulted into 49 haplotypes. Comparison with the previously available PvAMA1 sequences in the GenBank database revealed that 45 of these were new haplotypes that have never been reported till date. For further analyses, we also included 11 previously reported PvAMA1 sequences from India available in the database. Thus genetic diversity and effect of natural selection were analyzed both at domain I and domain II of this promising malaria vaccine candidate among 72 Indian P. vivax isolates. Non-synonymous mutations were found at 25 codons (16 at domain I and 9 at domain II) where 17 codons were dimorphic while rest of them (8 codons) were trimorphic. Thus codon polymorphisms were observed to be more at domain I as compared to domain II. Although the difference between the rate of non-synonymous (dN) and synonymous (dS) mutations was positive (dN-dS, 0.002+/-0.004SE) at domain II, it was not significantly different from each other (P=0.272), indicating tendency of stronger diversifying selection at this domain. The dN-dS difference for domain I (-0.006+/-0.009SE, P=0.268) and for entire 900 bp region (-0.002+/-0.005E, P=0.320) being negative and statistically insignificant suggests the role of both positive as well as purifying selection. Three-dimensional distributions of all polymorphic residues were mapped on a modeled PvAMA1 structure. Results suggested that almost all of the observed polymorphisms were located at one surface of the antigen. In conclusion, PvAMA1 antigen displays high diversity among Indian isolates with more diversifying selection at domain II. The result has significant value in malaria vaccine development using this antigen.
Collapse
Affiliation(s)
- Ankur Thakur
- Department of Biotechnology, All India Institute of Medical Sciences, Ansari Nagar, New Delhi, 110029, India
| | | | | | | | | |
Collapse
|
46
|
Yom Din S, Hurvitz A, Goldberg D, Jackson K, Levavi-Sivan B, Degani G. Cloning of Russian sturgeon (Acipenser gueldenstaedtii) growth hormone and insulin-like growth factor I and their expression in male and female fish during the first period of growth. J Endocrinol Invest 2008; 31:201-10. [PMID: 18401201 DOI: 10.1007/bf03345591] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
In this study, the GH and IGF-I of the Russian sturgeon (rs), Acipenser gueldenstaedtii, were cloned and sequenced, and their mRNA gene expression determined. In addition, to improve our understanding of the GH function, the expression of this hormone was assessed in young males and females. Moreover, IGF-I expression was quantified in young males and compared to that in older ones. The nucleotide sequence of the rsGH cDNA was 980 bp long and had an open reading frame of 642 bp, beginning with the first ATG codon at position 39 and ending with the stop codon at position 683. A putative polyadenylation signal, AATAAA, was recognized 42 bp upstream of the poly (A) tail. The position of the signal- peptide cleavage site was predicted to be at position 111, yielding a signal peptide of 24 amino-acids (aa) and a mature peptide of 190 aa. When the rsGH aa sequence was compared with other species, the highest degree of identity was found to be with mammalians (66-70% identity), followed by anguilliformes and amphibia (61%) and other fish (39-47%). The level of rsGH mRNA was discovered to be similar in pituitaries of females and males of 5 age groups (1, 2, 3, 4, and 5- yr-old). In females and males, the levels did not change dramatically during the first 5 yr of growth. The partial nucleotide sequence of the rsIGF-I was 445 bp long and had an open reading frame of 396 bp, beginning with the ATG codon at position 50. The position of the signal-peptide cleavage site was predicted to be at position 187, yielding a signal peptide of 44 aa. The highest level of IGF-I mRNA expression was recorded in the kidney of adult sturgeons. The IGF-I mRNA expression levels in the intestine, pituitary gland, and liver were not significantly different. Low levels of expression were found in the brain, heart, and muscle. In most tissues, there was no significant difference between mRNA levels of one and 5-yr-old fish. In conclusion, based on the GH-sequence analysis, A. gueldenstaedtii is genetically distant from other teleosts. The expression of the GH mRNA was similar in males and females, and its level remained constant during the first 5 yr of growth. While the IGF-I mRNA expression differed amongst various tissues, the level in each tissue was similar in 1 and 5-yr-old fish.
Collapse
Affiliation(s)
- S Yom Din
- Department of Animal Sciences, Faculty of Agricultural, Food and Environmental Quality Sciences, The Hebrew University of Jerusalem, Rehovot, Israel
| | | | | | | | | | | |
Collapse
|
47
|
Khaĭrulina IS, Molotkov MV, Bulygin KN, Graĭfer DM, Ven'iaminova AG, Karpova GG. [C-terminal fragment of ribosomal protein S15 is located at the decoding site of the human ribosome]. Mol Biol (Mosk) 2008; 42:306-313. [PMID: 18610839] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Protein S15 is a characteristic component of the mammalian 80S ribosome that neighbors mRNA codon at the decoding site and the downstream triplets. In this study we determined S15 protein fragments located close to mRNA positions +4 to +12 with respect to the first nucleotide of the P site codon on the human ribosome. For cross-linking to ribosomal protein S15, a set of mRNA was used that contained triplet UUU/UUC at the 5'-termini and a perfluorophenyl azide-modified uridine in position 3' of this triplet. The locations of mRNA analogues on the ribosome were governed by tRNAPhe cognate to the UUU/UUC triplet targeted to the P site. Cross-linked S15 protein was isolated from the irradiated with mild UV light complexes of 80S ribosomes with tRNAPhe and mRNA analogues with subsequent cleavage with CNBr that splits polypeptide chain after methionines. Analysis of modified oligopeptides resulted from the cleavage revealed that in all cases cross-linking site was located in C-terminal fragment 111-145 of protein S15 indicating that this fragment is involved in formation of decoding site of the eukaryotic ribosome.
Collapse
|
48
|
Molotkov MV, Graĭfer DM, Popugaeva EA, Bulygin KN, Meshchaninova MI, Ven'iaminova AG, Karpova GG. [Protein S3 in the human 80S ribosome adjoins mRNA from 3'-side of the A-site codon]. Bioorg Khim 2007; 33:431-41. [PMID: 17886434 DOI: 10.1134/s106816200704005x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
The protein environment of mRNA 3' of the A-site codon (the decoding site) in the human 80S ribosome was studied using a set of oligoribonucleotide derivatives bearing a UUU triplet at the 5'-end and a perfluoroarylazide group at one of the nucleotide residues at the 3'-end of this triplet. Analogues of mRNA were phased into the ribosome using binding at the tRNAPhe P-site, which recognizes the UUU codon. Mild UV irradiation of ribosome complexes with tRNAPhe and mRNA analogues resulted in the predominant crosslinking of the analogues with the 40S subunit components, mainly with proteins and, to a lesser extent, with rRNA. Among the 40S subunit ribosomal proteins, the S3 protein was the main target for modification in all cases. In addition, minor crosslinking with the S2 protein was observed. The crosslinking with the S3 and S2 proteins occurred both in triple complexes and in the absence of tRNA. Within triple complexes, crosslinking with S15 protein was also found, its efficiency considerably falling when the modified nucleotide was moved from positions +5 to +12 relative to the first codon nucleotide in the P-site. In some cases, crosslinking with the S30 protein was observed, it was most efficient for the derivative containing a photoreactive group at the +7 adenosine residue. The results indicate that the S3 protein in the human ribosome plays a key role in the formation of the mRNA binding site 3' of the codon in the decoding site.
Collapse
|
49
|
Zamora-Romo E, Cruz-Vera LR, Vivanco-Domínguez S, Magos-Castro MA, Guarneros G. Efficient expression of gene variants that harbour AGA codons next to the initiation codon. Nucleic Acids Res 2007; 35:5966-74. [PMID: 17726048 PMCID: PMC2034473 DOI: 10.1093/nar/gkm643] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
In an effort to improve the knowledge about the rules which direct the effect of the early ORF sequences on translation efficiency, we have analyzed the effect of pairs of the six arginine codons at the second and third positions on the expression of lacZ variants. Whereas the pairs of identical AGA or AGG codons were favorable for the gene expression, identical pairs of each of the four CGN codons were very inefficient. This result was unexpected because tandems of AGA or AGG codons located in more internal gene positions provoke deficient expression whilst internally located CGU and CGC are the most abundant and efficiently translated arginine codons. The mixed combinations of AGA and each of the CGN codons usually resulted in efficient rates of lacZ expression independently of the peptidyl-tRNA propensity to dissociate from the ribosome. Thus, the variant harboring the pair of AGA codons was expressed as efficiently as the variant carrying a pair of AAA codons in the same positions, a configuration reported as one of the most common and efficient for gene expression. We explain these results assuming that the presence of adenines in these early positions enhance gene expression. As expected, specific mRNA levels correlated with the intensity of lacZ expression for each variant. However, the induction of lacZ AGA AGA gene in pth cells accumulated peptidyl-tRNAArg4 as well as a short 5′-proximal lacZ mRNA fragment suggesting ribosome stalling due to depletion of aminoacylated-tRNAArg4.
Collapse
Affiliation(s)
- Efraín Zamora-Romo
- Departamento de Genética y Biología Molecular, Centro de Investigación y de Estudios Avanzados del IPN, Apartado Postal 14-740, 07000 México D. F. and Department of Biological Sciences, Stanford University, Stanford, CA 94305-5020, USA
| | - Luis Rogelio Cruz-Vera
- Departamento de Genética y Biología Molecular, Centro de Investigación y de Estudios Avanzados del IPN, Apartado Postal 14-740, 07000 México D. F. and Department of Biological Sciences, Stanford University, Stanford, CA 94305-5020, USA
| | - Serafín Vivanco-Domínguez
- Departamento de Genética y Biología Molecular, Centro de Investigación y de Estudios Avanzados del IPN, Apartado Postal 14-740, 07000 México D. F. and Department of Biological Sciences, Stanford University, Stanford, CA 94305-5020, USA
| | - Marco Antonio Magos-Castro
- Departamento de Genética y Biología Molecular, Centro de Investigación y de Estudios Avanzados del IPN, Apartado Postal 14-740, 07000 México D. F. and Department of Biological Sciences, Stanford University, Stanford, CA 94305-5020, USA
| | - Gabriel Guarneros
- Departamento de Genética y Biología Molecular, Centro de Investigación y de Estudios Avanzados del IPN, Apartado Postal 14-740, 07000 México D. F. and Department of Biological Sciences, Stanford University, Stanford, CA 94305-5020, USA
- *To whom correspondence should be addressed. +52 55 5061 3338+52 55 5061 3392
| |
Collapse
|
50
|
Zalucki YM, Power PM, Jennings MP. Selection for efficient translation initiation biases codon usage at second amino acid position in secretory proteins. Nucleic Acids Res 2007; 35:5748-54. [PMID: 17717002 PMCID: PMC2034453 DOI: 10.1093/nar/gkm577] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The definition of a typical sec-dependent bacterial signal peptide contains a positive charge at the N-terminus, thought to be required for membrane association. In this study the amino acid distribution of all Escherichia coli secretory proteins were analysed. This revealed that there was a statistically significant bias for lysine at the second codon position (P2), consistent with a role for the positive charge in secretion. Removal of the positively charged residue P2 in two different model systems revealed that a positive charge is not required for protein export. A well-characterized feature of large amino acids like lysine at P2 is inhibition of N-terminal methionine removal by methionyl amino-peptidase (MAP). Substitution of lysine at P2 for other large or small amino acids did not affect protein export. Analysis of codon usage revealed that there was a bias for the AAA lysine codon at P2, suggesting that a non-coding function for the AAA codon may be responsible for the strong bias for lysine at P2 of secretory signal sequences. We conclude that the selection for high translation initiation efficiency maybe the selective pressure that has led to codon and consequent amino acid usage at P2 of secretory proteins.
Collapse
Affiliation(s)
- Yaramah M. Zalucki
- School of Molecular & Microbial Sciences, University of Queensland, St Lucia QLD 4072 and Molecular Infectious Diseases Group, Department of Paediatrics, University of Oxford, Oxford, United Kingdom OX3 9DU
| | - Peter M. Power
- School of Molecular & Microbial Sciences, University of Queensland, St Lucia QLD 4072 and Molecular Infectious Diseases Group, Department of Paediatrics, University of Oxford, Oxford, United Kingdom OX3 9DU
| | - Michael P. Jennings
- School of Molecular & Microbial Sciences, University of Queensland, St Lucia QLD 4072 and Molecular Infectious Diseases Group, Department of Paediatrics, University of Oxford, Oxford, United Kingdom OX3 9DU
- *To whom correspondence should be addressed. 61 7 3365487961 7 33654620
| |
Collapse
|