1
|
Barat A, Powner MW. Spontaneous Peptide Ligation Mediated by Cysteamine. JACS AU 2024; 4:1752-1757. [PMID: 38818061 PMCID: PMC11134366 DOI: 10.1021/jacsau.4c00243] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 04/23/2024] [Accepted: 04/24/2024] [Indexed: 06/01/2024]
Abstract
The fundamental and universal nature of life's exploitation of peptides suggests they must have played a vital role during the onset of life, but their spontaneous chemoselective synthesis in water remains unknown. Aminonitriles (1) are widely accepted as prebiotic precursors of both amino acids and peptides, but they do not spontaneously polymerize in water to yield peptides. Here, we demonstrate that the simple prebiotically plausible aminothiol, cysteamine (5), participates in Strecker chemistry to furnish β-mercaptoethyl-α-aminonitriles (8) and β-mercaptoethyl-amino acids (16), which are predisposed to spontaneously form peptides in water. Intramolecular thiol catalyzed ligation is faster, higher-yielding, and more α-selective than previously reported prebiotic peptide ligation chemistries, enabling, for example, the highly regioselective α-ligation of Asp- and Glu-dinitriles in quantitative yields. Our findings suggest that cysteamine (5), the thiol bearing moiety of the universal thiol cofactor coenzyme A, may have played an important role in the selective chemical synthesis of prebiotic α-peptides.
Collapse
Affiliation(s)
- Abid Barat
- Department
of Chemistry, University College London, 20 Gordon Street, London WC1H 0AJ, United Kingdom
| | - Matthew W. Powner
- Department
of Chemistry, University College London, 20 Gordon Street, London WC1H 0AJ, United Kingdom
| |
Collapse
|
2
|
Thoma B, Powner MW. Selective Synthesis of Lysine Peptides and the Prebiotically Plausible Synthesis of Catalytically Active Diaminopropionic Acid Peptide Nitriles in Water. J Am Chem Soc 2023; 145:3121-3130. [PMID: 36700882 PMCID: PMC9912261 DOI: 10.1021/jacs.2c12497] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
Why life encodes specific proteinogenic amino acids remains an unsolved problem, but a non-enzymatic synthesis that recapitulates biology's universal strategy of stepwise N-to-C terminal peptide growth may hold the key to this selection. Lysine is an important proteinogenic amino acid that, despite its essential structural, catalytic, and functional roles in biochemistry, has widely been assumed to be a late addition to the genetic code. Here, we demonstrate that lysine thioacids undergo coupling with aminonitriles in neutral water to afford peptides in near-quantitative yield, whereas non-proteinogenic lysine homologues, ornithine, and diaminobutyric acid cannot form peptides due to rapid and quantitative cyclization that irreversibly blocks peptide synthesis. We demonstrate for the first time that ornithine lactamization provides an absolute differentiation of lysine and ornithine during (non-enzymatic) N-to-C-terminal peptide ligation. We additionally demonstrate that the shortest lysine homologue, diaminopropionic acid, undergoes effective peptide ligation. This prompted us to discover a high-yielding prebiotically plausible synthesis of the diaminopropionic acid residue, by peptide nitrile modification, through the addition of ammonia to a dehydroalanine nitrile. With this synthesis in hand, we then discovered that the low basicity of diaminopropionyl residues promotes effective, biomimetic, imine catalysis in neutral water. Our results suggest diaminopropionic acid, synthesized by peptide nitrile modification, can replace or augment lysine residues during early evolution but that lysine's electronically isolated sidechain amine likely provides an evolutionary advantage for coupling and coding as a preformed monomer in monomer-by-monomer peptide translation.
Collapse
|
3
|
Wang X, Dong Q, Chen G, Zhang J, Liu Y, Cai Y. Frameshift and wild-type proteins are often highly similar because the genetic code and genomes were optimized for frameshift tolerance. BMC Genomics 2022; 23:416. [PMID: 35655139 PMCID: PMC9164415 DOI: 10.1186/s12864-022-08435-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2021] [Accepted: 03/02/2022] [Indexed: 11/10/2022] Open
Abstract
Frameshift mutations have been considered of significant importance for the molecular evolution of proteins and their coding genes, while frameshift protein sequences encoded in the alternative reading frames of coding genes have been considered to be meaningless. However, functional frameshifts have been found widely existing. It was puzzling how a frameshift protein kept its structure and functionality while substantial changes occurred in its primary amino-acid sequence. This study shows that the similarities among frameshifts and wild types are higher than random similarities and are determined at different levels. Frameshift substitutions are more conservative than random substitutions in the standard genetic code (SGC). The frameshift substitutions score of SGC ranks in the top 2.0-3.5% of alternative genetic codes, showing that SGC is nearly optimal for frameshift tolerance. In many genes and certain genomes, frameshift-resistant codons and codon pairs appear more frequently than expected, suggesting that frameshift tolerance is achieved through not only the optimality of the genetic code but, more importantly, the further optimization of a specific gene or genome through the usages of codons/codon pairs, which sheds light on the role of frameshift mutations in molecular and genomic evolution.
Collapse
|
4
|
Factors in Protobiomonomer Selection for the Origin of the Standard Genetic Code. Acta Biotheor 2021; 69:745-767. [PMID: 34283307 DOI: 10.1007/s10441-021-09420-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Accepted: 07/01/2021] [Indexed: 10/20/2022]
Abstract
Natural selection of specific protobiomonomers during abiogenic development of the prototype genetic code is hindered by the diversity of structural, spatial, and rotational isomers that have identical elemental composition and molecular mass (M), but can vary significantly in their physicochemical characteristics, such as the melting temperature Tm, the Tm:M ratio, and the solubility in water, due to different positions of atoms in the molecule. These parameters differ between cis- and trans-isomers of dicarboxylic acids, spatial monosaccharide isomers, and structural isomers of α-, β-, and γ-amino acids. The stable planar heterocyclic molecules of the major nucleobases comprise four (C, H, N, O) or three (C, H, N) elements and contain a single -C=C bond and two nitrogen atoms in each heterocycle involved in C-N and C=N bonds. They exist as isomeric resonance hybrids of single and double bonds and as a mixture of tautomer forms due to the presence of -C=O and/or -NH2 side groups. They are thermostable, insoluble in water, and exhibit solid-state stability, which is of central importance for DNA molecules as carriers of genetic information. In M-Tm diagrams, proteinogenic amino acids and the corresponding codons are distributed fairly regularly relative to the distinct clusters of purine and pyrimidine bases, reflecting the correspondence between codons and amino acids that was established in different periods of genetic code development. The body of data on the evolution of the genetic code system indicates that the elemental composition and molecular structure of protobiomonomers, and their M, Tm, photostability, and aqueous solubility determined their selection in the emergence of the standard genetic code.
Collapse
|
5
|
Gonzalez DL, Giannerini S, Rosa R. On the origin of degeneracy in the genetic code. Interface Focus 2019; 9:20190038. [PMID: 31641429 PMCID: PMC6802134 DOI: 10.1098/rsfs.2019.0038] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2019] [Accepted: 09/05/2019] [Indexed: 01/11/2023] Open
Abstract
The degeneracy of amino acid coding is one of the most crucial and enigmatic aspects of the genetic code. Different theories about the origin of the genetic code have been developed. However, to date, there is no comprehensive hypothesis on the mechanism that might have generated the degeneracy as we observe it. Here, we provide a new theory that explains the origin of the degeneracy based only on symmetry principles. The approach allows one to describe exactly the degeneracy of the early code (progenitor of the genetic code of LUCA, the last universal common ancestor) which is hypothesized to have the same degeneracy as the present vertebrate mitochondrial genetic code. The theory is based upon the tessera code, that fits as the progenitor of the early code. Moreover, we describe in detail the possible evolutionary transitions implied by our theory. The approach is supported by a unified mathematical framework that accounts for the degeneracy properties of both nuclear and mitochondrial genetic codes. Our work provides a new perspective to the understanding of the origin of the genetic code and the roles of symmetry principles in the organization of genetic information.
Collapse
Affiliation(s)
- D L Gonzalez
- CNR-IMM, UOS di Bologna, Via Gobetti 101, 40129 Bologna, Italy.,Dipartimento di Scienze Statistiche, Università di Bologna, via delle Belle Arti 41, 40126 Bologna, Italy
| | - S Giannerini
- Dipartimento di Scienze Statistiche, Università di Bologna, via delle Belle Arti 41, 40126 Bologna, Italy
| | - R Rosa
- CNR-IMM, UOS di Bologna, Via Gobetti 101, 40129 Bologna, Italy
| |
Collapse
|
6
|
Santos J, Monteagudo Á. Inclusion of the fitness sharing technique in an evolutionary algorithm to analyze the fitness landscape of the genetic code adaptability. BMC Bioinformatics 2017; 18:195. [PMID: 28347270 PMCID: PMC5369190 DOI: 10.1186/s12859-017-1608-x] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2016] [Accepted: 03/16/2017] [Indexed: 11/26/2022] Open
Abstract
Background The canonical code, although prevailing in complex genomes, is not universal. It was shown the canonical genetic code superior robustness compared to random codes, but it is not clearly determined how it evolved towards its current form. The error minimization theory considers the minimization of point mutation adverse effect as the main selection factor in the evolution of the code. We have used simulated evolution in a computer to search for optimized codes, which helps to obtain information about the optimization level of the canonical code in its evolution. A genetic algorithm searches for efficient codes in a fitness landscape that corresponds with the adaptability of possible hypothetical genetic codes. The lower the effects of errors or mutations in the codon bases of a hypothetical code, the more efficient or optimal is that code. The inclusion of the fitness sharing technique in the evolutionary algorithm allows the extent to which the canonical genetic code is in an area corresponding to a deep local minimum to be easily determined, even in the high dimensional spaces considered. Results The analyses show that the canonical code is not in a deep local minimum and that the fitness landscape is not a multimodal fitness landscape with deep and separated peaks. Moreover, the canonical code is clearly far away from the areas of higher fitness in the landscape. Conclusions Given the non-presence of deep local minima in the landscape, although the code could evolve and different forces could shape its structure, the fitness landscape nature considered in the error minimization theory does not explain why the canonical code ended its evolution in a location which is not an area of a localized deep minimum of the huge fitness landscape.
Collapse
Affiliation(s)
- José Santos
- Department of Computer Science, University of A Coruña, Campus de Elviña s/n, A Coruña, 15071, Spain.
| | - Ángel Monteagudo
- Department of Computer Science, University of A Coruña, Campus de Elviña s/n, A Coruña, 15071, Spain
| |
Collapse
|
7
|
Prebiotic selection and assembly of proteinogenic amino acids and natural nucleotides from complex mixtures. Nat Chem 2017. [DOI: 10.1038/nchem.2703] [Citation(s) in RCA: 66] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
|
8
|
Simões J, Bezerra AR, Moura GR, Araújo H, Gut I, Bayes M, Santos MAS. The Fungus Candida albicans Tolerates Ambiguity at Multiple Codons. Front Microbiol 2016; 7:401. [PMID: 27065968 PMCID: PMC4814463 DOI: 10.3389/fmicb.2016.00401] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2015] [Accepted: 03/14/2016] [Indexed: 12/31/2022] Open
Abstract
The ascomycete Candida albicans is a normal resident of the gastrointestinal tract of humans and other warm-blooded animals. It occurs in a broad range of body sites and has high capacity to survive and proliferate in adverse environments with drastic changes in oxygen, carbon dioxide, pH, osmolarity, nutrients, and temperature. Its biology is unique due to flexible reassignment of the leucine CUG codon to serine and synthesis of statistical proteins. Under standard growth conditions, CUG sites incorporate leucine (3% of the times) and serine (97% of the times) on a proteome wide scale, but leucine incorporation fluctuates in response to environmental stressors and can be artificially increased up to 98%. In order to determine whether such flexibility also exists at other codons, we have constructed several serine tRNAs that decode various non-cognate codons. Expression of these tRNAs had minor effects on fitness, but growth of the mistranslating strains at different temperatures, in medium with different pH and nutrients composition was often enhanced relatively to the wild type (WT) strain, supporting our previous data on adaptive roles of CUG ambiguity in variable growth conditions. Parallel evolution of the recombinant strains (100 generations) followed by full genome resequencing identified various strain specific single nucleotide polymorphisms (SNP) and one SNP in the deneddylase (JAB1) gene in all strains. Since JAB1 is a subunit of the COP9 signalosome complex, which interacts with cullin (Cdc53p) to mediate degradation of a variety of cellular proteins, our data suggest that neddylation plays a key role in tolerance and adaptation to codon ambiguity in C. albicans.
Collapse
Affiliation(s)
- João Simões
- Health Sciences Program, Department of Medical Sciences, Institute of Biomedicine - iBiMED, University of Aveiro Aveiro, Portugal
| | - Ana R Bezerra
- Health Sciences Program, Department of Medical Sciences, Institute of Biomedicine - iBiMED, University of Aveiro Aveiro, Portugal
| | - Gabriela R Moura
- Health Sciences Program, Department of Medical Sciences, Institute of Biomedicine - iBiMED, University of Aveiro Aveiro, Portugal
| | - Hugo Araújo
- Health Sciences Program, Department of Medical Sciences, Institute of Biomedicine - iBiMED, University of Aveiro Aveiro, Portugal
| | - Ivo Gut
- Centro Nacional de Análises Genómico, Parc Científic Barcelona, Spain
| | - Mónica Bayes
- Centro Nacional de Análises Genómico, Parc Científic Barcelona, Spain
| | - Manuel A S Santos
- Health Sciences Program, Department of Medical Sciences, Institute of Biomedicine - iBiMED, University of Aveiro Aveiro, Portugal
| |
Collapse
|
9
|
Gardini S, Cheli S, Baroni S, Di Lascio G, Mangiavacchi G, Micheletti N, Monaco CL, Savini L, Alocci D, Mangani S, Niccolai N. On Nature's Strategy for Assigning Genetic Code Multiplicity. PLoS One 2016; 11:e0148174. [PMID: 26849571 PMCID: PMC4746209 DOI: 10.1371/journal.pone.0148174] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2015] [Accepted: 01/13/2016] [Indexed: 11/26/2022] Open
Abstract
Genetic code redundancy would yield, on the average, the assignment of three codons for each of the natural amino acids. The fact that this number is observed only for incorporating Ile and to stop RNA translation still waits for an overall explanation. Through a Structural Bioinformatics approach, the wealth of information stored in the Protein Data Bank has been used here to look for unambiguous clues to decipher the rationale of standard genetic code (SGC) in assigning from one to six different codons for amino acid translation. Leu and Arg, both protected from translational errors by six codons, offer the clearest clue by appearing as the most abundant amino acids in protein-protein and protein-nucleic acid interfaces. Other SGC hidden messages have been sought by analyzing, in a protein structure framework, the roles of over- and under-protected amino acids.
Collapse
Affiliation(s)
- Simone Gardini
- Department of Biotechnology, Chemistry and Pharmacy, University of Siena, Siena, Italy
| | - Sara Cheli
- Department of Biotechnology, Chemistry and Pharmacy, University of Siena, Siena, Italy
| | - Silvia Baroni
- Department of Biotechnology, Chemistry and Pharmacy, University of Siena, Siena, Italy
| | - Gabriele Di Lascio
- Department of Biotechnology, Chemistry and Pharmacy, University of Siena, Siena, Italy
| | - Guido Mangiavacchi
- Department of Biotechnology, Chemistry and Pharmacy, University of Siena, Siena, Italy
| | - Nicholas Micheletti
- Department of Biotechnology, Chemistry and Pharmacy, University of Siena, Siena, Italy
| | - Carmen Luigia Monaco
- Department of Biotechnology, Chemistry and Pharmacy, University of Siena, Siena, Italy
| | - Lorenzo Savini
- Department of Biotechnology, Chemistry and Pharmacy, University of Siena, Siena, Italy
| | - Davide Alocci
- Department of Biotechnology, Chemistry and Pharmacy, University of Siena, Siena, Italy
| | - Stefano Mangani
- Department of Biotechnology, Chemistry and Pharmacy, University of Siena, Siena, Italy
| | - Neri Niccolai
- Department of Biotechnology, Chemistry and Pharmacy, University of Siena, Siena, Italy
- * E-mail:
| |
Collapse
|
10
|
Cheng B, Cui S. Supramolecular Chemistry and Mechanochemistry of Macromolecules: Recent Advances by Single-Molecule Force Spectroscopy. Top Curr Chem (Cham) 2015; 369:97-134. [DOI: 10.1007/128_2015_628] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
11
|
Lenstra R. Evolution of the genetic code through progressive symmetry breaking. J Theor Biol 2014; 347:95-108. [DOI: 10.1016/j.jtbi.2014.01.002] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2012] [Revised: 12/18/2013] [Accepted: 01/01/2014] [Indexed: 01/18/2023]
|
12
|
Rosandić M, Paar V, Glunčić M. Fundamental role of start/stop regulators in whole DNA and new trinucleotide classification. Gene 2013; 531:184-90. [PMID: 24042127 DOI: 10.1016/j.gene.2013.09.021] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2013] [Revised: 08/31/2013] [Accepted: 09/05/2013] [Indexed: 10/26/2022]
Abstract
The origin and logic of genetic code are two of greatest mysteries of life sciences. Analyzing DNA sequences we showed that the start/stop trinucleotides have broader importance than just marking start and stop of exons in coding DNA. On this basis, here we introduced new classification of trinucleotides and showed that all A+T rich trinucleotides consisting of three different nucleotides arise from start-ATG, stop-TGA and stop-TAG using their complement, reverse complement and reverse transformations. Due to the same transformations during generations of crossing-over they can switch from one form to the other. By direct process the start-ATG and stop-TAG can irreversibly transform into stop-TAA. By transformation into A+T rich trinucleotides and 16/32 C+G rich they can lose the start/stop function and take the role of a sense codon in reversible way. The remaining 16 C+G trinucleotides cannot directly transform into start/stop trinucleotides and thus remain a firm skeleton for structuring the C+G rich DNA. We showed that start/stops strongly enrich the A+T rich noncoding DNA through frequently extended forms. From the evolutionary viewpoint the start/stops are chief creators of prevailing A+T rich noncoding DNA, and of more stable coding DNA. We propose that start/stops have basic role as "seeds" in trinucleotide evolution of noncoding and coding sequences and lead to asymmetry between A+T and C+G rich DNA. By dynamical transformations during evolution they enabled pronounced phylogenetic broadness, keeping the regulator function.
Collapse
Affiliation(s)
- Marija Rosandić
- Faculty of Science, University of Zagreb, Bijenička 32, 10000 Zagreb, Croatia
| | | | | |
Collapse
|
13
|
Conard SE, Buckley J, Dang M, Bedwell GJ, Carter RL, Khass M, Bedwell DM. Identification of eRF1 residues that play critical and complementary roles in stop codon recognition. RNA (NEW YORK, N.Y.) 2012; 18:1210-21. [PMID: 22543865 PMCID: PMC3358643 DOI: 10.1261/rna.031997.111] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/16/2011] [Accepted: 03/15/2012] [Indexed: 05/31/2023]
Abstract
The initiation and elongation stages of translation are directed by codon-anticodon interactions. In contrast, a release factor protein mediates stop codon recognition prior to polypeptide chain release. Previous studies have identified specific regions of eukaryotic release factor one (eRF1) that are important for decoding each stop codon. The cavity model for eukaryotic stop codon recognition suggests that three binding pockets/cavities located on the surface of eRF1's domain one are key elements in stop codon recognition. Thus, the model predicts that amino acid changes in or near these cavities should influence termination in a stop codon-dependent manner. Previous studies have suggested that the TASNIKS and YCF motifs within eRF1 domain one play important roles in stop codon recognition. These motifs are highly conserved in standard code organisms that use UAA, UAG, and UGA as stop codons, but are more divergent in variant code organisms that have reassigned a subset of stop codons to sense codons. In the current study, we separately introduced TASNIKS and YCF motifs from six variant code organisms into eRF1 of Saccharomyces cerevisiae to determine their effect on stop codon recognition in vivo. We also examined the consequences of additional changes at residues located between the TASNIKS and YCF motifs. Overall, our results indicate that changes near cavities two and three frequently mediated significant effects on stop codon selectivity. In particular, changes in the YCF motif, rather than the TASNIKS motif, correlated most consistently with variant code stop codon selectivity.
Collapse
Affiliation(s)
- Sara E. Conard
- Department of Microbiology, University of Alabama at Birmingham, Birmingham, Alabama 35294, USA
| | - Jessica Buckley
- Department of Microbiology, University of Alabama at Birmingham, Birmingham, Alabama 35294, USA
| | - Mai Dang
- Department of Microbiology, University of Alabama at Birmingham, Birmingham, Alabama 35294, USA
| | - Gregory J. Bedwell
- Department of Microbiology, University of Alabama at Birmingham, Birmingham, Alabama 35294, USA
| | - Richard L. Carter
- Department of Microbiology, University of Alabama at Birmingham, Birmingham, Alabama 35294, USA
| | - Mohamed Khass
- Department of Microbiology, University of Alabama at Birmingham, Birmingham, Alabama 35294, USA
| | - David M. Bedwell
- Department of Microbiology, University of Alabama at Birmingham, Birmingham, Alabama 35294, USA
| |
Collapse
|
14
|
Nikolajewa S, Friedel M, Beyer A, Wilhelm T. THE NEW CLASSIFICATION SCHEME OF THE GENETIC CODE, ITS EARLY EVOLUTION, AND tRNA USAGE. J Bioinform Comput Biol 2011; 4:609-20. [PMID: 16819806 DOI: 10.1142/s0219720006001825] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2005] [Revised: 12/09/2005] [Accepted: 12/23/2005] [Indexed: 11/18/2022]
Abstract
We present a new classification scheme of the genetic code. In contrast to the standard form it clearly shows five codon symmetries: codon-anticodon, codon-reverse codon, and sense-antisense symmetry, as well as symmetries with respect to purine-pyrimidine (A versus G, U versus C) and keto-aminobase (G versus U, A versus C) exchanges. We study the number of tRNA genes of 16 archaea, 81 bacteria and 7 eucaryotes to analyze whether these symmetries are reflected in the corresponding tRNA usage patterns. Two features are especially striking: reverse stop codons do not have their own tRNAs (just one exception in human), and A** anticodons are significantly suppressed. Our classification scheme of the genetic code and the identified tRNA usage patterns support recent speculations about the early evolution of the genetic code. In particular, pre-tRNAs might have had the ability to bind their codons in two directions to the corresponding codons.
Collapse
Affiliation(s)
- Swetlana Nikolajewa
- Theoretical Systems Biology, Institute of Molecular Biotechnology Beutenbergstr, 11, Jena, D-07745, Germany
| | | | | | | |
Collapse
|
15
|
Zhang Z, Yu J. On the organizational dynamics of the genetic code. GENOMICS PROTEOMICS & BIOINFORMATICS 2011; 9:21-9. [PMID: 21641559 PMCID: PMC5054158 DOI: 10.1016/s1672-0229(11)60004-1] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/30/2010] [Accepted: 10/26/2010] [Indexed: 11/23/2022]
Abstract
The organization of the canonical genetic code needs to be thoroughly illuminated. Here we reorder the four nucleotides—adenine, thymine, guanine and cytosine—according to their emergence in evolution, and apply the organizational rules to devising an algebraic representation for the canonical genetic code. Under a framework of the devised code, we quantify codon and amino acid usages from a large collection of 917 prokaryotic genome sequences, and associate the usages with its intrinsic structure and classification schemes as well as amino acid physicochemical properties. Our results show that the algebraic representation of the code is structurally equivalent to a content-centric organization of the code and that codon and amino acid usages under different classification schemes were correlated closely with GC content, implying a set of rules governing composition dynamics across a wide variety of prokaryotic genome sequences. These results also indicate that codons and amino acids are not randomly allocated in the code, where the six-fold degenerate codons and their amino acids have important balancing roles for error minimization. Therefore, the content-centric code is of great usefulness in deciphering its hitherto unknown regularities as well as the dynamics of nucleotide, codon, and amino acid compositions.
Collapse
Affiliation(s)
- Zhang Zhang
- Plant Stress Genomics Research Center, Division of Chemical and Life Sciences and Engineering, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | | |
Collapse
|
16
|
Lukashenko NP. Molecular evolution of ciliates (Ciliophora) and some related groups of protozoans. RUSS J GENET+ 2009. [DOI: 10.1134/s1022795409080018] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
17
|
RNA relics and origin of life. Int J Mol Sci 2009; 10:3420-3441. [PMID: 20111682 PMCID: PMC2812825 DOI: 10.3390/ijms10083420] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2009] [Revised: 07/11/2009] [Accepted: 07/28/2009] [Indexed: 11/18/2022] Open
Abstract
A number of small RNA sequences, located in different non-coding sequences and highly preserved across the tree of life, have been suggested to be molecular fossils, of ancient (and possibly primordial) origin. On the other hand, recent years have revealed the existence of ubiquitous roles for small RNA sequences in modern organisms, in functions ranging from cell regulation to antiviral activity. We propose that a single thread can be followed from the beginning of life in RNA structures selected only for stability reasons through the RNA relics and up to the current coevolution of RNA sequences; such an understanding would shed light both on the history and on the present development of the RNA machinery and interactions. After presenting the evidence (by comparing their sequences) that points toward a common thread, we discuss a scenario of genome coevolution (with emphasis on viral infectious processes) and finally propose a plan for the reevaluation of the stereochemical theory of the genetic code; we claim that it may still be relevant, and not only for understanding the origin of life, but also for a comprehensive picture of regulation in present-day cells.
Collapse
|
18
|
Baranov PV, Venin M, Provan G. Codon size reduction as the origin of the triplet genetic code. PLoS One 2009; 4:e5708. [PMID: 19479032 PMCID: PMC2682656 DOI: 10.1371/journal.pone.0005708] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2009] [Accepted: 04/22/2009] [Indexed: 11/26/2022] Open
Abstract
The genetic code appears to be optimized in its robustness to missense errors and frameshift errors. In addition, the genetic code is near-optimal in terms of its ability to carry information in addition to the sequences of encoded proteins. As evolution has no foresight, optimality of the modern genetic code suggests that it evolved from less optimal code variants. The length of codons in the genetic code is also optimal, as three is the minimal nucleotide combination that can encode the twenty standard amino acids. The apparent impossibility of transitions between codon sizes in a discontinuous manner during evolution has resulted in an unbending view that the genetic code was always triplet. Yet, recent experimental evidence on quadruplet decoding, as well as the discovery of organisms with ambiguous and dual decoding, suggest that the possibility of the evolution of triplet decoding from living systems with non-triplet decoding merits reconsideration and further exploration. To explore this possibility we designed a mathematical model of the evolution of primitive digital coding systems which can decode nucleotide sequences into protein sequences. These coding systems can evolve their nucleotide sequences via genetic events of Darwinian evolution, such as point-mutations. The replication rates of such coding systems depend on the accuracy of the generated protein sequences. Computer simulations based on our model show that decoding systems with codons of length greater than three spontaneously evolve into predominantly triplet decoding systems. Our findings suggest a plausible scenario for the evolution of the triplet genetic code in a continuous manner. This scenario suggests an explanation of how protein synthesis could be accomplished by means of long RNA-RNA interactions prior to the emergence of the complex decoding machinery, such as the ribosome, that is required for stabilization and discrimination of otherwise weak triplet codon-anticodon interactions.
Collapse
Affiliation(s)
- Pavel V Baranov
- Biochemistry Department, University College Cork, Cork, Ireland.
| | | | | |
Collapse
|
19
|
Vallabhaneni H, Fan-Minogue H, Bedwell DM, Farabaugh PJ. Connection between stop codon reassignment and frequent use of shifty stop frameshifting. RNA (NEW YORK, N.Y.) 2009; 15:889-897. [PMID: 19329535 PMCID: PMC2673066 DOI: 10.1261/rna.1508109] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/11/2008] [Accepted: 02/05/2009] [Indexed: 05/27/2023]
Abstract
Ciliated protozoa of the genus Euplotes have undergone genetic code reassignment, redefining the termination codon UGA to encode cysteine. In addition, Euplotes spp. genes very frequently employ shifty stop frameshifting. Both of these phenomena involve noncanonical events at a termination codon, suggesting they might have a common cause. We recently demonstrated that Euplotes octocarinatus peptide release factor eRF1 ignores UGA termination codons while continuing to recognize UAA and UAG. Here we show that both the Tetrahymena thermophila and E. octocarinatus eRF1 factors allow efficient frameshifting at all three termination codons, suggesting that UGA redefinition also impaired UAA/UAG recognition. Mutations of the Euplotes factor restoring a phylogenetically conserved motif in eRF1 (TASNIKS) reduced programmed frameshifting at all three termination codons. Mutation of another conserved residue, Cys124, strongly reduces frameshifting at UGA while actually increasing frameshifting at UAA/UAG. We will discuss these results in light of recent biochemical characterization of these mutations.
Collapse
Affiliation(s)
- Haritha Vallabhaneni
- Program in Molecular and Cell Biology, Department of Biological Sciences, University of Maryland Baltimore County, Baltimore,Maryland 21250, USA
| | | | | | | |
Collapse
|
20
|
|
21
|
On the evolution of the standard genetic code: vestiges of critical scale invariance from the RNA world in current prokaryote genomes. PLoS One 2009; 4:e4340. [PMID: 19183813 PMCID: PMC2631149 DOI: 10.1371/journal.pone.0004340] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2008] [Accepted: 11/21/2008] [Indexed: 11/19/2022] Open
Abstract
Herein two genetic codes from which the primeval RNA code could have originated the standard genetic code (SGC) are derived. One of them, called extended RNA code type I, consists of all codons of the type RNY (purine-any base-pyrimidine) plus codons obtained by considering the RNA code but in the second (NYR type) and third (YRN type) reading frames. The extended RNA code type II, comprises all codons of the type RNY plus codons that arise from transversions of the RNA code in the first (YNY type) and third (RNR) nucleotide bases. In order to test if putative nucleotide sequences in the RNA World and in both extended RNA codes, share the same scaling and statistical properties to those encountered in current prokaryotes, we used the genomes of four Eubacteria and three Archaeas. For each prokaryote, we obtained their respective genomes obeying the RNA code or the extended RNA codes types I and II. In each case, we estimated the scaling properties of triplet sequences via a renormalization group approach, and we calculated the frequency distributions of distances for each codon. Remarkably, the scaling properties of the distance series of some codons from the RNA code and most codons from both extended RNA codes turned out to be identical or very close to the scaling properties of codons of the SGC. To test for the robustness of these results, we show, via computer simulation experiments, that random mutations of current genomes, at the rates of 10(-10) per site per year during three billions of years, were not enough for destroying the observed patterns. Therefore, we conclude that most current prokaryotes may still contain relics of the primeval RNA World and that both extended RNA codes may well represent two plausible evolutionary paths between the RNA code and the current SGC.
Collapse
|
22
|
Fan-Minogue H, Du M, Pisarev AV, Kallmeyer AK, Salas-Marco J, Keeling KM, Thompson SR, Pestova TV, Bedwell DM. Distinct eRF3 requirements suggest alternate eRF1 conformations mediate peptide release during eukaryotic translation termination. Mol Cell 2008; 30:599-609. [PMID: 18538658 DOI: 10.1016/j.molcel.2008.03.020] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2008] [Revised: 02/25/2008] [Accepted: 03/28/2008] [Indexed: 10/22/2022]
Abstract
Organisms that use the standard genetic code recognize UAA, UAG, and UGA as stop codons, whereas variant code species frequently alter this pattern of stop codon recognition. We previously demonstrated that a hybrid eRF1 carrying the Euplotes octocarinatus domain 1 fused to Saccharomyces cerevisiae domains 2 and 3 (Eo/Sc eRF1) recognized UAA and UAG, but not UGA, as stop codons. In the current study, we identified mutations in Eo/Sc eRF1 that restore UGA recognition and define distinct roles for the TASNIKS and YxCxxxF motifs in eRF1 function. Mutations in or near the YxCxxxF motif support the cavity model for stop codon recognition by eRF1. Mutations in the TASNIKS motif eliminated the eRF3 requirement for peptide release at UAA and UAG codons, but not UGA codons. These results suggest that the TASNIKS motif and eRF3 function together to trigger eRF1 conformational changes that couple stop codon recognition and peptide release during eukaryotic translation termination.
Collapse
Affiliation(s)
- Hua Fan-Minogue
- Department of Microbiology, University of Alabama at Birmingham, Birmingham, AL 35294, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Tan H, Wang J, Yang F, Zhao ZK. Genome-wide analysis of coding DNA and amino acid variation in Saccharomyces cerevisiae. Yeast 2008; 25:29-39. [PMID: 17914746 DOI: 10.1002/yea.1547] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
The possible causes of variation on amino acid composition in the yeast Saccharomyces cerevisiae were investigated genome-wide. The results indicated that: (a) the base composition of coding DNA and amino acid composition was similar among all the chromosomes, which was in sharp contrast with the great varies of the composition of the individual's coding DNA and amino acid; (b) some amino acids (e.g. Cys and Trp) were not present in all the proteins; and (c) amino acid bias was associated with a base bias (in terms of A-, G-, C- and T-rich codons). Based on the third rule and a proposed universal trend of amino acid gain and loss in protein evolution, the changing pattern of coding DNA was predicted to be T- and C-accruing, whereas A and G were consistently reducing. All these results held the potential to reveal precisely how DNA ongoing change has a major effect on the composition of proteins.
Collapse
Affiliation(s)
- Haidong Tan
- Division of Biotechnology, Dalian Institute of Chemical Physics, CAS, Dalian 116023, People's Republic of China.
| | | | | | | |
Collapse
|
24
|
Demongeot J, Moreira A. A possible circular RNA at the origin of life. J Theor Biol 2007; 249:314-24. [PMID: 17825325 DOI: 10.1016/j.jtbi.2007.07.010] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2007] [Revised: 07/04/2007] [Accepted: 07/05/2007] [Indexed: 11/24/2022]
Abstract
The increasing volume of sequenced genomes and the recent techniques for performing in vitro molecular evolution have rekindled the interest for questions on the origin of life. Nevertheless, a gap continues to exist between the research on prebiotic chemistry and molecule generation, on one hand, and the study of molecular fossils preserved in genomes, on the other. Here we attempt to fill this gap by using some assumptions about the prebiotic scenario (including a strong stereochemical basis for the genetic code) to determine the RNA sequences more likely to appear and subsist. A set of minimal RNA rings is exhaustively determined; a subset of them is then selected through stability arguments, and a particular ring ("AL ring") is finally singled out as the most likely winner of this prebiotic game. The rings happen to have several structural and statistical properties of modern genes: a repeated AUG codon appears spontaneously (and is thus made available for becoming a start signal), the form AUG/STOP emerges, and frequency patterns resemble those of present genes. The whole set of rings was also compared to a database of tRNAs, considering the conserved positions (located in the free parts of the molecule, essentially the loops); the ring that most closely matched tRNA sequences-and matched, in fact, the consensus of tRNA at all the aligned positions-was AL, the same ring independently selected before. The unselected emergence of gene-like features through two simple selection steps and the close similarity between the finally selected ring and tRNA (including some remarkable features of the resulting alignment) suggest a possible link between the prebiotic world and the first biological molecules, which is amenable for experimental testing. Even if our scenario is partially wrong, the unlikely coincidences should provide useful hints for other efforts.
Collapse
|
25
|
Forterre P, Gribaldo S. The origin of modern terrestrial life. HFSP JOURNAL 2007; 1:156-68. [PMID: 19404443 PMCID: PMC2640990 DOI: 10.2976/1.2759103] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/22/2007] [Accepted: 06/22/2007] [Indexed: 11/19/2022]
Abstract
The study of the origin of life covers many areas of expertise and requires the input of various scientific communities. In recent years, this research field has often been viewed as part of a broader agenda under the name of "exobiology" or "astrobiology." In this review, we have somewhat narrowed this agenda, focusing on the origin of modern terrestrial life. The adjective "modern" here means that we did not speculate on different forms of life that could have possibly appeared on our planet, but instead focus on the existing forms (cells and viruses). We try to briefly present the state of the art about alternative hypotheses discussing not only the origin of life per se, but also how life evolved to produce the modern biosphere through a succession of steps that we would like to characterize as much as possible.
Collapse
Affiliation(s)
- Patrick Forterre
- Institut Pasteur, 25 rue du Docteur Roux,
75015 Paris et Université Paris-Sud, CNRS, UMR 8621, 91405, Crsay-Cedex,
France
| | | |
Collapse
|
26
|
Russell MJ. The alkaline solution to the emergence of life: energy, entropy and early evolution. Acta Biotheor 2007; 55:133-79. [PMID: 17704896 DOI: 10.1007/s10441-007-9018-5] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2007] [Accepted: 05/07/2007] [Indexed: 11/27/2022]
Abstract
The Earth agglomerates and heats. Convection cells within the planetary interior expedite the cooling process. Volcanoes evolve steam, carbon dioxide, sulfur dioxide and pyrophosphate. An acidulous Hadean ocean condenses from the carbon dioxide atmosphere. Dusts and stratospheric sulfurous smogs absorb a proportion of the Sun's rays. The cooled ocean leaks into the stressed crust and also convects. High temperature acid springs, coupled to magmatic plumes and spreading centers, emit iron, manganese, zinc, cobalt and nickel ions to the ocean. Away from the spreading centers cooler alkaline spring waters emanate from the ocean floor. These bear hydrogen, formate, ammonia, hydrosulfide and minor methane thiol. The thermal potential begins to be dissipated but the chemical potential is dammed. The exhaling alkaline solutions are frustrated in their further attempt to mix thoroughly with their oceanic source by the spontaneous precipitation of biomorphic barriers of colloidal iron compounds and other minerals. It is here we surmise that organic molecules are synthesized, filtered, concentrated and adsorbed, while acetate and methane--separate products of the precursor to the reductive acetyl-coenzyme-A pathway-are exhaled as waste. Reactions in mineral compartments produce acetate, amino acids, and the components of nucleosides. Short peptides, condensed from the simple amino acids, sequester 'ready-made' iron sulfide clusters to form protoferredoxins, and also bind phosphates. Nucleotides are assembled from amino acids, simple phosphates carbon dioxide and ribose phosphate upon nanocrystalline mineral surfaces. The side chains of particular amino acids register to fitting nucleotide triplet clefts. Keyed in, the amino acids are polymerized, through acid-base catalysis, to alpha chains. Peptides, the tenuous outer-most filaments of the nanocrysts, continually peel away from bound RNA. The polymers are concentrated at cooler regions of the mineral compartments through thermophoresis. RNA is reproduced through a convective polymerase chain reaction operating between 40 and 100 degrees C. The coded peptides produce true ferredoxins, the ubiquitous proteins with the longest evolutionary pedigree. They take over the role of catalyst and electron transfer agent from the iron sulfides. Other iron-nickel sulfide clusters, sequestered now by cysteine residues as CO-dehydrogenase and acetyl-coenzyme-A synthase, promote further chemosynthesis and support the hatchery--the electrochemical reactor--from which they sprang. Reactions and interactions fall into step as further pathways are negotiated. This hydrothermal circuitry offers a continuous supply of material and chemical energy, as well as electricity and proticity at a potential appropriate for the onset of life in the dark, a rapidly emerging kinetic structure born to persist, evolve and generate entropy while the sun shines.
Collapse
Affiliation(s)
- Michael J Russell
- Planetary Science and Life Detection Section 3220, MS:183-601, Jet Propulsion Laboratory, California Institute of Technology, 4800 Oak Grove Drive, Pasadena, CA, 91109-8099, USA.
| |
Collapse
|
27
|
Koonin EV. The cosmological model of eternal inflation and the transition from chance to biological evolution in the history of life. Biol Direct 2007; 2:15. [PMID: 17540027 PMCID: PMC1892545 DOI: 10.1186/1745-6150-2-15] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2007] [Accepted: 05/31/2007] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Recent developments in cosmology radically change the conception of the universe as well as the very notions of "probable" and "possible". The model of eternal inflation implies that all macroscopic histories permitted by laws of physics are repeated an infinite number of times in the infinite multiverse. In contrast to the traditional cosmological models of a single, finite universe, this worldview provides for the origin of an infinite number of complex systems by chance, even as the probability of complexity emerging in any given region of the multiverse is extremely low. This change in perspective has profound implications for the history of any phenomenon, and life on earth cannot be an exception. HYPOTHESIS Origin of life is a chicken and egg problem: for biological evolution that is governed, primarily, by natural selection, to take off, efficient systems for replication and translation are required, but even barebones cores of these systems appear to be products of extensive selection. The currently favored (partial) solution is an RNA world without proteins in which replication is catalyzed by ribozymes and which serves as the cradle for the translation system. However, the RNA world faces its own hard problems as ribozyme-catalyzed RNA replication remains a hypothesis and the selective pressures behind the origin of translation remain mysterious. Eternal inflation offers a viable alternative that is untenable in a finite universe, i.e., that a coupled system of translation and replication emerged by chance, and became the breakthrough stage from which biological evolution, centered around Darwinian selection, took off. A corollary of this hypothesis is that an RNA world, as a diverse population of replicating RNA molecules, might have never existed. In this model, the stage for Darwinian selection is set by anthropic selection of complex systems that rarely but inevitably emerge by chance in the infinite universe (multiverse). CONCLUSION The plausibility of different models for the origin of life on earth directly depends on the adopted cosmological scenario. In an infinite universe (multiverse), emergence of highly complex systems by chance is inevitable. Therefore, under this cosmology, an entity as complex as a coupled translation-replication system should be considered a viable breakthrough stage for the onset of biological evolution. REVIEWERS This article was reviewed by Eric Bapteste, David Krakauer, Sergei Maslov, and Itai Yanai.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
| |
Collapse
|
28
|
Han KH, Kim JH, Kim WS, Han DM. The snpA, a temperature-sensitive suppressor of npgA1, encodes the eukaryotic translation release factor, eRF1, in Aspergillus nidulans. FEMS Microbiol Lett 2006; 251:155-60. [PMID: 16125337 DOI: 10.1016/j.femsle.2005.07.042] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2005] [Revised: 07/27/2005] [Accepted: 07/28/2005] [Indexed: 11/19/2022] Open
Abstract
The npgA1 mutation causes defects in the outer layer of the cell wall resulting in a colorless colony. In this study, a temperature-sensitive suppressor of npgA1 named snpA was isolated by UV mutagenesis. The suppressing mutant showed pleiotropic phenotypes in cellular structure and developmental processes when incubated at a temperature of 37 degrees C or above. At 37 degrees C, multiple germ tubes emerged from germinating conidia. Moreover, at 42 degrees C conidia germination was delayed more than 12h and hyphal growth was strongly inhibited. The suppressor allele, snpA6, is recessive and maps to the linkage group III. A gene complementing the mutation was identified employing the chromosome III-specific cosmid library. Sequencing analysis revealed that the snpA gene encodes the eukaryotic polypeptide release factor, eRF1. The snpA6 allele contains a G-A mutation resulting in SnpA(E117K), which may allow read-through of the nonsense mutation in the npgA1 allele in a similar manner to the yeast omni-potent suppressor SUP45 and SUP35.
Collapse
Affiliation(s)
- Kap-Hoon Han
- Research Center for Biomedicinal Resources, Pai Chai University, Daejeon 302-735, Republic of Korea
| | | | | | | |
Collapse
|
29
|
Wu HL, Bagby S, van den Elsen JMH. Evolution of the Genetic Triplet Code via Two Types of Doublet Codons. J Mol Evol 2005; 61:54-64. [PMID: 16059752 DOI: 10.1007/s00239-004-0224-3] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2004] [Accepted: 02/21/2005] [Indexed: 10/25/2022]
Abstract
Explaining the apparent non-random codon distribution and the nature and number of amino acids in the 'standard' genetic code remains a challenge, despite the various hypotheses so far proposed. In this paper we propose a simple new hypothesis for code evolution involving a progression from singlet to doublet to triplet codons with a reading mechanism that moves three bases each step. We suggest that triplet codons gradually evolved from two types of ambiguous doublet codons, those in which the first two bases of each three-base window were read ('prefix' codons) and those in which the last two bases of each window were read ('suffix' codons). This hypothesis explains multiple features of the genetic code such as the origin of the pattern of four-fold degenerate and two-fold degenerate triplet codons, the origin of its error minimising properties, and why there are only 20 amino acids.
Collapse
Affiliation(s)
- Huan-Lin Wu
- Department of Biology and Biochemistry, University of Bath, 4 South, Claverton Down, Bath BA2 7AY, UK
| | | | | |
Collapse
|
30
|
|
31
|
Zhang Y, Baranov PV, Atkins JF, Gladyshev VN. Pyrrolysine and Selenocysteine Use Dissimilar Decoding Strategies. J Biol Chem 2005; 280:20740-51. [PMID: 15788401 DOI: 10.1074/jbc.m501458200] [Citation(s) in RCA: 82] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Selenocysteine (Sec) and pyrrolysine (Pyl) are known as the 21st and 22nd amino acids in protein. Both are encoded by codons that normally function as stop signals. Sec specification by UGA codons requires the presence of a cis-acting selenocysteine insertion sequence (SECIS) element. Similarly, it is thought that Pyl is inserted by UAG codons with the help of a putative pyrrolysine insertion sequence (PYLIS) element. Herein, we analyzed the occurrence of Pyl-utilizing organisms, Pyl-associated genes, and Pyl-containing proteins. The Pyl trait is restricted to several microbes, and only one organism has both Pyl and Sec. We found that methanogenic archaea that utilize Pyl have few genes that contain in-frame UAG codons, and many of these are followed with nearby UAA or UGA codons. In addition, unambiguous UAG stop signals could not be identified. This bias was not observed in Sec-utilizing organisms and non-Pyl-utilizing archaea, as well as with other stop codons. These observations as well as analyses of the coding potential of UAG codons, overlapping genes, and release factor sequences suggest that UAG is not a typical stop signal in Pyl-utilizing archaea. On the other hand, searches for conserved Pyl-containing proteins revealed only four protein families, including methylamine methyltransferases and transposases. Only methylamine methyltransferases matched the Pyl trait and had conserved Pyl, suggesting that this amino acid is used primarily by these enzymes. These findings are best explained by a model wherein UAG codons may have ambiguous meaning and Pyl insertion can effectively compete with translation termination for UAG codons obviating the need for a specific PYLIS structure. Thus, Sec and Pyl follow dissimilar decoding and evolutionary strategies.
Collapse
Affiliation(s)
- Yan Zhang
- Department of Biochemistry, University of Nebraska, Lincoln, Nebraska 68588-0664, USA
| | | | | | | |
Collapse
|
32
|
Abstract
Since the early days of the discovery of the genetic code nonrandom patterns have been searched for in the code in the hope of providing information about its origin and early evolution. Here we present a new classification scheme of the genetic code that is based on a binary representation of the purines and pyrimidines. This scheme reveals known patterns more clearly than the common one, for instance, the classification of strong, mixed, and weak codons as well as the ordering of codon families. Furthermore, new patterns have been found that have not been described before: Nearly all quantitative amino acid properties, such as Woese's polarity and the specific volume, show a perfect correlation to Lagerkvist's codon-anticodon binding strength. Our new scheme leads to new ideas about the evolution of the genetic code. It is hypothesized that it started with a binary doublet code and developed via a quaternary doublet code into the contemporary triplet code. Furthermore, arguments are presented against suggestions that a "simpler" code, where only the midbase was informational, was at the origin of the genetic code.
Collapse
Affiliation(s)
- Thomas Wilhelm
- Institute of Molecular Biotechnology, Beutenbergstr. 11, 07745 Jena, Germany.
| | | |
Collapse
|
33
|
Chao AT, Dierick HA, Addy TM, Bejsovec A. Mutations in eukaryotic release factors 1 and 3 act as general nonsense suppressors in Drosophila. Genetics 2004; 165:601-12. [PMID: 14573473 PMCID: PMC1462801 DOI: 10.1093/genetics/165.2.601] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
In a screen for suppressors of the Drosophila wingless(PE4) nonsense allele, we isolated mutations in the two components that form eukaryotic release factor. eRF1 and eRF3 comprise the translation termination complex that recognizes stop codons and catalyzes the release of nascent polypeptide chains from ribosomes. Mutations disrupting the Drosophila eRF1 and eRF3 show a strong maternal-effect nonsense suppression due to readthrough of stop codons and are zygotically lethal during larval stages. We tested nonsense mutations in wg and in other embryonically acting genes and found that different stop codons can be suppressed but only a subset of nonsense alleles are subject to suppression. We suspect that the context of the stop codon is significant: nonsense alleles sensitive to suppression by eRF1 and eRF3 encode stop codons that are immediately followed by a cytidine. Such suppressible alleles appear to be intrinsically weak, with a low level of readthrough that is enhanced when translation termination is disrupted. Thus the eRF1 and eRF3 mutations provide a tool for identifying nonsense alleles that are leaky. Our findings have important implications for assigning null mutant phenotypes and for selecting appropriate alleles to use in suppressor screens.
Collapse
Affiliation(s)
- Anna T Chao
- Department of Biology, Duke University, Durham, North Carolina 27708-1000, USA
| | | | | | | |
Collapse
|
34
|
Sengupta A, Ghosh I, Mallick J, Thakur AR, Datta K. Presence of a human Hyaluronan binding protein 1 (HABP1) pseudogene-like sequence in Methanosarcina barkeri suggests its linkage in evolution. DNA Cell Biol 2004; 23:301-10. [PMID: 15169609 DOI: 10.1089/104454904323090930] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The gene encoding Hyaluronan binding protein 1 (HABP1) and its homologs have been reported across eukaryotes, from yeast to human. We have reported the presence of processed pseudogenes in several human chromosomes, along with the location of the HABP1 gene on chromosome 17p12-p13. In this study, we report not only the presence of HABP1 pseudogene in other animal species, but also the presence of a homologous sequence in Methanosarcina barkeri, an ancient life form. This sequence has 44.8% homology to the human HABP1 cDNA and 45.3% homology with the HABP1 pseudogene in human chromosome 21. This sequence has a high G + C content (57%), characteristic of archaea, a family to which M. barkeri belongs. The presence of this HABP1 cDNA like fragment in M. barkeri might enable us to shed light on the evolution of the HABPl gene and whether it was present in a common ancestral organism before the lineages separated.
Collapse
Affiliation(s)
- Aniruddha Sengupta
- Biochemistry Laboratory, School of Environmental Sciences, Jawaharlal Nehru University, New Delhi, India
| | | | | | | | | |
Collapse
|
35
|
Abstract
Since discovering the pattern by which amino acids are assigned to codons within the standard genetic code, investigators have explored the idea that natural selection placed biochemically similar amino acids near to one another in coding space so as to minimize the impact of mutations and/or mistranslations. The analytical evidence to support this theory has grown in sophistication and strength over the years, and counterclaims questioning its plausibility and quantitative support have yet to transcend some significant weaknesses in their approach. These weaknesses are illustrated here by means of a simple simulation model for adaptive genetic code evolution. There remain ill explored facets of the 'error minimizing' code hypothesis, however, including the mechanism and pathway by which an adaptive pattern of codon assignments emerged, the extent to which natural selection created synonym redundancy, its role in shaping the amino acid and nucleotide languages, and even the correct interpretation of the adaptive codon assignment pattern: these represent fertile areas for future research.
Collapse
Affiliation(s)
- Stephen J Freeland
- Department of Biology, University of Maryland, Baltimore County, Catonsville, MD, USA.
| | | | | |
Collapse
|
36
|
Abstract
The primordial genetic code probably has been a drastically simplified ancestor of the canonical code that is used by contemporary cells. In order to understand how the present-day code came about we first need to explain how the language of the building plan can change without destroying the encoded information. In this work we introduce a minimal organism model that is based on biophysically reasonable descriptions of RNA and protein, namely secondary structure folding and knowledge based potentials. The evolution of a population of such organism under competition for a common resource is simulated explicitly at the level of individual replication events. Starting with very simple codes, and hence greatly reduced amino acid alphabets, we observe a diversification of the codes in most simulation runs. The driving force behind this effect is the possibility to produce fitter proteins when the repertoire of amino acids is enlarged.
Collapse
Affiliation(s)
- Günter Weberndorfer
- Institut für Theoretische Chemie und Molekulare Strukturbiologie, Universität Wien, Wien, Austria
| | | | | |
Collapse
|
37
|
Chavatte L, Kervestin S, Favre A, Jean-Jean O. Stop codon selection in eukaryotic translation termination: comparison of the discriminating potential between human and ciliate eRF1s. EMBO J 2003; 22:1644-53. [PMID: 12660170 PMCID: PMC152891 DOI: 10.1093/emboj/cdg146] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
During eukaryotic translation termination, eRF1 responds to three stop codons. However, in ciliates with variant genetic codes, only one or two codons function as a stop signal. To localize the region of ciliate eRF1 implicated in stop codon discrimination, we have constructed ciliate-human hybrid eRF1s by swapping regions of human eRF1 for the equivalent region of ciliate Euplotes eRF1. We have examined the formation of a cross-link between recombinant eRF1s and mRNA analogs containing the photoactivable 4-thiouridine (s(4)U) at the first position of stop and control sense codons. With human eRF1, this cross-link can be detected only when either stop or UGG codons are located in the ribosomal A site. Here we show that the cross-link of the Euplotes-human hybrid eRF1 is restricted to mRNAs containing UAG and UAA codons, and that the entire N-terminal domain of Euplotes eRF1 is involved in discriminating against UGA and UGG. On the basis of these results, we discuss the steps of the selection process that determine the accuracy of stop codon recognition in eukaryotes.
Collapse
Affiliation(s)
- Laurent Chavatte
- Institut Jacques Monod, UMR 7592 CNRS-Universités Paris 7-Paris 6, France
| | | | | | | |
Collapse
|
38
|
Abstract
The mechanism of translation termination has long been a puzzle. Recent crystallographic evidence suggests that the eukaryotic release factor (eRF1), the bacterial release factor (RF2) and the ribosome recycling factor (RRF) all mimic a tRNA structure, whereas biochemical and genetic evidence supports the idea of a tripeptide 'anticodon' in bacterial release factors RF1 and RF2. However, the suggested structural mimicry of RF2 is not in agreement with the tripeptide 'anticodon' hypothesis and, furthermore, recently determined structures using cryo-electron microscopy show that, when bound to the ribosome, RF2 has a conformation that is distinct from the RF2 crystal structure. In addition, hydroxyl-radical probings of RRF on the ribosome are not in agreement with the simple idea that RRF mimics tRNA in the ribosome A-site. All of this evidence seriously questions the simple concept of structural mimicry between proteins and RNA and, thus, leaves only functional mimicry of protein factors of translation to be investigated.
Collapse
Affiliation(s)
- Yoshikazu Nakamura
- Department of Basic Medical Sciences, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan.
| | | |
Collapse
|
39
|
Seit-Nebi A, Frolova L, Kisselev L. Conversion of omnipotent translation termination factor eRF1 into ciliate-like UGA-only unipotent eRF1. EMBO Rep 2002; 3:881-6. [PMID: 12189178 PMCID: PMC1084231 DOI: 10.1093/embo-reports/kvf178] [Citation(s) in RCA: 75] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
In eukaryotic ribosomes, termination of translation is triggered by class 1 polypeptide release factor, eRF1. In organisms with a universal code, eRF1 responds to three stop codons, whereas, in ciliates with variant codes, only one or two codon(s) remain(s) as stop signals. By mutagenesis of the Y-C-F minidomain of the N domain, we converted an omnipotent human eRF1 recognizing all three stop codons into a unipotent 'ciliate-like' UGA-only eRF1. The conserved Cys127 located in the Y-C-F minidomain plays a critical role in stop codon recognition. The UGA-only response has also been achieved by concomitant substitutions of four other amino acids located at the Y-C-F and NIKS minidomains of eRF1. We suggest that for eRF1 the stop codon decoding is of a non-linear (non-protein-anticodon) type and explores a combination of positive and negative determinants. We assume that stop codon recognition is profoundly different by eukaryotic and prokaryotic class 1 RFs.
Collapse
Affiliation(s)
- Alim Seit-Nebi
- Engelhardt Institute of Molecular Biology, Moscow, Russia
| | | | | |
Collapse
|
40
|
Ito K, Frolova L, Seit-Nebi A, Karamyshev A, Kisselev L, Nakamura Y. Omnipotent decoding potential resides in eukaryotic translation termination factor eRF1 of variant-code organisms and is modulated by the interactions of amino acid sequences within domain 1. Proc Natl Acad Sci U S A 2002; 99:8494-9. [PMID: 12084909 PMCID: PMC124286 DOI: 10.1073/pnas.142690099] [Citation(s) in RCA: 71] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
In eukaryotes, a single translational release factor, eRF1, deciphers three stop codons, although its decoding mechanism remains puzzling. In the ciliate Tetrahymena thermophila, UAA and UAG codons are reassigned to Gln codons. A yeast eRF1-domain swap containing Tetrahymena domain 1 responded only to UGA in vitro and failed to complement a defect in yeast eRF1 in vivo at 37 degrees C. This finding demonstrates that decoding specificity of eRF1 from variant code organisms resides at domain 1. However, the wild-type eRF1 hybrid fully restored the growth of eRF1-deficient yeast at 30 degrees C. Tetrahymena eRF1 contains a variant sequence, KATNIKD, at the tip of domain 1. The TASNIKD variant of hybrid eRF1 rendered the eRF1-nullified yeast viable, although in an in vitro assay, the same hybrid eRF1 responded only to UGA. Nevertheless, the yeast eRF1 bearing the KATNIKD motif instead of the TASNIKS heptapeptide present in higher eukaryotes remains omnipotent in vivo. Collectively, these data suggest that variant genetic code organisms like Tetrahymena have an intrinsic potential to decode three stop codons in vivo, and that interaction within domain 1 between the KAT tripeptide and other sequences modulates the decoding specificity of Tetrahymena eRF1.
Collapse
Affiliation(s)
- Koichi Ito
- Department of Basic Medical Sciences, Institute of Medical Science, University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo 108-8639, Japan
| | | | | | | | | | | |
Collapse
|
41
|
Sowerby SJ, Petersen GB, Holm NG. Primordial coding of amino acids by adsorbed purine bases. ORIGINS LIFE EVOL B 2002; 32:35-46. [PMID: 11889916 DOI: 10.1023/a:1013957812213] [Citation(s) in RCA: 42] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Scanning tunneling microscopy and chromatography experiments exploring the potential templating properties of nucleic acid bases adsorbed to the surface of crystalline graphite, revealed that the interactions of amino acids with the bare crystal surface are significantly modulated by the prior adsorption of adenine and hypoxanthine. These bases are the coding elements of a putative purine-only genetic alphabet and the observed effects are different for each of the bases. Such mapping between bases and amino acids provides a coding mechanism. These observations demonstrate that a simple pre-RNA amino acid discrimination mechanism could have existed on the prebiotic Earth providing critical functionality for the origin of life.
Collapse
Affiliation(s)
- Stephen J Sowerby
- Department of Geology and Geochemistry, Stockholm University, Stockholm, Sweden
| | | | | |
Collapse
|
42
|
Inagaki Y, Blouin C, Doolittle WF, Roger AJ. Convergence and constraint in eukaryotic release factor 1 (eRF1) domain 1: the evolution of stop codon specificity. Nucleic Acids Res 2002; 30:532-44. [PMID: 11788716 PMCID: PMC99827 DOI: 10.1093/nar/30.2.532] [Citation(s) in RCA: 55] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Class 1 release factor in eukaryotes (eRF1) recognizes stop codons and promotes peptide release from the ribosome. The 'molecular mimicry' hypothesis suggests that domain 1 of eRF1 is analogous to the tRNA anticodon stem-loop. Recent studies strongly support this hypothesis and several models for specific interactions between stop codons and residues in domain 1 have been proposed. In this study we have sequenced and identified novel eRF1 sequences across a wide diversity of eukaryotes and re-evaluated the codon-binding site by bioinformatic analyses of a large eRF1 dataset. Analyses of the eRF1 structure combined with estimates of evolutionary rates at amino acid sites allow us to define the residues that are under structural (i.e. those involved in intramolecular interactions) versus non-structural selective constraints. Furthermore, we have re-assessed convergent substitutions in the ciliate variant code eRF1s using maximum likelihood-based phylogenetic approaches. Our results favor the model proposed by Bertram et al. that stop codons bind to three 'cavities' on the protein surface, although we suggest that the stop codon may bind in the opposite orientation to the original model. We assess the feasibility of this alternative binding orientation with a triplet stop codon and the eRF1 domain 1 structures using molecular modeling techniques.
Collapse
Affiliation(s)
- Yuji Inagaki
- Program in Evolutionary Biology, Canadian Institute for Advanced Research, Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia B3H 4H7, Canada.
| | | | | | | |
Collapse
|
43
|
Kervestin S, Frolova L, Kisselev L, Jean-Jean O. Stop codon recognition in ciliates: Euplotes release factor does not respond to reassigned UGA codon. EMBO Rep 2001; 2:680-4. [PMID: 11463747 PMCID: PMC1083993 DOI: 10.1093/embo-reports/kve156] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
In eukaryotes, the polypeptide release factor 1 (eRF1) is involved in translation termination at all three stop codons. However, the mechanism for decoding stop codons remains unknown. A direct interaction of eRF1 with the stop codons has been postulated. Recent studies focus on eRF1 from ciliates in which some stop codons are reassigned to sense codons. Using an in vitro assay based on mammalian ribosomes, we show that eRF1 from the ciliate Euplotes aediculatus responds to UAA and UAG as stop codons and lacks the capacity to decipher the UGA codon, which encodes cysteine in this organism. This result strongly suggests that in ciliates with variant genetic codes eRF1 does not recognize the reassigned codons. Recent hypotheses describing stop codon discrimination by eRF1 are not fully consistent with the set of eRF1 sequences available so far and require direct experimental testing.
Collapse
Affiliation(s)
- S Kervestin
- Unité de Biochimie Cellulaire, CNRS FRE 2219, Université Pierre et Marie Curie, 9 quai Saint-Bernard, 75005 Paris, France
| | | | | | | |
Collapse
|
44
|
Füchslin RM, McCaskill JS. Evolutionary self-organization of cell-free genetic coding. Proc Natl Acad Sci U S A 2001; 98:9185-90. [PMID: 11470896 PMCID: PMC55395 DOI: 10.1073/pnas.151253198] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2000] [Accepted: 05/21/2001] [Indexed: 11/18/2022] Open
Abstract
Genetic encoding provides a generic construction scheme for biomolecular functions. This paper addresses the key problem of coevolution and exploitation of the multiple components necessary to implement a replicable genetic encoding scheme. Extending earlier results on multicomponent replication, the necessity of spatial structure for the evolutionary stabilization of the genetic coding system is established. An individual-based stochastic model of interacting molecules in three-dimensional space is presented that allows the evolution of genetic coding to be analyzed explicitly. A massively parallel configurable computer (NGEN) is used to implement the model, on the time scale of millions of generations, directly in electronic hardware. The spatial correlations between components of the genetic coding system are analyzed and found to be essential for evolutionary stability.
Collapse
Affiliation(s)
- R M Füchslin
- GMD-German National Research Center for Information Technology, Schloss Birlinghoven, D-53754 St. Augustin (Bonn), Germany
| | | |
Collapse
|
45
|
Saito H, Suga H. A ribozyme exclusively aminoacylates the 3'-hydroxyl group of the tRNA terminal adenosine. J Am Chem Soc 2001; 123:7178-9. [PMID: 11459506 DOI: 10.1021/ja015756s] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- H Saito
- Department of Chemistry, University at Buffalo State University of New York Buffalo, New York 14260-3000, USA
| | | |
Collapse
|
46
|
Lahav N, Nir S, Elitzur AC. The emergence of life on Earth. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2001; 75:75-120. [PMID: 11311715 DOI: 10.1016/s0079-6107(01)00003-7] [Citation(s) in RCA: 41] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Combined top-down and bottom-up research strategies and the principle of biological continuity were employed in an attempt to reconstruct a comprehensive origin of life theory, which is an extension of the coevolution theory (Lahav and Nir, Origins of Life Evol. Biosphere (1997) 27, 377-395). The resulting theory of emergence of templated-information and functionality (ETIF) addresses the emergence of living entities from inanimate matter, and that of the central mechanisms of their further evolution. It proposes the emergence of short organic catalysts (peptides and proto-ribozymes) and feedback-loop systems, plus their template-and-sequence-directed (TSD) reactions, encompassing catalyzed replication and translation of populations of molecules organized as chemical-informational feedback loop entities, in a fluctuating (wetting-drying) environment, functioning as simplified extant molecular-biological systems. The feedback loops with their TSD systems are chemically and functionally continuous with extant living organisms and their emergence in an inanimate environment may be defined as the beginning of life. The ETIF theory considers the emergence of bio-homochirality, a primordial genetic code, information and the incorporation of primordial metabolic cycles and compartmentation into the emerging living entities. This theory helps to establish a novel measure of biological information, which focuses on its physical effects rather than on the structure of the message, and makes it possible to estimate the time needed for the transition from the inanimate state to the closure of the first feedback-loop systems. Moreover, it forms the basis for novel laboratory experiments and computer modeling, encompassing catalytic activity of short peptides and proto-RNAs and the emergence of bio-homochirality and feedback-loop systems.
Collapse
Affiliation(s)
- N Lahav
- Department of Soil and Water Sciences, The Faculty of Agriculture, The Hebrew University of Jerusalem, Rehovot 76100, Israel.
| | | | | |
Collapse
|
47
|
Inagaki Y, Doolittle WF. Class I release factors in ciliates with variant genetic codes. Nucleic Acids Res 2001; 29:921-7. [PMID: 11160924 PMCID: PMC29606 DOI: 10.1093/nar/29.4.921] [Citation(s) in RCA: 40] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
In eukaryotes with the universal genetic code a single class I release factor (eRF1) most probably recognizes all stop codons (UAA, UAG and UGA) and is essential for termination of nascent peptide synthesis. It is well established that stop codons have been reassigned to amino acid codons at least three times among ciliates. The codon specificities of ciliate eRF1s must have been modified to accommodate the variant codes. In this study we have amplified, cloned and sequenced eRF1 genes of two hypotrichous ciliates, Oxytricha trifallax (UAA and UAG for Gln) and Euplotes aediculatus (UGA for Cys). We also sequenced/identified three protist and two archaeal class I RF genes to enlarge the database of eRF1/aRF1s with the universal code. Extensive comparisons between universal code eRF1s and those of Oxytricha, Euplotes, and Tetrahymena which represent three lineages that acquired variant codes independently, provide important clues to identify stop codon-binding regions in eRF1. Domain 1 in the five ciliate eRF1s, particularly the TASNIKS heptapeptide and its adjacent region, differs significantly from domain 1 in universal code eRF1s. This observation suggests that domain 1 contains the codon recognition site, but that the mechanism of eRF1 codon recognition may be more complex than proposed by Nakamura et al. or Knight and Landweber.
Collapse
Affiliation(s)
- Y Inagaki
- Program in Evolutionary Biology, Canadian Institute for Advanced Research, Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia B3H 4H7, Canada.
| | | |
Collapse
|
48
|
Abstract
BACKGROUND The nuclear genetic code has changed in several lineages of ciliates. These changes, UAR to glutamine and UGA to cysteine, imply that eukaryotic release factor 1 (eRF1), the protein that recognizes stop codons and terminates translation, changes specificity. Here we test whether changes in eRF1 drive genetic code evolution. RESULTS Database sequence analysis reveals numerous genetic code alterations in ciliates, including UGA --> tryptophan in Blepharisma americanum and the distantly related Colpoda. We sequenced eRF1 from four ciliates: B. americanum, a heterotrich that independently derived the same eRF1 specificity as Euplotes, and three spirotrichs, Stylonychia lemnae, S. mytilus, and Oxytricha trifallax, that independently derived the same genetic code as Tetrahymena (UAR --> glutamine). Distantly related ciliates with similar codes show characteristic changes in eRF1. We used a sliding window analysis to test associations between changes in specific eRF1 residues and changes in the genetic code. The regions of eRF1 that display convergent substitutions are identical to those identified in a recently reported nonsense suppression mutant screen in yeast. CONCLUSIONS Genetic code change by stop codon reassignment is surprisingly frequent in ciliates, with UGA --> tryptophan occurring twice independently. This is the first description of this code, previously found only in bacteria and mitochondria, in a eukaryotic nuclear genome. eRF1 has evolved strikingly convergently in lineages with variant genetic codes. The strong concordance with biochemical data indicates that our methodology may be generally useful for detecting molecular determinants of biochemical changes in evolution.
Collapse
Affiliation(s)
- C A Lozupone
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ 08544, USA
| | | | | |
Collapse
|