1
|
Ruzov AS, Ermakov AS. The non-canonical nucleotides and prebiotic evolution. Biosystems 2025; 248:105411. [PMID: 39900260 DOI: 10.1016/j.biosystems.2025.105411] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2024] [Revised: 12/23/2024] [Accepted: 01/31/2025] [Indexed: 02/05/2025]
Abstract
The mystery of the origin of life has been puzzling mankind for several millenia. Starting from the second half of the 20th century, when the crucial role of nucleic acids in biological heredity became apparent, the emphasis in the field has shifted to the explanation of the origin of nucleic acids and the mechanisms of copying of macromolecules. In the 1960s, the hypothesis of the RNA World was proposed, according to which the first stages of the origin of life on Earth were associated with the appearance of self-replicating complexes based on RNA, that were akin to RNA-enzymes that catalyze critical for life chemical reactions. Currently, it has been shown that different forms of RNA include not only canonical (adenine, uracil, guanine, cytosine), but also about 170 non-canonical nucleotides. In this review, we discuss potential roles of these non-canonical nucleotides in the processes of molecular prebiotic evolution, such as the emergence of canonical RNA nucleotides and catalytic RNAs, as well as the origin of template synthesis of RNA and proteins.
Collapse
Affiliation(s)
- Alexey S Ruzov
- Institute of Bioengineering, Research Center of Biotechnology, Russian Academy of Sciences, 119071, Moscow, Russia
| | - Alexander S Ermakov
- Institute of Bioengineering, Research Center of Biotechnology, Russian Academy of Sciences, 119071, Moscow, Russia; Faculty of Biology, Lomonosov Moscow State University, 119991, Moscow, Russia.
| |
Collapse
|
2
|
Seelig B, Chen IA. Intellectual frameworks to understand complex biochemical systems at the origin of life. Nat Chem 2025; 17:11-19. [PMID: 39762573 DOI: 10.1038/s41557-024-01698-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Accepted: 11/14/2024] [Indexed: 01/11/2025]
Abstract
Understanding the emergence of complex biochemical systems, such as protein translation, is a great challenge. Although synthetic approaches can provide insight into the potential early stages of life, they do not address the equally important question of why the complex systems of life would have evolved. In particular, the intricacies of the mechanisms governing the transfer of information from nucleic acid sequences to proteins make it difficult to imagine how coded protein synthesis could have emerged from a prebiotic soup. Here we discuss the use of intellectual frameworks in studying the emergence of life. We discuss how one such framework, namely the RNA world theory, has spurred research, and provide an overview of its limitations. We suggest that the emergence of coded protein synthesis could be broken into experimentally tractable problems by treating it as a molecular bricolage-a complex system integrating many different parts, each of which originally evolved for uses unrelated to its modern function-to promote a concrete understanding of its origin.
Collapse
Affiliation(s)
- Burckhard Seelig
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, MN, USA.
- BioTechnology Institute, University of Minnesota, St. Paul, MN, USA.
| | - Irene A Chen
- Department of Chemical and Biomolecular Engineering, University of California, Los Angeles, Los Angeles, CA, USA.
| |
Collapse
|
3
|
Di Giulio M. Theories of the origin of the genetic code: Strong corroboration for the coevolution theory. Biosystems 2024; 239:105217. [PMID: 38663520 DOI: 10.1016/j.biosystems.2024.105217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Revised: 04/16/2024] [Accepted: 04/18/2024] [Indexed: 04/29/2024]
Abstract
I analyzed all the theories and models of the origin of the genetic code, and over the years, I have considered the main suggestions that could explain this origin. The conclusion of this analysis is that the coevolution theory of the origin of the genetic code is the theory that best captures the majority of observations concerning the organization of the genetic code. In other words, the biosynthetic relationships between amino acids would have heavily influenced the origin of the organization of the genetic code, as supported by the coevolution theory. Instead, the presence in the genetic code of physicochemical properties of amino acids, which have also been linked to the physicochemical properties of anticodons or codons or bases by stereochemical and physicochemical theories, would simply be the result of natural selection. More explicitly, I maintain that these correlations between codons, anticodons or bases and amino acids are in fact the result not of a real correlation between amino acids and codons, for example, but are only the effect of the intervention of natural selection. Specifically, in the genetic code table we expect, for example, that the most similar codons - that is, those that differ by only one base - will have more similar physicochemical properties. Therefore, the 64 codons of the genetic code table ordered in a certain way would also represent an ordering of some of their physicochemical properties. Now, a study aimed at clarifying which physicochemical property of amino acids has influenced the allocation of amino acids in the genetic code has established that the partition energy of amino acids has played a role decisive in this. Indeed, under some conditions, the genetic code was found to be approximately 98% optimized on its columns. In this same work, it was shown that this was most likely the result of the action of natural selection. If natural selection had truly allocated the amino acids in the genetic code in such a way that similar amino acids also have similar codons - this, not through a mechanism of physicochemical interaction between, for example, codons and amino acids - then it might turn out that even different physicochemical properties of codons (or anticodons or bases) show some correlation with the physicochemical properties of amino acids, simply because the partition energy of amino acids is correlated with other physicochemical properties of amino acids. It is very likely that this would inevitably lead to a correlation between codons (or anticodons or bases) and amino acids. In other words, since the codons (anticodons or bases) are ordered in the genetic code, that is to say, some of their physicochemical properties should also be ordered by a similar order, and given that the amino acids would also appear to have been ordered in the genetic code by selection natural, then it should inevitably turn out that there is a correlation between, for example, the hydrophobicity of anticodons and that of amino acids. Instead, the intervention of natural selection in organizing the genetic code would appear to be highly compatible with the main mechanism of structuring the genetic code as supported by the coevolution theory. This would make the coevolution theory the only plausible explanation for the origin of the genetic code.
Collapse
Affiliation(s)
- Massimo Di Giulio
- The Ionian School, Early Evolution of Life Department, Genetic Code and tRNA Origin Laboratory, Via Roma 19, 67030, Alfedena, L'Aquila, Italy.
| |
Collapse
|
4
|
Di Giulio M. The time of appearance of the genetic code. Biosystems 2024; 237:105159. [PMID: 38373543 DOI: 10.1016/j.biosystems.2024.105159] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Revised: 02/13/2024] [Accepted: 02/16/2024] [Indexed: 02/21/2024]
Abstract
I support the hypothesis that the origin of the genetic code occurred simultaneously with the evolution of cellularity. That is to say, I favour the hypothesis that the origin of the genetic code is a very, very late event in the history of life on Earth. I corroborate this hypothesis with observations favouring the progenote's stage for the Last Universal Common Ancestor (LUCA), for the ancestor of bacteria and that of archaea. Indeed, these progenotic stages would imply that - at that time - the origin of the genetic code was still ongoing simply because this origin would fall within the very definition of progenote. Therefore, if the evolution of cellularity had truly been coeval with the origin of the genetic code - at least in its terminal part - then this would favour theories such as the coevolution theory of the origin of the genetic code because this theory would postulate that this origin must have occurred in extremely complex protocellular conditions and not concerning stereochemical or physicochemical interactions having to do with other stages of the origin of life. In this sense, the coevolution theory would be corroborated while the stereochemical and physicochemical theories would be damaged. Therefore, the origin of the genetic code would be linked to the origin of the cell and not to the origin of life as sometimes asserted. Therefore, I will discuss the late hypothesis of the origin of the genetic code in the context of the theories proposed to explain this origin and more generally of its implications for the early evolution of life.
Collapse
Affiliation(s)
- Massimo Di Giulio
- The Ionian School, Early Evolution of Life Department, Genetic Code and tRNA Origin Laboratory, Via Roma 19, 67030, Alfedena, L'Aquila, Italy.
| |
Collapse
|
5
|
Abstract
Preexisting partial genetic codes can fuse to evolve towards the complete Standard Genetic Code (SGC). Such code fusion provides a path of 'least selection', readily generating precursor codes that resemble the SGC. Consequently, such least selections produce the SGC via minimal, thus rapid, change. Optimal code evolution therefore requires delayed wobble. Early wobble encoding slows code evolution, very specifically diminishing the most likely SGC precursors: near-complete, accurate codes which are the products of code fusions. In contrast: given delayed wobble, the SGC can emerge from a truncation selection/evolutionary radiation based on proficient fused coding.
Collapse
Affiliation(s)
- Michael Yarus
- Department of Molecular, Cellular and Developmental Biology, University of Colorado, Boulder, CO, USA
| |
Collapse
|
6
|
Martínez Giménez JA, Tabares Seisdedos R. A Cofactor-Based Mechanism for the Origin of the Genetic Code. ORIGINS LIFE EVOL B 2022; 52:149-163. [PMID: 36071304 DOI: 10.1007/s11084-022-09628-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Accepted: 08/02/2022] [Indexed: 11/24/2022]
Abstract
The origin of the genetic code is probably the central problem of the studies on the origin of life. The key question to answer is the molecular mechanism that allows the association of the amino acids with their triplet codons. We proposed that the codon-anticodon duplex located in the acceptor stem of primitive tRNAs would facilitate the chemical reactions required to synthesize cognate amino acids from simple amino acids (glycine, valine, and aspartic acid) linked to the 3' acceptor end. In our view, various nucleotide-A-derived cofactors (with reactive chemical groups) may be attached to the codon-anticodon duplex, which allows group-transferring reactions from cofactors to simple amino acids, thereby producing the final amino acid. The nucleotide-A-derived cofactors could be incorporated into the RNA duplex (helix) by docking Adenosine (cofactor) into the minor groove via an interaction similar to the A-minor motif, forming a base triple between Adenosine and one complementary base pair of the duplex. Furthermore, we propose that this codon-anticodon duplex could initially catalyze a self-aminoacylation reaction with a simple amino acid. Therefore, the sequence of bases in the codon-anticodon duplex would determine the reactions that occurred during the formation of new amino acids for selective binding of nucleotide-A-derived cofactors.
Collapse
Affiliation(s)
| | - Rafael Tabares Seisdedos
- Departamento de Medicina, Facultad de Medicina de Valencia, (CIBERSAM; INCLIVA-UV), Universidad de Valencia, Av. Blasco Ibañez 17, 46010, Valencia, Spain.
| |
Collapse
|
7
|
Arguments against the stereochemical theory of the origin of the genetic code. Biosystems 2022; 221:104750. [PMID: 35970477 DOI: 10.1016/j.biosystems.2022.104750] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Revised: 07/26/2022] [Accepted: 07/26/2022] [Indexed: 11/23/2022]
Abstract
I support the hypothesis that stereochemical theory is unnatural because it is based on artificial and not simple mechanisms as required for a good theory. Indeed, for stereochemical theory the origin of the genetic code requires, in the first place, a primary interaction, for example, between a codon and an amino acid on a proto-tRNA. But this interaction is a necessary but not sufficient condition, because the evolution of the mRNA molecule, which would really define the genetic code, is still necessary for the complete origin of the genetic code. In other words, the need for two molecules, tRNA and mRNA, to define the genetic code, with their at least partial independence would testify to an artificial mechanism typical of stereochemical theory because it would not guarantee that amino acid-codon (or -anticodon) assignments realized in the first phase of the origin of the genetic code, would necessarily be maintained also in the second phase of its completion. Furthermore, the genetic code encodes for amino acids but amino acids are not the truly functional aspect, they are only intermediaries, of their final products, proteins, which are the only true entities actually coded by genes. Therefore, it would not be immediately clear from the point of view of stereochemical theory, to say why it is the amino acids and not the proteins that are involved in the primary stereochemical interactions that would have led to the origin of the genetic code. Hence, at least some of the stereochemical theory models would be not very credible, not being able to say much about the coding of proteins by genes. Finally, I inspected the genetic code table following the logic that more closely similar amino acids should - according to stereochemical theory - be coded by highly similar codons, finding that only a few pairs of amino acids actually satisfy this logic, further discretizing the stereochemical theory.
Collapse
|
8
|
Abstract
The RNA world concept1 is one of the most fundamental pillars of the origin of life theory2–4. It predicts that life evolved from increasingly complex self-replicating RNA molecules1,2,4. The question of how this RNA world then advanced to the next stage, in which proteins became the catalysts of life and RNA reduced its function predominantly to information storage, is one of the most mysterious chicken-and-egg conundrums in evolution3–5. Here we show that non-canonical RNA bases, which are found today in transfer and ribosomal RNAs6,7, and which are considered to be relics of the RNA world8–12, are able to establish peptide synthesis directly on RNA. The discovered chemistry creates complex peptide-decorated RNA chimeric molecules, which suggests the early existence of an RNA–peptide world13 from which ribosomal peptide synthesis14 may have emerged15,16. The ability to grow peptides on RNA with the help of non-canonical vestige nucleosides offers the possibility of an early co-evolution of covalently connected RNAs and peptides13,17,18, which then could have dissociated at a higher level of sophistication to create the dualistic nucleic acid–protein world that is the hallmark of all life on Earth. Peptide synthesis can take place directly on RNA, which suggests how a nucleic acid–protein world might have originated on early Earth.
Collapse
|
9
|
Kondratyeva LG, Dyachkova MS, Galchenko AV. The Origin of Genetic Code and Translation in the Framework of Current Concepts on the Origin of Life. BIOCHEMISTRY. BIOKHIMIIA 2022; 87:150-169. [PMID: 35508902 DOI: 10.1134/s0006297922020079] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
The origin of genetic code and translation system is probably the central and most difficult problem in the investigations on the origin of life and one of the most complex problems in the evolutionary biology in general. There are multiple hypotheses on the emergence and development of existing genetic systems that propose the mechanisms for the origin and early evolution of genetic code, as well as for the emergence of replication and translation. Here, we discuss the most well-known of these hypotheses, although none of them provides a description of the early evolution of genetic systems without gaps and assumptions. The RNA world hypothesis is a currently prevailing scientific idea on the early evolution of biological and pre-biological structures, the main advantage of which is the assumption that RNAs as the first living systems were self-sufficient, i.e., capable of functioning as both catalysts and templates. However, this hypothesis has also significant limitations. In particular, no ribozymes with processive polymerase activity have been yet discovered or synthesized. Taking into account the mutual need of proteins and nucleic acids in each other in the current world, many authors propose the early evolution scenarios based on the co-evolution of these two classes of organic molecules. They postulate that the emergence of translation was necessary for the replication of nucleic acids, in contrast to the RNA world hypothesis, according to which the emergence of translation was preceded by the era of self-replicating RNAs. Although such scenarios are less parsimonious from the evolutionary point of view, since they require simultaneous emergence and evolution of two classes of organic molecules, as well as the emergence of synchronized replication and translation, their major advantage is that they explain the development of processive and much more accurate protein-dependent replication.
Collapse
Affiliation(s)
- Liya G Kondratyeva
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, 117997, Russia
| | | | - Alexey V Galchenko
- Peoples' Friendship University of Russia (RUDN University), Moscow, 117198, Russia.
| |
Collapse
|
10
|
Caldararo F, Di Giulio M. The genetic code is very close to a global optimum in a model of its origin taking into account both the partition energy of amino acids and their biosynthetic relationships. Biosystems 2022; 214:104613. [DOI: 10.1016/j.biosystems.2022.104613] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2021] [Revised: 01/16/2022] [Accepted: 01/17/2022] [Indexed: 01/23/2023]
|
11
|
Kovalenko SP. On the Origin of Genetically Coded Protein Synthesis. RUSSIAN JOURNAL OF BIOORGANIC CHEMISTRY 2021. [DOI: 10.1134/s1068162021060121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
12
|
Genome Evolution from Random Ligation of RNAs of Autocatalytic Sets. Int J Mol Sci 2021; 22:ijms222413526. [PMID: 34948321 PMCID: PMC8707343 DOI: 10.3390/ijms222413526] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Revised: 12/08/2021] [Accepted: 12/15/2021] [Indexed: 11/16/2022] Open
Abstract
The evolutionary origin of the genome remains elusive. Here, I hypothesize that its first iteration, the protogenome, was a multi-ribozyme RNA. It evolved, likely within liposomes (the protocells) forming in dry-wet cycling environments, through the random fusion of ribozymes by a ligase and was amplified by a polymerase. The protogenome thereby linked, in one molecule, the information required to seed the protometabolism (a combination of RNA-based autocatalytic sets) in newly forming protocells. If this combination of autocatalytic sets was evolutionarily advantageous, the protogenome would have amplified in a population of multiplying protocells. It likely was a quasispecies with redundant information, e.g., multiple copies of one ribozyme. As such, new functionalities could evolve, including a genetic code. Once one or more components of the protometabolism were templated by the protogenome (e.g., when a ribozyme was replaced by a protein enzyme), and/or addiction modules evolved, the protometabolism became dependent on the protogenome. Along with increasing fidelity of the RNA polymerase, the protogenome could grow, e.g., by incorporating additional ribozyme domains. Finally, the protogenome could have evolved into a DNA genome with increased stability and storage capacity. I will provide suggestions for experiments to test some aspects of this hypothesis, such as evaluating the ability of ribozyme RNA polymerases to generate random ligation products and testing the catalytic activity of linked ribozyme domains.
Collapse
|
13
|
Factors in Protobiomonomer Selection for the Origin of the Standard Genetic Code. Acta Biotheor 2021; 69:745-767. [PMID: 34283307 DOI: 10.1007/s10441-021-09420-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Accepted: 07/01/2021] [Indexed: 10/20/2022]
Abstract
Natural selection of specific protobiomonomers during abiogenic development of the prototype genetic code is hindered by the diversity of structural, spatial, and rotational isomers that have identical elemental composition and molecular mass (M), but can vary significantly in their physicochemical characteristics, such as the melting temperature Tm, the Tm:M ratio, and the solubility in water, due to different positions of atoms in the molecule. These parameters differ between cis- and trans-isomers of dicarboxylic acids, spatial monosaccharide isomers, and structural isomers of α-, β-, and γ-amino acids. The stable planar heterocyclic molecules of the major nucleobases comprise four (C, H, N, O) or three (C, H, N) elements and contain a single -C=C bond and two nitrogen atoms in each heterocycle involved in C-N and C=N bonds. They exist as isomeric resonance hybrids of single and double bonds and as a mixture of tautomer forms due to the presence of -C=O and/or -NH2 side groups. They are thermostable, insoluble in water, and exhibit solid-state stability, which is of central importance for DNA molecules as carriers of genetic information. In M-Tm diagrams, proteinogenic amino acids and the corresponding codons are distributed fairly regularly relative to the distinct clusters of purine and pyrimidine bases, reflecting the correspondence between codons and amino acids that was established in different periods of genetic code development. The body of data on the evolution of the genetic code system indicates that the elemental composition and molecular structure of protobiomonomers, and their M, Tm, photostability, and aqueous solubility determined their selection in the emergence of the standard genetic code.
Collapse
|
14
|
Martínez-Giménez JA, Tabares-Seisdedos R. Possible Ancestral Functions of the Genetic and RNA Operational Precodes and the Origin of the Genetic System. ORIGINS LIFE EVOL B 2021; 51:167-183. [PMID: 34097191 DOI: 10.1007/s11084-021-09610-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Accepted: 05/17/2021] [Indexed: 11/24/2022]
Abstract
The origin of genetic systems is the central problem in the study of the origin of life for which various explanatory hypotheses have been presented. One model suggests that both ancestral transfer ribonucleic acid (tRNA) molecules and primitive ribosomes were originally involved in RNA replication (Campbell 1991). According to this model the early tRNA molecules catalyzed their own self-loading with a trinucleotide complementary to their anticodon triplet, while the primordial ribosome (protoribosome) catalyzed the transfer of these terminal trinucleotides from one tRNA to another tRNA harboring the growing RNA polymer at the 3´-end.Here we present the notion that the anticodon-codon-like pairs presumably located in the acceptor stem of primordial tRNAs (Rodin et al. 1996) (thus being and remaining, after the code and translation origins, the major contributor to the RNA operational code (Schimmel et al. 1993)) might have originally been used for RNA replication rather than translation; these anticodon and acceptor stem triplets would have been involved in accurately loading the 3'-end of tRNAs with a trinucleotide complementary to their anticodon triplet, thus allowing the accurate repair of tRNAs for their use by the protoribosome during RNA replication.We propose that tRNAs could have catalyzed their own trinucleotide self-loading by forming catalytic tRNA dimers which would have had polymerase activity. Therefore, the loading mechanism and its evolution may have been a basic step in the emergence of new genetic mechanisms such as genetic translation. The evolutionary implications of this proposed loading mechanism are also discussed.
Collapse
Affiliation(s)
| | - Rafael Tabares-Seisdedos
- Departamento de Medicina, Facultad de Medicina de Valencia, Universidad de Valencia, Av. Blasco Ibañez 17, 46010, Valencia, Spain.
| |
Collapse
|
15
|
Di Giulio M. The evolutionary stages of the complexity of biological catalysts mark and clarify the phases of the origin of the genetic code: A model for the origin of the reading frame with codons from proto-mRNAs with different frames. Biosystems 2021; 207:104449. [PMID: 34052366 DOI: 10.1016/j.biosystems.2021.104449] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2021] [Revised: 05/20/2021] [Accepted: 05/22/2021] [Indexed: 11/30/2022]
Abstract
I analyse the origin of the genetic code in the light of the evolution of biological catalysts. I discuss the rudimentary forms that the genetic code assumed in the presence of a catalysis performed by ions or by low molecular weight molecules, such as nucleotide coenzymes. However, it is only with the advent of a mixed polymer made of RNA and peptides - covalently linked - that the genetic code took on a clearer form. Indeed, the first true form of coding appeared. Furthermore, interacting peptidated RNAs promoted an extremely rudimentary form of protein synthesis. This stage evolved into a stage in which proto-mRNAs guided interactions among peptidated RNAs aimed at the synthesis of peptidated RNAs having an active catalytic function. Finally, the invasion of aminoacylated proto-tRNAs with specific amino acids, coming from amino acid metabolism, and recognising only three bases on these proto-mRNAs with reading frames larger than three bases, would have triggered the birth of actual mRNAs, i.e. the origin of codons. All this would have linked the metabolism of amino acids to the origin of mRNAs and therefore to the origin of the organization of the genetic code, as maintained by the coevolution theory of the genetic code.
Collapse
Affiliation(s)
- Massimo Di Giulio
- The Ionian School, Genetic Code and tRNA Origin Laboratory, Via Roma 19, 67030, Alfedena, L'Aquila, Italy; Institute of Biosciences and Bioresources, National Research Council, Via P. Castellino, 111, 80131, Naples, Italy.
| |
Collapse
|
16
|
Villarreal LP, Witzany G. Social Networking of Quasi-Species Consortia drive Virolution via Persistence. AIMS Microbiol 2021; 7:138-162. [PMID: 34250372 PMCID: PMC8255905 DOI: 10.3934/microbiol.2021010] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Accepted: 04/25/2021] [Indexed: 12/31/2022] Open
Abstract
The emergence of cooperative quasi-species consortia (QS-C) thinking from the more accepted quasispecies equations of Manfred Eigen, provides a conceptual foundation from which concerted action of RNA agents can now be understood. As group membership becomes a basic criteria for the emergence of living systems, we also start to understand why the history and context of social RNA networks become crucial for survival and function. History and context of social RNA networks also lead to the emergence of a natural genetic code. Indeed, this QS-C thinking can also provide us with a transition point between the chemical world of RNA replicators and the living world of RNA agents that actively differentiate self from non-self and generate group identity with membership roles. Importantly the social force of a consortia to solve complex, multilevel problems also depend on using opposing and minority functions. The consortial action of social networks of RNA stem-loops subsequently lead to the evolution of cellular organisms representing a tree of life.
Collapse
|
17
|
Seligmann H. First arrived, first served: competition between codons for codon-amino acid stereochemical interactions determined early genetic code assignments. Naturwissenschaften 2020; 107:20. [PMID: 32367155 DOI: 10.1007/s00114-020-01676-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2019] [Revised: 03/10/2020] [Accepted: 04/05/2020] [Indexed: 12/12/2022]
Abstract
Stereochemical nucleotide-amino acid interactions, in the form of noncovalent nucleotide-amino acid interactions, potentially produced the genetic code's codon-amino acid assignments. Empirical estimates of single nucleotide-amino acid affinities on surfaces and in solution are used to test whether trinucleotide-amino acid affinities determined genetic code assignments pending the principle "first arrived, first served": presumed early amino acids have greater codon-amino acid affinities than ulterior ones. Here, these single nucleotide affinities are used to approximate all 64 × 20 trinucleotide-amino acid affinities. Analyses show that (1) on surfaces, genetic code codon-amino acid assignments tend to match high affinities for the amino acids that integrated earliest the genetic code (according to Wong's metabolic coevolution hypothesis between nucleotides and amino acids) and (2) in solution, the same principle holds for the anticodon-amino acid assignments. Affinity analyses match best genetic code assignments when assuming that trinucleotides competed for amino acids, rather than amino acids for trinucleotides. Codon-amino acid affinities stick better to genetic code assignments than anticodon-amino acid affinities. Presumably, two independent coding systems, on surfaces and in solution, converged, and formed the current translation system. Proto-translation on surfaces by direct codon-amino acid interactions without tRNA-like adaptors coadapted with a system emerging in solution by proto-tRNA anticodon-amino acid interactions. These systems assigned identical or similar cognates to codons on surfaces and to anticodons in solution. Results indicate that a prebiotic metabolism predated genetic code self-organization.
Collapse
Affiliation(s)
- Hervé Seligmann
- The National Natural History Collections, The Hebrew University of Jerusalem, 91904, Jerusalem, Israel. .,Faculty of Medicine, Université Grenoble Alpes, Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical & Labcom CNRS/UGA/OrangeLabs Telecoms4Health, F-38700, La Tronche, France.
| |
Collapse
|
18
|
Kubyshkin V, Budisa N. Anticipating alien cells with alternative genetic codes: away from the alanine world! Curr Opin Biotechnol 2019; 60:242-249. [DOI: 10.1016/j.copbio.2019.05.006] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2018] [Accepted: 05/07/2019] [Indexed: 12/24/2022]
|
19
|
Kubyshkin V, Budisa N. The Alanine World Model for the Development of the Amino Acid Repertoire in Protein Biosynthesis. Int J Mol Sci 2019; 20:ijms20215507. [PMID: 31694194 PMCID: PMC6862034 DOI: 10.3390/ijms20215507] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Revised: 11/01/2019] [Accepted: 11/03/2019] [Indexed: 12/13/2022] Open
Abstract
A central question in the evolution of the modern translation machinery is the origin and chemical ethology of the amino acids prescribed by the genetic code. The RNA World hypothesis postulates that templated protein synthesis has emerged in the transition from RNA to the Protein World. The sequence of these events and principles behind the acquisition of amino acids to this process remain elusive. Here we describe a model for this process by following the scheme previously proposed by Hartman and Smith, which suggests gradual expansion of the coding space as GC–GCA–GCAU genetic code. We point out a correlation of this scheme with the hierarchy of the protein folding. The model follows the sequence of steps in the process of the amino acid recruitment and fits well with the co-evolution and coenzyme handle theories. While the starting set (GC-phase) was responsible for the nucleotide biosynthesis processes, in the second phase alanine-based amino acids (GCA-phase) were recruited from the core metabolism, thereby providing a standard secondary structure, the α-helix. In the final phase (GCAU-phase), the amino acids were appended to the already existing architecture, enabling tertiary fold and membrane interactions. The whole scheme indicates strongly that the choice for the alanine core was done at the GCA-phase, while glycine and proline remained rudiments from the GC-phase. We suggest that the Protein World should rather be considered the Alanine World, as it predominantly relies on the alanine as the core chemical scaffold.
Collapse
Affiliation(s)
- Vladimir Kubyshkin
- Department of Chemistry, University of Manitoba, Dysart Rd. 144, Winnipeg, MB R3T 2N2, Canada
- Correspondence: (V.K.); or (N.B.); Tel.: +1-204-474-9321 or +49-30-314-28821 (N.B.)
| | - Nediljko Budisa
- Department of Chemistry, University of Manitoba, Dysart Rd. 144, Winnipeg, MB R3T 2N2, Canada
- Department of Chemistry, Technical University of Berlin, Müller-Breslau-Str. 10, 10623 Berlin, Germany
- Correspondence: (V.K.); or (N.B.); Tel.: +1-204-474-9321 or +49-30-314-28821 (N.B.)
| |
Collapse
|
20
|
Kunnev D, Gospodinov A. Possible Emergence of Sequence Specific RNA Aminoacylation via Peptide Intermediary to Initiate Darwinian Evolution and Code Through Origin of Life. Life (Basel) 2018; 8:E44. [PMID: 30279401 PMCID: PMC6316189 DOI: 10.3390/life8040044] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2018] [Revised: 09/30/2018] [Accepted: 09/30/2018] [Indexed: 12/12/2022] Open
Abstract
One of the most intriguing questions in biological science is how life originated on Earth. A large number of hypotheses have been proposed to explain it, each putting an emphasis on different events leading to functional translation and self-sustained system. Here, we propose a set of interactions that could have taken place in the prebiotic environment. According to our hypothesis, hybridization-induced proximity of short aminoacylated RNAs led to the synthesis of peptides of random sequence. We postulate that among these emerged a type of peptide(s) capable of stimulating the interaction between specific RNAs and specific amino acids, which we call "bridge peptide" (BP). We conclude that translation should have emerged at the same time when the standard genetic code begun to evolve due to the stabilizing effect on RNA-peptide complexes with the help of BPs. Ribosomes, ribozymes, and the enzyme-directed RNA replication could co-evolve within the same period, as logical outcome of RNA-peptide world without the need of RNA only self-sustained step.
Collapse
Affiliation(s)
- Dimiter Kunnev
- Roswell Park Cancer Institute, Department of Molecular & Cellular Biology, Buffalo, NY 14263, USA.
| | - Anastas Gospodinov
- Roumen Tsanev Institute of Molecular Biology, Bulgarian Academy of Sciences, Acad. G. Bonchev Str. 21, Sofia 1113, Bulgaria.
| |
Collapse
|
21
|
Błażej P, Wnętrzak M, Mackiewicz D, Mackiewicz P. Optimization of the standard genetic code according to three codon positions using an evolutionary algorithm. PLoS One 2018; 13:e0201715. [PMID: 30092017 PMCID: PMC6084934 DOI: 10.1371/journal.pone.0201715] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2018] [Accepted: 07/21/2018] [Indexed: 12/28/2022] Open
Abstract
Many biological systems are typically examined from the point of view of adaptation to certain conditions or requirements. One such system is the standard genetic code (SGC), which generally minimizes the cost of amino acid replacements resulting from mutations or mistranslations. However, no full consensus has been reached on the factors that caused the evolution of this feature. One of the hypotheses suggests that code optimality was directly selected as an advantage to preserve information about encoded proteins. An important feature that should be considered when studying the SGC is the different roles of the three codon positions. Therefore, we investigated the robustness of this code regarding the cost of amino acid replacements resulting from substitutions in these positions separately and the sum of these costs. We applied a modified evolutionary algorithm and included four models of the genetic code assuming various restrictions on its structure. The SGC was compared both with the codes that minimize the objective function and those that maximize it. This approach allowed us to place the SGC in the global space of possible codes, which is a more appropriate and unbiased comparison than that with randomly generated codes because they are characterized by relatively uniform amino acid assignments to codons. The SGC appeared to be well optimized at the global scale, but its individual positions were not fully optimized because there were codes that were optimized for only one codon position and simultaneously outperformed the SGC at the other positions. We also found that different code structures may lead to the same optimality and that random codes can show a tendency to minimize costs under some of the genetic code models. Our results suggest that the optimality of SGC could be a by-product of other processes.
Collapse
Affiliation(s)
- Paweł Błażej
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, Wrocław, Poland
| | - Małgorzata Wnętrzak
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, Wrocław, Poland
| | - Dorota Mackiewicz
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, Wrocław, Poland
| | - Paweł Mackiewicz
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, Wrocław, Poland
- * E-mail:
| |
Collapse
|
22
|
Di Giulio M. A discriminative test among the different theories proposed to explain the origin of the genetic code: The coevolution theory finds additional support. Biosystems 2018; 169-170:1-4. [DOI: 10.1016/j.biosystems.2018.05.002] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2018] [Revised: 04/26/2018] [Accepted: 05/07/2018] [Indexed: 11/29/2022]
|
23
|
Palacios-Pérez M, Andrade-Díaz F, José MV. A Proposal of the Ur-proteome. ORIGINS LIFE EVOL B 2018; 48:245-258. [PMID: 29127550 DOI: 10.1007/s11084-017-9553-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2017] [Accepted: 10/24/2017] [Indexed: 11/25/2022]
Abstract
Herein we outline a plausible proteome, encoded by assuming a primeval RNY genetic code. We unveil the primeval phenotype by using only the RNA genotype; it means that we recovered the most ancestral proteome, mostly made of the 8 amino acids encoded by RNY triplets. By looking at those fragments, it is noticeable that they are positioned, not at catalytic sites, but in the cofactor binding sites. It implies that the stabilization of a molecule appeared long before its catalytic activity, and therefore the Ur-proteome comprised a set of proteins modules that corresponded to Cofactor Stabilizing Binding Sites (CSBSs), which we call the primitive bindome. With our method, we reconstructed the structures of the "first protein modules" that Sobolevsky and Trifonov (2006) found by using only RMSD. We also examine the probable cofactors that bound to them. We discuss the notion of CSBSs as the first proteins modules in progenotes in the context of several proposals about the primitive forms of life.
Collapse
Affiliation(s)
- Miryam Palacios-Pérez
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, C.P. 04510, Ciudad de México CDMX, Mexico
| | - Fernando Andrade-Díaz
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, C.P. 04510, Ciudad de México CDMX, Mexico
| | - Marco V José
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, C.P. 04510, Ciudad de México CDMX, Mexico.
| |
Collapse
|
24
|
The evolution of the genetic code: Impasses and challenges. Biosystems 2018; 164:217-225. [DOI: 10.1016/j.biosystems.2017.10.006] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2017] [Revised: 10/06/2017] [Accepted: 10/09/2017] [Indexed: 01/17/2023]
|
25
|
Triplet-Based Codon Organization Optimizes the Impact of Synonymous Mutation on Nucleic Acid Molecular Dynamics. J Mol Evol 2018; 86:91-102. [PMID: 29344693 PMCID: PMC5846835 DOI: 10.1007/s00239-018-9828-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2017] [Accepted: 01/06/2018] [Indexed: 11/22/2022]
Abstract
Since the elucidation of the genetic code almost 50 years ago, many nonrandom aspects of its codon organization remain only partly resolved. Here, we investigate the recent hypothesis of ‘dual-use’ codons which proposes that in addition to allowing adjustment of codon optimization to tRNA abundance, the degeneracy in the triplet-based genetic code also multiplexes information regarding DNA’s helical shape and protein-binding dynamics while avoiding interference with other protein-level characteristics determined by amino acid properties. How such structural optimization of the code within eukaryotic chromatin could have arisen from an RNA world is a mystery, but would imply some preadaptation in an RNA context. We analyzed synonymous (protein-silent) and nonsynonymous (protein-altering) mutational impacts on molecular dynamics in 13823 identically degenerate alternative codon reorganizations, defined by codon transitions in 7680 GPU-accelerated molecular dynamic simulations of implicitly and explicitly solvated double-stranded aRNA and bDNA structures. When compared to all possible alternative codon assignments, the standard genetic code minimized the impact of synonymous mutations on the random atomic fluctuations and correlations of carbon backbone vector trajectories while facilitating the specific movements that contribute to DNA polymer flexibility. This trend was notably stronger in the context of RNA supporting the idea that dual-use codon optimization and informational multiplexing in DNA resulted from the preadaptation of the RNA duplex to resist changes to thermostability. The nonrandom and divergent molecular dynamics of synonymous mutations also imply that the triplet-based code may have resulted from adaptive functional expansion enabling a primordial doublet code to multiplex gene regulatory information via the shape and charge of the minor groove.
Collapse
|
26
|
Granold M, Hajieva P, Toşa MI, Irimie FD, Moosmann B. Modern diversification of the amino acid repertoire driven by oxygen. Proc Natl Acad Sci U S A 2018; 115:41-46. [PMID: 29259120 PMCID: PMC5776824 DOI: 10.1073/pnas.1717100115] [Citation(s) in RCA: 59] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
All extant life employs the same 20 amino acids for protein biosynthesis. Studies on the number of amino acids necessary to produce a foldable and catalytically active polypeptide have shown that a basis set of 7-13 amino acids is sufficient to build major structural elements of modern proteins. Hence, the reasons for the evolutionary selection of the current 20 amino acids out of a much larger available pool have remained elusive. Here, we have analyzed the quantum chemistry of all proteinogenic and various prebiotic amino acids. We find that the energetic HOMO-LUMO gap, a correlate of chemical reactivity, becomes incrementally closer in modern amino acids, reaching the level of specialized redox cofactors in the late amino acids tryptophan and selenocysteine. We show that the arising prediction of a higher reactivity of the more recently added amino acids is correct as regards various free radicals, particularly oxygen-derived peroxyl radicals. Moreover, we demonstrate an immediate survival benefit conferred by the enhanced redox reactivity of the modern amino acids tyrosine and tryptophan in oxidatively stressed cells. Our data indicate that in demanding building blocks with more versatile redox chemistry, biospheric molecular oxygen triggered the selective fixation of the last amino acids in the genetic code. Thus, functional rather than structural amino acid properties were decisive during the finalization of the universal genetic code.
Collapse
Affiliation(s)
- Matthias Granold
- Evolutionary Biochemistry and Redox Medicine, Institute for Pathobiochemistry, University Medical Center of the Johannes Gutenberg University, 55128 Mainz, Germany
| | - Parvana Hajieva
- Cellular Adaptation Group, Institute for Pathobiochemistry, University Medical Center of the Johannes Gutenberg University, 55128 Mainz, Germany
| | - Monica Ioana Toşa
- Group of Biocatalysis and Biotransformations, Faculty of Chemistry and Chemical Engineering, Babeş-Bolyai University, Cluj-Napoca 400028, Romania
| | - Florin-Dan Irimie
- Group of Biocatalysis and Biotransformations, Faculty of Chemistry and Chemical Engineering, Babeş-Bolyai University, Cluj-Napoca 400028, Romania
| | - Bernd Moosmann
- Evolutionary Biochemistry and Redox Medicine, Institute for Pathobiochemistry, University Medical Center of the Johannes Gutenberg University, 55128 Mainz, Germany;
| |
Collapse
|
27
|
Di Giulio M. The aminoacyl-tRNA synthetases had only a marginal role in the origin of the organization of the genetic code: Evidence in favor of the coevolution theory. J Theor Biol 2017; 432:14-24. [DOI: 10.1016/j.jtbi.2017.08.005] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2017] [Revised: 08/01/2017] [Accepted: 08/03/2017] [Indexed: 10/19/2022]
|
28
|
Frozen Accident Pushing 50: Stereochemistry, Expansion, and Chance in the Evolution of the Genetic Code. Life (Basel) 2017; 7:life7020022. [PMID: 28545255 PMCID: PMC5492144 DOI: 10.3390/life7020022] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2017] [Revised: 05/19/2017] [Accepted: 05/20/2017] [Indexed: 12/31/2022] Open
Abstract
Nearly 50 years ago, Francis Crick propounded the frozen accident scenario for the evolution of the genetic code along with the hypothesis that the early translation system consisted primarily of RNA. Under the frozen accident perspective, the code is universal among modern life forms because any change in codon assignment would be highly deleterious. The frozen accident can be considered the default theory of code evolution because it does not imply any specific interactions between amino acids and the cognate codons or anticodons, or any particular properties of the code. The subsequent 49 years of code studies have elucidated notable features of the standard code, such as high robustness to errors, but failed to develop a compelling explanation for codon assignments. In particular, stereochemical affinity between amino acids and the cognate codons or anticodons does not seem to account for the origin and evolution of the code. Here, I expand Crick’s hypothesis on RNA-only translation system by presenting evidence that this early translation already attained high fidelity that allowed protein evolution. I outline an experimentally testable scenario for the evolution of the code that combines a distinct version of the stereochemical hypothesis, in which amino acids are recognized via unique sites in the tertiary structure of proto-tRNAs, rather than by anticodons, expansion of the code via proto-tRNA duplication, and the frozen accident.
Collapse
|
29
|
Yarus M. The Genetic Code and RNA-Amino Acid Affinities. Life (Basel) 2017; 7:life7020013. [PMID: 28333103 PMCID: PMC5492135 DOI: 10.3390/life7020013] [Citation(s) in RCA: 51] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2017] [Revised: 03/16/2017] [Accepted: 03/17/2017] [Indexed: 11/22/2022] Open
Abstract
A significant part of the genetic code likely originated via a chemical interaction, which should be experimentally verifiable. One possible verification relates bound amino acids (or perhaps their activated congeners) and ribonucleotide sequences within cognate RNA binding sites. To introduce this interaction, I first summarize how amino acids function as targets for RNA binding. Then the experimental method for selecting relevant RNA binding sites is characterized. The selection method’s characteristics are related to the investigation of the RNA binding site model treated at the outset. Finally, real binding sites from selection and also from extant natural RNAs (for example, the Sulfobacillus guanidinium riboswitch) are connected to the genetic code, and by extension, to the evolutionary progression that produced the code. During this process, peptides may have been produced directly on an instructive amino acid binding RNA (a DRT; Direct RNA Template). Combination of observed stereochemical selectivity with adaptation and co-evolutionary refinement is logically required, and also potentially sufficient, to create the striking order conserved throughout the present coding table.
Collapse
Affiliation(s)
- Michael Yarus
- Department of Molecular, Cellular and Developmental Biology, University of Colorado, Boulder, CO 80309-0347, USA.
| |
Collapse
|
30
|
Some pungent arguments against the physico-chemical theories of the origin of the genetic code and corroborating the coevolution theory. J Theor Biol 2017; 414:1-4. [DOI: 10.1016/j.jtbi.2016.11.014] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2016] [Revised: 10/26/2016] [Accepted: 11/16/2016] [Indexed: 10/20/2022]
|
31
|
Abstract
Several theories for the origin of life have gained widespread acceptance, led by primordial soup, chemical evolution, metabolism first, and the RNA world. However, while new and existing theories often address a key step, there is less focus on a comprehensive abiogenic continuum leading to the last universal common ancestor. Herein, I present the "minimotif synthesis" hypothesis unifying select origin of life theories with new and revised steps. The hypothesis is based on first principles, on the concept of selection over long time scales, and on a stepwise progression toward complexity. The major steps are the thermodynamically-driven origination of extant molecular specificity emerging from primordial soup leading to the rise of peptide catalysts, and a cyclic feed-forward catalytic diversification of compound and peptides in the primordial soup. This is followed by degenerate, semi-partially conservative peptide replication to pass on catalytic knowledge to progeny protocells. At some point during this progression, the emergence of RNA and selection could drive the separation of catalytic and genetic functions, allowing peptides and proteins to permeate the catalytic space, and RNA to encode higher fidelity information transfer. Translation may have emerged from RNA template driven organization and successive ligation of activated amino acids as a predecessor to translation.
Collapse
Affiliation(s)
- Martin R Schiller
- Nevada Institute of Personalized Medicine and School of Life Sciences, University of Nevada, Las Vegas, Nevada, USA
| |
Collapse
|
32
|
Di Giulio M. The lack of foundation in the mechanism on which are based the physico-chemical theories for the origin of the genetic code is counterposed to the credible and natural mechanism suggested by the coevolution theory. J Theor Biol 2016; 399:134-40. [PMID: 27067244 DOI: 10.1016/j.jtbi.2016.04.005] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2016] [Revised: 03/29/2016] [Accepted: 04/01/2016] [Indexed: 11/25/2022]
Abstract
I analyze the mechanism on which are based the majority of theories that put to the center of the origin of the genetic code the physico-chemical properties of amino acids. As this mechanism is based on excessive mutational steps, I conclude that it could not have been operative or if operative it would not have allowed a full realization of predictions of these theories, because this mechanism contained, evidently, a high indeterminacy. I make that disapproving the four-column theory of the origin of the genetic code (Higgs, 2009) and reply to the criticism that was directed towards the coevolution theory of the origin of the genetic code. In this context, I suggest a new hypothesis that clarifies the mechanism by which the domains of codons of the precursor amino acids would have evolved, as predicted by the coevolution theory. This mechanism would have used particular elongation factors that would have constrained the evolution of all amino acids belonging to a given biosynthetic family to the progenitor pre-tRNA, that for first recognized, the first codons that evolved in a certain codon domain of a determined precursor amino acid. This happened because the elongation factors recognized two characteristics of the progenitor pre-tRNAs of precursor amino acids, which prevented the elongation factors from recognizing the pre-tRNAs belonging to biosynthetic families of different precursor amino acids. Finally, I analyze by means of Fisher's exact test, the distribution, within the genetic code, of the biosynthetic classes of amino acids and the ones of polarity values of amino acids. This analysis would seem to support the biosynthetic classes of amino acids over the ones of polarity values, as the main factor that led to the structuring of the genetic code, with the physico-chemical properties of amino acids playing only a subsidiary role in this evolution. As a whole, the full analysis brings to the conclusion that the coevolution theory of the origin of the genetic code would be a theory highly corroborated.
Collapse
Affiliation(s)
- Massimo Di Giulio
- Early Evolution of Life Laboratory, Institute of Biosciences and Bioresources, CNR, Via P. Castellino, 111, 80131 Naples, Italy.
| |
Collapse
|
33
|
Coevolution Theory of the Genetic Code at Age Forty: Pathway to Translation and Synthetic Life. Life (Basel) 2016; 6:life6010012. [PMID: 26999216 PMCID: PMC4810243 DOI: 10.3390/life6010012] [Citation(s) in RCA: 57] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2016] [Revised: 02/26/2016] [Accepted: 03/04/2016] [Indexed: 11/17/2022] Open
Abstract
The origins of the components of genetic coding are examined in the present study. Genetic information arose from replicator induction by metabolite in accordance with the metabolic expansion law. Messenger RNA and transfer RNA stemmed from a template for binding the aminoacyl-RNA synthetase ribozymes employed to synthesize peptide prosthetic groups on RNAs in the Peptidated RNA World. Coevolution of the genetic code with amino acid biosynthesis generated tRNA paralogs that identify a last universal common ancestor (LUCA) of extant life close to Methanopyrus, which in turn points to archaeal tRNA introns as the most primitive introns and the anticodon usage of Methanopyrus as an ancient mode of wobble. The prediction of the coevolution theory of the genetic code that the code should be a mutable code has led to the isolation of optional and mandatory synthetic life forms with altered protein alphabets.
Collapse
|
34
|
Abstract
The impressive body of work on the major evolutionary transitions in the last 20 y calls for a reconstruction of the theory although a 2D account (evolution of informational systems and transitions in individuality) remains. Significant advances include the concept of fraternal and egalitarian transitions (lower-level units like and unlike, respectively). Multilevel selection, first without, then with, the collectives in focus is an important explanatory mechanism. Transitions are decomposed into phases of origin, maintenance, and transformation (i.e., further evolution) of the higher level units, which helps reduce the number of transitions in the revised list by two so that it is less top-heavy. After the transition, units show strong cooperation and very limited realized conflict. The origins of cells, the emergence of the genetic code and translation, the evolution of the eukaryotic cell, multicellularity, and the origin of human groups with language are reconsidered in some detail in the light of new data and considerations. Arguments are given why sex is not in the revised list as a separate transition. Some of the transitions can be recursive (e.g., plastids, multicellularity) or limited (transitions that share the usual features of major transitions without a massive phylogenetic impact, such as the micro- and macronuclei in ciliates). During transitions, new units of reproduction emerge, and establishment of such units requires high fidelity of reproduction (as opposed to mere replication).
Collapse
Affiliation(s)
- Eörs Szathmáry
- Center for the Conceptual Foundations of Science, Parmenides Foundation, D-82049 Munich, Germany; Department of Plant Systematics, Ecology and Theoretical Biology, Biological Institute, Eötvös University, H-1117 Budapest, Hungary; and MTA-ELTE Theoretical Biology and Evolutionary Ecology Research Group, H-1117 Budapest, Hungary
| |
Collapse
|
35
|
|
36
|
RNA editing and modifications of RNAs might have favoured the evolution of the triplet genetic code from an ennuplet code. J Theor Biol 2014; 359:1-5. [DOI: 10.1016/j.jtbi.2014.05.037] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2014] [Revised: 05/21/2014] [Accepted: 05/27/2014] [Indexed: 11/24/2022]
|
37
|
Are proposed early genetic codes capable of encoding viable proteins? J Mol Evol 2014; 78:263-74. [PMID: 24826911 DOI: 10.1007/s00239-014-9622-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2013] [Accepted: 04/28/2014] [Indexed: 01/10/2023]
Abstract
Proteins are elaborate biopolymers balancing between contradicting intrinsic propensities to fold, aggregate, or remain disordered. Assessing their primary structural preferences observable without evolutionary optimization has been reinforced by the recent identification of de novo proteins that have emerged from previously non-coding sequences. In this paper we investigate structural preferences of hypothetical proteins translated from random DNA segments using the standard genetic code and three of its proposed evolutionarily predecessor models encoding 10, 6, and 4 amino acids, respectively. Our only main assumption is that the disorder, aggregation, and transmembrane helix predictions used are able to reflect the differences in the trends of the protein sets investigated. We found that the 10-residue code encodes proteins that resemble modern proteins in their predicted structural properties. All of the investigated early genetic codes give rise to proteins with enhanced disorder and diminished aggregation propensities. Our results suggest that an ancestral genetic code similar to the proposed 10-residue one is capable of encoding functionally diverse proteins but these might have existed under conditions different from today's common physiological ones. The existence of a protein functional repertoire for the investigated earlier stages which is quite distinct as it is today can be deduced from the presented results.
Collapse
|
38
|
The protein invasion: a broad review on the origin of the translational system. J Mol Evol 2013; 77:185-96. [PMID: 24145863 DOI: 10.1007/s00239-013-9592-x] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2013] [Accepted: 10/12/2013] [Indexed: 12/25/2022]
Abstract
Translation, coded peptide synthesis, arguably exists at the heart of modern cellular life. By orchestrating an incredibly complex interaction between tRNAs, mRNAs, aaRSs, the ribosome, and numerous other small molecules, the translational system allows the interpretation of data in the form of DNA to create massively complex proteins which control and enact almost every cellular function. A natural question then, is how did this system evolve? Here we present a broad review of the existing theories of the last two decades on the origin of the translational system. We attempt to synthesize the wide variety of ideas as well as organize them into modular components, addressing the evolution of the peptide-RNA interaction, tRNA, mRNA, the ribosome, and the first proteins separately. We hope to provide both a comprehensive overview of the literature as well as a framework for future discussions and novel theories.
Collapse
|
39
|
Di Giulio M. The Origin of the Genetic Code: Matter of Metabolism or Physicochemical Determinism? J Mol Evol 2013; 77:131-3. [DOI: 10.1007/s00239-013-9593-9] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2013] [Accepted: 10/18/2013] [Indexed: 12/27/2022]
|
40
|
Zenkin N. Hypothesis: Emergence of Translation as a Result of RNA Helicase Evolution. J Mol Evol 2012; 74:249-56. [DOI: 10.1007/s00239-012-9503-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2011] [Accepted: 04/13/2012] [Indexed: 10/28/2022]
|
41
|
de Vladar HP. Amino acid fermentation at the origin of the genetic code. Biol Direct 2012; 7:6. [PMID: 22325238 PMCID: PMC3376031 DOI: 10.1186/1745-6150-7-6] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2011] [Accepted: 02/10/2012] [Indexed: 01/15/2023] Open
Abstract
There is evidence that the genetic code was established prior to the existence of proteins, when metabolism was powered by ribozymes. Also, early proto-organisms had to rely on simple anaerobic bioenergetic processes. In this work I propose that amino acid fermentation powered metabolism in the RNA world, and that this was facilitated by proto-adapters, the precursors of the tRNAs. Amino acids were used as carbon sources rather than as catalytic or structural elements. In modern bacteria, amino acid fermentation is known as the Stickland reaction. This pathway involves two amino acids: the first undergoes oxidative deamination, and the second acts as an electron acceptor through reductive deamination. This redox reaction results in two keto acids that are employed to synthesise ATP via substrate-level phosphorylation. The Stickland reaction is the basic bioenergetic pathway of some bacteria of the genus Clostridium. Two other facts support Stickland fermentation in the RNA world. First, several Stickland amino acid pairs are synthesised in abiotic amino acid synthesis. This suggests that amino acids that could be used as an energy substrate were freely available. Second, anticodons that have complementary sequences often correspond to amino acids that form Stickland pairs. The main hypothesis of this paper is that pairs of complementary proto-adapters were assigned to Stickland amino acids pairs. There are signatures of this hypothesis in the genetic code. Furthermore, it is argued that the proto-adapters formed double strands that brought amino acid pairs into proximity to facilitate their mutual redox reaction, structurally constraining the anticodon pairs that are assigned to these amino acid pairs. Significance tests which randomise the code are performed to study the extent of the variability of the energetic (ATP) yield. Random assignments can lead to a substantial yield of ATP and maintain enough variability, thus selection can act and refine the assignments into a proto-code that optimises the energetic yield. Monte Carlo simulations are performed to evaluate the establishment of these simple proto-codes, based on amino acid substitutions and codon swapping. In all cases, donor amino acids are assigned to anticodons composed of U+G, and have low redundancy (1-2 codons), whereas acceptor amino acids are assigned to the the remaining codons. These bioenergetic and structural constraints allow for a metabolic role for amino acids before their co-option as catalyst cofactors. Reviewers: this article was reviewed by Prof. William Martin, Prof. Eörs Szathmáry (nominated by Dr. Gáspár Jékely) and Dr. Ádám Kun (nominated by Dr. Sandor Pongor)
Collapse
|
42
|
Small cofactors may assist protein emergence from RNA world: clues from RNA-protein complexes. PLoS One 2011; 6:e22494. [PMID: 21789260 PMCID: PMC3138788 DOI: 10.1371/journal.pone.0022494] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2011] [Accepted: 06/24/2011] [Indexed: 11/19/2022] Open
Abstract
It is now widely accepted that at an early stage in the evolution of life an RNA world arose, in which RNAs both served as the genetic material and catalyzed diverse biochemical reactions. Then, proteins have gradually replaced RNAs because of their superior catalytic properties in catalysis over time. Therefore, it is important to investigate how primitive functional proteins emerged from RNA world, which can shed light on the evolutionary pathway of life from RNA world to the modern world. In this work, we proposed that the emergence of most primitive functional proteins are assisted by the early primitive nucleotide cofactors, while only a minority are induced directly by RNAs based on the analysis of RNA-protein complexes. Furthermore, the present findings have significant implication for exploring the composition of primitive RNA, i.e., adenine base as principal building blocks.
Collapse
|
43
|
Rodin AS, Szathmáry E, Rodin SN. On origin of genetic code and tRNA before translation. Biol Direct 2011; 6:14. [PMID: 21342520 PMCID: PMC3050877 DOI: 10.1186/1745-6150-6-14] [Citation(s) in RCA: 66] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2010] [Accepted: 02/22/2011] [Indexed: 01/12/2023] Open
Abstract
BACKGROUND Synthesis of proteins is based on the genetic code - a nearly universal assignment of codons to amino acids (aas). A major challenge to the understanding of the origins of this assignment is the archetypal "key-lock vs. frozen accident" dilemma. Here we re-examine this dilemma in light of 1) the fundamental veto on "foresight evolution", 2) modular structures of tRNAs and aminoacyl-tRNA synthetases, and 3) the updated library of aa-binding sites in RNA aptamers successfully selected in vitro for eight amino acids. RESULTS The aa-binding sites of arginine, isoleucine and tyrosine contain both their cognate triplets, anticodons and codons. We have noticed that these cases might be associated with palindrome-dinucleotides. For example, one-base shift to the left brings arginine codons CGN, with CG at 1-2 positions, to the respective anticodons NCG, with CG at 2-3 positions. Formally, the concomitant presence of codons and anticodons is also expected in the reverse situation, with codons containing palindrome-dinucleotides at their 2-3 positions, and anticodons exhibiting them at 1-2 positions. A closer analysis reveals that, surprisingly, RNA binding sites for Arg, Ile and Tyr "prefer" (exactly as in the actual genetic code) the anticodon(2-3)/codon(1-2) tetramers to their anticodon(1-2)/codon(2-3) counterparts, despite the seemingly perfect symmetry of the latter. However, since in vitro selection of aa-specific RNA aptamers apparently had nothing to do with translation, this striking preference provides a new strong support to the notion of the genetic code emerging before translation, in response to catalytic (and possibly other) needs of ancient RNA life. Consistently with the pre-translation origin of the code, we propose here a new model of tRNA origin by the gradual, Fibonacci process-like, elongation of a tRNA molecule from a primordial coding triplet and 5'DCCA3' quadruplet (D is a base-determinator) to the eventual 76 base-long cloverleaf-shaped molecule. CONCLUSION Taken together, our findings necessarily imply that primordial tRNAs, tRNA aminoacylating ribozymes, and (later) the translation machinery in general have been co-evolving to ''fit'' the (likely already defined) genetic code, rather than the opposite way around. Coding triplets in this primal pre-translational code were likely similar to the anticodons, with second and third nucleotides being more important than the less specific first one. Later, when the code was expanding in co-evolution with the translation apparatus, the importance of 2-3 nucleotides of coding triplets "transferred" to the 1-2 nucleotides of their complements, thus distinguishing anticodons from codons. This evolutionary primacy of anticodons in genetic coding makes the hypothesis of primal stereo-chemical affinity between amino acids and cognate triplets, the hypothesis of coding coenzyme handles for amino acids, the hypothesis of tRNA-like genomic 3' tags suggesting that tRNAs originated in replication, and the hypothesis of ancient ribozymes-mediated operational code of tRNA aminoacylation not mutually contradicting but rather co-existing in harmony.
Collapse
Affiliation(s)
- Andrei S Rodin
- Human Genetics Center, School of Public Health, University of Texas, Houston, TX 77225, USA
- Collegium Budapest (Institute for Advanced Study), Szentháromság u. 2, H-1014 Budapest, Hungary
| | - Eörs Szathmáry
- Collegium Budapest (Institute for Advanced Study), Szentháromság u. 2, H-1014 Budapest, Hungary
- Parmenides Center for the Study of Thinking, Kirchplatz 1, D-82049 Munich/Pullach, Germany
- Institute of Biology, Eötvös University, 1c Pázmány Péter sétány, H-1117 Budapest, Hungary
| | - Sergei N Rodin
- Collegium Budapest (Institute for Advanced Study), Szentháromság u. 2, H-1014 Budapest, Hungary
- Department of Molecular and Cellular Biology, Beckman Research Institute of the City of Hope, Duarte, CA 91010, USA
| |
Collapse
|
44
|
Ma W. The scenario on the origin of translation in the RNA world: in principle of replication parsimony. Biol Direct 2010; 5:65. [PMID: 21110883 PMCID: PMC3002371 DOI: 10.1186/1745-6150-5-65] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2010] [Accepted: 11/27/2010] [Indexed: 01/06/2023] Open
Abstract
Background It is now believed that in the origin of life, proteins should have been "invented" in an RNA world. However, due to the complexity of a possible RNA-based proto-translation system, this evolving process seems quite complicated and the associated scenario remains very blurry. Considering that RNA can bind amino acids with specificity, it has been reasonably supposed that initial peptides might have been synthesized on "RNA templates" containing multiple amino acid binding sites. This "Direct RNA Template (DRT)" mechanism is attractive because it should be the simplest mechanism for RNA to synthesize peptides, thus very likely to have been adopted initially in the RNA world. Then, how this mechanism could develop into a proto-translation system mechanism is an interesting problem. Presentation of the hypothesis Here an explanation to this problem is shown considering the principle of "replication parsimony" --- genetic information tends to be utilized in a parsimonious way under selection pressure, due to its replication cost (e.g., in the RNA world, nucleotides and ribozymes for RNA replication). Because a DRT would be quite long even for a short peptide, its replication cost would be great. Thus the diversity and the length of functional peptides synthesized by the DRT mechanism would be seriously limited. Adaptors (proto-tRNAs) would arise to allow a DRT's complementary strand (called "C-DRT" here) to direct the synthesis of the same peptide synthesized by the DRT itself. Because the C-DRT is a necessary part in the DRT's replication, fewer turns of the DRT's replication would be needed to synthesize definite copies of the functional peptide, thus saving the replication cost. Acting through adaptors, C-DRTs could transform into much shorter templates (called "proto-mRNAs" here) and substitute the role of DRTs, thus significantly saving the replication cost. A proto-rRNA corresponding to the small subunit rRNA would then emerge to aid the binding of proto-tRNAs and proto-mRNAs, allowing the reduction of base pairs between them (ultimately resulting in the triplet anticodon/codon pair), thus further saving the replication cost. In this context, the replication cost saved would allow the appearance of more and longer functional peptides and, finally, proteins. The hypothesis could be called "DRT-RP" ("RP" for "replication parsimony"). Testing the hypothesis The scenario described here is open for experimental work at some key scenes, including the compact DRT mechanism, the development of adaptors from aa-aptamers, the synthesis of peptides by proto-tRNAs and proto-mRNAs without the participation of proto-rRNAs, etc. Interestingly, a recent computer simulation study has demonstrated the plausibility of one of the evolving processes driven by replication parsimony in the scenario. Implication of the hypothesis An RNA-based proto-translation system could arise gradually from the DRT mechanism according to the principle of "replication parsimony" --- to save the replication cost of RNA templates for functional peptides. A surprising side deduction along the logic of the hypothesis is that complex, biosynthetic amino acids might have entered the genetic code earlier than simple, prebiotic amino acids, which is opposite to the common sense. Overall, the present discussion clarifies the blurry scenario concerning the origin of translation with a major clue, which shows vividly how life could "manage" to exploit potential chemical resources in nature, eventually in an efficient way over evolution. Reviewers This article was reviewed by Eugene V. Koonin, Juergen Brosius, and Arcady Mushegian.
Collapse
Affiliation(s)
- Wentao Ma
- College of Life Sciences, Wuhan University, Wuhan 430072, PR China.
| |
Collapse
|
45
|
Stability of the genetic code and optimal parameters of amino acids. J Theor Biol 2010; 269:57-63. [PMID: 20955716 DOI: 10.1016/j.jtbi.2010.10.015] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2010] [Revised: 09/20/2010] [Accepted: 10/12/2010] [Indexed: 11/24/2022]
Abstract
The standard genetic code is known to be much more efficient in minimizing adverse effects of misreading errors and one-point mutations in comparison with a random code having the same structure, i.e. the same number of codons coding for each particular amino acid. We study the inverse problem, how the code structure affects the optimal physico-chemical parameters of amino acids ensuring the highest stability of the genetic code. It is shown that the choice of two or more amino acids with given properties determines unambiguously all the others. In this sense the code structure determines strictly the optimal parameters of amino acids or the corresponding scales may be derived directly from the genetic code. In the code with the structure of the standard genetic code the resulting values for hydrophobicity obtained in the scheme "leave one out" and in the scheme with fixed maximum and minimum parameters correlate significantly with the natural scale. The comparison of the optimal and natural parameters allows assessing relative impact of physico-chemical and error-minimization factors during evolution of the genetic code. As the resulting optimal scale depends on the choice of amino acids with given parameters, the technique can also be applied to testing various scenarios of the code evolution with increasing number of codified amino acids. Our results indicate the co-evolution of the genetic code and physico-chemical properties of recruited amino acids.
Collapse
|
46
|
Active centrum hypothesis: the origin of chiral homogeneity and the RNA-world. Biosystems 2010; 103:1-12. [PMID: 20851736 DOI: 10.1016/j.biosystems.2010.09.004] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2009] [Revised: 09/06/2010] [Accepted: 09/06/2010] [Indexed: 11/22/2022]
Abstract
I propose a hypothesis on the origin of chiral homogeneity of bio-molecules based on chiral catalysis. The first chiral active centre may have formed on the surface of complexes comprising metal ions, amino acids, other coenzymes and oligomers (short RNAs). The complexes must have been dominated by short RNAs capable of self-reproduction with ligation. Most of the first complexes may have catalysed the production of nucleotides. A basic assumption is that such complexes can be assembled from their components almost freely, in a huge variety of combinations. This assumption implies that "a few" components can constitute "a huge" number of active centre types. Moreover, an experiment is proposed to test the performance of such complexes in vitro. If the complexes were built up freely from their elements, then Darwinian evolution would operate on the assembly mechanism of complexes. For the production of complexes, first their parts had to appear by forming a proper three-dimensional structure. Three possible re-building mechanisms of the proper geometric structure of complexes are proposed. First, the integration of RNA parts of complexes was assisted presumably by a pre-intron. Second, the binding of RNA parts of a complex may give rise to a "polluted" RNA world. Third, the pairing of short RNA parts and their geometric conformation may have been supported by a pre-genetic code. Finally, an evolutionary step-by-step scenario of the origin of homochirality and a "polluted" RNA world is also introduced based on the proposed combinatorial complex chemistry. Homochirality is evolved by Darwinian selection whenever the efficiency of the reflexive autocatalysis of a dynamical combinatorial library increases with the homochirality of the active centres of reactions cascades and the homochirality of the elements of the dynamical combinatorial library. Moreover, the potential importance of phospholipid membrane is also discussed.
Collapse
|
47
|
Seaborg DM. Was Wright right? The canonical genetic code is an empirical example of an adaptive peak in nature; deviant genetic codes evolved using adaptive bridges. J Mol Evol 2010; 71:87-99. [PMID: 20711776 PMCID: PMC2924497 DOI: 10.1007/s00239-010-9373-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2010] [Accepted: 07/02/2010] [Indexed: 11/30/2022]
Abstract
The canonical genetic code is on a sub-optimal adaptive peak with respect to its ability to minimize errors, and is close to, but not quite, optimal. This is demonstrated by the near-total adjacency of synonymous codons, the similarity of adjacent codons, and comparisons of frequency of amino acid usage with number of codons in the code for each amino acid. As a rare empirical example of an adaptive peak in nature, it shows adaptive peaks are real, not merely theoretical. The evolution of deviant genetic codes illustrates how populations move from a lower to a higher adaptive peak. This is done by the use of "adaptive bridges," neutral pathways that cross over maladaptive valleys by virtue of masking of the phenotypic expression of some maladaptive aspects in the genotype. This appears to be the general mechanism by which populations travel from one adaptive peak to another. There are multiple routes a population can follow to cross from one adaptive peak to another. These routes vary in the probability that they will be used, and this probability is determined by the number and nature of the mutations that happen along each of the routes. A modification of the depiction of adaptive landscapes showing genetic distances and probabilities of travel along their multiple possible routes would throw light on this important concept.
Collapse
Affiliation(s)
- David M Seaborg
- Foundation for Biological Conservation and Research, 1888 Pomar Way, Walnut Creek, CA 94598-1424, USA.
| |
Collapse
|
48
|
Widmann J, Harris JK, Lozupone C, Wolfson A, Knight R. Stable tRNA-based phylogenies using only 76 nucleotides. RNA (NEW YORK, N.Y.) 2010; 16:1469-77. [PMID: 20558546 PMCID: PMC2905747 DOI: 10.1261/rna.726010] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2007] [Accepted: 04/16/2010] [Indexed: 05/29/2023]
Abstract
tRNAs are among the most ancient, highly conserved sequences on earth, but are often thought to be poor phylogenetic markers because they are short, often subject to horizontal gene transfer, and easily change specificity. Here we use an algorithm now commonly used in microbial ecology, UniFrac, to cluster 175 genomes spanning all three domains of life based on the phylogenetic relationships among their complete tRNA pools. We find that the overall pattern of similarities and differences in the tRNA pools recaptures universal phylogeny to a remarkable extent, and that the resulting tree is similar to the distribution of bootstrapped rRNA trees from the same genomes. In contrast, the trees derived from tRNAs of identical specificity or of individual isoacceptors generally produced trees of lower quality. However, some tRNA isoacceptors were very good predictors of the overall pattern of organismal evolution. These results show that UniFrac can extract meaningful biological patterns from even phylogenies with high level of statistical inaccuracy and horizontal gene transfer, and that, overall, the pattern of tRNA evolution tracks universal phylogeny and provides a background against which we can test hypotheses about the evolution of individual isoacceptors.
Collapse
Affiliation(s)
- Jeremy Widmann
- Department of Chemistry and Biochemistry, University of Colorado, Boulder, CO 80309, USA
| | | | | | | | | |
Collapse
|
49
|
Abstract
Alterations to the genetic code--codon reassignments--have occurred many times in life's history, despite the fact that genomes are coadapted to their genetic codes and therefore alterations are likely to be maladaptive. A potential mechanism for adaptive codon reassignment, which could trigger either a temporary period of codon ambiguity or a permanent genetic code change, is the reactivation of a pseudogene by a nonsense suppressor mutant transfer RNA. I examine the population genetics of each stage of this process and find that pseudogene rescue is plausible and also readily explains some features of extant variability in genetic codes.
Collapse
Affiliation(s)
- L J Johnson
- School of Biological Sciences, University of Reading, Reading, UK.
| |
Collapse
|
50
|
Han DX, Wang HY, Ji ZL, Hu AF, Zhao YF. Amino acid homochirality may be linked to the origin of phosphate-based life. J Mol Evol 2010; 70:572-82. [PMID: 20506019 DOI: 10.1007/s00239-010-9353-z] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2009] [Accepted: 05/06/2010] [Indexed: 10/19/2022]
Abstract
Phosphorylation has to have been one of the key events in prebiotic evolution on earth. In this article, the emergence of phosphoryl amino acid 5'-nucleosides having a P-N bond is described as a model of the origin of amino acid homochirality and Genetic Code. It is proposed that the intramolecular interaction between the nucleotide base and the amino acid side-chain influences the stability of particular amino acid 5'-nucleotides, and the interaction also selects for the chirality of amino acids. The differences between L: - and D: -conformation energies (DeltaE (conf)) are evaluated by DFT methods at the B3LYP/6-31G(d) level. Although, as expected, these DeltaE (conf) values are not large, they do give differences in energy that can distinguish the chirality of amino acids. Based on our calculations, the chiral selection of the earliest amino acids for L: -enantiomers seems to be determined by a clear stereochemical/physicochemical relationship. As later amino acids developed from the earliest amino acids, we deduce that the chirality of these late amino acids was inherited from that of the early amino acids. This idea reaches far back into evolution, and we hope that it will guide further experiments in this area.
Collapse
Affiliation(s)
- Da Xiong Han
- Department of Pharmacy, Medical College of Xiamen University, Xiamen, 361005, China.
| | | | | | | | | |
Collapse
|