1
|
Di Giulio M. Theories of the origin of the genetic code: Strong corroboration for the coevolution theory. Biosystems 2024; 239:105217. [PMID: 38663520 DOI: 10.1016/j.biosystems.2024.105217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Revised: 04/16/2024] [Accepted: 04/18/2024] [Indexed: 04/29/2024]
Abstract
I analyzed all the theories and models of the origin of the genetic code, and over the years, I have considered the main suggestions that could explain this origin. The conclusion of this analysis is that the coevolution theory of the origin of the genetic code is the theory that best captures the majority of observations concerning the organization of the genetic code. In other words, the biosynthetic relationships between amino acids would have heavily influenced the origin of the organization of the genetic code, as supported by the coevolution theory. Instead, the presence in the genetic code of physicochemical properties of amino acids, which have also been linked to the physicochemical properties of anticodons or codons or bases by stereochemical and physicochemical theories, would simply be the result of natural selection. More explicitly, I maintain that these correlations between codons, anticodons or bases and amino acids are in fact the result not of a real correlation between amino acids and codons, for example, but are only the effect of the intervention of natural selection. Specifically, in the genetic code table we expect, for example, that the most similar codons - that is, those that differ by only one base - will have more similar physicochemical properties. Therefore, the 64 codons of the genetic code table ordered in a certain way would also represent an ordering of some of their physicochemical properties. Now, a study aimed at clarifying which physicochemical property of amino acids has influenced the allocation of amino acids in the genetic code has established that the partition energy of amino acids has played a role decisive in this. Indeed, under some conditions, the genetic code was found to be approximately 98% optimized on its columns. In this same work, it was shown that this was most likely the result of the action of natural selection. If natural selection had truly allocated the amino acids in the genetic code in such a way that similar amino acids also have similar codons - this, not through a mechanism of physicochemical interaction between, for example, codons and amino acids - then it might turn out that even different physicochemical properties of codons (or anticodons or bases) show some correlation with the physicochemical properties of amino acids, simply because the partition energy of amino acids is correlated with other physicochemical properties of amino acids. It is very likely that this would inevitably lead to a correlation between codons (or anticodons or bases) and amino acids. In other words, since the codons (anticodons or bases) are ordered in the genetic code, that is to say, some of their physicochemical properties should also be ordered by a similar order, and given that the amino acids would also appear to have been ordered in the genetic code by selection natural, then it should inevitably turn out that there is a correlation between, for example, the hydrophobicity of anticodons and that of amino acids. Instead, the intervention of natural selection in organizing the genetic code would appear to be highly compatible with the main mechanism of structuring the genetic code as supported by the coevolution theory. This would make the coevolution theory the only plausible explanation for the origin of the genetic code.
Collapse
Affiliation(s)
- Massimo Di Giulio
- The Ionian School, Early Evolution of Life Department, Genetic Code and tRNA Origin Laboratory, Via Roma 19, 67030, Alfedena, L'Aquila, Italy.
| |
Collapse
|
2
|
Di Giulio M. The time of appearance of the genetic code. Biosystems 2024; 237:105159. [PMID: 38373543 DOI: 10.1016/j.biosystems.2024.105159] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Revised: 02/13/2024] [Accepted: 02/16/2024] [Indexed: 02/21/2024]
Abstract
I support the hypothesis that the origin of the genetic code occurred simultaneously with the evolution of cellularity. That is to say, I favour the hypothesis that the origin of the genetic code is a very, very late event in the history of life on Earth. I corroborate this hypothesis with observations favouring the progenote's stage for the Last Universal Common Ancestor (LUCA), for the ancestor of bacteria and that of archaea. Indeed, these progenotic stages would imply that - at that time - the origin of the genetic code was still ongoing simply because this origin would fall within the very definition of progenote. Therefore, if the evolution of cellularity had truly been coeval with the origin of the genetic code - at least in its terminal part - then this would favour theories such as the coevolution theory of the origin of the genetic code because this theory would postulate that this origin must have occurred in extremely complex protocellular conditions and not concerning stereochemical or physicochemical interactions having to do with other stages of the origin of life. In this sense, the coevolution theory would be corroborated while the stereochemical and physicochemical theories would be damaged. Therefore, the origin of the genetic code would be linked to the origin of the cell and not to the origin of life as sometimes asserted. Therefore, I will discuss the late hypothesis of the origin of the genetic code in the context of the theories proposed to explain this origin and more generally of its implications for the early evolution of life.
Collapse
Affiliation(s)
- Massimo Di Giulio
- The Ionian School, Early Evolution of Life Department, Genetic Code and tRNA Origin Laboratory, Via Roma 19, 67030, Alfedena, L'Aquila, Italy.
| |
Collapse
|
3
|
Model of Genetic Code Structure Evolution under Various Types of Codon Reading. Int J Mol Sci 2022; 23:ijms23031690. [PMID: 35163612 PMCID: PMC8835785 DOI: 10.3390/ijms23031690] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2021] [Revised: 01/23/2022] [Accepted: 01/25/2022] [Indexed: 11/28/2022] Open
Abstract
The standard genetic code (SGC) is a set of rules according to which 64 codons are assigned to 20 canonical amino acids and stop coding signal. As a consequence, the SGC is redundant because there is a greater number of codons than the number of encoded labels. This redundancy implies the existence of codons that encode the same genetic information. The size and organization of such synonymous codon blocks are important characteristics of the SGC structure whose evolution is still unclear. Therefore, we studied possible evolutionary mechanisms of the codon block structure. We conducted computer simulations assuming that coding systems at early stages of the SGC evolution were sets of ambiguous codon assignments with high entropy. We included three types of reading systems characterized by different inaccuracy and pattern of codon recognition. In contrast to the previous study, we allowed for evolution of the reading systems and their competition. The simulations performed under minimization of translational errors and reduction of coding ambiguity produced the coding system resistant to these errors. The reading system similar to that present in the SGC dominated the others very quickly. The survived system was also characterized by low entropy and possessed properties similar to that in the SGC. Our simulation show that the unambiguous SGC could emerged from a code with a lower level of ambiguity and the number of tRNAs increased during the evolution.
Collapse
|
4
|
Caldararo F, Di Giulio M. The genetic code is very close to a global optimum in a model of its origin taking into account both the partition energy of amino acids and their biosynthetic relationships. Biosystems 2022; 214:104613. [DOI: 10.1016/j.biosystems.2022.104613] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2021] [Revised: 01/16/2022] [Accepted: 01/17/2022] [Indexed: 01/23/2023]
|
5
|
Pawlak K, Wnetrzak M, Mackiewicz D, Mackiewicz P, Błażej P. Models of genetic code structure evolution with variable number of coded labels. Biosystems 2021; 210:104528. [PMID: 34492316 DOI: 10.1016/j.biosystems.2021.104528] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Revised: 08/26/2021] [Accepted: 08/27/2021] [Indexed: 10/20/2022]
Abstract
It is assumed that at the early stage of cell evolution its translation machinery was characterized by high noise, i.e. ambiguous assignment of codons to amino acids in the genetic code, which initially encoded only few amino acids. Next, during its evolution new amino acids were added to this code. Taking into account this facts, we investigated theoretical models of genetic code's structure, which evolved from a set of ambiguous codons assignments into a coding system with a low level of uncertainty. We considered three types of translational inaccuracies assuming a different number of fixed codon positions. We applied a modified version of evolutionary algorithm for finding the genetic codes that the most effectively reduced the initial uncertainty in the assignment of codons to encoded labels, i.e. amino acids and a stop translation signal. We examined codes with the number of labels from four to 22. Our results indicated that the quality of genetic code structure is strongly dependent on the number of encoded labels as well as the type of translational mechanism. The more strict assignments of codon to the labels was preferred by the codes encoding more number of labels. The results showed that a smaller degeneracy of codes evolved from a more tolerant coding with the stepwise addition of coded amino acids to the genetic code. The distribution of codon groups in the standard genetic code corresponds well to the translation model assuming two fixed codon positions, whereas the six-codon groups can be relics form previous stages of evolution when the code characterized by a greater uncertainty.
Collapse
Affiliation(s)
- Konrad Pawlak
- Department of Bioinformatics and Genomics, Faculty of Biotechnology, University of Wrocław, ul. Joliot-Curie 14a, Wrocław, Poland
| | - Małgorzata Wnetrzak
- Department of Bioinformatics and Genomics, Faculty of Biotechnology, University of Wrocław, ul. Joliot-Curie 14a, Wrocław, Poland
| | - Dorota Mackiewicz
- Department of Bioinformatics and Genomics, Faculty of Biotechnology, University of Wrocław, ul. Joliot-Curie 14a, Wrocław, Poland
| | - Paweł Mackiewicz
- Department of Bioinformatics and Genomics, Faculty of Biotechnology, University of Wrocław, ul. Joliot-Curie 14a, Wrocław, Poland
| | - Paweł Błażej
- Department of Bioinformatics and Genomics, Faculty of Biotechnology, University of Wrocław, ul. Joliot-Curie 14a, Wrocław, Poland.
| |
Collapse
|
6
|
Di Giulio M. LUCA as well as the ancestors of archaea, bacteria and eukaryotes were progenotes: Inference from the distribution and diversity of the reading mechanism of the AUA and AUG codons in the domains of life. Biosystems 2020; 198:104239. [PMID: 32919036 DOI: 10.1016/j.biosystems.2020.104239] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2020] [Revised: 09/01/2020] [Accepted: 09/01/2020] [Indexed: 11/25/2022]
Abstract
Here I use the rationale assuming that if of a certain trait that exerts its function in some aspect of the genetic code or, more generally, in protein synthesis, it is possible to identify the evolutionary stage of its origin then it would imply that this evolutionary moment would be characterized by a high translational noise because this trait would originate for the first time during that evolutionary stage. That is to say, if this trait had a non-marginal role in the realization of the genetic code, or in protein synthesis, then the origin of this trait would imply that, more generally, it was the genetic code itself that was still originating. But if the genetic code were still originating - at that precise evolutionary stage - then this would imply that there was a high translational noise which in turn would imply that it was in the presence of a protocell, i.e. a progenote that was by definition characterized by high translational noise. I apply this rationale to the mechanism of modification of the base 34 of the anticodon of an isoleucine tRNA that leads to the reading of AUA and AUG codons in archaea, bacteria and eukaryotes. The phylogenetic distribution of this mechanism in these phyletic lineages indicates that this mechanism originated only after the evolutionary stage of the last universal common ancestor (LUCA), namely, during the formation of cellular domains, i.e., at the stage of ancestors of these main phyletic lineages. Furthermore, given that this mechanism of modification of the base 34 of the anticodon of the isoleucine tRNA would result to emerge at a stage of the origin of the genetic code - despite in its terminal phases - then all this would imply that the ancestors of bacteria, archaea and eukaryotes were progenotes. If so, all the more so, the LUCA would also be a progenote since it preceded these ancestors temporally. A consequence of all this reasoning might be that since these three ancestors were of the progenotes that were different from each other, if at least one of them had evolved into at least two real and different cells - basically different from each other - then the number of cellular domains would not be three but it would be greater than three.
Collapse
Affiliation(s)
- Massimo Di Giulio
- The Ionian School, Genetic Code and tRNA Origin Laboratory, Via Roma 19, 67030, Alfedena (L'Aquila), Italy; Institute of Biosciences and Bioresources, National Research Council, Via P. Castellino, 111, 80131, Naples, Italy.
| |
Collapse
|
7
|
The phylogenetic distribution of the glutaminyl-tRNA synthetase and Glu-tRNA Gln amidotransferase in the fundamental lineages would imply that the ancestor of archaea, that of eukaryotes and LUCA were progenotes. Biosystems 2020; 196:104174. [PMID: 32535177 DOI: 10.1016/j.biosystems.2020.104174] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2020] [Revised: 05/25/2020] [Accepted: 05/25/2020] [Indexed: 12/21/2022]
Abstract
The function of the glutaminyl-tRNA synthetase and Glu-tRNAGln amidotransferase might be related to the origin of the genetic code because, for example, glutaminyl-tRNA synthetase catalyses the fundamental reaction that makes the genetic code. If the evolutionary stage of the origin of these two enzymes could be unambiguously identified, then the genetic code should still have been originating at that particular evolutionary stage because the fundamental reaction that makes the code itself was still evidently evolving. This would result in that particular evolutionary moment being attributed to the evolutionary stage of the progenote because it would have a relationship between the genotype and the phenotype not yet fully realized because the genetic code was precisely still originating. I then analyzed the distribution of the glutaminyl-tRNA synthetase and Glu-tRNAGln aminodotrasferase in the main phyletic lineages. Since in some cases the origin of these two enzymes can be related to the evolutionary stages of ancestors of archaea and eukaryotes, this would indicate these ancestors as progenotes because at that evolutionary moment the genetic code was evidently still evolving, thus realizing the definition of progenote. The conclusion that the ancestor of archaea and that of eukaryotes were progenotes would imply that even the last universal common ancestor (LUCA) was a progenote because it appeared, on the tree of life, temporally before these ancestors.
Collapse
|
8
|
Barbieri M. Evolution of the genetic code: The ambiguity-reduction theory. Biosystems 2019; 185:104024. [DOI: 10.1016/j.biosystems.2019.104024] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2019] [Revised: 08/26/2019] [Accepted: 08/26/2019] [Indexed: 10/26/2022]
|
9
|
The Second Special Issue on Code Biology - An overview. Biosystems 2019; 187:104050. [PMID: 31589914 DOI: 10.1016/j.biosystems.2019.104050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
10
|
Dragovich B, Mišić NŽ. p-Adic hierarchical properties of the genetic code. Biosystems 2019; 185:104017. [PMID: 31433999 DOI: 10.1016/j.biosystems.2019.104017] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2019] [Revised: 08/10/2019] [Accepted: 08/13/2019] [Indexed: 12/11/2022]
Abstract
In this article, we consider p-adic modeling of the standard genetic code and the vertebrate mitochondrial one. To this end, we use 5-adic and 2-adic distance as a mathematical tool to describe closeness (nearness, similarity) between codons as elements of a bioinformation space. Codons which are simultaneously at the smallest 5-adic and 2-adic distance code the same (or similar) amino acid or stop signal. The set of codons is presented as an ultrametric tree as well as a fractal and p-adic network. It is shown that genetic code can be treated as sequential translation between genetic languages. This p-adic approach gives possibility to be applied to sequences of nucleotides of an arbitrary finite length. We present an overview of published and some new results on various p-adic properties of the genetic code.
Collapse
Affiliation(s)
- Branko Dragovich
- Institute of Physics, University of Belgrade, Belgrade, Serbia; Mathematical Institute, Serbian Academy of Sciences and Arts, Belgrade, Serbia.
| | - Nataša Ž Mišić
- Research and Development Institute Lola Ltd, Kneza Višeslava 70a, Belgrade, Serbia.
| |
Collapse
|
11
|
A general model on the origin of biological codes. Biosystems 2019; 181:11-19. [DOI: 10.1016/j.biosystems.2019.04.010] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2019] [Revised: 04/16/2019] [Accepted: 04/16/2019] [Indexed: 01/09/2023]
|
12
|
Optimization of the standard genetic code in terms of two mutation types: Point mutations and frameshifts. Biosystems 2019; 181:44-50. [DOI: 10.1016/j.biosystems.2019.04.012] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2019] [Accepted: 04/27/2019] [Indexed: 02/08/2023]
|
13
|
Barbieri M. What is code biology? Biosystems 2018; 164:1-10. [DOI: 10.1016/j.biosystems.2017.10.005] [Citation(s) in RCA: 44] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2017] [Revised: 10/04/2017] [Accepted: 10/05/2017] [Indexed: 01/29/2023]
|
14
|
The evolution of the genetic code: Impasses and challenges. Biosystems 2018; 164:217-225. [DOI: 10.1016/j.biosystems.2017.10.006] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2017] [Revised: 10/06/2017] [Accepted: 10/09/2017] [Indexed: 01/17/2023]
|
15
|
Kubyshkin V, Acevedo-Rocha CG, Budisa N. On universal coding events in protein biogenesis. Biosystems 2018; 164:16-25. [DOI: 10.1016/j.biosystems.2017.10.004] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2017] [Revised: 10/02/2017] [Accepted: 10/03/2017] [Indexed: 12/14/2022]
|
16
|
|
17
|
|
18
|
Wills PR. DNA as information. PHILOSOPHICAL TRANSACTIONS. SERIES A, MATHEMATICAL, PHYSICAL, AND ENGINEERING SCIENCES 2016; 374:rsta.2015.0417. [PMID: 26857666 DOI: 10.1098/rsta.2015.0417] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 12/14/2015] [Indexed: 06/05/2023]
Abstract
This article reviews contributions to this theme issue covering the topic 'DNA as information' in relation to the structure of DNA, the measure of its information content, the role and meaning of information in biology and the origin of genetic coding as a transition from uninformed to meaningful computational processes in physical systems.
Collapse
Affiliation(s)
- Peter R Wills
- Department of Physics, University of Auckland, PO Box 92019, Auckland 1142, New Zealand Institut für Biochemie und Molekularbiologie, Universität Hamburg, Martin-Luther-King-Platz 6, 20146 Hamburg, Germany Department of Biochemistry and Biophysics, University of North Carolina, 120 Mason Farm Road, Chapel Hill, NC 27599-7260, USA
| |
Collapse
|