1
|
Carter CW, Tang GQ, Patra SK, Betts L, Dieckhaus H, Kuhlman B, Douglas J, Wills PR, Bouckaert R, Popovic M, Ditzler MA. WITHDRAWN: Structural Enzymology, Phylogenetics, Differentiation, and Symbolic Reflexivity at the Dawn of Biology. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2024.12.17.628912. [PMID: 39763899 PMCID: PMC11702779 DOI: 10.1101/2024.12.17.628912] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/18/2025]
Abstract
This manuscript was posted without the final consent of all authors. The authors have therefore withdrawn it. The authors do not wish this work to be cited as reference for the project. If you have any questions, please contact the corresponding author, carter@med.unc.edu .
Collapse
Affiliation(s)
- Charles W. Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260
| | - Guo Qing Tang
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260
| | - Sourav Kumar Patra
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260
| | - Laurie Betts
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260
| | - Henry Dieckhaus
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260
- Lineberger Comprehensive Cancer Center, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Brian Kuhlman
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260
- Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Jordan Douglas
- Department of Physics, Auckland University, Auckland, NZ
- Department of Computer Science, Auckland University, Auckland, NZ
| | - Peter R. Wills
- Department of Physics, Auckland University, Auckland, NZ
| | - Remco Bouckaert
- Department of Computer Science, Auckland University, Auckland, NZ
| | | | | |
Collapse
|
2
|
Douglas J, Cui H, Perona JJ, Vargas‐Rodriguez O, Tyynismaa H, Carreño CA, Ling J, Ribas de Pouplana L, Yang X, Ibba M, Becker H, Fischer F, Sissler M, Carter CW, Wills PR. AARS Online: A collaborative database on the structure, function, and evolution of the aminoacyl-tRNA synthetases. IUBMB Life 2024; 76:1091-1105. [PMID: 39247978 PMCID: PMC11580382 DOI: 10.1002/iub.2911] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2024] [Accepted: 08/07/2024] [Indexed: 09/10/2024]
Abstract
The aminoacyl-tRNA synthetases (aaRS) are a large group of enzymes that implement the genetic code in all known biological systems. They attach amino acids to their cognate tRNAs, moonlight in various translational and non-translational activities beyond aminoacylation, and are linked to many genetic disorders. The aaRS have a subtle ontology characterized by structural and functional idiosyncrasies that vary from organism to organism, and protein to protein. Across the tree of life, the 22 coded amino acids are handled by 16 evolutionary families of Class I aaRS and 21 families of Class II aaRS. We introduce AARS Online, an interactive Wikipedia-like tool curated by an international consortium of field experts. This platform systematizes existing knowledge about the aaRS by showcasing a taxonomically diverse selection of aaRS sequences and structures. Through its graphical user interface, AARS Online facilitates a seamless exploration between protein sequence and structure, providing a friendly introduction to the material for non-experts and a useful resource for experts. Curated multiple sequence alignments can be extracted for downstream analyses. Accessible at www.aars.online, AARS Online is a free resource to delve into the world of the aaRS.
Collapse
Affiliation(s)
- Jordan Douglas
- Department of PhysicsUniversity of AucklandNew Zealand
- Centre for Computational EvolutionUniversity of AucklandNew Zealand
| | - Haissi Cui
- Department of ChemistryUniversity of TorontoCanada
| | - John J. Perona
- Department of ChemistryPortland State UniversityPortlandOregonUSA
| | - Oscar Vargas‐Rodriguez
- Department of Molecular Biology and BiophysicsUniversity of ConnecticutStorrsConnecticutUSA
| | - Henna Tyynismaa
- Stem Cells and Metabolism Research Program, Faculty of MedicineUniversity of HelsinkiFinland
| | | | - Jiqiang Ling
- Department of Cell Biology and Molecular GeneticsUniversity of MarylandCollege ParkMarylandUSA
| | - Lluís Ribas de Pouplana
- Institute for Research in BiomedicineThe Barcelona Institute of Science and TechnologyBarcelonaCataloniaSpain
- Catalan Institution for Research and Advanced StudiesBarcelonaCataloniaSpain
| | - Xiang‐Lei Yang
- Department of Molecular MedicineThe Scripps Research InstituteLa JollaCaliforniaUSA
| | - Michael Ibba
- Biological SciencesChapman UniversityOrangeCaliforniaUSA
| | - Hubert Becker
- Génétique Moléculaire, Génomique MicrobiologiqueUniversity of StrasbourgFrance
| | - Frédéric Fischer
- Génétique Moléculaire, Génomique MicrobiologiqueUniversity of StrasbourgFrance
| | - Marie Sissler
- Génétique Moléculaire, Génomique MicrobiologiqueUniversity of StrasbourgFrance
| | - Charles W. Carter
- Department of Biochemistry and BiophysicsUniversity of North Carolina at Chapel HillChapel HillNorth CarolinaUSA
| | - Peter R. Wills
- Department of PhysicsUniversity of AucklandNew Zealand
- Centre for Computational EvolutionUniversity of AucklandNew Zealand
| |
Collapse
|
3
|
Patra S, Douglas J, Wills P, Betts L, Qing T, Carter C. A genomic database furnishes minimal functional glycyl-tRNA synthetases homologous to other, designed class II urzymes. Nucleic Acids Res 2024; 52:13305-13324. [PMID: 39494520 PMCID: PMC11602164 DOI: 10.1093/nar/gkae992] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Accepted: 10/18/2024] [Indexed: 11/05/2024] Open
Abstract
The hypothesis that conserved core catalytic sites could represent ancestral aminoacyl-tRNA synthetases (AARS) drove the design of functional TrpRS, LeuRS, and HisRS 'urzymes'. We describe here new urzymes detected in the genomic record of the arctic fox, Vulpes lagopus. They are homologous to the α-subunit of bacterial heterotetrameric Class II glycyl-tRNA synthetase (GlyRS-B) enzymes. AlphaFold2 predicted that the N-terminal 81 amino acids would adopt a 3D structure nearly identical to our designed HisRS urzyme (HisCA1). We expressed and purified that N-terminal segment and the spliced open reading frame GlyCA1-2. Both exhibit robust single-turnover burst sizes and ATP consumption rates higher than those previously published for HisCA urzymes and comparable to those for LeuAC and TrpAC. GlyCA is more than twice as active in glycine activation by adenosine triphosphate as the full-length GlyRS-B α2 dimer. Michaelis-Menten rate constants for all three substrates reveal significant coupling between Exon2 and both substrates. GlyCA activation favors Class II amino acids that complement those favored by HisCA and LeuAC. Structural features help explain these results. These minimalist GlyRS catalysts are thus homologous to previously described urzymes. Their properties reinforce the notion that urzymes may have the requisite catalytic activities to implement a reduced, ancestral genetic coding alphabet.
Collapse
Affiliation(s)
- Sourav Kumar Patra
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| | - Jordan Douglas
- Department of Physics, The University of Auckland, Auckland 1042, New Zealand
- Centre for Computational Evolution, University of Auckland, 1010, New Zealand
| | - Peter R Wills
- Department of Physics, The University of Auckland, Auckland 1042, New Zealand
| | - Laurie Betts
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| | - Tang Guo Qing
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| | - Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| |
Collapse
|
4
|
Wong TF. Triphasic Development of the Genetic Code. Chem Rev 2024; 124:9866-9872. [PMID: 39088192 PMCID: PMC11393795 DOI: 10.1021/acs.chemrev.3c00915] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/02/2024]
Abstract
The genetic code contains an alphabet of genetically encoded amino acids. The ten Phase 1 amino acids, including Gly, Ala, Ser, Asp, Glu, Val, Leu, Ile, Pro and Thr, were available from the prebiotic environment, whereas the ten Phase 2 amino acids, including Phe, Tyr, Arg, His, Trp, Asn, Gln, Lys, Cys, and Met, became available only later from amino acid biosyntheses. In the archaeon Methanopyrus kandleri, the oldest organism known, the standard alphabet of 20 amino acids was "frozen" and no additional amino acid was encoded in the subsequent 3 Gyrs. Four decades ago, it was discovered that the code was frozen because all the organisms were so well adapted to the standard amino acids that oligogenic barriers, consisting of genes that are thoroughly dependent on the standard code, would cause loss of viability upon the deletion of any one amino acid from the code. Once the reason for the freezing of the code was ascertained, procedures were devised by scientists worldwide to enable the encoding of novel noncanonical amino acids (ncAAs). These encoded Phase 3 ncAAs now surpass the 20 canonical Phase 2 amino acids in the code.
Collapse
Affiliation(s)
- Tze-Fei Wong
- Division of Life Science and Applied Genomics Center, Hong Kong University of Science & Technology Hong Kong, China
| |
Collapse
|
5
|
Tang GQ, Hu H, Douglas J, Carter C. Primordial aminoacyl-tRNA synthetases preferred minihelices to full-length tRNA. Nucleic Acids Res 2024; 52:7096-7111. [PMID: 38783009 PMCID: PMC11229368 DOI: 10.1093/nar/gkae417] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Revised: 04/30/2024] [Accepted: 05/10/2024] [Indexed: 05/25/2024] Open
Abstract
Aminoacyl-tRNA synthetases (AARS) and tRNAs translate the genetic code in all living cells. Little is known about how their molecular ancestors began to enforce the coding rules for the expression of their own genes. Schimmel et al. proposed in 1993 that AARS catalytic domains began by reading an 'operational' code in the acceptor stems of tRNA minihelices. We show here that the enzymology of an AARS urzyme•TΨC-minihelix cognate pair is a rich in vitro realization of that idea. The TΨC-minihelixLeu is a very poor substrate for full-length Leucyl-tRNA synthetase. It is a superior RNA substrate for the corresponding urzyme, LeuAC. LeuAC active-site mutations shift the choice of both amino acid and RNA substrates. AARS urzyme•minihelix cognate pairs are thus small, pliant models for the ancestral decoding hardware. They are thus an ideal platform for detailed experimental study of the operational RNA code.
Collapse
Affiliation(s)
- Guo Qing Tang
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| | - Hao Hu
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| | - Jordan Douglas
- Department of Physics, The University of Auckland, New Zealand
- Centre for Computational Evolution, University of Auckland, New Zealand
- Department of Computer Science, The University of Auckland, New Zealand
| | - Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| |
Collapse
|
6
|
Di Giulio M. Theories of the origin of the genetic code: Strong corroboration for the coevolution theory. Biosystems 2024; 239:105217. [PMID: 38663520 DOI: 10.1016/j.biosystems.2024.105217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Revised: 04/16/2024] [Accepted: 04/18/2024] [Indexed: 04/29/2024]
Abstract
I analyzed all the theories and models of the origin of the genetic code, and over the years, I have considered the main suggestions that could explain this origin. The conclusion of this analysis is that the coevolution theory of the origin of the genetic code is the theory that best captures the majority of observations concerning the organization of the genetic code. In other words, the biosynthetic relationships between amino acids would have heavily influenced the origin of the organization of the genetic code, as supported by the coevolution theory. Instead, the presence in the genetic code of physicochemical properties of amino acids, which have also been linked to the physicochemical properties of anticodons or codons or bases by stereochemical and physicochemical theories, would simply be the result of natural selection. More explicitly, I maintain that these correlations between codons, anticodons or bases and amino acids are in fact the result not of a real correlation between amino acids and codons, for example, but are only the effect of the intervention of natural selection. Specifically, in the genetic code table we expect, for example, that the most similar codons - that is, those that differ by only one base - will have more similar physicochemical properties. Therefore, the 64 codons of the genetic code table ordered in a certain way would also represent an ordering of some of their physicochemical properties. Now, a study aimed at clarifying which physicochemical property of amino acids has influenced the allocation of amino acids in the genetic code has established that the partition energy of amino acids has played a role decisive in this. Indeed, under some conditions, the genetic code was found to be approximately 98% optimized on its columns. In this same work, it was shown that this was most likely the result of the action of natural selection. If natural selection had truly allocated the amino acids in the genetic code in such a way that similar amino acids also have similar codons - this, not through a mechanism of physicochemical interaction between, for example, codons and amino acids - then it might turn out that even different physicochemical properties of codons (or anticodons or bases) show some correlation with the physicochemical properties of amino acids, simply because the partition energy of amino acids is correlated with other physicochemical properties of amino acids. It is very likely that this would inevitably lead to a correlation between codons (or anticodons or bases) and amino acids. In other words, since the codons (anticodons or bases) are ordered in the genetic code, that is to say, some of their physicochemical properties should also be ordered by a similar order, and given that the amino acids would also appear to have been ordered in the genetic code by selection natural, then it should inevitably turn out that there is a correlation between, for example, the hydrophobicity of anticodons and that of amino acids. Instead, the intervention of natural selection in organizing the genetic code would appear to be highly compatible with the main mechanism of structuring the genetic code as supported by the coevolution theory. This would make the coevolution theory the only plausible explanation for the origin of the genetic code.
Collapse
Affiliation(s)
- Massimo Di Giulio
- The Ionian School, Early Evolution of Life Department, Genetic Code and tRNA Origin Laboratory, Via Roma 19, 67030, Alfedena, L'Aquila, Italy.
| |
Collapse
|
7
|
Rekadwad BN, Shouche YS, Jangid K. Investigation of tRNA-based relatedness within the Planctomycetes-Verrucomicrobia-Chlamydiae (PVC) superphylum: a comparative analysis. Arch Microbiol 2023; 205:366. [PMID: 37917352 DOI: 10.1007/s00203-023-03694-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 09/23/2023] [Accepted: 09/27/2023] [Indexed: 11/04/2023]
Abstract
The PVC superphylum is a diverse group of prokaryotes that require stringent growth conditions. RNA is a fascinating molecule to find evolutionary relatedness according to the RNA World Hypothesis. We conducted tRNA gene analysis to find evolutionary relationships in the PVC phyla. The analysis of genomic data (P = 9, V = 4, C = 8) revealed that the number of tRNA genes varied from 28 to 90 in Planctomycetes and Chlamydia, respectively. Verrucomicrobia has whole genomes and the longest scaffold (3 + 1), with tRNA genes ranging from 49 to 53 in whole genomes and 4 in the longest scaffold. Most tRNAs in the E. coli genome clustered with homologs, but approximately 43% clustered with tRNAs encoding different amino acids. Planctomyces, Akkermansia, Isosphaera, and Chlamydia were similar to E. coli tRNAs. In a phylum, tRNAs coding for different amino acids clustered at a range of 8 to 10%. Further analysis of these tRNAs showed sequence similarity with Cyanobacteria, Proteobacteria, Viridiplantae, Ascomycota and Basidiomycota (Eukaryota). This indicates the possibility of horizontal gene transfer or, otherwise, a different origin of tRNA in PVC bacteria. Hence, this work proves its importance for determining evolutionary relatedness and potentially identifying bacteria using tRNA. Thus, the analysis of these tRNAs indicates that primitive RNA may have served as the genetic material of LUCA before being replaced by DNA. A quantitative analysis is required to test these possibilities that relate the evolutionary significance of tRNA to the origin of life.
Collapse
Affiliation(s)
- Bhagwan Narayan Rekadwad
- National Centre for Microbial Resource (NCMR), DBT-National Centre for Cell Science (DBT-NCCS), Saviribai Phule Pune University Campus, Ganeshkhind, Pune, 411007, Maharashtra, India.
- Microbe AI Lab, Division of Microbiology and Biotechnology, Yenepoya Research Centre, Yenepoya (Deemed to Be University), Mangalore, 575018, Karnataka, India.
| | - Yogesh S Shouche
- National Centre for Microbial Resource (NCMR), DBT-National Centre for Cell Science (DBT-NCCS), Saviribai Phule Pune University Campus, Ganeshkhind, Pune, 411007, Maharashtra, India
- Gut Microbiology Research Division, SKAN Research Trust, Bangalore, 560034, Karnataka, India
| | - Kamlesh Jangid
- Bioenergy Group, DST-Agharkar Research Institute, Gopal Ganesh Agarkar Road, Pune, 411004, Maharashtra, India
| |
Collapse
|
8
|
Štambuk N, Konjevoda P, Brčić-Kostić K, Baković J, Štambuk A. New algorithm for the analysis of nucleotide and amino acid evolutionary relationships based on Klein four-group. Biosystems 2023; 233:105030. [PMID: 37717902 DOI: 10.1016/j.biosystems.2023.105030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 09/10/2023] [Accepted: 09/10/2023] [Indexed: 09/19/2023]
Abstract
Phylogenetics is the study of ancestral relationships among biological species. Such sequence analyses are often represented as phylogenetic trees. The branching pattern of each tree and its topology reflect the evolutionary relatedness between analyzed sequences. We present a Klein four-group algorithm (K4A) for the evolutionary analysis of nucleotide and amino acid sequences. Klein four-group set of operators consists of: identity e (U), and three elements-a = transition (C), b = transversion (G) and c = transition-transversion or complementarity (A). We generated Klein four-group based distance matrices of: 1. Cayley table (CK4), 2. Table rows (K4R), 3. Table columns (K4C), and 4. Euclidean 2D distance (K4E). The performance of the matrices was tested on a dataset of RecA proteins in bacteria, eukaryotes (Rad51 homolog) and archaea (RadA homolog). RecA and its functional homologs are found in all species, and are essential for the repair and maintenance of DNA. Consequently, they represent a good model for the study of evolutionary relationship of protein and nucleotide sequences. The ancestral relationship between the sequences was correctly classified by all K4A matrices concerning general topology. All distance matrices exhibited small variations among species, and overall results of tree classification were in agreement with the general patterns obtained by standard BLOSUM and PAM substitution matrices. During the evolution of a code there is a phase of optimization of system rules, the ambiguity of a code is eliminated, and the system starts producing specific components. Klein four-group algorithm is consistent with the concept of ambiguity reduction. It also enables the use of different genetic code table variants optimized for particular transitions in evolution based on biological specificity.
Collapse
Affiliation(s)
- Nikola Štambuk
- Centre for Nuclear Magnetic Resonance, Ruđer Bošković Institute, Bijenička cesta 54, HR-10000, Zagreb, Croatia.
| | - Paško Konjevoda
- Laboratory for Epigenomics, Division of Molecular Medicine, Ruđer Bošković Institute, Bijenička cesta 54, HR-10000, Zagreb, Croatia.
| | - Krunoslav Brčić-Kostić
- Laboratory of Evolutionary Genetics, Division of Molecular Biology, Ruđer Bošković Institute, Bijenička cesta 54, HR-10000, Zagreb, Croatia
| | - Josip Baković
- University Hospital Dubrava, Department of Surgery, Avenija Gojka Šuška 6, HR-10000, Zagreb, Croatia
| | - Albert Štambuk
- Faculty of Kinesiology, University of Zagreb, Horvaćanski zavoj 15, HR-10000 Zagreb, Croatia
| |
Collapse
|
9
|
Tang GQ, Elder JJH, Douglas J, Carter CW. Domain acquisition by class I aminoacyl-tRNA synthetase urzymes coordinated the catalytic functions of HVGH and KMSKS motifs. Nucleic Acids Res 2023; 51:8070-8084. [PMID: 37470821 PMCID: PMC10450160 DOI: 10.1093/nar/gkad590] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Revised: 06/23/2023] [Accepted: 07/11/2023] [Indexed: 07/21/2023] Open
Abstract
Leucyl-tRNA synthetase (LeuRS) is a Class I aminoacyl-tRNA synthetase (aaRS) that synthesizes leucyl-tRNAleu for codon-directed protein synthesis. Two signature sequences, HxGH and KMSKS help stabilize transition-states for amino acid activation and tRNA aminoacylation by all Class I aaRS. Separate alanine mutants of each signature, together with the double mutant, behave in opposite ways in Pyrococcus horikoshii LeuRS and the 129-residue urzyme ancestral model generated from it (LeuAC). Free energy coupling terms, Δ(ΔG‡), for both reactions are large and favourable for LeuRS, but unfavourable for LeuAC. Single turnover assays with 32Pα-ATP show correspondingly different internal products. These results implicate domain motion in catalysis by full-length LeuRS. The distributed thermodynamic cycle of mutational changes authenticates LeuAC urzyme catalysis far more convincingly than do single point mutations. Most importantly, the evolutionary gain of function induced by acquiring the anticodon-binding (ABD) and multiple insertion modules in the catalytic domain appears to be to coordinate the catalytic function of the HxGH and KMSKS signature sequences. The implication that backbone elements of secondary structures achieve a major portion of the overall transition-state stabilization by LeuAC is also consistent with coevolution of the genetic code and metabolic pathways necessary to produce histidine and lysine sidechains.
Collapse
Affiliation(s)
- Guo Qing Tang
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| | - Jessica J H Elder
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| | - Jordan Douglas
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
- Department of Physics, The University of Auckland, New Zealand
| | - Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| |
Collapse
|
10
|
Rosandić M, Paar V. The Evolution of Life Is a Road Paved with the DNA Quadruplet Symmetry and the Supersymmetry Genetic Code. Int J Mol Sci 2023; 24:12029. [PMID: 37569405 PMCID: PMC10418607 DOI: 10.3390/ijms241512029] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 07/19/2023] [Accepted: 07/24/2023] [Indexed: 08/13/2023] Open
Abstract
Symmetries have not been completely determined and explained from the discovery of the DNA structure in 1953 and the genetic code in 1961. We show, during 10 years of investigation and research, our discovery of the Supersymmetry Genetic Code table in the form of 2 × 8 codon boxes, quadruplet DNA symmetries, and the classification of trinucleotides/codons, all built with the same physiochemical double mirror symmetry and Watson-Crick pairing. We also show that single-stranded RNA had the complete code of life in the form of the Supersymmetry Genetic Code table simultaneously with instructions of codons' relationship as to how to develop the DNA molecule on the principle of Watson-Crick pairing. We show that the same symmetries between the genetic code and DNA quadruplet are highly conserved during the whole evolution even between phylogenetically distant organisms. In this way, decreasing disorder and entropy enabled the evolution of living beings up to sophisticated species with cognitive features. Our hypothesis that all twenty amino acids are necessary for the origin of life on the Earth, which entirely changes our view on evolution, confirms the evidence of organic natural amino acids from the extra-terrestrial asteroid Ryugu, which is nearly as old as our solar system.
Collapse
Affiliation(s)
- Marija Rosandić
- Department of Internal Medicine, University Hospital Centre Zagreb, (Ret.), 10000 Zagreb, Croatia
- Croatian Academy of Sciences and Arts, 10000 Zagreb, Croatia;
| | - Vladimir Paar
- Croatian Academy of Sciences and Arts, 10000 Zagreb, Croatia;
- Physics Department, Faculty of Science, University of Zagreb, 10000 Zagreb, Croatia
| |
Collapse
|
11
|
Thoma B, Powner MW. Selective Synthesis of Lysine Peptides and the Prebiotically Plausible Synthesis of Catalytically Active Diaminopropionic Acid Peptide Nitriles in Water. J Am Chem Soc 2023; 145:3121-3130. [PMID: 36700882 PMCID: PMC9912261 DOI: 10.1021/jacs.2c12497] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
Why life encodes specific proteinogenic amino acids remains an unsolved problem, but a non-enzymatic synthesis that recapitulates biology's universal strategy of stepwise N-to-C terminal peptide growth may hold the key to this selection. Lysine is an important proteinogenic amino acid that, despite its essential structural, catalytic, and functional roles in biochemistry, has widely been assumed to be a late addition to the genetic code. Here, we demonstrate that lysine thioacids undergo coupling with aminonitriles in neutral water to afford peptides in near-quantitative yield, whereas non-proteinogenic lysine homologues, ornithine, and diaminobutyric acid cannot form peptides due to rapid and quantitative cyclization that irreversibly blocks peptide synthesis. We demonstrate for the first time that ornithine lactamization provides an absolute differentiation of lysine and ornithine during (non-enzymatic) N-to-C-terminal peptide ligation. We additionally demonstrate that the shortest lysine homologue, diaminopropionic acid, undergoes effective peptide ligation. This prompted us to discover a high-yielding prebiotically plausible synthesis of the diaminopropionic acid residue, by peptide nitrile modification, through the addition of ammonia to a dehydroalanine nitrile. With this synthesis in hand, we then discovered that the low basicity of diaminopropionyl residues promotes effective, biomimetic, imine catalysis in neutral water. Our results suggest diaminopropionic acid, synthesized by peptide nitrile modification, can replace or augment lysine residues during early evolution but that lysine's electronically isolated sidechain amine likely provides an evolutionary advantage for coupling and coding as a preformed monomer in monomer-by-monomer peptide translation.
Collapse
|
12
|
Harrison SA, Palmeira RN, Halpern A, Lane N. A biophysical basis for the emergence of the genetic code in protocells. BIOCHIMICA ET BIOPHYSICA ACTA. BIOENERGETICS 2022; 1863:148597. [PMID: 35868450 DOI: 10.1016/j.bbabio.2022.148597] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Revised: 06/27/2022] [Accepted: 07/13/2022] [Indexed: 11/17/2022]
Abstract
The origin of the genetic code is an abiding mystery in biology. Hints of a 'code within the codons' suggest biophysical interactions, but these patterns have resisted interpretation. Here, we present a new framework, grounded in the autotrophic growth of protocells from CO2 and H2. Recent work suggests that the universal core of metabolism recapitulates a thermodynamically favoured protometabolism right up to nucleotide synthesis. Considering the genetic code in relation to an extended protometabolism allows us to predict most codon assignments. We show that the first letter of the codon corresponds to the distance from CO2 fixation, with amino acids encoded by the purines (G followed by A) being closest to CO2 fixation. These associations suggest a purine-rich early metabolism with a restricted pool of amino acids. The second position of the anticodon corresponds to the hydrophobicity of the amino acid encoded. We combine multiple measures of hydrophobicity to show that this correlation holds strongly for early amino acids but is weaker for later species. Finally, we demonstrate that redundancy at the third position is not randomly distributed around the code: non-redundant amino acids can be assigned based on size, specifically length. We attribute this to additional stereochemical interactions at the anticodon. These rules imply an iterative expansion of the genetic code over time with codon assignments depending on both distance from CO2 and biophysical interactions between nucleotide sequences and amino acids. In this way the earliest RNA polymers could produce non-random peptide sequences with selectable functions in autotrophic protocells.
Collapse
Affiliation(s)
- Stuart A Harrison
- Department of Genetics, Evolution and Environment, University College London, Darwin Building, Gower Street, London WC1E 6BT, United Kingdom of Great Britain and Northern Ireland
| | - Raquel Nunes Palmeira
- Department of Genetics, Evolution and Environment, University College London, Darwin Building, Gower Street, London WC1E 6BT, United Kingdom of Great Britain and Northern Ireland
| | - Aaron Halpern
- Department of Genetics, Evolution and Environment, University College London, Darwin Building, Gower Street, London WC1E 6BT, United Kingdom of Great Britain and Northern Ireland
| | - Nick Lane
- Department of Genetics, Evolution and Environment, University College London, Darwin Building, Gower Street, London WC1E 6BT, United Kingdom of Great Britain and Northern Ireland.
| |
Collapse
|
13
|
Borah C, Ali T. Genetic code noise immunity features: Degeneracy and frameshift correction. GENE REPORTS 2022. [DOI: 10.1016/j.genrep.2022.101707] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
14
|
Property based analysis: Optimality of RNY comma-free code versus circular code (X) after frameshift errors. GENE REPORTS 2022. [DOI: 10.1016/j.genrep.2022.101652] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
15
|
Kondratyeva LG, Dyachkova MS, Galchenko AV. The Origin of Genetic Code and Translation in the Framework of Current Concepts on the Origin of Life. BIOCHEMISTRY. BIOKHIMIIA 2022; 87:150-169. [PMID: 35508902 DOI: 10.1134/s0006297922020079] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
The origin of genetic code and translation system is probably the central and most difficult problem in the investigations on the origin of life and one of the most complex problems in the evolutionary biology in general. There are multiple hypotheses on the emergence and development of existing genetic systems that propose the mechanisms for the origin and early evolution of genetic code, as well as for the emergence of replication and translation. Here, we discuss the most well-known of these hypotheses, although none of them provides a description of the early evolution of genetic systems without gaps and assumptions. The RNA world hypothesis is a currently prevailing scientific idea on the early evolution of biological and pre-biological structures, the main advantage of which is the assumption that RNAs as the first living systems were self-sufficient, i.e., capable of functioning as both catalysts and templates. However, this hypothesis has also significant limitations. In particular, no ribozymes with processive polymerase activity have been yet discovered or synthesized. Taking into account the mutual need of proteins and nucleic acids in each other in the current world, many authors propose the early evolution scenarios based on the co-evolution of these two classes of organic molecules. They postulate that the emergence of translation was necessary for the replication of nucleic acids, in contrast to the RNA world hypothesis, according to which the emergence of translation was preceded by the era of self-replicating RNAs. Although such scenarios are less parsimonious from the evolutionary point of view, since they require simultaneous emergence and evolution of two classes of organic molecules, as well as the emergence of synchronized replication and translation, their major advantage is that they explain the development of processive and much more accurate protein-dependent replication.
Collapse
Affiliation(s)
- Liya G Kondratyeva
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, 117997, Russia
| | | | - Alexey V Galchenko
- Peoples' Friendship University of Russia (RUDN University), Moscow, 117198, Russia.
| |
Collapse
|
16
|
Formation of the Codon Degeneracy during Interdependent Development between Metabolism and Replication. Genes (Basel) 2021; 12:genes12122023. [PMID: 34946975 PMCID: PMC8701183 DOI: 10.3390/genes12122023] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 11/30/2021] [Accepted: 12/03/2021] [Indexed: 11/16/2022] Open
Abstract
Nirenberg's genetic code chart shows a profound correspondence between codons and amino acids. The aim of this article is to try to explain the primordial formation of the codon degeneracy. It remains a puzzle how informative molecules arose from the supposed prebiotic random sequences. If introducing an initial driving force based on the relative stabilities of triplex base pairs, the prebiotic sequence evolution became innately nonrandom. Thus, the primordial assignment of the 64 codons to the 20 amino acids has been explained in detail according to base substitutions during the coevolution of tRNAs with aaRSs; meanwhile, the classification of aaRSs has also been explained.
Collapse
|
17
|
Fimmel E, Gumbel M, Starman M, Strüngmann L. Computational Analysis of Genetic Code Variations Optimized for the Robustness against Point Mutations with Wobble-like Effects. Life (Basel) 2021; 11:1338. [PMID: 34947869 PMCID: PMC8707135 DOI: 10.3390/life11121338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 11/26/2021] [Accepted: 11/27/2021] [Indexed: 11/17/2022] Open
Abstract
It is believed that the codon-amino acid assignments of the standard genetic code (SGC) help to minimize the negative effects caused by point mutations. All possible point mutations of the genetic code can be represented as a weighted graph with weights that correspond to the probabilities of these mutations. The robustness of a code against point mutations can be described then by means of the so-called conductance measure. This paper quantifies the wobble effect, which was investigated previously by applying the weighted graph approach, and seeks optimal weights using an evolutionary optimization algorithm to maximize the code's robustness. One result of our study is that the robustness of the genetic code is least influenced by mutations in the third position-like with the wobble effect. Moreover, the results clearly demonstrate that point mutations in the first, and even more importantly, in the second base of a codon have a very large influence on the robustness of the genetic code. These results were compared to single nucleotide variants (SNV) in coding sequences which support our findings. Additionally, it was analyzed which structure of a genetic code evolves from random code tables when the robustness is maximized. Our calculations show that the resulting code tables are very close to the standard genetic code. In conclusion, the results illustrate that the robustness against point mutations seems to be an important factor in the evolution of the standard genetic code.
Collapse
|
18
|
Factors in Protobiomonomer Selection for the Origin of the Standard Genetic Code. Acta Biotheor 2021; 69:745-767. [PMID: 34283307 DOI: 10.1007/s10441-021-09420-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Accepted: 07/01/2021] [Indexed: 10/20/2022]
Abstract
Natural selection of specific protobiomonomers during abiogenic development of the prototype genetic code is hindered by the diversity of structural, spatial, and rotational isomers that have identical elemental composition and molecular mass (M), but can vary significantly in their physicochemical characteristics, such as the melting temperature Tm, the Tm:M ratio, and the solubility in water, due to different positions of atoms in the molecule. These parameters differ between cis- and trans-isomers of dicarboxylic acids, spatial monosaccharide isomers, and structural isomers of α-, β-, and γ-amino acids. The stable planar heterocyclic molecules of the major nucleobases comprise four (C, H, N, O) or three (C, H, N) elements and contain a single -C=C bond and two nitrogen atoms in each heterocycle involved in C-N and C=N bonds. They exist as isomeric resonance hybrids of single and double bonds and as a mixture of tautomer forms due to the presence of -C=O and/or -NH2 side groups. They are thermostable, insoluble in water, and exhibit solid-state stability, which is of central importance for DNA molecules as carriers of genetic information. In M-Tm diagrams, proteinogenic amino acids and the corresponding codons are distributed fairly regularly relative to the distinct clusters of purine and pyrimidine bases, reflecting the correspondence between codons and amino acids that was established in different periods of genetic code development. The body of data on the evolution of the genetic code system indicates that the elemental composition and molecular structure of protobiomonomers, and their M, Tm, photostability, and aqueous solubility determined their selection in the emergence of the standard genetic code.
Collapse
|
19
|
Subedi BP, Martin WF, Carbone V, Duin EC, Cronin B, Sauter J, Schofield LR, Sutherland-Smith AJ, Ronimus RS. Archaeal pseudomurein and bacterial murein cell wall biosynthesis share a common evolutionary ancestry. FEMS MICROBES 2021; 2:xtab012. [PMID: 37334239 PMCID: PMC10117817 DOI: 10.1093/femsmc/xtab012] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2021] [Accepted: 08/19/2021] [Indexed: 08/29/2023] Open
Abstract
Bacteria near-universally contain a cell wall sacculus of murein (peptidoglycan), the synthesis of which has been intensively studied for over 50 years. In striking contrast, archaeal species possess a variety of other cell wall types, none of them closely resembling murein. Interestingly though, one type of archaeal cell wall termed pseudomurein found in the methanogen orders Methanobacteriales and Methanopyrales is a structural analogue of murein in that it contains a glycan backbone that is cross-linked by a L-amino acid peptide. Here, we present taxonomic distribution, gene cluster and phylogenetic analyses that confirm orthologues of 13 bacterial murein biosynthesis enzymes in pseudomurein-containing methanogens, most of which are distantly related to their bacterial counterparts. We also present the first structure of an archaeal pseudomurein peptide ligase from Methanothermus fervidus DSM1088 (Mfer336) to a resolution of 2.5 Å and show that it possesses a similar overall tertiary three domain structure to bacterial MurC and MurD type murein peptide ligases. Taken together the data strongly indicate that murein and pseudomurein biosynthetic pathways share a common evolutionary history.
Collapse
Affiliation(s)
- Bishwa P Subedi
- AgResearch Ltd. Grasslands, Tennent Drive, Palmerston North 4442, New Zealand
- Massey University, Tennent Drive, Palmerston North 4442, New Zealand
| | - William F Martin
- Institute for Molecular Evolution, Heinrich-Heine University, Düsseldorf Universitätsstraße 1, D-40225, Germany
| | - Vincenzo Carbone
- AgResearch Ltd. Grasslands, Tennent Drive, Palmerston North 4442, New Zealand
| | - Eduardus C Duin
- Department of Chemistry and Biochemistry, Auburn University, Auburn, AL 36849, USA
| | - Bryan Cronin
- Department of Chemistry and Biochemistry, Auburn University, Auburn, AL 36849, USA
| | - Julia Sauter
- AgResearch Ltd. Grasslands, Tennent Drive, Palmerston North 4442, New Zealand
| | - Linley R Schofield
- AgResearch Ltd. Grasslands, Tennent Drive, Palmerston North 4442, New Zealand
| | | | - Ron S Ronimus
- AgResearch Ltd. Grasslands, Tennent Drive, Palmerston North 4442, New Zealand
| |
Collapse
|
20
|
Ying J, Ding R, Liu Y, Zhao Y. Prebiotic Chemistry in Aqueous Environment: A Review of Peptide Synthesis and Its Relationship with Genetic Code. CHINESE J CHEM 2021. [DOI: 10.1002/cjoc.202100120] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Affiliation(s)
- Jianxi Ying
- Institute of Drug Discovery Technology Ningbo University, No.818 Fenghua Road, Ningbo Zhejiang 315211 China
- Qian Xuesen Collaborative Research Center of Astrochemistry and Space Life Sciences Ningbo University No.818 Fenghua Road, Ningbo Zhejiang 315211 China
| | - Ruiwen Ding
- Institute of Drug Discovery Technology Ningbo University, No.818 Fenghua Road, Ningbo Zhejiang 315211 China
- Qian Xuesen Collaborative Research Center of Astrochemistry and Space Life Sciences Ningbo University No.818 Fenghua Road, Ningbo Zhejiang 315211 China
| | - Yan Liu
- College of Chemistry and Chemical Engineering Xiamen University, No. 422, Siming South Road Xiamen Fujian 361005 China
| | - Yufen Zhao
- Institute of Drug Discovery Technology Ningbo University, No.818 Fenghua Road, Ningbo Zhejiang 315211 China
- Qian Xuesen Collaborative Research Center of Astrochemistry and Space Life Sciences Ningbo University No.818 Fenghua Road, Ningbo Zhejiang 315211 China
- College of Chemistry and Chemical Engineering Xiamen University, No. 422, Siming South Road Xiamen Fujian 361005 China
| |
Collapse
|
21
|
The Mutational Robustness of the Genetic Code and Codon Usage in Environmental Context: A Non-Extremophilic Preference? Life (Basel) 2021; 11:life11080773. [PMID: 34440517 PMCID: PMC8398314 DOI: 10.3390/life11080773] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Revised: 07/23/2021] [Accepted: 07/28/2021] [Indexed: 12/12/2022] Open
Abstract
The genetic code was evolved, to some extent, to minimize the effects of mutations. The effects of mutations depend on the amino acid repertoire, the structure of the genetic code and frequencies of amino acids in proteomes. The amino acid compositions of proteins and corresponding codon usages are still under selection, which allows us to ask what kind of environment the standard genetic code is adapted to. Using simple computational models and comprehensive datasets comprising genomic and environmental data from all three domains of Life, we estimate the expected severity of non-synonymous genomic mutations in proteins, measured by the change in amino acid physicochemical properties. We show that the fidelity in these physicochemical properties is expected to deteriorate with extremophilic codon usages, especially in thermophiles. These findings suggest that the genetic code performs better under non-extremophilic conditions, which not only explains the low substitution rates encountered in halophiles and thermophiles but the revealed relationship between the genetic code and habitat allows us to ponder on earlier phases in the history of Life.
Collapse
|
22
|
Phylogenetic analysis of mutational robustness based on codon usage supports that the standard genetic code does not prefer extreme environments. Sci Rep 2021; 11:10963. [PMID: 34040064 PMCID: PMC8154912 DOI: 10.1038/s41598-021-90440-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2021] [Accepted: 05/10/2021] [Indexed: 02/04/2023] Open
Abstract
The mutational robustness of the genetic code is rarely discussed in the context of biological diversity, such as codon usage and related factors, often considered as independent of the actual organism's proteome. Here we put the living beings back to picture and use distortion as a metric of mutational robustness. Distortion estimates the expected severities of non-synonymous mutations measuring it by amino acid physicochemical properties and weighting for codon usage. Using the biological variance of codon frequencies, we interpret the mutational robustness of the standard genetic code with regards to their corresponding environments and genomic compositions (GC-content). Employing phylogenetic analyses, we show that coding fidelity in physicochemical properties can deteriorate with codon usages adapted to extreme environments and these putative effects are not the artefacts of phylogenetic bias. High temperature environments select for codon usages with decreased mutational robustness of hydrophobic, volumetric, and isoelectric properties. Selection at high saline concentrations also leads to reduced fidelity in polar and isoelectric patterns. These show that the genetic code performs best with mesophilic codon usages, strengthening the view that LUCA or its ancestors preferred lower temperature environments. Taxonomic implications, such as rooting the tree of life, are also discussed.
Collapse
|
23
|
Dujon B. On the origin of the genetic code: a 27-codon hypothetical precursor of an intricate 64-codon intermediate shaped the modern code. C R Biol 2021; 343:15-52. [PMID: 33988323 DOI: 10.5802/crbiol.47] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Accepted: 03/03/2021] [Indexed: 11/24/2022]
Abstract
The modern genetic code reveals numerous traces of specific relationships between the early codons which, together with its internal asymmetries, suggest a sequential appearance of the nucleobases in primitive RNA molecules. Keeping the hypothesis of triplet pairings between primitive RNA molecules at the origin of the code, this work systematically examines complete codon-anticodon interaction matrices assuming distinct pairing options at each position of the triplet duplexes. Application of these principles suggests that a 27-codon precursor having a reasonable coding capacity for short peptide synthesis could have started with primitive RNA molecules able to form two distinct pairs with different free energies between a single purine and two pyrimidines (such as G with C and U). Conservation of the same pairing options at positions 1 and 2 of codons at the arrival of a second purine with distinct pairing preferences (such as A) generated a 64-codon intermediate code made of interrelated pairs or groups of codons (designated here as intricacy). The numerous traces of this hypothetical scheme that are visible in the standard and variant forms of the modern code demonstrate without ambiguity that the ancestral codon-anticodon duplexes required high energetic pairings at their central position (Watson-Crick) but tolerated less energetic pairings at the first codon position (G • U type). Combined with the sequential appearance of the nucleobases, the predicted codon intricacy allows a stepwise reconstruction of the evolution of the coding repertoire, by simple a posteriori comparison to the modern code. This reconstruction reveals a remarkable internal coherence in terms of amino acids and tRNA synthetases recruitment. The code started with a group of amino acids (Ala, Gly, Pro, Ser and Thr) that are now all activated by class II tRNA synthetases before reaching an intermediate period during which up to 14 distinct amino acids could be encoded by a full set of intricated codons. The perfect coincidence between the last 6 amino acids predicted in this reconstruction and the speculated action of the arrival of free atmospheric oxygen on proteins is spectacular, and suggests that the code has only reached its present form after the great oxidation event.
Collapse
Affiliation(s)
- Bernard Dujon
- Institut Pasteur, Dept. Genomes and Genetics, CNRS (UMR3525) and Sorbonne Université (UFR927), Paris, France
| |
Collapse
|
24
|
Abstract
Codon-dependent translation underlies genetics and phylogenetic inferences, but its origins pose two challenges. Prevailing narratives cannot account for the fact that aminoacyl-tRNA synthetases (aaRSs), which translate the genetic code, must collectively enforce the rules used to assemble themselves. Nor can they explain how specific assignments arose from rudimentary differentiation between ancestral aaRSs and corresponding transfer RNAs (tRNAs). Experimental deconstruction of the two aaRS superfamilies created new experimental tools with which to analyze the emergence of the code. Amino acid and tRNA substrate recognition are linked to phase transfer free energies of amino acids and arise largely from aaRS class-specific differences in secondary structure. Sensitivity to protein folding rules endowed ancestral aaRS-tRNA pairs with the feedback necessary to rapidly compare alternative genetic codes and coding sequences. These and other experimental data suggest that the aaRS bidirectional genetic ancestry stabilized the differentiation and interdependence required to initiate and elaborate the genetic coding table.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7260, USA;
| | - Peter R Wills
- Department of Physics, University of Auckland, Auckland 1142, New Zealand
| |
Collapse
|
25
|
Did Amino Acid Side Chain Reactivity Dictate the Composition and Timing of Aminoacyl-tRNA Synthetase Evolution? Genes (Basel) 2021; 12:genes12030409. [PMID: 33809136 PMCID: PMC8001834 DOI: 10.3390/genes12030409] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2021] [Revised: 03/05/2021] [Accepted: 03/10/2021] [Indexed: 12/16/2022] Open
Abstract
The twenty amino acids in the standard genetic code were fixed prior to the last universal common ancestor (LUCA). Factors that guided this selection included establishment of pathways for their metabolic synthesis and the concomitant fixation of substrate specificities in the emerging aminoacyl-tRNA synthetases (aaRSs). In this conceptual paper, we propose that the chemical reactivity of some amino acid side chains (e.g., lysine, cysteine, homocysteine, ornithine, homoserine, and selenocysteine) delayed or prohibited the emergence of the corresponding aaRSs and helped define the amino acids in the standard genetic code. We also consider the possibility that amino acid chemistry delayed the emergence of the glutaminyl- and asparaginyl-tRNA synthetases, neither of which are ubiquitous in extant organisms. We argue that fundamental chemical principles played critical roles in fixation of some aspects of the genetic code pre- and post-LUCA.
Collapse
|
26
|
Chirumbolo S, Vella A. Molecules, Information and the Origin of Life: What Is Next? Molecules 2021; 26:molecules26041003. [PMID: 33672848 PMCID: PMC7917628 DOI: 10.3390/molecules26041003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2021] [Revised: 02/09/2021] [Accepted: 02/10/2021] [Indexed: 12/20/2022] Open
Abstract
How life did originate and what is life, in its deepest foundation? The texture of life is known to be held by molecules and their chemical-physical laws, yet a thorough elucidation of the aforementioned questions still stands as a puzzling challenge for science. Focusing solely on molecules and their laws has indirectly consolidated, in the scientific knowledge, a mechanistic (reductionist) perspective of biology and medicine. This occurred throughout the long historical path of experimental science, affecting subsequently the onset of the many theses and speculations about the origin of life and its maintenance. Actually, defining what is life, asks for a novel epistemology, a ground on which living systems’ organization, whose origin is still questioned via chemistry, physics and even philosophy, may provide a new key to focus onto the complex nature of the human being. In this scenario, many issues, such as the role of information and water structure, have been long time neglected from the theoretical basis on the origin of life and marginalized as a kind of scenic backstage. On the contrary, applied science and technology went ahead on considering molecules as the sole leading components in the scenery. Water physics and information dynamics may have a role in living systems much more fundamental than ever expected. Can an organism be simply explained by a mechanistic view of its nature or we need “something else”? Probably, we can earn sound foundations about life by simply changing our prejudicial view about living systems simply as complex, highly ordered machines. In this manuscript we would like to reappraise many fundamental aspects of molecular and chemical biology and reading them through a new paradigm, which includes Prigogine’s dissipative structures and informational dissipation (Shannon dissipation). This would provide readers with insightful clues about how biology and chemistry may be thoroughly revised, referring to new models, such as informational dissipation. We trust they are enabled to address a straightforward contribution in elucidating what life is for science. This overview is not simply a philosophical speculation, but it would like to affect deeply our way to conceive and describe the foundations of organisms’ life, providing intriguing suggestions for readers in the field.
Collapse
Affiliation(s)
- Salvatore Chirumbolo
- Department of Neurosciences, Biomedicine and Movement Sciences, University of Verona, 37134 Verona, Italy
- Correspondence: ; Tel.: +39-0458027645
| | - Antonio Vella
- Verona-Unit of Immunology, Azienda Ospedaliera Universitaria Integrata, 37134 Verona, Italy;
| |
Collapse
|
27
|
Wills PR, Carter CW. Impedance Matching and the Choice Between Alternative Pathways for the Origin of Genetic Coding. Int J Mol Sci 2020; 21:E7392. [PMID: 33036401 PMCID: PMC7582391 DOI: 10.3390/ijms21197392] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2020] [Revised: 09/28/2020] [Accepted: 09/30/2020] [Indexed: 01/07/2023] Open
Abstract
We recently observed that errors in gene replication and translation could be seen qualitatively to behave analogously to the impedances in acoustical and electronic energy transducing systems. We develop here quantitative relationships necessary to confirm that analogy and to place it into the context of the minimization of dissipative losses of both chemical free energy and information. The formal developments include expressions for the information transferred from a template to a new polymer, Iσ; an impedance parameter, Z; and an effective alphabet size, neff; all of which have non-linear dependences on the fidelity parameter, q, and the alphabet size, n. Surfaces of these functions over the {n,q} plane reveal key new insights into the origin of coding. Our conclusion is that the emergence and evolutionary refinement of information transfer in biology follow principles previously identified to govern physical energy flows, strengthening analogies (i) between chemical self-organization and biological natural selection, and (ii) between the course of evolutionary trajectories and the most probable pathways for time-dependent transitions in physics. Matching the informational impedance of translation to the four-letter alphabet of genes uncovers a pivotal role for the redundancy of triplet codons in preserving as much intrinsic genetic information as possible, especially in early stages when the coding alphabet size was small.
Collapse
Affiliation(s)
- Peter R. Wills
- Department of Physics and Te Ao Marama Centre for Fundamental Inquiry, University of Auckland, PB 92019, Auckland 1142, New Zealand
| | - Charles W. Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, USA
| |
Collapse
|
28
|
Long X, Xue H, Wong JTF. Descent of Bacteria and Eukarya From an Archaeal Root of Life. Evol Bioinform Online 2020; 16:1176934320908267. [PMID: 32636606 PMCID: PMC7313328 DOI: 10.1177/1176934320908267] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2019] [Accepted: 01/30/2020] [Indexed: 02/05/2023] Open
Abstract
The 3 biological domains delineated based on small subunit ribosomal RNAs (SSU rRNAs) are confronted by uncertainties regarding the relationship between Archaea and Bacteria, and the origin of Eukarya. The similarities between the paralogous valyl-tRNA and isoleucyl-tRNA synthetases in 5398 species estimated by BLASTP, which decreased from Archaea to Bacteria and further to Eukarya, were consistent with vertical gene transmission from an archaeal root of life close to Methanopyrus kandleri through a Primitive Archaea Cluster to an Ancestral Bacteria Cluster, and to Eukarya. The predominant similarities of the ribosomal proteins (rProts) of eukaryotes toward archaeal rProts relative to bacterial rProts established that an archaeal parent rather than a bacterial parent underwent genome merger with bacteria to generate eukaryotes with mitochondria. Eukaryogenesis benefited from the predominantly archaeal accelerated gene adoption (AGA) phenotype pertaining to horizontally transferred genes from other prokaryotes and expedited genome evolution via both gene-content mutations and nucleotidyl mutations. Archaeons endowed with substantial AGA activity were accordingly favored as candidate archaeal parents. Based on the top similarity bitscores displayed by their proteomes toward the eukaryotic proteomes of Giardia and Trichomonas, and high AGA activity, the Aciduliprofundum archaea were identified as leading candidates of the archaeal parent. The Asgard archaeons and a number of bacterial species were among the foremost potential contributors of eukaryotic-like proteins to Eukarya.
Collapse
Affiliation(s)
- Xi Long
- Division of Life Science, The Hong Kong University of Science and Technology, Hong Kong, China
| | - Hong Xue
- Division of Life Science, The Hong Kong University of Science and Technology, Hong Kong, China
| | - J Tze-Fei Wong
- Division of Life Science, The Hong Kong University of Science and Technology, Hong Kong, China
| |
Collapse
|
29
|
On the Importance of Asymmetry in the Phenotypic Expression of the Genetic Code upon the Molecular Evolution of Proteins. Symmetry (Basel) 2020. [DOI: 10.3390/sym12060997] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
The standard genetic code (SGC) is a mapping between the 64 possible arrangements of the four RNA nucleotides (C, A, U, G) into triplets or codons, where 61 codons are assigned to a specific amino acid and the other three are stop codons for terminating protein synthesis. Aminoacyl-tRNA synthetases (aaRSs) are responsible for implementing the SGC by specifically amino-acylating only its cognate transfer RNA (tRNA), thereby linking an amino acid with its corresponding anticodon triplets. tRNAs molecules bind each codon with its anticodon. To understand the meaning of symmetrical/asymmetrical properties of the SGC, we designed synthetic genetic codes with known symmetries and with the same degeneracy of the SGC. We determined their impact on the substitution rates for each amino acid under a neutral model of protein evolution. We prove that the phenotypic graphs of the SGC for codons and anticodons for all the possible arrangements of nucleotides are asymmetric and the amino acids do not form orbits. In the symmetrical synthetic codes, the amino acids are grouped according to their codonicity, this is the number of triplets encoding a given amino acid. Both the SGC and symmetrical synthetic codes exhibit a probability of occurrence of the amino acids proportional to their degeneracy. Unlike the SGC, the synthetic codes display a constant probability of occurrence of the amino acid according to their codonicity. The asymmetry of the phenotypic graphs of codons and anticodons of the SGC, has important implications on the evolutionary processes of proteins.
Collapse
|
30
|
Ruiz-Mirazo K, Shirt-Ediss B, Escribano-Cabeza M, Moreno A. The Construction of Biological 'Inter-Identity' as the Outcome of a Complex Process of Protocell Development in Prebiotic Evolution. Front Physiol 2020; 11:530. [PMID: 32547413 PMCID: PMC7269143 DOI: 10.3389/fphys.2020.00530] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2019] [Accepted: 04/29/2020] [Indexed: 11/25/2022] Open
Abstract
The concept of identity is used both (i) to distinguish a system as a particular material entity that is conserved as such in a given environment (token-identity: i.e., identity as permanence or endurance over time), and (ii) to relate a system with other members of a set (type-identity: i.e., identity as an equivalence relationship). Biological systems are characterized, in a minimal and universal sense, by a highly complex and dynamic, far-from-equilibrium organization of very diverse molecular components and transformation processes (i.e., 'genetically instructed cellular metabolisms') that maintain themselves in constant interaction with their corresponding environments, including other systems of similar nature. More precisely, all living entities depend on a deeply convoluted organization of molecules and processes (a naturalized von Neumann constructor architecture) that subsumes, in the form of current individuals (autonomous cells), a history of ecological and evolutionary interactions (across cell populations). So one can defend, on those grounds, that living beings have an identity of their own from both approximations: (i) and (ii). These transversal and trans-generational dimensions of biological phenomena, which unfold together with the actual process of biogenesis, must be carefully considered in order to understand the intricacies and metabolic robustness of the first living cells, their underlying uniformity (i.e., their common biochemical core) and the eradication of previous -or alternative- forms of complex natural phenomena. Therefore, a comprehensive approach to the origins of life requires conjugating the actual properties of the developing complex individuals (fusing and dividing protocells, at various stages) with other, population-level features, linked to their collective-evolutionary behavior, under much wider and longer-term parameters. On these lines, we will argue that life, in its most basic sense, here on Earth or anywhere else, demands crossing a high complexity threshold and that the concept of 'inter-identity' can help us realize the different aspects involved in the process. The article concludes by pointing out some of the challenges ahead if we are to integrate the corresponding explanatory frameworks, physiological and evolutionary, in the hope that a more general theory of biology is on its way.
Collapse
Affiliation(s)
- Kepa Ruiz-Mirazo
- Department of Logic and Philosophy of Science, University of the Basque Country, San Sebastian, Spain
- Biofisika Institute (CSIC, UPV-EHU), Leioa, Spain
| | - Ben Shirt-Ediss
- Interdisciplinary Computing and Complex BioSystems Group, Newcastle University, Newcastle upon Tyne, United Kingdom
| | - Miguel Escribano-Cabeza
- Department of Logic and Philosophy of Science, University of the Basque Country, San Sebastian, Spain
| | - Alvaro Moreno
- Department of Logic and Philosophy of Science, University of the Basque Country, San Sebastian, Spain
| |
Collapse
|
31
|
Takénaka A, Moras D. Correlation between equi-partition of aminoacyl-tRNA synthetases and amino-acid biosynthesis pathways. Nucleic Acids Res 2020; 48:3277-3285. [PMID: 31965182 PMCID: PMC7102985 DOI: 10.1093/nar/gkaa013] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2019] [Revised: 12/31/2019] [Accepted: 01/07/2020] [Indexed: 12/11/2022] Open
Abstract
The partition of aminoacyl-tRNA synthetases (aaRSs) into two classes of equal size and the correlated amino acid distribution is a puzzling still unexplained observation. We propose that the time scale of the amino-acid synthesis, assumed to be proportional to the number of reaction steps (NE) involved in the biosynthesis pathway, is one of the parameters that controlled the timescale of aaRSs appearance. Because all pathways are branched at fructose-6-phosphate on the metabolic pathway, this product is defined as the common origin for the NE comparison. For each amino-acid, the NE value, counted from the origin to the final product, provides a timescale for the pathways to be established. An archeological approach based on NE reveals that aaRSs of the two classes are generated in pair along this timescale. The results support the coevolution theory for the origin of the genetic code with an earlier appearance of class II aaRSs.
Collapse
Affiliation(s)
- Akio Takénaka
- Research Institute, Chiba Institute of Technology, 2-17-1 Tsudanuma, Narashino, Chiba 275-0016, Japan.,Faculty of Pharmacy, Shenyang Pharmaceutical University, Benxi, Liaoning 117004, China
| | - Dino Moras
- Department of Integrated Structural Biology, Institut de Génétique et de Biologie Moléculaire et Cellulaire (IGBMC) 1 rue Laurent Fries, Illkirch 67404, France.,Centre National de Recherche Scientifique (CNRS) UMR 7104, France.,Institut National de Santé et de Recherche Médicale (INSERM) U1258, France.,Université de Strasbourg, Illkirch, France
| |
Collapse
|
32
|
Footprints of a Singular 22-Nucleotide RNA Ring at the Origin of Life. BIOLOGY 2020; 9:biology9050088. [PMID: 32344921 PMCID: PMC7285048 DOI: 10.3390/biology9050088] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/17/2020] [Revised: 04/06/2020] [Accepted: 04/19/2020] [Indexed: 11/17/2022]
Abstract
(1) Background: Previous experimental observations and theoretical hypotheses have been providing insight into a hypothetical world where an RNA hairpin or ring may have debuted as the primary informational and functional molecule. We propose a model revisiting the architecture of RNA-peptide interactions at the origin of life through the evolutionary dynamics of RNA populations. (2) Methods: By performing a step-by-step computation of the smallest possible hairpin/ring RNA sequences compatible with building up a variety of peptides of the primitive network, we inferred the sequence of a singular docosameric RNA molecule, we call the ALPHA sequence. Then, we searched for any relics of the peptides made from ALPHA in sequences deposited in the different public databases. (3) Results: Sequence matching between ALPHA and sequences from organisms among the earliest forms of life on Earth were found at high statistical relevance. We hypothesize that the frequency of appearance of relics from ALPHA sequence in present genomes has a functional necessity. (4) Conclusions: Given the fitness of ALPHA as a supportive sequence of the framework of all existing theories, and the evolution of Archaea and giant viruses, it is anticipated that the unique properties of this singular archetypal ALPHA sequence should prove useful as a model matrix for future applications, ranging from synthetic biology to DNA computing.
Collapse
|
33
|
Błażej P, Wnetrzak M, Mackiewicz D, Mackiewicz P. Basic principles of the genetic code extension. ROYAL SOCIETY OPEN SCIENCE 2020; 7:191384. [PMID: 32257313 PMCID: PMC7062095 DOI: 10.1098/rsos.191384] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/22/2019] [Accepted: 01/09/2020] [Indexed: 05/08/2023]
Abstract
Compounds including non-canonical amino acids (ncAAs) or other artificially designed molecules can find a lot of applications in medicine, industry and biotechnology. They can be produced thanks to the modification or extension of the standard genetic code (SGC). Such peptides or proteins including the ncAAs can be constantly delivered in a stable way by organisms with the customized genetic code. Among several methods of engineering the code, using non-canonical base pairs is especially promising, because it enables generating many new codons, which can be used to encode any new amino acid. Since even one pair of new bases can extend the SGC up to 216 codons generated by a six-letter nucleotide alphabet, the extension of the SGC can be achieved in many ways. Here, we proposed a stepwise procedure of the SGC extension with one pair of non-canonical bases to minimize the consequences of point mutations. We reported relationships between codons in the framework of graph theory. All 216 codons were represented as nodes of the graph, whereas its edges were induced by all possible single nucleotide mutations occurring between codons. Therefore, every set of canonical and newly added codons induces a specific subgraph. We characterized the properties of the induced subgraphs generated by selected sets of codons. Thanks to that, we were able to describe a procedure for incremental addition of the set of meaningful codons up to the full coding system consisting of three pairs of bases. The procedure of gradual extension of the SGC makes the whole system robust to changing genetic information due to mutations and is compatible with the views assuming that codons and amino acids were added successively to the primordial SGC, which evolved minimizing harmful consequences of mutations or mistranslations of encoded proteins.
Collapse
Affiliation(s)
- Paweł Błażej
- Department of Bioinformatics and Genomics, Faculty of Biotechnology, University of Wrocław, ul. Joliot-Curie 14a, Wrocław, Poland
| | | | | | | |
Collapse
|
34
|
Auboeuf D. Physicochemical Foundations of Life that Direct Evolution: Chance and Natural Selection are not Evolutionary Driving Forces. Life (Basel) 2020; 10:life10020007. [PMID: 31973071 PMCID: PMC7175370 DOI: 10.3390/life10020007] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2019] [Revised: 01/15/2020] [Accepted: 01/16/2020] [Indexed: 12/11/2022] Open
Abstract
The current framework of evolutionary theory postulates that evolution relies on random mutations generating a diversity of phenotypes on which natural selection acts. This framework was established using a top-down approach as it originated from Darwinism, which is based on observations made of complex multicellular organisms and, then, modified to fit a DNA-centric view. In this article, it is argued that based on a bottom-up approach starting from the physicochemical properties of nucleic and amino acid polymers, we should reject the facts that (i) natural selection plays a dominant role in evolution and (ii) the probability of mutations is independent of the generated phenotype. It is shown that the adaptation of a phenotype to an environment does not correspond to organism fitness, but rather corresponds to maintaining the genome stability and integrity. In a stable environment, the phenotype maintains the stability of its originating genome and both (genome and phenotype) are reproduced identically. In an unstable environment (i.e., corresponding to variations in physicochemical parameters above a physiological range), the phenotype no longer maintains the stability of its originating genome, but instead influences its variations. Indeed, environment- and cellular-dependent physicochemical parameters define the probability of mutations in terms of frequency, nature, and location in a genome. Evolution is non-deterministic because it relies on probabilistic physicochemical rules, and evolution is driven by a bidirectional interplay between genome and phenotype in which the phenotype ensures the stability of its originating genome in a cellular and environmental physicochemical parameter-depending manner.
Collapse
Affiliation(s)
- Didier Auboeuf
- Laboratory of Biology and Modelling of the Cell, Univ Lyon, ENS de Lyon, Univ Claude Bernard, CNRS UMR 5239, INSERM U1210, 46 Allée d'Italie, Site Jacques Monod, F-69007, Lyon, France
| |
Collapse
|
35
|
Kubyshkin V, Budisa N. Anticipating alien cells with alternative genetic codes: away from the alanine world! Curr Opin Biotechnol 2019; 60:242-249. [DOI: 10.1016/j.copbio.2019.05.006] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2018] [Accepted: 05/07/2019] [Indexed: 12/24/2022]
|
36
|
Carter CW, Wills PR. Experimental solutions to problems defining the origin of codon-directed protein synthesis. Biosystems 2019; 183:103979. [PMID: 31176803 PMCID: PMC6693952 DOI: 10.1016/j.biosystems.2019.103979] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2019] [Revised: 05/27/2019] [Accepted: 05/29/2019] [Indexed: 12/13/2022]
Abstract
How genetic coding differentiated biology from chemistry is a long-standing challenge in Biology, for which there have been few experimental approaches, despite a wide-ranging speculative literature. We summarize five coordinated areas-experimental characterization of functional approximations to the minimal peptides (protozymes and urzymes) necessary to activate amino acids and acylate tRNA; showing that specificities of these experimental models match those expected from the synthetase Class division; population of disjoint regions of amino acid sequence space via bidirectional coding ancestry of the two synthetase Classes; showing that the phase transfer equilibria of amino acid side chains that form a two-dimensional basis set for protein folding are embedded in patterns of bases in the tRNA acceptor stem and anticodon; and identification of molecular signatures of ancestral synthetases and tRNAs necessary to define the earliest cognate synthetase:tRNA pairs-that now compose an extensive experimentally testable paradigm for progress toward understanding the coordinated emergence of the codon table and viable mRNA coding sequences. We briefly discuss recent progress toward identifying the remaining outstanding questions-the nature of the earliest amino acid alphabets and the origin of binding discrimination via distinct amino acid sequence-independent protein secondary structures-and how these, too, might be addressed experimentally.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, United States
| | - Peter R Wills
- Department of Physics and Te Ao Marama Centre for Fundamental Inquiry, University of Auckland, PB 92019, Auckland 1142, New Zealand
| |
Collapse
|
37
|
Adaptive Properties of the Genetically Encoded Amino Acid Alphabet Are Inherited from Its Subsets. Sci Rep 2019; 9:12468. [PMID: 31462646 PMCID: PMC6713743 DOI: 10.1038/s41598-019-47574-x] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2019] [Accepted: 07/08/2019] [Indexed: 01/11/2023] Open
Abstract
Life uses a common set of 20 coded amino acids (CAAs) to construct proteins. This set was likely canonicalized during early evolution; before this, smaller amino acid sets were gradually expanded as new synthetic, proofreading and coding mechanisms became biologically available. Many possible subsets of the modern CAAs or other presently uncoded amino acids could have comprised the earlier sets. We explore the hypothesis that the CAAs were selectively fixed due to their unique adaptive chemical properties, which facilitate folding, catalysis, and solubility of proteins, and gave adaptive value to organisms able to encode them. Specifically, we studied in silico hypothetical CAA sets of 3–19 amino acids comprised of 1913 structurally diverse α-amino acids, exploring the adaptive value of their combined physicochemical properties relative to those of the modern CAA set. We find that even hypothetical sets containing modern CAA members are especially adaptive; it is difficult to find sets even among a large choice of alternatives that cover the chemical property space more amply. These results suggest that each time a CAA was discovered and embedded during evolution, it provided an adaptive value unusual among many alternatives, and each selective step may have helped bootstrap the developing set to include still more CAAs.
Collapse
|
38
|
A tRNA- and Anticodon-Centric View of the Evolution of Aminoacyl-tRNA Synthetases, tRNAomes, and the Genetic Code. Life (Basel) 2019; 9:life9020037. [PMID: 31060233 PMCID: PMC6616430 DOI: 10.3390/life9020037] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2019] [Revised: 04/24/2019] [Accepted: 05/01/2019] [Indexed: 11/25/2022] Open
Abstract
Pathways of standard genetic code evolution remain conserved and apparent, particularly upon analysis of aminoacyl-tRNA synthetase (aaRS) lineages. Despite having incompatible active site folds, class I and class II aaRS are homologs by sequence. Specifically, structural class IA aaRS enzymes derive from class IIA aaRS enzymes by in-frame extension of the protein N-terminus and by an alternate fold nucleated by the N-terminal extension. The divergence of aaRS enzymes in the class I and class II clades was analyzed using the Phyre2 protein fold recognition server. The class I aaRS radiated from the class IA enzymes, and the class II aaRS radiated from the class IIA enzymes. The radiations of aaRS enzymes bolster the coevolution theory for evolution of the amino acids, tRNAomes, the genetic code, and aaRS enzymes and support a tRNA anticodon-centric perspective. We posit that second- and third-position tRNA anticodon sequence preference (C>(U~G)>A) powerfully selected the sectoring pathway for the code. GlyRS-IIA appears to have been the primordial aaRS from which all aaRS enzymes evolved, and glycine appears to have been the primordial amino acid around which the genetic code evolved.
Collapse
|
39
|
BłaŻej P, Wnetrzak M, Mackiewicz D, Mackiewicz P. The influence of different types of translational inaccuracies on the genetic code structure. BMC Bioinformatics 2019; 20:114. [PMID: 30841864 PMCID: PMC6404327 DOI: 10.1186/s12859-019-2661-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2018] [Accepted: 01/29/2019] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND The standard genetic code is a recipe for assigning unambiguously 21 labels, i.e. amino acids and stop translation signal, to 64 codons. However, at early stages of the translational machinery development, the codons did not have to be read unambiguously and the early genetic codes could have contained some ambiguous assignments of codons to amino acids. Therefore, the goal of this work was to obtain the genetic code structures which could have evolved assuming different types of inaccuracy of the translational machinery starting from unambiguous assignments of codons to amino acids. RESULTS We developed a theoretical model assuming that the level of uncertainty of codon assignments can gradually decrease during the simulations. Since it is postulated that the standard code has evolved to be robust against point mutations and mistranslations, we developed three simulation scenarios assuming that such errors can influence one, two or three codon positions. The simulated codes were selected using the evolutionary algorithm methodology to decrease coding ambiguity and increase their robustness against mistranslation. CONCLUSIONS The results indicate that the typical codon block structure of the genetic code could have evolved to decrease the ambiguity of amino acid to codon assignments and to increase the fidelity of reading the genetic information. However, the robustness to errors was not the decisive factor that influenced the genetic code evolution because it is possible to find theoretical codes that minimize the reading errors better than the standard genetic code.
Collapse
Affiliation(s)
- Paweł BłaŻej
- Department of Genomics, University of Wrocław, ul. Joliot-Curie 14a, Wrocław, 50-383 Poland
| | - Małgorzata Wnetrzak
- Department of Genomics, University of Wrocław, ul. Joliot-Curie 14a, Wrocław, 50-383 Poland
| | - Dorota Mackiewicz
- Department of Genomics, University of Wrocław, ul. Joliot-Curie 14a, Wrocław, 50-383 Poland
| | - Paweł Mackiewicz
- Department of Genomics, University of Wrocław, ul. Joliot-Curie 14a, Wrocław, 50-383 Poland
| |
Collapse
|
40
|
Chatterjee S, Yadav S. The Origin of Prebiotic Information System in the Peptide/RNA World: A Simulation Model of the Evolution of Translation and the Genetic Code. Life (Basel) 2019; 9:E25. [PMID: 30832272 PMCID: PMC6463137 DOI: 10.3390/life9010025] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2018] [Revised: 01/09/2019] [Accepted: 02/25/2019] [Indexed: 12/20/2022] Open
Abstract
Information is the currency of life, but the origin of prebiotic information remains a mystery. We propose transitional pathways from the cosmic building blocks of life to the complex prebiotic organic chemistry that led to the origin of information systems. The prebiotic information system, specifically the genetic code, is segregated, linear, and digital, and it appeared before the emergence of DNA. In the peptide/RNA world, lipid membranes randomly encapsulated amino acids, RNA, and peptide molecules, which are drawn from the prebiotic soup, to initiate a molecular symbiosis inside the protocells. This endosymbiosis led to the hierarchical emergence of several requisite components of the translation machine: transfer RNAs (tRNAs), aminoacyl-tRNA synthetase (aaRS), messenger RNAs (mRNAs), ribosomes, and various enzymes. When assembled in the right order, the translation machine created proteins, a process that transferred information from mRNAs to assemble amino acids into polypeptide chains. This was the beginning of the prebiotic information age. The origin of the genetic code is enigmatic; herein, we propose an evolutionary explanation: the demand for a wide range of protein enzymes over peptides in the prebiotic reactions was the main selective pressure for the origin of information-directed protein synthesis. The molecular basis of the genetic code manifests itself in the interaction of aaRS and their cognate tRNAs. In the beginning, aminoacylated ribozymes used amino acids as a cofactor with the help of bridge peptides as a process for selection between amino acids and their cognate codons/anticodons. This process selects amino acids and RNA species for the next steps. The ribozymes would give rise to pre-tRNA and the bridge peptides to pre-aaRS. Later, variants would appear and evolution would produce different but specific aaRS-tRNA-amino acid combinations. Pre-tRNA designed and built pre-mRNA for the storage of information regarding its cognate amino acid. Each pre-mRNA strand became the storage device for the genetic information that encoded the amino acid sequences in triplet nucleotides. As information appeared in the digital languages of the codon within pre-mRNA and mRNA, and the genetic code for protein synthesis evolved, the prebiotic chemistry then became more organized and directional with the emergence of the translation and genetic code. The genetic code developed in three stages that are coincident with the refinement of the translation machines: the GNC code that was developed by the pre-tRNA/pre-aaRS /pre-mRNA machine, SNS code by the tRNA/aaRS/mRNA machine, and finally the universal genetic code by the tRNA/aaRS/mRNA/ribosome machine. We suggest the coevolution of translation machines and the genetic code. The emergence of the translation machines was the beginning of the Darwinian evolution, an interplay between information and its supporting structure. Our hypothesis provides the logical and incremental steps for the origin of the programmed protein synthesis. In order to better understand the prebiotic information system, we converted letter codons into numerical codons in the Universal Genetic Code Table. We have developed a software, called CATI (Codon-Amino Acid-Translator-Imitator), to translate randomly chosen numerical codons into corresponding amino acids and vice versa. This conversion has granted us insight into how the genetic code might have evolved in the peptide/RNA world. There is great potential in the application of numerical codons to bioinformatics, such as barcoding, DNA mining, or DNA fingerprinting. We constructed the likely biochemical pathways for the origin of translation and the genetic code using the Model-View-Controller (MVC) software framework, and the translation machinery step-by-step. While using AnyLogic software, we were able to simulate and visualize the entire evolution of the translation machines, amino acids, and the genetic code.
Collapse
Affiliation(s)
- Sankar Chatterjee
- Department of Geosciences, Museum of Texas Tech University, Box 43191, 3301 4th Street, Lubbock, TX 79409, USA.
| | - Surya Yadav
- Rawls College of Business, Texas Tech University, Box 42101, 703 Flint Avenue, Lubbock, TX 79409, USA.
| |
Collapse
|
41
|
Newton MS, Morrone DJ, Lee KH, Seelig B. Genetic Code Evolution Investigated through the Synthesis and Characterisation of Proteins from Reduced-Alphabet Libraries. Chembiochem 2019; 20:846-856. [PMID: 30511381 DOI: 10.1002/cbic.201800668] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2018] [Indexed: 11/08/2022]
Abstract
The universal genetic code of 20 amino acids is the product of evolution. It is believed that earlier versions of the code had fewer residues. Many theories for the order in which amino acids were integrated into the code have been proposed, considering factors ranging from prebiotic chemistry to codon capture. Several meta-analyses combined these theories to yield a feasible consensus chronology of the genetic code's evolution, but there is a dearth of experimental data to test the hypothesised order. We used combinatorial chemistry to synthesise libraries of random polypeptides that were based on different subsets of the 20 standard amino acids, thus representing different stages of a plausible history of the alphabet. Four libraries were comprised of the five, nine, and 16 most ancient amino acids, and all 20 extant residues for a direct side-by-side comparison. We characterised numerous variants from each library for their solubility and propensity to form secondary, tertiary or quaternary structures. Proteins from the two most ancient libraries were more likely to be soluble than those from the extant library. Several individual protein variants exhibited inducible protein folding and other traits typical of intrinsically disordered proteins. From these libraries, we can infer how primordial protein structure and function might have evolved with the genetic code.
Collapse
Affiliation(s)
- Matilda S Newton
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, 55455, USA.,BioTechnology Institute, University of Minnesota, 1479 Gortner Avenue, 140 Gortner Laboratory, St. Paul, MN, 55108-6106, USA
| | - Dana J Morrone
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, 55455, USA.,BioTechnology Institute, University of Minnesota, 1479 Gortner Avenue, 140 Gortner Laboratory, St. Paul, MN, 55108-6106, USA
| | - Kun-Hwa Lee
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, 55455, USA.,BioTechnology Institute, University of Minnesota, 1479 Gortner Avenue, 140 Gortner Laboratory, St. Paul, MN, 55108-6106, USA
| | - Burckhard Seelig
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, 55455, USA.,BioTechnology Institute, University of Minnesota, 1479 Gortner Avenue, 140 Gortner Laboratory, St. Paul, MN, 55108-6106, USA
| |
Collapse
|
42
|
Abstract
Photosynthesis and nitrogen fixation became evolutionarily immutable as “frozen metabolic accidents” because multiple interactions between the proteins and protein complexes involved led to their co-evolution in modules. This has impeded their adaptation to an oxidizing atmosphere, and reconfiguration now requires modification or replacement of whole modules, using either natural modules from exotic species or non-natural proteins with similar interaction potential. Ultimately, the relevant complexes might be reconstructed (almost) from scratch, starting either from appropriate precursor processes or by designing alternative pathways. These approaches will require advances in synthetic biology, laboratory evolution, and a better understanding of module functions.
Collapse
Affiliation(s)
- Dario Leister
- Faculty of Biology, Ludwig-Maximilians-University Munich, Großhaderner Str. 2, 82152, Planegg-Martinsried, Germany.
| |
Collapse
|
43
|
Rogers SO. Evolution of the genetic code based on conservative changes of codons, amino acids, and aminoacyl tRNA synthetases. J Theor Biol 2019; 466:1-10. [PMID: 30658052 DOI: 10.1016/j.jtbi.2019.01.022] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2018] [Revised: 01/10/2019] [Accepted: 01/14/2019] [Indexed: 11/30/2022]
Abstract
The genetic code, as arranged in the standard tabular form, displays a non-random structure relating to the characteristics of the amino acids. An alternative arrangement can be made by organizing the code according to aminoacyl-tRNA synthetases (aaRSs), codons, and reverse complement codons, which illuminates a coevolutionary process that led to the contemporary genetic code. As amino acids were added to the genetic code, they were recognized by aaRSs that interact with stereochemically similar amino acids. Single nucleotide changes in the codons and anticodons were favored over more extensive changes, such that there was a logical stepwise progression in the evolution of the genetic code. The model presented traces the evolution of the genetic code accounting for these steps. Amino acid frequencies in ancient proteins and the preponderance of GNN codons in mRNAs for ancient proteins indicate that the genetic code began with alanine, aspartate, glutamate, glycine, and valine, with alanine being in the highest proportions. In addition to being consistent in terms of conservative changes in codon nucleotides, the model also is consistent with respect to aaRS classes, aaRS attachment to the tRNA, amino acid stereochemistry, and to a large extent with amino acid physicochemistry, and biochemical pathways.
Collapse
Affiliation(s)
- Scott O Rogers
- Department of Biological Sciences, Bowling Green State University, Bowling Green, OH, United States.
| |
Collapse
|
44
|
The Ribosome as a Missing Link in Prebiotic Evolution III: Over-Representation of tRNA- and rRNA-Like Sequences and Plieofunctionality of Ribosome-Related Molecules Argues for the Evolution of Primitive Genomes from Ribosomal RNA Modules. Int J Mol Sci 2019; 20:ijms20010140. [PMID: 30609737 PMCID: PMC6337102 DOI: 10.3390/ijms20010140] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2018] [Revised: 12/21/2018] [Accepted: 12/23/2018] [Indexed: 12/18/2022] Open
Abstract
We propose that ribosomal RNA (rRNA) formed the basis of the first cellular genomes, and provide evidence from a review of relevant literature and proteonomic tests. We have proposed previously that the ribosome may represent the vestige of the first self-replicating entity in which rRNAs also functioned as genes that were transcribed into functional messenger RNAs (mRNAs) encoding ribosomal proteins. rRNAs also encoded polymerases to replicate itself and a full complement of the transfer RNAs (tRNAs) required to translate its genes. We explore here a further prediction of our “ribosome-first” theory: the ribosomal genome provided the basis for the first cellular genomes. Modern genomes should therefore contain an unexpectedly large percentage of tRNA- and rRNA-like modules derived from both sense and antisense reading frames, and these should encode non-ribosomal proteins, as well as ribosomal ones with key cell functions. Ribosomal proteins should also have been co-opted by cellular evolution to play extra-ribosomal functions. We review existing literature supporting these predictions. We provide additional, new data demonstrating that rRNA-like sequences occur at significantly higher frequencies than predicted on the basis of mRNA duplications or randomized RNA sequences. These data support our “ribosome-first” theory of cellular evolution.
Collapse
|
45
|
Wnętrzak M, Błażej P, Mackiewicz D, Mackiewicz P. The optimality of the standard genetic code assessed by an eight-objective evolutionary algorithm. BMC Evol Biol 2018; 18:192. [PMID: 30545289 PMCID: PMC6293558 DOI: 10.1186/s12862-018-1304-0] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2017] [Accepted: 11/22/2018] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND The standard genetic code (SGC) is a unique set of rules which assign amino acids to codons. Similar amino acids tend to have similar codons indicating that the code evolved to minimize the costs of amino acid replacements in proteins, caused by mutations or translational errors. However, if such optimization in fact occurred, many different properties of amino acids must have been taken into account during the code evolution. Therefore, this problem can be reformulated as a multi-objective optimization task, in which the selection constraints are represented by measures based on various amino acid properties. RESULTS To study the optimality of the SGC we applied a multi-objective evolutionary algorithm and we used the representatives of eight clusters, which grouped over 500 indices describing various physicochemical properties of amino acids. Thanks to that we avoided an arbitrary choice of amino acid features as optimization criteria. As a consequence, we were able to conduct a more general study on the properties of the SGC than the ones presented so far in other papers on this topic. We considered two models of the genetic code, one preserving the characteristic codon blocks structure of the SGC and the other without this restriction. The results revealed that the SGC could be significantly improved in terms of error minimization, hereby it is not fully optimized. Its structure differs significantly from the structure of the codes optimized to minimize the costs of amino acid replacements. On the other hand, using newly defined quality measures that placed the SGC in the global space of theoretical genetic codes, we showed that the SGC is definitely closer to the codes that minimize the costs of amino acids replacements than those maximizing them. CONCLUSIONS The standard genetic code represents most likely only partially optimized systems, which emerged under the influence of many different factors. Our findings can be useful to researchers involved in modifying the genetic code of the living organisms and designing artificial ones.
Collapse
Affiliation(s)
- Małgorzata Wnętrzak
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, ul. Joliot-Curie 14a, 50-383, Wrocław, Poland
| | - Paweł Błażej
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, ul. Joliot-Curie 14a, 50-383, Wrocław, Poland
| | - Dorota Mackiewicz
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, ul. Joliot-Curie 14a, 50-383, Wrocław, Poland
| | - Paweł Mackiewicz
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, ul. Joliot-Curie 14a, 50-383, Wrocław, Poland.
| |
Collapse
|
46
|
Błażej P, Wnętrzak M, Mackiewicz D, Mackiewicz P. Optimization of the standard genetic code according to three codon positions using an evolutionary algorithm. PLoS One 2018; 13:e0201715. [PMID: 30092017 PMCID: PMC6084934 DOI: 10.1371/journal.pone.0201715] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2018] [Accepted: 07/21/2018] [Indexed: 12/28/2022] Open
Abstract
Many biological systems are typically examined from the point of view of adaptation to certain conditions or requirements. One such system is the standard genetic code (SGC), which generally minimizes the cost of amino acid replacements resulting from mutations or mistranslations. However, no full consensus has been reached on the factors that caused the evolution of this feature. One of the hypotheses suggests that code optimality was directly selected as an advantage to preserve information about encoded proteins. An important feature that should be considered when studying the SGC is the different roles of the three codon positions. Therefore, we investigated the robustness of this code regarding the cost of amino acid replacements resulting from substitutions in these positions separately and the sum of these costs. We applied a modified evolutionary algorithm and included four models of the genetic code assuming various restrictions on its structure. The SGC was compared both with the codes that minimize the objective function and those that maximize it. This approach allowed us to place the SGC in the global space of possible codes, which is a more appropriate and unbiased comparison than that with randomly generated codes because they are characterized by relatively uniform amino acid assignments to codons. The SGC appeared to be well optimized at the global scale, but its individual positions were not fully optimized because there were codes that were optimized for only one codon position and simultaneously outperformed the SGC at the other positions. We also found that different code structures may lead to the same optimality and that random codes can show a tendency to minimize costs under some of the genetic code models. Our results suggest that the optimality of SGC could be a by-product of other processes.
Collapse
Affiliation(s)
- Paweł Błażej
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, Wrocław, Poland
| | - Małgorzata Wnętrzak
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, Wrocław, Poland
| | - Dorota Mackiewicz
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, Wrocław, Poland
| | - Paweł Mackiewicz
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, Wrocław, Poland
- * E-mail:
| |
Collapse
|
47
|
Tripathi S, Deem MW. The Standard Genetic Code Facilitates Exploration of the Space of Functional Nucleotide Sequences. J Mol Evol 2018; 86:325-339. [PMID: 29959476 DOI: 10.1007/s00239-018-9852-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2017] [Accepted: 06/21/2018] [Indexed: 01/07/2023]
Abstract
The standard genetic code is well known to be optimized for minimizing the phenotypic effects of single-nucleotide substitutions, a property that was likely selected for during the emergence of a universal code. Given the fitness advantage afforded by high standing genetic diversity in a population in a dynamic environment, it is possible that selection to explore a large fraction of the space of functional proteins also occurred. To determine whether selection for such a property played a role during the emergence of the nearly universal standard genetic code, we investigated the number of functional variants of the Escherichia coli PhoQ protein explored at different time scales under translation using different genetic codes. We found that the standard genetic code is highly optimal for exploring a large fraction of the space of functional PhoQ variants at intermediate time scales as compared to random codes. Environmental changes, in response to which genetic diversity in a population provides a fitness advantage, are likely to have occurred at these intermediate time scales. Our results indicate that the ability of the standard code to explore a large fraction of the space of functional sequence variants arises from a balance between robustness and flexibility and is largely independent of the property of the standard code to minimize the phenotypic effects of mutations. We propose that selection to explore a large fraction of the functional sequence space while minimizing the phenotypic effects of mutations contributed toward the emergence of the standard code as the universal genetic code.
Collapse
Affiliation(s)
- Shubham Tripathi
- PhD Program in Systems, Synthetic, and Physical Biology, Rice University, Houston, TX, 77005, USA
- Center for Theoretical Biological Physics, Rice University, Houston, TX, 77005, USA
| | - Michael W Deem
- PhD Program in Systems, Synthetic, and Physical Biology, Rice University, Houston, TX, 77005, USA.
- Center for Theoretical Biological Physics, Rice University, Houston, TX, 77005, USA.
- Department of Bioengineering, Rice University, Houston, TX, 77005, USA.
- Department of Physics and Astronomy, Rice University, Houston, TX, 77005, USA.
| |
Collapse
|
48
|
Di Giulio M. On Earth, there would be a number of fundamental kinds of primary cells – cellular domains – greater than or equal to four. J Theor Biol 2018; 443:10-17. [DOI: 10.1016/j.jtbi.2018.01.025] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2017] [Revised: 01/10/2018] [Accepted: 01/19/2018] [Indexed: 11/15/2022]
|
49
|
Carter CW, Wills PR. Interdependence, Reflexivity, Fidelity, Impedance Matching, and the Evolution of Genetic Coding. Mol Biol Evol 2018; 35:269-286. [PMID: 29077934 PMCID: PMC5850816 DOI: 10.1093/molbev/msx265] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Genetic coding is generally thought to have required ribozymes whose functions were taken over by polypeptide aminoacyl-tRNA synthetases (aaRS). Two discoveries about aaRS and their interactions with tRNA substrates now furnish a unifying rationale for the opposite conclusion: that the key processes of the Central Dogma of molecular biology emerged simultaneously and naturally from simple origins in a peptide•RNA partnership, eliminating the epistemological utility of a prior RNA world. First, the two aaRS classes likely arose from opposite strands of the same ancestral gene, implying a simple genetic alphabet. The resulting inversion symmetries in aaRS structural biology would have stabilized the initial and subsequent differentiation of coding specificities, rapidly promoting diversity in the proteome. Second, amino acid physical chemistry maps onto tRNA identity elements, establishing reflexive, nanoenvironmental sensing in protein aaRS. Bootstrapping of increasingly detailed coding is thus intrinsic to polypeptide aaRS, but impossible in an RNA world. These notions underline the following concepts that contradict gradual replacement of ribozymal aaRS by polypeptide aaRS: 1) aaRS enzymes must be interdependent; 2) reflexivity intrinsic to polypeptide aaRS production dynamics promotes bootstrapping; 3) takeover of RNA-catalyzed aminoacylation by enzymes will necessarily degrade specificity; and 4) the Central Dogma's emergence is most probable when replication and translation error rates remain comparable. These characteristics are necessary and sufficient for the essentially de novo emergence of a coupled gene-replicase-translatase system of genetic coding that would have continuously preserved the functional meaning of genetically encoded protein genes whose phylogenetic relationships match those observed today.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Peter R Wills
- Department of Physics, University of Auckland, Auckland, New Zealand
| |
Collapse
|
50
|
The evolution of the genetic code: Impasses and challenges. Biosystems 2018; 164:217-225. [DOI: 10.1016/j.biosystems.2017.10.006] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2017] [Revised: 10/06/2017] [Accepted: 10/09/2017] [Indexed: 01/17/2023]
|