1
|
Patra SK, Randolph N, Kuhlman B, Dieckhaus H, Betts L, Douglas J, Wills PR, Carter CW. Aminoacyl-tRNA synthetase urzymes optimized by deep learning behave as a quasispecies. STRUCTURAL DYNAMICS (MELVILLE, N.Y.) 2025; 12:024701. [PMID: 40290414 PMCID: PMC12033045 DOI: 10.1063/4.0000294] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/29/2025] [Accepted: 03/19/2025] [Indexed: 04/30/2025]
Abstract
Protein design plays a key role in our efforts to work out how genetic coding began. That effort entails urzymes. Urzymes are small, conserved excerpts from full-length aminoacyl-tRNA synthetases that remain active. Urzymes require design to connect disjoint pieces and repair naked nonpolar patches created by removing large domains. Rosetta allowed us to create the first urzymes, but those urzymes were only sparingly soluble. We could measure activity, but it was hard to concentrate those samples to levels required for structural biology. Here, we used the deep learning algorithms ProteinMPNN and AlphaFold2 to redesign a set of optimized LeuAC urzymes derived from leucyl-tRNA synthetase. We select a balanced, representative subset of eight variants for testing using principal component analysis. Most tested variants are much more soluble than the original LeuAC. They also span a range of catalytic proficiency and amino acid specificity. The data enable detailed statistical analyses of the sources of both solubility and specificity. In that way, we show how to begin to unwrap the elements of protein chemistry that were hidden within the neural networks. Deep learning networks have thus helped us surmount several vexing obstacles to further investigations into the nature of ancestral proteins. Finally, we discuss how the eight variants might resemble a sample drawn from a population similar to one subject to natural selection.
Collapse
Affiliation(s)
- Sourav Kumar Patra
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7260, USA
| | - Nicholas Randolph
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7260, USA
| | | | | | - Laurie Betts
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7260, USA
| | - Jordan Douglas
- Department of Physics, University of Auckland, Auckland, New Zealand
| | - Peter R. Wills
- Department of Physics, University of Auckland, Auckland, New Zealand
| | | |
Collapse
|
2
|
Lei L, Burton ZF. Chemical Evolution of Life on Earth. Genes (Basel) 2025; 16:220. [PMID: 40004549 PMCID: PMC11854950 DOI: 10.3390/genes16020220] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2025] [Revised: 02/04/2025] [Accepted: 02/08/2025] [Indexed: 02/27/2025] Open
Abstract
Background/Objectives: The origin of genes and genetics is the story of the coevolution of translation systems and the genetic code. Remarkably, the history of the origin of life on Earth was inscribed and preserved in the sequences of tRNAs. Methods: Sequence logos demonstrate the patterning of pre-life tRNA sequences. Results: The pre-life type I and type II tRNA sequences are known to the last nucleotide with only a few ambiguities. Type I and type II tRNAs evolved from ligation of three 31 nt minihelices of highly patterned and known sequence followed by closely related 9 nt internal deletion(s) within ligated acceptor stems. The D loop 17 nt core was a truncated UAGCC repeat. The anticodon and T 17 nt stem-loop-stems are homologous sequences with 5 nt stems and 7 nt U-turn loops that were selected in pre-life to resist ribozyme nucleases and to present a 3 nt anticodon with a single wobble position. The 7 nt T loop in tRNA was selected to interact with the D loop at the "elbow". The 5'-acceptor stem was based on a 7 nt truncated GCG repeat. The 3'-acceptor stem was based on a complementary 7 nt CGC repeat. In pre-life, ACCA-Gly was a primitive adapter molecule ligated to many RNAs, including tRNAs, to synthesize polyglycine. Conclusions: Analysis of sequence logos of tRNAs from an ancient Archaeon substantiates how the pre-life to life transition occurred on Earth. Polyglycine is posited to have aggregated complex molecular assemblies, including minihelices, tRNAs, cooperating molecules, and protocells, leading to the first life on Earth.
Collapse
Affiliation(s)
- Lei Lei
- School of Biological Sciences, University of New England, Biddeford, ME 04005, USA;
| | - Zachary Frome Burton
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA
| |
Collapse
|
3
|
Carter CW, Tang GQ, Patra SK, Betts L, Dieckhaus H, Kuhlman B, Douglas J, Wills PR, Bouckaert R, Popovic M, Ditzler MA. WITHDRAWN: Structural Enzymology, Phylogenetics, Differentiation, and Symbolic Reflexivity at the Dawn of Biology. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2024.12.17.628912. [PMID: 39763899 PMCID: PMC11702779 DOI: 10.1101/2024.12.17.628912] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/18/2025]
Abstract
This manuscript was posted without the final consent of all authors. The authors have therefore withdrawn it. The authors do not wish this work to be cited as reference for the project. If you have any questions, please contact the corresponding author, carter@med.unc.edu .
Collapse
Affiliation(s)
- Charles W. Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260
| | - Guo Qing Tang
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260
| | - Sourav Kumar Patra
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260
| | - Laurie Betts
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260
| | - Henry Dieckhaus
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260
- Lineberger Comprehensive Cancer Center, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Brian Kuhlman
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260
- Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Jordan Douglas
- Department of Physics, Auckland University, Auckland, NZ
- Department of Computer Science, Auckland University, Auckland, NZ
| | - Peter R. Wills
- Department of Physics, Auckland University, Auckland, NZ
| | - Remco Bouckaert
- Department of Computer Science, Auckland University, Auckland, NZ
| | | | | |
Collapse
|
4
|
Patra S, Douglas J, Wills P, Betts L, Qing T, Carter C. A genomic database furnishes minimal functional glycyl-tRNA synthetases homologous to other, designed class II urzymes. Nucleic Acids Res 2024; 52:13305-13324. [PMID: 39494520 PMCID: PMC11602164 DOI: 10.1093/nar/gkae992] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Accepted: 10/18/2024] [Indexed: 11/05/2024] Open
Abstract
The hypothesis that conserved core catalytic sites could represent ancestral aminoacyl-tRNA synthetases (AARS) drove the design of functional TrpRS, LeuRS, and HisRS 'urzymes'. We describe here new urzymes detected in the genomic record of the arctic fox, Vulpes lagopus. They are homologous to the α-subunit of bacterial heterotetrameric Class II glycyl-tRNA synthetase (GlyRS-B) enzymes. AlphaFold2 predicted that the N-terminal 81 amino acids would adopt a 3D structure nearly identical to our designed HisRS urzyme (HisCA1). We expressed and purified that N-terminal segment and the spliced open reading frame GlyCA1-2. Both exhibit robust single-turnover burst sizes and ATP consumption rates higher than those previously published for HisCA urzymes and comparable to those for LeuAC and TrpAC. GlyCA is more than twice as active in glycine activation by adenosine triphosphate as the full-length GlyRS-B α2 dimer. Michaelis-Menten rate constants for all three substrates reveal significant coupling between Exon2 and both substrates. GlyCA activation favors Class II amino acids that complement those favored by HisCA and LeuAC. Structural features help explain these results. These minimalist GlyRS catalysts are thus homologous to previously described urzymes. Their properties reinforce the notion that urzymes may have the requisite catalytic activities to implement a reduced, ancestral genetic coding alphabet.
Collapse
Affiliation(s)
- Sourav Kumar Patra
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| | - Jordan Douglas
- Department of Physics, The University of Auckland, Auckland 1042, New Zealand
- Centre for Computational Evolution, University of Auckland, 1010, New Zealand
| | - Peter R Wills
- Department of Physics, The University of Auckland, Auckland 1042, New Zealand
| | - Laurie Betts
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| | - Tang Guo Qing
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| | - Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| |
Collapse
|
5
|
Carter CW. Base Pairing Promoted the Self-Organization of Genetic Coding, Catalysis, and Free-Energy Transduction. Life (Basel) 2024; 14:199. [PMID: 38398709 PMCID: PMC10890426 DOI: 10.3390/life14020199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Revised: 01/21/2024] [Accepted: 01/25/2024] [Indexed: 02/25/2024] Open
Abstract
How Nature discovered genetic coding is a largely ignored question, yet the answer is key to explaining the transition from biochemical building blocks to life. Other, related puzzles also fall inside the aegis enclosing the codes themselves. The peptide bond is unstable with respect to hydrolysis. So, it requires some form of chemical free energy to drive it. Amino acid activation and acyl transfer are also slow and must be catalyzed. All living things must thus also convert free energy and synchronize cellular chemistry. Most importantly, functional proteins occupy only small, isolated regions of sequence space. Nature evolved heritable symbolic data processing to seek out and use those sequences. That system has three parts: a memory of how amino acids behave in solution and inside proteins, a set of code keys to access that memory, and a scoring function. The code keys themselves are the genes for cognate pairs of tRNA and aminoacyl-tRNA synthetases, AARSs. The scoring function is the enzymatic specificity constant, kcat/kM, which measures both catalysis and specificity. The work described here deepens the evidence for and understanding of an unexpected consequence of ancestral bidirectional coding. Secondary structures occur in approximately the same places within antiparallel alignments of their gene products. However, the polar amino acids that define the molecular surface of one are reflected into core-defining non-polar side chains on the other. Proteins translated from base-paired coding strands fold up inside out. Bidirectional genes thus project an inverted structural duality into the proteome. I review how experimental data root the scoring functions responsible for the origins of coding and catalyzed activation of unfavorable chemical reactions in that duality.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, USA
| |
Collapse
|
6
|
Douglas J, Bouckaert R, Carter CW, Wills P. Enzymic recognition of amino acids drove the evolution of primordial genetic codes. Nucleic Acids Res 2024; 52:558-571. [PMID: 38048305 PMCID: PMC10810186 DOI: 10.1093/nar/gkad1160] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Revised: 10/28/2023] [Accepted: 11/20/2023] [Indexed: 12/06/2023] Open
Abstract
How genetic information gained its exquisite control over chemical processes needed to build living cells remains an enigma. Today, the aminoacyl-tRNA synthetases (AARS) execute the genetic codes in all living systems. But how did the AARS that emerged over three billion years ago as low-specificity, protozymic forms then spawn the full range of highly-specific enzymes that distinguish between 22 diverse amino acids? A phylogenetic reconstruction of extant AARS genes, enhanced by analysing modular acquisitions, reveals six AARS with distinct bacterial, archaeal, eukaryotic, or organellar clades, resulting in a total of 36 families of AARS catalytic domains. Small structural modules that differentiate one AARS family from another played pivotal roles in discriminating between amino acid side chains, thereby expanding the genetic code and refining its precision. The resulting model shows a tendency for less elaborate enzymes, with simpler catalytic domains, to activate amino acids that were not synthesised until later in the evolution of the code. The most probable evolutionary route for an emergent amino acid type to establish a place in the code was by recruiting older, less specific AARS, rather than adapting contemporary lineages. This process, retrofunctionalisation, differs from previously described mechanisms through which amino acids would enter the code.
Collapse
Affiliation(s)
- Jordan Douglas
- Department of Physics, The University of Auckland, New Zealand
- Centre for Computational Evolution, The University of Auckland, New Zealand
| | - Remco Bouckaert
- Centre for Computational Evolution, The University of Auckland, New Zealand
- School of Computer Science, The University of Auckland, New Zealand
| | - Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, USA
| | - Peter R Wills
- Department of Physics, The University of Auckland, New Zealand
- Centre for Computational Evolution, The University of Auckland, New Zealand
| |
Collapse
|
7
|
Tang GQ, Elder JJH, Douglas J, Carter CW. Domain acquisition by class I aminoacyl-tRNA synthetase urzymes coordinated the catalytic functions of HVGH and KMSKS motifs. Nucleic Acids Res 2023; 51:8070-8084. [PMID: 37470821 PMCID: PMC10450160 DOI: 10.1093/nar/gkad590] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Revised: 06/23/2023] [Accepted: 07/11/2023] [Indexed: 07/21/2023] Open
Abstract
Leucyl-tRNA synthetase (LeuRS) is a Class I aminoacyl-tRNA synthetase (aaRS) that synthesizes leucyl-tRNAleu for codon-directed protein synthesis. Two signature sequences, HxGH and KMSKS help stabilize transition-states for amino acid activation and tRNA aminoacylation by all Class I aaRS. Separate alanine mutants of each signature, together with the double mutant, behave in opposite ways in Pyrococcus horikoshii LeuRS and the 129-residue urzyme ancestral model generated from it (LeuAC). Free energy coupling terms, Δ(ΔG‡), for both reactions are large and favourable for LeuRS, but unfavourable for LeuAC. Single turnover assays with 32Pα-ATP show correspondingly different internal products. These results implicate domain motion in catalysis by full-length LeuRS. The distributed thermodynamic cycle of mutational changes authenticates LeuAC urzyme catalysis far more convincingly than do single point mutations. Most importantly, the evolutionary gain of function induced by acquiring the anticodon-binding (ABD) and multiple insertion modules in the catalytic domain appears to be to coordinate the catalytic function of the HxGH and KMSKS signature sequences. The implication that backbone elements of secondary structures achieve a major portion of the overall transition-state stabilization by LeuAC is also consistent with coevolution of the genetic code and metabolic pathways necessary to produce histidine and lysine sidechains.
Collapse
Affiliation(s)
- Guo Qing Tang
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| | - Jessica J H Elder
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| | - Jordan Douglas
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
- Department of Physics, The University of Auckland, New Zealand
| | - Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| |
Collapse
|
8
|
Abstract
The mechanism and the evolution of DNA replication and transcription, the key elements of the central dogma of biology, are fundamentally well explained by the physicochemical complementarity between strands of nucleic acids. However, the determinants that have shaped the third part of the dogma-the process of biological translation and the universal genetic code-remain unclear. We review and seek parallels between different proposals that view the evolution of translation through the prism of weak, noncovalent interactions between biological macromolecules. In particular, we focus on a recent proposal that there exists a hitherto unrecognized complementarity at the heart of biology, that between messenger RNA coding regions and the proteins that they encode, especially if the two are unstructured. Reflecting the idea that the genetic code evolved from intrinsic binding propensities between nucleotides and amino acids, this proposal promises to forge a link between the distant past and the present of biological systems.
Collapse
Affiliation(s)
- Bojan Zagrovic
- Department of Structural and Computational Biology, Max Perutz Labs & University of Vienna, Vienna, Austria;
| | - Marlene Adlhart
- Department of Structural and Computational Biology, Max Perutz Labs & University of Vienna, Vienna, Austria;
| | - Thomas H Kapral
- Department of Structural and Computational Biology, Max Perutz Labs & University of Vienna, Vienna, Austria;
- Vienna BioCenter PhD Program, Doctoral School of the University of Vienna and Medical University of Vienna, Vienna, Austria
| |
Collapse
|
9
|
Giegé R, Eriani G. The tRNA identity landscape for aminoacylation and beyond. Nucleic Acids Res 2023; 51:1528-1570. [PMID: 36744444 PMCID: PMC9976931 DOI: 10.1093/nar/gkad007] [Citation(s) in RCA: 73] [Impact Index Per Article: 36.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Revised: 12/21/2022] [Accepted: 01/03/2023] [Indexed: 02/07/2023] Open
Abstract
tRNAs are key partners in ribosome-dependent protein synthesis. This process is highly dependent on the fidelity of tRNA aminoacylation by aminoacyl-tRNA synthetases and relies primarily on sets of identities within tRNA molecules composed of determinants and antideterminants preventing mischarging by non-cognate synthetases. Such identity sets were discovered in the tRNAs of a few model organisms, and their properties were generalized as universal identity rules. Since then, the panel of identity elements governing the accuracy of tRNA aminoacylation has expanded considerably, but the increasing number of reported functional idiosyncrasies has led to some confusion. In parallel, the description of other processes involving tRNAs, often well beyond aminoacylation, has progressed considerably, greatly expanding their interactome and uncovering multiple novel identities on the same tRNA molecule. This review highlights key findings on the mechanistics and evolution of tRNA and tRNA-like identities. In addition, new methods and their results for searching sets of multiple identities on a single tRNA are discussed. Taken together, this knowledge shows that a comprehensive understanding of the functional role of individual and collective nucleotide identity sets in tRNA molecules is needed for medical, biotechnological and other applications.
Collapse
Affiliation(s)
- Richard Giegé
- Correspondence may also be addressed to Richard Giegé.
| | | |
Collapse
|
10
|
Opuu V, Simonson T. Enzyme redesign and genetic code expansion. Protein Eng Des Sel 2023; 36:gzad017. [PMID: 37879093 DOI: 10.1093/protein/gzad017] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2023] [Revised: 09/10/2023] [Accepted: 09/19/2023] [Indexed: 10/27/2023] Open
Abstract
Enzyme design is an important application of computational protein design (CPD). It can benefit enormously from the additional chemistries provided by noncanonical amino acids (ncAAs). These can be incorporated into an 'expanded' genetic code, and introduced in vivo into target proteins. The key step for genetic code expansion is to engineer an aminoacyl-transfer RNA (tRNA) synthetase (aaRS) and an associated tRNA that handles the ncAA. Experimental directed evolution has been successfully used to engineer aaRSs and incorporate over 200 ncAAs into expanded codes. But directed evolution has severe limits, and is not yet applicable to noncanonical AA backbones. CPD can help address several of its limitations, and has begun to be applied to this problem. We review efforts to redesign aaRSs, studies that designed new proteins and functionalities with the help of ncAAs, and some of the method developments that have been used, such as adaptive landscape flattening Monte Carlo, which allows an enzyme to be redesigned with substrate or transition state binding as the design target.
Collapse
Affiliation(s)
- Vaitea Opuu
- Institut Chimie Biologie Innovation (CNRS UMR8231), Ecole Supérieure de Physique et Chimie de Paris (ESPCI), 75005 Paris, France
| | - Thomas Simonson
- Laboratoire de Biologie Structurale de la Cellule (CNRS UMR7654), Ecole Polytechnique, Institut Polytechnique de Paris, 91128 Palaiseau, France
| |
Collapse
|
11
|
A Leucyl-tRNA Synthetase Urzyme: Authenticity of tRNA Synthetase Catalytic Activities and Promiscuous Phosphorylation of Leucyl-5'AMP. Int J Mol Sci 2022; 23:ijms23084229. [PMID: 35457045 PMCID: PMC9026127 DOI: 10.3390/ijms23084229] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Revised: 03/30/2022] [Accepted: 03/31/2022] [Indexed: 02/05/2023] Open
Abstract
Aminoacyl-tRNA synthetase (aaRS)/tRNA cognate pairs translate the genetic code by synthesizing specific aminoacyl-tRNAs that are assembled on messenger RNA by the ribosome. Deconstruction of the two distinct aaRS superfamilies (Classes) has provided conceptual and experimental models for their early evolution. Urzymes, containing ~120–130 amino acids excerpted from regions where genetic coding sequence complementarities have been identified, are key experimental models motivated by the proposal of a single bidirectional ancestral gene. Previous reports that Class I and Class II urzymes accelerate both amino acid activation and tRNA aminoacylation have not been extended to other synthetases. We describe a third urzyme (LeuAC) prepared from the Class IA Pyrococcus horikoshii leucyl-tRNA synthetase. We adduce multiple lines of evidence for the authenticity of its catalysis of both canonical reactions, amino acid activation and tRNALeu aminoacylation. Mutation of the three active-site lysine residues to alanine causes significant, but modest reduction in both amino acid activation and aminoacylation. LeuAC also catalyzes production of ADP, a non-canonical enzymatic function that has been overlooked since it first was described for several full-length aaRS in the 1970s. Structural data suggest that the LeuAC active site accommodates two ATP conformations that are prominent in water but rarely seen bound to proteins, accounting for successive, in situ phosphorylation of the bound leucyl-5′AMP phosphate, accounting for ADP production. This unusual ATP consumption regenerates the transition state for amino acid activation and suggests, in turn, that in the absence of the editing and anticodon-binding domains, LeuAC releases leu-5′AMP unusually slowly, relative to the two phosphorylation reactions.
Collapse
|
12
|
Li DJ. Distributional features of triplet codons in genomes underlie the diversification of life. Biosystems 2022; 217:104681. [DOI: 10.1016/j.biosystems.2022.104681] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2021] [Revised: 04/04/2022] [Accepted: 04/07/2022] [Indexed: 11/02/2022]
|
13
|
Carter CW, Popinga A, Bouckaert R, Wills PR. Multidimensional Phylogenetic Metrics Identify Class I Aminoacyl-tRNA Synthetase Evolutionary Mosaicity and Inter-Modular Coupling. Int J Mol Sci 2022; 23:ijms23031520. [PMID: 35163448 PMCID: PMC8835825 DOI: 10.3390/ijms23031520] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2021] [Revised: 01/17/2022] [Accepted: 01/17/2022] [Indexed: 02/01/2023] Open
Abstract
The role of aminoacyl-tRNA synthetases (aaRS) in the emergence and evolution of genetic coding poses challenging questions concerning their provenance. We seek evidence about their ancestry from curated structure-based multiple sequence alignments of a structurally invariant “scaffold” shared by all 10 canonical Class I aaRS. Three uncorrelated phylogenetic metrics—mutation frequency, its uniformity, and row-by-row cladistic congruence—imply that the Class I scaffold is a mosaic assembled from successive genetic sources. Metrics for different modules vary in accordance with their presumed functionality. Sequences derived from the ATP– and amino acid– binding sites exhibit specific two-way coupling to those derived from Connecting Peptide 1, a third module whose metrics suggest later acquisition. The data help validate: (i) experimental fragmentations of the canonical Class I structure into three partitions that retain catalytic activities in proportion to their length; and (ii) evidence that the ancestral Class I aaRS gene also encoded a Class II ancestor in frame on the opposite strand. A 46-residue Class I “protozyme” roots the Class I tree prior to the adaptive radiation of the Rossmann dinucleotide binding fold that refined substrate discrimination. Such rooting implies near simultaneous emergence of genetic coding and the origin of the proteome, resolving a conundrum posed by previous inferences that Class I aaRS evolved after the genetic code had been implemented in an RNA world. Further, pinpointing discontinuous enhancements of aaRS fidelity establishes a timeline for the growth of coding from a binary amino acid alphabet.
Collapse
Affiliation(s)
- Charles W. Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, USA
- Correspondence: ; Tel.: +1-919-966-3263
| | - Alex Popinga
- Centre for Computational Evolution, University of Auckland, PB 92019, Auckland 1142, New Zealand; (A.P.); (R.B.)
| | - Remco Bouckaert
- Centre for Computational Evolution, University of Auckland, PB 92019, Auckland 1142, New Zealand; (A.P.); (R.B.)
| | - Peter R. Wills
- Department of Physics and Te Ao Marama Centre for Fundamental Inquiry, University of Auckland, PB 92019, Auckland 1142, New Zealand;
| |
Collapse
|
14
|
Furukawa R, Yokobori SI, Sato R, Kumagawa T, Nakagawa M, Katoh K, Yamagishi A. Amino Acid Specificity of Ancestral Aminoacyl-tRNA Synthetase Prior to the Last Universal Common Ancestor Commonote commonote. J Mol Evol 2022; 90:73-94. [PMID: 35084522 PMCID: PMC8821087 DOI: 10.1007/s00239-021-10043-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Accepted: 12/16/2021] [Indexed: 11/24/2022]
Abstract
Extant organisms commonly use 20 amino acids in protein synthesis. In the translation system, aminoacyl-tRNA synthetase (ARS) selectively binds an amino acid and transfers it to the cognate tRNA. It is postulated that the amino acid repertoire of ARS expanded during the development of the translation system. In this study we generated composite phylogenetic trees for seven ARSs (SerRS, ProRS, ThrRS, GlyRS-1, HisRS, AspRS, and LysRS) which are thought to have diverged by gene duplication followed by mutation, before the evolution of the last universal common ancestor. The composite phylogenetic tree shows that the AspRS/LysRS branch diverged from the other five ARSs at the deepest node, with the GlyRS/HisRS branch and the other three ARSs (ThrRS, ProRS and SerRS) diverging at the second deepest node. ThrRS diverged next, and finally ProRS and SerRS diverged from each other. Based on the phylogenetic tree, sequences of the ancestral ARSs prior to the evolution of the last universal common ancestor were predicted. The amino acid specificity of each ancestral ARS was then postulated by comparison with amino acid recognition sites of ARSs of extant organisms. Our predictions demonstrate that ancestral ARSs had substantial specificity and that the number of amino acid types amino-acylated by proteinaceous ARSs was limited before the appearance of a fuller range of proteinaceous ARS species. From an assumption that 10 amino acid species are required for folding and function, proteinaceous ARS possibly evolved in a translation system composed of preexisting ribozyme ARSs, before the evolution of the last universal common ancestor.
Collapse
Affiliation(s)
- Ryutaro Furukawa
- Department of Applied Life Sciences, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, Japan.,Faculty of Human Science, Waseda University, 2-579-15 Mikajima, Tokorozawa, Saitama, 359-1192, Japan
| | - Shin-Ichi Yokobori
- Department of Applied Life Sciences, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, Japan
| | - Riku Sato
- Department of Applied Life Sciences, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, Japan
| | - Taimu Kumagawa
- Department of Applied Life Sciences, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, Japan
| | - Mizuho Nakagawa
- Department of Applied Life Sciences, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, Japan
| | - Kazutaka Katoh
- Department of Genome Informatics, Genome Information Research Center, Research Institute for Microbial Diseases, Osaka University, 3-1 Yamadaoka, Suita, Osaka, 565-0871, Japan
| | - Akihiko Yamagishi
- Department of Applied Life Sciences, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, Japan.
| |
Collapse
|
15
|
Formation of the Codon Degeneracy during Interdependent Development between Metabolism and Replication. Genes (Basel) 2021; 12:genes12122023. [PMID: 34946975 PMCID: PMC8701183 DOI: 10.3390/genes12122023] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 11/30/2021] [Accepted: 12/03/2021] [Indexed: 11/16/2022] Open
Abstract
Nirenberg's genetic code chart shows a profound correspondence between codons and amino acids. The aim of this article is to try to explain the primordial formation of the codon degeneracy. It remains a puzzle how informative molecules arose from the supposed prebiotic random sequences. If introducing an initial driving force based on the relative stabilities of triplex base pairs, the prebiotic sequence evolution became innately nonrandom. Thus, the primordial assignment of the 64 codons to the 20 amino acids has been explained in detail according to base substitutions during the coevolution of tRNAs with aaRSs; meanwhile, the classification of aaRSs has also been explained.
Collapse
|
16
|
Amino acid activation analysis of primitive aminoacyl-tRNA synthetases encoded by both strands of a single gene using the malachite green assay. Biosystems 2021; 208:104481. [PMID: 34245865 DOI: 10.1016/j.biosystems.2021.104481] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Revised: 07/07/2021] [Accepted: 07/07/2021] [Indexed: 12/19/2022]
Abstract
The Rodin-Ohno hypothesis postulates that two classes of aminoacyl-tRNA synthetases were encoded complementary to double-stranded DNA. Particularly, Geobacillus stearothermophilus tryptophanyl-tRNA synthetase (TrpRS, belonging to class I) and Escherichia coli histidyl-tRNA synthetase (HisRS, belonging to class II) show high complementarity of the middle base of the codons in the mRNA sequence encoding each ATP binding site. Here, for the reported 46-residue peptides designed from the three-dimensional structures of TrpRS and HisRS, amino acid activation analysis was performed using the malachite green assay, which detects the pyrophosphate departing from ATP in the forward reaction of the first step of tRNA aminoacylation. A maltose-binding protein fusion with the 46 residues of TrpRS (TrpRS46mer) exhibited high activation capacity for several amino acids in the presence of ATP and amino acids, but the activity of an alanine substitution mutant of the first histidine in the HIGH motif (TrpRS46merH15A) was largely reduced. In contrast, pyrophosphate release by HisRS46mer in the histidine activation step was lower than that in the case of TrpRS46mer. Both HisRS46mer and the alanine mutant at the 113th arginine (HisRS46merR113A) showed slightly higher levels of pyrophosphate release than the maltose-binding protein alone. These results do not rule out the Rodin-Ohno hypothesis, but may suggest the necessity of establishing unique evolutionary models from different perspectives.
Collapse
|
17
|
Abstract
Codon-dependent translation underlies genetics and phylogenetic inferences, but its origins pose two challenges. Prevailing narratives cannot account for the fact that aminoacyl-tRNA synthetases (aaRSs), which translate the genetic code, must collectively enforce the rules used to assemble themselves. Nor can they explain how specific assignments arose from rudimentary differentiation between ancestral aaRSs and corresponding transfer RNAs (tRNAs). Experimental deconstruction of the two aaRS superfamilies created new experimental tools with which to analyze the emergence of the code. Amino acid and tRNA substrate recognition are linked to phase transfer free energies of amino acids and arise largely from aaRS class-specific differences in secondary structure. Sensitivity to protein folding rules endowed ancestral aaRS-tRNA pairs with the feedback necessary to rapidly compare alternative genetic codes and coding sequences. These and other experimental data suggest that the aaRS bidirectional genetic ancestry stabilized the differentiation and interdependence required to initiate and elaborate the genetic coding table.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7260, USA;
| | - Peter R Wills
- Department of Physics, University of Auckland, Auckland 1142, New Zealand
| |
Collapse
|
18
|
Carter CW, Wills PR. Reciprocally-Coupled Gating: Strange Loops in Bioenergetics, Genetics, and Catalysis. Biomolecules 2021; 11:265. [PMID: 33670192 PMCID: PMC7916928 DOI: 10.3390/biom11020265] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2020] [Revised: 02/04/2021] [Accepted: 02/06/2021] [Indexed: 12/12/2022] Open
Abstract
Bioenergetics, genetic coding, and catalysis are all difficult to imagine emerging without pre-existing historical context. That context is often posed as a "Chicken and Egg" problem; its resolution is concisely described by de Grasse Tyson: "The egg was laid by a bird that was not a chicken". The concision and generality of that answer furnish no details-only an appropriate framework from which to examine detailed paradigms that might illuminate paradoxes underlying these three life-defining biomolecular processes. We examine experimental aspects here of five examples that all conform to the same paradigm. In each example, a paradox is resolved by coupling "if, and only if" conditions for reciprocal transitions between levels, such that the consequent of the first test is the antecedent for the second. Each condition thus restricts fluxes through, or "gates" the other. Reciprocally-coupled gating, in which two gated processes constrain one another, is self-referential, hence maps onto the formal structure of "strange loops". That mapping uncovers two different kinds of forces that may help unite the axioms underlying three phenomena that distinguish biology from chemistry. As a physical analog for Gödel's logic, biomolecular strange-loops provide a natural metaphor around which to organize a large body of experimental data, linking biology to information, free energy, and the second law of thermodynamics.
Collapse
Affiliation(s)
- Charles W. Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, USA
| | - Peter R. Wills
- Department of Physics and Te Ao Marama Centre for Fundamental Inquiry, University of Auckland, PB 92019, Auckland 1142, New Zealand;
| |
Collapse
|
19
|
Abstract
What were the physico-chemical forces that drove the origins of life? We discuss four major prebiotic 'discoveries': persistent sampling of chemical reaction space; sequence-encodable foldable catalysts; assembly of functional pathways; and encapsulation and heritability. We describe how a 'proteins-first' world gives plausible mechanisms. We note the importance of hydrophobic and polar compositions of matter in these advances.
Collapse
Affiliation(s)
- K. A. Dill
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, USA
- Department of Chemistry, Stony Brook University, Stony Brook, NY, USA
- Department Physics and Astronomy, Stony Brook University, Stony Brook, NY, USA
| | - L. Agozzino
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, USA
| |
Collapse
|
20
|
Carter CW. Simultaneous codon usage, the origin of the proteome, and the emergence of de-novo proteins. Curr Opin Struct Biol 2021; 68:142-148. [PMID: 33529785 DOI: 10.1016/j.sbi.2021.01.004] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2020] [Accepted: 01/05/2021] [Indexed: 12/21/2022]
Abstract
Genetic coding generally uses only one of a gene's two strands; its complement serving as template for replication. Aminoacyl-tRNA synthetases, aaRS, apparently first emerged as pairs on bidirectional genes, in which anticodons in the template strand served as codons for an entirely different protein. Interpreting both strands in frame constrained such genes sufficiently that it was rapidly superseded, leaving only traces in the elevated pairing between codon middle bases in antiparallel alignments. Codon assignments actually promote using information from both strands in multiple reading frames. Related phenomena, known as overprinting, are widely associated with viruses. In-frame bidirectional coding and overprinting nevertheless imply different structural and functional relationships, and different roles in generating folded proteins throughout the evolution of the proteome.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry, Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, United States.
| |
Collapse
|
21
|
Fontecilla-Camps JC. Primordial bioenergy sources: The two facets of adenosine triphosphate. J Inorg Biochem 2020; 216:111347. [PMID: 33450675 DOI: 10.1016/j.jinorgbio.2020.111347] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Revised: 12/14/2020] [Accepted: 12/21/2020] [Indexed: 01/10/2023]
Abstract
Life requires energy to exist, to reproduce and to survive. Two major hypotheses have been put forward concerning the source of this energy at the very early stages of life evolution: (i) abiotic organics either brought to Earth by comets and/or meteorites, or produced at its atmosphere, and (ii) mineral surface-dependent bioinorganic catalytic reactions. Considering the latter possibility, I propose that, besides being a precursor of nucleic acids, adenosine triphosphate (ATP), which probably was used very early to improve the fidelity of nucleic acid polymerization, played an essential role in the transition between mineral-bound protocells and their free counterparts. Indeed, phosphorylation by ATP renders carboxylate groups electrophilic enough to react with nucleophiles such as amines, an effect that, thanks to their Lewis acid character, also have dehydrated metal ions on mineral surfaces. Early ATP synthesis for metabolic processes most likely depended on substrate level phosphorylation. However, the exaptation of a hexameric helicase-like ATPase and a transmembrane H+ pump (which evolved to counteract the acidity caused by fermentation reactions within the protocell) generated a much more efficient membrane-bound ATP synthase that uses chemiosmosis to make ATP.
Collapse
|
22
|
Nesterov-Mueller A, Popov R, Seligmann H. Combinatorial Fusion Rules to Describe Codon Assignment in the Standard Genetic Code. Life (Basel) 2020; 11:life11010004. [PMID: 33374866 PMCID: PMC7824455 DOI: 10.3390/life11010004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Revised: 12/15/2020] [Accepted: 12/21/2020] [Indexed: 11/16/2022] Open
Abstract
We propose combinatorial fusion rules that describe the codon assignment in the standard genetic code simply and uniformly for all canonical amino acids. These rules become obvious if the origin of the standard genetic code is considered as a result of a fusion of four protocodes: Two dominant AU and GC protocodes and two recessive AU and GC protocodes. The biochemical meaning of the fusion rules consists of retaining the complementarity between cognate codons of the small hydrophobic amino acids and large charged or polar amino acids within the protocodes. The proto tRNAs were assembled in form of two kissing hairpins with 9-base and 10-base loops in the case of dominant protocodes and two 9-base loops in the case of recessive protocodes. The fusion rules reveal the connection between the stop codons, the non-canonical amino acids, pyrrolysine and selenocysteine, and deviations in the translation of mitochondria. Using fusion rules, we predicted the existence of additional amino acids that are essential for the development of the standard genetic code. The validity of the proposed partition of the genetic code into dominant and recessive protocodes is considered referring to state-of-the-art hypotheses. The formation of two aminoacyl-tRNA synthetase classes is compatible with four-protocode partition.
Collapse
Affiliation(s)
- Alexander Nesterov-Mueller
- Institute of Microstructure Technology, Karlsruhe Institute of Technology (KIT), 76344 Eggenstein-Leopoldshafen, Germany; (R.P.); (H.S.)
- Correspondence:
| | - Roman Popov
- Institute of Microstructure Technology, Karlsruhe Institute of Technology (KIT), 76344 Eggenstein-Leopoldshafen, Germany; (R.P.); (H.S.)
| | - Hervé Seligmann
- Institute of Microstructure Technology, Karlsruhe Institute of Technology (KIT), 76344 Eggenstein-Leopoldshafen, Germany; (R.P.); (H.S.)
- The National Natural History Collections, The Hebrew University of Jerusalem, Jerusalem 91904, Israel
- Laboratory AGEIS EA 7407, Team Tools for e-GnosisMedical & LabcomCNRS/UGA/OrangeLabs Telecoms4Health, Faculty of Medicine, Université Grenoble Alpes, F-38700 La Tronche, France
| |
Collapse
|
23
|
Wills PR, Carter CW. Impedance Matching and the Choice Between Alternative Pathways for the Origin of Genetic Coding. Int J Mol Sci 2020; 21:E7392. [PMID: 33036401 PMCID: PMC7582391 DOI: 10.3390/ijms21197392] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2020] [Revised: 09/28/2020] [Accepted: 09/30/2020] [Indexed: 01/07/2023] Open
Abstract
We recently observed that errors in gene replication and translation could be seen qualitatively to behave analogously to the impedances in acoustical and electronic energy transducing systems. We develop here quantitative relationships necessary to confirm that analogy and to place it into the context of the minimization of dissipative losses of both chemical free energy and information. The formal developments include expressions for the information transferred from a template to a new polymer, Iσ; an impedance parameter, Z; and an effective alphabet size, neff; all of which have non-linear dependences on the fidelity parameter, q, and the alphabet size, n. Surfaces of these functions over the {n,q} plane reveal key new insights into the origin of coding. Our conclusion is that the emergence and evolutionary refinement of information transfer in biology follow principles previously identified to govern physical energy flows, strengthening analogies (i) between chemical self-organization and biological natural selection, and (ii) between the course of evolutionary trajectories and the most probable pathways for time-dependent transitions in physics. Matching the informational impedance of translation to the four-letter alphabet of genes uncovers a pivotal role for the redundancy of triplet codons in preserving as much intrinsic genetic information as possible, especially in early stages when the coding alphabet size was small.
Collapse
Affiliation(s)
- Peter R. Wills
- Department of Physics and Te Ao Marama Centre for Fundamental Inquiry, University of Auckland, PB 92019, Auckland 1142, New Zealand
| | - Charles W. Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, USA
| |
Collapse
|
24
|
Abstract
The aminoacyl-tRNA synthetases are an essential and universally distributed family of enzymes that plays a critical role in protein synthesis, pairing tRNAs with their cognate amino acids for decoding mRNAs according to the genetic code. Synthetases help to ensure accurate translation of the genetic code by using both highly accurate cognate substrate recognition and stringent proofreading of noncognate products. While alterations in the quality control mechanisms of synthetases are generally detrimental to cellular viability, recent studies suggest that in some instances such changes facilitate adaption to stress conditions. Beyond their central role in translation, synthetases are also emerging as key players in an increasing number of other cellular processes, with far-reaching consequences in health and disease. The biochemical versatility of the synthetases has also proven pivotal in efforts to expand the genetic code, further emphasizing the wide-ranging roles of the aminoacyl-tRNA synthetase family in synthetic and natural biology.
Collapse
Affiliation(s)
- Miguel Angel Rubio Gomez
- Center for RNA Biology, The Ohio State University, Columbus, Ohio 43210, USA Department of Microbiology, The Ohio State University, Columbus, Ohio 43210, USA
| | - Michael Ibba
- Center for RNA Biology, The Ohio State University, Columbus, Ohio 43210, USA Department of Microbiology, The Ohio State University, Columbus, Ohio 43210, USA
| |
Collapse
|
25
|
Kaiser F, Krautwurst S, Salentin S, Haupt VJ, Leberecht C, Bittrich S, Labudde D, Schroeder M. The structural basis of the genetic code: amino acid recognition by aminoacyl-tRNA synthetases. Sci Rep 2020; 10:12647. [PMID: 32724042 PMCID: PMC7387524 DOI: 10.1038/s41598-020-69100-0] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2020] [Accepted: 07/06/2020] [Indexed: 12/29/2022] Open
Abstract
Storage and directed transfer of information is the key requirement for the development of life. Yet any information stored on our genes is useless without its correct interpretation. The genetic code defines the rule set to decode this information. Aminoacyl-tRNA synthetases are at the heart of this process. We extensively characterize how these enzymes distinguish all natural amino acids based on the computational analysis of crystallographic structure data. The results of this meta-analysis show that the correct read-out of genetic information is a delicate interplay between the composition of the binding site, non-covalent interactions, error correction mechanisms, and steric effects.
Collapse
Affiliation(s)
- Florian Kaiser
- Biotechnology Center (BIOTEC), TU Dresden, 01307, Dresden, Germany. .,PharmAI GmbH, Tatzberg 47, 01307, Dresden, Germany.
| | - Sarah Krautwurst
- University of Applied Sciences Mittweida, 09648, Mittweida, Germany
| | | | - V Joachim Haupt
- Biotechnology Center (BIOTEC), TU Dresden, 01307, Dresden, Germany.,PharmAI GmbH, Tatzberg 47, 01307, Dresden, Germany
| | | | | | - Dirk Labudde
- University of Applied Sciences Mittweida, 09648, Mittweida, Germany
| | | |
Collapse
|
26
|
Wichmann S, Ardern Z. Optimality in the standard genetic code is robust with respect to comparison code sets. Biosystems 2019; 185:104023. [DOI: 10.1016/j.biosystems.2019.104023] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2019] [Revised: 08/22/2019] [Accepted: 08/24/2019] [Indexed: 01/22/2023]
|
27
|
Carter CW, Wills PR. Experimental solutions to problems defining the origin of codon-directed protein synthesis. Biosystems 2019; 183:103979. [PMID: 31176803 PMCID: PMC6693952 DOI: 10.1016/j.biosystems.2019.103979] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2019] [Revised: 05/27/2019] [Accepted: 05/29/2019] [Indexed: 12/13/2022]
Abstract
How genetic coding differentiated biology from chemistry is a long-standing challenge in Biology, for which there have been few experimental approaches, despite a wide-ranging speculative literature. We summarize five coordinated areas-experimental characterization of functional approximations to the minimal peptides (protozymes and urzymes) necessary to activate amino acids and acylate tRNA; showing that specificities of these experimental models match those expected from the synthetase Class division; population of disjoint regions of amino acid sequence space via bidirectional coding ancestry of the two synthetase Classes; showing that the phase transfer equilibria of amino acid side chains that form a two-dimensional basis set for protein folding are embedded in patterns of bases in the tRNA acceptor stem and anticodon; and identification of molecular signatures of ancestral synthetases and tRNAs necessary to define the earliest cognate synthetase:tRNA pairs-that now compose an extensive experimentally testable paradigm for progress toward understanding the coordinated emergence of the codon table and viable mRNA coding sequences. We briefly discuss recent progress toward identifying the remaining outstanding questions-the nature of the earliest amino acid alphabets and the origin of binding discrimination via distinct amino acid sequence-independent protein secondary structures-and how these, too, might be addressed experimentally.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, United States
| | - Peter R Wills
- Department of Physics and Te Ao Marama Centre for Fundamental Inquiry, University of Auckland, PB 92019, Auckland 1142, New Zealand
| |
Collapse
|
28
|
Carter CW, Wills PR. Hierarchical groove discrimination by Class I and II aminoacyl-tRNA synthetases reveals a palimpsest of the operational RNA code in the tRNA acceptor-stem bases. Nucleic Acids Res 2019; 46:9667-9683. [PMID: 30016476 PMCID: PMC6182185 DOI: 10.1093/nar/gky600] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2018] [Accepted: 07/12/2018] [Indexed: 01/01/2023] Open
Abstract
Class I and II aaRS recognition of opposite grooves was likely among the earliest determinants fixed in the tRNA acceptor stem bases. A new regression model identifies those determinants in bacterial tRNAs. Integral coefficients relate digital dependent to independent variables with perfect agreement between observed and calculated grooves for all twenty isoaccepting tRNAs. Recognition is mediated by the Discriminator base 73, the first base pair, and base 2 of the acceptor stem. Subsets of these coefficients also identically compute grooves recognized by smaller numbers of aaRS. Thus, the model is hierarchical, suggesting that new rules were added to pre-existing ones as new amino acids joined the coding alphabet. A thermodynamic rationale for the simplest model implies that Class-dependent aaRS secondary structures exploited differential tendencies of the acceptor stem to form the hairpin observed in Class I aaRS•tRNA complexes, enabling the earliest groove discrimination. Curiously, groove recognition also depends explicitly on the identity of base 2 in a manner consistent with the middle bases of the codon table, confirming a hidden ancestry of codon-anticodon pairing in the acceptor stem. That, and the lack of correlation with anticodon bases support prior productive coding interaction of tRNA minihelices with proto-mRNA.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, USA
| | - Peter R Wills
- Department of Physics, Centre for Computational Evolution, and Te Ao Marama Centre for Fundamental Enquiry, University of Auckland, PB 92109, Auckland 1142, New Zealand
| |
Collapse
|
29
|
Carter CW, Wills PR. Class I and II aminoacyl-tRNA synthetase tRNA groove discrimination created the first synthetase-tRNA cognate pairs and was therefore essential to the origin of genetic coding. IUBMB Life 2019; 71:1088-1098. [PMID: 31190358 DOI: 10.1002/iub.2094] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2019] [Revised: 04/14/2019] [Accepted: 04/15/2019] [Indexed: 12/20/2022]
Abstract
The genetic code likely arose when a bidirectional gene replicating as a quasi-species began to produce ancestral aminoacyl-tRNA synthetases (aaRS) capable of distinguishing between two distinct sets of amino acids. The synthetase class division therefore necessarily implies a mechanism by which the two ancestral synthetases could also discriminate between two different kinds of tRNA substrates. We used regression methods to uncover the possible patterns of base sequences capable of such discrimination and find that they appear to be related to thermodynamic differences in the relative stabilities of a hairpin necessary for recognition of tRNA substrates by Class I aaRS. The thermodynamic differences appear to be exploited by secondary structural differences between models for the ancestral aaRS called synthetase Urzymes and reinforced by packing of aromatic amino acid side chains against the nonpolar face of the ribose of A76 if and only if the tRNA CCA sequence forms a hairpin. The patterns of bases 1, 2, and 73 and stabilization of the hairpin by structural complementarity with Class I, but not Class II, aaRS Urzymes appear to be necessary and sufficient to have enabled the generation of the first two aaRS-tRNA cognate pairs, and the launch of a rudimentary binary genetic coding related recognizably to contemporary cognate pairs. As a consequence, it seems likely that nonrandom aminoacylation of tRNAs preceded the advent of the tRNA anticodon stem-loop. Consistent with this suggestion, coding rules in the acceptor-stem bases also reveal a palimpsest of the codon-anticodon interaction, as previously proposed. © 2019 IUBMB Life, 2019 © 2019 IUBMB Life, 71(8):1088-1098, 2019.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Peter R Wills
- Department of Physics and Te Ao Marama Centre for Fundamental Inquiry, University of Auckland, Auckland, New Zealand
| |
Collapse
|
30
|
Demongeot J, Seligmann H. Bias for 3'-Dominant Codon Directional Asymmetry in Theoretical Minimal RNA Rings. J Comput Biol 2019; 26:1003-1012. [PMID: 31120344 DOI: 10.1089/cmb.2018.0256] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Aminoacyl tRNA synthetases ligate tRNAs specifically with their cognate amino acid. These synthetases are among life's earliest proteins, class II tRNA synthetases (cognates A, D, F, G, H, K, N, P, S, and T) presumably preceding class I tRNA synthetases (cognates C, E, I, L, M, Q, R, V, W, and Y). Classification of codons into palindromic (structure XYX), 5'-dominant (YXX), and 3'-dominant (XXY) (Codon Directional Asymmetry [CDA]) shows that class II tRNA synthetases aminoacylate amino acids associated with XXY. Our working hypothesis expects bias for XXY codons in primordial RNAs, such as theoretical minimal RNA rings, designed in silico to mimic life's earliest RNAs. Twenty-five RNA rings have been computed, which code over a minimal length (22 nucleotides) for a start codon, stop codon, and one and only one codon for each of the 20 amino acids, and form stem-loop hairpins preventing degradation; these 25 minimal RNAs are the only ones matching these constraints and they seem homologous to consensus tRNA sequences. This similarity defined candidate RNA ring anticodons and corresponding cognate amino acids. Here, analyses of RNA ring codon contents confirm bias for XXY codons in 13 among 14 RNA rings with unequal XXY and YXX codon numbers. This bias increases with the genetic code integration order of the RNA ring's cognate amino acid across and within tRNA synthetase classes, suggesting that evolutionary processes, and not physicochemical constraints, produced the association between CDA and tRNA synthetase classes. The self-referential hypothesis for genetic code origin, a very complete genetic code evolutionary hypothesis integrating many translational machinery components, predicts best among genetic code evolutionary hypotheses CDA biases in RNA rings. The RNA rings' simple design inadvertently reproduces CDAs predicted by the genetic code's structure, confirming theoretical minimal RNA rings as good proxies for life's earliest RNAs.
Collapse
Affiliation(s)
- Jacques Demongeot
- Laboratory AGEIS EA 7407, Faculty of Medicine, Team Tools for e-Gnosis Medical, Université Grenoble Alpes, La Tronche, France
| | - Hervé Seligmann
- The National Natural History Collections, The Hebrew University of Jerusalem, Jerusalem, Israel
| |
Collapse
|
31
|
Nyamai DW, Tastan Bishop Ö. Aminoacyl tRNA synthetases as malarial drug targets: a comparative bioinformatics study. Malar J 2019; 18:34. [PMID: 30728021 PMCID: PMC6366043 DOI: 10.1186/s12936-019-2665-6] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2018] [Accepted: 01/27/2019] [Indexed: 01/08/2023] Open
Abstract
BACKGROUND Treatment of parasitic diseases has been challenging due to evolution of drug resistant parasites, and thus there is need to identify new class of drugs and drug targets. Protein translation is important for survival of malarial parasite, Plasmodium, and the pathway is present in all of its life cycle stages. Aminoacyl tRNA synthetases are primary enzymes in protein translation as they catalyse amino acid addition to the cognate tRNA. This study sought to understand differences between Plasmodium and human aminoacyl tRNA synthetases through bioinformatics analysis. METHODS Plasmodium berghei, Plasmodium falciparum, Plasmodium fragile, Plasmodium knowlesi, Plasmodium malariae, Plasmodium ovale, Plasmodium vivax, Plasmodium yoelii and human aminoacyl tRNA synthetase sequences were retrieved from UniProt database and grouped into 20 families based on amino acid specificity. These families were further divided into two classes. Both families and classes were analysed. Motif discovery was carried out using the MEME software, sequence identity calculation was done using an in-house Python script, multiple sequence alignments were performed using PROMALS3D and TCOFFEE tools, and phylogenetic tree calculations were performed using MEGA vs 7.0 tool. Possible alternative binding sites were predicted using FTMap webserver and SiteMap tool. RESULTS Motif discovery revealed Plasmodium-specific motifs while phylogenetic tree calculations showed that Plasmodium proteins have different evolutionary history to the human homologues. Human aaRSs sequences showed low sequence identity (below 40%) compared to Plasmodium sequences. Prediction of alternative binding sites revealed potential druggable sites in PfArgRS, PfMetRS and PfProRS at regions that are weakly conserved when compared to the human homologues. Multiple sequence analysis, motif discovery, pairwise sequence identity calculations and phylogenetic tree analysis showed significant differences between parasite and human aaRSs proteins despite functional and structural conservation. These differences may provide a basis for further exploration of Plasmodium aminoacyl tRNA synthetases as potential drug targets. CONCLUSION This study showed that, despite, functional and structural conservation, Plasmodium aaRSs have key differences from the human homologues. These differences in Plasmodium aaRSs can be targeted to develop anti-malarial drugs with less toxicity to the host.
Collapse
Affiliation(s)
- Dorothy Wavinya Nyamai
- Research Unit in Bioinformatics (RUBi), Department of Biochemistry and Microbiology, Rhodes University, Grahamstown, 6140, South Africa
| | - Özlem Tastan Bishop
- Research Unit in Bioinformatics (RUBi), Department of Biochemistry and Microbiology, Rhodes University, Grahamstown, 6140, South Africa.
| |
Collapse
|
32
|
Opron K, Burton ZF. Ribosome Structure, Function, and Early Evolution. Int J Mol Sci 2018; 20:ijms20010040. [PMID: 30583477 PMCID: PMC6337491 DOI: 10.3390/ijms20010040] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2018] [Revised: 12/03/2018] [Accepted: 12/16/2018] [Indexed: 11/16/2022] Open
Abstract
Ribosomes are among the largest and most dynamic molecular motors. The structure and dynamics of translation initiation and elongation are reviewed. Three ribosome motions have been identified for initiation and translocation. A swivel motion between the head/beak and the body of the 30S subunit was observed. A tilting dynamic of the head/beak versus the body of the 30S subunit was detected using simulations. A reversible ratcheting motion was seen between the 30S and the 50S subunits that slide relative to one another. The 30S⁻50S intersubunit contacts regulate translocation. IF2, EF-Tu, and EF-G are homologous G-protein GTPases that cycle on and off the same site on the ribosome. The ribosome, aminoacyl-tRNA synthetase (aaRS) enzymes, transfer ribonucleic acid (tRNA), and messenger ribonucleic acid (mRNA) form the core of information processing in cells and are coevolved. Surprisingly, class I and class II aaRS enzymes, with distinct and incompatible folds, are homologs. Divergence of class I and class II aaRS enzymes and coevolution of the genetic code are described by analysis of ancient archaeal species.
Collapse
Affiliation(s)
- Kristopher Opron
- Bioinformatics Core, University of Michigan, Ann Arbor, MI 48109-0674, USA.
| | - Zachary F Burton
- Department of Biochemistry and Molecular Biology, 603 Wilson Rd., Michigan State University, MI 48824-1319, USA.
| |
Collapse
|
33
|
Abstract
Abundant and essential motifs, such as phosphate-binding loops (P-loops), are presumed to be the seeds of modern enzymes. The Walker-A P-loop is absolutely essential in modern NTPase enzymes, in mediating binding, and transfer of the terminal phosphate groups of NTPs. However, NTPase function depends on many additional active-site residues placed throughout the protein's scaffold. Can motifs such as P-loops confer function in a simpler context? We applied a phylogenetic analysis that yielded a sequence logo of the putative ancestral Walker-A P-loop element: a β-strand connected to an α-helix via the P-loop. Computational design incorporated this element into de novo designed β-α repeat proteins with relatively few sequence modifications. We obtained soluble, stable proteins that unlike modern P-loop NTPases bound ATP in a magnesium-independent manner. Foremost, these simple P-loop proteins avidly bound polynucleotides, RNA, and single-strand DNA, and mutations in the P-loop's key residues abolished binding. Binding appears to be facilitated by the structural plasticity of these proteins, including quaternary structure polymorphism that promotes a combined action of multiple P-loops. Accordingly, oligomerization enabled a 55-aa protein carrying a single P-loop to confer avid polynucleotide binding. Overall, our results show that the P-loop Walker-A motif can be implemented in small and simple β-α repeat proteins, primarily as a polynucleotide binding motif.
Collapse
|
34
|
Bittrich S, Schroeder M, Labudde D. Characterizing the relation of functional and Early Folding Residues in protein structures using the example of aminoacyl-tRNA synthetases. PLoS One 2018; 13:e0206369. [PMID: 30376559 PMCID: PMC6207335 DOI: 10.1371/journal.pone.0206369] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2018] [Accepted: 10/11/2018] [Indexed: 01/10/2023] Open
Abstract
Proteins are chains of amino acids which adopt a three-dimensional structure and are then able to catalyze chemical reactions or propagate signals in organisms. Without external influence, many proteins fold into their native structure, and a small number of Early Folding Residues (EFR) have previously been shown to initiate the formation of secondary structure elements and guide their respective assembly. Using the two diverse superfamilies of aminoacyl-tRNA synthetases (aaRS), it is shown that the position of EFR is preserved over the course of evolution even when the corresponding sequence conservation is small. Folding initiation sites are positioned in the center of secondary structure elements, independent of aaRS class. In class I, the predicted position of EFR resembles an ancient structural packing motif present in many seemingly unrelated proteins. Furthermore, it is shown that EFR and functionally relevant residues in aaRS are almost entirely disjoint sets of residues. The Start2Fold database is used to investigate whether this separation of EFR and functional residues can be observed for other proteins. EFR are found to constitute crucial connectors of protein regions which are distant at sequence level. Especially, these residues exhibit a high number of non-covalent residue-residue contacts such as hydrogen bonds and hydrophobic interactions. This tendency also manifests as energetically stable local regions, as substantiated by a knowledge-based potential. Despite profound differences regarding how EFR and functional residues are embedded in protein structures, a strict separation of structurally and functionally relevant residues cannot be observed for a more general collection of proteins.
Collapse
Affiliation(s)
- Sebastian Bittrich
- Applied Computer Sciences & Biosciences, University of Applied Sciences Mittweida, Mittweida, Saxony, Germany
- Biotechnology Center (BIOTEC), Technische Universität Dresden, Dresden, Saxony, Germany
| | - Michael Schroeder
- Biotechnology Center (BIOTEC), Technische Universität Dresden, Dresden, Saxony, Germany
| | - Dirk Labudde
- Applied Computer Sciences & Biosciences, University of Applied Sciences Mittweida, Mittweida, Saxony, Germany
| |
Collapse
|
35
|
Kaiser F, Bittrich S, Salentin S, Leberecht C, Haupt VJ, Krautwurst S, Schroeder M, Labudde D. Backbone Brackets and Arginine Tweezers delineate Class I and Class II aminoacyl tRNA synthetases. PLoS Comput Biol 2018; 14:e1006101. [PMID: 29659563 PMCID: PMC5919687 DOI: 10.1371/journal.pcbi.1006101] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2017] [Revised: 04/26/2018] [Accepted: 03/20/2018] [Indexed: 12/22/2022] Open
Abstract
The origin of the machinery that realizes protein biosynthesis in all organisms is still unclear. One key component of this machinery are aminoacyl tRNA synthetases (aaRS), which ligate tRNAs to amino acids while consuming ATP. Sequence analyses revealed that these enzymes can be divided into two complementary classes. Both classes differ significantly on a sequence and structural level, feature different reaction mechanisms, and occur in diverse oligomerization states. The one unifying aspect of both classes is their function of binding ATP. We identified Backbone Brackets and Arginine Tweezers as most compact ATP binding motifs characteristic for each Class. Geometric analysis shows a structural rearrangement of the Backbone Brackets upon ATP binding, indicating a general mechanism of all Class I structures. Regarding the origin of aaRS, the Rodin-Ohno hypothesis states that the peculiar nature of the two aaRS classes is the result of their primordial forms, called Protozymes, being encoded on opposite strands of the same gene. Backbone Brackets and Arginine Tweezers were traced back to the proposed Protozymes and their more efficient successors, the Urzymes. Both structural motifs can be observed as pairs of residues in contemporary structures and it seems that the time of their addition, indicated by their placement in the ancient aaRS, coincides with the evolutionary trace of Proto- and Urzymes. Aminoacyl tRNA synthetases (aaRS) are primordial enzymes essential for interpretation and transfer of genetic information. Understanding the origin of the peculiarities observed with aaRS can explain what constituted the earliest life forms and how the genetic code was established. The increasing amount of experimentally determined three-dimensional structures of aaRS opens up new avenues for high-throughput analyses of molecular mechanisms. In this study, we present an exhaustive structural analysis of ATP binding motifs. We unveil an oppositional implementation of enzyme substrate binding in each aaRS Class. While Class I binds via interactions mediated by backbone hydrogen bonds, Class II uses a pair of arginine residues to establish salt bridges to its ATP ligand. We show how nature realized the binding of the same ligand species with completely different mechanisms. In addition, we demonstrate that sequence or even structure analysis for conserved residues may miss important functional aspects which can only be revealed by ligand interaction studies. Additionally, the placement of those key residues in the structure supports a popular hypothesis, which states that prototypic aaRS were once coded on complementary strands of the same gene.
Collapse
Affiliation(s)
- Florian Kaiser
- University of Applied Sciences Mittweida, Mittweida, Germany
- Biotechnology Center (BIOTEC), TU Dresden, Dresden, Germany
- * E-mail:
| | - Sebastian Bittrich
- University of Applied Sciences Mittweida, Mittweida, Germany
- Biotechnology Center (BIOTEC), TU Dresden, Dresden, Germany
| | | | - Christoph Leberecht
- University of Applied Sciences Mittweida, Mittweida, Germany
- Biotechnology Center (BIOTEC), TU Dresden, Dresden, Germany
| | | | | | | | - Dirk Labudde
- University of Applied Sciences Mittweida, Mittweida, Germany
| |
Collapse
|
36
|
Branciamore S, Gogoshin G, Di Giulio M, Rodin AS. Intrinsic Properties of tRNA Molecules as Deciphered via Bayesian Network and Distribution Divergence Analysis. Life (Basel) 2018; 8:life8010005. [PMID: 29419741 PMCID: PMC5871937 DOI: 10.3390/life8010005] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2017] [Revised: 01/22/2018] [Accepted: 01/23/2018] [Indexed: 12/27/2022] Open
Abstract
The identity/recognition of tRNAs, in the context of aminoacyl tRNA synthetases (and other molecules), is a complex phenomenon that has major implications ranging from the origins and evolution of translation machinery and genetic code to the evolution and speciation of tRNAs themselves to human mitochondrial diseases to artificial genetic code engineering. Deciphering it via laboratory experiments, however, is difficult and necessarily time- and resource-consuming. In this study, we propose a mathematically rigorous two-pronged in silico approach to identifying and classifying tRNA positions important for tRNA identity/recognition, rooted in machine learning and information-theoretic methodology. We apply Bayesian Network modeling to elucidate the structure of intra-tRNA-molecule relationships, and distribution divergence analysis to identify meaningful inter-molecule differences between various tRNA subclasses. We illustrate the complementary application of these two approaches using tRNA examples across the three domains of life, and identify and discuss important (informative) positions therein. In summary, we deliver to the tRNA research community a novel, comprehensive methodology for identifying the specific elements of interest in various tRNA molecules, which can be followed up by the corresponding experimental work and/or high-resolution position-specific statistical analyses.
Collapse
Affiliation(s)
- Sergio Branciamore
- Department of Diabetes Complications and Metabolism, Diabetes and Metabolism Research Institute, City of Hope, Duarte, 91010 CA, USA.
| | - Grigoriy Gogoshin
- Department of Diabetes Complications and Metabolism, Diabetes and Metabolism Research Institute, City of Hope, Duarte, 91010 CA, USA.
| | - Massimo Di Giulio
- Early Evolution of Life Laboratory, Institute of Biosciences and Bioresources, CNR, 80131 Naples, Italy.
| | - Andrei S Rodin
- Department of Diabetes Complications and Metabolism, Diabetes and Metabolism Research Institute, City of Hope, Duarte, 91010 CA, USA.
| |
Collapse
|
37
|
Carter CW, Wills PR. Interdependence, Reflexivity, Fidelity, Impedance Matching, and the Evolution of Genetic Coding. Mol Biol Evol 2018; 35:269-286. [PMID: 29077934 PMCID: PMC5850816 DOI: 10.1093/molbev/msx265] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Genetic coding is generally thought to have required ribozymes whose functions were taken over by polypeptide aminoacyl-tRNA synthetases (aaRS). Two discoveries about aaRS and their interactions with tRNA substrates now furnish a unifying rationale for the opposite conclusion: that the key processes of the Central Dogma of molecular biology emerged simultaneously and naturally from simple origins in a peptide•RNA partnership, eliminating the epistemological utility of a prior RNA world. First, the two aaRS classes likely arose from opposite strands of the same ancestral gene, implying a simple genetic alphabet. The resulting inversion symmetries in aaRS structural biology would have stabilized the initial and subsequent differentiation of coding specificities, rapidly promoting diversity in the proteome. Second, amino acid physical chemistry maps onto tRNA identity elements, establishing reflexive, nanoenvironmental sensing in protein aaRS. Bootstrapping of increasingly detailed coding is thus intrinsic to polypeptide aaRS, but impossible in an RNA world. These notions underline the following concepts that contradict gradual replacement of ribozymal aaRS by polypeptide aaRS: 1) aaRS enzymes must be interdependent; 2) reflexivity intrinsic to polypeptide aaRS production dynamics promotes bootstrapping; 3) takeover of RNA-catalyzed aminoacylation by enzymes will necessarily degrade specificity; and 4) the Central Dogma's emergence is most probable when replication and translation error rates remain comparable. These characteristics are necessary and sufficient for the essentially de novo emergence of a coupled gene-replicase-translatase system of genetic coding that would have continuously preserved the functional meaning of genetically encoded protein genes whose phylogenetic relationships match those observed today.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Peter R Wills
- Department of Physics, University of Auckland, Auckland, New Zealand
| |
Collapse
|
38
|
Opuu V, Silvert M, Simonson T. Computational design of fully overlapping coding schemes for protein pairs and triplets. Sci Rep 2017; 7:15873. [PMID: 29158504 PMCID: PMC5696523 DOI: 10.1038/s41598-017-16221-8] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2017] [Accepted: 11/09/2017] [Indexed: 11/26/2022] Open
Abstract
Gene pairs that overlap in their coding regions are rare except in viruses. They may occur transiently in gene creation and are of biotechnological interest. We have examined the possibility to encode an arbitrary pair of protein domains as a dual gene, with the shorter coding sequence completely embedded in the longer one. For 500 × 500 domain pairs (X, Y), we computationally designed homologous pairs (X', Y') coded this way, using an algorithm that provably maximizes the sequence similarity between (X', Y') and (X, Y). Three schemes were considered, with X' and Y' coded on the same or complementary strands. For 16% of the pairs, an overlapping coding exists where the level of homology of X', Y' to the natural proteins represents an E-value of 10-10 or better. Thus, for an arbitrary domain pair, it is surprisingly easy to design homologous sequences that can be encoded as a fully-overlapping gene pair. The algorithm is general and was used to design 200 triple genes, with three proteins encoded by the same DNA segment. The ease of design suggests overlapping genes may have occurred frequently in evolution and could be readily used to compress or constrain artificial genomes.
Collapse
Affiliation(s)
- Vaitea Opuu
- Laboratoire de Biochimie (CNRS UMR7654), Ecole Polytechnique, Palaiseau, France
| | - Martin Silvert
- Laboratoire de Biochimie (CNRS UMR7654), Ecole Polytechnique, Palaiseau, France
| | - Thomas Simonson
- Laboratoire de Biochimie (CNRS UMR7654), Ecole Polytechnique, Palaiseau, France.
| |
Collapse
|
39
|
Bijective codon transformations show genetic code symmetries centered on cytosine's coding properties. Theory Biosci 2017; 137:17-31. [PMID: 29147851 DOI: 10.1007/s12064-017-0258-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2017] [Accepted: 11/13/2017] [Indexed: 12/11/2022]
Abstract
Homology of some RNAs with template DNA requires systematic exchanges between nucleotides. Such exchanges produce 'swinger' RNA along 23 bijective transformations (nine symmetric, X ↔ Y; and 14 asymmetric, X → Y → Z → X, for example A ↔ C and A → C → G → A, respectively). Here, analyses compare amino acids coded by swinger-transformed codons to those coded by untransformed codons, defining coding invariance after transformations. Swinger transformations cluster according to coding invariance in four groups characterized by transformations into cytosine (C = C, T → C, A → C, and G → C). C's central mutational coding role shows that swinger transformations constrained genetic code genesis. Coding invariance post-transformations correlate positively/negatively with mitochondrial swinger transcription/lepidosaurian body temperature. Presumably, low/high temperatures stabilize/revert rare swinger polymerization modes, producing long swinger sequences/point mutations, respectively. Coding invariance after swinger transformations might compensate effects of swinger polymerizations in species with low body temperatures. Hypothetically, swinger transcription increased coding potential of RNA self-replicating protolife systems under heating/cooling cycles.
Collapse
|
40
|
Carter CW, Chandrasekaran SN, Weinreb V, Li L, Williams T. Combining multi-mutant and modular thermodynamic cycles to measure energetic coupling networks in enzyme catalysis. STRUCTURAL DYNAMICS (MELVILLE, N.Y.) 2017; 4:032101. [PMID: 28191480 PMCID: PMC5272822 DOI: 10.1063/1.4974218] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/07/2016] [Accepted: 12/21/2016] [Indexed: 06/06/2023]
Abstract
We measured and cross-validated the energetics of networks in Bacillus stearothermophilus Tryptophanyl-tRNA synthetase (TrpRS) using both multi-mutant and modular thermodynamic cycles. Multi-dimensional combinatorial mutagenesis showed that four side chains from this "molecular switch" move coordinately with the active-site Mg2+ ion as the active site preorganizes to stabilize the transition state for amino acid activation. A modular thermodynamic cycle consisting of full-length TrpRS, its Urzyme, and the Urzyme plus each of the two domains deleted in the Urzyme gives similar energetics. These dynamic linkages, although unlikely to stabilize the transition-state directly, consign the active-site preorganization to domain motion, assuring coupled vectorial behavior.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill , Chapel Hill, North Carolina 27599-7260, USA
| | - Srinivas Niranj Chandrasekaran
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill , Chapel Hill, North Carolina 27599-7260, USA
| | - Violetta Weinreb
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill , Chapel Hill, North Carolina 27599-7260, USA
| | - Li Li
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill , Chapel Hill, North Carolina 27599-7260, USA
| | - Tishan Williams
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill , Chapel Hill, North Carolina 27599-7260, USA
| |
Collapse
|
41
|
Self-Referential Encoding on Modules of Anticodon Pairs-Roots of the Biological Flow System. Life (Basel) 2017; 7:life7020016. [PMID: 28383509 PMCID: PMC5492138 DOI: 10.3390/life7020016] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2017] [Revised: 03/24/2017] [Accepted: 03/26/2017] [Indexed: 12/22/2022] Open
Abstract
The proposal that the genetic code was formed on the basis of (proto)tRNA Dimer-Directed Protein Synthesis is reviewed and updated. The tRNAs paired through the anticodon loops are an indication on the process. Dimers are considered mimics of the ribosomes-structures that hold tRNAs together and facilitate the transferase reaction, and of the translation process-anticodons are at the same time codons for each other. The primitive protein synthesis system gets stabilized when the product peptides are stable and apt to bind the producers therewith establishing a self-stimulating production cycle. The chronology of amino acid encoding starts with Glycine and Serine, indicating the metabolic support of the Glycine-Serine C1-assimilation pathway, which is also consistent with evidence on origins of bioenergetics mechanisms. Since it is not possible to reach for substrates simpler than C1 and compounds in the identified pathway are apt for generating the other central metabolic routes, it is considered that protein synthesis is the beginning and center of a succession of sink-effective mechanisms that drive the formation and evolution of the metabolic flow system. Plasticity and diversification of proteins construct the cellular system following the orientation given by the flow and implementing it. Nucleic acid monomers participate in bioenergetics and the polymers are conservative memory systems for the synthesis of proteins. Protoplasmic fission is the final sink-effective mechanism, part of cell reproduction, guaranteeing that proteins don't accumulate to saturation, which would trigger inhibition.
Collapse
|
42
|
Carter CW. High-Dimensional Mutant and Modular Thermodynamic Cycles, Molecular Switching, and Free Energy Transduction. Annu Rev Biophys 2017; 46:433-453. [PMID: 28375734 DOI: 10.1146/annurev-biophys-070816-033811] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Understanding how distinct parts of proteins produce coordinated behavior has driven and continues to drive advances in protein science and enzymology. However, despite consensus about the conceptual basis for allostery, the idiosyncratic nature of allosteric mechanisms resists general approaches. Computational methods can identify conformational transition states from structural changes, revealing common switching mechanisms that impose multistate behavior. Thermodynamic cycles use factorial perturbations to measure coupling energies between side chains in molecular switches that mediate shear during domain motion. Such cycles have now been complemented by modular cycles that measure energetic coupling between separable domains. For one model system, energetic coupling between domains has been shown to be quantitatively equivalent to that between dynamic side chains. Linkages between domain motion, switching residues, and catalysis make nucleoside triphosphate hydrolysis conditional on domain movement, confirming an essential yet neglected aspect of free energy transduction and suggesting the potential generality of these studies.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27514;
| |
Collapse
|
43
|
José MV, Zamudio GS, Morgado ER. A unified model of the standard genetic code. ROYAL SOCIETY OPEN SCIENCE 2017; 4:160908. [PMID: 28405378 PMCID: PMC5383835 DOI: 10.1098/rsos.160908] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/14/2016] [Accepted: 01/30/2017] [Indexed: 06/07/2023]
Abstract
The Rodin-Ohno (RO) and the Delarue models divide the table of the genetic code into two classes of aminoacyl-tRNA synthetases (aaRSs I and II) with recognition from the minor or major groove sides of the tRNA acceptor stem, respectively. These models are asymmetric but they are biologically meaningful. On the other hand, the standard genetic code (SGC) can be derived from the primeval RNY code (R stands for purines, Y for pyrimidines and N any of them). In this work, the RO-model is derived by means of group actions, namely, symmetries represented by automorphisms, assuming that the SGC originated from a primeval RNY code. It turns out that the RO-model is symmetric in a six-dimensional (6D) hypercube. Conversely, using the same automorphisms, we show that the RO-model can lead to the SGC. In addition, the asymmetric Delarue model becomes symmetric by means of quotient group operations. We formulate isometric functions that convert the class aaRS I into the class aaRS II and vice versa. We show that the four polar requirement categories display a symmetrical arrangement in our 6D hypercube. Altogether these results cannot be attained, neither in two nor in three dimensions. We discuss the present unified 6D algebraic model, which is compatible with both the SGC (based upon the primeval RNY code) and the RO-model.
Collapse
Affiliation(s)
- Marco V. José
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, MexicoD.F. 04510, Mexico
| | - Gabriel S. Zamudio
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, MexicoD.F. 04510, Mexico
| | - Eberto R. Morgado
- Facultad de Matemática, Física y Computación, Universidad Central ‘Marta Abreu’ de Las Villas, Santa Clara, Cuba
| |
Collapse
|
44
|
Carter CW. Coding of Class I and II Aminoacyl-tRNA Synthetases. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2017; 966:103-148. [PMID: 28828732 PMCID: PMC5927602 DOI: 10.1007/5584_2017_93] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The aminoacyl-tRNA synthetases and their cognate transfer RNAs translate the universal genetic code. The twenty canonical amino acids are sufficiently diverse to create a selective advantage for dividing amino acid activation between two distinct, apparently unrelated superfamilies of synthetases, Class I amino acids being generally larger and less polar, Class II amino acids smaller and more polar. Biochemical, bioinformatic, and protein engineering experiments support the hypothesis that the two Classes descended from opposite strands of the same ancestral gene. Parallel experimental deconstructions of Class I and II synthetases reveal parallel losses in catalytic proficiency at two novel modular levels-protozymes and Urzymes-associated with the evolution of catalytic activity. Bi-directional coding supports an important unification of the proteome; affords a genetic relatedness metric-middle base-pairing frequencies in sense/antisense alignments-that probes more deeply into the evolutionary history of translation than do single multiple sequence alignments; and has facilitated the analysis of hitherto unknown coding relationships in tRNA sequences. Reconstruction of native synthetases by modular thermodynamic cycles facilitated by domain engineering emphasizes the subtlety associated with achieving high specificity, shedding new light on allosteric relationships in contemporary synthetases. Synthetase Urzyme structural biology suggests that they are catalytically-active molten globules, broadening the potential manifold of polypeptide catalysts accessible to primitive genetic coding and motivating revisions of the origins of catalysis. Finally, bi-directional genetic coding of some of the oldest genes in the proteome places major limitations on the likelihood that any RNA World preceded the origins of coded proteins.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599-7260, USA.
| |
Collapse
|
45
|
Chimeric mitochondrial peptides from contiguous regular and swinger RNA. Comput Struct Biotechnol J 2016; 14:283-97. [PMID: 27453772 PMCID: PMC4942731 DOI: 10.1016/j.csbj.2016.06.005] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2016] [Revised: 06/19/2016] [Accepted: 06/23/2016] [Indexed: 12/20/2022] Open
Abstract
Previous mass spectrometry analyses described human mitochondrial peptides entirely translated from swinger RNAs, RNAs where polymerization systematically exchanged nucleotides. Exchanges follow one among 23 bijective transformation rules, nine symmetric exchanges (X ↔ Y, e.g. A ↔ C) and fourteen asymmetric exchanges (X → Y → Z → X, e.g. A → C → G → A), multiplying by 24 DNA's protein coding potential. Abrupt switches from regular to swinger polymerization produce chimeric RNAs. Here, human mitochondrial proteomic analyses assuming abrupt switches between regular and swinger transcriptions, detect chimeric peptides, encoded by part regular, part swinger RNA. Contiguous regular- and swinger-encoded residues within single peptides are stronger evidence for translation of swinger RNA than previously detected, entirely swinger-encoded peptides: regular parts are positive controls matched with contiguous swinger parts, increasing confidence in results. Chimeric peptides are 200 × rarer than swinger peptides (3/100,000 versus 6/1000). Among 186 peptides with > 8 residues for each regular and swinger parts, regular parts of eleven chimeric peptides correspond to six among the thirteen recognized, mitochondrial protein-coding genes. Chimeric peptides matching partly regular proteins are rarer and less expressed than chimeric peptides matching non-coding sequences, suggesting targeted degradation of misfolded proteins. Present results strengthen hypotheses that the short mitogenome encodes far more proteins than hitherto assumed. Entirely swinger-encoded proteins could exist. Chimeric peptides are translated from contiguous regular and swinger RNA They are 200x rarer than mitochondrial swinger peptides Chimeric peptides integrated in regular mitochondrial proteins are downregulated Contiguous regular parts are matched positive controls for swinger parts The last point validates results beyond other statistical tests for robustness
Collapse
|
46
|
Sapienza PJ, Li L, Williams T, Lee AL, Carter CW. An Ancestral Tryptophanyl-tRNA Synthetase Precursor Achieves High Catalytic Rate Enhancement without Ordered Ground-State Tertiary Structures. ACS Chem Biol 2016; 11:1661-8. [PMID: 27008438 PMCID: PMC5461432 DOI: 10.1021/acschembio.5b01011] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Urzymes-short, active core modules derived from enzyme superfamilies-prepared from the two aminoacyl-tRNA synthetase (aaRS) classes contain only the modules shared by all related family members. They have been described as models for ancestral forms. Understanding them currently depends on inferences drawn from the crystal structures of the full-length enzymes. As aaRS Urzymes lack much of the mass of modern aaRS's, retaining only a small portion of the hydrophobic cores of the full-length enzymes, it is desirable to characterize their structures. We report preliminary characterization of (15)N tryptophanyl-tRNA synthetase Urzyme by heteronuclear single quantum coherence (HSQC) NMR spectroscopy supplemented by circular dichroism, thermal melting, and induced fluorescence of bound dye. The limited dispersion of (1)H chemical shifts (0.5 ppm) is inconsistent with a narrow ensemble of well-packed structures in either free or substrate-bound forms, although the number of resonances from the bound state increases, indicating a modest, ligand-dependent gain in structure. Circular dichroism spectroscopy shows the presence of helices and evidence of cold denaturation, and all ligation states induce Sypro Orange fluorescence at ambient temperatures. Although the term "molten globule" is difficult to define precisely, these characteristics are consistent with most such definitions. Active-site titration shows that a majority of molecules retain ∼60% of the transition state stabilization free energy observed in modern synthetases. In contrast to the conventional view that enzymes require stable tertiary structures, we conclude that a highly flexible ground-state ensemble can nevertheless bind tightly to the transition state for amino acid activation.
Collapse
Affiliation(s)
- Paul J. Sapienza
- Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy
| | - Li Li
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, 25799
| | - Tishan Williams
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, 25799
| | - Andrew L. Lee
- Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy
| | - Charles W. Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, 25799
| |
Collapse
|
47
|
Abstract
Aminoacyl-tRNA synthetases (aaRSs) are modular enzymes globally conserved in the three kingdoms of life. All catalyze the same two-step reaction, i.e., the attachment of a proteinogenic amino acid on their cognate tRNAs, thereby mediating the correct expression of the genetic code. In addition, some aaRSs acquired other functions beyond this key role in translation. Genomics and X-ray crystallography have revealed great structural diversity in aaRSs (e.g., in oligomery and modularity, in ranking into two distinct groups each subdivided in 3 subgroups, by additional domains appended on the catalytic modules). AaRSs show huge structural plasticity related to function and limited idiosyncrasies that are kingdom or even species specific (e.g., the presence in many Bacteria of non discriminating aaRSs compensating for the absence of one or two specific aaRSs, notably AsnRS and/or GlnRS). Diversity, as well, occurs in the mechanisms of aaRS gene regulation that are not conserved in evolution, notably between distant groups such as Gram-positive and Gram-negative Bacteria. The review focuses on bacterial aaRSs (and their paralogs) and covers their structure, function, regulation, and evolution. Structure/function relationships are emphasized, notably the enzymology of tRNA aminoacylation and the editing mechanisms for correction of activation and charging errors. The huge amount of genomic and structural data that accumulated in last two decades is reviewed, showing how the field moved from essentially reductionist biology towards more global and integrated approaches. Likewise, the alternative functions of aaRSs and those of aaRS paralogs (e.g., during cell wall biogenesis and other metabolic processes in or outside protein synthesis) are reviewed. Since aaRS phylogenies present promiscuous bacterial, archaeal, and eukaryal features, similarities and differences in the properties of aaRSs from the three kingdoms of life are pinpointed throughout the review and distinctive characteristics of bacterium-like synthetases from organelles are outlined.
Collapse
Affiliation(s)
- Richard Giegé
- Architecture et Réactivité de l'ARN, Université de Strasbourg, CNRS, IBMC, 67084 Strasbourg, France
| | - Mathias Springer
- Université Paris Diderot, Sorbonne Cité, UPR9073 CNRS, IBPC, 75005 Paris, France
| |
Collapse
|
48
|
Wills PR. The generation of meaningful information in molecular systems. PHILOSOPHICAL TRANSACTIONS. SERIES A, MATHEMATICAL, PHYSICAL, AND ENGINEERING SCIENCES 2016; 374:rsta.2015.0066. [PMID: 26857673 DOI: 10.1098/rsta.2015.0066] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 07/17/2015] [Indexed: 06/05/2023]
Abstract
The physico-chemical processes occurring inside cells are under the computational control of genetic (DNA) and epigenetic (internal structural) programming. The origin and evolution of genetic information (nucleic acid sequences) is reasonably well understood, but scant attention has been paid to the origin and evolution of the molecular biological interpreters that give phenotypic meaning to the sequence information that is quite faithfully replicated during cellular reproduction. The near universality and age of the mapping from nucleotide triplets to amino acids embedded in the functionality of the protein synthetic machinery speaks to the early development of a system of coding which is still extant in every living organism. We take the origin of genetic coding as a paradigm of the emergence of computation in natural systems, focusing on the requirement that the molecular components of an interpreter be synthesized autocatalytically. Within this context, it is seen that interpreters of increasing complexity are generated by series of transitions through stepped dynamic instabilities (non-equilibrium phase transitions). The early phylogeny of the amino acyl-tRNA synthetase enzymes is discussed in such terms, leading to the conclusion that the observed optimality of the genetic code is a natural outcome of the processes of self-organization that produced it.
Collapse
Affiliation(s)
- Peter R Wills
- Department of Physics, University of Auckland, PB 92019, Auckland 1142, Aotearoa, New Zealand
| |
Collapse
|
49
|
Tamura K. Origins and Early Evolution of the tRNA Molecule. Life (Basel) 2015; 5:1687-99. [PMID: 26633518 PMCID: PMC4695843 DOI: 10.3390/life5041687] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2015] [Revised: 11/25/2015] [Accepted: 11/26/2015] [Indexed: 11/16/2022] Open
Abstract
Modern transfer RNAs (tRNAs) are composed of ~76 nucleotides and play an important role as "adaptor" molecules that mediate the translation of information from messenger RNAs (mRNAs). Many studies suggest that the contemporary full-length tRNA was formed by the ligation of half-sized hairpin-like RNAs. A minihelix (a coaxial stack of the acceptor stem on the T-stem of tRNA) can function both in aminoacylation by aminoacyl tRNA synthetases and in peptide bond formation on the ribosome, indicating that it may be a vestige of the ancestral tRNA. The universal CCA-3' terminus of tRNA is also a typical characteristic of the molecule. "Why CCA?" is the fundamental unanswered question, but several findings give a comprehensive picture of its origin. Here, the origins and early evolution of tRNA are discussed in terms of various perspectives, including nucleotide ligation, chiral selectivity of amino acids, genetic code evolution, and the organization of the ribosomal peptidyl transferase center (PTC). The proto-tRNA molecules may have evolved not only as adaptors but also as contributors to the composition of the ribosome.
Collapse
Affiliation(s)
- Koji Tamura
- Department of Biological Science and Technology, Tokyo University of Science, 6-3-1 Niijuku, Katsushika-ku, Tokyo 125-8585, Japan.
- Research Institute for Science and Technology, Tokyo University of Science, 2641 Yamazaki, Noda, Chiba 278-8510, Japan.
| |
Collapse
|
50
|
Carter CW, Wolfenden R. tRNA acceptor-stem and anticodon bases embed separate features of amino acid chemistry. RNA Biol 2015; 13:145-51. [PMID: 26595350 DOI: 10.1080/15476286.2015.1112488] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022] Open
Abstract
The universal genetic code is a translation table by which nucleic acid sequences can be interpreted as polypeptides with a wide range of biological functions. That information is used by aminoacyl-tRNA synthetases to translate the code. Moreover, amino acid properties dictate protein folding. We recently reported that digital correlation techniques could identify patterns in tRNA identity elements that govern recognition by synthetases. Our analysis, and the functionality of truncated synthetases that cannot recognize the tRNA anticodon, support the conclusion that the tRNA acceptor stem houses an independent code for the same 20 amino acids that likely functioned earlier in the emergence of genetics. The acceptor-stem code, related to amino acid size, is distinct from a code in the anticodon that is related to amino acid polarity. Details of the acceptor-stem code suggest that it was useful in preserving key properties of stereochemically-encoded peptides that had developed the capacity to interact catalytically with RNA. The quantitative embedding of the chemical properties of amino acids into tRNA bases has implications for the origins of molecular biology.
Collapse
Affiliation(s)
- Charles W Carter
- a Department of Biochemistry and Biophysics , University of North Carolina at Chapel Hill , Chapel Hill , NC 27599-7260
| | - Richard Wolfenden
- a Department of Biochemistry and Biophysics , University of North Carolina at Chapel Hill , Chapel Hill , NC 27599-7260
| |
Collapse
|