1
|
Demongeot J, Seligmann H. Why Is AUG the Start Codon?: Theoretical Minimal RNA Rings: Maximizing Coded Information Biases 1st Codon for the Universal Initiation Codon AUG. Bioessays 2020; 42:e1900201. [PMID: 32227358 DOI: 10.1002/bies.201900201] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2019] [Revised: 02/09/2020] [Indexed: 01/04/2023]
Abstract
The rational design of theoretical minimal RNA rings predetermines AUG as the universal start codon. This design maximizes coded amino acid diversity over minimal sequence length, defining in silico theoretical minimal RNA rings, candidate ancestral genes. RNA rings code for 21 amino acids and a stop codon after three consecutive translation rounds, and form a degradation-delaying stem-loop hairpin. Twenty-five RNA rings match these constraints, ten start with the universal initiation codon AUG. No first codon bias exists among remaining RNA rings. RNA ring design predetermines AUG as initiation codon. This is the only explanation yet for AUG as start codon. RNA ring design determines additional RNA ring gene- and tRNA-like properties described previously, because it presumably mimics constraints on life's primordial RNAs.
Collapse
Affiliation(s)
- Jacques Demongeot
- Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical & Labcom CNRS/UGA/OrangeLabs Telecom4Health, Faculty of Medicine, Université Grenoble Alpes, La Tronche, F-38700, France
| | - Hervé Seligmann
- Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical & Labcom CNRS/UGA/OrangeLabs Telecom4Health, Faculty of Medicine, Université Grenoble Alpes, La Tronche, F-38700, France.,The National Natural History Collections, The Hebrew University of Jerusalem, Jerusalem, 91404, Israel
| |
Collapse
|
2
|
Seligmann H. Localized Context-Dependent Effects of the "Ambush" Hypothesis: More Off-Frame Stop Codons Downstream of Shifty Codons. DNA Cell Biol 2019; 38:786-795. [PMID: 31157984 DOI: 10.1089/dna.2019.4725] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
The ambush hypothesis speculates that off-frame stop codons increase translational efficiency after ribosomal frameshifts by stopping early frameshifted translation. Some evidences fit this hypothesis: (1) synonymous codon usages increase with their potential contribution to off-frame stops; (2) the genetic code assigns frequent amino acids to codon families contributing to off-frame stops; (3) positive biases for off-frame stops (AT rich) occur despite adverse nucleotide (GC) biases; and (4) mitochondrial off-frame stop codon densities increase with ribosomal structural instability, potential proxy of frameshift frequencies. In this study, analyses of vertebrate mitogenes and tRNA synthetase genes from all superkingdoms and viruses test a new prediction of the ambush hypothesis: sequences immediately downstream of frameshift-inducing homopolymer codons (AAA, CCC, GGG, and TTT) are off-frame stop rich. Codons immediately downstream of homopolymer codons form more than average off-frame stops, biases are stronger than for corresponding upstream distances and for any other group of synonymous codons. Sequences downstream of that high-density region are off-frame stop depleted. This decrease suggests that off-frame stops, combined with suppressor tRNAs regulate translation of overlapping coding sequences. Results show the predictive power of the ambush hypothesis, from macroevolutionary (genetic code structure) to detailed gene sequence anatomy.
Collapse
Affiliation(s)
- Hervé Seligmann
- The National Natural History Collections, The Hebrew University of Jerusalem, Jerusalem, Israel
| |
Collapse
|
3
|
Demongeot J, Seligmann H. Theoretical minimal RNA rings recapitulate the order of the genetic code's codon-amino acid assignments. J Theor Biol 2019; 471:108-116. [DOI: 10.1016/j.jtbi.2019.03.024] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2018] [Revised: 09/19/2018] [Accepted: 03/28/2019] [Indexed: 12/21/2022]
|
4
|
Demongeot J, Seligmann H. Bias for 3'-Dominant Codon Directional Asymmetry in Theoretical Minimal RNA Rings. J Comput Biol 2019; 26:1003-1012. [PMID: 31120344 DOI: 10.1089/cmb.2018.0256] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Aminoacyl tRNA synthetases ligate tRNAs specifically with their cognate amino acid. These synthetases are among life's earliest proteins, class II tRNA synthetases (cognates A, D, F, G, H, K, N, P, S, and T) presumably preceding class I tRNA synthetases (cognates C, E, I, L, M, Q, R, V, W, and Y). Classification of codons into palindromic (structure XYX), 5'-dominant (YXX), and 3'-dominant (XXY) (Codon Directional Asymmetry [CDA]) shows that class II tRNA synthetases aminoacylate amino acids associated with XXY. Our working hypothesis expects bias for XXY codons in primordial RNAs, such as theoretical minimal RNA rings, designed in silico to mimic life's earliest RNAs. Twenty-five RNA rings have been computed, which code over a minimal length (22 nucleotides) for a start codon, stop codon, and one and only one codon for each of the 20 amino acids, and form stem-loop hairpins preventing degradation; these 25 minimal RNAs are the only ones matching these constraints and they seem homologous to consensus tRNA sequences. This similarity defined candidate RNA ring anticodons and corresponding cognate amino acids. Here, analyses of RNA ring codon contents confirm bias for XXY codons in 13 among 14 RNA rings with unequal XXY and YXX codon numbers. This bias increases with the genetic code integration order of the RNA ring's cognate amino acid across and within tRNA synthetase classes, suggesting that evolutionary processes, and not physicochemical constraints, produced the association between CDA and tRNA synthetase classes. The self-referential hypothesis for genetic code origin, a very complete genetic code evolutionary hypothesis integrating many translational machinery components, predicts best among genetic code evolutionary hypotheses CDA biases in RNA rings. The RNA rings' simple design inadvertently reproduces CDAs predicted by the genetic code's structure, confirming theoretical minimal RNA rings as good proxies for life's earliest RNAs.
Collapse
Affiliation(s)
- Jacques Demongeot
- Laboratory AGEIS EA 7407, Faculty of Medicine, Team Tools for e-Gnosis Medical, Université Grenoble Alpes, La Tronche, France
| | - Hervé Seligmann
- The National Natural History Collections, The Hebrew University of Jerusalem, Jerusalem, Israel
| |
Collapse
|
5
|
Demongeot J, Seligmann H. More Pieces of Ancient than Recent Theoretical Minimal Proto-tRNA-Like RNA Rings in Genes Coding for tRNA Synthetases. J Mol Evol 2019; 87:152-174. [DOI: 10.1007/s00239-019-09892-6] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2018] [Accepted: 03/22/2019] [Indexed: 12/19/2022]
|
6
|
Seligmann H. Protein Sequences Recapitulate Genetic Code Evolution. Comput Struct Biotechnol J 2018; 16:177-189. [PMID: 30002789 PMCID: PMC6040577 DOI: 10.1016/j.csbj.2018.05.001] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2018] [Revised: 05/14/2018] [Accepted: 05/17/2018] [Indexed: 12/16/2022] Open
Abstract
Several hypotheses predict ranks of amino acid assignments to genetic code's codons. Analyses here show that average positions of amino acid species in proteins correspond to assignment ranks, in particular as predicted by Juke's neutral mutation hypothesis for codon assignments. In all tested protein groups, including co- and post-translationally folding proteins, 'recent' amino acids are on average closer to gene 5' extremities than 'ancient' ones. Analyses of pairwise residue contact energies matrices suggest that early amino acids stereochemically selected late ones that stablilize residue interactions within protein cores, presumably producing 5'-late-to-3'-early amino acid protein sequence gradients. The gradient might reduce protein misfolding, also after mutations, extending principles of neutral mutations to protein folding. Presumably, in self-perpetuating and self-correcting systems like the genetic code, initial conditions produce similarities between evolution of the process (the genetic code) and 'ontogeny' of resulting structures (here proteins), producing apparent teleonomy between process and product.
Collapse
Affiliation(s)
- Hervé Seligmann
- Unité de Recherche sur les Maladies Infectieuses et Tropicales Emergentes, UMR MEPHI, Aix-Marseille Université, IRD, Assistance Publique-Hôpitaux de Marseille, Institut Hospitalo-Universitaire Méditerranée-Infection, 19-21 boulevard Jean Moulin, 13005 Marseille, France.
| |
Collapse
|
7
|
Seligmann H, Warthi G. Genetic Code Optimization for Cotranslational Protein Folding: Codon Directional Asymmetry Correlates with Antiparallel Betasheets, tRNA Synthetase Classes. Comput Struct Biotechnol J 2017; 15:412-424. [PMID: 28924459 PMCID: PMC5591391 DOI: 10.1016/j.csbj.2017.08.001] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2017] [Revised: 07/20/2017] [Accepted: 08/05/2017] [Indexed: 12/14/2022] Open
Abstract
A new codon property, codon directional asymmetry in nucleotide content (CDA), reveals a biologically meaningful genetic code dimension: palindromic codons (first and last nucleotides identical, codon structure XZX) are symmetric (CDA = 0), codons with structures ZXX/XXZ are 5'/3' asymmetric (CDA = - 1/1; CDA = - 0.5/0.5 if Z and X are both purines or both pyrimidines, assigning negative/positive (-/+) signs is an arbitrary convention). Negative/positive CDAs associate with (a) Fujimoto's tetrahedral codon stereo-table; (b) tRNA synthetase class I/II (aminoacylate the 2'/3' hydroxyl group of the tRNA's last ribose, respectively); and (c) high/low antiparallel (not parallel) betasheet conformation parameters. Preliminary results suggest CDA-whole organism associations (body temperature, developmental stability, lifespan). Presumably, CDA impacts spatial kinetics of codon-anticodon interactions, affecting cotranslational protein folding. Some synonymous codons have opposite CDA sign (alanine, leucine, serine, and valine), putatively explaining how synonymous mutations sometimes affect protein function. Correlations between CDA and tRNA synthetase classes are weaker than between CDA and antiparallel betasheet conformation parameters. This effect is stronger for mitochondrial genetic codes, and potentially drives mitochondrial codon-amino acid reassignments. CDA reveals information ruling nucleotide-protein relations embedded in reversed (not reverse-complement) sequences (5'-ZXX-3'/5'-XXZ-3').
Collapse
Affiliation(s)
- Hervé Seligmann
- Aix-Marseille Univ, Unité de Recherche sur les Maladies Infectieuses et Tropicales Emergentes, UM 63, CNRS UMR7278, IRD 198, INSERM U1095, Institut Hospitalo-Universitaire Méditerranée-Infection, Marseille, Postal code 13385, France
- Dept. Ecol Evol Behav, Alexander Silberman Inst Life Sci, The Hebrew University of Jerusalem, IL-91904 Jerusalem, Israel
| | - Ganesh Warthi
- Aix-Marseille Univ, Unité de Recherche sur les Maladies Infectieuses et Tropicales Emergentes, UM 63, CNRS UMR7278, IRD 198, INSERM U1095, Institut Hospitalo-Universitaire Méditerranée-Infection, Marseille, Postal code 13385, France
| |
Collapse
|
8
|
El Houmami N, Seligmann H. Evolution of Nucleotide Punctuation Marks: From Structural to Linear Signals. Front Genet 2017; 8:36. [PMID: 28396681 PMCID: PMC5366352 DOI: 10.3389/fgene.2017.00036] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2016] [Accepted: 03/13/2017] [Indexed: 01/13/2023] Open
Abstract
We present an evolutionary hypothesis assuming that signals marking nucleotide synthesis (DNA replication and RNA transcription) evolved from multi- to unidimensional structures, and were carried over from transcription to translation. This evolutionary scenario presumes that signals combining secondary and primary nucleotide structures are evolutionary transitions. Mitochondrial replication initiation fits this scenario. Some observations reported in the literature corroborate that several signals for nucleotide synthesis function in translation, and vice versa. (a) Polymerase-induced frameshift mutations occur preferentially at translational termination signals (nucleotide deletion is interpreted as termination of nucleotide polymerization, paralleling the role of stop codons in translation). (b) Stem-loop hairpin presence/absence modulates codon-amino acid assignments, showing that translational signals sometimes combine primary and secondary nucleotide structures (here codon and stem-loop). (c) Homopolymer nucleotide triplets (AAA, CCC, GGG, TTT) cause transcriptional and ribosomal frameshifts. Here we find in recently described human mitochondrial RNAs that systematically lack mono-, dinucleotides after each trinucleotide (delRNAs) that delRNA triplets include 2x more homopolymers than mitogenome regions not covered by delRNA. Further analyses of delRNAs show that the natural circular code X (a little-known group of 20 translational signals enabling ribosomal frame retrieval consisting of 20 codons {AAC, AAT, ACC, ATC, ATT, CAG, CTC, CTG, GAA, GAC, GAG, GAT, GCC, GGC, GGT, GTA, GTC, GTT, TAC, TTC} universally overrepresented in coding versus other frames of gene sequences), regulates frameshift in transcription and translation. This dual transcription and translation role confirms for X the hypothesis that translational signals were carried over from transcriptional signals.
Collapse
Affiliation(s)
- Nawal El Houmami
- URMITE, Aix Marseille Université UM63, CNRS 7278, IRD 198, INSERM 1095, IHU - Méditerranée Infection Marseille, France
| | - Hervé Seligmann
- URMITE, Aix Marseille Université UM63, CNRS 7278, IRD 198, INSERM 1095, IHU - Méditerranée Infection Marseille, France
| |
Collapse
|
9
|
Seligmann H. Translation of mitochondrial swinger RNAs according to tri-, tetra- and pentacodons. Biosystems 2015; 140:38-48. [PMID: 26723232 DOI: 10.1016/j.biosystems.2015.11.009] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2015] [Revised: 11/08/2015] [Accepted: 11/23/2015] [Indexed: 10/22/2022]
Abstract
Transcriptomes and proteomes include RNA and protein fragments not matching regular transcription/translation. Some 'non-canonical' mitochondrial transcripts match mitogenomes after assuming one among 23 systematic exchanges between nucleotides, producing swinger RNAs (nine symmetric, X↔Y, example C↔T; 14 asymmetric, X→Y→Z→X, example A→T→G→A) in GenBank's EST database. Here, reanalyzes of (a) public human mitochondrial transcriptome data (Illumina: RNA-seq) allowed to detect mitochondrial swinger RNAs for all 23 exchanges and (b) independent public human mitochondrial trypsinized proteomic mass spectrometry data allowed to detect peptides predicted from translation of parts of swinger-transformed mitogenomes covered by detected swinger reads. RNA-seq and previous EST swinger transcript data converge. Swinger RNA translation frequently inserts various amino acids at stop codons. Swinger RNA-peptide associations exist also for peptides matching systematically frameshifting translation, peptides entirely coded by tetra- and pentacodons (regular codons expanded by silent mononucleotides at 4th, and silent dinucleotides at 4th and 5th position(s), respectively). Swinger peptides differ from regular mitochondrial proteins: not membrane embedded, reflect warmer, anaerobic, low resource conditions, reminding a free-living ancestor. Tetra- and pentacoded peptides associate with low, high GC contents, respectively, suggesting expanded codon translations associate with thermic stresses. Results confirm experimentally predicted swinger, tetra- and pentacoded mitochondrial peptides, increasing mitogenomic coding density.
Collapse
Affiliation(s)
- Hervé Seligmann
- Unité de Recherche sur les Maladies Infectieuses et Tropicales Émergentes, Faculté de Médecine, URMITE CNRS-IRD 198 UMER 6236, Université de la Méditerranée, Marseille, France.
| |
Collapse
|
10
|
Fang P, Guo M. Evolutionary Limitation and Opportunities for Developing tRNA Synthetase Inhibitors with 5-Binding-Mode Classification. Life (Basel) 2015; 5:1703-25. [PMID: 26670257 PMCID: PMC4695845 DOI: 10.3390/life5041703] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2015] [Revised: 11/24/2015] [Accepted: 11/25/2015] [Indexed: 12/30/2022] Open
Abstract
Aminoacyl-tRNA synthetases (aaRSs) are enzymes that catalyze the transfer of amino acids to their cognate tRNAs as building blocks for translation. Each of the aaRS families plays a pivotal role in protein biosynthesis and is indispensable for cell growth and survival. In addition, aaRSs in higher species have evolved important non-translational functions. These translational and non-translational functions of aaRS are attractive for developing antibacterial, antifungal, and antiparasitic agents and for treating other human diseases. The interplay between amino acids, tRNA, ATP, EF-Tu and non-canonical binding partners, had shaped each family with distinct pattern of key sites for regulation, with characters varying among species across the path of evolution. These sporadic variations in the aaRSs offer great opportunity to target these essential enzymes for therapy. Up to this day, growing numbers of aaRS inhibitors have been discovered and developed. Here, we summarize the latest developments and structural studies of aaRS inhibitors, and classify them with distinct binding modes into five categories.
Collapse
Affiliation(s)
- Pengfei Fang
- State Key Laboratory of Bioorganic and Natural Products Chemistry, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai 200032, China.
- Department of Cancer Biology, The Scripps Research Institute, Scripps Florida, 130 Scripps Way, Jupiter, FL 33458, USA.
| | - Min Guo
- Department of Cancer Biology, The Scripps Research Institute, Scripps Florida, 130 Scripps Way, Jupiter, FL 33458, USA.
| |
Collapse
|
11
|
The relation between hairpin formation by mitochondrial WANCY tRNAs and the occurrence of the light strand replication origin in Lepidosauria. Gene 2014; 542:248-57. [DOI: 10.1016/j.gene.2014.02.021] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2013] [Revised: 12/27/2013] [Accepted: 02/17/2014] [Indexed: 01/28/2023]
|
12
|
Seligmann H. Pocketknife tRNA hypothesis: Anticodons in mammal mitochondrial tRNA side-arm loops translate proteins? Biosystems 2013; 113:165-76. [DOI: 10.1016/j.biosystems.2013.07.004] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2013] [Revised: 07/02/2013] [Accepted: 07/03/2013] [Indexed: 12/11/2022]
|
13
|
Seligmann H. Polymerization of non-complementary RNA: systematic symmetric nucleotide exchanges mainly involving uracil produce mitochondrial RNA transcripts coding for cryptic overlapping genes. Biosystems 2013; 111:156-74. [PMID: 23410796 DOI: 10.1016/j.biosystems.2013.01.011] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2012] [Revised: 01/24/2013] [Accepted: 01/29/2013] [Indexed: 12/23/2022]
Abstract
Usual DNA→RNA transcription exchanges T→U. Assuming different systematic symmetric nucleotide exchanges during translation, some GenBank RNAs match exactly human mitochondrial sequences (exchange rules listed in decreasing transcript frequencies): C↔U, A↔U, A↔U+C↔G (two nucleotide pairs exchanged), G↔U, A↔G, C↔G, none for A↔C, A↔G+C↔U, and A↔C+G↔U. Most unusual transcripts involve exchanging uracil. Independent measures of rates of rare replicational enzymatic DNA nucleotide misinsertions predict frequencies of RNA transcripts systematically exchanging the corresponding misinserted nucleotides. Exchange transcripts self-hybridize less than other gene regions, self-hybridization increases with length, suggesting endoribonuclease-limited elongation. Blast detects stop codon depleted putative protein coding overlapping genes within exchange-transcribed mitochondrial genes. These align with existing GenBank proteins (mainly metazoan origins, prokaryotic and viral origins underrepresented). These GenBank proteins frequently interact with RNA/DNA, are membrane transporters, or are typical of mitochondrial metabolism. Nucleotide exchange transcript frequencies increase with overlapping gene densities and stop densities, indicating finely tuned counterbalancing regulation of expression of systematic symmetric nucleotide exchange-encrypted proteins. Such expression necessitates combined activities of suppressor tRNAs matching stops, and nucleotide exchange transcription. Two independent properties confirm predicted exchanged overlap coding genes: discrepancy of third codon nucleotide contents from replicational deamination gradients, and codon usage according to circular code predictions. Predictions from both properties converge, especially for frequent nucleotide exchange types. Nucleotide exchanging transcription apparently increases coding densities of protein coding genes without lengthening genomes, revealing unsuspected functional DNA coding potential.
Collapse
Affiliation(s)
- Hervé Seligmann
- National Natural History Museum Collections, The Hebrew University of Jerusalem, 91904 Jerusalem, Israel.
| |
Collapse
|
14
|
Putative mitochondrial polypeptides coded by expanded quadruplet codons, decoded by antisense tRNAs with unusual anticodons. Biosystems 2012; 110:84-106. [DOI: 10.1016/j.biosystems.2012.09.002] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2012] [Revised: 09/20/2012] [Accepted: 09/26/2012] [Indexed: 11/19/2022]
|