1
|
Calaça Serrão A, Dänekamp FT, Meggyesi Z, Braun D. Replication elongates short DNA, reduces sequence bias and develops trimer structure. Nucleic Acids Res 2024; 52:1290-1297. [PMID: 38096089 PMCID: PMC10853772 DOI: 10.1093/nar/gkad1190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 11/15/2023] [Accepted: 11/30/2023] [Indexed: 02/10/2024] Open
Abstract
The origin of molecular evolution required the replication of short oligonucleotides to form longer polymers. Prebiotically plausible oligonucleotide pools tend to contain more of some nucleobases than others. It has been unclear whether this initial bias persists and how it affects replication. To investigate this, we examined the evolution of 12-mer biased short DNA pools using an enzymatic model system. This allowed us to study the long timescales involved in evolution, since it is not yet possible with currently investigated prebiotic replication chemistries. Our analysis using next-generation sequencing from different time points revealed that the initial nucleotide bias of the pool disappeared in the elongated pool after isothermal replication. In contrast, the nucleotide composition at each position in the elongated sequences remained biased and varied with both position and initial bias. Furthermore, we observed the emergence of highly periodic dimer and trimer motifs in the rapidly elongated sequences. This shift in nucleotide composition and the emergence of structure through templated replication could help explain how biased prebiotic pools could undergo molecular evolution and lead to complex functional nucleic acids.
Collapse
Affiliation(s)
- Adriana Calaça Serrão
- Systems Biophysics, Physics Department, Center for NanoScience, Ludwig-Maximilians-Universität München, Amalienstraße 54, 80799 Munich, Germany
| | - Felix T Dänekamp
- Systems Biophysics, Physics Department, Center for NanoScience, Ludwig-Maximilians-Universität München, Amalienstraße 54, 80799 Munich, Germany
| | - Zsófia Meggyesi
- Systems Biophysics, Physics Department, Center for NanoScience, Ludwig-Maximilians-Universität München, Amalienstraße 54, 80799 Munich, Germany
| | - Dieter Braun
- Systems Biophysics, Physics Department, Center for NanoScience, Ludwig-Maximilians-Universität München, Amalienstraße 54, 80799 Munich, Germany
| |
Collapse
|
2
|
Wu LF, Liu Z, Roberts SJ, Su M, Szostak JW, Sutherland JD. Template-Free Assembly of Functional RNAs by Loop-Closing Ligation. J Am Chem Soc 2022; 144:13920-13927. [PMID: 35880790 PMCID: PMC9354263 DOI: 10.1021/jacs.2c05601] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The first ribozymes are thought to have emerged at a time when RNA replication proceeded via nonenzymatic template copying processes. However, functional RNAs have stable folded structures, and such structures are much more difficult to copy than short unstructured RNAs. How can these conflicting requirements be reconciled? Also, how can the inhibition of ribozyme function by complementary template strands be avoided or minimized? Here, we show that short RNA duplexes with single-stranded overhangs can be converted into RNA stem loops by nonenzymatic cross-strand ligation. We then show that loop-closing ligation reactions enable the assembly of full-length functional ribozymes without any external template. Thus, one can envisage a potential pathway whereby structurally complex functional RNAs could have formed at an early stage of evolution when protocell genomes might have consisted only of collections of short replicating oligonucleotides.
Collapse
Affiliation(s)
- Long-Fei Wu
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge Biomedical Campus, Cambridge CB2 0QH, United Kingdom.,Department of Molecular Biology and Center for Computational and Integrative Biology, Howard Hughes Medical Institute, Massachusetts General Hospital, Boston, Massachusetts 02114, United States.,Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, United States.,Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts 02138, United States
| | - Ziwei Liu
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge Biomedical Campus, Cambridge CB2 0QH, United Kingdom
| | - Samuel J Roberts
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge Biomedical Campus, Cambridge CB2 0QH, United Kingdom
| | - Meng Su
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge Biomedical Campus, Cambridge CB2 0QH, United Kingdom
| | - Jack W Szostak
- Department of Molecular Biology and Center for Computational and Integrative Biology, Howard Hughes Medical Institute, Massachusetts General Hospital, Boston, Massachusetts 02114, United States.,Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, United States.,Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts 02138, United States
| | - John D Sutherland
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge Biomedical Campus, Cambridge CB2 0QH, United Kingdom
| |
Collapse
|
3
|
Zhou L, Ding D, Szostak JW. The virtual circular genome model for primordial RNA replication. RNA (NEW YORK, N.Y.) 2021; 27:1-11. [PMID: 33028653 PMCID: PMC7749632 DOI: 10.1261/rna.077693.120] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Accepted: 10/02/2020] [Indexed: 05/13/2023]
Abstract
We propose a model for the replication of primordial protocell genomes that builds upon recent advances in the nonenzymatic copying of RNA. We suggest that the original genomes consisted of collections of oligonucleotides beginning and ending at all possible positions on both strands of one or more virtual circular sequences. Replication is driven by feeding with activated monomers and by the activation of monomers and oligonucleotides in situ. A fraction of the annealed configurations of the protocellular oligonucleotides would allow for template-directed oligonucleotide growth by primer extension or ligation. Rearrangements of these annealed configurations, driven either by environmental fluctuations or occurring spontaneously, would allow for continued oligonucleotide elongation. Assuming that shorter oligonucleotides were more abundant than longer ones, replication of the entire genome could occur by the growth of all oligonucleotides by as little as one nucleotide on average. We consider possible scenarios that could have given rise to such protocell genomes, as well as potential routes to the emergence of catalytically active ribozymes and thus the more complex cells of the RNA World.
Collapse
Affiliation(s)
- Lijun Zhou
- Howard Hughes Medical Institute, Department of Molecular Biology and Center for Computational and Integrative Biology, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Dian Ding
- Howard Hughes Medical Institute, Department of Molecular Biology and Center for Computational and Integrative Biology, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts 02138, USA
| | - Jack W Szostak
- Howard Hughes Medical Institute, Department of Molecular Biology and Center for Computational and Integrative Biology, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, USA
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts 02138, USA
| |
Collapse
|
4
|
Oliver CG, Reinharz V, Waldispühl J. On the emergence of structural complexity in RNA replicators. RNA (NEW YORK, N.Y.) 2019; 25:1579-1591. [PMID: 31467146 PMCID: PMC6859851 DOI: 10.1261/rna.070391.119] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/15/2019] [Accepted: 08/19/2019] [Indexed: 06/10/2023]
Abstract
The RNA world hypothesis relies on the ability of ribonucleic acids to spontaneously acquire complex structures capable of supporting essential biological functions. Multiple sophisticated evolutionary models have been proposed for their emergence, but they often assume specific conditions. In this work, we explore a simple and parsimonious scenario describing the emergence of complex molecular structures at the early stages of life. We show that at specific GC content regimes, an undirected replication model is sufficient to explain the apparition of multibranched RNA secondary structures-a structural signature of many essential ribozymes. We ran a large-scale computational study to map energetically stable structures on complete mutational networks of 50-nt-long RNA sequences. Our results reveal that the sequence landscape with stable structures is enriched with multibranched structures at a length scale coinciding with the appearance of complex structures in RNA databases. A random replication mechanism preserving a 50% GC content may suffice to explain a natural enrichment of stable complex structures in populations of functional RNAs. In contrast, an evolutionary mechanism eliciting the most stable folds at each generation appears to help reaching multibranched structures at highest GC content.
Collapse
Affiliation(s)
- Carlos G Oliver
- School of Computer Science, McGill University, Montreal, QC H3A 2B3, Canada
| | - Vladimir Reinharz
- Center for Soft and Living Matter, Institute for Basic Science, Ulsan 34126, South Korea
| | - Jérôme Waldispühl
- School of Computer Science, McGill University, Montreal, QC H3A 2B3, Canada
| |
Collapse
|
5
|
Type-II tRNAs and Evolution of Translation Systems and the Genetic Code. Int J Mol Sci 2018; 19:ijms19103275. [PMID: 30360357 PMCID: PMC6214036 DOI: 10.3390/ijms19103275] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2018] [Revised: 10/12/2018] [Accepted: 10/18/2018] [Indexed: 12/23/2022] Open
Abstract
Because tRNA is the core biological intellectual property that was necessary to evolve translation systems, tRNAomes, ribosomes, aminoacyl-tRNA synthetases, and the genetic code, the evolution of tRNA is the core story in evolution of life on earth. We have previously described the evolution of type-I tRNAs. Here, we use the same model to describe the evolution of type-II tRNAs, with expanded V loops. The models are strongly supported by inspection of typical tRNA diagrams, measuring lengths of V loop expansions, and analyzing the homology of V loop sequences to tRNA acceptor stems. Models for tRNA evolution provide a pathway for the inanimate-to-animate transition and for the evolution of translation systems, the genetic code, and cellular life.
Collapse
|
6
|
Slinger BL, Meyer MM. RNA regulators responding to ribosomal protein S15 are frequent in sequence space. Nucleic Acids Res 2016; 44:9331-9341. [PMID: 27580716 PMCID: PMC5100602 DOI: 10.1093/nar/gkw754] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2016] [Accepted: 08/19/2016] [Indexed: 02/07/2023] Open
Abstract
There are several natural examples of distinct RNA structures that interact with the same ligand to regulate the expression of homologous genes in different organisms. One essential question regarding this phenomenon is whether such RNA regulators are the result of convergent or divergent evolution. Are the RNAs derived from some common ancestor and diverged to the point where we cannot identify the similarity, or have multiple solutions to the same biological problem arisen independently? A key variable in assessing these alternatives is how frequently such regulators arise within sequence space. Ribosomal protein S15 is autogenously regulated via an RNA regulator in many bacterial species; four apparently distinct regulators have been functionally validated in different bacterial phyla. Here, we explore how frequently such regulators arise within a partially randomized sequence population. We find many RNAs that interact specifically with ribosomal protein S15 from Geobacillus kaustophilus with biologically relevant dissociation constants. Furthermore, of the six sequences we characterize, four show regulatory activity in an Escherichia coli reporter assay. Subsequent footprinting and mutagenesis analysis indicates that protein binding proximal to regulatory features such as the Shine–Dalgarno sequence is sufficient to enable regulation, suggesting that regulation in response to S15 is relatively easily acquired.
Collapse
Affiliation(s)
- Betty L Slinger
- Biology Department, Boston College, Chestnut Hill, MA 02467, USA
| | - Michelle M Meyer
- Biology Department, Boston College, Chestnut Hill, MA 02467, USA
| |
Collapse
|
7
|
Mallatt J, Chittenden KD. The GC content of LSU rRNA evolves across topological and functional regions of the ribosome in all three domains of life. Mol Phylogenet Evol 2014; 72:17-30. [PMID: 24394731 DOI: 10.1016/j.ympev.2013.12.007] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2013] [Revised: 11/28/2013] [Accepted: 12/24/2013] [Indexed: 12/21/2022]
Abstract
Large-subunit rRNA is the ribozyme that catalyzes protein synthesis by translation, and many of its features vary along a deep-to-superficial gradient. By measuring the G+C proportions in this rRNA in all three domains of life (60 bacteria, 379 eukaryote, and 23 archaean sequences), we tested whether the proportion of GC nucleotides varies along this in-out gradient. The rRNA regions used were several zones identified by Bokov and Steinberg (2009) as being arranged from deep to superficial within the LSU. To the Bokov-Steinberg zones, we added the most superficial zone of all, the divergent domains (expansion segments), which are greatly enlarged in eukaryotes. Regression lines constructed from the hundreds of species of organisms revealed the expected in-out gradient, showing that species with high %GC (or high %AT) in their rRNA distribute more of these abundant nucleotides into the peripheral zones. This could be explained by the evolutionary rates of replacement of all nucleotides (A, C, G, T), because these latter rates are fastest at the periphery and slowest near the conserved core. As an overall explanation, we propose that when extrinsic factors (whole-genome nucleotide composition, or environmental temperature) demand the percentage of GC in the rRNA of a species be high or low, then the deep-lying zones are buffered against GC variation because they are the slowest to evolve. The deep, conserved zones are also the most involved in translation, hinting that stabilizing selection there prevents a high GC variability that would diminish LSU rRNA's core functions. We found only a few domain-specific trends in rRNA-GC distribution, which relate to many Archaea living at high temperatures or to the highly complex genes and adaptations of Eukaryota. Use of rRNA sequences in molecular phylogenetic studies, for reconstructing the relationships of organisms across the tree of life, requires accurate models of how rRNA evolves. The demonstration that GC distributes in regular patterns across rRNA regions can improve these tree-reconstruction models in the future and should yield phylogenies of greater accuracy.
Collapse
Affiliation(s)
- Jon Mallatt
- School of Biological Sciences, Washington State University, Pullman, WA 99164-4236, United States.
| | - Kevin D Chittenden
- School of Biological Sciences, Washington State University, Pullman, WA 99164-4236, United States
| |
Collapse
|
8
|
Ivica NA, Obermayer B, Campbell GW, Rajamani S, Gerland U, Chen IA. The paradox of dual roles in the RNA world: resolving the conflict between stable folding and templating ability. J Mol Evol 2013; 77:55-63. [PMID: 24078151 PMCID: PMC12051476 DOI: 10.1007/s00239-013-9584-x] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2013] [Accepted: 09/12/2013] [Indexed: 10/26/2022]
Abstract
The hypothesized dual roles of RNA as both information carrier and biocatalyst during the earliest stages of life require a combination of features: good templating ability (for replication) and stable folding (for ribozymes). However, this poses the following paradox: well-folded sequences are poor templates for copying, but poorly folded sequences are unlikely to be good ribozymes. Here, we describe a strategy to overcome this dilemma through G:U wobble pairing in RNA. Unlike Watson-Crick base pairs, wobble pairs contribute highly to the energetic stability of the folded structure of their sequence, but only slightly, if at all, to the stability of the folded reverse complement. Sequences in the RNA World might thereby combine stable folding of the ribozyme with an unstructured, reverse-complementary genome, resulting in a "division of labor" between the strands. We demonstrate this strategy using computational simulations of RNA folding and an experimental model of early replication, nonenzymatic template-directed RNA primer extension. Additional study is needed to solve other problems associated with a complete replication cycle, including separation of strands after copying. Interestingly, viroid RNA sequences, which have been suggested to be relics of an RNA World (Diener, Proc Natl Acad Sci USA 86:9370-9374, 1989), also show significant asymmetry in folding energy between the infectious (+) and template (-) strands due to G:U pairing, suggesting that this strategy may even be used by replicators in the present day.
Collapse
Affiliation(s)
- Nikola A. Ivica
- FAS Center for Systems Biology, Harvard University, Cambridge, MA, USA
| | | | - Gregory W. Campbell
- Department of Chemistry and Biochemistry and Program in Biomolecular Sciences and Engineering, University of California, Santa Barbara, CA, USA
| | - Sudha Rajamani
- FAS Center for Systems Biology, Harvard University, Cambridge, MA, USA
| | - Ulrich Gerland
- Department of Physics, Ludwig-Maximillian University, Munich, Germany
| | - Irene A. Chen
- FAS Center for Systems Biology, Harvard University, Cambridge, MA, USA
- Department of Chemistry and Biochemistry and Program in Biomolecular Sciences and Engineering, University of California, Santa Barbara, CA, USA
| |
Collapse
|
9
|
Derr J, Manapat ML, Rajamani S, Leu K, Xulvi-Brunet R, Joseph I, Nowak MA, Chen IA. Prebiotically plausible mechanisms increase compositional diversity of nucleic acid sequences. Nucleic Acids Res 2012; 40:4711-22. [PMID: 22319215 PMCID: PMC3378899 DOI: 10.1093/nar/gks065] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
During the origin of life, the biological information of nucleic acid polymers must have increased to encode functional molecules (the RNA world). Ribozymes tend to be compositionally unbiased, as is the vast majority of possible sequence space. However, ribonucleotides vary greatly in synthetic yield, reactivity and degradation rate, and their non-enzymatic polymerization results in compositionally biased sequences. While natural selection could lead to complex sequences, molecules with some activity are required to begin this process. Was the emergence of compositionally diverse sequences a matter of chance, or could prebiotically plausible reactions counter chemical biases to increase the probability of finding a ribozyme? Our in silico simulations using a two-letter alphabet show that template-directed ligation and high concatenation rates counter compositional bias and shift the pool toward longer sequences, permitting greater exploration of sequence space and stable folding. We verified experimentally that unbiased DNA sequences are more efficient templates for ligation, thus increasing the compositional diversity of the pool. Our work suggests that prebiotically plausible chemical mechanisms of nucleic acid polymerization and ligation could predispose toward a diverse pool of longer, potentially structured molecules. Such mechanisms could have set the stage for the appearance of functional activity very early in the emergence of life.
Collapse
Affiliation(s)
- Julien Derr
- FAS Center for Systems Biology, Harvard University, Cambridge, MA 02138, USA
| | | | | | | | | | | | | | | |
Collapse
|
10
|
Leu K, Obermayer B, Rajamani S, Gerland U, Chen IA. The prebiotic evolutionary advantage of transferring genetic information from RNA to DNA. Nucleic Acids Res 2011; 39:8135-47. [PMID: 21724606 PMCID: PMC3185426 DOI: 10.1093/nar/gkr525] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2011] [Revised: 06/07/2011] [Accepted: 06/07/2011] [Indexed: 11/13/2022] Open
Abstract
In the early 'RNA world' stage of life, RNA stored genetic information and catalyzed chemical reactions. However, the RNA world eventually gave rise to the DNA-RNA-protein world, and this transition included the 'genetic takeover' of information storage by DNA. We investigated evolutionary advantages for using DNA as the genetic material. The error rate of replication imposes a fundamental limit on the amount of information that can be stored in the genome, as mutations degrade information. We compared misincorporation rates of RNA and DNA in experimental non-enzymatic polymerization and calculated the lowest possible error rates from a thermodynamic model. Both analyses found that RNA replication was intrinsically error-prone compared to DNA, suggesting that total genomic information could increase after the transition to DNA. Analysis of the transitional RNA/DNA hybrid duplexes showed that copying RNA into DNA had similar fidelity to RNA replication, so information could be maintained during the genetic takeover. However, copying DNA into RNA was very error-prone, suggesting that attempts to return to the RNA world would result in a considerable loss of information. Therefore, the genetic takeover may have been driven by a combination of increased chemical stability, increased genome size and irreversibility.
Collapse
Affiliation(s)
- Kevin Leu
- FAS Center for Systems Biology, Department of Physics, Harvard University, Cambridge, MA 02138, USA and Department of Physics, Arnold Sommerfeld Center for Theoretical Physics and Center for NanoScience, Ludwig-Maximilians Universität München, Munich, Germany
| | - Benedikt Obermayer
- FAS Center for Systems Biology, Department of Physics, Harvard University, Cambridge, MA 02138, USA and Department of Physics, Arnold Sommerfeld Center for Theoretical Physics and Center for NanoScience, Ludwig-Maximilians Universität München, Munich, Germany
| | - Sudha Rajamani
- FAS Center for Systems Biology, Department of Physics, Harvard University, Cambridge, MA 02138, USA and Department of Physics, Arnold Sommerfeld Center for Theoretical Physics and Center for NanoScience, Ludwig-Maximilians Universität München, Munich, Germany
| | - Ulrich Gerland
- FAS Center for Systems Biology, Department of Physics, Harvard University, Cambridge, MA 02138, USA and Department of Physics, Arnold Sommerfeld Center for Theoretical Physics and Center for NanoScience, Ludwig-Maximilians Universität München, Munich, Germany
| | - Irene A. Chen
- FAS Center for Systems Biology, Department of Physics, Harvard University, Cambridge, MA 02138, USA and Department of Physics, Arnold Sommerfeld Center for Theoretical Physics and Center for NanoScience, Ludwig-Maximilians Universität München, Munich, Germany
| |
Collapse
|
11
|
Labean TH, Butt TR, Kauffman SA, Schultes EA. Protein folding absent selection. Genes (Basel) 2011; 2:608-26. [PMID: 24710212 PMCID: PMC3927614 DOI: 10.3390/genes2030608] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2011] [Revised: 08/05/2011] [Accepted: 08/11/2011] [Indexed: 11/16/2022] Open
Abstract
Biological proteins are known to fold into specific 3D conformations. However, the fundamental question has remained: Do they fold because they are biological, and evolution has selected sequences which fold? Or is folding a common trait, widespread throughout sequence space? To address this question arbitrary, unevolved, random-sequence proteins were examined for structural features found in folded, biological proteins. Libraries of long (71 residue), random-sequence polypeptides, with ensemble amino acid composition near the mean for natural globular proteins, were expressed as cleavable fusions with ubiquitin. The structural properties of both the purified pools and individual isolates were then probed using circular dichroism, fluorescence emission, and fluorescence quenching techniques. Despite this necessarily sparse "sampling" of sequence space, structural properties that define globular biological proteins, namely collapsed conformations, secondary structure, and cooperative unfolding, were found to be prevalent among unevolved sequences. Thus, for polypeptides the size of small proteins, natural selection is not necessary to account for the compact and cooperative folded states observed in nature.
Collapse
Affiliation(s)
- Thomas H Labean
- Sequenomics LLC, 1428 Chanterelle Lane, Hillsborough, NC 27278, USA.
| | - Tauseef R Butt
- LifeSensors Inc., 271 Great Valley Parkway, Suite 100, Malvern, PA 19355, USA.
| | - Stuart A Kauffman
- Complex Systems Center University of Vermont, 200C Farrell Hall, 210 Colchester Ave., Burlington, VT 05405, USA.
| | - Erik A Schultes
- Sequenomics LLC, 1428 Chanterelle Lane, Hillsborough, NC 27278, USA.
| |
Collapse
|
12
|
Laing C, Schlick T. Computational approaches to RNA structure prediction, analysis, and design. Curr Opin Struct Biol 2011; 21:306-18. [PMID: 21514143 PMCID: PMC3112238 DOI: 10.1016/j.sbi.2011.03.015] [Citation(s) in RCA: 101] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2011] [Revised: 03/24/2011] [Accepted: 03/29/2011] [Indexed: 12/19/2022]
Abstract
RNA molecules are important cellular components involved in many fundamental biological processes. Understanding the mechanisms behind their functions requires RNA tertiary structure knowledge. Although modeling approaches for the study of RNA structures and dynamics lag behind efforts in protein folding, much progress has been achieved in the past two years. Here, we review recent advances in RNA folding algorithms, RNA tertiary motif discovery, applications of graph theory approaches to RNA structure and function, and in silico generation of RNA sequence pools for aptamer design. Advances within each area can be combined to impact many problems in RNA structure and function.
Collapse
Affiliation(s)
- Christian Laing
- Department of Chemistry, Courant Institute of Mathematical Sciences, New York University, 251 Mercer Street, New York, NY 10012, USA
| | - Tamar Schlick
- Department of Chemistry, Courant Institute of Mathematical Sciences, New York University, 251 Mercer Street, New York, NY 10012, USA
| |
Collapse
|
13
|
Stich M, Manrubia SC. Motif frequency and evolutionary search times in RNA populations. J Theor Biol 2011; 280:117-26. [PMID: 21419782 DOI: 10.1016/j.jtbi.2011.03.010] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2010] [Revised: 01/26/2011] [Accepted: 03/10/2011] [Indexed: 02/07/2023]
Abstract
RNA molecules, through their dual identity as sequence and structure, are an appropriate experimental and theoretical model to study the genotype-phenotype map and evolutionary processes taking place in simple replicator populations. In this computational study, we relate properties of the sequence-structure map, in particular the abundance of a given secondary structure in a random pool, with the number of replicative events that an initially random population of sequences needs to find that structure through mutation and selection. For common structures, this search process turns out to be much faster than for rare structures. Furthermore, search and fixation processes are more efficient in a wider range of mutation rates for common structures, thus indicating that evolvability of RNA populations is not simply determined by abundance. We also find significant differences in the search and fixation processes for structures of same abundance, and relate them with the number of base pairs forming the structure. Moreover, the influence of the nucleotide content of the RNA sequences on the search process is studied. Our results advance in the understanding of the distribution and attainability of RNA secondary structures. They hint at the fact that, beyond sequence length and sequence-to-function redundancy, the mutation rate that permits localization and fixation of a given phenotype strongly depends on its relative abundance and global, in general non-uniform, distribution in sequence space.
Collapse
Affiliation(s)
- Michael Stich
- Centro de Astrobiología (CSIC-INTA), Ctra de Ajalvir km 4, 28850 Torrejón de Ardoz, Madrid, Spain.
| | | |
Collapse
|
14
|
Affiliation(s)
- Christien Kluwe
- Department of Chemistry and Biochemistry, University of Texas at Austin, Austin, TX 78712 USA
| | - Andrew D. Ellington
- Department of Chemistry and Biochemistry, University of Texas at Austin, Austin, TX 78712 USA
| |
Collapse
|
15
|
Majerfeld I, Chocholousova J, Malaiya V, Widmann J, McDonald D, Reeder J, Iyer M, Illangasekare M, Yarus M, Knight R. Nucleotides that are essential but not conserved; a sufficient L-tryptophan site in RNA. RNA (NEW YORK, N.Y.) 2010; 16:1915-24. [PMID: 20699302 PMCID: PMC2941100 DOI: 10.1261/rna.2220210] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/13/2010] [Accepted: 06/22/2010] [Indexed: 05/20/2023]
Abstract
Conservation is often used to define essential sequences within RNA sites. However, conservation finds only invariant sequence elements that are necessary for function, rather than finding a set of sequence elements sufficient for function. Biochemical studies in several systems-including the hammerhead ribozyme and the purine riboswitch-find additional elements, such as loop-loop interactions, required for function yet not phylogenetically conserved. Here we define a critical test of sufficiency: We embed a minimal, apparently sufficient motif for binding the amino acid tryptophan in a random-sequence background and ask whether we obtain functional molecules. After a negative result, we use a combination of three-dimensional structural modeling, selection, designed mutations, high-throughput sequencing, and bioinformatics to explore functional insufficiency. This reveals an essential unpaired G in a diverse structural context, varied sequence, and flexible distance from the invariant internal loop binding site identified previously. Addition of the new element yields a sufficient binding site by the insertion criterion, binding tryptophan in 22 out of 23 tries. Random insertion testing for site sufficiency seems likely to be broadly revealing.
Collapse
Affiliation(s)
- Irene Majerfeld
- Department of Molecular, Cellular and Developmental Biology, University of Colorado at Boulder, Boulder, Colorado 80309, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|