1
|
Zuber J, Mathews DH. Estimating RNA Secondary Structure Folding Free Energy Changes with efn2. Methods Mol Biol 2024; 2726:1-13. [PMID: 38780725 DOI: 10.1007/978-1-0716-3519-3_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/25/2024]
Abstract
A number of analyses require estimates of the folding free energy changes of specific RNA secondary structures. These predictions are often based on a set of nearest neighbor parameters that models the folding stability of a RNA secondary structure as the sum of folding stabilities of the structural elements that comprise the secondary structure. In the software suite RNAstructure, the free energy change calculation is implemented in the program efn2. The efn2 program estimates the folding free energy change and the experimental uncertainty in the folding free energy change. It can be run through the graphical user interface for RNAstructure, from the command line, or a web server. This chapter provides detailed protocols for using efn2.
Collapse
Affiliation(s)
- Jeffrey Zuber
- Department of Biochemistry & Biophysics and Center for RNA Biology, University of Rochester Medical Center, Rochester, NY, USA
| | - David H Mathews
- Department of Biochemistry & Biophysics and Center for RNA Biology, University of Rochester Medical Center, Rochester, NY, USA.
- Department of Biostatistics & Computational Biology, University of Rochester Medical Center, Rochester, NY, USA.
| |
Collapse
|
2
|
Ward M, Sun H, Datta A, Wise M, Mathews DH. Determining parameters for non-linear models of multi-loop free energy change. Bioinformatics 2020; 35:4298-4306. [PMID: 30923811 DOI: 10.1093/bioinformatics/btz222] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2018] [Revised: 02/10/2019] [Accepted: 03/27/2019] [Indexed: 12/12/2022] Open
Abstract
MOTIVATION Predicting the secondary structure of RNA is a fundamental task in bioinformatics. Algorithms that predict secondary structure given only the primary sequence, and a model to evaluate the quality of a structure, are an integral part of this. These algorithms have been updated as our model of RNA thermodynamics changed and expanded. An exception to this has been the treatment of multi-loops. Although more advanced models of multi-loop free energy change have been suggested, a simple, linear model has been used since the 1980s. However, recently, new dynamic programing algorithms for secondary structure prediction that could incorporate these models were presented. Unfortunately, these models appear to have lower accuracy for secondary structure prediction. RESULTS We apply linear regression and a new parameter optimization algorithm to find better parameters for the existing linear model and advanced non-linear multi-loop models. These include the Jacobson-Stockmayer and Aalberts & Nandagopal models. We find that the current linear model parameters may be near optimal for the linear model, and that no advanced model performs better than the existing linear model parameters even after parameter optimization. AVAILABILITY AND IMPLEMENTATION Source code and data is available at https://github.com/maxhwardg/advanced_multiloops. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Max Ward
- Computer Science & Software Engineering, The University of Western Australia, Crawley, WA, Australia
| | - Hongying Sun
- Department of Biochemistry & Biophysics, University of Rochester, Rochester, NY, USA.,Center for RNA Biology, University of Rochester, Rochester, NY, USA
| | - Amitava Datta
- Computer Science & Software Engineering, The University of Western Australia, Crawley, WA, Australia
| | - Michael Wise
- Computer Science & Software Engineering, The University of Western Australia, Crawley, WA, Australia.,The Marshall Centre for Infectious Diseases Research and Training, The University of Western Australia, Crawley, WA, Australia
| | - David H Mathews
- Department of Biostatistics & Computational Biology, University of Rochester, Rochester, NY, USA
| |
Collapse
|
3
|
Ermolenko DN, Mathews DH. Making ends meet: New functions of mRNA secondary structure. WILEY INTERDISCIPLINARY REVIEWS-RNA 2020; 12:e1611. [PMID: 32597020 DOI: 10.1002/wrna.1611] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Revised: 05/19/2020] [Accepted: 05/19/2020] [Indexed: 11/10/2022]
Abstract
The 5' cap and 3' poly(A) tail of mRNA are known to synergistically regulate mRNA translation and stability. Recent computational and experimental studies revealed that both protein-coding and non-coding RNAs will fold with extensive intramolecular secondary structure, which will result in close distances between the sequence ends. This proximity of the ends is a sequence-independent, universal property of most RNAs. Only low-complexity sequences without guanosines are without secondary structure and exhibit end-to-end distances expected for RNA random coils. The innate proximity of RNA ends might have important biological implications that remain unexplored. In particular, the inherent compactness of mRNA might regulate translation initiation by facilitating the formation of protein complexes that bridge mRNA 5' and 3' ends. Additionally, the proximity of mRNA ends might mediate coupling of 3' deadenylation to 5' end mRNA decay. This article is categorized under: RNA Structure and Dynamics > RNA Structure, Dynamics, and Chemistry RNA Structure and Dynamics > Influence of RNA Structure in Biological Systems Translation > Translation Regulation.
Collapse
Affiliation(s)
- Dmitri N Ermolenko
- Department of Biochemistry & Biophysics and Center for RNA Biology, School of Medicine and Dentistry, University of Rochester, Rochester, New York, USA
| | - David H Mathews
- Department of Biochemistry & Biophysics and Center for RNA Biology, School of Medicine and Dentistry, University of Rochester, Rochester, New York, USA
| |
Collapse
|
4
|
Kimchi O, Cragnolini T, Brenner MP, Colwell LJ. A Polymer Physics Framework for the Entropy of Arbitrary Pseudoknots. Biophys J 2019; 117:520-532. [PMID: 31353036 PMCID: PMC6697467 DOI: 10.1016/j.bpj.2019.06.037] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2018] [Revised: 06/21/2019] [Accepted: 06/27/2019] [Indexed: 11/18/2022] Open
Abstract
The accurate prediction of RNA secondary structure from primary sequence has had enormous impact on research from the past 40 years. Although many algorithms are available to make these predictions, the inclusion of non-nested loops, termed pseudoknots, still poses challenges arising from two main factors: 1) no physical model exists to estimate the loop entropies of complex intramolecular pseudoknots, and 2) their NP-complete enumeration has impeded their study. Here, we address both challenges. First, we develop a polymer physics model that can address arbitrarily complex pseudoknots using only two parameters corresponding to concrete physical quantities-over an order of magnitude fewer than the sparsest state-of-the-art phenomenological methods. Second, by coupling this model to exhaustive enumeration of the set of possible structures, we compute the entire free energy landscape of secondary structures resulting from a primary RNA sequence. We demonstrate that for RNA structures of ∼80 nucleotides, with minimal heuristics, the complete enumeration of possible secondary structures can be accomplished quickly despite the NP-complete nature of the problem. We further show that despite our loop entropy model's parametric sparsity, it performs better than or on par with previously published methods in predicting both pseudoknotted and non-pseudoknotted structures on a benchmark data set of RNA structures of ≤80 nucleotides. We suggest ways in which the accuracy of the model can be further improved.
Collapse
Affiliation(s)
- Ofer Kimchi
- Harvard Graduate Program in Biophysics, Harvard University, Cambridge, Massachusetts.
| | - Tristan Cragnolini
- Department of Chemistry, University of Cambridge, Cambridge, United Kingdom
| | - Michael P Brenner
- School of Engineering and Applied Sciences, Cambridge, Massachusetts; Kavli Institute for Bionano Science and Technology, Harvard University, Cambridge, Massachusetts
| | - Lucy J Colwell
- Department of Chemistry, University of Cambridge, Cambridge, United Kingdom.
| |
Collapse
|
5
|
Mak CH, Phan ENH. Topological Constraints and Their Conformational Entropic Penalties on RNA Folds. Biophys J 2019; 114:2059-2071. [PMID: 29742400 PMCID: PMC5961522 DOI: 10.1016/j.bpj.2018.03.035] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2017] [Revised: 03/08/2018] [Accepted: 03/22/2018] [Indexed: 02/01/2023] Open
Abstract
Functional RNAs can fold into intricate structures using a number of different secondary and tertiary structural motifs. Many factors contribute to the overall free energy of the target fold. This study aims at quantifying the entropic costs coming from the loss of conformational freedom when the sugar-phosphate backbone is subjected to constraints imposed by secondary and tertiary contacts. Motivated by insights from topology theory, we design a diagrammatic scheme to represent different types of RNA structures so that constraints associated with a folded structure may be segregated into mutually independent subsets, enabling the total conformational entropy loss to be easily calculated as a sum of independent terms. We used high-throughput Monte Carlo simulations to simulate large ensembles of single-stranded RNA sequences in solution to validate the assumptions behind our diagrammatic scheme, examining the entropic costs for hairpin initiation and formation of many multiway junctions. Our diagrammatic scheme aids in the factorization of secondary/tertiary constraints into distinct topological classes and facilitates the discovery of interrelationships among multiple constraints on RNA folds. This perspective, which to our knowledge is novel, leads to useful insights into the inner workings of some functional RNA sequences, demonstrating how they might operate by transforming their structures among different topological classes.
Collapse
Affiliation(s)
- Chi H Mak
- Department of Chemistry, University of Southern California, Los Angeles, California; Center of Applied Mathematical Sciences, University of Southern California, Los Angeles, California; Department of Biological Sciences, University of Southern California, Los Angeles, California.
| | - Ethan N H Phan
- Department of Chemistry, University of Southern California, Los Angeles, California
| |
Collapse
|
6
|
Zuber J, Mathews DH. Estimating uncertainty in predicted folding free energy changes of RNA secondary structures. RNA (NEW YORK, N.Y.) 2019; 25:747-754. [PMID: 30952689 PMCID: PMC6521603 DOI: 10.1261/rna.069203.118] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/08/2018] [Accepted: 04/02/2019] [Indexed: 06/09/2023]
Abstract
Nearest neighbor parameters for estimating the folding stability of RNA are commonly used in secondary structure prediction, for generating folding ensembles of structures, and for analyzing RNA function. Previously, we demonstrated that we could quantify the uncertainties in each nearest neighbor parameter by perturbing the underlying optical melting data within experimental error and rederiving the parameters, which accounts for the substantial correlations that exist between the parameters. In this contribution, we describe a method to estimate uncertainty in the estimated folding stabilities of RNA structures, accounting for correlations in the nearest neighbor parameters. This method is incorporated in the RNA structure software package.
Collapse
Affiliation(s)
- Jeffrey Zuber
- Department of Biochemistry and Biophysics and Center for RNA Biology, University of Rochester Medical Center, Rochester, New York, 14642, USA
| | - David H Mathews
- Department of Biochemistry and Biophysics and Center for RNA Biology, University of Rochester Medical Center, Rochester, New York, 14642, USA
- Department of Biostatistics and Computational Biology, University of Rochester Medical Center, Rochester, New York, 14642, USA
| |
Collapse
|
7
|
Jiang G, Chen K, Sun J. Accurate prediction of secondary structure of tRNAs. Biochem Biophys Res Commun 2019; 509:64-68. [DOI: 10.1016/j.bbrc.2018.12.042] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2018] [Accepted: 12/05/2018] [Indexed: 11/28/2022]
|
8
|
mRNAs and lncRNAs intrinsically form secondary structures with short end-to-end distances. Nat Commun 2018; 9:4328. [PMID: 30337527 PMCID: PMC6193969 DOI: 10.1038/s41467-018-06792-z] [Citation(s) in RCA: 47] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2018] [Accepted: 09/20/2018] [Indexed: 12/17/2022] Open
Abstract
The 5' and 3' termini of RNA play important roles in many cellular processes. Using Förster resonance energy transfer (FRET), we show that mRNAs and lncRNAs have an intrinsic propensity to fold in the absence of proteins into structures in which the 5' end and 3' end are ≤7 nm apart irrespective of mRNA length. Computational estimates suggest that the inherent proximity of the ends is a universal property of most mRNA and lncRNA sequences. Only guanosine-depleted RNA sequences with low sequence complexity are unstructured and exhibit end-to-end distances expected for the random coil conformation of RNA. While the biological implications remain to be explored, short end-to-end distances could facilitate the binding of protein factors that regulate translation initiation by bridging mRNA 5' and 3' ends. Furthermore, our studies provide the basis for measuring, computing and manipulating end-to-end distances and secondary structure in RNA in research and biotechnology.
Collapse
|
9
|
Ward M, Datta A, Wise M, Mathews DH. Advanced multi-loop algorithms for RNA secondary structure prediction reveal that the simplest model is best. Nucleic Acids Res 2017; 45:8541-8550. [PMID: 28586479 PMCID: PMC5737859 DOI: 10.1093/nar/gkx512] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2017] [Accepted: 05/31/2017] [Indexed: 01/08/2023] Open
Abstract
Algorithmic prediction of RNA secondary structure has been an area of active inquiry since the 1970s. Despite many innovations since then, our best techniques are not yet perfect. The workhorses of the RNA secondary structure prediction engine are recursions first described by Zuker and Stiegler in 1981. These have well understood caveats; a notable flaw is the ad-hoc treatment of multi-loops, also called helical-junctions, that persists today. While several advanced models for multi-loops have been proposed, it seems to have been assumed that incorporating them into the recursions would lead to intractability, and so no algorithms for these models exist. Some of these models include the classical model based on Jacobson–Stockmayer polymer theory, and another by Aalberts and Nadagopal that incorporates two-length-scale polymer physics. We have realized practical, tractable algorithms for each of these models. However, after implementing these algorithms, we found that no advanced model was better than the original, ad-hoc model used for multi-loops. While this is unexpected, it supports the praxis of the current model.
Collapse
Affiliation(s)
- Max Ward
- Computer Science & Software Engineering, The University of Western Australia, Australia
| | - Amitava Datta
- Computer Science & Software Engineering, The University of Western Australia, Australia
| | - Michael Wise
- Computer Science & Software Engineering, The University of Western Australia, Australia.,The Marshall Centre for Infectious Diseases Research and Training, The University of Western Australia, Australia
| | - David H Mathews
- Department of Biochemistry & Biophysics, Department of Biostatistics & Computational Biology, and Center for RNA Biology, University of Rochester, NY, USA
| |
Collapse
|
10
|
Hua L, Song Y, Kim N, Laing C, Wang JTL, Schlick T. CHSalign: A Web Server That Builds upon Junction-Explorer and RNAJAG for Pairwise Alignment of RNA Secondary Structures with Coaxial Helical Stacking. PLoS One 2016; 11:e0147097. [PMID: 26789998 PMCID: PMC4720362 DOI: 10.1371/journal.pone.0147097] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2015] [Accepted: 12/29/2015] [Indexed: 01/01/2023] Open
Abstract
RNA junctions are important structural elements of RNA molecules. They are formed when three or more helices come together in three-dimensional space. Recent studies have focused on the annotation and prediction of coaxial helical stacking (CHS) motifs within junctions. Here we exploit such predictions to develop an efficient alignment tool to handle RNA secondary structures with CHS motifs. Specifically, we build upon our Junction-Explorer software for predicting coaxial stacking and RNAJAG for modelling junction topologies as tree graphs to incorporate constrained tree matching and dynamic programming algorithms into a new method, called CHSalign, for aligning the secondary structures of RNA molecules containing CHS motifs. Thus, CHSalign is intended to be an efficient alignment tool for RNAs containing similar junctions. Experimental results based on thousands of alignments demonstrate that CHSalign can align two RNA secondary structures containing CHS motifs more accurately than other RNA secondary structure alignment tools. CHSalign yields a high score when aligning two RNA secondary structures with similar CHS motifs or helical arrangement patterns, and a low score otherwise. This new method has been implemented in a web server, and the program is also made freely available, at http://bioinformatics.njit.edu/CHSalign/.
Collapse
Affiliation(s)
- Lei Hua
- Bioinformatics Laboratory, Department of Computer Science, New Jersey Institute of Technology, Newark, New Jersey, United States of America
| | - Yang Song
- Bioinformatics Laboratory, Department of Computer Science, New Jersey Institute of Technology, Newark, New Jersey, United States of America
| | - Namhee Kim
- Department of Chemistry, New York University, New York, New York, United States of America
| | - Christian Laing
- Bioinformatics Laboratory, Department of Computer Science, New Jersey Institute of Technology, Newark, New Jersey, United States of America
| | - Jason T. L. Wang
- Bioinformatics Laboratory, Department of Computer Science, New Jersey Institute of Technology, Newark, New Jersey, United States of America
- * E-mail: (JW); (TS)
| | - Tamar Schlick
- Department of Chemistry, New York University, New York, New York, United States of America
- Courant Institute of Mathematical Sciences, New York University, New York, New York, United States of America
- * E-mail: (JW); (TS)
| |
Collapse
|
11
|
Réblová M, Jaklitsch WM, Réblová K, Štěpánek V. Phylogenetic Reconstruction of the Calosphaeriales and Togniniales Using Five Genes and Predicted RNA Secondary Structures of ITS, and Flabellascus tenuirostris gen. et sp. nov. PLoS One 2015; 10:e0144616. [PMID: 26699541 PMCID: PMC4689446 DOI: 10.1371/journal.pone.0144616] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2015] [Accepted: 11/20/2015] [Indexed: 11/18/2022] Open
Abstract
The Calosphaeriales is revisited with new collection data, living cultures, morphological studies of ascoma centrum, secondary structures of the internal transcribed spacer (ITS) rDNA and phylogeny based on novel DNA sequences of five nuclear ribosomal and protein-coding loci. Morphological features, molecular evidence and information from predicted RNA secondary structures of ITS converged upon robust phylogenies of the Calosphaeriales and Togniniales. The current concept of the Calosphaeriales includes the Calosphaeriaceae and Pleurostomataceae encompassing five monophyletic genera, Calosphaeria, Flabellascus gen. nov., Jattaea, Pleurostoma and Togniniella, strongly supported by Bayesian and Maximum Likelihood methods. The structural elements of ITS1 form characteristic patterns that are phylogenetically conserved, corroborate observations based on morphology and have a high predictive value at the generic level. Three major clades containing 44 species of Phaeoacremonium were recovered in the closely related Togniniales based on ITS, actin and β-tubulin sequences. They are newly characterized by sexual and RNA structural characters and ecology. This approach is a first step towards understanding of the molecular systematics of Phaeoacremonium and possibly its new classification. In the Calosphaeriales, Jattaea aphanospora sp. nov. and J. ribicola sp. nov. are introduced, Calosphaeria taediosa is combined in Jattaea and epitypified. The sexual morph of Phaeoacremonium cinereum was encountered for the first time on decaying wood and obtained in vitro. In order to achieve a single nomenclature, the genera of asexual morphs linked with the Calosphaeriales are transferred to synonymy of their sexual morphs following the principle of priority, i.e. Calosphaeriophora to Calosphaeria, Phaeocrella to Togniniella and Pleurostomophora to Pleurostoma. Three new combinations are proposed, i.e. Pleurostoma ochraceum comb. nov., P. repens comb. nov. and P. richardsiae comb. nov. The morphology-based key is provided to facilitate identification of genera accepted in the Calosphaeriales.
Collapse
Affiliation(s)
- Martina Réblová
- Department of Taxonomy, Institute of Botany of the Academy of Sciences of the Czech Republic, Průhonice, Czech Republic
- * E-mail:
| | - Walter M. Jaklitsch
- Department of Forest and Soil Sciences, Forest Pathology and Forest Protection, Institute of Forest Entomology, BOKU-University of Natural Resources and Life Sciences, Vienna, Austria
- Department of Botany and Biodiversity Research, Division of Systematic and Evolutionary Botany, University of Vienna, Vienna, Austria
| | - Kamila Réblová
- Faculty of Medicine, Masaryk University, Brno, Czech Republic
- Central European Institute of Technology, Masaryk University, Brno, Czech Republic
| | - Václav Štěpánek
- Laboratory of Enzyme Technology, Institute of Microbiology of the Academy of Sciences of the Czech Republic, Prague, Czech Republic
| |
Collapse
|
12
|
Chen JL, Bellaousov S, Tubbs JD, Kennedy SD, Lopez MJ, Mathews DH, Turner DH. Nuclear Magnetic Resonance-Assisted Prediction of Secondary Structure for RNA: Incorporation of Direction-Dependent Chemical Shift Constraints. Biochemistry 2015; 54:6769-82. [PMID: 26451676 PMCID: PMC4666457 DOI: 10.1021/acs.biochem.5b00833] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
![]()
Knowledge
of RNA
structure is necessary to determine structure–function relationships
and to facilitate design of potential therapeutics.
RNA secondary structure prediction can be improved by applying constraints
from nuclear magnetic resonance (NMR) experiments to a dynamic programming
algorithm. Imino proton walks from NOESY spectra reveal double-stranded
regions. Chemical shifts of protons in GH1, UH3, and UH5 of GU pairs,
UH3, UH5, and AH2 of AU pairs, and GH1 of GC pairs were analyzed to
identify constraints for the 5′ to 3′ directionality
of base pairs in helices. The 5′ to 3′ directionality
constraints were incorporated into an NMR-assisted prediction of secondary
structure (NAPSS-CS) program. When it was tested on 18 structures,
including nine pseudoknots, the sensitivity and positive predictive
value were improved relative to those of three unrestrained programs.
The prediction accuracy for the pseudoknots improved the most. The
program also facilitates assignment of chemical shifts to individual
nucleotides, a necessary step for determining three-dimensional structure.
Collapse
Affiliation(s)
- Jonathan L Chen
- Department of Chemistry, University of Rochester , Rochester, New York 14627, United States
| | - Stanislav Bellaousov
- Department of Biochemistry and Biophysics, University of Rochester School of Medicine and Dentistry , Rochester, New York 14642, United States
| | - Jason D Tubbs
- Department of Chemistry, University of Rochester , Rochester, New York 14627, United States
| | - Scott D Kennedy
- Department of Biochemistry and Biophysics, University of Rochester School of Medicine and Dentistry , Rochester, New York 14642, United States
| | - Michael J Lopez
- Department of Chemistry, University of Rochester , Rochester, New York 14627, United States
| | - David H Mathews
- Department of Biochemistry and Biophysics, University of Rochester School of Medicine and Dentistry , Rochester, New York 14642, United States.,Center for RNA Biology, University of Rochester , Rochester, New York 14642, United States
| | - Douglas H Turner
- Department of Chemistry, University of Rochester , Rochester, New York 14627, United States.,Center for RNA Biology, University of Rochester , Rochester, New York 14642, United States
| |
Collapse
|
13
|
Abstract
Despite the success of RNA secondary structure prediction for simple, short RNAs, the problem of predicting RNAs with long-range tertiary folds remains. Furthermore, RNA 3D structure prediction is hampered by the lack of the knowledge about the tertiary contacts and their thermodynamic parameters. Low-resolution structural modeling enables us to estimate the conformational entropies for a number of tertiary folds through rigorous statistical mechanical calculations. The models lead to 3D tertiary folds at coarse-grained level. The coarse-grained structures serve as the initial structures for all-atom molecular dynamics refinement to build the final all-atom 3D structures. In this paper, we present an overview of RNA computational models for secondary and tertiary structures’ predictions and then focus on a recently developed RNA statistical mechanical model—the Vfold model. The main emphasis is placed on the physics behind the models, including the treatment of the non-canonical interactions in secondary and tertiary structure modelings, and the correlations to RNA functions.
Collapse
Affiliation(s)
- Xiaojun Xu
- />Department of Physics, University of Missouri, Columbia, MO 65211 USA
- />Department of Biochemistry, University of Missouri, Columbia, MO 65211 USA
- />Informatics Institute, University of Missouri, Columbia, MO 65211 USA
| | - Shi-Jie Chen
- />Department of Physics, University of Missouri, Columbia, MO 65211 USA
- />Department of Biochemistry, University of Missouri, Columbia, MO 65211 USA
- />Informatics Institute, University of Missouri, Columbia, MO 65211 USA
| |
Collapse
|
14
|
Structural analysis of a class III preQ1 riboswitch reveals an aptamer distant from a ribosome-binding site regulated by fast dynamics. Proc Natl Acad Sci U S A 2015; 112:E3485-94. [PMID: 26106162 DOI: 10.1073/pnas.1503955112] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
PreQ1-III riboswitches are newly identified RNA elements that control bacterial genes in response to preQ1 (7-aminomethyl-7-deazaguanine), a precursor to the essential hypermodified tRNA base queuosine. Although numerous riboswitches fold as H-type or HLout-type pseudoknots that integrate ligand-binding and regulatory sequences within a single folded domain, the preQ1-III riboswitch aptamer forms a HLout-type pseudoknot that does not appear to incorporate its ribosome-binding site (RBS). To understand how this unusual organization confers function, we determined the crystal structure of the class III preQ1 riboswitch from Faecalibacterium prausnitzii at 2.75 Å resolution. PreQ1 binds tightly (KD,app 6.5 ± 0.5 nM) between helices P1 and P2 of a three-way helical junction wherein the third helix, P4, projects orthogonally from the ligand-binding pocket, exposing its stem-loop to base pair with the 3' RBS. Biochemical analysis, computational modeling, and single-molecule FRET imaging demonstrated that preQ1 enhances P4 reorientation toward P1-P2, promoting a partially nested, H-type pseudoknot in which the RBS undergoes rapid docking (kdock ∼ 0.6 s(-1)) and undocking (kundock ∼ 1.1 s(-1)). Discovery of such dynamic conformational switching provides insight into how a riboswitch with bipartite architecture uses dynamics to modulate expression platform accessibility, thus expanding the known repertoire of gene control strategies used by regulatory RNAs.
Collapse
|
15
|
Accurate SHAPE-directed RNA secondary structure modeling, including pseudoknots. Proc Natl Acad Sci U S A 2013; 110:5498-503. [PMID: 23503844 DOI: 10.1073/pnas.1219988110] [Citation(s) in RCA: 253] [Impact Index Per Article: 21.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
Abstract
A pseudoknot forms in an RNA when nucleotides in a loop pair with a region outside the helices that close the loop. Pseudoknots occur relatively rarely in RNA but are highly overrepresented in functionally critical motifs in large catalytic RNAs, in riboswitches, and in regulatory elements of viruses. Pseudoknots are usually excluded from RNA structure prediction algorithms. When included, these pairings are difficult to model accurately, especially in large RNAs, because allowing this structure dramatically increases the number of possible incorrect folds and because it is difficult to search the fold space for an optimal structure. We have developed a concise secondary structure modeling approach that combines SHAPE (selective 2'-hydroxyl acylation analyzed by primer extension) experimental chemical probing information and a simple, but robust, energy model for the entropic cost of single pseudoknot formation. Structures are predicted with iterative refinement, using a dynamic programming algorithm. This melded experimental and thermodynamic energy function predicted the secondary structures and the pseudoknots for a set of 21 challenging RNAs of known structure ranging in size from 34 to 530 nt. On average, 93% of known base pairs were predicted, and all pseudoknots in well-folded RNAs were identified.
Collapse
|
16
|
Meng Y, Aalberts D. Free energy cost of stretching mRNA hairpin loops inhibits small RNA binding. Biophys J 2013; 104:482-7. [PMID: 23442870 PMCID: PMC3552280 DOI: 10.1016/j.bpj.2012.12.017] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2012] [Revised: 11/22/2012] [Accepted: 12/05/2012] [Indexed: 11/18/2022] Open
Abstract
Small RNA-mRNA binding is an essential step in RNA interference, an important cellular regulatory process. Calculations of binding free energy have been used in binding site prediction, but the cost of stretching the mRNA loop when the small RNA-mRNA duplex forms requires further exploration. Here, using both polymer physics theory and simulations, we estimate the free energy of a stretched mRNA loop. We find loop stretching significantly increases the free energy of 3' supplementary/compensatory miRNA binding and siRNA binding to mRNA hairpin loops. We also make the observation that sites where 3' supplementary binding is available may bind at the seed only, and that loop stretching often favors seed-only binding over seed plus 3' supplementary binding in mRNA hairpins.
Collapse
|
17
|
Abstract
The free energy of an RNA fold is a combination of favorable base pairing and stacking interactions competing with entropic costs of forming loops. Here we show how loop entropy, surprisingly, can promote tertiary order. A general formula for the free energy of forming multibranch and other RNA loops is derived with a polymer-physics based theory. We also derive a formula for the free energy of coaxial stacking in the context of a loop. Simulations support the analytic formulas. The effects of stacking of unpaired bases are also studied with simulations.
Collapse
|
18
|
Laing C, Wen D, Wang JTL, Schlick T. Predicting coaxial helical stacking in RNA junctions. Nucleic Acids Res 2011; 40:487-98. [PMID: 21917853 PMCID: PMC3258123 DOI: 10.1093/nar/gkr629] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
RNA junctions are important structural elements that form when three or more helices come together in space in the tertiary structures of RNA molecules. Determining their structural configuration is important for predicting RNA 3D structure. We introduce a computational method to predict, at the secondary structure level, the coaxial helical stacking arrangement in junctions, as well as classify the junction topology. Our approach uses a data mining approach known as random forests, which relies on a set of decision trees trained using length, sequence and other variables specified for any given junction. The resulting protocol predicts coaxial stacking within three- and four-way junctions with an accuracy of 81% and 77%, respectively; the accuracy increases to 83% and 87%, respectively, when knowledge from the junction family type is included. Coaxial stacking predictions for the five to ten-way junctions are less accurate (60%) due to sparse data available for training. Additionally, our application predicts the junction family with an accuracy of 85% for three-way junctions and 74% for four-way junctions. Comparisons with other methods, as well applications to unsolved RNAs, are also presented. The web server Junction-Explorer to predict junction topologies is freely available at: http://bioinformatics.njit.edu/junction.
Collapse
Affiliation(s)
- Christian Laing
- Department of Chemistry, Courant Institute of Mathematical Sciences, New York University, 251 Mercer Street, New York, NY 10012, USA
| | | | | | | |
Collapse
|
19
|
Liu B, Diamond JM, Mathews DH, Turner DH. Fluorescence competition and optical melting measurements of RNA three-way multibranch loops provide a revised model for thermodynamic parameters. Biochemistry 2011; 50:640-53. [PMID: 21133351 PMCID: PMC3032278 DOI: 10.1021/bi101470n] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
![]()
Three-way multibranch loops (junctions) are common in RNA secondary structures. Computer algorithms such as RNAstructure and MFOLD do not consider the identity of unpaired nucleotides in multibranch loops when predicting secondary structure. There is limited experimental data, however, to parametrize this aspect of these algorithms. In this study, UV optical melting and a fluorescence competition assay are used to measure stabilities of multibranch loops containing up to five unpaired adenosines or uridines or a loop E motif. These results provide a test of our understanding of the factors affecting multibranch loop stability and provide revised parameters for predicting stability. The results should help to improve predictions of RNA secondary structure.
Collapse
Affiliation(s)
- Biao Liu
- Department of Chemistry, University of Rochester, Rochester, New York 14627, United States
| | | | | | | |
Collapse
|