1
|
Courtney E, Datta A, Mathews DH, Ward M. memerna: Sparse RNA folding including coaxial stacking. J Mol Biol 2025; 437:168819. [PMID: 39427984 DOI: 10.1016/j.jmb.2024.168819] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Revised: 09/16/2024] [Accepted: 10/08/2024] [Indexed: 10/22/2024]
Abstract
Determining RNA secondary structure is a core problem in computational biology. Fast algorithms for predicting secondary structure are fundamental to this task. We describe a modified formulation of the Zuker-Stiegler algorithm with coaxial stacking, a stabilising interaction in which the ends of helices in multi-loops are stacked. In particular, optimal coaxial stacking is computed as part of the dynamic programming state, rather than in an inner loop. We introduce a new notion of sparsity, which we call replaceability. Replaceability is a more general condition and applicable in more places than the triangle inequality that is used by previous sparse folding methods. We also introduce non-monotonic candidate lists as an additional sparsification tool. Existing usages of the triangle inequality for sparsification can be thought of as an application of both replaceability and monotonicity together. The modified recurrences along with replaceability allows sparsification to be applied to coaxial stacking as well, which increases the speed of the algorithm. We implemented this algorithm in software we call memerna, which we show to have the fastest exact (non-heuristic) implementation of RNA folding under the complete Turner 2004 model with coaxial stacking, out of several popular RNA folding tools supporting coaxial stacking. We also introduce a new notation for secondary structure which includes coaxial stacking, terminal mismatches, and dangles (CTDs) information. The memerna package 0.1 release is available at https://github.com/Edgeworth/memerna/tree/release/0.1.
Collapse
Affiliation(s)
- Eliot Courtney
- Department of Computer Science & Software Engineering, The University of Western Australia, Western Australia, Australia
| | - Amitava Datta
- Department of Computer Science & Software Engineering, The University of Western Australia, Western Australia, Australia
| | - David H Mathews
- Department of Biochemistry & Biophysics, University of Rochester Medical Center, Rochester, NY, USA; Center for RNA Biology, University of Rochester Medical Center, Rochester, NY, USA; Department of Biostatistics & Computational Biology, University of Rochester Medical Center, Rochester, NY, USA
| | - Max Ward
- Department of Computer Science & Software Engineering, The University of Western Australia, Western Australia, Australia.
| |
Collapse
|
2
|
Todisco M, Radakovic A, Szostak JW. RNA Complexes with Nicks and Gaps: Thermodynamic and Kinetic Effects of Coaxial Stacking and Dangling Ends. J Am Chem Soc 2024; 146:18083-18094. [PMID: 38904115 PMCID: PMC11229006 DOI: 10.1021/jacs.4c05115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2024] [Revised: 06/13/2024] [Accepted: 06/14/2024] [Indexed: 06/22/2024]
Abstract
Multiple RNA strands can interact in solution and assume a large variety of configurations dictated by their potential for base pairing. Although duplex formation from two complementary oligonucleotides has been studied in detail, we still lack a systematic characterization of the behavior of higher order complexes. Here, we focus on the thermodynamic and kinetic effects of an upstream oligonucleotide on the binding of a downstream oligonucleotide to a common template, as we vary the sequence and structure of the contact interface. We show that coaxial stacking in RNA is well correlated with but much more stabilizing than helix propagation over an analogous intact double helix step (median ΔΔG°37 °C ≈ 1.7 kcal/mol). Consequently, approximating coaxial stacking in RNA with the helix propagation term leads to large discrepancies between predictions and our experimentally determined melting temperatures, with an offset of ≈10 °C. Our kinetic study reveals that the hybridization of the downstream probe oligonucleotide is impaired (lower kon) by the presence of the upstream oligonucleotide, with the thermodynamic stabilization coming entirely from an extended lifetime (lower koff) of the bound downstream oligonucleotide, which can increase from seconds to months. Surprisingly, we show that the effect of nicks is dependent on the length of the stacking oligonucleotides, and we discuss the binding of ultrashort (1-4 nt) oligonucleotides that are relevant in the context of the origin of life. The thermodynamic and kinetic data obtained in this work allow for the prediction of the formation and stability of higher-order multistranded complexes.
Collapse
Affiliation(s)
- Marco Todisco
- Howard Hughes Medical Institute, Department of Chemistry, The University of Chicago, Chicago, Illinois 60637, United States
| | - Aleksandar Radakovic
- Howard Hughes Medical Institute, Department of Chemistry, The University of Chicago, Chicago, Illinois 60637, United States
- Harvard Medical School,Department of Genetics, Boston, Massachusetts 02115, United States
| | - Jack W Szostak
- Howard Hughes Medical Institute, Department of Chemistry, The University of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|
3
|
Chojnowski G, Zaborowski R, Magnus M, Mukherjee S, Bujnicki JM. RNA 3D structure modeling by fragment assembly with small-angle X-ray scattering restraints. Bioinformatics 2023; 39:btad527. [PMID: 37647627 PMCID: PMC10474949 DOI: 10.1093/bioinformatics/btad527] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Revised: 07/14/2023] [Accepted: 08/28/2023] [Indexed: 09/01/2023] Open
Abstract
SUMMARY Structure determination is a key step in the functional characterization of many non-coding RNA molecules. High-resolution RNA 3D structure determination efforts, however, are not keeping up with the pace of discovery of new non-coding RNA sequences. This increases the importance of computational approaches and low-resolution experimental data, such as from the small-angle X-ray scattering experiments. We present RNA Masonry, a computer program and a web service for a fully automated modeling of RNA 3D structures. It assemblies RNA fragments into geometrically plausible models that meet user-provided secondary structure constraints, restraints on tertiary contacts, and small-angle X-ray scattering data. We illustrate the method description with detailed benchmarks and its application to structural studies of viral RNAs with SAXS restraints. AVAILABILITY AND IMPLEMENTATION The program web server is available at http://iimcb.genesilico.pl/rnamasonry. The source code is available at https://gitlab.com/gchojnowski/rnamasonry.
Collapse
Affiliation(s)
- Grzegorz Chojnowski
- International Institute of Molecular and Cell Biology, Warsaw 02-109, Poland
- European Molecular Biology Laboratory, Hamburg Unit, Hamburg 22607, Germany
| | - Rafał Zaborowski
- International Institute of Molecular and Cell Biology, Warsaw 02-109, Poland
| | - Marcin Magnus
- ReMedy International Research Agenda Unit, IMol Polish Academy of Sciences, Warsaw, Poland
| | - Sunandan Mukherjee
- International Institute of Molecular and Cell Biology, Warsaw 02-109, Poland
| | - Janusz M Bujnicki
- International Institute of Molecular and Cell Biology, Warsaw 02-109, Poland
| |
Collapse
|
4
|
Abstract
RNAstructure is a user-friendly program for the prediction and analysis of RNA secondary structure. It is available as a web server, a program with a graphical user interface, or a set of command line tools. The programs are available for Microsoft Windows, macOS, or Linux. This article provides protocols for prediction of RNA secondary structure (using the web server, the graphical user interface, or the command line) and high-affinity oligonucleotide binding sites to a structured RNA target (using the graphical user interface). © 2023 Wiley Periodicals LLC. Basic Protocol 1: Predicting RNA secondary structure using the RNAstructure web server Alternate Protocol 1: Predicting secondary structure and base pair probabilities using the RNAstructure graphical user interface Alternate Protocol 2: Predicting secondary structure and base pair probabilities using the RNAstructure command line interface Basic Protocol 2: Predicting binding affinities of oligonucleotides complementary to an RNA target using OligoWalk.
Collapse
Affiliation(s)
- Sara E. Ali
- Department of Biochemistry & Biophysics and Center for RNA Biology, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642
| | - Abhinav Mittal
- Department of Biochemistry & Biophysics and Center for RNA Biology, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642
| | - David H. Mathews
- Department of Biochemistry & Biophysics and Center for RNA Biology, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642
| |
Collapse
|
5
|
Monroy-Eklund A, Taylor C, Weidmann CA, Burch C, Laederach A. Structural analysis of MALAT1 long noncoding RNA in cells and in evolution. RNA (NEW YORK, N.Y.) 2023; 29:691-704. [PMID: 36792358 PMCID: PMC10159000 DOI: 10.1261/rna.079388.122] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Accepted: 02/02/2023] [Indexed: 05/06/2023]
Abstract
Although not canonically polyadenylated, the long noncoding RNA MALAT1 (metastasis-associated lung adenocarcinoma transcript 1) is stabilized by a highly conserved 76-nt triple helix structure on its 3' end. The entire MALAT1 transcript is over 8000 nt long in humans. The strongest structural conservation signal in MALAT1 (as measured by covariation of base pairs) is in the triple helix structure. Primary sequence analysis of covariation alone does not reveal the degree of structural conservation of the entire full-length transcript, however. Furthermore, RNA structure is often context dependent; RNA binding proteins that are differentially expressed in different cell types may alter structure. We investigate here the in-cell and cell-free structures of the full-length human and green monkey (Chlorocebus sabaeus) MALAT1 transcripts in multiple tissue-derived cell lines using SHAPE chemical probing. Our data reveal levels of uniform structural conservation in different cell lines, in cells and cell-free, and even between species, despite significant differences in primary sequence. The uniformity of the structural conservation across the entire transcript suggests that, despite seeing covariation signals only in the triple helix junction of the lncRNA, the rest of the transcript's structure is remarkably conserved, at least in primates and across multiple cell types and conditions.
Collapse
Affiliation(s)
- Anais Monroy-Eklund
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA
| | - Colin Taylor
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA
| | - Chase A Weidmann
- Department of Biological Chemistry, University of Michigan Medical School, Center for RNA Biomedicine, Rogel Cancer Center, Ann Arbor, Michigan 48109, USA
| | - Christina Burch
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA
| | - Alain Laederach
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA
| |
Collapse
|
6
|
Zhuo B, Ou X, Li J. Structure and Mechanical Stabilities of the Three-Way Junction Motifs in Prohead RNA. J Phys Chem B 2021; 125:12125-12134. [PMID: 34719230 DOI: 10.1021/acs.jpcb.1c04681] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
The core structure of phi29 prohead RNA (pRNA) is composed of three major helices organized into three-way junction pRNA (3WJ-pRNA) and has stout structural rigidity along the coaxial helices. Prohead RNAs of the other Bacillus subtilis bacteriophages such as GA1 and SF5 share similar secondary structure and function with phi29; whether these pRNAs have similar mechanical rigidity remains to be elucidated. In this study, we constructed the tertiary structures of GA1 and SF5 3WJ-pRNAs by comparative modeling. Both GA1 and SF5 3WJ-pRNAs adopt a similar structure, in which three helices are organized as the three-way junction and two of the three helices are stacked coaxially. Moreover, detailed structural features of GA1 and SF5 3WJ-pRNAs are also similar to those of phi29 3WJ-pRNA: all of the bases of the coaxial helices are paired, and all of the adenines in the junction region are paired, which eliminates the interference of A-minor tertiary interactions. Hence, the coaxial helices tightly join to each other, and the major groove between them is very narrow. Two Mg2+ ions can thus fit into this major groove and form double Mg clamps. A steered molecular dynamics simulation was used to study the mechanical properties of these 3WJ-pRNAs. Both GA1 and SF5 3WJ-pRNAs show strong resistance to applied force in the direction of their coaxial helices. Such mechanical stability can be attributed to the Mg clamps.
Collapse
Affiliation(s)
- Boyang Zhuo
- Department of Physics, Zhejiang University, Hangzhou 310027, China
| | - Xinwen Ou
- Department of Physics, Zhejiang University, Hangzhou 310027, China
| | - Jingyuan Li
- Department of Physics, Zhejiang University, Hangzhou 310027, China
| |
Collapse
|
7
|
Lackey L, Coria A, Ghosh AJ, Grayeski P, Hatfield A, Shankar V, Platig J, Xu Z, Ramos SBV, Silverman EK, Ortega VE, Cho MH, Hersh CP, Hobbs BD, Castaldi P, Laederach A. Alternative poly-adenylation modulates α1-antitrypsin expression in chronic obstructive pulmonary disease. PLoS Genet 2021; 17:e1009912. [PMID: 34784346 PMCID: PMC8631626 DOI: 10.1371/journal.pgen.1009912] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Revised: 11/30/2021] [Accepted: 10/25/2021] [Indexed: 01/07/2023] Open
Abstract
α1-anti-trypsin (A1AT), encoded by SERPINA1, is a neutrophil elastase inhibitor that controls the inflammatory response in the lung. Severe A1AT deficiency increases risk for Chronic Obstructive Pulmonary Disease (COPD), however, the role of A1AT in COPD in non-deficient individuals is not well known. We identify a 2.1-fold increase (p = 2.5x10-6) in the use of a distal poly-adenylation site in primary lung tissue RNA-seq in 82 COPD cases when compared to 64 controls and replicate this in an independent study of 376 COPD and 267 controls. This alternative polyadenylation event involves two sites, a proximal and distal site, 61 and 1683 nucleotides downstream of the A1AT stop codon. To characterize this event, we measured the distal ratio in human primary tissue short read RNA-seq data and corroborated our results with long read RNA-seq data. Integrating these results with 3' end RNA-seq and nanoluciferase reporter assay experiments we show that use of the distal site yields mRNA transcripts with over 50-fold decreased translation efficiency and A1AT expression. We identified seven RNA binding proteins using enhanced CrossLinking and ImmunoPrecipitation precipitation (eCLIP) with one or more binding sites in the SERPINA1 3' UTR. We combined these data with measurements of the distal ratio in shRNA knockdown experiments, nuclear and cytoplasmic fractionation, and chemical RNA structure probing. We identify Quaking Homolog (QKI) as a modulator of SERPINA1 mRNA translation and confirm the role of QKI in SERPINA1 translation with luciferase reporter assays. Analysis of single-cell RNA-seq showed differences in the distribution of the SERPINA1 distal ratio among hepatocytes, macrophages, αβ-Tcells and plasma cells in the liver. Alveolar Type 1,2, dendritic cells and macrophages also vary in their distal ratio in the lung. Our work reveals a complex post-transcriptional mechanism that regulates alternative polyadenylation and A1AT expression in COPD.
Collapse
Affiliation(s)
- Lela Lackey
- Department of Genetics and Biochemistry, Center for Human Genetics, Clemson University, Greenwood, South Carolina, United States of America
| | - Aaztli Coria
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina, United States of America
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, North Carolina, United States of America
| | - Auyon J. Ghosh
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Phil Grayeski
- Curriculum in Genetics and Molecular Biology, University of North Carolina, Chapel Hill, North Carolina, United States of America
| | - Abigail Hatfield
- Department of Genetics and Biochemistry, Center for Human Genetics, Clemson University, Greenwood, South Carolina, United States of America
| | - Vijay Shankar
- Department of Genetics and Biochemistry, Center for Human Genetics, Clemson University, Greenwood, South Carolina, United States of America
| | - John Platig
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Zhonghui Xu
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Silvia B. V. Ramos
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, North Carolina, United States of America
| | - Edwin K. Silverman
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Victor E. Ortega
- Department of Internal Medicine, Division of Respiratory Medicine, Center for Individualized Medicine, Mayo Clinic, Scottsdale, Arizona, United States of America
| | - Michael H. Cho
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Craig P. Hersh
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Brian D. Hobbs
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Peter Castaldi
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Internal Medicine and Primary Care, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Alain Laederach
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina, United States of America
| |
Collapse
|
8
|
Largy E, König A, Ghosh A, Ghosh D, Benabou S, Rosu F, Gabelica V. Mass Spectrometry of Nucleic Acid Noncovalent Complexes. Chem Rev 2021; 122:7720-7839. [PMID: 34587741 DOI: 10.1021/acs.chemrev.1c00386] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Nucleic acids have been among the first targets for antitumor drugs and antibiotics. With the unveiling of new biological roles in regulation of gene expression, specific DNA and RNA structures have become very attractive targets, especially when the corresponding proteins are undruggable. Biophysical assays to assess target structure as well as ligand binding stoichiometry, affinity, specificity, and binding modes are part of the drug development process. Mass spectrometry offers unique advantages as a biophysical method owing to its ability to distinguish each stoichiometry present in a mixture. In addition, advanced mass spectrometry approaches (reactive probing, fragmentation techniques, ion mobility spectrometry, ion spectroscopy) provide more detailed information on the complexes. Here, we review the fundamentals of mass spectrometry and all its particularities when studying noncovalent nucleic acid structures, and then review what has been learned thanks to mass spectrometry on nucleic acid structures, self-assemblies (e.g., duplexes or G-quadruplexes), and their complexes with ligands.
Collapse
Affiliation(s)
- Eric Largy
- Univ. Bordeaux, CNRS, INSERM, ARNA, UMR 5320, U1212, IECB, F-33600 Pessac, France
| | - Alexander König
- Univ. Bordeaux, CNRS, INSERM, ARNA, UMR 5320, U1212, IECB, F-33600 Pessac, France
| | - Anirban Ghosh
- Univ. Bordeaux, CNRS, INSERM, ARNA, UMR 5320, U1212, IECB, F-33600 Pessac, France
| | - Debasmita Ghosh
- Univ. Bordeaux, CNRS, INSERM, ARNA, UMR 5320, U1212, IECB, F-33600 Pessac, France
| | - Sanae Benabou
- Univ. Bordeaux, CNRS, INSERM, ARNA, UMR 5320, U1212, IECB, F-33600 Pessac, France
| | - Frédéric Rosu
- Univ. Bordeaux, CNRS, INSERM, IECB, UMS 3033, F-33600 Pessac, France
| | - Valérie Gabelica
- Univ. Bordeaux, CNRS, INSERM, ARNA, UMR 5320, U1212, IECB, F-33600 Pessac, France
| |
Collapse
|
9
|
Cheng Y, Zhang S, Xu X, Chen SJ. Vfold2D-MC: A Physics-Based Hybrid Model for Predicting RNA Secondary Structure Folding. J Phys Chem B 2021; 125:10108-10118. [PMID: 34473508 DOI: 10.1021/acs.jpcb.1c04731] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Accurate prediction of RNA structure and folding stability has a far-reaching impact on our understanding of RNA functions. Here we develop Vfold2D-MC, a new physics-based model, to predict RNA structure and folding thermodynamics from the sequence. The model employs virtual bond-based coarse-graining of RNA backbone conformation and generates RNA conformations through Monte Carlo sampling of the bond angles and torsional angles of the virtual bonds. Using a coarse-grained statistical potential derived from the known structures, we assign each conformation with a statistical weight. The weighted average over the conformational ensemble gives the entropy and free energy parameters for the hairpin, bulge, and internal loops, and multiway junctions. From the thermodynamic parameters, we predict RNA structures, melting curves, and structural changes from the sequence. Theory-experiment comparisons indicate that Vfold2D-MC not only gives improved structure predictions but also enables the interpretation of thermodynamic results for different RNA structures, including multibranched junctions. This new model sets a promising framework to treat more complicated RNA structures, such as pseudoknotted and intramolecular kissing loops, for which experimental thermodynamic parameters are often unavailable.
Collapse
Affiliation(s)
- Yi Cheng
- Department of Physics, Department of Biochemistry, and Institute for Data Science and Informatics, University of Missouri, Columbia, Missouri 65211, United States
| | - Sicheng Zhang
- Department of Physics, Department of Biochemistry, and Institute for Data Science and Informatics, University of Missouri, Columbia, Missouri 65211, United States
| | - Xiaojun Xu
- Institute of Bioinformatics and Medical Engineering, Jiangsu University of Technology, Changzhou, Jiangsu 213001, China
| | - Shi-Jie Chen
- Department of Physics, Department of Biochemistry, and Institute for Data Science and Informatics, University of Missouri, Columbia, Missouri 65211, United States
| |
Collapse
|
10
|
Danaee P, Rouches M, Wiley M, Deng D, Huang L, Hendrix D. bpRNA: large-scale automated annotation and analysis of RNA secondary structure. Nucleic Acids Res 2019; 46:5381-5394. [PMID: 29746666 PMCID: PMC6009582 DOI: 10.1093/nar/gky285] [Citation(s) in RCA: 109] [Impact Index Per Article: 18.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2018] [Accepted: 04/11/2018] [Indexed: 01/04/2023] Open
Abstract
While RNA secondary structure prediction from sequence data has made remarkable progress, there is a need for improved strategies for annotating the features of RNA secondary structures. Here, we present bpRNA, a novel annotation tool capable of parsing RNA structures, including complex pseudoknot-containing RNAs, to yield an objective, precise, compact, unambiguous, easily-interpretable description of all loops, stems, and pseudoknots, along with the positions, sequence, and flanking base pairs of each such structural feature. We also introduce several new informative representations of RNA structure types to improve structure visualization and interpretation. We have further used bpRNA to generate a web-accessible meta-database, ‘bpRNA-1m’, of over 100 000 single-molecule, known secondary structures; this is both more fully and accurately annotated and over 20-times larger than existing databases. We use a subset of the database with highly similar (≥90% identical) sequences filtered out to report on statistical trends in sequence, flanking base pairs, and length. Both the bpRNA method and the bpRNA-1m database will be valuable resources both for specific analysis of individual RNA molecules and large-scale analyses such as are useful for updating RNA energy parameters for computational thermodynamic predictions, improving machine learning models for structure prediction, and for benchmarking structure-prediction algorithms.
Collapse
Affiliation(s)
| | | | | | - Dezhong Deng
- School of Electrical Engineering and Computer Science
| | - Liang Huang
- School of Electrical Engineering and Computer Science
| | - David Hendrix
- School of Electrical Engineering and Computer Science.,Department of Biochemistry and Biophysics
| |
Collapse
|
11
|
Zuber J, Mathews DH. Estimating uncertainty in predicted folding free energy changes of RNA secondary structures. RNA (NEW YORK, N.Y.) 2019; 25:747-754. [PMID: 30952689 PMCID: PMC6521603 DOI: 10.1261/rna.069203.118] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/08/2018] [Accepted: 04/02/2019] [Indexed: 06/09/2023]
Abstract
Nearest neighbor parameters for estimating the folding stability of RNA are commonly used in secondary structure prediction, for generating folding ensembles of structures, and for analyzing RNA function. Previously, we demonstrated that we could quantify the uncertainties in each nearest neighbor parameter by perturbing the underlying optical melting data within experimental error and rederiving the parameters, which accounts for the substantial correlations that exist between the parameters. In this contribution, we describe a method to estimate uncertainty in the estimated folding stabilities of RNA structures, accounting for correlations in the nearest neighbor parameters. This method is incorporated in the RNA structure software package.
Collapse
Affiliation(s)
- Jeffrey Zuber
- Department of Biochemistry and Biophysics and Center for RNA Biology, University of Rochester Medical Center, Rochester, New York, 14642, USA
| | - David H Mathews
- Department of Biochemistry and Biophysics and Center for RNA Biology, University of Rochester Medical Center, Rochester, New York, 14642, USA
- Department of Biostatistics and Computational Biology, University of Rochester Medical Center, Rochester, New York, 14642, USA
| |
Collapse
|
12
|
Thiel BC, Beckmann IK, Kerpedjiev P, Hofacker IL. 3D based on 2D: Calculating helix angles and stacking patterns using forgi 2.0, an RNA Python library centered on secondary structure elements. F1000Res 2019; 8:ISCB Comm J-287. [PMID: 31069053 PMCID: PMC6480952 DOI: 10.12688/f1000research.18458.2] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 04/04/2019] [Indexed: 01/01/2023] Open
Abstract
We present forgi, a Python library to analyze the tertiary structure of RNA secondary structure elements. Our representation of an RNA molecule is centered on secondary structure elements (stems, bulges and loops). By fitting a cylinder to the helix axis, these elements are carried over into a coarse-grained 3D structure representation. Integration with Biopython allows for handling of all-atom 3D information. forgi can deal with a variety of file formats including dotbracket strings, PDB and MMCIF files. We can handle modified residues, missing residues, cofold and multifold structures as well as nucleotide numbers starting at arbitrary positions. We apply this library to the study of stacking helices in junctions and pseudoknots and investigate how far stacking helices in solved experimental structures can divert from coaxial geometries.
Collapse
Affiliation(s)
- Bernhard C. Thiel
- Department of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Vienna, 1090, Austria
| | - Irene K. Beckmann
- Department of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Vienna, 1090, Austria
| | - Peter Kerpedjiev
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, 02115, USA
| | - Ivo L. Hofacker
- Department of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Vienna, 1090, Austria
- Research Group Bioinformatics and Computational Biology, Faculty of Computer Science, University of Vienna, Vienna, 1090, Austria
| |
Collapse
|
13
|
Thiel BC, Beckmann IK, Kerpedjiev P, Hofacker IL. 3D based on 2D: Calculating helix angles and stacking patterns using forgi 2.0, an RNA Python library centered on secondary structure elements. F1000Res 2019; 8:ISCB Comm J-287. [PMID: 31069053 PMCID: PMC6480952 DOI: 10.12688/f1000research.18458.1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 03/06/2019] [Indexed: 10/12/2023] Open
Abstract
We present forgi, a Python library to analyze the tertiary structure of RNA secondary structure elements. Our representation of an RNA molecule is centered on secondary structure elements (stems, bulges and loops). By fitting a cylinder to the helix axis, these elements are carried over into a coarse-grained 3D structure representation. Integration with Biopython allows for handling of all-atom 3D information. forgi can deal with a variety of file formats including dotbracket strings, PDB and MMCIF files. We can handle modified residues, missing residues, cofold and multifold structures as well as nucleotide numbers starting at arbitrary positions. We apply this library to the study of stacking helices in junctions and pseudo knots and investigate how far stacking helices in solved experimental structures can divert from coaxial geometries.
Collapse
Affiliation(s)
- Bernhard C. Thiel
- Department of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Vienna, 1090, Austria
| | - Irene K. Beckmann
- Department of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Vienna, 1090, Austria
| | - Peter Kerpedjiev
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, 02115, USA
| | - Ivo L. Hofacker
- Department of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Vienna, 1090, Austria
- Research Group Bioinformatics and Computational Biology, Faculty of Computer Science, University of Vienna, Vienna, 1090, Austria
| |
Collapse
|
14
|
Quantitative tests of a reconstitution model for RNA folding thermodynamics and kinetics. Proc Natl Acad Sci U S A 2017; 114:E7688-E7696. [PMID: 28839094 DOI: 10.1073/pnas.1703507114] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Decades of study of the architecture and function of structured RNAs have led to the perspective that RNA tertiary structure is modular, made of locally stable domains that retain their structure across RNAs. We formalize a hypothesis inspired by this modularity-that RNA folding thermodynamics and kinetics can be quantitatively predicted from separable energetic contributions of the individual components of a complex RNA. This reconstitution hypothesis considers RNA tertiary folding in terms of ΔGalign, the probability of aligning tertiary contact partners, and ΔGtert, the favorable energetic contribution from the formation of tertiary contacts in an aligned state. This hypothesis predicts that changes in the alignment of tertiary contacts from different connecting helices and junctions (ΔGHJH) or from changes in the electrostatic environment (ΔG+/-) will not affect the energetic perturbation from a mutation in a tertiary contact (ΔΔGtert). Consistent with these predictions, single-molecule FRET measurements of folding of model RNAs revealed constant ΔΔGtert values for mutations in a tertiary contact embedded in different structural contexts and under different electrostatic conditions. The kinetic effects of these mutations provide further support for modular behavior of RNA elements and suggest that tertiary mutations may be used to identify rate-limiting steps and dissect folding and assembly pathways for complex RNAs. Overall, our model and results are foundational for a predictive understanding of RNA folding that will allow manipulation of RNA folding thermodynamics and kinetics. Conversely, the approaches herein can identify cases where an independent, additive model cannot be applied and so require additional investigation.
Collapse
|
15
|
Hill AC, Schroeder SJ. Thermodynamic stabilities of three-way junction nanomotifs in prohead RNA. RNA (NEW YORK, N.Y.) 2017; 23:521-529. [PMID: 28069889 PMCID: PMC5340915 DOI: 10.1261/rna.059220.116] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/18/2016] [Accepted: 12/24/2016] [Indexed: 06/06/2023]
Abstract
The thermodynamic stabilities of four natural prohead or packaging RNA (pRNA) three-way junction (3WJ) nanomotifs and seven phi29 pRNA 3WJ deletion mutant nanomotifs were investigated using UV optical melting on a three-component RNA system. Our data reveal that some pRNA 3WJs are more stable than the phi29 pRNA 3WJ. The stability of the 3WJ contributes to the unique self-assembly properties of pRNA. Thus, ultrastable pRNA 3WJ motifs suggest new scaffolds for pRNA-based nanotechnology. We present data demonstrating that pRNA 3WJs differentially respond to the presence of metal ions. A comparison of our data with free energies predicted by currently available RNA secondary structure prediction programs shows that these programs do not accurately predict multibranch loop stabilities. These results will expand the existing parameters used for RNA secondary structure prediction from sequence in order to better inform RNA structure-function hypotheses and guide the rational design of functional RNA supramolecular assemblies.
Collapse
Affiliation(s)
| | - Susan J Schroeder
- Department of Microbiology and Plant Biology
- Department of Chemistry and Biochemistry, University of Oklahoma, Norman, Oklahoma 73019, USA
| |
Collapse
|
16
|
Sloma MF, Mathews DH. Exact calculation of loop formation probability identifies folding motifs in RNA secondary structures. RNA (NEW YORK, N.Y.) 2016; 22:1808-1818. [PMID: 27852924 PMCID: PMC5113201 DOI: 10.1261/rna.053694.115] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2015] [Accepted: 09/08/2016] [Indexed: 05/10/2023]
Abstract
RNA secondary structure prediction is widely used to analyze RNA sequences. In an RNA partition function calculation, free energy nearest neighbor parameters are used in a dynamic programming algorithm to estimate statistical properties of the secondary structure ensemble. Previously, partition functions have largely been used to estimate the probability that a given pair of nucleotides form a base pair, the conditional stacking probability, the accessibility to binding of a continuous stretch of nucleotides, or a representative sample of RNA structures. Here it is demonstrated that an RNA partition function can also be used to calculate the exact probability of formation of hairpin loops, internal loops, bulge loops, or multibranch loops at a given position. This calculation can also be used to estimate the probability of formation of specific helices. Benchmarking on a set of RNA sequences with known secondary structures indicated that loops that were calculated to be more probable were more likely to be present in the known structure than less probable loops. Furthermore, highly probable loops are more likely to be in the known structure than the set of loops predicted in the lowest free energy structures.
Collapse
Affiliation(s)
- Michael F Sloma
- Department of Biochemistry and Biophysics and Center for RNA Biology, University of Rochester Medical Center, Rochester, New York 14642, USA
| | - David H Mathews
- Department of Biochemistry and Biophysics and Center for RNA Biology, University of Rochester Medical Center, Rochester, New York 14642, USA
| |
Collapse
|
17
|
Mutations Designed by Ensemble Defect to Misfold Conserved RNA Structures of Influenza A Segments 7 and 8 Affect Splicing and Attenuate Viral Replication in Cell Culture. PLoS One 2016; 11:e0156906. [PMID: 27272307 PMCID: PMC4896458 DOI: 10.1371/journal.pone.0156906] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2016] [Accepted: 05/20/2016] [Indexed: 12/01/2022] Open
Abstract
Influenza A virus is a significant public health threat, but little is understood about the viral RNA structure and function. Current vaccines and therapeutic options to control influenza A virus infections are mostly protein-centric and of limited effectiveness. Here, we report using an ensemble defect approach to design mutations to misfold regions of conserved mRNA structures in influenza A virus segments 7 and 8. Influenza A mutant viruses inhibit pre-mRNA splicing and attenuate viral replication in cell culture, thus providing evidence for functions of the targeted regions. Targeting these influenza A viral RNA regions provides new possibilities for designing vaccines and therapeutics against this important human respiratory pathogen. The results also demonstrate that the ensemble defect approach is an efficient way to test for function of RNA sequences.
Collapse
|
18
|
Mustoe AM, Al-Hashimi HM, Brooks CL. Secondary structure encodes a cooperative tertiary folding funnel in the Azoarcus ribozyme. Nucleic Acids Res 2015; 44:402-12. [PMID: 26481360 PMCID: PMC4705646 DOI: 10.1093/nar/gkv1055] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2015] [Accepted: 10/03/2015] [Indexed: 12/20/2022] Open
Abstract
A requirement for specific RNA folding is that the free-energy landscape discriminate against non-native folds. While tertiary interactions are critical for stabilizing the native fold, they are relatively non-specific, suggesting additional mechanisms contribute to tertiary folding specificity. In this study, we use coarse-grained molecular dynamics simulations to explore how secondary structure shapes the tertiary free-energy landscape of the Azoarcus ribozyme. We show that steric and connectivity constraints posed by secondary structure strongly limit the accessible conformational space of the ribozyme, and that these so-called topological constraints in turn pose strong free-energy penalties on forming different tertiary contacts. Notably, native A-minor and base-triple interactions form with low conformational free energy, while non-native tetraloop/tetraloop–receptor interactions are penalized by high conformational free energies. Topological constraints also give rise to strong cooperativity between distal tertiary interactions, quantitatively matching prior experimental measurements. The specificity of the folding landscape is further enhanced as tertiary contacts place additional constraints on the conformational space, progressively funneling the molecule to the native state. These results indicate that secondary structure assists the ribozyme in navigating the otherwise rugged tertiary folding landscape, and further emphasize topological constraints as a key force in RNA folding.
Collapse
Affiliation(s)
- Anthony M Mustoe
- Department of Biophysics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Hashim M Al-Hashimi
- Department of Biochemistry and Chemistry, Duke University School of Medicine, Durham, NC 27710, USA
| | - Charles L Brooks
- Department of Biophysics, University of Michigan, Ann Arbor, MI 48109, USA Department of Chemistry, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
19
|
Schudoma C. It's a loop world - single strands in RNA as structural and functional elements. Biomol Concepts 2015; 2:171-81. [PMID: 25962027 DOI: 10.1515/bmc.2011.016] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2011] [Accepted: 03/25/2011] [Indexed: 01/31/2023] Open
Abstract
Unpaired regions in RNA molecules - loops - are centrally involved in defining the characteristic three-dimensional (3D) architecture of RNAs and are of high interest in RNA engineering and design. Loops adopt diverse, but specific conformations stabilised by complex tertiary structural interactions that provide structural flexibility to RNA structures that would otherwise not be possible if they only consisted of the rigid A-helical shapes usually formed by canonical base pairing. By participating in sequence-non-local contacts, they furthermore contribute to stabilising the overall fold of RNA molecules. Interactions between RNAs and other nucleic acids, proteins, or small molecules are also generally mediated by RNA loop structures. Therefore, the function of an RNA molecule is generally dependent on its loops. Examples include intermolecular interactions between RNAs as part of the microRNA processing pathways, ribozymatic activity, or riboswitch-ligand interactions. Bioinformatics approaches have been successfully applied to the identification of novel RNA structural motifs including loops, local and global RNA 3D structure prediction, and structural and conformational analysis of RNAs and have contributed to a better understanding of the sequence-structure-function relationships in RNA loops.
Collapse
|
20
|
|
21
|
Bonneau E, Girard N, Lemieux S, Legault P. The NMR structure of the II-III-VI three-way junction from the Neurospora VS ribozyme reveals a critical tertiary interaction and provides new insights into the global ribozyme structure. RNA (NEW YORK, N.Y.) 2015; 21:1621-32. [PMID: 26124200 PMCID: PMC4536322 DOI: 10.1261/rna.052076.115] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2015] [Accepted: 06/08/2015] [Indexed: 05/04/2023]
Abstract
As part of an effort to structurally characterize the complete Neurospora VS ribozyme, NMR solution structures of several subdomains have been previously determined, including the internal loops of domains I and VI, the I/V kissing-loop interaction and the III-IV-V junction. Here, we expand this work by determining the NMR structure of a 62-nucleotide RNA (J236) that encompasses the VS ribozyme II-III-VI three-way junction and its adjoining stems. In addition, we localize Mg(2+)-binding sites within this structure using Mn(2+)-induced paramagnetic relaxation enhancement. The NMR structure of the J236 RNA displays a family C topology with a compact core stabilized by continuous stacking of stems II and III, a cis WC/WC G•A base pair, two base triples and two Mg(2+) ions. Moreover, it reveals a remote tertiary interaction between the adenine bulges of stems II and VI. Additional NMR studies demonstrate that both this bulge-bulge interaction and Mg(2+) ions are critical for the stable folding of the II-III-VI junction. The NMR structure of the J236 RNA is consistent with biochemical studies on the complete VS ribozyme, but not with biophysical studies performed with a minimal II-III-VI junction that does not contain the II-VI bulge-bulge interaction. Together with previous NMR studies, our findings provide important new insights into the three-dimensional architecture of this unique ribozyme.
Collapse
Affiliation(s)
- Eric Bonneau
- Département de Biochimie et Médecine Moléculaire, Université de Montréal, Montréal, Quebec H3C 3J7, Canada
| | - Nicolas Girard
- Département de Biochimie et Médecine Moléculaire, Université de Montréal, Montréal, Quebec H3C 3J7, Canada
| | - Sébastien Lemieux
- Département d'Informatique et de Recherche Opérationnelle et Institut de Recherche en Immunologie et en Cancérologie, Université de Montréal, Montréal, Quebec H3C 3J7, Canada
| | - Pascale Legault
- Département de Biochimie et Médecine Moléculaire, Université de Montréal, Montréal, Quebec H3C 3J7, Canada
| |
Collapse
|
22
|
Mustoe AM, Liu X, Lin PJ, Al-Hashimi HM, Fierke CA, Brooks CL. Noncanonical secondary structure stabilizes mitochondrial tRNA(Ser(UCN)) by reducing the entropic cost of tertiary folding. J Am Chem Soc 2015; 137:3592-9. [PMID: 25705930 PMCID: PMC4399864 DOI: 10.1021/ja5130308] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Mammalian mitochondrial tRNA(Ser(UCN)) (mt-tRNA(Ser)) and pyrrolysine tRNA (tRNA(Pyl)) fold to near-canonical three-dimensional structures despite having noncanonical secondary structures with shortened interhelical loops that disrupt the conserved tRNA tertiary interaction network. How these noncanonical tRNAs compensate for their loss of tertiary interactions remains unclear. Furthermore, in human mt-tRNA(Ser), lengthening the variable loop by the 7472insC mutation reduces mt-tRNA(Ser) concentration in vivo through poorly understood mechanisms and is strongly associated with diseases such as deafness and epilepsy. Using simulations of the TOPRNA coarse-grained model, we show that increased topological constraints encoded by the unique secondary structure of wild-type mt-tRNA(Ser) decrease the entropic cost of folding by ∼2.5 kcal/mol compared to canonical tRNA, offsetting its loss of tertiary interactions. Further simulations show that the pathogenic 7472insC mutation disrupts topological constraints and hence destabilizes the mutant mt-tRNA(Ser) by ∼0.6 kcal/mol relative to wild-type. UV melting experiments confirm that insertion mutations lower mt-tRNA(Ser) melting temperature by 6-9 °C and increase the folding free energy by 0.8-1.7 kcal/mol in a largely sequence- and salt-independent manner, in quantitative agreement with our simulation predictions. Our results show that topological constraints provide a quantitative framework for describing key aspects of RNA folding behavior and also provide the first evidence of a pathogenic mutation that is due to disruption of topological constraints.
Collapse
Affiliation(s)
- Anthony M. Mustoe
- Department of Biophysics, University of Michigan, Ann Arbor, MI 48109-1055, USA
| | - Xin Liu
- Department of Chemistry, University of Michigan, Ann Arbor, MI 48109-1055, USA
| | - Paul J. Lin
- Department of Chemistry, University of Michigan, Ann Arbor, MI 48109-1055, USA
| | - Hashim M. Al-Hashimi
- Departments of Biochemistry and Chemistry, Duke University School of Medicine, Durham, NC, 27710, USA
| | - Carol A. Fierke
- Department of Biophysics, University of Michigan, Ann Arbor, MI 48109-1055, USA
- Department of Chemistry, University of Michigan, Ann Arbor, MI 48109-1055, USA
- Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 48109-1055, USA
| | - Charles L. Brooks
- Department of Biophysics, University of Michigan, Ann Arbor, MI 48109-1055, USA
- Department of Chemistry, University of Michigan, Ann Arbor, MI 48109-1055, USA
| |
Collapse
|
23
|
Bonneau E, Legault P. Nuclear magnetic resonance structure of the III-IV-V three-way junction from the Varkud satellite ribozyme and identification of magnesium-binding sites using paramagnetic relaxation enhancement. Biochemistry 2014; 53:6264-75. [PMID: 25238589 DOI: 10.1021/bi500826n] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
The VS ribozyme is a catalytic RNA found within some natural isolates of Neurospora that is being used as a model system to improve our understanding of RNA structure, catalysis, and engineering. The catalytic domain contains five helical domains (SLII-SLVI) that are organized by two three-way junctions. The III-IV-V junction is required for high-affinity binding of the substrate domain (SLI) through formation of a kissing loop interaction with SLV. Here, we determine the high-resolution nuclear magnetic resonance (NMR) structure of a 47-nucleotide RNA containing the III-IV-V junction (J345). The J345 RNA adopts a Y-shaped fold typical of the family C three-way junctions, with coaxial stacking between stems III and IV and an acute angle between stems III and V. The NMR structure reveals that the core of the III-IV-V junction contains four stacked base triples, a U-turn motif, a cross-strand stacking interaction, an A-minor interaction, and a ribose zipper. In addition, the NMR structure shows that the cCUUGg tetraloop used to stabilize stem IV adopts a novel RNA tetraloop fold, different from the known gCUUGc tetraloop structure. Using Mn(2+)-induced paramagnetic relaxation enhancement, we identify six Mg(2+)-binding sites within J345, including one associated with the cCUUGg tetraloop and two with the junction core. The NMR structure of J345 likely represents the conformation of the III-IV-V junction in the context of the active VS ribozyme and suggests that this junction functions as a dynamic hinge that contributes to substrate recognition and catalysis. Moreover, this study highlights a new role for family C three-way junctions in long-range tertiary interactions.
Collapse
Affiliation(s)
- Eric Bonneau
- Département de Biochimie et Médecine Moléculaire, Université de Montréal , C.P. 6128, Succursale Centre-Ville, Montréal, QC, Canada H3C 3J7
| | | |
Collapse
|
24
|
Mustoe AM, Brooks CL, Al-Hashimi HM. Topological constraints are major determinants of tRNA tertiary structure and dynamics and provide basis for tertiary folding cooperativity. Nucleic Acids Res 2014; 42:11792-804. [PMID: 25217593 PMCID: PMC4191394 DOI: 10.1093/nar/gku807] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Recent studies have shown that basic steric and connectivity constraints encoded at the secondary structure level are key determinants of 3D structure and dynamics in simple two-way RNA junctions. However, the role of these topological constraints in higher order RNA junctions remains poorly understood. Here, we use a specialized coarse-grained molecular dynamics model to directly probe the thermodynamic contributions of topological constraints in defining the 3D architecture and dynamics of transfer RNA (tRNA). Topological constraints alone restrict tRNA's allowed conformational space by over an order of magnitude and strongly discriminate against formation of non-native tertiary contacts, providing a sequence independent source of folding specificity. Topological constraints also give rise to long-range correlations between the relative orientation of tRNA's helices, which in turn provides a mechanism for encoding thermodynamic cooperativity between distinct tertiary interactions. These aspects of topological constraints make it such that only several tertiary interactions are needed to confine tRNA to its native global structure and specify functionally important 3D dynamics. We further show that topological constraints are conserved across tRNA's different naturally occurring secondary structures. Taken together, our results emphasize the central role of secondary-structure-encoded topological constraints in defining RNA 3D structure, dynamics and folding.
Collapse
Affiliation(s)
- Anthony M Mustoe
- Department of Biophysics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Charles L Brooks
- Department of Biophysics, University of Michigan, Ann Arbor, MI 48109, USA Department of Chemistry, University of Michigan, Ann Arbor, MI 48109, USA
| | - Hashim M Al-Hashimi
- Department of Biochemistry and Chemistry, Duke University School of Medicine, Durham, NC 27710, USA
| |
Collapse
|
25
|
Affiliation(s)
- David H. Mathews
- Department of Biochemistry & Biophysics and Center for RNA Biology, University of Rochester Medical Center Rochester New York
| |
Collapse
|
26
|
Abstract
RNA dynamics play a fundamental role in many cellular functions. However, there is no general framework to describe these complex processes, which typically consist of many structural maneuvers that occur over timescales ranging from picoseconds to seconds. Here, we classify RNA dynamics into distinct modes representing transitions between basins on a hierarchical free-energy landscape. These transitions include large-scale secondary-structural transitions at >0.1-s timescales, base-pair/tertiary dynamics at microsecond-to-millisecond timescales, stacking dynamics at timescales ranging from nanoseconds to microseconds, and other "jittering" motions at timescales ranging from picoseconds to nanoseconds. We review various modes within these three different tiers, the different mechanisms by which they are used to regulate function, and how they can be coupled together to achieve greater functional complexity.
Collapse
|
27
|
|
28
|
Mustoe AM, Al-Hashimi HM, Brooks CL. Coarse grained models reveal essential contributions of topological constraints to the conformational free energy of RNA bulges. J Phys Chem B 2014; 118:2615-27. [PMID: 24547945 PMCID: PMC3983386 DOI: 10.1021/jp411478x] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
![]()
Recent studies have shown that simple
stereochemical constraints
encoded at the RNA secondary structure level significantly restrict
the orientation of RNA helices across two-way junctions and yield
physically reasonable distributions of RNA 3D conformations. Here
we develop a new coarse-grain model, TOPRNA, that is optimized for
exploring detailed aspects of these topological constraints in complex
RNA systems. Unlike prior models, TOPRNA effectively treats RNAs as
collections of semirigid helices linked by freely rotatable single
strands, allowing us to isolate the effects of secondary structure
connectivity and sterics on 3D structure. Simulations of bulge junctions
show that TOPRNA captures new aspects of topological constraints,
including variations arising from deviations in local A-form structure,
translational displacements of the helices, and stereochemical constraints
imposed by bulge-linker nucleotides. Notably, these aspects of topological
constraints define free energy landscapes that coincide with the distribution
of bulge conformations in the PDB. Our simulations also quantitatively
reproduce NMR RDC measurements made on HIV-1 TAR at low salt concentrations,
although not for different TAR mutants or at high salt concentrations.
Our results confirm that topological constraints are an important
determinant of bulge conformation and dynamics and demonstrate the
utility of TOPRNA for studying the topological constraints of complex
RNAs.
Collapse
Affiliation(s)
- Anthony M Mustoe
- Departments of Biophysics and ‡Chemistry, University of Michigan , 930 North University Avenue, Ann Arbor, Michigan 48109, United States
| | | | | |
Collapse
|
29
|
Tan ZJ, Chen SJ. Ion-mediated RNA structural collapse: effect of spatial confinement. Biophys J 2013; 103:827-36. [PMID: 22947944 DOI: 10.1016/j.bpj.2012.06.048] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2012] [Revised: 06/25/2012] [Accepted: 06/27/2012] [Indexed: 12/28/2022] Open
Abstract
RNAs are negatively charged molecules that reside in cellular environments with macromolecular crowding. Macromolecular confinement can influence the ion effects in RNA folding. In this work, using the recently developed tightly bound ion model for ion fluctuation and correlation, we investigate the effect of confinement on ion-mediated RNA structural collapse for a simple model system. We find that for both Na(+) and Mg(2+), the ion efficiencies in mediating structural collapse/folding are significantly enhanced by the structural confinement. This enhancement of ion efficiency is attributed to the decreased electrostatic free-energy difference between the compact conformation ensemble and the (restricted) extended conformation ensemble due to the spatial restriction.
Collapse
Affiliation(s)
- Zhi-Jie Tan
- Department of Physics and Key Laboratory of Artificial Micro- and Nano-Structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, People's Republic of China.
| | | |
Collapse
|
30
|
Donghi D, Pechlaner M, Finazzo C, Knobloch B, Sigel RKO. The structural stabilization of the κ three-way junction by Mg(II) represents the first step in the folding of a group II intron. Nucleic Acids Res 2012; 41:2489-504. [PMID: 23275550 PMCID: PMC3575829 DOI: 10.1093/nar/gks1179] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Folding of group II introns is characterized by a first slow compaction of domain 1 (D1) followed by the rapid docking of other domains to this scaffold. D1 compaction initiates in a small subregion encompassing the κ and ζ elements. These two tertiary elements are also the major interaction sites with domain 5 to form the catalytic core. Here, we provide the first characterization of the structure adopted at an early folding step and show that the folding control element can be narrowed down to the three-way junction with the κ motif. In our nuclear magnetic resonance studies of this substructure derived from the yeast mitochondrial group II intron Sc.ai5γ, we show that a high affinity Mg(II) ion stabilizes the κ element and enables coaxial stacking between helices d′ and d′′, favoring a rigid duplex across the three-way junction. The κ-element folds into a stable GAAA-tetraloop motif and engages in A-minor interactions with helix d′. The addition of cobalt(III)hexammine reveals three distinct binding sites. The Mg(II)-promoted structural rearrangement and rigidification of the D1 core can be identified as the first micro-step of D1 folding.
Collapse
Affiliation(s)
- Daniela Donghi
- Institute of Inorganic Chemistry, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
| | | | | | | | | |
Collapse
|
31
|
Wang Y, Manzour A, Shareghi P, Shaw TI, Li YW, Malmberg RL, Cai L. Stable stem enabled Shannon entropies distinguish non-coding RNAs from random backgrounds. BMC Bioinformatics 2012; 13 Suppl 5:S1. [PMID: 22537005 PMCID: PMC3358654 DOI: 10.1186/1471-2105-13-s5-s1] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Background The computational identification of RNAs in genomic sequences requires the identification of signals of RNA sequences. Shannon base pairing entropy is an indicator for RNA secondary structure fold certainty in detection of structural, non-coding RNAs (ncRNAs). Under the Boltzmann ensemble of secondary structures, the probability of a base pair is estimated from its frequency across all the alternative equilibrium structures. However, such an entropy has yet to deliver the desired performance for distinguishing ncRNAs from random sequences. Developing novel methods to improve the entropy measure performance may result in more effective ncRNA gene finding based on structure detection. Results This paper shows that the measuring performance of base pairing entropy can be significantly improved with a constrained secondary structure ensemble in which only canonical base pairs are assumed to occur in energetically stable stems in a fold. This constraint actually reduces the space of the secondary structure and may lower the probabilities of base pairs unfavorable to the native fold. Indeed, base pairing entropies computed with this constrained model demonstrate substantially narrowed gaps of Z-scores between ncRNAs, as well as drastic increases in the Z-score for all 13 tested ncRNA sets, compared to shuffled sequences. Conclusions These results suggest the viability of developing effective structure-based ncRNA gene finding methods by investigating secondary structure ensembles of ncRNAs.
Collapse
Affiliation(s)
- Yingfeng Wang
- Department of Computer Science, University of Georgia, Athens, Georgia 30602, USA.
| | | | | | | | | | | | | |
Collapse
|
32
|
Cao S, Chen SJ. Statistical mechanical modeling of RNA folding: from free energy landscape to tertiary structural prediction. NUCLEIC ACIDS AND MOLECULAR BIOLOGY 2012; 27:185-212. [PMID: 27293312 DOI: 10.1007/978-3-642-25740-7_10] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
In spite of the success of computational methods for predicting RNA secondary structure, the problem of predicting RNA tertiary structure folding remains. Low-resolution structural models show promise as they allow for rigorous statistical mechanical computation for the conformational entropies, free energies, and the coarse-grained structures of tertiary folds. Molecular dynamics refinement of coarse-grained structures leads to all-atom 3D structures. Modeling based on statistical mechanics principles also has the unique advantage of predicting the full free energy landscape, including local minima and the global free energy minimum. The energy landscapes combined with the 3D structures form the basis for quantitative predictions of RNA functions. In this chapter, we present an overview of statistical mechanical models for RNA folding and then focus on a recently developed RNA statistical mechanical model -- the Vfold model. The main emphasis is placed on the physics underpinning the models, the computational strategies, and the connections to RNA biology.
Collapse
Affiliation(s)
- Song Cao
- Department of Physics and Department of Biochemistry, University of Missouri, Columbia, MO 65211
| | - Shi-Jie Chen
- Department of Physics and Department of Biochemistry, University of Missouri, Columbia, MO 65211
| |
Collapse
|
33
|
Lamiable A, Barth D, Denise A, Quessette F, Vial S, Westhof É. Automated prediction of three-way junction topological families in RNA secondary structures. Comput Biol Chem 2012; 37:1-5. [DOI: 10.1016/j.compbiolchem.2011.11.001] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2011] [Revised: 11/14/2011] [Accepted: 11/16/2011] [Indexed: 11/24/2022]
|
34
|
Parisien M, Major F. Determining RNA three-dimensional structures using low-resolution data. J Struct Biol 2012; 179:252-60. [PMID: 22387042 DOI: 10.1016/j.jsb.2011.12.024] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2011] [Revised: 11/29/2011] [Accepted: 12/06/2011] [Indexed: 11/25/2022]
Abstract
Knowing the 3-D structure of an RNA is fundamental to understand its biological function. Nowadays X-ray crystallography and NMR spectroscopy are systematically applied to newly discovered RNAs. However, the application of these high-resolution techniques is not always possible, and thus scientists must turn to lower resolution alternatives. Here, we introduce a pipeline to systematically generate atomic resolution 3-D structures that are consistent with low-resolution data sets. We compare and evaluate the discriminative power of a number of low-resolution experimental techniques to reproduce the structure of the Escherichia coli tRNA(VAL) and P4-P6 domain of the Tetrahymena thermophila group I intron. We test single and combinations of the most accessible low-resolution techniques, i.e. hydroxyl radical footprinting (OH), methidiumpropyl-EDTA (MPE), multiplexed hydroxyl radical cleavage (MOHCA), and small-angle X-ray scattering (SAXS). We show that OH-derived constraints are accurate to discriminate structures at the atomic level, whereas EDTA-based constraints apply to global shape determination. We provide a guide for choosing which experimental techniques or combination of thereof is best in which context. The pipeline represents an important step towards high-throughput low-resolution RNA structure determination.
Collapse
Affiliation(s)
- Marc Parisien
- Biochemistry Department, The University of Chicago, 929 E. 57th Street, Chicago, IL 60637, USA
| | | |
Collapse
|
35
|
Martin JS, Halvorsen M, Davis-Neulander L, Ritz J, Gopinath C, Beauregard A, Laederach A. Structural effects of linkage disequilibrium on the transcriptome. RNA (NEW YORK, N.Y.) 2012; 18:77-87. [PMID: 22109839 PMCID: PMC3261746 DOI: 10.1261/rna.029900.111] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
A majority of SNPs (single nucleotide polymorphisms) map to noncoding and intergenic regions of the genome. Noncoding SNPs are often identified in genome-wide association studies (GWAS) as strongly associated with human disease. Two such disease-associated SNPs in the 5' UTR of the human FTL (Ferritin Light Chain) gene are predicted to alter the ensemble of structures adopted by the mRNA. High-accuracy single nucleotide resolution chemical mapping reveals that these SNPs result in substantial changes in the structural ensemble in agreement with the computational prediction. Furthermore six rescue mutations are correctly predicted to restore the mRNA to its wild-type ensemble. Our data confirm that the FTL 5' UTR is a "RiboSNitch," an RNA that changes structure if a particular disease-associated SNP is present. The structural change observed is analogous to that of a bacterial Riboswitch in that it likely regulates translation. These data further suggest that specific pairs of SNPs in high linkage disequilibrium (LD) will form RNA structure-stabilizing haplotypes (SSHs). We identified 484 SNP pairs that form SSHs in UTRs of the human genome, and in eight of the 10 SSH-containing transcripts, SNP pairs stabilize RNA protein binding sites. The ubiquitous nature of SSHs in the transcriptome suggests that certain haplotypes are conserved to avoid RiboSNitch formation.
Collapse
Affiliation(s)
- Joshua S. Martin
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | - Matthew Halvorsen
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | - Lauren Davis-Neulander
- Developmental Genetics and Bioinformatics, Wadsworth Center, Albany, New York 12208, USA
| | - Justin Ritz
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | - Chetna Gopinath
- Biomedical Sciences Department, University at Albany, Albany, New York 12208, USA
| | - Arthur Beauregard
- Biomedical Sciences Department, University at Albany, Albany, New York 12208, USA
| | - Alain Laederach
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599, USA
- Corresponding author.E-mail .
| |
Collapse
|
36
|
Shareghi P, Wang Y, Malmberg R, Cai L. Simultaneous prediction of RNA secondary structure and helix coaxial stacking. BMC Genomics 2012; 13 Suppl 3:S7. [PMID: 22759616 PMCID: PMC3394421 DOI: 10.1186/1471-2164-13-s3-s7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Background RNA secondary structure plays a scaffolding role for RNA tertiary conformation. Accurate secondary structure prediction can not only identify double-stranded helices and single stranded-loops but also help provide information for potential tertiary interaction motifs critical to the 3D conformation. The average accuracy in ab initio prediction remains 70%; performance improvement has only been limited to short RNA sequences. The prediction of tertiary interaction motifs is difficult without multiple, related sequences that are usually not available. This paper presents research that aims to improve the secondary structure prediction performance and to develop a capability to predict coaxial stacking between helices. Coaxial stacking positions two helices on the same axis, a tertiary motif present in almost all junctions that account for a high percentage of RNA tertiary structures. Results This research identified energetic rules for coaxial stacks and geometric constraints on stack combinations, which were applied to developing an efficient dynamic programming application for simultaneous prediction of secondary structure and coaxial stacking. Results on a number of non-coding RNA data sets, of short and moderately long lengths, show a performance improvement (specially on tRNAs) for secondary structure prediction when compared with existing methods. The program also demonstrates a capability for prediction of coaxial stacking. Conclusions The significant leap of performance on tRNAs demonstrated in this work suggests that a breakthrough to a higher performance in RNA secondary structure prediction may lie in understanding contributions from tertiary motifs critical to the structure, as such information can be used to constrain geometrically as well as energetically the space of RNA secondary structure.
Collapse
|
37
|
Butcher SE, Pyle AM. The molecular interactions that stabilize RNA tertiary structure: RNA motifs, patterns, and networks. Acc Chem Res 2011; 44:1302-11. [PMID: 21899297 DOI: 10.1021/ar200098t] [Citation(s) in RCA: 253] [Impact Index Per Article: 18.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
RNA molecules adopt specific three-dimensional structures critical to their function. Many essential metabolic processes, including protein synthesis and RNA splicing, are carried out by RNA molecules with elaborate tertiary structures (e.g. 3QIQ, right). Indeed, the ribosome and self-splicing introns are complex RNA machines. But even the coding regions in messenger RNAs and viral RNAs are flanked by highly structured untranslated regions, which provide regulatory information necessary for gene expression. RNA tertiary structure is defined as the three-dimensional arrangement of RNA building blocks, which include helical duplexes, triple-stranded structures, and other components that are held together through connections collectively termed RNA tertiary interactions. The structural diversity of these interactions is now a subject of intense investigation, involving the techniques of NMR, X-ray crystallography, chemical genetics, and phylogenetic analysis. At the same time, many investigators are using biophysical techniques to elucidate the driving forces for tertiary structure formation and the mechanisms for its stabilization. RNA tertiary folding is promoted by maximization of base stacking, much like the hydrophobic effect that drives protein folding. RNA folding also requires electrostatic stabilization, both through charge screening and site binding of metals, and it is enhanced by desolvation of the phosphate backbone. In this Account, we provide an overview of the features that specify and stabilize RNA tertiary structure. A major determinant for overall tertiary RNA architecture is local conformation in secondary-structure junctions, which are regions from which two or more duplexes project. At junctions and other structures, such as pseudoknots and kissing loops, adjacent helices stack on one another, and these coaxial stacks play a major role in dictating the overall architectural form of an RNA molecule. In addition to RNA junction topology, a second determinant for RNA tertiary structure is the formation of sequence-specific interactions. Networks of triple helices, tetraloop-receptor interactions, and other sequence-specific contacts establish the framework for the overall tertiary fold. The third determinant of tertiary structure is the formation of stabilizing stacking and backbone interactions, and many are not sequence specific. For example, ribose zippers allow 2'-hydroxyl groups on different RNA strands to form networks of interdigitated hydrogen bonds, serving to seal strands together and thereby stabilize adjacent substructures. These motifs often require monovalent and divalent cations, which can interact diffusely or through chelation to specific RNA functional groups. As we learn more about the components of RNA tertiary structure, we will be able to predict the structures of RNA molecules from their sequences, thereby obtaining key information about biological function. Understanding and predicting RNA structure is particularly important given the recent discovery that although most of our genome is transcribed into RNA molecules, few of them have a known function. The prevalence of RNA viruses and pathogens with RNA genomes makes RNA drug discovery an active area of research. Finally, knowledge of RNA structure will facilitate the engineering of supramolecular RNA structures, which can be used as nanomechanical components for new materials. But all of this promise depends on a better understanding of the RNA parts list, and how the pieces fit together.
Collapse
Affiliation(s)
- Samuel E. Butcher
- Department of Biochemistry, University of Wisconsin—Madison, 433 Babcock
Drive, Madison, Wisconsin 53706-1544, United States
| | - Anna Marie Pyle
- Department of Molecular, Cellular
and Developmental Biology and Department of Chemistry, Yale University, New Haven, Connecticut, United States
- Howard Hughes Medical Institute
| |
Collapse
|
38
|
Höner zu Siederdissen C, Bernhart SH, Stadler PF, Hofacker IL. A folding algorithm for extended RNA secondary structures. Bioinformatics 2011; 27:i129-36. [PMID: 21685061 PMCID: PMC3117358 DOI: 10.1093/bioinformatics/btr220] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Motivation: RNA secondary structure contains many non-canonical base pairs of different pair families. Successful prediction of these structural features leads to improved secondary structures with applications in tertiary structure prediction and simultaneous folding and alignment. Results: We present a theoretical model capturing both RNA pair families and extended secondary structure motifs with shared nucleotides using 2-diagrams. We accompany this model with a number of programs for parameter optimization and structure prediction. Availability: All sources (optimization routines, RNA folding, RNA evaluation, extended secondary structure visualization) are published under the GPLv3 and available at www.tbi.univie.ac.at/software/rnawolf/. Contact:choener@tbi.univie.ac.at
Collapse
|
39
|
Wang Z, Parisien M, Scheets K, Miller WA. The cap-binding translation initiation factor, eIF4E, binds a pseudoknot in a viral cap-independent translation element. Structure 2011; 19:868-80. [PMID: 21645857 DOI: 10.1016/j.str.2011.03.013] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2011] [Revised: 03/11/2011] [Accepted: 03/13/2011] [Indexed: 01/15/2023]
Abstract
Eukaryotic initiation factor eIF4E performs a key early step in translation by specifically recognizing the m⁷GpppN cap structure at the 5' end of cellular mRNAs. Many viral mRNAs lack a 5' cap and thus bypass eIF4E. In contrast, we reported a cap-independent translation element (PTE) in Pea enation mosaic virus RNA2 that binds and requires eIF4E for translation initiation. To understand how this uncapped RNA is bound tightly by eIF4E, we employ SHAPE probing, phylogenetic comparisons with new PTEs discovered in panico- and carmoviruses, footprinting of the eIF4E binding site, and 3D RNA modeling using NAST, MC-Fold, and MC-Sym to predict a compact, 3D structure of the RNA. We propose that the cap-binding pocket of eIF4E clamps around a pseudoknot, placing a highly SHAPE-reactive guanosine in the pocket in place of the normal m⁷GpppN cap. This reveals a new mechanism of mRNA recognition by eIF4E.
Collapse
Affiliation(s)
- Zhaohui Wang
- Plant Pathology Department, and Biochemistry, Biophysics, and Molecular Biology Department, Iowa State University, Ames, IA 50011, USA
| | | | | | | |
Collapse
|
40
|
Laing C, Wen D, Wang JTL, Schlick T. Predicting coaxial helical stacking in RNA junctions. Nucleic Acids Res 2011; 40:487-98. [PMID: 21917853 PMCID: PMC3258123 DOI: 10.1093/nar/gkr629] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
RNA junctions are important structural elements that form when three or more helices come together in space in the tertiary structures of RNA molecules. Determining their structural configuration is important for predicting RNA 3D structure. We introduce a computational method to predict, at the secondary structure level, the coaxial helical stacking arrangement in junctions, as well as classify the junction topology. Our approach uses a data mining approach known as random forests, which relies on a set of decision trees trained using length, sequence and other variables specified for any given junction. The resulting protocol predicts coaxial stacking within three- and four-way junctions with an accuracy of 81% and 77%, respectively; the accuracy increases to 83% and 87%, respectively, when knowledge from the junction family type is included. Coaxial stacking predictions for the five to ten-way junctions are less accurate (60%) due to sparse data available for training. Additionally, our application predicts the junction family with an accuracy of 85% for three-way junctions and 74% for four-way junctions. Comparisons with other methods, as well applications to unsolved RNAs, are also presented. The web server Junction-Explorer to predict junction topologies is freely available at: http://bioinformatics.njit.edu/junction.
Collapse
Affiliation(s)
- Christian Laing
- Department of Chemistry, Courant Institute of Mathematical Sciences, New York University, 251 Mercer Street, New York, NY 10012, USA
| | | | | | | |
Collapse
|
41
|
Seetin MG, Mathews DH. Automated RNA tertiary structure prediction from secondary structure and low-resolution restraints. J Comput Chem 2011; 32:2232-44. [PMID: 21509787 PMCID: PMC3288334 DOI: 10.1002/jcc.21806] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2010] [Revised: 12/07/2010] [Accepted: 03/06/2011] [Indexed: 11/05/2022]
Abstract
A novel protocol for all-atom RNA tertiary structure prediction is presented that uses restrained molecular mechanics and simulated annealing. The restraints are from secondary structure, covariation analysis, coaxial stacking predictions for helices in junctions, and, when available, cross-linking data. Results are demonstrated on the Alu domain of the mammalian signal recognition particle RNA, the Saccharomyces cerevisiae phenylalanine tRNA, the hammerhead ribozyme, the hepatitis C virus internal ribosomal entry site, and the P4-P6 domain of the Tetrahymena thermophila group I intron. The predicted structure is selected from a pool of decoy structures with a score that maximizes radius of gyration and base-base contacts, which was empirically found to select higher quality decoys. This simple ab initio approach is sufficient to make good predictions of the structure of RNAs compared to current crystal structures using both root mean square deviation and the accuracy of base-base contacts.
Collapse
Affiliation(s)
- Matthew G Seetin
- Department of Biochemistry and Biophysics and Center for RNA Biology, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642
| | | |
Collapse
|
42
|
Abstract
Current experiments on structural determination cannot keep up the pace with the steadily emerging RNA sequences and new functions. This underscores the request for an accurate model for RNA three-dimensional (3D) structural prediction. Although considerable progress has been made in mechanistic studies, accurate prediction for RNA tertiary folding from sequence remains an unsolved problem. The first and most important requirement for the prediction of RNA structure from physical principles is an accurate free energy model. A recently developed three-vector virtual bond-based RNA folding model ("Vfold") has allowed us to compute the chain entropy and predict folding free energies and structures for RNA secondary structures and simple pseudoknots. Here we develop a free energy-based method to predict larger more complex RNA tertiary folds. The approach is based on a multiscaling strategy: from the nucleotide sequence, we predict the two-dimensional (2D) structures (defined by the base pairs and tertiary contacts); based on the 2D structure, we construct a 3D scaffold; with the 3D scaffold as the initial state, we combine AMBER energy minimization and PDB-based fragment search to predict the all-atom structure. A key advantage of the approach is the statistical mechanical calculation for the conformational entropy of RNA structures, including those with cross-linked loops. Benchmark tests show that the model leads to significant improvements in RNA 3D structure prediction.
Collapse
Affiliation(s)
- Song Cao
- Department of Physics and Department of Biochemistry, University of Missouri, Columbia, MO 65211
| | - Shi-Jie Chen
- Department of Physics and Department of Biochemistry, University of Missouri, Columbia, MO 65211
| |
Collapse
|
43
|
Liu B, Diamond JM, Mathews DH, Turner DH. Fluorescence competition and optical melting measurements of RNA three-way multibranch loops provide a revised model for thermodynamic parameters. Biochemistry 2011; 50:640-53. [PMID: 21133351 PMCID: PMC3032278 DOI: 10.1021/bi101470n] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
![]()
Three-way multibranch loops (junctions) are common in RNA secondary structures. Computer algorithms such as RNAstructure and MFOLD do not consider the identity of unpaired nucleotides in multibranch loops when predicting secondary structure. There is limited experimental data, however, to parametrize this aspect of these algorithms. In this study, UV optical melting and a fluorescence competition assay are used to measure stabilities of multibranch loops containing up to five unpaired adenosines or uridines or a loop E motif. These results provide a test of our understanding of the factors affecting multibranch loop stability and provide revised parameters for predicting stability. The results should help to improve predictions of RNA secondary structure.
Collapse
Affiliation(s)
- Biao Liu
- Department of Chemistry, University of Rochester, Rochester, New York 14627, United States
| | | | | | | |
Collapse
|
44
|
Abstract
RNA molecules take advantage of prevalent structural motifs to fold and assemble into well-defined 3D architectures. The A-minor junction is a class of RNA motifs that specifically controls coaxial stacking of helices in natural RNAs. A sensitive self-assembling supra-molecular system was used as an assay to compare several natural and previously unidentified A-minor junctions by native polyacrylamide gel electrophoresis and atomic force microscopy. This class of modular motifs follows a topological rule that can accommodate a variety of interchangeable A-minor interactions with distinct local structural motifs. Overall, two different types of A-minor junctions can be distinguished based on their functional self-assembling behavior: one group makes use of triloops or GNRA and GNRA-like loops assembling with helices, while the other takes advantage of more complex tertiary receptors specific for the loop to gain higher stability. This study demonstrates how different structural motifs of RNA can contribute to the formation of topologically equivalent helical stacks. It also exemplifies the need of classifying RNA motifs based on their tertiary structural features rather than secondary structural features. The A-minor junction rule can be used to facilitate tertiary structure prediction of RNAs and rational design of RNA parts for nanobiotechnology and synthetic biology.
Collapse
Affiliation(s)
- Cody Geary
- Department of Chemistry and Biochemistry, University of California, Santa Barbara, CA 93106-9510, USA
| | | | | |
Collapse
|
45
|
Flores SC, Altman RB. Turning limited experimental information into 3D models of RNA. RNA (NEW YORK, N.Y.) 2010; 16:1769-78. [PMID: 20651028 PMCID: PMC2924536 DOI: 10.1261/rna.2112110] [Citation(s) in RCA: 67] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Our understanding of RNA functions in the cell is evolving rapidly. As for proteins, the detailed three-dimensional (3D) structure of RNA is often key to understanding its function. Although crystallography and nuclear magnetic resonance (NMR) can determine the atomic coordinates of some RNA structures, many 3D structures present technical challenges that make these methods difficult to apply. The great flexibility of RNA, its charged backbone, dearth of specific surface features, and propensity for kinetic traps all conspire with its long folding time, to challenge in silico methods for physics-based folding. On the other hand, base-pairing interactions (either in runs to form helices or isolated tertiary contacts) and motifs are often available from relatively low-cost experiments or informatics analyses. We present RNABuilder, a novel code that uses internal coordinate mechanics to satisfy user-specified base pairing and steric forces under chemical constraints. The code recapitulates the topology and characteristic L-shape of tRNA and obtains an accurate noncrystallographic structure of the Tetrahymena ribozyme P4/P6 domain. The algorithm scales nearly linearly with molecule size, opening the door to the modeling of significantly larger structures.
Collapse
|
46
|
Aalberts DP, Nandagopal N. A two-length-scale polymer theory for RNA loop free energies and helix stacking. RNA (NEW YORK, N.Y.) 2010; 16:1350-1355. [PMID: 20504955 PMCID: PMC2885684 DOI: 10.1261/rna.1831710] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2009] [Accepted: 03/22/2010] [Indexed: 05/29/2023]
Abstract
The reliability of RNA secondary structure predictions is subject to the accuracy of the underlying free energy model. Mfold and other RNA folding algorithms are based on the Turner model, whose weakest part is its formulation of loop free energies, particularly for multibranch loops. RNA loops contain single-strand and helix-crossing segments, so we develop an enhanced two-length freely jointed chain theory and revise it for self-avoidance. Our resulting universal formula for RNA loop entropy has fewer parameters than the Turner/Mfold model, and yet simulations show that the standard errors for multibranch loop free energies are reduced by an order of magnitude. We further note that coaxial stacking decreases the effective length of multibranch loops and provides, surprisingly, an entropic stabilization of the ordered configuration in addition to the enthalpic contribution of helix stacking. Our formula is in good agreement with measured hairpin free energies. We find that it also improves the accuracy of folding predictions.
Collapse
Affiliation(s)
- Daniel P Aalberts
- Department of Physics, Williams College, Williamstown, Massachusetts 01257, USA.
| | | |
Collapse
|
47
|
Andronescu MS, Pop C, Condon AE. Improved free energy parameters for RNA pseudoknotted secondary structure prediction. RNA (NEW YORK, N.Y.) 2010; 16:26-42. [PMID: 19933322 PMCID: PMC2802035 DOI: 10.1261/rna.1689910] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/16/2009] [Accepted: 08/03/2009] [Indexed: 05/21/2023]
Abstract
Accurate prediction of RNA pseudoknotted secondary structures from the base sequence is a challenging computational problem. Since prediction algorithms rely on thermodynamic energy models to identify low-energy structures, prediction accuracy relies in large part on the quality of free energy change parameters. In this work, we use our earlier constraint generation and Boltzmann likelihood parameter estimation methods to obtain new energy parameters for two energy models for secondary structures with pseudoknots, namely, the Dirks-Pierce (DP) and the Cao-Chen (CC) models. To train our parameters, and also to test their accuracy, we create a large data set of both pseudoknotted and pseudoknot-free secondary structures. In addition to structural data our training data set also includes thermodynamic data, for which experimentally determined free energy changes are available for sequences and their reference structures. When incorporated into the HotKnots prediction algorithm, our new parameters result in significantly improved secondary structure prediction on our test data set. Specifically, the prediction accuracy when using our new parameters improves from 68% to 79% for the DP model, and from 70% to 77% for the CC model.
Collapse
Affiliation(s)
- Mirela S Andronescu
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA.
| | | | | |
Collapse
|
48
|
Zhang J, Dundas J, Lin M, Chen R, Wang W, Liang J. Prediction of geometrically feasible three-dimensional structures of pseudoknotted RNA through free energy estimation. RNA (NEW YORK, N.Y.) 2009; 15:2248-63. [PMID: 19864433 PMCID: PMC2779689 DOI: 10.1261/rna.1723609] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/08/2009] [Accepted: 09/05/2009] [Indexed: 05/07/2023]
Abstract
Accurate free energy estimation is essential for RNA structure prediction. The widely used Turner's energy model works well for nested structures. For pseudoknotted RNAs, however, there is no effective rule for estimation of loop entropy and free energy. In this work we present a new free energy estimation method, termed the pseudoknot predictor in three-dimensional space (pk3D), which goes beyond Turner's model. Our approach treats nested and pseudoknotted structures alike in one unifying physical framework, regardless of how complex the RNA structures are. We first test the ability of pk3D in selecting native structures from a large number of decoys for a set of 43 pseudoknotted RNA molecules, with lengths ranging from 23 to 113. We find that pk3D performs slightly better than the Dirks and Pierce extension of Turner's rule. We then test pk3D for blind secondary structure prediction, and find that pk3D gives the best sensitivity and comparable positive predictive value (related to specificity) in predicting pseudoknotted RNA secondary structures, when compared with other methods. A unique strength of pk3D is that it also generates spatial arrangement of structural elements of the RNA molecule. Comparison of three-dimensional structures predicted by pk3D with the native structure measured by nuclear magnetic resonance or X-ray experiments shows that the predicted spatial arrangement of stems and loops is often similar to that found in the native structure. These close-to-native structures can be used as starting points for further refinement to derive accurate three-dimensional structures of RNA molecules, including those with pseudoknots.
Collapse
Affiliation(s)
- Jian Zhang
- Department of Bioengineering, University of Illinois at Chicago, Chicago, Illinois 60607, USA
| | | | | | | | | | | |
Collapse
|
49
|
Laing C, Jung S, Iqbal A, Schlick T. Tertiary motifs revealed in analyses of higher-order RNA junctions. J Mol Biol 2009; 393:67-82. [PMID: 19660472 DOI: 10.1016/j.jmb.2009.07.089] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2009] [Revised: 07/29/2009] [Accepted: 07/29/2009] [Indexed: 12/22/2022]
Abstract
RNA junctions are secondary-structure elements formed when three or more helices come together. They are present in diverse RNA molecules with various fundamental functions in the cell. To better understand the intricate architecture of three-dimensional (3D) RNAs, we analyze currently solved 3D RNA junctions in terms of base-pair interactions and 3D configurations. First, we study base-pair interaction diagrams for solved RNA junctions with 5 to 10 helices and discuss common features. Second, we compare these higher-order junctions to those containing 3 or 4 helices and identify global motif patterns such as coaxial stacking and parallel and perpendicular helical configurations. These analyses show that higher-order junctions organize their helical components in parallel and helical configurations similar to lower-order junctions. Their sub-junctions also resemble local helical configurations found in three- and four-way junctions and are stabilized by similar long-range interaction preferences such as A-minor interactions. Furthermore, loop regions within junctions are high in adenine but low in cytosine, and in agreement with previous studies, we suggest that coaxial stacking between helices likely forms when the common single-stranded loop is small in size; however, other factors such as stacking interactions involving noncanonical base pairs and proteins can greatly determine or disrupt coaxial stacking. Finally, we introduce the ribo-base interactions: when combined with the along-groove packing motif, these ribo-base interactions form novel motifs involved in perpendicular helix-helix interactions. Overall, these analyses suggest recurrent tertiary motifs that stabilize junction architecture, pack helices, and help form helical configurations that occur as sub-elements of larger junction networks. The frequent occurrence of similar helical motifs suggest nature's finite and perhaps limited repertoire of RNA helical conformation preferences. More generally, studies of RNA junctions and tertiary building blocks can ultimately help in the difficult task of RNA 3D structure prediction.
Collapse
Affiliation(s)
- Christian Laing
- Department of Chemistry, New York University, 251 Mercer Street, New York, NY 10012, USA
| | | | | | | |
Collapse
|
50
|
Analysis of four-way junctions in RNA structures. J Mol Biol 2009; 390:547-59. [PMID: 19445952 DOI: 10.1016/j.jmb.2009.04.084] [Citation(s) in RCA: 83] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2009] [Revised: 04/21/2009] [Accepted: 04/30/2009] [Indexed: 11/24/2022]
Abstract
RNA secondary structures can be divided into helical regions composed of canonical Watson-Crick and related base pairs, as well as single-stranded regions such as hairpin loops, internal loops, and junctions. These elements function as building blocks in the design of diverse RNA molecules with various fundamental functions in the cell. To better understand the intricate architecture of three-dimensional (3D) RNAs, we analyze existing RNA four-way junctions in terms of base-pair interactions and 3D configurations. Specifically, we identify nine broad junction families according to coaxial stacking patterns and helical configurations. We find that helices within junctions tend to arrange in roughly parallel and perpendicular patterns and stabilize their conformations using common tertiary motifs such as coaxial stacking, loop-helix interaction, and helix packing interaction. Our analysis also reveals a number of highly conserved base-pair interaction patterns and novel tertiary motifs such as A-minor-coaxial stacking combinations and sarcin/ricin motif variants. Such analyses of RNA building blocks can ultimately help in the difficult task of RNA 3D structure prediction.
Collapse
|