1
|
Grzywacz K, Chełkowska-Pauszek A, Plucinska-Jankowska M, Żywicki M. The Evaluation of SHAPE-MaP RNA Structure Probing Protocols Reveals a Novel Role of Mn 2+ in the Detection of 2'-OH Adducts. Int J Mol Sci 2023; 24:ijms24097890. [PMID: 37175596 PMCID: PMC10178110 DOI: 10.3390/ijms24097890] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2023] [Revised: 04/22/2023] [Accepted: 04/24/2023] [Indexed: 05/15/2023] Open
Abstract
Chemical probing, for decades, has been one of the most popular tools for studying the secondary structure of RNA molecules. Recently, protocols for simultaneous analysis of multiple RNAs have been developed, enabling in vivo transcriptome-wide interrogation of the RNA structure dynamics. One of the most popular methods is the selective 2'-hydroxyl acylation analyzed by primer extension and mutational profiling (SHAPE-MaP). In this study, we describe the evaluation of this protocol by addressing the influence of the reverse transcription enzymes, buffer conditions, and chemical probes on the properties of the cDNA library and the quality of mutational profiling-derived structural signals. Our results reveal a SuperScript IV (SSIV) reverse transcriptase as a more efficient enzyme for mutational profiling of SHAPE adducts and shed new light on the role of Mn2+ cations in the modulation of SSIV readthrough efficiency.
Collapse
Affiliation(s)
- Kamilla Grzywacz
- Institute of Bioorganic Chemistry Polish Academy of Sciences, 61-704 Poznań, Poland
| | - Agnieszka Chełkowska-Pauszek
- Department of Computational Biology, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, 61-614 Poznań, Poland
| | - Marianna Plucinska-Jankowska
- Department of Computational Biology, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, 61-614 Poznań, Poland
| | - Marek Żywicki
- Department of Computational Biology, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, 61-614 Poznań, Poland
| |
Collapse
|
2
|
RNA secondary structure packages evaluated and improved by high-throughput experiments. Nat Methods 2022; 19:1234-1242. [PMID: 36192461 PMCID: PMC9839360 DOI: 10.1038/s41592-022-01605-0] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2020] [Accepted: 08/10/2022] [Indexed: 01/17/2023]
Abstract
Despite the popularity of computer-aided study and design of RNA molecules, little is known about the accuracy of commonly used structure modeling packages in tasks sensitive to ensemble properties of RNA. Here, we demonstrate that the EternaBench dataset, a set of more than 20,000 synthetic RNA constructs designed on the RNA design platform Eterna, provides incisive discriminative power in evaluating current packages in ensemble-oriented structure prediction tasks. We find that CONTRAfold and RNAsoft, packages with parameters derived through statistical learning, achieve consistently higher accuracy than more widely used packages in their standard settings, which derive parameters primarily from thermodynamic experiments. We hypothesized that training a multitask model with the varied data types in EternaBench might improve inference on ensemble-based prediction tasks. Indeed, the resulting model, named EternaFold, demonstrated improved performance that generalizes to diverse external datasets including complete messenger RNAs, viral genomes probed in human cells and synthetic designs modeling mRNA vaccines.
Collapse
|
3
|
Leppek K, Byeon GW, Kladwang W, Wayment-Steele HK, Kerr CH, Xu AF, Kim DS, Topkar VV, Choe C, Rothschild D, Tiu GC, Wellington-Oguri R, Fujii K, Sharma E, Watkins AM, Nicol JJ, Romano J, Tunguz B, Diaz F, Cai H, Guo P, Wu J, Meng F, Shi S, Participants E, Dormitzer PR, Solórzano A, Barna M, Das R. Combinatorial optimization of mRNA structure, stability, and translation for RNA-based therapeutics. Nat Commun 2022; 13:1536. [PMID: 35318324 PMCID: PMC8940940 DOI: 10.1038/s41467-022-28776-w] [Citation(s) in RCA: 158] [Impact Index Per Article: 52.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Accepted: 02/07/2022] [Indexed: 02/07/2023] Open
Abstract
Therapeutic mRNAs and vaccines are being developed for a broad range of human diseases, including COVID-19. However, their optimization is hindered by mRNA instability and inefficient protein expression. Here, we describe design principles that overcome these barriers. We develop an RNA sequencing-based platform called PERSIST-seq to systematically delineate in-cell mRNA stability, ribosome load, as well as in-solution stability of a library of diverse mRNAs. We find that, surprisingly, in-cell stability is a greater driver of protein output than high ribosome load. We further introduce a method called In-line-seq, applied to thousands of diverse RNAs, that reveals sequence and structure-based rules for mitigating hydrolytic degradation. Our findings show that highly structured "superfolder" mRNAs can be designed to improve both stability and expression with further enhancement through pseudouridine nucleoside modification. Together, our study demonstrates simultaneous improvement of mRNA stability and protein expression and provides a computational-experimental platform for the enhancement of mRNA medicines.
Collapse
Affiliation(s)
- Kathrin Leppek
- Department of Genetics, Stanford University, Stanford, CA, 94305, USA
| | - Gun Woo Byeon
- Department of Genetics, Stanford University, Stanford, CA, 94305, USA
| | - Wipapat Kladwang
- Department of Biochemistry, Stanford University, Stanford, CA, 94305, USA
| | | | - Craig H Kerr
- Department of Genetics, Stanford University, Stanford, CA, 94305, USA
| | - Adele F Xu
- Department of Genetics, Stanford University, Stanford, CA, 94305, USA
| | - Do Soon Kim
- Department of Biochemistry, Stanford University, Stanford, CA, 94305, USA
| | - Ved V Topkar
- Program in Biophysics, Stanford University, Stanford, CA, 94305, USA
| | - Christian Choe
- Department of Bioengineering, Stanford University, Stanford, CA, 94305, USA
| | - Daphna Rothschild
- Department of Genetics, Stanford University, Stanford, CA, 94305, USA
| | - Gerald C Tiu
- Department of Genetics, Stanford University, Stanford, CA, 94305, USA
| | | | - Kotaro Fujii
- Department of Genetics, Stanford University, Stanford, CA, 94305, USA
| | - Eesha Sharma
- Department of Biochemistry, Stanford University, Stanford, CA, 94305, USA
| | - Andrew M Watkins
- Department of Biochemistry, Stanford University, Stanford, CA, 94305, USA
| | - John J Nicol
- Eterna Massive Open Laboratory, Stanford University, Stanford, CA, 94305, USA
| | - Jonathan Romano
- Eterna Massive Open Laboratory, Stanford University, Stanford, CA, 94305, USA
- Department of Computer Science and Engineering, State University of New York at Buffalo, Buffalo, New York, 14260, USA
| | - Bojan Tunguz
- Department of Biochemistry, Stanford University, Stanford, CA, 94305, USA
- NVIDIA Corporation, 2788 San Tomas Expy, Santa Clara, CA, 95051, USA
| | - Fernando Diaz
- Pfizer Vaccine Research and Development, Pearl River, NY, USA
| | - Hui Cai
- Pfizer Vaccine Research and Development, Pearl River, NY, USA
| | - Pengbo Guo
- Pfizer Vaccine Research and Development, Pearl River, NY, USA
| | - Jiewei Wu
- Pfizer Vaccine Research and Development, Pearl River, NY, USA
| | - Fanyu Meng
- Pfizer Vaccine Research and Development, Pearl River, NY, USA
| | - Shuai Shi
- Pfizer Vaccine Research and Development, Pearl River, NY, USA
| | - Eterna Participants
- Eterna Massive Open Laboratory, Stanford University, Stanford, CA, 94305, USA
| | - Philip R Dormitzer
- Pfizer Vaccine Research and Development, Pearl River, NY, USA
- GlaxoSmithKline, 1000 Winter St., Waltham, MA, 02453, USA
| | | | - Maria Barna
- Department of Genetics, Stanford University, Stanford, CA, 94305, USA.
| | - Rhiju Das
- Department of Biochemistry, Stanford University, Stanford, CA, 94305, USA.
- Program in Biophysics, Stanford University, Stanford, CA, 94305, USA.
- Eterna Massive Open Laboratory, Stanford University, Stanford, CA, 94305, USA.
| |
Collapse
|
4
|
Wayment-Steele HK, Kladwang W, Watkins AM, Kim DS, Tunguz B, Reade W, Demkin M, Romano J, Wellington-Oguri R, Nicol JJ, Gao J, Onodera K, Fujikawa K, Mao H, Vandewiele G, Tinti M, Steenwinckel B, Ito T, Noumi T, He S, Ishi K, Lee Y, Öztürk F, Chiu KY, Öztürk E, Amer K, Fares M, Das R. Deep learning models for predicting RNA degradation via dual crowdsourcing. NAT MACH INTELL 2022; 4:1174-1184. [PMID: 36567960 PMCID: PMC9771809 DOI: 10.1038/s42256-022-00571-8] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2021] [Accepted: 10/21/2022] [Indexed: 12/16/2022]
Abstract
Medicines based on messenger RNA (mRNA) hold immense potential, as evidenced by their rapid deployment as COVID-19 vaccines. However, worldwide distribution of mRNA molecules has been limited by their thermostability, which is fundamentally limited by the intrinsic instability of RNA molecules to a chemical degradation reaction called in-line hydrolysis. Predicting the degradation of an RNA molecule is a key task in designing more stable RNA-based therapeutics. Here, we describe a crowdsourced machine learning competition ('Stanford OpenVaccine') on Kaggle, involving single-nucleotide resolution measurements on 6,043 diverse 102-130-nucleotide RNA constructs that were themselves solicited through crowdsourcing on the RNA design platform Eterna. The entire experiment was completed in less than 6 months, and 41% of nucleotide-level predictions from the winning model were within experimental error of the ground truth measurement. Furthermore, these models generalized to blindly predicting orthogonal degradation data on much longer mRNA molecules (504-1,588 nucleotides) with improved accuracy compared with previously published models. These results indicate that such models can represent in-line hydrolysis with excellent accuracy, supporting their use for designing stabilized messenger RNAs. The integration of two crowdsourcing platforms, one for dataset creation and another for machine learning, may be fruitful for other urgent problems that demand scientific discovery on rapid timescales.
Collapse
Affiliation(s)
- Hannah K. Wayment-Steele
- grid.168010.e0000000419368956Department of Chemistry, Stanford University, Stanford, CA USA ,grid.497584.30000 0004 6761 3573Eterna Massive Open Laboratory, Stanford, CA USA
| | - Wipapat Kladwang
- grid.497584.30000 0004 6761 3573Eterna Massive Open Laboratory, Stanford, CA USA ,grid.168010.e0000000419368956Department of Biochemistry, Stanford University, Stanford, CA USA
| | - Andrew M. Watkins
- grid.497584.30000 0004 6761 3573Eterna Massive Open Laboratory, Stanford, CA USA ,grid.168010.e0000000419368956Department of Biochemistry, Stanford University, Stanford, CA USA ,grid.418158.10000 0004 0534 4718Prescient Design, Genentech, San Francisco, CA USA
| | - Do Soon Kim
- grid.497584.30000 0004 6761 3573Eterna Massive Open Laboratory, Stanford, CA USA ,grid.168010.e0000000419368956Department of Biochemistry, Stanford University, Stanford, CA USA
| | - Bojan Tunguz
- grid.168010.e0000000419368956Department of Biochemistry, Stanford University, Stanford, CA USA ,grid.451133.10000 0004 0458 4453NVIDIA Corporation, Santa Clara, CA USA
| | | | | | - Jonathan Romano
- grid.497584.30000 0004 6761 3573Eterna Massive Open Laboratory, Stanford, CA USA ,grid.168010.e0000000419368956Department of Biochemistry, Stanford University, Stanford, CA USA ,grid.273335.30000 0004 1936 9887Department of Computer Science and Engineering, State University of New York at Buffalo, Buffalo, NY USA
| | | | - John J. Nicol
- grid.497584.30000 0004 6761 3573Eterna Massive Open Laboratory, Stanford, CA USA
| | | | | | | | | | - Gilles Vandewiele
- grid.5342.00000 0001 2069 7798IDLab, Ghent University, Technologiepark-Zwijnaarde, Gent, Belgium
| | - Michele Tinti
- grid.8241.f0000 0004 0397 2876The Wellcome Centre for Anti-Infectives Research, College of Life Sciences, University of Dundee, Dundee, UK
| | - Bram Steenwinckel
- grid.5342.00000 0001 2069 7798IDLab, Ghent University, Technologiepark-Zwijnaarde, Gent, Belgium
| | | | - Taiga Noumi
- grid.497111.b0000 0004 0570 906XKeyence Corporation, 1-3-14, Higashi-Nakajima, Higashi-Yodogawa-ku, Osaka, Japan
| | - Shujun He
- grid.264756.40000 0004 4687 2082Department of Chemical Engineering, Texas A&M University, College Station, TX USA
| | | | - Youhan Lee
- grid.418964.60000 0001 0742 3338Korea Atomic Energy Research Institute, Daejeon, Republic of Korea ,Kakao Brain Corp, Seongnam, Gyeonggi-do Republic of Korea
| | | | | | | | - Karim Amer
- grid.440877.80000 0004 0377 5987Center for Informatics Science, Nile University, Sheikh Zayed, Giza, Egypt
| | - Mohamed Fares
- grid.440877.80000 0004 0377 5987Center for Informatics Science, Nile University, Sheikh Zayed, Giza, Egypt ,grid.419725.c0000 0001 2151 8157National Research Centre, Dokki, Cairo, Egypt
| | | | - Rhiju Das
- grid.497584.30000 0004 6761 3573Eterna Massive Open Laboratory, Stanford, CA USA ,grid.168010.e0000000419368956Department of Biochemistry, Stanford University, Stanford, CA USA ,grid.168010.e0000000419368956Howard Hughes Medical Institute, Stanford University, Stanford, CA USA
| |
Collapse
|
5
|
Leppek K, Byeon GW, Kladwang W, Wayment-Steele HK, Kerr CH, Xu AF, Kim DS, Topkar VV, Choe C, Rothschild D, Tiu GC, Wellington-Oguri R, Fujii K, Sharma E, Watkins AM, Nicol JJ, Romano J, Tunguz B, Participants E, Barna M, Das R. Combinatorial optimization of mRNA structure, stability, and translation for RNA-based therapeutics. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2021:2021.03.29.437587. [PMID: 33821271 PMCID: PMC8020971 DOI: 10.1101/2021.03.29.437587] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Therapeutic mRNAs and vaccines are being developed for a broad range of human diseases, including COVID-19. However, their optimization is hindered by mRNA instability and inefficient protein expression. Here, we describe design principles that overcome these barriers. We develop a new RNA sequencing-based platform called PERSIST-seq to systematically delineate in-cell mRNA stability, ribosome load, as well as in-solution stability of a library of diverse mRNAs. We find that, surprisingly, in-cell stability is a greater driver of protein output than high ribosome load. We further introduce a method called In-line-seq, applied to thousands of diverse RNAs, that reveals sequence and structure-based rules for mitigating hydrolytic degradation. Our findings show that "superfolder" mRNAs can be designed to improve both stability and expression that are further enhanced through pseudouridine nucleoside modification. Together, our study demonstrates simultaneous improvement of mRNA stability and protein expression and provides a computational-experimental platform for the enhancement of mRNA medicines.
Collapse
Affiliation(s)
- Kathrin Leppek
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| | - Gun Woo Byeon
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| | - Wipapat Kladwang
- Department of Biochemistry, Stanford University, California 94305, USA
| | | | - Craig H Kerr
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| | - Adele F Xu
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| | - Do Soon Kim
- Department of Biochemistry, Stanford University, California 94305, USA
| | - Ved V Topkar
- Program in Biophysics, Stanford University, Stanford, California 94305, USA
| | - Christian Choe
- Department of Bioengineering, Stanford University, Stanford, California 94305, USA
| | - Daphna Rothschild
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| | - Gerald C Tiu
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| | | | - Kotaro Fujii
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| | - Eesha Sharma
- Department of Biochemistry, Stanford University, California 94305, USA
| | - Andrew M Watkins
- Department of Biochemistry, Stanford University, California 94305, USA
| | | | - Jonathan Romano
- Eterna Massive Open Laboratory
- Department of Computer Science and Engineering, State University of New York at Buffalo, Buffalo, New York, 14260, USA
| | - Bojan Tunguz
- Department of Biochemistry, Stanford University, California 94305, USA
| | | | - Maria Barna
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University, California 94305, USA
| |
Collapse
|
6
|
Kladwang W, Topkar VV, Liu B, Rangan R, Hodges TL, Keane SC, Al-Hashimi H, Das R. Anomalous Reverse Transcription through Chemical Modifications in Polyadenosine Stretches. Biochemistry 2020; 59:2154-2170. [PMID: 32407625 DOI: 10.1021/acs.biochem.0c00020] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
Thermostable reverse transcriptases are workhorse enzymes underlying nearly all modern techniques for RNA structure mapping and for the transcriptome-wide discovery of RNA chemical modifications. Despite their wide use, these enzymes' behaviors at chemical modified nucleotides remain poorly understood. Wellington-Oguri et al. recently reported an apparent loss of chemical modification within putatively unstructured polyadenosine stretches modified by dimethyl sulfate or 2' hydroxyl acylation, as probed by reverse transcription. Here, reanalysis of these and other publicly available data, capillary electrophoresis experiments on chemically modified RNAs, and nuclear magnetic resonance spectroscopy on (A)12 and variants show that this effect is unlikely to arise from an unusual structure of polyadenosine. Instead, tests of different reverse transcriptases on chemically modified RNAs and molecules synthesized with single 1-methyladenosines implicate a previously uncharacterized reverse transcriptase behavior: near-quantitative bypass through chemical modifications within polyadenosine stretches. All tested natural and engineered reverse transcriptases (MMLV; SuperScript II, III, and IV; TGIRT-III; and MarathonRT) exhibit this anomalous bypass behavior. Accurate DMS-guided structure modeling of the polyadenylated HIV-1 3' untranslated region requires taking into account this anomaly. Our results suggest that poly(rA-dT) hybrid duplexes can trigger an unexpectedly effective reverse transcriptase bypass and that chemical modifications in mRNA poly(A) tails may be generally undercounted.
Collapse
Affiliation(s)
- Wipapat Kladwang
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305, United States
| | - Ved V Topkar
- Biophysics Program, Stanford University, Stanford, California 94305, United States
| | - Bei Liu
- Department of Biochemistry, Duke University School of Medicine, Durham, North Carolina 27710, United States
| | - Ramya Rangan
- Biophysics Program, Stanford University, Stanford, California 94305, United States
| | - Tracy L Hodges
- Biophysics Program, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Sarah C Keane
- Biophysics Program, University of Michigan, Ann Arbor, Michigan 48109, United States.,Department of Chemistry, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Hashim Al-Hashimi
- Department of Biochemistry, Duke University School of Medicine, Durham, North Carolina 27710, United States.,Department of Chemistry, Duke University School of Medicine, Durham, North Carolina 27710, United States
| | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305, United States.,Biophysics Program, Stanford University, Stanford, California 94305, United States.,Department of Physics, Stanford University, Stanford, California 94305, United States
| |
Collapse
|
7
|
Wellington-Oguri R, Fisker E, Zada M, Wiley M, Townley J, Players E. Evidence of an Unusual Poly(A) RNA Signature Detected by High-Throughput Chemical Mapping. Biochemistry 2020; 59:2041-2046. [PMID: 32412236 DOI: 10.1021/acs.biochem.0c00215] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Homopolymeric adenosine RNA plays numerous roles in both cells and noncellular genetic material. We report herein an unusual poly(A) signature in chemical mapping data generated by the Eterna Massive Open Laboratory. Poly(A) sequences of length seven or more show unexpected results in the selective 2'-hydroxyl acylation read out by primer extension (SHAPE) and dimethyl sulfate (DMS) chemical probing. This unusual signature first appears in poly(A) sequences of length seven and grows to its maximum strength at length ∼10. In a long poly(A) sequence, the substitution of a single A by any other nucleotide disrupts the signature, but only for the 6 or so nucleotides on the 5' side of the substitution.
Collapse
Affiliation(s)
- Roger Wellington-Oguri
- Eterna Massive Open Laboratory, Stanford University, B400 Beckman Center, Stanford, California 93405, United States
| | - Eli Fisker
- Eterna Massive Open Laboratory, Stanford University, B400 Beckman Center, Stanford, California 93405, United States
| | - Mathew Zada
- Eterna Massive Open Laboratory, Stanford University, B400 Beckman Center, Stanford, California 93405, United States
| | - Michelle Wiley
- Eterna Massive Open Laboratory, Stanford University, B400 Beckman Center, Stanford, California 93405, United States
| | - Jill Townley
- Eterna Massive Open Laboratory, Stanford University, B400 Beckman Center, Stanford, California 93405, United States
| | - Eterna Players
- Eterna Massive Open Laboratory, Stanford University, B400 Beckman Center, Stanford, California 93405, United States
| |
Collapse
|
8
|
Yu B, Lu Y, Zhang QC, Hou L. Prediction and differential analysis of RNA secondary structure. QUANTITATIVE BIOLOGY 2020. [DOI: 10.1007/s40484-020-0205-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
9
|
Li X, Liang QX, Lin JR, Peng J, Yang JH, Yi C, Yu Y, Zhang QC, Zhou KR. Epitranscriptomic technologies and analyses. SCIENCE CHINA-LIFE SCIENCES 2020; 63:501-515. [DOI: 10.1007/s11427-019-1658-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/24/2019] [Accepted: 02/12/2020] [Indexed: 01/28/2023]
|
10
|
Abstract
RNA viruses encode the information required to usurp cellular metabolism and gene regulation and to enable their own replication in two ways: in the linear sequence of their RNA genomes and in higher-order structures that form when the genomic RNA strand folds back on itself. Application of high-resolution SHAPE (selective 2'-hydroxyl acylation analyzed by primer extension) structure probing to viral RNA genomes has identified numerous new regulatory elements, defined new principles by which viral RNAs interact with the cellular host and evade host immune responses, and revealed relationships between virus evolution and RNA structure. This review summarizes our current understanding of genome structure-function interrelationships for RNA viruses, as informed by SHAPE structure probing, and outlines opportunities for future studies.
Collapse
Affiliation(s)
- Mark A Boerneke
- Department of Chemistry, University of North Carolina, Chapel Hill, North Carolina 27599-3290, USA; , ,
| | - Jeffrey E Ehrhardt
- Department of Chemistry, University of North Carolina, Chapel Hill, North Carolina 27599-3290, USA; , ,
| | - Kevin M Weeks
- Department of Chemistry, University of North Carolina, Chapel Hill, North Carolina 27599-3290, USA; , ,
| |
Collapse
|
11
|
Abstract
RNA performs and regulates a diverse range of cellular processes, with new functional roles being uncovered at a rapid pace. Interest is growing in how these functions are linked to RNA structures that form in the complex cellular environment. A growing suite of technologies that use advances in RNA structural probes, high-throughput sequencing and new computational approaches to interrogate RNA structure at unprecedented throughput are beginning to provide insights into RNA structures at new spatial, temporal and cellular scales.
Collapse
Affiliation(s)
- Eric J Strobel
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA
| | - Angela M Yu
- Tri-Institutional Training Program in Computational Biology and Medicine, Weill Cornell Medicine, New York, NY, USA
| | - Julius B Lucks
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA.
| |
Collapse
|
12
|
Busan S, Weidmann CA, Sengupta A, Weeks KM. Guidelines for SHAPE Reagent Choice and Detection Strategy for RNA Structure Probing Studies. Biochemistry 2019; 58:2655-2664. [PMID: 31117385 DOI: 10.1021/acs.biochem.8b01218] [Citation(s) in RCA: 85] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Chemical probing is an important tool for characterizing the complex folded structures of RNA molecules, many of which play key cellular roles. Electrophilic SHAPE reagents create adducts at the 2'-hydroxyl position on the RNA backbone of flexible ribonucleotides with relatively little dependence on nucleotide identity. Strategies for adduct detection such as mutational profiling (MaP) allow accurate, automated calculation of relative adduct frequencies for each nucleotide in a given RNA or group of RNAs. A number of alternative reagents and adduct detection strategies have been proposed, especially for use in living cells. Here we evaluate five SHAPE reagents: three previously well-validated reagents 1M7 (1-methyl-7-nitroisatoic anhydride), 1M6 (1-methyl-6-nitroisatoic anhydride), and NMIA ( N-methylisatoic anhydride), one more recently proposed NAI (2-methylnicotinic acid imidazolide), and one novel reagent 5NIA (5-nitroisatoic anhydride). We clarify the importance of carefully designed software in reading out SHAPE experiments using massively parallel sequencing approaches. We examine SHAPE modification in living cells in diverse cell lines, compare MaP and reverse transcription-truncation as SHAPE adduct detection strategies, make recommendations for SHAPE reagent choice, and outline areas for future development.
Collapse
Affiliation(s)
- Steven Busan
- Department of Chemistry , University of North Carolina , Chapel Hill , North Carolina 27599-3290 , United States
| | - Chase A Weidmann
- Department of Chemistry , University of North Carolina , Chapel Hill , North Carolina 27599-3290 , United States
| | - Arnab Sengupta
- Department of Chemistry , University of North Carolina , Chapel Hill , North Carolina 27599-3290 , United States
| | - Kevin M Weeks
- Department of Chemistry , University of North Carolina , Chapel Hill , North Carolina 27599-3290 , United States
| |
Collapse
|
13
|
Prediction of secondary and tertiary structures of human BC200 RNA (BCYRN1) based on experimental and bioinformatic cross-validation. Biochem J 2018; 475:2727-2748. [PMID: 30072491 DOI: 10.1042/bcj20180239] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2018] [Revised: 07/25/2018] [Accepted: 08/01/2018] [Indexed: 11/17/2022]
Abstract
Based on experimental and bioinformatic approaches, we present the first empirically established complete secondary structure of human BC200 RNA. BC200 RNA is a brain-specific non-messenger RNA with a confirmed regulatory role in dendritic translation in neurons. Although the involvement of human BC200 RNA in various types of tumour and Alzheimer's disease has been repeatedly confirmed, the exact secondary structure remains not fully elucidated. To determine the secondary structure of BC200 RNA in vitro, we performed partial hydrolysis with sequence-specific nucleases and lead-induced cleavage. We also examined the availabilities of putative single-stranded regions and base-pairing interactions via specific DNAzymes and RNase H assay. To determine the complete spatial folding of BC200 RNA, we used experimental data as constraints in structure prediction programs and performed a comparison of results obtained by several algorithms using different criteria. Based on the experimental-derived secondary structure of BC200 RNA, we also predicted the tertiary structure of BC200 RNA. The presented combination of experimental and bioinformatic approaches not only enabled the determination of the most reliable secondary and tertiary structures of human BC200 RNA (largely in agreement with the previous phylogenetic model), but also verified the compatibility and potential disadvantages of utilizing in silico structure prediction programs.
Collapse
|
14
|
Saus E, Willis JR, Pryszcz LP, Hafez A, Llorens C, Himmelbauer H, Gabaldón T. nextPARS: parallel probing of RNA structures in Illumina. RNA (NEW YORK, N.Y.) 2018; 24:609-619. [PMID: 29358234 PMCID: PMC5855959 DOI: 10.1261/rna.063073.117] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/20/2017] [Accepted: 01/03/2018] [Indexed: 06/07/2023]
Abstract
RNA molecules play important roles in virtually every cellular process. These functions are often mediated through the adoption of specific structures that enable RNAs to interact with other molecules. Thus, determining the secondary structures of RNAs is central to understanding their function and evolution. In recent years several sequencing-based approaches have been developed that allow probing structural features of thousands of RNA molecules present in a sample. Here, we describe nextPARS, a novel Illumina-based implementation of in vitro parallel probing of RNA structures. Our approach achieves comparable accuracy to previous implementations, while enabling higher throughput and sample multiplexing.
Collapse
Affiliation(s)
- Ester Saus
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona 08003, Spain
- Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
| | - Jesse R Willis
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona 08003, Spain
- Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
| | - Leszek P Pryszcz
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona 08003, Spain
- Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
- International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - Ahmed Hafez
- Biotechvana, 46980 Paterna (Valencia), Spain
| | | | - Heinz Himmelbauer
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona 08003, Spain
- Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
- Department of Biotechnology, University of Natural Resources and Life Sciences Vienna (BOKU), 1190 Vienna, Austria
| | - Toni Gabaldón
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona 08003, Spain
- Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), 08010 Barcelona, Spain
| |
Collapse
|
15
|
Ritchey LE, Su Z, Tang Y, Tack DC, Assmann SM, Bevilacqua PC. Structure-seq2: sensitive and accurate genome-wide profiling of RNA structure in vivo. Nucleic Acids Res 2017. [PMID: 28637286 PMCID: PMC5737731 DOI: 10.1093/nar/gkx533] [Citation(s) in RCA: 92] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
RNA serves many functions in biology such as splicing, temperature sensing, and innate immunity. These functions are often determined by the structure of RNA. There is thus a pressing need to understand RNA structure and how it changes during diverse biological processes both in vivo and genome-wide. Here, we present Structure-seq2, which provides nucleotide-resolution RNA structural information in vivo and genome-wide. This optimized version of our original Structure-seq method increases sensitivity by at least 4-fold and improves data quality by minimizing formation of a deleterious by-product, reducing ligation bias, and improving read coverage. We also present a variation of Structure-seq2 in which a biotinylated nucleotide is incorporated during reverse transcription, which greatly facilitates the protocol by eliminating two PAGE purification steps. We benchmark Structure-seq2 on both mRNA and rRNA structure in rice (Oryza sativa). We demonstrate that Structure-seq2 can lead to new biological insights. Our Structure-seq2 datasets uncover hidden breaks in chloroplast rRNA and identify a previously unreported N1-methyladenosine (m1A) in a nuclear-encoded Oryza sativa rRNA. Overall, Structure-seq2 is a rapid, sensitive, and unbiased method to probe RNA in vivo and genome-wide that facilitates new insights into RNA biology.
Collapse
Affiliation(s)
- Laura E Ritchey
- Department of Chemistry, Pennsylvania State University, University Park, PA 16802, USA.,Center for RNA Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA
| | - Zhao Su
- Department of Biology, Pennsylvania State University, University Park, PA 16802, USA
| | - Yin Tang
- Bioinformatics and Genomics Graduate Program, Pennsylvania State University, University Park, PA 16802, USA
| | - David C Tack
- Department of Biology, Pennsylvania State University, University Park, PA 16802, USA
| | - Sarah M Assmann
- Department of Biology, Pennsylvania State University, University Park, PA 16802, USA
| | - Philip C Bevilacqua
- Department of Chemistry, Pennsylvania State University, University Park, PA 16802, USA.,Center for RNA Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA.,Department of Biochemistry & Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA
| |
Collapse
|
16
|
RNA structure inference through chemical mapping after accidental or intentional mutations. Proc Natl Acad Sci U S A 2017; 114:9876-9881. [PMID: 28851837 DOI: 10.1073/pnas.1619897114] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Despite the critical roles RNA structures play in regulating gene expression, sequencing-based methods for experimentally determining RNA base pairs have remained inaccurate. Here, we describe a multidimensional chemical-mapping method called "mutate-and-map read out through next-generation sequencing" (M2-seq) that takes advantage of sparsely mutated nucleotides to induce structural perturbations at partner nucleotides and then detects these events through dimethyl sulfate (DMS) probing and mutational profiling. In special cases, fortuitous errors introduced during DNA template preparation and RNA transcription are sufficient to give M2-seq helix signatures; these signals were previously overlooked or mistaken for correlated double-DMS events. When mutations are enhanced through error-prone PCR, in vitro M2-seq experimentally resolves 33 of 68 helices in diverse structured RNAs including ribozyme domains, riboswitch aptamers, and viral RNA domains with a single false positive. These inferences do not require energy minimization algorithms and can be made by either direct visual inspection or by a neural-network-inspired algorithm called M2-net. Measurements on the P4-P6 domain of the Tetrahymena group I ribozyme embedded in Xenopus egg extract demonstrate the ability of M2-seq to detect RNA helices in a complex biological environment.
Collapse
|
17
|
Dawn of the in vivo RNA structurome and interactome. Biochem Soc Trans 2017; 44:1395-1410. [PMID: 27911722 DOI: 10.1042/bst20160075] [Citation(s) in RCA: 35] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2016] [Revised: 06/19/2016] [Accepted: 07/04/2016] [Indexed: 12/11/2022]
Abstract
RNA is one of the most fascinating biomolecules in living systems given its structural versatility to fold into elaborate architectures for important biological functions such as gene regulation, catalysis, and information storage. Knowledge of RNA structures and interactions can provide deep insights into their functional roles in vivo For decades, RNA structural studies have been conducted on a transcript-by-transcript basis. The advent of next-generation sequencing (NGS) has enabled the development of transcriptome-wide structural probing methods to profile the global landscape of RNA structures and interactions, also known as the RNA structurome and interactome, which transformed our understanding of the RNA structure-function relationship on a transcriptomic scale. In this review, molecular tools and NGS methods used for RNA structure probing are presented, novel insights uncovered by RNA structurome and interactome studies are highlighted, and perspectives on current challenges and potential future directions are discussed. A more complete understanding of the RNA structures and interactions in vivo will help illuminate the novel roles of RNA in gene regulation, development, and diseases.
Collapse
|
18
|
Abstract
Biological functions of RNA molecules are dependent upon sustained specific three-dimensional (3D) structures of RNA, with or without the help of proteins. Understanding of RNA structure is frequently based on 2D structures, which describe only the Watson-Crick (WC) base pairs. Here, we hierarchically review the structural elements of RNA and how they contribute to RNA 3D structure. We focus our analysis on the non-WC base pairs and on RNA modules. Several computer programs have now been designed to predict RNA modules. We describe the RNA-Puzzles initiative, which is a community-wide, blind assessment of RNA 3D structure prediction programs to determine the capabilities and bottlenecks of current predictions. The assessment metrics used in RNA-Puzzles are briefly described. The detection of RNA 3D modules from sequence data and their automatic implementation belong to the current challenges in RNA 3D structure prediction.
Collapse
Affiliation(s)
- Zhichao Miao
- Architecture et Réactivité de l'ARN, Université de Strasbourg, Institut de Biologie Moléculaire et Cellulaire du CNRS, 67000 Strasbourg, France; ,
| | - Eric Westhof
- Architecture et Réactivité de l'ARN, Université de Strasbourg, Institut de Biologie Moléculaire et Cellulaire du CNRS, 67000 Strasbourg, France; ,
| |
Collapse
|
19
|
Abstract
The discoveries of myriad non-coding RNA molecules, each transiting through multiple flexible states in cells or virions, present major challenges for structure determination. Advances in high-throughput chemical mapping give new routes for characterizing entire transcriptomes in vivo, but the resulting one-dimensional data generally remain too information-poor to allow accurate de novo structure determination. Multidimensional chemical mapping (MCM) methods seek to address this challenge. Mutate-and-map (M2), RNA interaction groups by mutational profiling (RING-MaP and MaP-2D analysis) and multiplexed •OH cleavage analysis (MOHCA) measure how the chemical reactivities of every nucleotide in an RNA molecule change in response to modifications at every other nucleotide. A growing body of in vitro blind tests and compensatory mutation/rescue experiments indicate that MCM methods give consistently accurate secondary structures and global tertiary structures for ribozymes, ribosomal domains and ligand-bound riboswitch aptamers up to 200 nucleotides in length. Importantly, MCM analyses provide detailed information on structurally heterogeneous RNA states, such as ligand-free riboswitches that are functionally important but difficult to resolve with other approaches. The sequencing requirements of currently available MCM protocols scale at least quadratically with RNA length, precluding general application to transcriptomes or viral genomes at present. We propose a modify-cross-link-map (MXM) expansion to overcome this and other current limitations to resolving the in vivo 'RNA structurome'.
Collapse
|
20
|
Choudhary K, Deng F, Aviran S. Comparative and integrative analysis of RNA structural profiling data: current practices and emerging questions. QUANTITATIVE BIOLOGY 2017; 5:3-24. [PMID: 28717530 PMCID: PMC5510538 DOI: 10.1007/s40484-017-0093-6] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2016] [Revised: 12/08/2016] [Accepted: 12/15/2016] [Indexed: 12/30/2022]
Abstract
BACKGROUND Structure profiling experiments provide single-nucleotide information on RNA structure. Recent advances in chemistry combined with application of high-throughput sequencing have enabled structure profiling at transcriptome scale and in living cells, creating unprecedented opportunities for RNA biology. Propelled by these experimental advances, massive data with ever-increasing diversity and complexity have been generated, which give rise to new challenges in interpreting and analyzing these data. RESULTS We review current practices in analysis of structure profiling data with emphasis on comparative and integrative analysis as well as highlight emerging questions. Comparative analysis has revealed structural patterns across transcriptomes and has become an integral component of recent profiling studies. Additionally, profiling data can be integrated into traditional structure prediction algorithms to improve prediction accuracy. CONCLUSIONS To keep pace with experimental developments, methods to facilitate, enhance and refine such analyses are needed. Parallel advances in analysis methodology will complement profiling technologies and help them reach their full potential.
Collapse
Affiliation(s)
| | | | - Sharon Aviran
- Department of Biomedical Engineering and Genome Center, University of California at Davis, Davis, CA 95616, USA
| |
Collapse
|
21
|
Incarnato D, Oliviero S. The RNA Epistructurome: Uncovering RNA Function by Studying Structure and Post-Transcriptional Modifications. Trends Biotechnol 2016; 35:318-333. [PMID: 27988057 DOI: 10.1016/j.tibtech.2016.11.002] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2016] [Revised: 11/11/2016] [Accepted: 11/21/2016] [Indexed: 01/15/2023]
Abstract
A large fraction of higher metazoan genomes transcribe RNA molecules whose functions extend far beyond carrying instructions for protein synthesis. Although RNA is apparently a simple molecule, the ways in which it performs many of its functions have remained highly elusive for decades. As learned from studying ribosomal and transfer RNAs, two of the key features influencing the function of RNA are its structure and post-transcriptional modifications. A deep understanding of RNA function therefore requires rapid and straightforward approaches to study the complex and intricate landscape of RNA structures and modifications. In this review we summarize and discuss the most recent methods and findings in the field of RNA biology, with an eye toward new frontiers and open questions.
Collapse
Affiliation(s)
- Danny Incarnato
- Human Genetics Foundation (HuGeF), Via Nizza 52, 10126 Torino, Italy; Dipartimento di Scienze della Vita e Biologia dei Sistemi, Università di Torino, Via Accademia Albertina 13, Torino, Italy.
| | - Salvatore Oliviero
- Human Genetics Foundation (HuGeF), Via Nizza 52, 10126 Torino, Italy; Dipartimento di Scienze della Vita e Biologia dei Sistemi, Università di Torino, Via Accademia Albertina 13, Torino, Italy.
| |
Collapse
|
22
|
Choudhary K, Shih NP, Deng F, Ledda M, Li B, Aviran S. Metrics for rapid quality control in RNA structure probing experiments. Bioinformatics 2016; 32:3575-3583. [PMID: 27497441 DOI: 10.1093/bioinformatics/btw501] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2015] [Revised: 07/02/2016] [Accepted: 07/26/2016] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION The diverse functionalities of RNA can be attributed to its capacity to form complex and varied structures. The recent proliferation of new structure probing techniques coupled with high-throughput sequencing has helped RNA studies expand in both scope and depth. Despite differences in techniques, most experiments face similar challenges in reproducibility due to the stochastic nature of chemical probing and sequencing. As these protocols expand to transcriptome-wide studies, quality control becomes a more daunting task. General and efficient methodologies are needed to quantify variability and quality in the wide range of current and emerging structure probing experiments. RESULTS We develop metrics to rapidly and quantitatively evaluate data quality from structure probing experiments, demonstrating their efficacy on both small synthetic libraries and transcriptome-wide datasets. We use a signal-to-noise ratio concept to evaluate replicate agreement, which has the capacity to identify high-quality data. We also consider and compare two methods to assess variability inherent in probing experiments, which we then utilize to evaluate the coverage adjustments needed to meet desired quality. The developed metrics and tools will be useful in summarizing large-scale datasets and will help standardize quality control in the field. AVAILABILITY AND IMPLEMENTATION The data and methods used in this article are freely available at: http://bme.ucdavis.edu/aviranlab/SPEQC_software CONTACT: saviran@ucdavis.eduSupplementary information: Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Krishna Choudhary
- Department of Biomedical Engineering and Genome Center, University of California at Davis, Davis, CA, USA
| | - Nathan P Shih
- Department of Biomedical Engineering and Genome Center, University of California at Davis, Davis, CA, USA
| | - Fei Deng
- Department of Biomedical Engineering and Genome Center, University of California at Davis, Davis, CA, USA
| | - Mirko Ledda
- Department of Biomedical Engineering and Genome Center, University of California at Davis, Davis, CA, USA
| | - Bo Li
- Center for RNA Systems Biology, University of California at Berkeley, Berkeley, CA, USA
| | - Sharon Aviran
- Department of Biomedical Engineering and Genome Center, University of California at Davis, Davis, CA, USA
| |
Collapse
|
23
|
Watters KE, Yu AM, Strobel EJ, Settle AH, Lucks JB. Characterizing RNA structures in vitro and in vivo with selective 2'-hydroxyl acylation analyzed by primer extension sequencing (SHAPE-Seq). Methods 2016; 103:34-48. [PMID: 27064082 DOI: 10.1016/j.ymeth.2016.04.002] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2015] [Revised: 04/01/2016] [Accepted: 04/04/2016] [Indexed: 01/08/2023] Open
Abstract
RNA molecules adopt a wide variety of structures that perform many cellular functions, including, among others, catalysis, small molecule sensing, and cellular defense. Our ability to characterize, predict, and design RNA structures are key factors for understanding and controlling the biological roles of RNAs. Fortunately, there has been rapid progress in this area, especially with respect to experimental methods that can characterize RNA structures in a high throughput fashion using chemical probing and next-generation sequencing. Here, we describe one such method, selective 2'-hydroxyl acylation analyzed by primer extension sequencing (SHAPE-Seq), which measures nucleotide resolution flexibility information for RNAs in vitro and in vivo. We outline the process of designing and performing a SHAPE-Seq experiment and describe methods for using experimental SHAPE-Seq data to restrain computational folding algorithms to generate more accurate predictions of RNA secondary structure. We also provide a number of examples of SHAPE-Seq reactivity spectra obtained in vitro and in vivo and discuss important considerations for performing SHAPE-Seq experiments, both in terms of collecting and analyzing data. Finally, we discuss improvements and extensions of these experimental and computational techniques that promise to deepen our knowledge of RNA folding and function.
Collapse
Affiliation(s)
- Kyle E Watters
- School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY 14853, United States
| | - Angela M Yu
- School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY 14853, United States; Tri-Institutional Program in Computational Biology and Medicine, Cornell University, Ithaca, New York, Weill Cornell Medical College, New York, New York, Memorial Sloan-Kettering Cancer Center, New York, New York, United States; Computational Biology Program, Memorial Sloan-Kettering Cancer Center, New York, NY 10065, United States
| | - Eric J Strobel
- School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY 14853, United States
| | - Alex H Settle
- School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY 14853, United States
| | - Julius B Lucks
- School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY 14853, United States.
| |
Collapse
|
24
|
Anderson-Lee J, Fisker E, Kosaraju V, Wu M, Kong J, Lee J, Lee M, Zada M, Treuille A, Das R. Principles for Predicting RNA Secondary Structure Design Difficulty. J Mol Biol 2016; 428:748-757. [PMID: 26902426 PMCID: PMC4833017 DOI: 10.1016/j.jmb.2015.11.013] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2015] [Revised: 11/04/2015] [Accepted: 11/10/2015] [Indexed: 11/27/2022]
Abstract
Designing RNAs that form specific secondary structures is enabling better understanding and control of living systems through RNA-guided silencing, genome editing and protein organization. Little is known, however, about which RNA secondary structures might be tractable for downstream sequence design, increasing the time and expense of design efforts due to inefficient secondary structure choices. Here, we present insights into specific structural features that increase the difficulty of finding sequences that fold into a target RNA secondary structure, summarizing the design efforts of tens of thousands of human participants and three automated algorithms (RNAInverse, INFO-RNA and RNA-SSD) in the Eterna massive open laboratory. Subsequent tests through three independent RNA design algorithms (NUPACK, DSS-Opt and MODENA) confirmed the hypothesized importance of several features in determining design difficulty, including sequence length, mean stem length, symmetry and specific difficult-to-design motifs such as zigzags. Based on these results, we have compiled an Eterna100 benchmark of 100 secondary structure design challenges that span a large range in design difficulty to help test future efforts. Our in silico results suggest new routes for improving computational RNA design methods and for extending these insights to assess "designability" of single RNA structures, as well as of switches for in vitro and in vivo applications.
Collapse
Affiliation(s)
| | | | - Vineet Kosaraju
- Eterna Massive Open Laboratory; Department of Biochemistry, Stanford University, Stanford, CA 94305, USA
| | - Michelle Wu
- Eterna Massive Open Laboratory; Program in Biomedical Informatics, Stanford University, Stanford, CA 94305, USA
| | - Justin Kong
- Eterna Massive Open Laboratory; Department of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Jeehyung Lee
- Eterna Massive Open Laboratory; Department of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Minjae Lee
- Eterna Massive Open Laboratory; Department of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | | | - Adrien Treuille
- Eterna Massive Open Laboratory; Department of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA; Robotics Institute, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Rhiju Das
- Eterna Massive Open Laboratory; Department of Biochemistry, Stanford University, Stanford, CA 94305, USA; Department of Physics, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
25
|
Silverman IM, Berkowitz ND, Gosai SJ, Gregory BD. Genome-Wide Approaches for RNA Structure Probing. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016; 907:29-59. [PMID: 27256381 DOI: 10.1007/978-3-319-29073-7_2] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
RNA molecules of all types fold into complex secondary and tertiary structures that are important for their function and regulation. Structural and catalytic RNAs such as ribosomal RNA (rRNA) and transfer RNA (tRNA) are central players in protein synthesis, and only function through their proper folding into intricate three-dimensional structures. Studies of messenger RNA (mRNA) regulation have also revealed that structural elements embedded within these RNA species are important for the proper regulation of their total level in the transcriptome. More recently, the discovery of microRNAs (miRNAs) and long non-coding RNAs (lncRNAs) has shed light on the importance of RNA structure to genome, transcriptome, and proteome regulation. Due to the relatively small number, high conservation, and importance of structural and catalytic RNAs to all life, much early work in RNA structure analysis mapped out a detailed view of these molecules. Computational and physical methods were used in concert with enzymatic and chemical structure probing to create high-resolution models of these fundamental biological molecules. However, the recent expansion in our knowledge of the importance of RNA structure to coding and regulatory RNAs has left the field in need of faster and scalable methods for high-throughput structural analysis. To address this, nuclease and chemical RNA structure probing methodologies have been adapted for genome-wide analysis. These methods have been deployed to globally characterize thousands of RNA structures in a single experiment. Here, we review these experimental methodologies for high-throughput RNA structure determination and discuss the insights gained from each approach.
Collapse
Affiliation(s)
- Ian M Silverman
- Department of Biology, University of Pennsylvania, Philadelphia, PA, 19104, USA.,Cell and Molecular Biology Graduate Group, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Nathan D Berkowitz
- Department of Biology, University of Pennsylvania, Philadelphia, PA, 19104, USA.,Genomics and Computational Biology Graduate Group, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Sager J Gosai
- Department of Biology, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Brian D Gregory
- Department of Biology, University of Pennsylvania, Philadelphia, PA, 19104, USA. .,Cell and Molecular Biology Graduate Group, University of Pennsylvania, Philadelphia, PA, 19104, USA. .,Genomics and Computational Biology Graduate Group, University of Pennsylvania, Philadelphia, PA, 19104, USA.
| |
Collapse
|
26
|
Abstract
In recent years a wide variety of RNA molecules regulating fundamental cellular processes has been discovered. Therefore, RNA structure determination is experiencing a boost and many more RNA structures are likely to be determined in the years to come. The broader availability of experimentally determined RNA structures implies that molecular replacement (MR) will be used more and more frequently as a method for phasing future crystallographic structures. In this report we describe various aspects relative to RNA structure determination by MR. First, we describe how to select and create MR search models for nucleic acids. Second, we describe how to perform MR searches on RNA using available crystallographic software. Finally, we describe how to refine and interpret the successful MR solutions. These protocols are applicable to determine novel RNA structures as well as to establish structural-functional relationships on existing RNA structures.
Collapse
|
27
|
Bioinformatics tools for lncRNA research. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2016; 1859:23-30. [DOI: 10.1016/j.bbagrm.2015.07.014] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/30/2015] [Revised: 07/07/2015] [Accepted: 07/14/2015] [Indexed: 12/28/2022]
|
28
|
Abstract
Long non-coding RNAs (lncRNAs) are a class of RNA molecules that are changing how researchers view eukaryotic gene regulation. Once considered to be non-functional products of low-level aberrant transcription from non-coding regions of the genome, lncRNAs are now viewed as important epigenetic regulators and several lncRNAs have now been demonstrated to be critical players in the development and/or maintenance of cancer. Similarly, the emerging variety of interactions between lncRNAs and MYC, a well-known oncogenic transcription factor linked to most types of cancer, have caught the attention of many biomedical researchers. Investigations exploring the dynamic interactions between lncRNAs and MYC, referred to as the lncRNA-MYC network, have proven to be especially complex. Genome-wide studies have shown that MYC transcriptionally regulates many lncRNA genes. Conversely, recent reports identified lncRNAs that regulate MYC expression both at the transcriptional and post-transcriptional levels. These findings are of particular interest because they suggest roles of lncRNAs as regulators of MYC oncogenic functions and the possibility that targeting lncRNAs could represent a novel avenue to cancer treatment. Here, we briefly review the current understanding of how lncRNAs regulate chromatin structure and gene transcription, and then focus on the new developments in the emerging field exploring the lncRNA-MYC network in cancer.
Collapse
Affiliation(s)
- Michael J. Hamilton
- Department of Biochemistry, University of California, Riverside, CA 92521, USA
| | - Matthew D. Young
- Department of Biochemistry, University of California, Riverside, CA 92521, USA
| | - Silvia Sauer
- Department of Biochemistry, University of California, Riverside, CA 92521, USA
| | - Ernest Martinez
- Department of Biochemistry, University of California, Riverside, CA 92521, USA
| |
Collapse
|
29
|
Cordero P, Das R. Rich RNA Structure Landscapes Revealed by Mutate-and-Map Analysis. PLoS Comput Biol 2015; 11:e1004473. [PMID: 26566145 PMCID: PMC4643908 DOI: 10.1371/journal.pcbi.1004473] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2015] [Accepted: 07/20/2015] [Indexed: 11/19/2022] Open
Abstract
Landscapes exhibiting multiple secondary structures arise in natural RNA molecules that modulate gene expression, protein synthesis, and viral infection [corrected]. We report herein that high-throughput chemical experiments can isolate an RNA's multiple alternative secondary structures as they are stabilized by systematic mutagenesis (mutate-and-map, M2) and that a computational algorithm, REEFFIT, enables unbiased reconstruction of these states' structures and populations. In an in silico benchmark on non-coding RNAs with complex landscapes, M2-REEFFIT recovers 95% of RNA helices present with at least 25% population while maintaining a low false discovery rate (10%) and conservative error estimates. In experimental benchmarks, M2-REEFFIT recovers the structure landscapes of a 35-nt MedLoop hairpin, a 110-nt 16S rRNA four-way junction with an excited state, a 25-nt bistable hairpin, and a 112-nt three-state adenine riboswitch with its expression platform, molecules whose characterization previously required expert mutational analysis and specialized NMR or chemical mapping experiments. With this validation, M2-REEFFIT enabled tests of whether artificial RNA sequences might exhibit complex landscapes in the absence of explicit design. An artificial flavin mononucleotide riboswitch and a randomly generated RNA sequence are found to interconvert between three or more states, including structures for which there was no design, but that could be stabilized through mutations. These results highlight the likely pervasiveness of rich landscapes with multiple secondary structures in both natural and artificial RNAs and demonstrate an automated chemical/computational route for their empirical characterization.
Collapse
Affiliation(s)
- Pablo Cordero
- Biomedical Informatics Program, Stanford University, Stanford, California, United States of America
- Biochemistry Department, Stanford University, Stanford, California, United States of America
| | - Rhiju Das
- Biomedical Informatics Program, Stanford University, Stanford, California, United States of America
- Biochemistry Department, Stanford University, Stanford, California, United States of America
- Physics Department, Stanford University, Stanford, California, United States of America
| |
Collapse
|
30
|
Cheng CY, Chou FC, Kladwang W, Tian S, Cordero P, Das R. Consistent global structures of complex RNA states through multidimensional chemical mapping. eLife 2015; 4:e07600. [PMID: 26035425 PMCID: PMC4495719 DOI: 10.7554/elife.07600] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2015] [Accepted: 06/02/2015] [Indexed: 11/13/2022] Open
Abstract
Accelerating discoveries of non-coding RNA (ncRNA) in myriad biological processes pose major challenges to structural and functional analysis. Despite progress in secondary structure modeling, high-throughput methods have generally failed to determine ncRNA tertiary structures, even at the 1-nm resolution that enables visualization of how helices and functional motifs are positioned in three dimensions. We report that integrating a new method called MOHCA-seq (Multiplexed •OHCleavage Analysis with paired-end sequencing) with mutate-and-map secondary structure inference guides Rosetta 3D modeling to consistent 1-nm accuracy for intricately folded ncRNAs with lengths up to 188 nucleotides, including a blind RNA-puzzle challenge, the lariat-capping ribozyme. This multidimensional chemical mapping (MCM) pipeline resolves unexpected tertiary proximities for cyclic-di-GMP, glycine, and adenosylcobalamin riboswitch aptamers without their ligands and a loose structure for the recently discovered human HoxA9D internal ribosome entry site regulon. MCM offers a sequencing-based route to uncovering ncRNA 3D structure, applicable to functionally important but potentially heterogeneous states. DOI:http://dx.doi.org/10.7554/eLife.07600.001 Our genetic material, in the form of molecules of DNA, provides instructions for many different processes in our cells. To issue these instructions, particular sections of DNA are copied to make a type of molecule called ribonucleic acid (RNA). Some of these RNA molecules contain instructions to make proteins, but others—known as non-coding RNAs—regulate the activity of genes in cells. The genetic information within RNA is encoded by the sequence of four different chemical parts called ‘nucleotides’. RNA can exist as a single strand of nucleotides, but the nucleotides can also pair up in specific combinations to form sections of double-stranded RNA. Therefore, a single strand of non-coding RNA can fold into a complex three-dimensional shape that contains loops, twists, and bulges. The three-dimensional structures of non-coding RNAs are crucial for their roles in cells, but the variety and complexity of shapes that they can form makes it technically difficult to study them. In 2008, researchers developed a new method called MOHCA that can map the positions of nucleotides that are close together in the three-dimensional structure. Highly reactive chemicals are attached to the nucleotides and these can react with, and damage, other nearby nucleotides. By detecting which nucleotides have been damaged, it is possible to map the positions of these nucleotides and decipher the structure of the RNA molecule using computer algorithms. MOHCA is a promising approach, but the initial methods to find the damaged nucleotides were tedious and required specialized equipment. Now, Cheng, Das et al.—including some of the researchers involved in the 2008 work—have developed an improved version of MOHCA that uses readily available RNA sequencing techniques to find the damaged nucleotides. The RNA sequencing data are then analyzed by a new algorithm in the Rosetta computer modeling software. Cheng, Das et al. used this newly developed ‘MOHCA-seq’ and Rosetta to reveal the structures of a human non-coding RNA and several other non-coding RNA molecules to a much higher level of detail than before. Together, MOHCA-seq and Rosetta provide a rapid method for researchers to decipher the three-dimensional structure of non-coding RNAs. This method is likely to speed up the analysis of the complex structures of non-coding RNAs. It will be useful in future efforts to work out what roles these RNAs play in cells, including their activity in cancer, neurodegeneration, and other diseases. DOI:http://dx.doi.org/10.7554/eLife.07600.002
Collapse
Affiliation(s)
- Clarence Yu Cheng
- Department of Biochemistry, Stanford University, Stanford, United States
| | - Fang-Chieh Chou
- Department of Biochemistry, Stanford University, Stanford, United States
| | - Wipapat Kladwang
- Department of Biochemistry, Stanford University, Stanford, United States
| | - Siqi Tian
- Department of Biochemistry, Stanford University, Stanford, United States
| | - Pablo Cordero
- Biomedical Informatics Program, Stanford University, Stanford, United States
| | - Rhiju Das
- Department of Biochemistry, Stanford University, Stanford, United States
| |
Collapse
|
31
|
Miao Z, Adamiak RW, Blanchet MF, Boniecki M, Bujnicki JM, Chen SJ, Cheng C, Chojnowski G, Chou FC, Cordero P, Cruz JA, Ferré-D'Amaré AR, Das R, Ding F, Dokholyan NV, Dunin-Horkawicz S, Kladwang W, Krokhotin A, Lach G, Magnus M, Major F, Mann TH, Masquida B, Matelska D, Meyer M, Peselis A, Popenda M, Purzycka KJ, Serganov A, Stasiewicz J, Szachniuk M, Tandon A, Tian S, Wang J, Xiao Y, Xu X, Zhang J, Zhao P, Zok T, Westhof E. RNA-Puzzles Round II: assessment of RNA structure prediction programs applied to three large RNA structures. RNA (NEW YORK, N.Y.) 2015; 21:1066-84. [PMID: 25883046 PMCID: PMC4436661 DOI: 10.1261/rna.049502.114] [Citation(s) in RCA: 141] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/10/2015] [Accepted: 02/12/2015] [Indexed: 05/04/2023]
Abstract
This paper is a report of a second round of RNA-Puzzles, a collective and blind experiment in three-dimensional (3D) RNA structure prediction. Three puzzles, Puzzles 5, 6, and 10, represented sequences of three large RNA structures with limited or no homology with previously solved RNA molecules. A lariat-capping ribozyme, as well as riboswitches complexed to adenosylcobalamin and tRNA, were predicted by seven groups using RNAComposer, ModeRNA/SimRNA, Vfold, Rosetta, DMD, MC-Fold, 3dRNA, and AMBER refinement. Some groups derived models using data from state-of-the-art chemical-mapping methods (SHAPE, DMS, CMCT, and mutate-and-map). The comparisons between the predictions and the three subsequently released crystallographic structures, solved at diffraction resolutions of 2.5-3.2 Å, were carried out automatically using various sets of quality indicators. The comparisons clearly demonstrate the state of present-day de novo prediction abilities as well as the limitations of these state-of-the-art methods. All of the best prediction models have similar topologies to the native structures, which suggests that computational methods for RNA structure prediction can already provide useful structural information for biological problems. However, the prediction accuracy for non-Watson-Crick interactions, key to proper folding of RNAs, is low and some predicted models had high Clash Scores. These two difficulties point to some of the continuing bottlenecks in RNA structure prediction. All submitted models are available for download at http://ahsoka.u-strasbg.fr/rnapuzzles/.
Collapse
Affiliation(s)
- Zhichao Miao
- Architecture et Réactivité de l'ARN, Université de Strasbourg, Institut de biologie moléculaire et cellulaire du CNRS, 67000 Strasbourg, France
| | - Ryszard W Adamiak
- Department of Structural Chemistry and Biology of Nucleic Acids, Structural Chemistry of Nucleic Acids Laboratory, Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Marc-Frédérick Blanchet
- Institute for Research in Immunology and Cancer (IRIC), Department of Computer Science and Operations Research, Université de Montréal, Montréal, Québec, Canada H3C 3J7
| | - Michal Boniecki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland Laboratory of Bioinformatics, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, 61-614 Poznan, Poland
| | - Shi-Jie Chen
- Department of Physics and Astronomy, Department of Biochemistry, and Informatics Institute, University of Missouri-Columbia, Columbia, Missouri 65211, USA
| | - Clarence Cheng
- Department of Physics, Stanford University, Stanford, California 94305, USA
| | - Grzegorz Chojnowski
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - Fang-Chieh Chou
- Department of Physics, Stanford University, Stanford, California 94305, USA
| | - Pablo Cordero
- Department of Physics, Stanford University, Stanford, California 94305, USA
| | - José Almeida Cruz
- Architecture et Réactivité de l'ARN, Université de Strasbourg, Institut de biologie moléculaire et cellulaire du CNRS, 67000 Strasbourg, France
| | | | - Rhiju Das
- Department of Physics, Stanford University, Stanford, California 94305, USA
| | - Feng Ding
- Department of Physics and Astronomy, College of Engineering and Science, Clemson University, Clemson, South Carolina 29634, USA
| | - Nikolay V Dokholyan
- Department of Biochemistry and Biophysics, University of North Carolina, School of Medicine, Chapel Hill, North Carolina 27599, USA
| | - Stanislaw Dunin-Horkawicz
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - Wipapat Kladwang
- Department of Physics, Stanford University, Stanford, California 94305, USA
| | - Andrey Krokhotin
- Department of Biochemistry and Biophysics, University of North Carolina, School of Medicine, Chapel Hill, North Carolina 27599, USA
| | - Grzegorz Lach
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - Marcin Magnus
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - François Major
- Institute for Research in Immunology and Cancer (IRIC), Department of Computer Science and Operations Research, Université de Montréal, Montréal, Québec, Canada H3C 3J7
| | - Thomas H Mann
- Department of Physics, Stanford University, Stanford, California 94305, USA
| | - Benoît Masquida
- Génétique Moléculaire Génomique Microbiologie, Institut de physiologie et de la chimie biologique, 67084 Strasbourg, France
| | - Dorota Matelska
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - Mélanie Meyer
- Institut de génétique et de biologie moléculaire et cellulaire, 67400 Strasbourg, France
| | - Alla Peselis
- Department of Biochemistry and Molecular Pharmacology, New York University School of Medicine, New York, New York 10016, USA
| | - Mariusz Popenda
- Department of Structural Chemistry and Biology of Nucleic Acids, Structural Chemistry of Nucleic Acids Laboratory, Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Katarzyna J Purzycka
- Department of Structural Chemistry and Biology of Nucleic Acids, Structural Chemistry of Nucleic Acids Laboratory, Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Alexander Serganov
- Department of Biochemistry and Molecular Pharmacology, New York University School of Medicine, New York, New York 10016, USA
| | - Juliusz Stasiewicz
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - Marta Szachniuk
- Poznan University of Technology, Institute of Computing Science, 60-965 Poznan, Poland
| | - Arpit Tandon
- Department of Biochemistry and Biophysics, University of North Carolina, School of Medicine, Chapel Hill, North Carolina 27599, USA
| | - Siqi Tian
- Department of Physics, Stanford University, Stanford, California 94305, USA
| | - Jian Wang
- Department of Physics, Huazhong University of Science and Technology, 430074 Wuhan, China
| | - Yi Xiao
- Department of Physics, Huazhong University of Science and Technology, 430074 Wuhan, China
| | - Xiaojun Xu
- Department of Physics and Astronomy, Department of Biochemistry, and Informatics Institute, University of Missouri-Columbia, Columbia, Missouri 65211, USA
| | - Jinwei Zhang
- National Heart, Lung and Blood Institute, Bethesda, Maryland 20892-8012, USA
| | - Peinan Zhao
- Department of Physics and Astronomy, Department of Biochemistry, and Informatics Institute, University of Missouri-Columbia, Columbia, Missouri 65211, USA
| | - Tomasz Zok
- Poznan University of Technology, Institute of Computing Science, 60-965 Poznan, Poland
| | - Eric Westhof
- Architecture et Réactivité de l'ARN, Université de Strasbourg, Institut de biologie moléculaire et cellulaire du CNRS, 67000 Strasbourg, France
| |
Collapse
|
32
|
Poulsen LD, Kielpinski LJ, Salama SR, Krogh A, Vinther J. SHAPE Selection (SHAPES) enrich for RNA structure signal in SHAPE sequencing-based probing data. RNA (NEW YORK, N.Y.) 2015; 21:1042-52. [PMID: 25805860 PMCID: PMC4408784 DOI: 10.1261/rna.047068.114] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/19/2014] [Accepted: 02/04/2015] [Indexed: 05/24/2023]
Abstract
Selective 2' Hydroxyl Acylation analyzed by Primer Extension (SHAPE) is an accurate method for probing of RNA secondary structure. In existing SHAPE methods, the SHAPE probing signal is normalized to a no-reagent control to correct for the background caused by premature termination of the reverse transcriptase. Here, we introduce a SHAPE Selection (SHAPES) reagent, N-propanone isatoic anhydride (NPIA), which retains the ability of SHAPE reagents to accurately probe RNA structure, but also allows covalent coupling between the SHAPES reagent and a biotin molecule. We demonstrate that SHAPES-based selection of cDNA-RNA hybrids on streptavidin beads effectively removes the large majority of background signal present in SHAPE probing data and that sequencing-based SHAPES data contain the same amount of RNA structure data as regular sequencing-based SHAPE data obtained through normalization to a no-reagent control. Moreover, the selection efficiently enriches for probed RNAs, suggesting that the SHAPES strategy will be useful for applications with high-background and low-probing signal such as in vivo RNA structure probing.
Collapse
Affiliation(s)
- Line Dahl Poulsen
- Department of Biology, University of Copenhagen, DK-2200 Copenhagen N, Denmark
| | | | - Sofie R Salama
- Center for Biomolecular Science and Engineering, and Howard Hughes Medical Institute, University of California Santa Cruz, Santa Cruz, California 95064, USA
| | - Anders Krogh
- Department of Biology, University of Copenhagen, DK-2200 Copenhagen N, Denmark
| | - Jeppe Vinther
- Department of Biology, University of Copenhagen, DK-2200 Copenhagen N, Denmark
| |
Collapse
|
33
|
The RNA structurome: transcriptome-wide structure probing with next-generation sequencing. Trends Biochem Sci 2015; 40:221-32. [DOI: 10.1016/j.tibs.2015.02.005] [Citation(s) in RCA: 122] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2014] [Revised: 02/16/2015] [Accepted: 02/17/2015] [Indexed: 01/16/2023]
|
34
|
Kielpinski LJ, Sidiropoulos N, Vinther J. Reproducible Analysis of Sequencing-Based RNA Structure Probing Data with User-Friendly Tools. Methods Enzymol 2015; 558:153-180. [PMID: 26068741 DOI: 10.1016/bs.mie.2015.01.014] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/08/2022]
Abstract
RNA structure-probing data can improve the prediction of RNA secondary and tertiary structure and allow structural changes to be identified and investigated. In recent years, massive parallel sequencing has dramatically improved the throughput of RNA structure probing experiments, but at the same time also made analysis of the data challenging for scientists without formal training in computational biology. Here, we discuss different strategies for data analysis of massive parallel sequencing-based structure-probing data. To facilitate reproducible and standardized analysis of this type of data, we have made a collection of tools, which allow raw sequencing reads to be converted to normalized probing values using different published strategies. In addition, we also provide tools for visualization of the probing data in the UCSC Genome Browser and for converting RNA coordinates to genomic coordinates and vice versa. The collection is implemented as functions in the R statistical environment and as tools in the Galaxy platform, making them easily accessible for the scientific community. We demonstrate the usefulness of the collection by applying it to the analysis of sequencing-based hydroxyl radical probing data and comparing different normalization strategies.
Collapse
Affiliation(s)
- Lukasz Jan Kielpinski
- Section for RNA and Computational Biology, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Nikolaos Sidiropoulos
- Section for RNA and Computational Biology, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Jeppe Vinther
- Section for RNA and Computational Biology, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
35
|
Ge P, Zhang S. Computational analysis of RNA structures with chemical probing data. Methods 2015; 79-80:60-6. [PMID: 25687190 DOI: 10.1016/j.ymeth.2015.02.003] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2014] [Revised: 01/16/2015] [Accepted: 02/09/2015] [Indexed: 11/28/2022] Open
Abstract
RNAs play various roles, not only as the genetic codes to synthesize proteins, but also as the direct participants of biological functions determined by their underlying high-order structures. Although many computational methods have been proposed for analyzing RNA structures, their accuracy and efficiency are limited, especially when applied to the large RNAs and the genome-wide data sets. Recently, advances in parallel sequencing and high-throughput chemical probing technologies have prompted the development of numerous new algorithms, which can incorporate the auxiliary structural information obtained from those experiments. Their potential has been revealed by the secondary structure prediction of ribosomal RNAs and the genome-wide ncRNA function annotation. In this review, the existing probing-directed computational methods for RNA secondary and tertiary structure analysis are discussed.
Collapse
Affiliation(s)
- Ping Ge
- Department of Electrical Engineering and Computer Science, University of Central Florida, Orlando, FL 32816-2362, USA
| | - Shaojie Zhang
- Department of Electrical Engineering and Computer Science, University of Central Florida, Orlando, FL 32816-2362, USA.
| |
Collapse
|
36
|
Abstract
Reliable modeling of RNA tertiary structures is key to both understanding these structures' roles in complex biological machines and to eventually facilitating their design for molecular computing and robotics. In recent years, a concerted effort to improve computational prediction of RNA structure through the RNA-Puzzles blind prediction trials has accelerated advances in the field. Among other approaches, the versatile and expanding Rosetta molecular modeling software now permits modeling of RNAs in the 100-300 nucleotide size range at consistent subhelical (~1 nm) resolution. Our laboratory's current state-of-the-art methods for RNAs in this size range involve Fragment Assembly of RNA with Full-Atom Refinement (FARFAR), which optimizes RNA conformations in the context of a physically realistic energy function, as well as hybrid techniques that leverage experimental data to inform computational modeling. In this chapter, we give a practical guide to our current workflow for modeling RNA three-dimensional structures using FARFAR, including strategies for using data from multidimensional chemical mapping experiments to focus sampling and select accurate conformations.
Collapse
Affiliation(s)
- Clarence Yu Cheng
- Department of Biochemistry, Stanford University, Stanford, California, USA
| | - Fang-Chieh Chou
- Department of Biochemistry, Stanford University, Stanford, California, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University, Stanford, California, USA; Department of Physics, Stanford University, Stanford, California, USA.
| |
Collapse
|
37
|
Sloma MF, Mathews DH. Improving RNA secondary structure prediction with structure mapping data. Methods Enzymol 2015; 553:91-114. [PMID: 25726462 DOI: 10.1016/bs.mie.2014.10.053] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Methods to probe RNA secondary structure, such as small molecule modifying agents, secondary structure-specific nucleases, inline probing, and SHAPE chemistry, are widely used to study the structure of functional RNA. Computational secondary structure prediction programs can incorporate probing data to predict structure with high accuracy. In this chapter, an overview of current methods for probing RNA secondary structure is provided, including modern high-throughput methods. Methods for guiding secondary structure prediction algorithms using these data are explained, and best practices for using these data are provided. This chapter concludes by listing a number of open questions about how to best use probing data, and what these data can provide.
Collapse
Affiliation(s)
- Michael F Sloma
- Department of Biochemistry & Biophysics, University of Rochester Medical Center, Box 712, Rochester, New York, USA; Center for RNA Biology, University of Rochester Medical Center, Box 712, Rochester, New York, USA
| | - David H Mathews
- Department of Biochemistry & Biophysics, University of Rochester Medical Center, Box 712, Rochester, New York, USA; Center for RNA Biology, University of Rochester Medical Center, Box 712, Rochester, New York, USA.
| |
Collapse
|
38
|
Xue S, Tian S, Fujii K, Kladwang W, Das R, Barna M. RNA regulons in Hox 5' UTRs confer ribosome specificity to gene regulation. Nature 2015; 517:33-8. [PMID: 25409156 PMCID: PMC4353651 DOI: 10.1038/nature14010] [Citation(s) in RCA: 239] [Impact Index Per Article: 23.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2014] [Accepted: 10/30/2014] [Indexed: 02/07/2023]
Abstract
Emerging evidence suggests that the ribosome has a regulatory function in directing how the genome is translated in time and space. However, how this regulation is encoded in the messenger RNA sequence remains largely unknown. Here we uncover unique RNA regulons embedded in homeobox (Hox) 5' untranslated regions (UTRs) that confer ribosome-mediated control of gene expression. These structured RNA elements, resembling viral internal ribosome entry sites (IRESs), are found in subsets of Hox mRNAs. They facilitate ribosome recruitment and require the ribosomal protein RPL38 for their activity. Despite numerous layers of Hox gene regulation, these IRES elements are essential for converting Hox transcripts into proteins to pattern the mammalian body plan. This specialized mode of IRES-dependent translation is enabled by an additional regulatory element that we term the translation inhibitory element (TIE), which blocks cap-dependent translation of transcripts. Together, these data uncover a new paradigm for ribosome-mediated control of gene expression and organismal development.
Collapse
Affiliation(s)
- Shifeng Xue
- Department of Developmental Biology, Stanford University, Stanford, California 94305, USA
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| | - Siqi Tian
- Department of Biochemistry, Stanford University, Stanford, California 94305, USA
| | - Kotaro Fujii
- Department of Developmental Biology, Stanford University, Stanford, California 94305, USA
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| | - Wipapat Kladwang
- Department of Biochemistry, Stanford University, Stanford, California 94305, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University, Stanford, California 94305, USA
- Department of Physics, Stanford University, Stanford, California 94305, USA
| | - Maria Barna
- Department of Developmental Biology, Stanford University, Stanford, California 94305, USA
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| |
Collapse
|
39
|
Aviran S, Pachter L. Rational experiment design for sequencing-based RNA structure mapping. RNA (NEW YORK, N.Y.) 2014; 20:1864-1877. [PMID: 25332375 PMCID: PMC4238353 DOI: 10.1261/rna.043844.113] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/08/2013] [Accepted: 09/07/2014] [Indexed: 05/30/2023]
Abstract
Structure mapping is a classic experimental approach for determining nucleic acid structure that has gained renewed interest in recent years following advances in chemistry, genomics, and informatics. The approach encompasses numerous techniques that use different means to introduce nucleotide-level modifications in a structure-dependent manner. Modifications are assayed via cDNA fragment analysis, using electrophoresis or next-generation sequencing (NGS). The recent advent of NGS has dramatically increased the throughput, multiplexing capacity, and scope of RNA structure mapping assays, thereby opening new possibilities for genome-scale, de novo, and in vivo studies. From an informatics standpoint, NGS is more informative than prior technologies by virtue of delivering direct molecular measurements in the form of digital sequence counts. Motivated by these new capabilities, we introduce a novel model-based in silico approach for quantitative design of large-scale multiplexed NGS structure mapping assays, which takes advantage of the direct and digital nature of NGS readouts. We use it to characterize the relationship between controllable experimental parameters and the precision of mapping measurements. Our results highlight the complexity of these dependencies and shed light on relevant tradeoffs and pitfalls, which can be difficult to discern by intuition alone. We demonstrate our approach by quantitatively assessing the robustness of SHAPE-Seq measurements, obtained by multiplexing SHAPE (selective 2'-hydroxyl acylation analyzed by primer extension) chemistry in conjunction with NGS. We then utilize it to elucidate design considerations in advanced genome-wide approaches for probing the transcriptome, which recently obtained in vivo information using dimethyl sulfate (DMS) chemistry.
Collapse
Affiliation(s)
- Sharon Aviran
- Biomedical Engineering Department and Genome Center, University of California at Davis, Davis, California 95616, USA
| | - Lior Pachter
- Center for Computational Biology and Departments of Molecular and Cell Biology and Mathematics, University of California at Berkeley, Berkeley, California 94720, USA
| |
Collapse
|
40
|
Tian S, Cordero P, Kladwang W, Das R. High-throughput mutate-map-rescue evaluates SHAPE-directed RNA structure and uncovers excited states. RNA (NEW YORK, N.Y.) 2014; 20:1815-26. [PMID: 25183835 PMCID: PMC4201832 DOI: 10.1261/rna.044321.114] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
The three-dimensional conformations of noncoding RNAs underpin their biochemical functions but have largely eluded experimental characterization. Here, we report that integrating a classic mutation/rescue strategy with high-throughput chemical mapping enables rapid RNA structure inference with unusually strong validation. We revisit a 16S rRNA domain for which SHAPE (selective 2'-hydroxyl acylation with primer extension) and limited mutational analysis suggested a conformational change between apo- and holo-ribosome conformations. Computational support estimates, data from alternative chemical probes, and mutate-and-map (M(2)) experiments highlight issues of prior methodology and instead give a near-crystallographic secondary structure. Systematic interrogation of single base pairs via a high-throughput mutation/rescue approach then permits incisive validation and refinement of the M(2)-based secondary structure. The data further uncover the functional conformation as an excited state (20 ± 10% population) accessible via a single-nucleotide register shift. These results correct an erroneous SHAPE inference of a ribosomal conformational change, expose critical limitations of conventional structure mapping methods, and illustrate practical steps for more incisively dissecting RNA dynamic structure landscapes.
Collapse
Affiliation(s)
- Siqi Tian
- Department of Biochemistry, Stanford University, Stanford, California 94305, USA
| | - Pablo Cordero
- Biomedical Informatics Program, Stanford University, Stanford, California 94305, USA
| | - Wipapat Kladwang
- Department of Physics, Stanford University, Stanford, California 94305, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University, Stanford, California 94305, USA Biomedical Informatics Program, Stanford University, Stanford, California 94305, USA Department of Physics, Stanford University, Stanford, California 94305, USA
| |
Collapse
|
41
|
Loughrey D, Watters KE, Settle AH, Lucks JB. SHAPE-Seq 2.0: systematic optimization and extension of high-throughput chemical probing of RNA secondary structure with next generation sequencing. Nucleic Acids Res 2014; 42:gku909. [PMID: 25303992 PMCID: PMC4245970 DOI: 10.1093/nar/gku909] [Citation(s) in RCA: 101] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
RNA structure is a primary determinant of its function, and methods that merge chemical probing with next generation sequencing have created breakthroughs in the throughput and scale of RNA structure characterization. However, little work has been done to examine the effects of library preparation and sequencing on the measured chemical probe reactivities that encode RNA structural information. Here, we present the first analysis and optimization of these effects for selective 2′-hydroxyl acylation analyzed by primer extension sequencing (SHAPE-Seq). We first optimize SHAPE-Seq, and show that it provides highly reproducible reactivity data over a wide range of RNA structural contexts with no apparent biases. As part of this optimization, we present SHAPE-Seq v2.0, a ‘universal’ method that can obtain reactivity information for every nucleotide of an RNA without having to use or introduce a specific reverse transcriptase priming site within the RNA. We show that SHAPE-Seq v2.0 is highly reproducible, with reactivity data that can be used as constraints in RNA folding algorithms to predict structures on par with those generated using data from other SHAPE methods. We anticipate SHAPE-Seq v2.0 to be broadly applicable to understanding the RNA sequence–structure relationship at the heart of some of life's most fundamental processes.
Collapse
Affiliation(s)
- David Loughrey
- School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY 14850, USA
| | - Kyle E Watters
- School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY 14850, USA
| | - Alexander H Settle
- School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY 14850, USA
| | - Julius B Lucks
- School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY 14850, USA
| |
Collapse
|
42
|
Treuille A, Das R. Scientific rigor through videogames. Trends Biochem Sci 2014; 39:507-9. [PMID: 25300714 DOI: 10.1016/j.tibs.2014.08.005] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2014] [Revised: 08/11/2014] [Accepted: 08/15/2014] [Indexed: 10/24/2022]
Abstract
Hypothesis-driven experimentation - the scientific method - can be subverted by fraud, irreproducibility, and lack of rigorous predictive tests. A robust solution to these problems may be the 'massive open laboratory' model, recently embodied in the internet-scale videogame EteRNA. Deploying similar platforms throughout biology could enforce the scientific method more broadly.
Collapse
Affiliation(s)
- Adrien Treuille
- Computer Science Department, Carnegie Mellon University, Pittsburgh, PA 15206, USA.
| | - Rhiju Das
- Department of Biochemistry, Stanford University, Stanford, CA 94305, USA; Department of Physics, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
43
|
Hector RD, Burlacu E, Aitken S, Le Bihan T, Tuijtel M, Zaplatina A, Cook AG, Granneman S. Snapshots of pre-rRNA structural flexibility reveal eukaryotic 40S assembly dynamics at nucleotide resolution. Nucleic Acids Res 2014; 42:12138-54. [PMID: 25200078 PMCID: PMC4231735 DOI: 10.1093/nar/gku815] [Citation(s) in RCA: 75] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Ribosome assembly in eukaryotes involves the activity of hundreds of assembly factors that direct the hierarchical assembly of ribosomal proteins and numerous ribosomal RNA folding steps. However, detailed insights into the function of assembly factors and ribosomal RNA folding events are lacking. To address this, we have developed ChemModSeq, a method that combines structure probing, high-throughput sequencing and statistical modeling, to quantitatively measure RNA structural rearrangements during the assembly of macromolecular complexes. By applying ChemModSeq to purified 40S assembly intermediates we obtained nucleotide-resolution maps of ribosomal RNA flexibility revealing structurally distinct assembly intermediates and mechanistic insights into assembly dynamics not readily observed in cryo-electron microscopy reconstructions. We show that RNA restructuring events coincide with the release of assembly factors and predict that completion of the head domain is required before the Rio1 kinase enters the assembly pathway. Collectively, our results suggest that 40S assembly factors regulate the timely incorporation of ribosomal proteins by delaying specific folding steps in the 3' major domain of the 20S pre-ribosomal RNA.
Collapse
Affiliation(s)
- Ralph D Hector
- Centre for Synthetic and Systems Biology (SynthSys), University of Edinburgh, Edinburgh, EH9 3JD, UK
| | - Elena Burlacu
- Centre for Synthetic and Systems Biology (SynthSys), University of Edinburgh, Edinburgh, EH9 3JD, UK Wellcome Trust Centre for Cell Biology, University of Edinburgh, Edinburgh, EH9 3JR, UK
| | - Stuart Aitken
- MRC Human Genetics Unit, University of Edinburgh, Western General Hospital, Crewe Road, Edinburgh, EH4 2XU, UK
| | - Thierry Le Bihan
- Centre for Synthetic and Systems Biology (SynthSys), University of Edinburgh, Edinburgh, EH9 3JD, UK
| | - Maarten Tuijtel
- Centre for Synthetic and Systems Biology (SynthSys), University of Edinburgh, Edinburgh, EH9 3JD, UK
| | - Alina Zaplatina
- Centre for Synthetic and Systems Biology (SynthSys), University of Edinburgh, Edinburgh, EH9 3JD, UK
| | - Atlanta G Cook
- Wellcome Trust Centre for Cell Biology, University of Edinburgh, Edinburgh, EH9 3JR, UK
| | - Sander Granneman
- Centre for Synthetic and Systems Biology (SynthSys), University of Edinburgh, Edinburgh, EH9 3JD, UK
| |
Collapse
|
44
|
Abstract
A comprehensive understanding of RNA structure will provide fundamental insights into the cellular function of both coding and non-coding RNAs. Although many RNA structures have been analysed by traditional biophysical and biochemical methods, the low-throughput nature of these approaches has prevented investigation of the vast majority of cellular transcripts. Triggered by advances in sequencing technology, genome-wide approaches for probing the transcriptome are beginning to reveal how RNA structure affects each step of protein expression and RNA stability. In this Review, we discuss the emerging relationships between RNA structure and the regulation of gene expression.
Collapse
|
45
|
Kladwang W, Mann TH, Becka A, Tian S, Kim H, Yoon S, Das R. Standardization of RNA chemical mapping experiments. Biochemistry 2014; 53:3063-5. [PMID: 24766159 PMCID: PMC4033625 DOI: 10.1021/bi5003426] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
![]()
Chemical
mapping experiments offer powerful information about RNA
structure but currently involve ad hoc assumptions in data processing.
We show that simple dilutions, referencing standards (GAGUA hairpins),
and HiTRACE/MAPseeker analysis allow rigorous overmodification correction,
background subtraction, and normalization for electrophoretic data
and a ligation bias correction needed for accurate deep sequencing
data. Comparisons across six noncoding RNAs stringently test the proposed
standardization of dimethyl sulfate (DMS), 2′-OH acylation
(SHAPE), and carbodiimide measurements. Identification of new signatures
for extrahelical bulges and DMS “hot spot” pockets (including
tRNA A58, methylated in vivo) illustrates the utility
and necessity of standardization for quantitative RNA mapping.
Collapse
Affiliation(s)
- Wipapat Kladwang
- Department of Biochemistry, Stanford University , Stanford, California 94305, United States
| | | | | | | | | | | | | |
Collapse
|
46
|
Kielpinski LJ, Vinther J. Massive parallel-sequencing-based hydroxyl radical probing of RNA accessibility. Nucleic Acids Res 2014; 42:e70. [PMID: 24569351 PMCID: PMC4005689 DOI: 10.1093/nar/gku167] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Hydroxyl Radical Footprinting (HRF) is a tried-and-tested method for analysis of the tertiary structure of RNA and for identification of protein footprints on RNA. The hydroxyl radical reaction breaks accessible parts of the RNA backbone, thereby allowing ribose accessibility to be determined by detection of reverse transcriptase termination sites. Current methods for HRF rely on reverse transcription of a single primer and detection by fluorescent fragments by capillary electrophoresis. Here, we describe an accurate and efficient massive parallel-sequencing-based method for probing RNA accessibility with hydroxyl radicals, called HRF-Seq. Using random priming and a novel barcoding scheme, we show that HRF-Seq dramatically increases the throughput of HRF experiments and facilitates the parallel analysis of multiple RNAs or experimental conditions. Moreover, we demonstrate that HRF-Seq data for the Escherichia coli 16S rRNA correlates well with the ribose accessible surface area as determined by X-ray crystallography and have a resolution that readily allows the difference in accessibility caused by exposure of one side of RNA helices to be observed.
Collapse
Affiliation(s)
- Lukasz Jan Kielpinski
- Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, DK-2200 Copenhagen N, Denmark
| | | |
Collapse
|
47
|
Abstract
Self-assembling RNA molecules present compelling substrates for the rational interrogation and control of living systems. However, imperfect in silico models--even at the secondary structure level--hinder the design of new RNAs that function properly when synthesized. Here, we present a unique and potentially general approach to such empirical problems: the Massive Open Laboratory. The EteRNA project connects 37,000 enthusiasts to RNA design puzzles through an online interface. Uniquely, EteRNA participants not only manipulate simulated molecules but also control a remote experimental pipeline for high-throughput RNA synthesis and structure mapping. We show herein that the EteRNA community leveraged dozens of cycles of continuous wet laboratory feedback to learn strategies for solving in vitro RNA design problems on which automated methods fail. The top strategies--including several previously unrecognized negative design rules--were distilled by machine learning into an algorithm, EteRNABot. Over a rigorous 1-y testing phase, both the EteRNA community and EteRNABot significantly outperformed prior algorithms in a dozen RNA secondary structure design tests, including the creation of dendrimer-like structures and scaffolds for small molecule sensors. These results show that an online community can carry out large-scale experiments, hypothesis generation, and algorithm design to create practical advances in empirical science.
Collapse
|