1
|
Wang J. Genome-Wide Analysis of Stable RNA Secondary Structures across Multiple Organisms Using Chemical Probing Data: Insights into Short Structural Motifs and RNA-Targeting Therapeutics. Biochemistry 2025; 64:1817-1827. [PMID: 40131856 PMCID: PMC12005188 DOI: 10.1021/acs.biochem.4c00764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2024] [Revised: 03/10/2025] [Accepted: 03/14/2025] [Indexed: 03/27/2025]
Abstract
Small molecules targeting specific RNA-binding sites, including stable and transient RNA structures, are emerging as effective pharmacological approaches for modulating gene expression. However, little is understood about how stable RNA secondary structures are shared across organisms, which is an important factor in controlling drug selectivity. In this study, I provide an analytical pipeline named RNA secondary structure finder (R2S-Finder) to discover short, stable RNA structural motifs in humans, Escherichia coli (E. coli), SARS-CoV-2, and Zika virus by leveraging existing in vivo and in vitro genome-wide chemical RNA-probing datasets. I found several common features across the organisms. For example, apart from the well-documented tetraloops, AU-rich tetraloops are widely present in different organisms. I also validated that the 5' untranslated region (UTR) contains a higher proportion of stable structures than the coding sequences in humans and Zika virus. In general, stable structures predicted from in vitro (protein-free) and in vivo datasets are consistent across different organisms, indicating that stable structure formation is mostly driven by RNA folding, while a larger variation was found between in vitro and in vivo data for certain RNA types, such as human long intergenic noncoding RNAs (lincRNAs). Finally, I predicted stable three- and four-way RNA junctions that exist under both in vivo and in vitro conditions and can potentially serve as drug targets. All results of stable structures, stem-loops, internal loops, bulges, and n-way junctions have been collated in the R2S-Finder database (https://github.com/JingxinWangLab/R2S-Finder), which is coded in hyperlinked HTML pages and tabulated in CSV files.
Collapse
Affiliation(s)
- Jingxin Wang
- Section of Genetic Medicine,
Department of Medicine, Biological Sciences Division, University of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|
2
|
Mu K, Fei Y, Xu Y, Zhang QC. RASP v2.0: an updated atlas for RNA structure probing data. Nucleic Acids Res 2025; 53:D211-D219. [PMID: 39546630 PMCID: PMC11701657 DOI: 10.1093/nar/gkae1117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2024] [Revised: 10/16/2024] [Accepted: 11/14/2024] [Indexed: 11/17/2024] Open
Abstract
RNA molecules function in numerous biological processes by folding into intricate structures. Here we present RASP v2.0, an updated database for RNA structure probing data featuring a substantially expanded collection of datasets along with enhanced online structural analysis functionalities. Compared to the previous version, RASP v2.0 includes the following improvements: (i) the number of RNA structure datasets has increased from 156 to 438, comprising 216 transcriptome-wide RNA structure datasets, 141 target-specific RNA structure datasets, and 81 RNA-RNA interaction datasets, thereby broadening species coverage from 18 to 24, (ii) a deep learning-based model has been implemented to impute missing structural signals for 59 transcriptome-wide RNA structure datasets with low structure score coverage, significantly enhancing data quality, particularly for low-abundance RNAs, (iii) three new online analysis modules have been deployed to assist RNA structure studies, including missing structure score imputation, RNA secondary and tertiary structure prediction, and RNA binding protein (RBP) binding prediction. By providing a resource of much more comprehensive RNA structure data, RASP v2.0 is poised to facilitate the exploration of RNA structure-function relationships across diverse biological processes. RASP v2.0 is freely accessible at http://rasp2.zhanglab.net/.
Collapse
Affiliation(s)
- Kunting Mu
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China
- Tsinghua-Peking Center for Life Sciences, Beijing 100084, China
| | - Yuhan Fei
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China
- Tsinghua-Peking Center for Life Sciences, Beijing 100084, China
| | - Yiran Xu
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China
- Tsinghua-Peking Center for Life Sciences, Beijing 100084, China
| | - Qiangfeng Cliff Zhang
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China
- Tsinghua-Peking Center for Life Sciences, Beijing 100084, China
| |
Collapse
|
3
|
Szyjka CE, Kelly SL, Strobel EJ. Sequential structure probing of cotranscriptional RNA folding intermediates. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.10.14.618260. [PMID: 39464030 PMCID: PMC11507761 DOI: 10.1101/2024.10.14.618260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 10/29/2024]
Abstract
Cotranscriptional RNA folding pathways typically involve the sequential formation of folding intermediates. Existing methods for cotranscriptional RNA structure probing map the structure of nascent RNA in the context of a terminally arrested transcription elongation complex. Consequently, the rearrangement of RNA structures as nucleotides are added to the transcript can be inferred but is not assessed directly. To address this limitation, we have developed linked-multipoint Transcription Elongation Complex RNA structure probing (TECprobe-LM), which assesses the cotranscriptional rearrangement of RNA structures by sequentially positioning E. coli RNAP at two or more points within a DNA template so that nascent RNA can be chemically probed. We validated TECprobe-LM by measuring known folding events that occur within the E. coli signal recognition particle RNA, Clostridium beijerinckii pfl ZTP riboswitch, and Bacillus cereus crcB fluoride riboswitch folding pathways. Our findings establish TECprobe-LM as a strategy for detecting cotranscriptional RNA folding events directly using chemical probing.
Collapse
Affiliation(s)
- Courtney E. Szyjka
- Department of Biological Sciences, The University at Buffalo, Buffalo, NY 14260, USA
| | - Skyler L. Kelly
- Department of Biological Sciences, The University at Buffalo, Buffalo, NY 14260, USA
| | - Eric J. Strobel
- Department of Biological Sciences, The University at Buffalo, Buffalo, NY 14260, USA
| |
Collapse
|
4
|
Wang J. Genome-Wide Identification of Stable RNA Secondary Structures Across Multiple Organisms Using Chemical Probing Data: Insights into Short Structural Motifs and RNA-Targeting Therapeutics. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.10.08.617329. [PMID: 39416040 PMCID: PMC11482745 DOI: 10.1101/2024.10.08.617329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/19/2024]
Abstract
Small molecules targeting specific RNA binding sites, including stable and transient RNA structures, are emerging as effective pharmacological approaches for modulating gene expression. However, little is understood about how stable RNA secondary structures are shared across organisms, an important factor in controlling drug selectivity. In this study, I provide an analytical pipeline named RNA Secondary Structure Finder (R2S-Finder) to discover short, stable RNA structural motifs for humans, Escherichia coli ( E. coli ), SARS-CoV-2, and Zika virus by leveraging existing in vivo and in vitro genome-wide chemical RNA-probing datasets. I found several common features across organisms. For example, apart from the well-documented tetraloops, AU-rich tetraloops are widely present in different organisms. I also found that the 5' untranslated region (UTR) contains a higher proportion of stable structures than the coding sequences in humans, SARS-CoV-2, and Zika virus. In general, stable structures predicted from in vitro (protein-free) and in vivo datasets are consistent in humans, E. coli , and SARS-CoV-2, indicating that most stable structure formation were driven by RNA folding alone, while a larger variation was found between in vitro and in vivo data with certain RNA types, such as human long intergenic non-coding RNAs (lincRNAs). Finally, I predicted stable three- and four-way RNA junctions that exist both in vivo and in vitro conditions, which can potentially serve as drug targets. All results of stable sequences, stem-loops, internal loops, bulges, and three- and four-way junctions have been collated in the R2S-Finder database ( https://github.com/JingxinWangLab/R2S-Finder ), which is coded in hyperlinked HTML pages and tabulated in CSV files.
Collapse
|
5
|
Greenwood T, Heitsch CE. How Parameters Influence SHAPE-Directed Predictions. Methods Mol Biol 2024; 2726:105-124. [PMID: 38780729 DOI: 10.1007/978-1-0716-3519-3_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/25/2024]
Abstract
The structure of an RNA sequence encodes information about its biological function. Dynamic programming algorithms are often used to predict the conformation of an RNA molecule from its sequence alone, and adding experimental data as auxiliary information improves prediction accuracy. This auxiliary data is typically incorporated into the nearest neighbor thermodynamic model22 by converting the data into pseudoenergies. Here, we look at how much of the space of possible structures auxiliary data allows prediction methods to explore. We find that for a large class of RNA sequences, auxiliary data shifts the predictions significantly. Additionally, we find that predictions are highly sensitive to the parameters which define the auxiliary data pseudoenergies. In fact, the parameter space can typically be partitioned into regions where different structural predictions predominate.
Collapse
|
6
|
Szyjka CE, Strobel EJ. Observation of coordinated RNA folding events by systematic cotranscriptional RNA structure probing. Nat Commun 2023; 14:7839. [PMID: 38030633 PMCID: PMC10687018 DOI: 10.1038/s41467-023-43395-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Accepted: 11/08/2023] [Indexed: 12/01/2023] Open
Abstract
RNA begins to fold as it is transcribed by an RNA polymerase. Consequently, RNA folding is constrained by the direction and rate of transcription. Understanding how RNA folds into secondary and tertiary structures therefore requires methods for determining the structure of cotranscriptional folding intermediates. Cotranscriptional RNA chemical probing methods accomplish this by systematically probing the structure of nascent RNA that is displayed from an RNA polymerase. Here, we describe a concise, high-resolution cotranscriptional RNA chemical probing procedure called variable length Transcription Elongation Complex RNA structure probing (TECprobe-VL). We demonstrate the accuracy and resolution of TECprobe-VL by replicating and extending previous analyses of ZTP and fluoride riboswitch folding and mapping the folding pathway of a ppGpp-sensing riboswitch. In each system, we show that TECprobe-VL identifies coordinated cotranscriptional folding events that mediate transcription antitermination. Our findings establish TECprobe-VL as an accessible method for mapping cotranscriptional RNA folding pathways.
Collapse
Affiliation(s)
- Courtney E Szyjka
- Department of Biological Sciences, The University at Buffalo, Buffalo, NY, 14260, USA
| | - Eric J Strobel
- Department of Biological Sciences, The University at Buffalo, Buffalo, NY, 14260, USA.
| |
Collapse
|
7
|
Szyjka CE, Strobel EJ. Observation of coordinated cotranscriptional RNA folding events. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.21.529405. [PMID: 36865203 PMCID: PMC9980086 DOI: 10.1101/2023.02.21.529405] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/25/2023]
Abstract
RNA begins to fold as it is transcribed by an RNA polymerase. Consequently, RNA folding is constrained by the direction and rate of transcription. Understanding how RNA folds into secondary and tertiary structures therefore requires methods for determining the structure of cotranscriptional folding intermediates. Cotranscriptional RNA chemical probing methods accomplish this by systematically probing the structure of nascent RNA that is displayed from RNA polymerase. Here, we have developed a concise, high-resolution cotranscriptional RNA chemical probing procedure called Transcription Elongation Complex RNA structure probing-Multilength (TECprobe-ML). We validated TECprobe-ML by replicating and extending previous analyses of ZTP and fluoride riboswitch folding, and mapped the folding pathway of a ppGpp-sensing riboswitch. In each system, TECprobe-ML identified coordinated cotranscriptional folding events that mediate transcription antitermination. Our findings establish TECprobe-ML as an accessible method for mapping cotranscriptional RNA folding pathways.
Collapse
Affiliation(s)
- Courtney E. Szyjka
- Department of Biological Sciences, The University at Buffalo, Buffalo, NY 14260, USA
| | - Eric J. Strobel
- Department of Biological Sciences, The University at Buffalo, Buffalo, NY 14260, USA
| |
Collapse
|
8
|
Zhao Q, Mao Q, Zhao Z, Yuan W, He Q, Sun Q, Yao Y, Fan X. RNA independent fragment partition method based on deep learning for RNA secondary structure prediction. Sci Rep 2023; 13:2861. [PMID: 36801945 PMCID: PMC9938198 DOI: 10.1038/s41598-023-30124-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 02/16/2023] [Indexed: 02/19/2023] Open
Abstract
The non-coding RNA secondary structure largely determines its function. Hence, accuracy in structure acquisition is of great importance. Currently, this acquisition primarily relies on various computational methods. The prediction of the structures of long RNA sequences with high precision and reasonable computational cost remains challenging. Here, we propose a deep learning model, RNA-par, which could partition an RNA sequence into several independent fragments (i-fragments) based on its exterior loops. Each i-fragment secondary structure predicted individually could be further assembled to acquire the complete RNA secondary structure. In the examination of our independent test set, the average length of the predicted i-fragments was 453 nt, which was considerably shorter than that of complete RNA sequences (848 nt). The accuracy of the assembled structures was higher than that of the structures predicted directly using the state-of-the-art RNA secondary structure prediction methods. This proposed model could serve as a preprocessing step for RNA secondary structure prediction for enhancing the predictive performance (especially for long RNA sequences) and reducing the computational cost. In the future, predicting the secondary structure of long-sequence RNA with high accuracy can be enabled by developing a framework combining RNA-par with various existing RNA secondary structure prediction algorithms. Our models, test codes and test data are provided at https://github.com/mianfei71/RNAPar .
Collapse
Affiliation(s)
- Qi Zhao
- grid.412252.20000 0004 0368 6968College of Medicine and Biological Information Engineering, Northeastern University, Shenyang, 110169 Liaoning China
| | - Qian Mao
- grid.411356.40000 0000 9339 3042College of Light Industry, Liaoning University, Shenyang, 110036 Liaoning China
| | - Zheng Zhao
- grid.440686.80000 0001 0543 8253College of Artificial Intelligence, Dalian Maritime University, Dalian, 116026 Liaoning China
| | - Wenxuan Yuan
- grid.412252.20000 0004 0368 6968College of Medicine and Biological Information Engineering, Northeastern University, Shenyang, 110169 Liaoning China
| | - Qiang He
- grid.412252.20000 0004 0368 6968College of Medicine and Biological Information Engineering, Northeastern University, Shenyang, 110169 Liaoning China
| | - Qixuan Sun
- grid.412252.20000 0004 0368 6968College of Medicine and Biological Information Engineering, Northeastern University, Shenyang, 110169 Liaoning China
| | - Yudong Yao
- grid.217309.e0000 0001 2180 0654Department of Electrical and Computer Engineering, Stevens Institute of Technology, Hoboken, NJ 07030 USA
| | - Xiaoya Fan
- School of Software, Dalian University of Technology, Key Laboratory for Ubiquitous Network and Service Software of Liaoning Province, Dalian, 116620, Liaoning, China.
| |
Collapse
|
9
|
Trinity L, Wark I, Lansing L, Jabbari H, Stege U. Shapify: Paths to SARS-CoV-2 frameshifting pseudoknot. PLoS Comput Biol 2023; 19:e1010922. [PMID: 36854032 PMCID: PMC10004594 DOI: 10.1371/journal.pcbi.1010922] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Revised: 03/10/2023] [Accepted: 02/05/2023] [Indexed: 03/02/2023] Open
Abstract
Multiple coronaviruses including MERS-CoV causing Middle East Respiratory Syndrome, SARS-CoV causing SARS, and SARS-CoV-2 causing COVID-19, use a mechanism known as -1 programmed ribosomal frameshifting (-1 PRF) to replicate. SARS-CoV-2 possesses a unique RNA pseudoknotted structure that stimulates -1 PRF. Targeting -1 PRF in SARS-CoV-2 to impair viral replication can improve patients' prognoses. Crucial to developing these therapies is understanding the structure of the SARS-CoV-2 -1 PRF pseudoknot. Our goal is to expand knowledge of -1 PRF structural conformations. Following a structural alignment approach, we identify similarities in -1 PRF pseudoknots of SARS-CoV-2, SARS-CoV, and MERS-CoV. We provide in-depth analysis of the SARS-CoV-2 and MERS-CoV -1 PRF pseudoknots, including reference and noteworthy mutated sequences. To better understand the impact of mutations, we provide insight on -1 PRF pseudoknot sequence mutations and their effect on resulting structures. We introduce Shapify, a novel algorithm that given an RNA sequence incorporates structural reactivity (SHAPE) data and partial structure information to output an RNA secondary structure prediction within a biologically sound hierarchical folding approach. Shapify enhances our understanding of SARS-CoV-2 -1 PRF pseudoknot conformations by providing energetically favourable predictions that are relevant to structure-function and may correlate with -1 PRF efficiency. Applied to the SARS-CoV-2 -1 PRF pseudoknot, Shapify unveils previously unknown paths from initial stems to pseudoknotted structures. By contextualizing our work with available experimental data, our structure predictions motivate future RNA structure-function research and can aid 3-D modeling of pseudoknots.
Collapse
Affiliation(s)
- Luke Trinity
- Department of Computer Science, University of Victoria, Victoria, British Columbia, Canada
| | - Ian Wark
- Department of Computing Science, University of Alberta, Edmonton, Alberta, Canada
| | - Lance Lansing
- Department of Computer Science, University of Victoria, Victoria, British Columbia, Canada
| | - Hosna Jabbari
- Department of Computer Science, University of Victoria, Victoria, British Columbia, Canada
- Department of Computing Science, University of Alberta, Edmonton, Alberta, Canada
- Institute on Aging and Lifelong Health, Victoria, British Columbia, Canada
| | - Ulrike Stege
- Department of Computer Science, University of Victoria, Victoria, British Columbia, Canada
| |
Collapse
|
10
|
Loganathan T, Doss C GP. Non-coding RNAs in human health and disease: potential function as biomarkers and therapeutic targets. Funct Integr Genomics 2023; 23:33. [PMID: 36625940 PMCID: PMC9838419 DOI: 10.1007/s10142-022-00947-4] [Citation(s) in RCA: 77] [Impact Index Per Article: 38.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Revised: 12/14/2022] [Accepted: 12/15/2022] [Indexed: 01/11/2023]
Abstract
Human diseases have been a critical threat from the beginning of human history. Knowing the origin, course of action and treatment of any disease state is essential. A microscopic approach to the molecular field is a more coherent and accurate way to explore the mechanism, progression, and therapy with the introduction and evolution of technology than a macroscopic approach. Non-coding RNAs (ncRNAs) play increasingly important roles in detecting, developing, and treating all abnormalities related to physiology, pathology, genetics, epigenetics, cancer, and developmental diseases. Noncoding RNAs are becoming increasingly crucial as powerful, multipurpose regulators of all biological processes. Parallel to this, a rising amount of scientific information has revealed links between abnormal noncoding RNA expression and human disorders. Numerous non-coding transcripts with unknown functions have been found in addition to advancements in RNA-sequencing methods. Non-coding linear RNAs come in a variety of forms, including circular RNAs with a continuous closed loop (circRNA), long non-coding RNAs (lncRNA), and microRNAs (miRNA). This comprises specific information on their biogenesis, mode of action, physiological function, and significance concerning disease (such as cancer or cardiovascular diseases and others). This study review focuses on non-coding RNA as specific biomarkers and novel therapeutic targets.
Collapse
Affiliation(s)
- Tamizhini Loganathan
- Laboratory of Integrative Genomics, Department of Integrative Biology, School of Biosciences and Technology, Vellore Institute of Technology (VIT), Vellore- 632014, Tamil Nadu, India
| | - George Priya Doss C
- Laboratory of Integrative Genomics, Department of Integrative Biology, School of Biosciences and Technology, Vellore Institute of Technology (VIT), Vellore- 632014, Tamil Nadu, India.
| |
Collapse
|
11
|
RNA secondary structure packages evaluated and improved by high-throughput experiments. Nat Methods 2022; 19:1234-1242. [PMID: 36192461 PMCID: PMC9839360 DOI: 10.1038/s41592-022-01605-0] [Citation(s) in RCA: 46] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2020] [Accepted: 08/10/2022] [Indexed: 01/17/2023]
Abstract
Despite the popularity of computer-aided study and design of RNA molecules, little is known about the accuracy of commonly used structure modeling packages in tasks sensitive to ensemble properties of RNA. Here, we demonstrate that the EternaBench dataset, a set of more than 20,000 synthetic RNA constructs designed on the RNA design platform Eterna, provides incisive discriminative power in evaluating current packages in ensemble-oriented structure prediction tasks. We find that CONTRAfold and RNAsoft, packages with parameters derived through statistical learning, achieve consistently higher accuracy than more widely used packages in their standard settings, which derive parameters primarily from thermodynamic experiments. We hypothesized that training a multitask model with the varied data types in EternaBench might improve inference on ensemble-based prediction tasks. Indeed, the resulting model, named EternaFold, demonstrated improved performance that generalizes to diverse external datasets including complete messenger RNAs, viral genomes probed in human cells and synthetic designs modeling mRNA vaccines.
Collapse
|
12
|
Programmable antivirals targeting critical conserved viral RNA secondary structures from influenza A virus and SARS-CoV-2. Nat Med 2022; 28:1944-1955. [PMID: 35982307 PMCID: PMC10132811 DOI: 10.1038/s41591-022-01908-x] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2017] [Accepted: 06/20/2022] [Indexed: 12/18/2022]
Abstract
Influenza A virus's (IAV's) frequent genetic changes challenge vaccine strategies and engender resistance to current drugs. We sought to identify conserved and essential RNA secondary structures within IAV's genome that are predicted to have greater constraints on mutation in response to therapeutic targeting. We identified and genetically validated an RNA structure (packaging stem-loop 2 (PSL2)) that mediates in vitro packaging and in vivo disease and is conserved across all known IAV isolates. A PSL2-targeting locked nucleic acid (LNA), administered 3 d after, or 14 d before, a lethal IAV inoculum provided 100% survival in mice, led to the development of strong immunity to rechallenge with a tenfold lethal inoculum, evaded attempts to select for resistance and retained full potency against neuraminidase inhibitor-resistant virus. Use of an analogous approach to target SARS-CoV-2, prophylactic administration of LNAs specific for highly conserved RNA structures in the viral genome, protected hamsters from efficient transmission of the SARS-CoV-2 USA_WA1/2020 variant. These findings highlight the potential applicability of this approach to any virus of interest via a process we term 'programmable antivirals', with implications for antiviral prophylaxis and post-exposure therapy.
Collapse
|
13
|
Ye L, Gribling-Burrer AS, Bohn P, Kibe A, Börtlein C, Ambi UB, Ahmad S, Olguin-Nava M, Smith M, Caliskan N, von Kleist M, Smyth RP. Short- and long-range interactions in the HIV-1 5' UTR regulate genome dimerization and packaging. Nat Struct Mol Biol 2022; 29:306-319. [PMID: 35347312 PMCID: PMC9010304 DOI: 10.1038/s41594-022-00746-2] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Accepted: 02/14/2022] [Indexed: 11/09/2022]
Abstract
RNA dimerization is the noncovalent association of two human immunodeficiency virus-1 (HIV-1) genomes. It is a conserved step in the HIV-1 life cycle and assumed to be a prerequisite for binding to the viral structural protein Pr55Gag during genome packaging. Here, we developed functional analysis of RNA structure-sequencing (FARS-seq) to comprehensively identify sequences and structures within the HIV-1 5' untranslated region (UTR) that regulate this critical step. Using FARS-seq, we found nucleotides important for dimerization throughout the HIV-1 5' UTR and identified distinct structural conformations in monomeric and dimeric RNA. In the dimeric RNA, key functional domains, such as stem-loop 1 (SL1), polyadenylation signal (polyA) and primer binding site (PBS), folded into independent structural motifs. In the monomeric RNA, SL1 was reconfigured into long- and short-range base pairings with polyA and PBS, respectively. We show that these interactions disrupt genome packaging, and additionally show that the PBS-SL1 interaction unexpectedly couples the PBS with dimerization and Pr55Gag binding. Altogether, our data provide insights into late stages of HIV-1 life cycle and a mechanistic explanation for the link between RNA dimerization and packaging.
Collapse
Affiliation(s)
- Liqing Ye
- grid.498164.6Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany
| | - Anne-Sophie Gribling-Burrer
- grid.498164.6Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany
| | - Patrick Bohn
- grid.498164.6Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany
| | - Anuja Kibe
- grid.498164.6Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany
| | - Charlene Börtlein
- grid.498164.6Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany
| | - Uddhav B. Ambi
- grid.498164.6Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany
| | - Shazeb Ahmad
- grid.498164.6Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany
| | - Marco Olguin-Nava
- grid.498164.6Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany
| | - Maureen Smith
- grid.13652.330000 0001 0940 3744P5 Systems Medicine of Infectious Disease, Robert Koch-Institute, Berlin, Germany
| | - Neva Caliskan
- grid.498164.6Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany ,grid.8379.50000 0001 1958 8658Faculty of Medicine, University of Würzburg, Würzburg, Germany
| | - Max von Kleist
- grid.13652.330000 0001 0940 3744P5 Systems Medicine of Infectious Disease, Robert Koch-Institute, Berlin, Germany
| | - Redmond P. Smyth
- grid.498164.6Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany ,grid.8379.50000 0001 1958 8658Faculty of Medicine, University of Würzburg, Würzburg, Germany
| |
Collapse
|
14
|
Cao J, Xue Y. Characteristic chemical probing patterns of loop motifs improve prediction accuracy of RNA secondary structures. Nucleic Acids Res 2021; 49:4294-4307. [PMID: 33849076 PMCID: PMC8096282 DOI: 10.1093/nar/gkab250] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 03/24/2021] [Accepted: 04/10/2021] [Indexed: 12/14/2022] Open
Abstract
RNA structures play a fundamental role in nearly every aspect of cellular physiology and pathology. Gaining insights into the functions of RNA molecules requires accurate predictions of RNA secondary structures. However, the existing thermodynamic folding models remain less accurate than desired, even when chemical probing data, such as selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) reactivities, are used as restraints. Unlike most SHAPE-directed algorithms that only consider SHAPE restraints for base pairing, we extract two-dimensional structural features encoded in SHAPE data and establish robust relationships between characteristic SHAPE patterns and loop motifs of various types (hairpin, internal, and bulge) and lengths (2-11 nucleotides). Such characteristic SHAPE patterns are closely related to the sugar pucker conformations of loop residues. Based on these patterns, we propose a computational method, SHAPELoop, which refines the predicted results of the existing methods, thereby further improving their prediction accuracy. In addition, SHAPELoop can provide information about local or global structural rearrangements (including pseudoknots) and help researchers to easily test their hypothesized secondary structures.
Collapse
Affiliation(s)
- Jingyi Cao
- School of Life Sciences, Tsinghua-Peking Joint Center for Life Sciences, Beijing Advanced Innovation Center for Structural Biology, Tsinghua University, Beijing 100084, China
| | - Yi Xue
- School of Life Sciences, Tsinghua-Peking Joint Center for Life Sciences, Beijing Advanced Innovation Center for Structural Biology, Tsinghua University, Beijing 100084, China
| |
Collapse
|
15
|
Calonaci N, Jones A, Cuturello F, Sattler M, Bussi G. Machine learning a model for RNA structure prediction. NAR Genom Bioinform 2021; 2:lqaa090. [PMID: 33575634 PMCID: PMC7671377 DOI: 10.1093/nargab/lqaa090] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Revised: 10/06/2020] [Accepted: 10/20/2020] [Indexed: 01/04/2023] Open
Abstract
RNA function crucially depends on its structure. Thermodynamic models currently used for secondary structure prediction rely on computing the partition function of folding ensembles, and can thus estimate minimum free-energy structures and ensemble populations. These models sometimes fail in identifying native structures unless complemented by auxiliary experimental data. Here, we build a set of models that combine thermodynamic parameters, chemical probing data (DMS and SHAPE) and co-evolutionary data (direct coupling analysis) through a network that outputs perturbations to the ensemble free energy. Perturbations are trained to increase the ensemble populations of a representative set of known native RNA structures. In the chemical probing nodes of the network, a convolutional window combines neighboring reactivities, enlightening their structural information content and the contribution of local conformational ensembles. Regularization is used to limit overfitting and improve transferability. The most transferable model is selected through a cross-validation strategy that estimates the performance of models on systems on which they are not trained. With the selected model we obtain increased ensemble populations for native structures and more accurate predictions in an independent validation set. The flexibility of the approach allows the model to be easily retrained and adapted to incorporate arbitrary experimental information.
Collapse
Affiliation(s)
- Nicola Calonaci
- International School for Advanced Studies, via Bonomea 265, 34136 Trieste, Italy
| | - Alisha Jones
- Institute of Structural Biology, Helmholtz Zentrum München, Neuherberg 85764, Germany
| | - Francesca Cuturello
- International School for Advanced Studies, via Bonomea 265, 34136 Trieste, Italy
| | - Michael Sattler
- Institute of Structural Biology, Helmholtz Zentrum München, Neuherberg 85764, Germany
| | - Giovanni Bussi
- International School for Advanced Studies, via Bonomea 265, 34136 Trieste, Italy
| |
Collapse
|
16
|
Hurst T, Chen SJ. Sieving RNA 3D Structures with SHAPE and Evaluating Mechanisms Driving Sequence-Dependent Reactivity Bias. J Phys Chem B 2021; 125:1156-1166. [PMID: 33497570 DOI: 10.1021/acs.jpcb.0c11365] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) chemical probing provides local RNA flexibility information at single-nucleotide resolution. In general, SHAPE is thought of as a secondary structure (2D) technology, but we find evidence that robust tertiary structure (3D) information is contained in SHAPE data. Here, we report a new model that achieves a higher correlation between SHAPE data and native RNA 3D structures than the previous 3D structure-SHAPE relationship model. Furthermore, we demonstrate that the new model improves our ability to discern between SHAPE-compatible and -incompatible structures on model decoys. After identifying sequence-dependent bias in SHAPE experiments, we propose a mechanism driving sequence-dependent bias in SHAPE experiments, using replica-exchange umbrella sampling simulations to confirm that the SHAPE sequence bias is largely explained by the stability of the unreacted SHAPE reagent in the binding pocket. Taken together, this work represents multiple practical advances in our mechanistic and predictive understanding of SHAPE technology.
Collapse
Affiliation(s)
- Travis Hurst
- Department of Physics, Department of Biochemistry, and Institute for Data Science and Informatics, University of Missouri, Columbia, Missouri 65211, United States
| | - Shi-Jie Chen
- Department of Physics, Department of Biochemistry, and Institute for Data Science and Informatics, University of Missouri, Columbia, Missouri 65211, United States
| |
Collapse
|
17
|
Li P, Zhou X, Xu K, Zhang QC. RASP: an atlas of transcriptome-wide RNA secondary structure probing data. Nucleic Acids Res 2021; 49:D183-D191. [PMID: 33068412 PMCID: PMC7779053 DOI: 10.1093/nar/gkaa880] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Revised: 09/13/2020] [Accepted: 09/26/2020] [Indexed: 02/06/2023] Open
Abstract
RNA molecules fold into complex structures that are important across many biological processes. Recent technological developments have enabled transcriptome-wide probing of RNA secondary structure using nucleases and chemical modifiers. These approaches have been widely applied to capture RNA secondary structure in many studies, but gathering and presenting such data from very different technologies in a comprehensive and accessible way has been challenging. Existing RNA structure probing databases usually focus on low-throughput or very specific datasets. Here, we present a comprehensive RNA structure probing database called RASP (RNA Atlas of Structure Probing) by collecting 161 deduplicated transcriptome-wide RNA secondary structure probing datasets from 38 papers. RASP covers 18 species across animals, plants, bacteria, fungi, and also viruses, and categorizes 18 experimental methods including DMS-seq, SHAPE-Seq, SHAPE-MaP, and icSHAPE, etc. Specially, RASP curates the up-to-date datasets of several RNA secondary structure probing studies for the RNA genome of SARS-CoV-2, the RNA virus that caused the on-going COVID-19 pandemic. RASP also provides a user-friendly interface to query, browse, and visualize RNA structure profiles, offering a shortcut to accessing RNA secondary structures grounded in experimental data. The database is freely available at http://rasp.zhanglab.net.
Collapse
MESH Headings
- Animals
- COVID-19/epidemiology
- COVID-19/prevention & control
- COVID-19/virology
- Computational Biology/methods
- Computational Biology/statistics & numerical data
- Databases, Genetic/statistics & numerical data
- Genome, Viral/genetics
- High-Throughput Nucleotide Sequencing/methods
- High-Throughput Nucleotide Sequencing/statistics & numerical data
- Humans
- Nucleic Acid Conformation
- Pandemics
- RNA/chemistry
- RNA/genetics
- RNA Probes/genetics
- RNA, Bacterial/chemistry
- RNA, Bacterial/genetics
- RNA, Fungal/chemistry
- RNA, Fungal/genetics
- RNA, Plant/chemistry
- RNA, Plant/genetics
- RNA, Viral/chemistry
- RNA, Viral/genetics
- SARS-CoV-2/genetics
- SARS-CoV-2/physiology
- Transcriptome
Collapse
Affiliation(s)
- Pan Li
- MOE Key Laboratory of Bioinformatics, Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, Center for Synthetic and Systems Biology, Tsinghua-Peking Joint Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Xiaolin Zhou
- MOE Key Laboratory of Bioinformatics, Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, Center for Synthetic and Systems Biology, Tsinghua-Peking Joint Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Kui Xu
- MOE Key Laboratory of Bioinformatics, Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, Center for Synthetic and Systems Biology, Tsinghua-Peking Joint Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Qiangfeng Cliff Zhang
- MOE Key Laboratory of Bioinformatics, Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, Center for Synthetic and Systems Biology, Tsinghua-Peking Joint Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing, 100084, China
| |
Collapse
|
18
|
Pinkney HR, Wright BM, Diermeier SD. The lncRNA Toolkit: Databases and In Silico Tools for lncRNA Analysis. Noncoding RNA 2020; 6:E49. [PMID: 33339309 PMCID: PMC7768357 DOI: 10.3390/ncrna6040049] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2020] [Revised: 12/14/2020] [Accepted: 12/15/2020] [Indexed: 02/07/2023] Open
Abstract
Long non-coding RNAs (lncRNAs) are a rapidly expanding field of research, with many new transcripts identified each year. However, only a small subset of lncRNAs has been characterized functionally thus far. To aid investigating the mechanisms of action by which new lncRNAs act, bioinformatic tools and databases are invaluable. Here, we review a selection of computational tools and databases for the in silico analysis of lncRNAs, including tissue-specific expression, protein coding potential, subcellular localization, structural conformation, and interaction partners. The assembled lncRNA toolkit is aimed primarily at experimental researchers as a useful starting point to guide wet-lab experiments, mainly containing multi-functional, user-friendly interfaces. With more and more new lncRNA analysis tools available, it will be essential to provide continuous updates and maintain the availability of key software in the future.
Collapse
Affiliation(s)
| | | | - Sarah D. Diermeier
- Department of Biochemistry, University of Otago, Dunedin 9016, New Zealand; (H.R.P.); (B.M.W.)
| |
Collapse
|
19
|
Palka C, Forino NM, Hentschel J, Das R, Stone MD. Folding heterogeneity in the essential human telomerase RNA three-way junction. RNA (NEW YORK, N.Y.) 2020; 26:1787-1800. [PMID: 32817241 PMCID: PMC7668248 DOI: 10.1261/rna.077255.120] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/12/2020] [Accepted: 07/29/2020] [Indexed: 06/11/2023]
Abstract
Telomeres safeguard the genome by suppressing illicit DNA damage responses at chromosome termini. To compensate for incomplete DNA replication at telomeres, most continually dividing cells, including many cancers, express the telomerase ribonucleoprotein (RNP) complex. Telomerase maintains telomere length by catalyzing de novo synthesis of short DNA repeats using an internal telomerase RNA (TR) template. TRs from diverse species harbor structurally conserved domains that contribute to RNP biogenesis and function. In vertebrate TRs, the conserved regions 4 and 5 (CR4/5) fold into a three-way junction (TWJ) that binds directly to the telomerase catalytic protein subunit and is required for telomerase function. We have analyzed the structural properties of the human TR (hTR) CR4/5 domain using a combination of in vitro chemical mapping, secondary structural modeling, and single-molecule structural analysis. Our data suggest the essential P6.1 stem-loop within CR4/5 is not stably folded in the absence of the telomerase reverse transcriptase in vitro. Rather, the hTR CR4/5 domain adopts a heterogeneous ensemble of conformations. Finally, single-molecule FRET measurements of CR4/5 and a mutant designed to stabilize the P6.1 stem demonstrate that TERT binding selects for a structural conformation of CR4/5 that is not the dominant state of the TERT-free in vitro RNA ensemble.
Collapse
Affiliation(s)
- Christina Palka
- Department of Chemistry and Biochemistry, University of California, Santa Cruz, California 95064, USA
| | - Nicholas M Forino
- Department of Molecular, Cell, and Developmental Biology, University of California, Santa Cruz, California 95064, USA
| | - Jendrik Hentschel
- Department of Chemistry and Biochemistry, University of California, Santa Cruz, California 95064, USA
| | - Rhiju Das
- Biophysics Program, Stanford University, Stanford, California 94305, USA
- Department of Biochemistry, Stanford University, Stanford, California 94305, USA
- Department of Physics, Stanford University, Stanford, California 94305, USA
| | - Michael D Stone
- Department of Chemistry and Biochemistry, University of California, Santa Cruz, California 95064, USA
- Center for Molecular Biology of RNA, University of California, Santa Cruz, California 95064, USA
| |
Collapse
|
20
|
Zhang W, Wu Q. Applications of phage-derived RNA-based technologies in synthetic biology. Synth Syst Biotechnol 2020; 5:343-360. [PMID: 33083579 PMCID: PMC7564126 DOI: 10.1016/j.synbio.2020.09.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2020] [Revised: 09/22/2020] [Accepted: 09/27/2020] [Indexed: 12/20/2022] Open
Abstract
As the most abundant biological entities with incredible diversity, bacteriophages (also known as phages) have been recognized as an important source of molecular machines for the development of genetic-engineering tools. At the same time, phages are crucial for establishing and improving basic theories of molecular biology. Studies on phages provide rich sources of essential elements for synthetic circuit design as well as powerful support for the improvement of directed evolution platforms. Therefore, phages play a vital role in the development of new technologies and central scientific concepts. After the RNA world hypothesis was proposed and developed, novel biological functions of RNA continue to be discovered. RNA and its related elements are widely used in many fields such as metabolic engineering and medical diagnosis, and their versatility led to a major role of RNA in synthetic biology. Further development of RNA-based technologies will advance synthetic biological tools as well as provide verification of the RNA world hypothesis. Most synthetic biology efforts are based on reconstructing existing biological systems, understanding fundamental biological processes, and developing new technologies. RNA-based technologies derived from phages will offer abundant sources for synthetic biological components. Moreover, phages as well as RNA have high impact on biological evolution, which is pivotal for understanding the origin of life, building artificial life-forms, and precisely reprogramming biological systems. This review discusses phage-derived RNA-based technologies terms of phage components, the phage lifecycle, and interactions between phages and bacteria. The significance of RNA-based technology derived from phages for synthetic biology and for understanding the earliest stages of biological evolution will be highlighted.
Collapse
Affiliation(s)
- Wenhui Zhang
- MOE Key Lab. Bioinformatics, School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Qiong Wu
- MOE Key Lab. Bioinformatics, School of Life Sciences, Tsinghua University, Beijing, 100084, China
- Center for Synthetic and Systems Biology, Tsinghua University, Beijing, 100084, China
- Corresponding author. MOE Key Lab. Bioinformatics, School of Life Sciences, Tsinghua University, Beijing, 100084, China.
| |
Collapse
|
21
|
Saaidi A, Allouche D, Regnier M, Sargueil B, Ponty Y. IPANEMAP: integrative probing analysis of nucleic acids empowered by multiple accessibility profiles. Nucleic Acids Res 2020; 48:8276-8289. [PMID: 32735675 PMCID: PMC7470984 DOI: 10.1093/nar/gkaa607] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Revised: 07/03/2020] [Accepted: 07/29/2020] [Indexed: 11/13/2022] Open
Abstract
The manual production of reliable RNA structure models from chemical probing experiments benefits from the integration of information derived from multiple protocols and reagents. However, the interpretation of multiple probing profiles remains a complex task, hindering the quality and reproducibility of modeling efforts. We introduce IPANEMAP, the first automated method for the modeling of RNA structure from multiple probing reactivity profiles. Input profiles can result from experiments based on diverse protocols, reagents, or collection of variants, and are jointly analyzed to predict the dominant conformations of an RNA. IPANEMAP combines sampling, clustering and multi-optimization, to produce secondary structure models that are both stable and well-supported by experimental evidences. The analysis of multiple reactivity profiles, both publicly available and produced in our study, demonstrates the good performances of IPANEMAP, even in a mono probing setting. It confirms the potential of integrating multiple sources of probing data, informing the design of informative probing assays.
Collapse
Affiliation(s)
- Afaf Saaidi
- CNRS UMR 7161, LIX, Ecole Polytechnique, Institut Polytechnique de Paris, 1 rue Estienne d'Orves, 91120 Palaiseau, France
| | - Delphine Allouche
- CNRS UMR 8038, CitCoM, Université de Paris, 4 avenue de l'observatoire, 75006 Paris, France
| | - Mireille Regnier
- CNRS UMR 7161, LIX, Ecole Polytechnique, Institut Polytechnique de Paris, 1 rue Estienne d'Orves, 91120 Palaiseau, France
| | - Bruno Sargueil
- CNRS UMR 8038, CitCoM, Université de Paris, 4 avenue de l'observatoire, 75006 Paris, France
| | - Yann Ponty
- CNRS UMR 7161, LIX, Ecole Polytechnique, Institut Polytechnique de Paris, 1 rue Estienne d'Orves, 91120 Palaiseau, France
| |
Collapse
|
22
|
Bandaru S, Tsuji MH, Shimizu Y, Usami K, Lee S, Takei NK, Yoshitome K, Nishimura Y, Otsuki T, Ito T. Structure-based design of gRNA for Cas13. Sci Rep 2020; 10:11610. [PMID: 32665590 PMCID: PMC7360764 DOI: 10.1038/s41598-020-68459-4] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2020] [Accepted: 06/01/2020] [Indexed: 12/26/2022] Open
Abstract
Cas13 endonuclease activity depends on the RNA local secondary structure with strong preference for single-stranded (SS) regions. Hence, it becomes indispensable to identify the SS regions for effective Cas13 mediated RNA knockdown. We herein present rational gRNA design by integrating experimental structure-seq data and predicted structural models. Utilizing structure-seq data for XIST transcript, we observed that gRNAs targeting the SS regions significantly induce transcript knockdown and cleavage than those targeting double-stranded (DS) regions. Further, we identified the "central seed region" in the gRNA that upon targeting the SS regions efficiently facilitates Cas13 mediated cleavage. In our following pursuits, we considered the scenario wherein experimental structure-seq data is not available, hence we used SS18-SSX2 fusion transcript indicated in synovial sarcomas and computationally predicted its structure. We observed that gRNAs targeting the SS regions predicted from the structure, efficiently induced necrosis compared to gRNAs that target the DS regions. In conclusion, for the effective RNA knockdown, the Cas13 mediated targeting strategy presented herein emphasizes the designing of gRNAs specifically targeting SS regions by utilizing structural information. Further, this strategy, in turn, can be anticipated to narrow the search space for gRNA design (by exclusively targeting SS regions) especially when lncRNAs are the targets.
Collapse
Affiliation(s)
- Srinivas Bandaru
- Department of Hygiene, Kawasaki Medical University, 577 Matsushima, Kurashiki, Okayama, 701-0192, Japan
| | - Mika Higashide Tsuji
- Department of Applied Chemistry and Biotechnology, Faculty of Engineering, Graduate School of Engineering, University of Fukui, Bunkyo 3-9-1, Fukui, Fukui, 910-8507, Japan
| | - Yurika Shimizu
- Department of Pathophysiology, Periodontal Science, Okayama University Graduate School of Medicine, Dentistry, and Pharmaceutical Sciences, Okayama, Okayama, 700-8558, Japan
| | - Kaya Usami
- Okayama University Medical School, Okayama, 700-8558, Japan
| | - Suni Lee
- Department of Hygiene, Kawasaki Medical University, 577 Matsushima, Kurashiki, Okayama, 701-0192, Japan
| | - Naoko Kumagai Takei
- Department of Hygiene, Kawasaki Medical University, 577 Matsushima, Kurashiki, Okayama, 701-0192, Japan
| | - Kei Yoshitome
- Department of Hygiene, Kawasaki Medical University, 577 Matsushima, Kurashiki, Okayama, 701-0192, Japan
| | - Yasumitsu Nishimura
- Department of Hygiene, Kawasaki Medical University, 577 Matsushima, Kurashiki, Okayama, 701-0192, Japan
| | - Takemi Otsuki
- Department of Hygiene, Kawasaki Medical University, 577 Matsushima, Kurashiki, Okayama, 701-0192, Japan
| | - Tatsuo Ito
- Department of Hygiene, Kawasaki Medical University, 577 Matsushima, Kurashiki, Okayama, 701-0192, Japan.
| |
Collapse
|
23
|
Yu B, Lu Y, Zhang QC, Hou L. Prediction and differential analysis of RNA secondary structure. QUANTITATIVE BIOLOGY 2020. [DOI: 10.1007/s40484-020-0205-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
24
|
Yu H, Zhang Y, Sun Q, Gao H, Tao S. RSVdb: a comprehensive database of transcriptome RNA structure. Brief Bioinform 2020; 22:5831476. [PMID: 32382747 DOI: 10.1093/bib/bbaa071] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2020] [Revised: 04/04/2020] [Accepted: 04/07/2020] [Indexed: 11/14/2022] Open
Abstract
RNA fulfills a crucial regulatory role in cells by folding into a complex RNA structure. To date, a chemical compound, dimethyl sulfate (DMS), has been developed to probe the RNA structure at the transcriptome level effectively. We proposed a database, RSVdb (https://taolab.nwafu.edu.cn/rsvdb/), for the browsing and visualization of transcriptome RNA structures. RSVdb, including 626 225 RNAs with validated DMS reactivity from 178 samples in eight species, supports four main functions: information retrieval, research overview, structure prediction and resource download. Users can search for species, studies, transcripts and genes of interest; browse the quality control of sequencing data and statistical charts of RNA structure information; preview and perform online prediction of RNA structures in silico and under DMS restraint of different experimental treatments and download RNA structure data for species and studies. Together, RSVdb provides a reference for RNA structure and will support future research on the function of RNA structure at the transcriptome level.
Collapse
|
25
|
Duncan D, Garner R, Zrantchev I, Ard T, Newman B, Saslow A, Wanserski E, Toga AW. Using Virtual Reality to Improve Performance and User Experience in Manual Correction of MRI Segmentation Errors by Non-experts. J Digit Imaging 2020; 32:97-104. [PMID: 30030766 DOI: 10.1007/s10278-018-0108-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Segmentation of MRI scans is a critical part of the workflow process before we can further analyze neuroimaging data. Although there are several automatic tools for segmentation, no segmentation software is perfectly accurate, and manual correction by visually inspecting the segmentation errors is required. The process of correcting these errors is tedious and time-consuming, so we present a novel method of performing this task in a head-mounted virtual reality interactive system with a new software, Virtual Brain Segmenter (VBS). We provide the results of user testing on 30 volunteers to show the benefits of our tool as a more efficient, intuitive, and engaging alternative compared with the current method of correcting segmentation errors.
Collapse
Affiliation(s)
- Dominique Duncan
- Laboratory of Neuro Imaging, USC Stevens Neuroimaging and Informatics Institute, Keck School of Medicine of USC, University of Southern California, 2025 Zonal Ave., Los Angeles, CA, 90033, USA.
| | - Rachael Garner
- Laboratory of Neuro Imaging, USC Stevens Neuroimaging and Informatics Institute, Keck School of Medicine of USC, University of Southern California, 2025 Zonal Ave., Los Angeles, CA, 90033, USA
| | - Ivan Zrantchev
- Laboratory of Neuro Imaging, USC Stevens Neuroimaging and Informatics Institute, Keck School of Medicine of USC, University of Southern California, 2025 Zonal Ave., Los Angeles, CA, 90033, USA
| | - Tyler Ard
- Laboratory of Neuro Imaging, USC Stevens Neuroimaging and Informatics Institute, Keck School of Medicine of USC, University of Southern California, 2025 Zonal Ave., Los Angeles, CA, 90033, USA
| | - Bradley Newman
- RareFaction Interactive, 1725 Camino Palmero Street, 410, Los Angeles, CA, 90046, USA
| | - Adam Saslow
- RareFaction Interactive, 1725 Camino Palmero Street, 410, Los Angeles, CA, 90046, USA
| | - Emily Wanserski
- RareFaction Interactive, 1725 Camino Palmero Street, 410, Los Angeles, CA, 90046, USA
| | - Arthur W Toga
- Laboratory of Neuro Imaging, USC Stevens Neuroimaging and Informatics Institute, Keck School of Medicine of USC, University of Southern California, 2025 Zonal Ave., Los Angeles, CA, 90033, USA
| |
Collapse
|
26
|
Du P, Cai P, Huang B, Jiang C, Wu Q, Li B, Qu K. SMAtool reveals sequences and structural principles of protein-RNA interaction. Biochem Biophys Res Commun 2020; 525:S0006-291X(20)30336-3. [PMID: 32070491 DOI: 10.1016/j.bbrc.2020.02.068] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2020] [Accepted: 02/09/2020] [Indexed: 11/21/2022]
Abstract
Protein binding events on RNA are highly related to RNA secondary structure, which affects post-transcriptional regulation and translation. However, it remains challenging to describe the association between RNA secondary structure and protein binding events. Here, we present Structure Motif Analysis tool (SMAtool), a pipeline that integrates RNA secondary structure and protein binding site information to profile the binding structure preference of each protein. As an example of applying SMAtool, we extracted the RNA-structure and binding site information respectively from the DMS-seq and eCLIP-seq data of the K562 cell-line, and used SMAtool to analyze the structure motif of each RNA binding protein (RBP). This new approach provided results consistent with X-ray crystallography data from the protein data bank (PDB) database, demonstrating that it can help researchers investigate the structure preference of RBP, and understand the role of RNA secondary structure in gene expression. Availability and implementation: https://github.com/QuKunLab/SMAtool.
Collapse
Affiliation(s)
- Pengcheng Du
- The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui, 230001, China
| | - Pengfei Cai
- The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui, 230001, China
| | - Beibei Huang
- The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui, 230001, China
| | - Chen Jiang
- The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui, 230001, China
| | - Quan Wu
- The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui, 230001, China.
| | - Bin Li
- The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui, 230001, China.
| | - Kun Qu
- The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui, 230001, China.
| |
Collapse
|
27
|
Xue AY, Yu AM, Lucks JB, Bagheri N. DUETT quantitatively identifies known and novel events in nascent RNA structural dynamics from chemical probing data. Bioinformatics 2019; 35:5103-5112. [PMID: 31389563 PMCID: PMC6954663 DOI: 10.1093/bioinformatics/btz449] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2018] [Revised: 04/29/2019] [Accepted: 07/19/2019] [Indexed: 01/23/2023] Open
Abstract
MOTIVATION RNA molecules can undergo complex structural dynamics, especially during transcription, which influence their biological functions. Recently developed high-throughput chemical probing experiments that study RNA cotranscriptional folding generate nucleotide-resolution 'reactivities' for each length of a growing nascent RNA that reflect structural dynamics. However, the manual annotation and qualitative interpretation of reactivity across these large datasets can be nuanced, laborious, and difficult for new practitioners. We developed a quantitative and systematic approach to automatically detect RNA folding events from these datasets to reduce human bias/error, standardize event discovery and generate hypotheses about RNA folding trajectories for further analysis and experimental validation. RESULTS Detection of Unknown Events with Tunable Thresholds (DUETT) identifies RNA structural transitions in cotranscriptional RNA chemical probing datasets. DUETT employs a feedback control-inspired method and a linear regression approach and relies on interpretable and independently tunable parameter thresholds to match qualitative user expectations with quantitatively identified folding events. We validate the approach by identifying known RNA structural transitions within the cotranscriptional folding pathways of the Escherichia coli signal recognition particle RNA and the Bacillus cereus crcB fluoride riboswitch. We identify previously overlooked features of these datasets such as heightened reactivity patterns in the signal recognition particle RNA about 12 nt lengths before base-pair rearrangement. We then apply a sensitivity analysis to identify tradeoffs when choosing parameter thresholds. Finally, we show that DUETT is tunable across a wide range of contexts, enabling flexible application to study broad classes of RNA folding mechanisms. AVAILABILITY AND IMPLEMENTATION https://github.com/BagheriLab/DUETT. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Albert Y Xue
- Department of Chemical & Biological Engineering, Northwestern University, Evanston, IL, USA
- Center for Synthetic Biology, Northwestern University, Evanston IL, USA
- Northwestern Institute on Complex Systems, Northwestern University, Evanston, IL, USA
| | - Angela M Yu
- Center for Synthetic Biology, Northwestern University, Evanston IL, USA
- Tri-Institutional Training Program in Computational Biology and Medicine, Weill Cornell Medicine, New York, NY, USA
| | - Julius B Lucks
- Department of Chemical & Biological Engineering, Northwestern University, Evanston, IL, USA
- Center for Synthetic Biology, Northwestern University, Evanston IL, USA
- Chemistry of Life Processes Institute, Northwestern University, Evanston, IL, USA
| | - Neda Bagheri
- Department of Chemical & Biological Engineering, Northwestern University, Evanston, IL, USA
- Center for Synthetic Biology, Northwestern University, Evanston IL, USA
- Northwestern Institute on Complex Systems, Northwestern University, Evanston, IL, USA
- Chemistry of Life Processes Institute, Northwestern University, Evanston, IL, USA
| |
Collapse
|
28
|
Strobel EJ, Cheng L, Berman KE, Carlson PD, Lucks JB. A ligand-gated strand displacement mechanism for ZTP riboswitch transcription control. Nat Chem Biol 2019; 15:1067-1076. [PMID: 31636437 PMCID: PMC6814202 DOI: 10.1038/s41589-019-0382-7] [Citation(s) in RCA: 52] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2019] [Revised: 07/31/2019] [Accepted: 08/22/2019] [Indexed: 01/14/2023]
Abstract
Cotranscriptional folding is an obligate step of RNA biogenesis that can guide RNA structure formation and function through transient intermediate folds. This process is particularly important for transcriptional riboswitches in which the formation of ligand-dependent structures during transcription regulates downstream gene expression. However, the intermediate structures that comprise cotranscriptional RNA folding pathways, and the mechanisms that enable transit between them, remain largely unknown. Here, we determine the series of cotranscriptional folds and rearrangements that mediate antitermination by the Clostridium beijerinckii pfl ZTP riboswitch in response to the purine biosynthetic intermediate ZMP. We uncover sequence and structural determinants that modulate an internal RNA strand displacement process and identify biases within natural ZTP riboswitch sequences that promote on-pathway folding. Our findings establish a mechanism for pfl riboswitch antitermination and suggest general strategies by which nascent RNA molecules navigate cotranscriptional folding pathways.
Collapse
Affiliation(s)
- Eric J Strobel
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA.
| | - Luyi Cheng
- Interdisciplinary Biological Sciences Graduate Program, Northwestern University, Evanston, IL, USA
- Center for Synthetic Biology, Northwestern University, Evanston, IL, USA
| | - Katherine E Berman
- Interdisciplinary Biological Sciences Graduate Program, Northwestern University, Evanston, IL, USA
- Center for Synthetic Biology, Northwestern University, Evanston, IL, USA
| | - Paul D Carlson
- Center for Synthetic Biology, Northwestern University, Evanston, IL, USA
- Robert F. Smith School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY, USA
| | - Julius B Lucks
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA.
- Interdisciplinary Biological Sciences Graduate Program, Northwestern University, Evanston, IL, USA.
- Center for Synthetic Biology, Northwestern University, Evanston, IL, USA.
| |
Collapse
|
29
|
Boccaletto P, Magnus M, Almeida C, Zyla A, Astha A, Pluta R, Baginski B, Jankowska E, Dunin-Horkawicz S, Wirecki TK, Boniecki MJ, Stefaniak F, Bujnicki JM. RNArchitecture: a database and a classification system of RNA families, with a focus on structural information. Nucleic Acids Res 2019; 46:D202-D205. [PMID: 29069520 PMCID: PMC5753356 DOI: 10.1093/nar/gkx966] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2017] [Accepted: 10/16/2017] [Indexed: 01/23/2023] Open
Abstract
RNArchitecture is a database that provides a comprehensive description of relationships between known families of structured non-coding RNAs, with a focus on structural similarities. The classification is hierarchical and similar to the system used in the SCOP and CATH databases of protein structures. Its central level is Family, which builds on the Rfam catalog and gathers closely related RNAs. Consensus structures of Families are described with a reduced secondary structure representation. Evolutionarily related Families are grouped into Superfamilies. Similar structures are further grouped into Architectures. The highest level, Class, organizes families into very broad structural categories, such as simple or complex structured RNAs. Some groups at different levels of the hierarchy are currently labeled as ‘unclassified’. The classification is expected to evolve as new data become available. For each Family with an experimentally determined three-diemsional (3D) structure(s), a representative one is provided. RNArchitecture also presents theoretical models of RNA 3D structure and is open for submission of structural models by users. Compared to other databases, RNArchitecture is unique in its focus on structure-based RNA classification, and in providing a platform for storing RNA 3D structure predictions. RNArchitecture can be accessed at http://iimcb.genesilico.pl/RNArchitecture/.
Collapse
Affiliation(s)
- Pietro Boccaletto
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Marcin Magnus
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Catarina Almeida
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Adriana Zyla
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Astha Astha
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Radoslaw Pluta
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Blazej Baginski
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Elzbieta Jankowska
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Stanislaw Dunin-Horkawicz
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Tomasz K Wirecki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Michal J Boniecki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Filip Stefaniak
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland.,Faculty of Biology, Adam Mickiewicz University, ul. Umultowska 89, PL-61-614 Poznan, Poland
| |
Collapse
|
30
|
Yesselman JD, Tian S, Liu X, Shi L, Li JB, Das R. Updates to the RNA mapping database (RMDB), version 2. Nucleic Acids Res 2019; 46:D375-D379. [PMID: 30053264 PMCID: PMC5753257 DOI: 10.1093/nar/gkx873] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2017] [Accepted: 09/19/2017] [Indexed: 12/20/2022] Open
Abstract
Chemical mapping is a broadly utilized technique for probing the structure and function of RNAs. The volume of chemical mapping data continues to grow as more researchers routinely employ this information and as experimental methods increase in throughput and information content. To create a central location for these data, we established an RNA mapping database (RMDB) 5 years ago. The RMDB, which is available at http://rmdb.stanford.edu, now contains chemical mapping data for over 800 entries, involving 134 000 natural and engineered RNAs, in vitro and in cellulo. The entries include large data sets from multidimensional techniques that focus on RNA tertiary structure and co-transcriptional folding, resulting in over 15 million residues probed. The database interface has been redesigned and now offers interactive graphical browsing of structural, thermodynamic and kinetic data at single-nucleotide resolution. The front-end interface now uses the force-directed RNA applet for secondary structure visualization and other JavaScript-based views of bar graphs and annotations. A new interface also streamlines the process for depositing new chemical mapping data to the RMDB.
Collapse
Affiliation(s)
- Joseph D Yesselman
- Department of Biochemistry, Stanford University School of Medicine, Stanford CA 94305, USA
| | - Siqi Tian
- Department of Biochemistry, Stanford University School of Medicine, Stanford CA 94305, USA
| | - Xin Liu
- Department of Genetics, Stanford University School of Medicine, Stanford CA 94305, USA
| | | | - Jin Billy Li
- Department of Genetics, Stanford University School of Medicine, Stanford CA 94305, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford CA 94305, USA.,Department of Physics, Stanford University, Stanford, CA 94305, USA
| |
Collapse
|
31
|
Abstract
RNA performs and regulates a diverse range of cellular processes, with new functional roles being uncovered at a rapid pace. Interest is growing in how these functions are linked to RNA structures that form in the complex cellular environment. A growing suite of technologies that use advances in RNA structural probes, high-throughput sequencing and new computational approaches to interrogate RNA structure at unprecedented throughput are beginning to provide insights into RNA structures at new spatial, temporal and cellular scales.
Collapse
Affiliation(s)
- Eric J Strobel
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA
| | - Angela M Yu
- Tri-Institutional Training Program in Computational Biology and Medicine, Weill Cornell Medicine, New York, NY, USA
| | - Julius B Lucks
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA.
| |
Collapse
|
32
|
Investigating targets for neuropharmacological intervention by molecular dynamics simulations. Biochem Soc Trans 2019; 47:909-918. [PMID: 31085614 DOI: 10.1042/bst20190048] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2019] [Revised: 04/11/2019] [Accepted: 04/23/2019] [Indexed: 01/09/2023]
Abstract
Medical research has identified over 500 brain disorders. Among these, there are still only very few neuropathologies whose causes are fully understood and, consequently, very few drugs whose mechanism of action is known. No FDA drug has been identified for major neurodegenerative diseases, such as Alzheimer's and Parkinson's. We still lack effective treatments and strategies for modulating progression or even early neurodegenerative disease onset diagnostic tools. A great support toward the highly needed identification of neuroactive drugs comes from computer simulation methods and, in particular, from molecular dynamics (MD). This provides insight into structure-function relationship of a target and predicts structure, dynamics and energetics of ligand/target complexes under biologically relevant conditions like temperature and physiological saline concentration. Here, we present examples of the predictive power of MD for neuroactive ligands/target complexes. This brief survey from our own research shows the usefulness of partnerships between academia and industry, and from joint efforts between experimental and theoretical groups.
Collapse
|
33
|
Frezza E, Courban A, Allouche D, Sargueil B, Pasquali S. The interplay between molecular flexibility and RNA chemical probing reactivities analyzed at the nucleotide level via an extensive molecular dynamics study. Methods 2019; 162-163:108-127. [PMID: 31145972 DOI: 10.1016/j.ymeth.2019.05.021] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2018] [Revised: 05/22/2019] [Accepted: 05/22/2019] [Indexed: 12/20/2022] Open
Abstract
Determination of the tridimensional structure of ribonucleic acid molecules is fundamental for understanding their function in the cell. A common method to investigate RNA structures of large molecules is the use of chemical probes such as SHAPE (2'-hydroxyl acylation analyzed by primer extension) reagents, DMS (dimethyl sulfate) and CMCT (1-cyclohexyl-3-(2-morpholinoethyl) carbodiimide metho-p-toluene sulfate), the reaction of which is dependent on the local structural properties of each nucleotide. In order to understand the interplay between local flexibility, sugar pucker, canonical pairing and chemical reactivity of the probes, we performed all-atom molecular dynamics simulations on a set of RNA molecules for which both tridimensional structure and chemical probing data are available and we analyzed the correlations between geometrical parameters and the chemical reactivity. Our study confirms that SHAPE reactivity is guided by the local flexibility of the different chemical moieties but suggests that a combination of multiple parameters is needed to better understand the implications of the reactivity at the molecular level. This is also the case for DMS and CMCT for which the reactivity appears to be more complex than commonly accepted.
Collapse
Affiliation(s)
- Elisa Frezza
- Faculté de Pharmacie de Paris, Laboratoire de Cristallographie et RMN Biologiques, UMR 8015 - CNRS, Université Paris Descartes, 4 Avenue de l'Observatoire 75270 PARIS CEDEX 06, France.
| | - Antoine Courban
- Faculté de Pharmacie de Paris, Laboratoire de Cristallographie et RMN Biologiques, UMR 8015 - CNRS, Université Paris Descartes, 4 Avenue de l'Observatoire 75270 PARIS CEDEX 06, France
| | - Delphine Allouche
- Faculté de Pharmacie de Paris, Laboratoire de Cristallographie et RMN Biologiques, UMR 8015 - CNRS, Université Paris Descartes, 4 Avenue de l'Observatoire 75270 PARIS CEDEX 06, France
| | - Bruno Sargueil
- Faculté de Pharmacie de Paris, Laboratoire de Cristallographie et RMN Biologiques, UMR 8015 - CNRS, Université Paris Descartes, 4 Avenue de l'Observatoire 75270 PARIS CEDEX 06, France.
| | - Samuela Pasquali
- Faculté de Pharmacie de Paris, Laboratoire de Cristallographie et RMN Biologiques, UMR 8015 - CNRS, Université Paris Descartes, 4 Avenue de l'Observatoire 75270 PARIS CEDEX 06, France.
| |
Collapse
|
34
|
Choudhary K, Lai YH, Tran EJ, Aviran S. dStruct: identifying differentially reactive regions from RNA structurome profiling data. Genome Biol 2019; 20:40. [PMID: 30791935 PMCID: PMC6385470 DOI: 10.1186/s13059-019-1641-3] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2018] [Accepted: 01/24/2019] [Indexed: 12/16/2022] Open
Abstract
RNA biology is revolutionized by recent developments of diverse high-throughput technologies for transcriptome-wide profiling of molecular RNA structures. RNA structurome profiling data can be used to identify differentially structured regions between groups of samples. Existing methods are limited in scope to specific technologies and/or do not account for biological variation. Here, we present dStruct which is the first broadly applicable method for differential analysis accounting for biological variation in structurome profiling data. dStruct is compatible with diverse profiling technologies, is validated with experimental data and simulations, and outperforms existing methods.
Collapse
Affiliation(s)
- Krishna Choudhary
- Department of Biomedical Engineering and Genome Center, University of California, Davis, One Shields Avenue, Davis, 95616 CA USA
| | - Yu-Hsuan Lai
- Department of Biochemistry, Purdue University, BCHM 305, 175 S. University Street, West Lafayette, 47907-2063 IN USA
| | - Elizabeth J. Tran
- Department of Biochemistry, Purdue University, BCHM 305, 175 S. University Street, West Lafayette, 47907-2063 IN USA
- Purdue University Center for Cancer Research, Purdue University, Hansen Life Sciences Research Building, Room 141, 201 S. University Street, West Lafayette, 47907-2064 IN USA
| | - Sharon Aviran
- Department of Biomedical Engineering and Genome Center, University of California, Davis, One Shields Avenue, Davis, 95616 CA USA
| |
Collapse
|
35
|
Schroeder SJ. Challenges and approaches to predicting RNA with multiple functional structures. RNA (NEW YORK, N.Y.) 2018; 24:1615-1624. [PMID: 30143552 PMCID: PMC6239171 DOI: 10.1261/rna.067827.118] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
The revolution in sequencing technology demands new tools to interpret the genetic code. As in vivo transcriptome-wide chemical probing techniques advance, new challenges emerge in the RNA folding problem. The emphasis on one sequence folding into a single minimum free energy structure is fading as a new focus develops on generating RNA structural ensembles and identifying functional structural features in ensembles. This review describes an efficient combinatorially complete method and three free energy minimization approaches to predicting RNA structures with more than one functional fold, as well as two methods for analysis of a thermodynamics-based Boltzmann ensemble of structures. The review then highlights two examples of viral RNA 3'-UTR regions that fold into more than one conformation and have been characterized by single molecule fluorescence energy resonance transfer or NMR spectroscopy. These examples highlight the different approaches and challenges in predicting structure and function from sequence for RNA with multiple biological roles and folds. More well-defined examples and new metrics for measuring differences in RNA structures will guide future improvements in prediction of RNA structure and function from sequence.
Collapse
Affiliation(s)
- Susan J Schroeder
- Department of Chemistry and Biochemistry, Department of Microbiology and Plant Biology, University of Oklahoma, Norman, Oklahoma 73019, USA
| |
Collapse
|
36
|
Wright PR, Mann M, Backofen R. Structure and Interaction Prediction in Prokaryotic RNA Biology. Microbiol Spectr 2018; 6:10.1128/microbiolspec.rwr-0001-2017. [PMID: 29676245 PMCID: PMC11633574 DOI: 10.1128/microbiolspec.rwr-0001-2017] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2017] [Indexed: 01/01/2023] Open
Abstract
Many years of research in RNA biology have soundly established the importance of RNA-based regulation far beyond most early traditional presumptions. Importantly, the advances in "wet" laboratory techniques have produced unprecedented amounts of data that require efficient and precise computational analysis schemes and algorithms. Hence, many in silico methods that attempt topological and functional classification of novel putative RNA-based regulators are available. In this review, we technically outline thermodynamics-based standard RNA secondary structure and RNA-RNA interaction prediction approaches that have proven valuable to the RNA research community in the past and present. For these, we highlight their usability with a special focus on prokaryotic organisms and also briefly mention recent advances in whole-genome interactomics and how this may influence the field of predictive RNA research.
Collapse
Affiliation(s)
| | | | - Rolf Backofen
- Bioinformatics Group
- Center for Biological Signaling Studies (BIOSS), University of Freiburg, Freiburg, Germany
| |
Collapse
|
37
|
Lackey L, Coria A, Woods C, McArthur E, Laederach A. Allele-specific SHAPE-MaP assessment of the effects of somatic variation and protein binding on mRNA structure. RNA (NEW YORK, N.Y.) 2018; 24:513-528. [PMID: 29317542 PMCID: PMC5855952 DOI: 10.1261/rna.064469.117] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/06/2017] [Accepted: 01/04/2018] [Indexed: 05/22/2023]
Abstract
The impact of inherited and somatic mutations on messenger RNA (mRNA) structure remains poorly understood. Recent technological advances that leverage next-generation sequencing to obtain experimental structure data, such as SHAPE-MaP, can reveal structural effects of mutations, especially when these data are incorporated into structure modeling. Here, we analyze the ability of SHAPE-MaP to detect the relatively subtle structural changes caused by single-nucleotide mutations. We find that allele-specific sorting greatly improved our detection ability. Thus, we used SHAPE-MaP with a novel combination of clone-free robotic mutagenesis and allele-specific sorting to perform a rapid, comprehensive survey of noncoding somatic and inherited riboSNitches in two cancer-associated mRNAs, TPT1 and LCP1 Using rigorous thermodynamic modeling of the Boltzmann suboptimal ensemble, we identified a subset of mutations that change TPT1 and LCP1 RNA structure, with approximately 14% of all variants identified as riboSNitches. To confirm that these in vitro structures were biologically relevant, we tested how dependent TPT1 and LCP1 mRNA structures were on their environments. We performed SHAPE-MaP on TPT1 and LCP1 mRNAs in the presence or absence of cellular proteins and found that both mRNAs have similar overall folds in all conditions. RiboSNitches identified within these mRNAs in vitro likely exist under biological conditions. Overall, these data reveal a robust mRNA structural landscape where differences in environmental conditions and most sequence variants do not significantly alter RNA structural ensembles. Finally, predicting riboSNitches in mRNAs from sequence alone remains particularly challenging; these data will provide the community with benchmarks for further algorithmic development.
Collapse
Affiliation(s)
- Lela Lackey
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | - Aaztli Coria
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | - Chanin Woods
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | - Evonne McArthur
- School of Medicine, Vanderbilt University, Nashville, Tennessee 37232, USA
| | - Alain Laederach
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| |
Collapse
|
38
|
Woods CT, Laederach A. Classification of RNA structure change by 'gazing' at experimental data. Bioinformatics 2018; 33:1647-1655. [PMID: 28130241 PMCID: PMC5447233 DOI: 10.1093/bioinformatics/btx041] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2016] [Accepted: 01/20/2017] [Indexed: 11/12/2022] Open
Abstract
Motivation Mutations (or Single Nucleotide Variants) in folded RiboNucleic Acid structures that cause local or global conformational change are riboSNitches. Predicting riboSNitches is challenging, as it requires making two, albeit related, structure predictions. The data most often used to experimentally validate riboSNitch predictions is Selective 2' Hydroxyl Acylation by Primer Extension, or SHAPE. Experimentally establishing a riboSNitch requires the quantitative comparison of two SHAPE traces: wild-type (WT) and mutant. Historically, SHAPE data was collected on electropherograms and change in structure was evaluated by 'gel gazing.' SHAPE data is now routinely collected with next generation sequencing and/or capillary sequencers. We aim to establish a classifier capable of simulating human 'gazing' by identifying features of the SHAPE profile that human experts agree 'looks' like a riboSNitch. Results We find strong quantitative agreement between experts when RNA scientists 'gaze' at SHAPE data and identify riboSNitches. We identify dynamic time warping and seven other features predictive of the human consensus. The classSNitch classifier reported here accurately reproduces human consensus for 167 mutant/WT comparisons with an Area Under the Curve (AUC) above 0.8. When we analyze 2019 mutant traces for 17 different RNAs, we find that features of the WT SHAPE reactivity allow us to improve thermodynamic structure predictions of riboSNitches. This is significant, as accurate RNA structural analysis and prediction is likely to become an important aspect of precision medicine. Availability and Implementation The classSNitch R package is freely available at http://classsnitch.r-forge.r-project.org . Contact alain@email.unc.edu. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Chanin Tolson Woods
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.,Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Alain Laederach
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.,Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| |
Collapse
|
39
|
Tian S, Kladwang W, Das R. Allosteric mechanism of the V. vulnificus adenine riboswitch resolved by four-dimensional chemical mapping. eLife 2018; 7:29602. [PMID: 29446752 PMCID: PMC5847336 DOI: 10.7554/elife.29602] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2017] [Accepted: 02/13/2018] [Indexed: 12/23/2022] Open
Abstract
The structural interconversions that mediate the gene regulatory functions of RNA molecules may be different from classic models of allostery, but the relevant structural correlations have remained elusive in even intensively studied systems. Here, we present a four-dimensional expansion of chemical mapping called lock-mutate-map-rescue (LM2R), which integrates multiple layers of mutation with nucleotide-resolution chemical mapping. This technique resolves the core mechanism of the adenine-responsive V. vulnificus add riboswitch, a paradigmatic system for which both Monod-Wyman-Changeux (MWC) conformational selection models and non-MWC alternatives have been proposed. To discriminate amongst these models, we locked each functionally important helix through designed mutations and assessed formation or depletion of other helices via compensatory rescue evaluated by chemical mapping. These LM2R measurements give strong support to the pre-existing correlations predicted by MWC models, disfavor alternative models, and suggest additional structural heterogeneities that may be general across ligand-free riboswitches.
Collapse
Affiliation(s)
- Siqi Tian
- Department of Biochemistry, Stanford University, Stanford, United States
| | - Wipapat Kladwang
- Department of Biochemistry, Stanford University, Stanford, United States
| | - Rhiju Das
- Department of Physics, Stanford University, Stanford, United States
| |
Collapse
|
40
|
Hawkes EJ, Hennelly SP, Novikova IV, Irwin JA, Dean C, Sanbonmatsu KY. COOLAIR Antisense RNAs Form Evolutionarily Conserved Elaborate Secondary Structures. Cell Rep 2018; 16:3087-3096. [PMID: 27653675 DOI: 10.1016/j.celrep.2016.08.045] [Citation(s) in RCA: 103] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2016] [Revised: 06/03/2016] [Accepted: 08/12/2016] [Indexed: 01/07/2023] Open
Abstract
There is considerable debate about the functionality of long non-coding RNAs (lncRNAs). Lack of sequence conservation has been used to argue against functional relevance. We investigated antisense lncRNAs, called COOLAIR, at the A. thaliana FLC locus and experimentally determined their secondary structure. The major COOLAIR variants are highly structured, organized by exon. The distally polyadenylated transcript has a complex multi-domain structure, altered by a single non-coding SNP defining a functionally distinct A. thaliana FLC haplotype. The A. thaliana COOLAIR secondary structure was used to predict COOLAIR exons in evolutionarily divergent Brassicaceae species. These predictions were validated through chemical probing and cloning. Despite the relatively low nucleotide sequence identity, the structures, including multi-helix junctions, show remarkable evolutionary conservation. In a number of places, the structure is conserved through covariation of a non-contiguous DNA sequence. This structural conservation supports a functional role for COOLAIR transcripts rather than, or in addition to, antisense transcription.
Collapse
Affiliation(s)
- Emily J Hawkes
- John Innes Centre, Norwich Research Park, Norwich NR4 7UH, UK
| | - Scott P Hennelly
- Los Alamos National Laboratory, Los Alamos, NM 87545, USA; New Mexico Consortium, Los Alamos, NM 87544, USA
| | - Irina V Novikova
- Los Alamos National Laboratory, Los Alamos, NM 87545, USA; Pacific Northwest National Laboratory, Environmental Molecular Sciences Laboratory, Richland, WA 99354, USA
| | - Judith A Irwin
- John Innes Centre, Norwich Research Park, Norwich NR4 7UH, UK
| | - Caroline Dean
- John Innes Centre, Norwich Research Park, Norwich NR4 7UH, UK
| | - Karissa Y Sanbonmatsu
- Los Alamos National Laboratory, Los Alamos, NM 87545, USA; New Mexico Consortium, Los Alamos, NM 87544, USA.
| |
Collapse
|
41
|
Mlýnský V, Bussi G. Molecular Dynamics Simulations Reveal an Interplay between SHAPE Reagent Binding and RNA Flexibility. J Phys Chem Lett 2018; 9:313-318. [PMID: 29265824 PMCID: PMC5830694 DOI: 10.1021/acs.jpclett.7b02921] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2017] [Accepted: 12/21/2017] [Indexed: 05/10/2023]
Abstract
The function of RNA molecules usually depends on their overall fold and on the presence of specific structural motifs. Chemical probing methods are routinely used in combination with nearest-neighbor models to determine RNA secondary structure. Among the available methods, SHAPE is relevant due to its capability to probe all RNA nucleotides and the possibility to be used in vivo. However, the structural determinants for SHAPE reactivity and its mechanism of reaction are still unclear. Here molecular dynamics simulations and enhanced sampling techniques are used to predict the accessibility of nucleotide analogs and larger RNA structural motifs to SHAPE reagents. We show that local RNA reconformations are crucial in allowing reagents to reach the 2'-OH group of a particular nucleotide and that sugar pucker is a major structural factor influencing SHAPE reactivity.
Collapse
Affiliation(s)
- Vojtěch Mlýnský
- Scuola Internazionale Superiore di
Studi Avanzati, SISSA, via Bonomea 265, 34136 Trieste, Italy
| | - Giovanni Bussi
- Scuola Internazionale Superiore di
Studi Avanzati, SISSA, via Bonomea 265, 34136 Trieste, Italy
| |
Collapse
|
42
|
Watters KE, Lucks JB. Mapping RNA Structure In Vitro with SHAPE Chemistry and Next-Generation Sequencing (SHAPE-Seq). Methods Mol Biol 2018; 1490:135-62. [PMID: 27665597 DOI: 10.1007/978-1-4939-6433-8_9] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Mapping RNA structure with selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) chemistry has proven to be a versatile method for characterizing RNA structure in a variety of contexts. SHAPE reagents covalently modify RNAs in a structure-dependent manner to create adducts at the 2'-OH group of the ribose backbone at nucleotides that are structurally flexible. The positions of these adducts are detected using reverse transcriptase (RT) primer extension, which stops one nucleotide before the modification, to create a pool of cDNAs whose lengths reflect the location of SHAPE modification. Quantification of the cDNA pools is used to estimate the "reactivity" of each nucleotide in an RNA molecule to the SHAPE reagent. High reactivities indicate nucleotides that are structurally flexible, while low reactivities indicate nucleotides that are inflexible. These SHAPE reactivities can then be used to infer RNA structures by restraining RNA structure prediction algorithms. Here, we provide a state-of-the-art protocol describing how to perform in vitro RNA structure probing with SHAPE chemistry using next-generation sequencing to quantify cDNA pools and estimate reactivities (SHAPE-Seq). The use of next-generation sequencing allows for higher throughput, more consistent data analysis, and multiplexing capabilities. The technique described herein, SHAPE-Seq v2.0, uses a universal reverse transcription priming site that is ligated to the RNA after SHAPE modification. The introduced priming site allows for the structural analysis of an RNA independent of its sequence.
Collapse
Affiliation(s)
- Kyle E Watters
- Robert F. Smith School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY, 14850, USA
| | - Julius B Lucks
- Robert F. Smith School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY, 14850, USA.
| |
Collapse
|
43
|
Strobel EJ, Watters KE, Nedialkov Y, Artsimovitch I, Lucks JB. Distributed biotin-streptavidin transcription roadblocks for mapping cotranscriptional RNA folding. Nucleic Acids Res 2017; 45:e109. [PMID: 28398514 PMCID: PMC5499547 DOI: 10.1093/nar/gkx233] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2017] [Accepted: 04/02/2017] [Indexed: 01/24/2023] Open
Abstract
RNA folding during transcription directs an order of folding that can determine RNA structure and function. However, the experimental study of cotranscriptional RNA folding has been limited by the lack of easily approachable methods that can interrogate nascent RNA structure at nucleotide resolution. To address this, we previously developed cotranscriptional selective 2΄-hydroxyl acylation analyzed by primer extension sequencing (SHAPE-Seq) to simultaneously probe all intermediate RNA transcripts during transcription by stalling elongation complexes at catalytically dead EcoRIE111Q roadblocks. While effective, the distribution of elongation complexes using EcoRIE111Q requires laborious PCR using many different oligonucleotides for each sequence analyzed. Here, we improve the broad applicability of cotranscriptional SHAPE-Seq by developing a sequence-independent biotin-streptavidin (SAv) roadblocking strategy that simplifies the preparation of roadblocking DNA templates. We first determine the properties of biotin-SAv roadblocks. We then show that randomly distributed biotin-SAv roadblocks can be used in cotranscriptional SHAPE-Seq experiments to identify the same RNA structural transitions related to a riboswitch decision-making process that we previously identified using EcoRIE111Q. Lastly, we find that EcoRIE111Q maps nascent RNA structure to specific transcript lengths more precisely than biotin-SAv and propose guidelines to leverage the complementary strengths of each transcription roadblock in cotranscriptional SHAPE-Seq.
Collapse
Affiliation(s)
- Eric J. Strobel
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL 60201, USA
| | - Kyle E. Watters
- Department of Molecular and Cell Biology, University of California, Berkeley, CA 94720, USA
| | - Yuri Nedialkov
- Department of Microbiology, The Ohio State University, Columbus, OH 43210, USA
- The Center for RNA Biology, The Ohio State University, Columbus, OH 43210, USA
| | - Irina Artsimovitch
- Department of Microbiology, The Ohio State University, Columbus, OH 43210, USA
- The Center for RNA Biology, The Ohio State University, Columbus, OH 43210, USA
| | - Julius B. Lucks
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL 60201, USA
- To whom correspondence should be addressed. Tel: +1 847 467 2943; Fax: +1 847 491 3728;
| |
Collapse
|
44
|
Woods CT, Lackey L, Williams B, Dokholyan NV, Gotz D, Laederach A. Comparative Visualization of the RNA Suboptimal Conformational Ensemble In Vivo. Biophys J 2017. [PMID: 28625696 PMCID: PMC5529173 DOI: 10.1016/j.bpj.2017.05.031] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
When a ribonucleic acid (RNA) molecule folds, it often does not adopt a single, well-defined conformation. The folding energy landscape of an RNA is highly dependent on its nucleotide sequence and molecular environment. Cellular molecules sometimes alter the energy landscape, thereby changing the ensemble of likely low-energy conformations. The effects of these energy landscape changes on the conformational ensemble are particularly challenging to visualize for large RNAs. We have created a robust approach for visualizing the conformational ensemble of RNAs that is well suited for in vitro versus in vivo comparisons. Our method creates a stable map of conformational space for a given RNA sequence. We first identify single point mutations in the RNA that maximally sample suboptimal conformational space based on the ensemble’s partition function. Then, we cluster these diverse ensembles to identify the most diverse partition functions for Boltzmann stochastic sampling. By using, to our knowledge, a novel nestedness distance metric, we iteratively add mutant suboptimal ensembles to converge on a stable 2D map of conformational space. We then compute the selective 2′ hydroxyl acylation by primer extension (SHAPE)-directed ensemble for the RNA folding under different conditions, and we project these ensembles on the map to visualize. To validate our approach, we established a conformational map of the Vibrio vulnificus add adenine riboswitch that reveals five classes of structures. In the presence of adenine, projection of the SHAPE-directed sampling correctly identified the on-conformation; without the ligand, only off-conformations were visualized. We also collected the whole-transcript in vitro and in vivo SHAPE-MaP for human β-actin messenger RNA that revealed similar global folds in both conditions. Nonetheless, a comparison of in vitro and in vivo data revealed that specific regions exhibited significantly different SHAPE-MaP profiles indicative of structural rearrangements, including rearrangement consistent with binding of the zipcode protein in a region distal to the stop codon.
Collapse
Affiliation(s)
- Chanin T Woods
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina; Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina
| | - Lela Lackey
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina
| | - Benfeard Williams
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina
| | - Nikolay V Dokholyan
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina
| | - David Gotz
- Carolina Health Informatics Program, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina; School of Information and Library Science, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina
| | - Alain Laederach
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina; Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina.
| |
Collapse
|
45
|
Miao Z, Adamiak RW, Antczak M, Batey RT, Becka AJ, Biesiada M, Boniecki MJ, Bujnicki JM, Chen SJ, Cheng CY, Chou FC, Ferré-D'Amaré AR, Das R, Dawson WK, Ding F, Dokholyan NV, Dunin-Horkawicz S, Geniesse C, Kappel K, Kladwang W, Krokhotin A, Łach GE, Major F, Mann TH, Magnus M, Pachulska-Wieczorek K, Patel DJ, Piccirilli JA, Popenda M, Purzycka KJ, Ren A, Rice GM, Santalucia J, Sarzynska J, Szachniuk M, Tandon A, Trausch JJ, Tian S, Wang J, Weeks KM, Williams B, Xiao Y, Xu X, Zhang D, Zok T, Westhof E. RNA-Puzzles Round III: 3D RNA structure prediction of five riboswitches and one ribozyme. RNA (NEW YORK, N.Y.) 2017; 23:655-672. [PMID: 28138060 PMCID: PMC5393176 DOI: 10.1261/rna.060368.116] [Citation(s) in RCA: 140] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/11/2016] [Accepted: 01/26/2017] [Indexed: 05/21/2023]
Abstract
RNA-Puzzles is a collective experiment in blind 3D RNA structure prediction. We report here a third round of RNA-Puzzles. Five puzzles, 4, 8, 12, 13, 14, all structures of riboswitch aptamers and puzzle 7, a ribozyme structure, are included in this round of the experiment. The riboswitch structures include biological binding sites for small molecules (S-adenosyl methionine, cyclic diadenosine monophosphate, 5-amino 4-imidazole carboxamide riboside 5'-triphosphate, glutamine) and proteins (YbxF), and one set describes large conformational changes between ligand-free and ligand-bound states. The Varkud satellite ribozyme is the most recently solved structure of a known large ribozyme. All puzzles have established biological functions and require structural understanding to appreciate their molecular mechanisms. Through the use of fast-track experimental data, including multidimensional chemical mapping, and accurate prediction of RNA secondary structure, a large portion of the contacts in 3D have been predicted correctly leading to similar topologies for the top ranking predictions. Template-based and homology-derived predictions could predict structures to particularly high accuracies. However, achieving biological insights from de novo prediction of RNA 3D structures still depends on the size and complexity of the RNA. Blind computational predictions of RNA structures already appear to provide useful structural information in many cases. Similar to the previous RNA-Puzzles Round II experiment, the prediction of non-Watson-Crick interactions and the observed high atomic clash scores reveal a notable need for an algorithm of improvement. All prediction models and assessment results are available at http://ahsoka.u-strasbg.fr/rnapuzzles/.
Collapse
Affiliation(s)
- Zhichao Miao
- Architecture et Réactivité de l'ARN, Université de Strasbourg, Institut de biologie moléculaire et cellulaire du CNRS, 67000 Strasbourg, France;
| | - Ryszard W Adamiak
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
- Poznan University of Technology, Institute of Computing Science, 60-965 Poznan, Poland
| | - Maciej Antczak
- Poznan University of Technology, Institute of Computing Science, 60-965 Poznan, Poland
| | - Robert T Batey
- Department of Chemistry and Biochemistry, University of Colorado at Boulder, Boulder, Colorado 80309-0596, USA
| | - Alexander J Becka
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Marcin Biesiada
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Michał J Boniecki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
- Laboratory of Bioinformatics, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, 61-614 Poznan, Poland
| | - Shi-Jie Chen
- Department of Physics and Astronomy, Department of Biochemistry, and Informatics Institute, University of Missouri-Columbia, Columbia, Missouri 65211, USA
| | - Clarence Yu Cheng
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Fang-Chieh Chou
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305, USA
| | | | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Wayne K Dawson
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - Feng Ding
- Department of Physics and Astronomy, Clemson University, Clemson, South Carolina 29634, USA
| | - Nikolay V Dokholyan
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA
| | - Stanisław Dunin-Horkawicz
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - Caleb Geniesse
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Kalli Kappel
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Wipapat Kladwang
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Andrey Krokhotin
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA
| | - Grzegorz E Łach
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - François Major
- Institute for Research in Immunology and Cancer (IRIC), Department of Computer Science and Operations Research, Université de Montréal, Montréal, Québec, H3C 3J7, Canada
| | - Thomas H Mann
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Marcin Magnus
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305, USA
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | | | - Dinshaw J Patel
- Structural Biology Program, Memorial Sloan-Kettering Cancer Center, New York, New York 10065, USA
| | - Joseph A Piccirilli
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, Illinois 60637, USA
- Department of Chemistry, The University of Chicago, Chicago, Illinois 60637, USA
| | - Mariusz Popenda
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Katarzyna J Purzycka
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Aiming Ren
- Structural Biology Program, Memorial Sloan-Kettering Cancer Center, New York, New York 10065, USA
- Life Sciences Institute, Zhejiang University, Hangzhou 310058, China
| | - Greggory M Rice
- Department of Chemistry, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-3290, USA
| | - John Santalucia
- Department of Chemistry, Wayne State University, Detroit, Michigan 48202, USA
- DNA Software, Ann Arbor, Michigan 48104, USA
| | - Joanna Sarzynska
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Marta Szachniuk
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
- Poznan University of Technology, Institute of Computing Science, 60-965 Poznan, Poland
| | - Arpit Tandon
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA
| | - Jeremiah J Trausch
- Department of Chemistry and Biochemistry, University of Colorado at Boulder, Boulder, Colorado 80309-0596, USA
| | - Siqi Tian
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Jian Wang
- Biomolecular Physics and Modeling Group, School of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China
| | - Kevin M Weeks
- Department of Chemistry, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-3290, USA
| | - Benfeard Williams
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA
| | - Yi Xiao
- Biomolecular Physics and Modeling Group, School of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China
| | - Xiaojun Xu
- Department of Physics and Astronomy, Department of Biochemistry, and Informatics Institute, University of Missouri-Columbia, Columbia, Missouri 65211, USA
| | - Dong Zhang
- Department of Physics and Astronomy, Department of Biochemistry, and Informatics Institute, University of Missouri-Columbia, Columbia, Missouri 65211, USA
| | - Tomasz Zok
- Poznan University of Technology, Institute of Computing Science, 60-965 Poznan, Poland
| | - Eric Westhof
- Architecture et Réactivité de l'ARN, Université de Strasbourg, Institut de biologie moléculaire et cellulaire du CNRS, 67000 Strasbourg, France;
| |
Collapse
|
46
|
Abstract
The discoveries of myriad non-coding RNA molecules, each transiting through multiple flexible states in cells or virions, present major challenges for structure determination. Advances in high-throughput chemical mapping give new routes for characterizing entire transcriptomes in vivo, but the resulting one-dimensional data generally remain too information-poor to allow accurate de novo structure determination. Multidimensional chemical mapping (MCM) methods seek to address this challenge. Mutate-and-map (M2), RNA interaction groups by mutational profiling (RING-MaP and MaP-2D analysis) and multiplexed •OH cleavage analysis (MOHCA) measure how the chemical reactivities of every nucleotide in an RNA molecule change in response to modifications at every other nucleotide. A growing body of in vitro blind tests and compensatory mutation/rescue experiments indicate that MCM methods give consistently accurate secondary structures and global tertiary structures for ribozymes, ribosomal domains and ligand-bound riboswitch aptamers up to 200 nucleotides in length. Importantly, MCM analyses provide detailed information on structurally heterogeneous RNA states, such as ligand-free riboswitches that are functionally important but difficult to resolve with other approaches. The sequencing requirements of currently available MCM protocols scale at least quadratically with RNA length, precluding general application to transcriptomes or viral genomes at present. We propose a modify-cross-link-map (MXM) expansion to overcome this and other current limitations to resolving the in vivo 'RNA structurome'.
Collapse
|
47
|
Rebijith KB, Asokan R, Hande HR, Krishna Kumar NK. The First Report of miRNAs from a Thysanopteran Insect, Thrips palmi Karny Using High-Throughput Sequencing. PLoS One 2016; 11:e0163635. [PMID: 27685664 PMCID: PMC5042526 DOI: 10.1371/journal.pone.0163635] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2016] [Accepted: 09/12/2016] [Indexed: 12/15/2022] Open
Abstract
Thrips palmi Karny (Thysanoptera: Thripidae) is the sole vector of Watermelon bud necrosis tospovirus, where the crop loss has been estimated to be around USD 50 million annually. Chemical insecticides are of limited use in the management of T. palmi due to the thigmokinetic behaviour and development of high levels of resistance to insecticides. There is an urgent need to find out an effective futuristic management strategy, where the small RNAs especially microRNAs hold great promise as a key player in the growth and development. miRNAs are a class of short non-coding RNAs involved in regulation of gene expression either by mRNA cleavage or by translational repression. We identified and characterized a total of 77 miRNAs from T. palmi using high-throughput deep sequencing. Functional classifications of the targets for these miRNAs revealed that majority of them are involved in the regulation of transcription and translation, nucleotide binding and signal transduction. We have also validated few of these miRNAs employing stem-loop RT-PCR, qRT-PCR and Northern blot. The present study not only provides an in-depth understanding of the biological and physiological roles of miRNAs in governing gene expression but may also lead as an invaluable tool for the management of thysanopteran insects in the future.
Collapse
Affiliation(s)
- K. B. Rebijith
- Division of Biotechnology, Indian Institute of Horticultural Research, Bangalore, India
- * E-mail: ;
| | - R. Asokan
- Division of Biotechnology, Indian Institute of Horticultural Research, Bangalore, India
- * E-mail: ;
| | - H. Ranjitha Hande
- Division of Biotechnology, Indian Institute of Horticultural Research, Bangalore, India
| | - N. K. Krishna Kumar
- Division of Horticultural Science, Indian Council of Agricultural Research, New Delhi, India
| |
Collapse
|
48
|
Kutchko KM, Laederach A. Transcending the prediction paradigm: novel applications of SHAPE to RNA function and evolution. WILEY INTERDISCIPLINARY REVIEWS-RNA 2016; 8. [PMID: 27396578 PMCID: PMC5179297 DOI: 10.1002/wrna.1374] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/04/2016] [Revised: 04/29/2016] [Accepted: 05/23/2016] [Indexed: 12/31/2022]
Abstract
Selective 2′‐hydroxyl acylation analyzed by primer extension (SHAPE) provides information on RNA structure at single‐nucleotide resolution. It is most often used in conjunction with RNA secondary structure prediction algorithms as a probabilistic or thermodynamic restraint. With the recent advent of ultra‐high‐throughput approaches for collecting SHAPE data, the applications of this technology are extending beyond structure prediction. In this review, we discuss recent applications of SHAPE data in the transcriptomic context and how this new experimental paradigm is changing our understanding of these experiments and RNA folding in general. SHAPE experiments probe both the secondary and tertiary structure of an RNA, suggesting that model‐free approaches for within and comparative RNA structure analysis can provide significant structural insight without the need for a full structural model. New methods incorporating SHAPE at different nucleotide resolutions are required to parse these transcriptomic data sets to transcend secondary structure modeling with global structural metrics. These ‘multiscale’ approaches provide deeper insights into RNA global structure, evolution, and function in the cell. WIREs RNA 2017, 8:e1374. doi: 10.1002/wrna.1374 For further resources related to this article, please visit the WIREs website.
Collapse
Affiliation(s)
- Katrina M Kutchko
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.,Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Alain Laederach
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.,Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| |
Collapse
|
49
|
Barquist L, Burge SW, Gardner PP. Studying RNA Homology and Conservation with Infernal: From Single Sequences to RNA Families. CURRENT PROTOCOLS IN BIOINFORMATICS 2016; 54:12.13.1-12.13.25. [PMID: 27322404 PMCID: PMC5010141 DOI: 10.1002/cpbi.4] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Emerging high-throughput technologies have led to a deluge of putative non-coding RNA (ncRNA) sequences identified in a wide variety of organisms. Systematic characterization of these transcripts will be a tremendous challenge. Homology detection is critical to making maximal use of functional information gathered about ncRNAs: identifying homologous sequence allows us to transfer information gathered in one organism to another quickly and with a high degree of confidence. ncRNA presents a challenge for homology detection, as the primary sequence is often poorly conserved and de novo secondary structure prediction and search remain difficult. This unit introduces methods developed by the Rfam database for identifying "families" of homologous ncRNAs starting from single "seed" sequences, using manually curated sequence alignments to build powerful statistical models of sequence and structure conservation known as covariance models (CMs), implemented in the Infernal software package. We provide a step-by-step iterative protocol for identifying ncRNA homologs and then constructing an alignment and corresponding CM. We also work through an example for the bacterial small RNA MicA, discovering a previously unreported family of divergent MicA homologs in genus Xenorhabdus in the process. © 2016 by John Wiley & Sons, Inc.
Collapse
Affiliation(s)
- Lars Barquist
- Institute for Molecular Infection Biology, University of Würzburg, Würzburg, D-97080 Germany
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA United Kingdom; Fax: +44 (0)1223 494919
| | - Sarah W. Burge
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA United Kingdom; Fax: +44 (0)1223 494919
| | - Paul P. Gardner
- School of Biological Sciences, University of Canterbury, Private Bag 4800, Christchurch, New Zealand
- Biomolecular Interaction Centre, University of Canterbury, Private Bag 4800, Christchurch, New Zealand
| |
Collapse
|
50
|
Sahoo S, Świtnicki MP, Pedersen JS. ProbFold: a probabilistic method for integration of probing data in RNA secondary structure prediction. Bioinformatics 2016; 32:2626-35. [PMID: 27153612 DOI: 10.1093/bioinformatics/btw175] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2015] [Accepted: 03/28/2016] [Indexed: 12/22/2022] Open
Abstract
MOTIVATION Recently, new RNA secondary structure probing techniques have been developed, including Next Generation Sequencing based methods capable of probing transcriptome-wide. These techniques hold great promise for improving structure prediction accuracy. However, each new data type comes with its own signal properties and biases, which may even be experiment specific. There is therefore a growing need for RNA structure prediction methods that can be automatically trained on new data types and readily extended to integrate and fully exploit multiple types of data. RESULTS Here, we develop and explore a modular probabilistic approach for integrating probing data in RNA structure prediction. It can be automatically trained given a set of known structures with probing data. The approach is demonstrated on SHAPE datasets, where we evaluate and selectively model specific correlations. The approach often makes superior use of the probing data signal compared to other methods. We illustrate the use of ProbFold on multiple data types using both simulations and a small set of structures with both SHAPE, DMS and CMCT data. Technically, the approach combines stochastic context-free grammars (SCFGs) with probabilistic graphical models. This approach allows rapid adaptation and integration of new probing data types. AVAILABILITY AND IMPLEMENTATION ProbFold is implemented in C ++. Models are specified using simple textual formats. Data reformatting is done using separate C ++ programs. Source code, statically compiled binaries for x86 Linux machines, C ++ programs, example datasets and a tutorial is available from http://moma.ki.au.dk/prj/probfold/ CONTACT : jakob.skou@clin.au.dk SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Sudhakar Sahoo
- Department of Molecular Medicine (MOMA), Aarhus University Hospital, Aarhus N 8200, Denmark
| | - Michał P Świtnicki
- Department of Molecular Medicine (MOMA), Aarhus University Hospital, Aarhus N 8200, Denmark
| | - Jakob Skou Pedersen
- Department of Molecular Medicine (MOMA), Aarhus University Hospital, Aarhus N 8200, Denmark Bioinformatics Research Centre, Aarhus University, Aarhus C DK-8000, Denmark
| |
Collapse
|