1
|
Srivastava M, Dukeshire MR, Mir Q, Omoru OB, Manzourolajdad A, Janga SC. Experimental and computational methods for studying the dynamics of RNA-RNA interactions in SARS-COV2 genomes. Brief Funct Genomics 2024; 23:46-54. [PMID: 36752040 PMCID: PMC10799312 DOI: 10.1093/bfgp/elac050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Revised: 10/24/2022] [Accepted: 11/11/2022] [Indexed: 02/09/2023] Open
Abstract
Long-range ribonucleic acid (RNA)-RNA interactions (RRI) are prevalent in positive-strand RNA viruses, including Beta-coronaviruses, and these take part in regulatory roles, including the regulation of sub-genomic RNA production rates. Crosslinking of interacting RNAs and short read-based deep sequencing of resulting RNA-RNA hybrids have shown that these long-range structures exist in severe acute respiratory syndrome coronavirus (SARS-CoV)-2 on both genomic and sub-genomic levels and in dynamic topologies. Furthermore, co-evolution of coronaviruses with their hosts is navigated by genetic variations made possible by its large genome, high recombination frequency and a high mutation rate. SARS-CoV-2's mutations are known to occur spontaneously during replication, and thousands of aggregate mutations have been reported since the emergence of the virus. Although many long-range RRIs have been experimentally identified using high-throughput methods for the wild-type SARS-CoV-2 strain, evolutionary trajectory of these RRIs across variants, impact of mutations on RRIs and interaction of SARS-CoV-2 RNAs with the host have been largely open questions in the field. In this review, we summarize recent computational tools and experimental methods that have been enabling the mapping of RRIs in viral genomes, with a specific focus on SARS-CoV-2. We also present available informatics resources to navigate the RRI maps and shed light on the impact of mutations on the RRI space in viral genomes. Investigating the evolution of long-range RNA interactions and that of virus-host interactions can contribute to the understanding of new and emerging variants as well as aid in developing improved RNA therapeutics critical for combating future outbreaks.
Collapse
Affiliation(s)
- Mansi Srivastava
- Department of BioHealth Informatics, School of Informatics and Computing, Indiana University Purdue University, 535 West Michigan Street, Indianapolis, Indiana 46202, USA
- Department of Biology, Indiana University, 1001 East 3 St, Bloomington, Indiana 47405, USA
| | - Matthew R Dukeshire
- Department of BioHealth Informatics, School of Informatics and Computing, Indiana University Purdue University, 535 West Michigan Street, Indianapolis, Indiana 46202, USA
| | - Quoseena Mir
- Department of BioHealth Informatics, School of Informatics and Computing, Indiana University Purdue University, 535 West Michigan Street, Indianapolis, Indiana 46202, USA
| | - Okiemute Beatrice Omoru
- Department of BioHealth Informatics, School of Informatics and Computing, Indiana University Purdue University, 535 West Michigan Street, Indianapolis, Indiana 46202, USA
| | - Amirhossein Manzourolajdad
- Department of BioHealth Informatics, School of Informatics and Computing, Indiana University Purdue University, 535 West Michigan Street, Indianapolis, Indiana 46202, USA
- Department of Computer Science, Colgate University, Hamilton, NY, USA
| | - Sarath Chandra Janga
- Department of BioHealth Informatics, School of Informatics and Computing, Indiana University Purdue University, 535 West Michigan Street, Indianapolis, Indiana 46202, USA
- Department of Medical and Molecular Genetics, Indiana University School of Medicine, Medical Research and Library Building, 975 West Walnut Street, Indianapolis, Indiana 46202, USA
- Centre for Computational Biology and Bioinformatics, Indiana University School of Medicine, 5021 Health Information and Translational Sciences (HITS), 410 West 10th Street, Indianapolis, Indiana 46202, USA
| |
Collapse
|
2
|
Beckmann IK, Waldl M, Will S, Hofacker IL. 3D feasibility of 2D RNA-RNA interaction paths by stepwise folding simulations. RNA (NEW YORK, N.Y.) 2024; 30:113-123. [PMID: 38071473 PMCID: PMC10798244 DOI: 10.1261/rna.079756.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Accepted: 11/16/2023] [Indexed: 01/18/2024]
Abstract
The structure of an RNA, and even more so its interactions with other RNAs, provide valuable information about its function. Secondary structure-based tools for RNA-RNA interaction predictions provide a quick way to identify possible interaction targets and structures. However, these tools ignore the effect of steric hindrance on the tertiary (3D) structure level, and do not consider whether a suitable folding pathway exists to form the interaction. As a consequence, these tools often predict interactions that are unrealistically long and could be formed (in three dimensions) only by going through highly entangled intermediates. Here, we present a computational pipeline to assess whether a proposed secondary (2D) structure interaction is sterically feasible and reachable along a plausible folding pathway. To this end, we simulate the folding of a series of 3D structures along a given 2D folding path. To avoid the complexity of large-scale atomic resolution simulations, our pipeline uses coarse-grained 3D modeling and breaks up the folding path into small steps, each corresponding to the extension of the interaction by 1 or 2 bp. We apply our pipeline to analyze RNA-RNA interaction formation for three selected RNA-RNA complexes. We find that kissing hairpins, in contrast to interactions in the exterior loop, are difficult to extend and tend to get stuck at an interaction length of 6 bp. Our tool, including source code, documentation, and sample data, is available at www.github.com/irenekb/RRI-3D.
Collapse
Affiliation(s)
- Irene K Beckmann
- Department of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, 1090 Wien, Austria
- Vienna BioCenter PhD Program, Doctoral School of the University of Vienna and Medical University of Vienna, A-1030 Vienna, Austria
| | - Maria Waldl
- Department of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, 1090 Wien, Austria
- Vienna Doctoral School in Chemistry (DoSChem), University of Vienna, 1090 Vienna, Austria
- Center for Anatomy and Cell Biology, Medical University of Vienna, 1090 Vienna, Austria
| | - Sebastian Will
- LIX - Batiment Turing, Ecole Polytechnique, 91120 Palaiseau, France
| | - Ivo L Hofacker
- Department of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, 1090 Wien, Austria
- Faculty of Computer Science, Research Group Bioinformatics and Computational Biology, University of Vienna, 1090 Vienna, Austria
| |
Collapse
|
3
|
Raden M, Miladi M. How to do RNA-RNA Interaction Prediction? A Use-Case Driven Handbook Using IntaRNA. Methods Mol Biol 2024; 2726:209-234. [PMID: 38780733 DOI: 10.1007/978-1-0716-3519-3_9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/25/2024]
Abstract
Computational prediction of RNA-RNA interactions (RRI) is a central methodology for the specific investigation of inter-molecular RNA interactions and regulatory effects of non-coding RNAs like eukaryotic microRNAs or prokaryotic small RNAs. Available methods can be classified according to their underlying prediction strategies, each implicating specific capabilities and restrictions often not transparent to the non-expert user. Within this work, we review seven classes of RRI prediction strategies and discuss the advantages and limitations of respective tools, since such knowledge is essential for selecting the right tool in the first place.Among the RRI prediction strategies, accessibility-based approaches have been shown to provide the most reliable predictions. Here, we describe how IntaRNA, as one of the state-of-the-art accessibility-based tools, can be applied in various use cases for the task of computational RRI prediction. Detailed hands-on examples for individual RRI predictions as well as large-scale target prediction scenarios are provided. We illustrate the flexibility and capabilities of IntaRNA through the examples. Each example is designed using real-life data from the literature and is accompanied by instructions on interpreting the respective results from IntaRNA output. Our use-case driven instructions enable non-expert users to comprehensively understand and utilize IntaRNA's features for effective RRI predictions.
Collapse
Affiliation(s)
- Martin Raden
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Freiburg, Germany.
| | - Milad Miladi
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Freiburg, Germany
| |
Collapse
|
4
|
Tieng FYF, Abdullah-Zawawi MR, Md Shahri NAA, Mohamed-Hussein ZA, Lee LH, Mutalib NSA. A Hitchhiker's guide to RNA-RNA structure and interaction prediction tools. Brief Bioinform 2023; 25:bbad421. [PMID: 38040490 PMCID: PMC10753535 DOI: 10.1093/bib/bbad421] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 10/16/2023] [Accepted: 10/26/2023] [Indexed: 12/03/2023] Open
Abstract
RNA biology has risen to prominence after a remarkable discovery of diverse functions of noncoding RNA (ncRNA). Most untranslated transcripts often exert their regulatory functions into RNA-RNA complexes via base pairing with complementary sequences in other RNAs. An interplay between RNAs is essential, as it possesses various functional roles in human cells, including genetic translation, RNA splicing, editing, ribosomal RNA maturation, RNA degradation and the regulation of metabolic pathways/riboswitches. Moreover, the pervasive transcription of the human genome allows for the discovery of novel genomic functions via RNA interactome investigation. The advancement of experimental procedures has resulted in an explosion of documented data, necessitating the development of efficient and precise computational tools and algorithms. This review provides an extensive update on RNA-RNA interaction (RRI) analysis via thermodynamic- and comparative-based RNA secondary structure prediction (RSP) and RNA-RNA interaction prediction (RIP) tools and their general functions. We also highlighted the current knowledge of RRIs and the limitations of RNA interactome mapping via experimental data. Then, the gap between RSP and RIP, the importance of RNA homologues, the relationship between pseudoknots, and RNA folding thermodynamics are discussed. It is hoped that these emerging prediction tools will deepen the understanding of RNA-associated interactions in human diseases and hasten treatment processes.
Collapse
Affiliation(s)
- Francis Yew Fu Tieng
- UKM Medical Molecular Biology Institute (UMBI), Universiti Kebangsaan Malaysia (UKM), Kuala Lumpur 56000, Malaysia
| | | | - Nur Alyaa Afifah Md Shahri
- UKM Medical Molecular Biology Institute (UMBI), Universiti Kebangsaan Malaysia (UKM), Kuala Lumpur 56000, Malaysia
| | - Zeti-Azura Mohamed-Hussein
- Institute of Systems Biology (INBIOSIS), UKM, Selangor 43600, Malaysia
- Department of Applied Physics, Faculty of Science and Technology, UKM, Selangor 43600, Malaysia
| | - Learn-Han Lee
- Sunway Microbiomics Centre, School of Medical and Life Sciences, Sunway University, Sunway City 47500, Malaysia
- Novel Bacteria and Drug Discovery Research Group, Microbiome and Bioresource Research Strength, Jeffrey Cheah School of Medicine and Health Sciences, Monash University of Malaysia, Selangor 47500, Malaysia
| | - Nurul-Syakima Ab Mutalib
- UKM Medical Molecular Biology Institute (UMBI), Universiti Kebangsaan Malaysia (UKM), Kuala Lumpur 56000, Malaysia
- Novel Bacteria and Drug Discovery Research Group, Microbiome and Bioresource Research Strength, Jeffrey Cheah School of Medicine and Health Sciences, Monash University of Malaysia, Selangor 47500, Malaysia
- Faculty of Health Sciences, UKM, Kuala Lumpur 50300, Malaysia
| |
Collapse
|
5
|
Zhang H, Li S, Dai N, Zhang L, Mathews DH, Huang L. LinearCoFold and LinearCoPartition: linear-time algorithms for secondary structure prediction of interacting RNA molecules. Nucleic Acids Res 2023; 51:e94. [PMID: 37650626 PMCID: PMC10570024 DOI: 10.1093/nar/gkad664] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Revised: 06/15/2023] [Accepted: 08/17/2023] [Indexed: 09/01/2023] Open
Abstract
Many RNAs function through RNA-RNA interactions. Fast and reliable RNA structure prediction with consideration of RNA-RNA interaction is useful, however, existing tools are either too simplistic or too slow. To address this issue, we present LinearCoFold, which approximates the complete minimum free energy structure of two strands in linear time, and LinearCoPartition, which approximates the cofolding partition function and base pairing probabilities in linear time. LinearCoFold and LinearCoPartition are orders of magnitude faster than RNAcofold. For example, on a sequence pair with combined length of 26,190 nt, LinearCoFold is 86.8× faster than RNAcofold MFE mode, and LinearCoPartition is 642.3× faster than RNAcofold partition function mode. Surprisingly, LinearCoFold and LinearCoPartition's predictions have higher PPV and sensitivity of intermolecular base pairs. Furthermore, we apply LinearCoFold to predict the RNA-RNA interaction between SARS-CoV-2 genomic RNA (gRNA) and human U4 small nuclear RNA (snRNA), which has been experimentally studied, and observe that LinearCoFold's prediction correlates better with the wet lab results than RNAcofold's.
Collapse
Affiliation(s)
- He Zhang
- Baidu Research, Sunnyvale, CA, USA
- School of Electrical Engineering & Computer Science, Oregon State University, Corvallis, OR, USA
| | - Sizhen Li
- School of Electrical Engineering & Computer Science, Oregon State University, Corvallis, OR, USA
| | - Ning Dai
- School of Electrical Engineering & Computer Science, Oregon State University, Corvallis, OR, USA
| | - Liang Zhang
- School of Electrical Engineering & Computer Science, Oregon State University, Corvallis, OR, USA
| | - David H Mathews
- Department of Biochemistry & Biophysics,Rochester, NY 14642, USA
- Center for RNA Biology, Rochester, NY 14642, USA
- Department of Biostatistics & Computational Biology, University of Rochester Medical Center, Rochester, NY 14642, USA
| | - Liang Huang
- School of Electrical Engineering & Computer Science, Oregon State University, Corvallis, OR, USA
| |
Collapse
|
6
|
Abstract
RNAstructure is a user-friendly program for the prediction and analysis of RNA secondary structure. It is available as a web server, a program with a graphical user interface, or a set of command line tools. The programs are available for Microsoft Windows, macOS, or Linux. This article provides protocols for prediction of RNA secondary structure (using the web server, the graphical user interface, or the command line) and high-affinity oligonucleotide binding sites to a structured RNA target (using the graphical user interface). © 2023 Wiley Periodicals LLC. Basic Protocol 1: Predicting RNA secondary structure using the RNAstructure web server Alternate Protocol 1: Predicting secondary structure and base pair probabilities using the RNAstructure graphical user interface Alternate Protocol 2: Predicting secondary structure and base pair probabilities using the RNAstructure command line interface Basic Protocol 2: Predicting binding affinities of oligonucleotides complementary to an RNA target using OligoWalk.
Collapse
Affiliation(s)
- Sara E. Ali
- Department of Biochemistry & Biophysics and Center for RNA Biology, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642
| | - Abhinav Mittal
- Department of Biochemistry & Biophysics and Center for RNA Biology, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642
| | - David H. Mathews
- Department of Biochemistry & Biophysics and Center for RNA Biology, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642
| |
Collapse
|
7
|
Fopase R, Panda C, Rajendran AP, Uludag H, Pandey LM. Potential of siRNA in COVID-19 therapy: Emphasis on in silico design and nanoparticles based delivery. Front Bioeng Biotechnol 2023; 11:1112755. [PMID: 36814718 PMCID: PMC9939533 DOI: 10.3389/fbioe.2023.1112755] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Accepted: 01/13/2023] [Indexed: 02/09/2023] Open
Abstract
Small interfering RNA (siRNA)-mediated mRNA degradation approach have imparted its eminence against several difficult-to-treat genetic disorders and other allied diseases. Viral outbreaks and resulting pandemics have repeatedly threatened public health and questioned human preparedness at the forefront of drug design and biomedical readiness. During the recent pandemic caused by the SARS-CoV-2, mRNA-based vaccination strategies have paved the way for a new era of RNA therapeutics. RNA Interference (RNAi) based approach using small interfering RNA may complement clinical management of the COVID-19. RNA Interference approach will primarily work by restricting the synthesis of the proteins required for viral replication, thereby hampering viral cellular entry and trafficking by targeting host as well as protein factors. Despite promising benefits, the stability of small interfering RNA in the physiological environment is of grave concern as well as site-directed targeted delivery and evasion of the immune system require immediate attention. In this regard, nanotechnology offers viable solutions for these challenges. The review highlights the potential of small interfering RNAs targeted toward specific regions of the viral genome and the features of nanoformulations necessary for the entrapment and delivery of small interfering RNAs. In silico design of small interfering RNA for different variants of SARS-CoV-2 has been discussed. Various nanoparticles as promising carriers of small interfering RNAs along with their salient properties, including surface functionalization, are summarized. This review will help tackle the real-world challenges encountered by the in vivo delivery of small interfering RNAs, ensuring a safe, stable, and readily available drug candidate for efficient management of SARS-CoV-2 in the future.
Collapse
Affiliation(s)
- Rushikesh Fopase
- Bio-Interface & Environmental Engineering Lab, Department of Biosciences and Bioengineering, Indian Institute of Technology Guwahati, Assam, India
| | - Chinmaya Panda
- Bio-Interface & Environmental Engineering Lab, Department of Biosciences and Bioengineering, Indian Institute of Technology Guwahati, Assam, India
| | - Amarnath P. Rajendran
- Department of Chemical & Materials Engineering, Faculty of Engineering, University of Alberta, Edmonton, AB, Canada
| | - Hasan Uludag
- Department of Chemical & Materials Engineering, Faculty of Engineering, University of Alberta, Edmonton, AB, Canada
- Faculty of Pharmacy and Pharmaceutical Sciences, University of Alberta, Edmonton, AB, Canada
- Department of Biomedical Engineering, Faculty of Medicine and Dentistry, University of Alberta, Edmonton, AB, Canada
| | - Lalit M. Pandey
- Bio-Interface & Environmental Engineering Lab, Department of Biosciences and Bioengineering, Indian Institute of Technology Guwahati, Assam, India
| |
Collapse
|
8
|
Hess JM, Jannen WK, Aalberts DP. The four mRNA bases have quite different (un)folding free energies, applications to RNA splicing and translation initiation with BindOligoNet. J Mol Biol 2022; 434:167578. [DOI: 10.1016/j.jmb.2022.167578] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Revised: 03/31/2022] [Accepted: 04/01/2022] [Indexed: 12/12/2022]
|
9
|
Kang Q, Meng J, Su C, Luan Y. Mining plant endogenous target mimics from miRNA-lncRNA interactions based on dual-path parallel ensemble pruning method. Brief Bioinform 2021; 23:6399881. [PMID: 34662389 DOI: 10.1093/bib/bbab440] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2021] [Revised: 09/07/2021] [Accepted: 09/24/2021] [Indexed: 12/14/2022] Open
Abstract
The interactions between microRNAs (miRNAs) and long non-coding RNAs (lncRNAs) play important roles in biological activities. Specially, lncRNAs as endogenous target mimics (eTMs) can bind miRNAs to regulate the expressions of target messenger RNAs (mRNAs). A growing number of studies focus on animals, but the studies on plants are scarce and many functions of plant eTMs are unknown. This study proposes a novel ensemble pruning protocol for predicting plant miRNA-lncRNA interactions at first. It adaptively prunes the base models based on dual-path parallel ensemble method to meet the challenge of cross-species prediction. Then potential eTMs are mined from predicted results. The expression levels of RNAs are identified through biological experiment to construct the lncRNA-miRNA-mRNA regulatory network, and the functions of potential eTMs are inferred through enrichment analysis. Experiment results show that the proposed protocol outperforms existing methods and state-of-the-art predictors on various plant species. A total of 17 potential eTMs are verified by biological experiment to involve in 22 regulations, and 14 potential eTMs are inferred by Gene Ontology enrichment analysis to involve in 63 functions, which is significant for further research.
Collapse
Affiliation(s)
- Qiang Kang
- School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning 116024, China
| | - Jun Meng
- School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning 116024, China
| | - Chenglin Su
- School of Bioengineering, Dalian University of Technology, Dalian, Liaoning, 116024 China
| | - Yushi Luan
- School of Bioengineering, Dalian University of Technology, Dalian, Liaoning, 116024 China
| |
Collapse
|
10
|
Qian K, Xu JX, Deng Y, Peng H, Peng J, Ou CM, Liu Z, Jiang LH, Tai YH. Signaling pathways of genetic variants and miRNAs in the pathogenesis of myasthenia gravis. Gland Surg 2020; 9:1933-1944. [PMID: 33447544 PMCID: PMC7804555 DOI: 10.21037/gs-20-39] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2020] [Accepted: 09/30/2020] [Indexed: 01/06/2023]
Abstract
BACKGROUND Myasthenia gravis (MG) is a chronic autoimmune neuromuscular disorder causing muscle weakness and characterized by a defect in synaptic transmission at the neuromuscular junction. The pathogenesis of this disease remains unclear. We aimed to predict the key signaling pathways of genetic variants and miRNAs in the pathogenesis of MG, and identify the key genes among them. METHODS We searched published information regarding associated single nucleotide polymorphisms (SNPs) and differentially-expressed miRNAs in MG cases. We search of SNPs and miRNAs in literature databases about MG, then we used bioinformatic tools to predict target genes of miRNAs. Moreover, functional enrichment analysis for key genes was carried out utilizing the Cytoscape-plugin, known as ClueGO. These key genes were mapped to STRING database to construct a protein-protein interaction (PPI) network. Then a miRNA-target gene regulatory network was established to screen key genes. RESULTS Five genes containing SNPs associated with MG risk were involved in the inflammatory bowel disease (IBD) signaling pathway, and FoxP3 was the key gene. MAPK1, SMAD4, SMAD2 and BCL2 were predicted to be targeted by the 18 miRNAs and to act as the key genes in adherens, junctions, apoptosis, or cancer-related pathways respectively. These five key genes containing SNPs or targeted by miRNAs were found to be involved in negative regulation of T cell differentiation. CONCLUSIONS We speculate that SNPs cause the genes to be defective or the miRNAs to downregulate the factors that subsequently negatively regulate regulatory T cells and trigger the onset of MG.
Collapse
Affiliation(s)
- Kai Qian
- Faculty of Life and Biotechnology, Kunming University of Science and Technology, Kunming, China
- Department of Thoracic Surgery, Institute of The First People’s Hospital of Yunnan Province, Kunming, China
| | - Jia-Xin Xu
- Department of Cardiovascular surgery, Yan’ an Affiliated Hospital of Kunming Medical University, Kunming, China
| | - Yi Deng
- Department of Oncology, Institute of Surgery Research, Daping Hospital, Army Medical University, Chongqing, China
| | - Hao Peng
- Department of Thoracic Surgery, Institute of The First People’s Hospital of Yunnan Province, Kunming, China
| | - Jun Peng
- Department of Thoracic Surgery, Institute of The First People’s Hospital of Yunnan Province, Kunming, China
| | - Chun-Mei Ou
- Department of Cardiovascular surgery, Institute of the First People’s Hospital of Yunnan Province, Kunming, China
| | - Zu Liu
- Department of Cardiovascular surgery, Yan’ an Affiliated Hospital of Kunming Medical University, Kunming, China
| | - Li-Hong Jiang
- Department of Thoracic Surgery, Institute of The First People’s Hospital of Yunnan Province, Kunming, China
| | - Yong-Hang Tai
- School of Electronic Information in the Yunnan Normal University, Kunming, China
| |
Collapse
|
11
|
Zhang H, Zhang L, Mathews DH, Huang L. LinearPartition: linear-time approximation of RNA folding partition function and base-pairing probabilities. Bioinformatics 2020; 36:i258-i267. [PMID: 32657379 PMCID: PMC7355276 DOI: 10.1093/bioinformatics/btaa460] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
MOTIVATION RNA secondary structure prediction is widely used to understand RNA function. Recently, there has been a shift away from the classical minimum free energy methods to partition function-based methods that account for folding ensembles and can therefore estimate structure and base pair probabilities. However, the classical partition function algorithm scales cubically with sequence length, and is therefore prohibitively slow for long sequences. This slowness is even more severe than cubic-time free energy minimization due to a substantially larger constant factor in runtime. RESULTS Inspired by the success of our recent LinearFold algorithm that predicts the approximate minimum free energy structure in linear time, we design a similar linear-time heuristic algorithm, LinearPartition, to approximate the partition function and base-pairing probabilities, which is shown to be orders of magnitude faster than Vienna RNAfold and CONTRAfold (e.g. 2.5 days versus 1.3 min on a sequence with length 32 753 nt). More interestingly, the resulting base-pairing probabilities are even better correlated with the ground-truth structures. LinearPartition also leads to a small accuracy improvement when used for downstream structure prediction on families with the longest length sequences (16S and 23S rRNAs), as well as a substantial improvement on long-distance base pairs (500+ nt apart). AVAILABILITY AND IMPLEMENTATION Code: http://github.com/LinearFold/LinearPartition; Server: http://linearfold.org/partition. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- He Zhang
- Baidu Research, Sunnyvale, CA 94089, USA
- School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR 97330, USA
| | - Liang Zhang
- School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR 97330, USA
| | - David H Mathews
- Department of Biochemistry & Biophysics, University of Rochester Medical Center, Rochester, NY 48306, USA
- Center for RNA Biology, University of Rochester Medical Center, Rochester, NY 48306, USA
- Department of Biostatistics & Computational Biology, University of Rochester Medical Center, Rochester, NY 48306, USA
| | - Liang Huang
- Baidu Research, Sunnyvale, CA 94089, USA
- School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR 97330, USA
| |
Collapse
|
12
|
Willmott D, Murrugarra D, Ye Q. Improving RNA secondary structure prediction via state inference with deep recurrent neural networks. COMPUTATIONAL AND MATHEMATICAL BIOPHYSICS 2020. [DOI: 10.1515/cmb-2020-0002] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Abstract
The problem of determining which nucleotides of an RNA sequence are paired or unpaired in the secondary structure of an RNA, which we call RNA state inference, can be studied by different machine learning techniques. Successful state inference of RNA sequences can be used to generate auxiliary information for data-directed RNA secondary structure prediction. Typical tools for state inference, such as hidden Markov models, exhibit poor performance in RNA state inference, owing in part to their inability to recognize nonlocal dependencies. Bidirectional long short-term memory (LSTM) neural networks have emerged as a powerful tool that can model global nonlinear sequence dependencies and have achieved state-of-the-art performances on many different classification problems.
This paper presents a practical approach to RNA secondary structure inference centered around a deep learning method for state inference. State predictions from a deep bidirectional LSTM are used to generate synthetic SHAPE data that can be incorporated into RNA secondary structure prediction via the Nearest Neighbor Thermodynamic Model (NNTM). This method produces predicted secondary structures for a diverse test set of 16S ribosomal RNA that are, on average, 25 percentage points more accurate than undirected MFE structures. Accuracy is highly dependent on the success of our state inference method, and investigating the global features of our state predictions reveals that accuracy of both our state inference and structure inference methods are highly dependent on the similarity of pairing patterns of the sequence to the training dataset. Availability of a large training dataset is critical to the success of this approach. Code available at https://github.com/dwillmott/rna-state-inf.
Collapse
Affiliation(s)
| | - David Murrugarra
- Department of Mathematics , University of Kentucky , Lexington , KY 40506-0027 USA
| | - Qiang Ye
- Department of Mathematics , University of Kentucky , Lexington , KY 40506-0027 USA
| |
Collapse
|
13
|
Raden M, Müller T, Mautner S, Gelhausen R, Backofen R. The impact of various seed, accessibility and interaction constraints on sRNA target prediction- a systematic assessment. BMC Bioinformatics 2020; 21:15. [PMID: 31931703 PMCID: PMC6956497 DOI: 10.1186/s12859-019-3143-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2019] [Accepted: 10/09/2019] [Indexed: 12/24/2022] Open
Abstract
BACKGROUND Seed and accessibility constraints are core features to enable highly accurate sRNA target screens based on RNA-RNA interaction prediction. Currently, available tools provide different (sets of) constraints and default parameter sets. Thus, it is hard to impossible for users to estimate the influence of individual restrictions on the prediction results. RESULTS Here, we present a systematic assessment of the impact of established and new constraints on sRNA target prediction both on a qualitative as well as computational level. This is done exemplarily based on the performance of IntaRNA, one of the most exact sRNA target prediction tools. IntaRNA provides various ways to constrain considered seed interactions, e.g. based on seed length, its accessibility, minimal unpaired probabilities, or energy thresholds, beside analogous constraints for the overall interaction. Thus, our results reveal the impact of individual constraints and their combinations. CONCLUSIONS This provides both a guide for users what is important and recommendations for existing and upcoming sRNA target prediction approaches.We show on a large sRNA target screen benchmark data set that only by altering the parameter set, IntaRNA recovers 30% more verified interactions while becoming 5-times faster. This exemplifies the potential of seed, accessibility and interaction constraints for sRNA target prediction.
Collapse
Affiliation(s)
- Martin Raden
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Georges-Koehler-Allee 106, Freiburg, 79110, Germany.
| | - Teresa Müller
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Georges-Koehler-Allee 106, Freiburg, 79110, Germany
| | - Stefan Mautner
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Georges-Koehler-Allee 106, Freiburg, 79110, Germany
| | - Rick Gelhausen
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Georges-Koehler-Allee 106, Freiburg, 79110, Germany
| | - Rolf Backofen
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Georges-Koehler-Allee 106, Freiburg, 79110, Germany.,Signalling Research Centres BIOSS and CIBSS, University of Freiburg, Schaenzlestr. 18, Freiburg, 79104, Germany
| |
Collapse
|
14
|
Antonov IV, Mazurov E, Borodovsky M, Medvedeva YA. Prediction of lncRNAs and their interactions with nucleic acids: benchmarking bioinformatics tools. Brief Bioinform 2019; 20:551-564. [PMID: 29697742 DOI: 10.1093/bib/bby032] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2017] [Revised: 03/26/2018] [Indexed: 01/22/2023] Open
Abstract
The genomes of mammalian species are pervasively transcribed producing as many noncoding as protein-coding RNAs. There is a growing body of evidence supporting their functional role. Long noncoding RNA (lncRNA) can bind both nucleic acids and proteins through several mechanisms. A reliable computational prediction of the most probable mechanism of lncRNA interaction can facilitate experimental validation of its function. In this study, we benchmarked computational tools capable to discriminate lncRNA from mRNA and predict lncRNA interactions with other nucleic acids. We assessed the performance of 9 tools for distinguishing protein-coding from noncoding RNAs, as well as 19 tools for prediction of RNA-RNA and RNA-DNA interactions. Our conclusions about the considered tools were based on their performances on the entire genome/transcriptome level, as it is the most common task nowadays. We found that FEELnc and CPAT distinguish between coding and noncoding mammalian transcripts in the most accurate manner. ASSA, RIBlast and LASTAL, as well as Triplexator, turned out to be the best predictors of RNA-RNA and RNA-DNA interactions, respectively. We showed that the normalization of the predicted interaction strength to the transcript length and GC content may improve the accuracy of inferring RNA interactions. Yet, all the current tools have difficulties to make accurate predictions of short-trans RNA-RNA interactions-stretches of sparse contacts. All over, there is still room for improvement in each category, especially for predictions of RNA interactions.
Collapse
Affiliation(s)
- Ivan V Antonov
- Institute of Bioengineering, Research Center of Biotechnology, Russian Academy of Science, Moscow, Russian Federation.,Department of Biological and Medical Physics, Moscow Institute of Physics and Technology, Dolgoprudny, Russian Federation
| | | | - Mark Borodovsky
- Department of Biological and Medical Physics, Moscow Institute of Physics and Technology, Dolgoprudny, Russian Federation
| | - Yulia A Medvedeva
- Institute of Bioengineering, Research Center of Biotechnology, Russian Academy of Science, Moscow, Russian Federation.,Department of Biological and Medical Physics, Moscow Institute of Physics and Technology, Dolgoprudny, Russian Federation.,Department of Computational Biology, Vavilov Institute of General Genetics, Russian Academy of Science, Moscow, Russian Federation
| |
Collapse
|
15
|
Li P, Wei Y, Mei M, Tang L, Sun L, Huang W, Zhou J, Zou C, Zhang S, Qin CF, Jiang T, Dai J, Tan X, Zhang QC. Integrative Analysis of Zika Virus Genome RNA Structure Reveals Critical Determinants of Viral Infectivity. Cell Host Microbe 2018; 24:875-886.e5. [PMID: 30472207 DOI: 10.1016/j.chom.2018.10.011] [Citation(s) in RCA: 72] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2018] [Revised: 09/21/2018] [Accepted: 10/24/2018] [Indexed: 01/25/2023]
Abstract
Zika virus (ZIKV) strains can be classified into the ancestral African and contemporary Asian lineages, with the latter responsible for recent epidemics associated with neurological conditions. To understand how Asian strains lead to exacerbated disease, a crucial step is identifying genomic variations that affect infectivity and pathogenicity. Here we use two high-throughput sequencing approaches to assess RNA secondary structures and intramolecular RNA-RNA interactions in vivo for the RNA genomes of Asian and African ZIKV lineages. Our analysis identified functional RNA structural elements and a functional long-range intramolecular interaction specific for the Asian epidemic strains. Mutants that disrupt this extended RNA interaction between the 5' UTR and the E protein coding region reduce virus infectivity, which is partially rescued with compensatory mutants, restoring this RNA-RNA interaction. These findings illuminate the structural basis of ZIKV regulation and provide a resource for the discovery of RNA structural elements important for ZIKV infection.
Collapse
Affiliation(s)
- Pan Li
- MOE Key Laboratory of Bioinformatics, Beijing Advanced Innovation Center for Structural Biology, Center for Synthetic and Systems Biology, Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Yifan Wei
- MOE Key Laboratory of Bioinformatics, Beijing Advanced Innovation Center for Structural Biology, Center for Synthetic and Systems Biology, Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Miao Mei
- School of Pharmaceutical Sciences, Center for Infectious Disease Research, School of Medicine, Tsinghua University, Tsinghua-Peking Center for Life Sciences, Beijing 100084, China
| | - Lei Tang
- MOE Key Laboratory of Bioinformatics, Beijing Advanced Innovation Center for Structural Biology, Center for Synthetic and Systems Biology, Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Lei Sun
- MOE Key Laboratory of Bioinformatics, Beijing Advanced Innovation Center for Structural Biology, Center for Synthetic and Systems Biology, Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Wenze Huang
- MOE Key Laboratory of Bioinformatics, Beijing Advanced Innovation Center for Structural Biology, Center for Synthetic and Systems Biology, Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Jianyu Zhou
- Bioinformatics Division, BNRIST, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China
| | - Chunlin Zou
- Institutes of Biology and Medical Sciences, Jiangsu Key Laboratory of Infection and Immunity, Soochow University, Suzhou 215123, China
| | - Shaojun Zhang
- MOE Key Laboratory of Bioinformatics, Beijing Advanced Innovation Center for Structural Biology, Center for Synthetic and Systems Biology, Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Cheng-Feng Qin
- Department of Virology, State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology and Epidemiology, Beijing 100071, China
| | - Tao Jiang
- Bioinformatics Division, BNRIST, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China; Department of Computer Science and Engineering, University of California, Riverside, CA 92521, USA
| | - Jianfeng Dai
- Institutes of Biology and Medical Sciences, Jiangsu Key Laboratory of Infection and Immunity, Soochow University, Suzhou 215123, China
| | - Xu Tan
- School of Pharmaceutical Sciences, Center for Infectious Disease Research, School of Medicine, Tsinghua University, Tsinghua-Peking Center for Life Sciences, Beijing 100084, China.
| | - Qiangfeng Cliff Zhang
- MOE Key Laboratory of Bioinformatics, Beijing Advanced Innovation Center for Structural Biology, Center for Synthetic and Systems Biology, Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China.
| |
Collapse
|
16
|
Qian X, Zhao J, Yeung PY, Zhang QC, Kwok CK. Revealing lncRNA Structures and Interactions by Sequencing-Based Approaches. Trends Biochem Sci 2018; 44:33-52. [PMID: 30459069 DOI: 10.1016/j.tibs.2018.09.012] [Citation(s) in RCA: 319] [Impact Index Per Article: 45.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2018] [Revised: 09/11/2018] [Accepted: 09/19/2018] [Indexed: 11/28/2022]
Abstract
Long noncoding RNAs (lncRNAs) have emerged as significant players in almost every level of gene function and regulation. Thus, characterizing the structures and interactions of lncRNAs is essential for understanding their mechanistic roles in cells. Through a combination of (bio)chemical approaches and automated capillary and high-throughput sequencing (HTS), the complexity and diversity of RNA structures and interactions has been revealed in the transcriptomes of multiple species. These methods have uncovered important biological insights into the mechanistic and functional roles of lncRNA in gene expression and RNA metabolism, as well as in development and disease. In this review, we summarize the latest sequencing strategies to reveal RNA structure, RNA-RNA, RNA-DNA, and RNA-protein interactions, and highlight the recent applications of these approaches to map functional lncRNAs. We discuss the advantages and limitations of these strategies, and provide recommendations to further advance methodologies capable of mapping RNA structure and interactions in order to discover new biology of lncRNAs and decipher their molecular mechanisms and implication in diseases.
Collapse
Affiliation(s)
- Xingyang Qian
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, Tsinghua-Peking Joint Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China; These authors contributed equally to this work
| | - Jieyu Zhao
- Department of Chemistry, City University of Hong Kong, Kowloon Tong, Hong Kong SAR, China; These authors contributed equally to this work
| | - Pui Yan Yeung
- Department of Chemistry, City University of Hong Kong, Kowloon Tong, Hong Kong SAR, China; These authors contributed equally to this work
| | - Qiangfeng Cliff Zhang
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, Tsinghua-Peking Joint Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China.
| | - Chun Kit Kwok
- Department of Chemistry, City University of Hong Kong, Kowloon Tong, Hong Kong SAR, China.
| |
Collapse
|
17
|
Mihailovic MK, Vazquez-Anderson J, Li Y, Fry V, Vimalathas P, Herrera D, Lease RA, Powell WB, Contreras LM. High-throughput in vivo mapping of RNA accessible interfaces to identify functional sRNA binding sites. Nat Commun 2018; 9:4084. [PMID: 30287822 PMCID: PMC6172242 DOI: 10.1038/s41467-018-06207-z] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2017] [Accepted: 08/10/2018] [Indexed: 12/18/2022] Open
Abstract
Herein we introduce a high-throughput method, INTERFACE, to reveal the capacity of contiguous RNA nucleotides to establish in vivo intermolecular RNA interactions for the purpose of functional characterization of intracellular RNA. INTERFACE enables simultaneous accessibility interrogation of an unlimited number of regions by coupling regional hybridization detection to transcription elongation outputs measurable by RNA-seq. We profile over 900 RNA interfaces in 71 validated, but largely mechanistically under-characterized, Escherichia coli sRNAs in the presence and absence of a global regulator, Hfq, and find that two-thirds of tested sRNAs feature Hfq-dependent regions. Further, we identify in vivo hybridization patterns that hallmark functional regions to uncover mRNA targets. In this way, we biochemically validate 25 mRNA targets, many of which are not captured by typically tested, top-ranked computational predictions. We additionally discover direct mRNA binding activity within the GlmY terminator, highlighting the information value of high-throughput RNA accessibility data. Mapping RNA accessibility is valuable for identifying functional/regulatory RNA regions. Here the authors introduce INTERFACE, an intracellular method that quantifies antisense hybridization efficacy of any number of RNA regions simultaneously via a transcriptional elongation output, measurable via RNA-seq
Collapse
Affiliation(s)
- Mia K Mihailovic
- McKetta Department of Chemical Engineering, University of Texas at Austin, 200 E. Dean Keeton St., Stop C0400, Austin, TX, 78712, USA
| | - Jorge Vazquez-Anderson
- McKetta Department of Chemical Engineering, University of Texas at Austin, 200 E. Dean Keeton St., Stop C0400, Austin, TX, 78712, USA
| | - Yan Li
- Department of Operations Research and Financial Engineering, Princeton University, Sherrerd Hall, Charlton St., Princeton, NJ, 08544, USA
| | - Victoria Fry
- McKetta Department of Chemical Engineering, University of Texas at Austin, 200 E. Dean Keeton St., Stop C0400, Austin, TX, 78712, USA
| | - Praveen Vimalathas
- McKetta Department of Chemical Engineering, University of Texas at Austin, 200 E. Dean Keeton St., Stop C0400, Austin, TX, 78712, USA
| | - Daniel Herrera
- Department of Computer Science, University of Texas at Austin, 2317 Speedway Stop D9500, Austin, TX, 78712, USA
| | - Richard A Lease
- Department of Chemical and Biomolecular Engineering, The Ohio State University, 151W. Woodruff Ave, Columbus, OH, 43210, USA.,Department of Chemistry and Biochemistry, The Ohio State University, 100W. 18th Ave, Columbus, OH, 43210, USA
| | - Warren B Powell
- Department of Operations Research and Financial Engineering, Princeton University, Sherrerd Hall, Charlton St., Princeton, NJ, 08544, USA
| | - Lydia M Contreras
- McKetta Department of Chemical Engineering, University of Texas at Austin, 200 E. Dean Keeton St., Stop C0400, Austin, TX, 78712, USA.
| |
Collapse
|
18
|
Antonov I, Marakhonov A, Zamkova M, Medvedeva Y. ASSA: Fast identification of statistically significant interactions between long RNAs. J Bioinform Comput Biol 2018; 16:1840001. [PMID: 29375012 DOI: 10.1142/s0219720018400012] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
The discovery of thousands of long noncoding RNAs (lncRNAs) in mammals raises a question about their functionality. It has been shown that some of them are involved in post-transcriptional regulation of other RNAs and form inter-molecular duplexes with their targets. Sequence alignment tools have been used for transcriptome-wide prediction of RNA-RNA interactions. However, such approaches have poor prediction accuracy since they ignore RNA's secondary structure. Application of the thermodynamics-based algorithms to long transcripts is not computationally feasible on a large scale. Here, we describe a new computational pipeline ASSA that combines sequence alignment and thermodynamics-based tools for efficient prediction of RNA-RNA interactions between long transcripts. To measure the hybridization strength, the sum energy of all the putative duplexes is computed. The main novelty implemented in ASSA is the ability to quickly estimate the statistical significance of the observed interaction energies. Most of the functional hybridizations between long RNAs were classified as statistically significant. ASSA outperformed 11 other tools in terms of the Area Under the Curve on two out of four test sets. Additionally, our results emphasized a unique property of the [Formula: see text] repeats with respect to the RNA-RNA interactions in the human transcriptome. ASSA is available at https://sourceforge.net/projects/assa/.
Collapse
Affiliation(s)
- Ivan Antonov
- * Institute of Bioengineering, Federal Research Center Fundamentals of Biotechnology RAS, Moscow 117312, Russia.,† Department of Molecular and Biological Physics & Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Region 141701, Russia
| | - Andrey Marakhonov
- ‡ Laboratory of Functional Analysis of the Genome, Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Region 141701, Russia.,§ Federal State Scientific Budgetary Institution, Research Centre for Medical Genetics, Moscow 115478, Russia
| | - Maria Zamkova
- ¶ Russian N.N. Blokhin Cancer Research Center, Moscow 115478, Russia
| | - Yulia Medvedeva
- * Institute of Bioengineering, Federal Research Center Fundamentals of Biotechnology RAS, Moscow 117312, Russia.,† Department of Molecular and Biological Physics & Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Region 141701, Russia.,∥ Vavilov Institute of General Genetics, RAS, Moscow 119333, Russia
| |
Collapse
|
19
|
Vazquez-Anderson J, Mihailovic MK, Baldridge KC, Reyes KG, Haning K, Cho SH, Amador P, Powell WB, Contreras LM. Optimization of a novel biophysical model using large scale in vivo antisense hybridization data displays improved prediction capabilities of structurally accessible RNA regions. Nucleic Acids Res 2017; 45:5523-5538. [PMID: 28334800 PMCID: PMC5435917 DOI: 10.1093/nar/gkx115] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2015] [Accepted: 02/14/2017] [Indexed: 11/17/2022] Open
Abstract
Current approaches to design efficient antisense RNAs (asRNAs) rely primarily on a thermodynamic understanding of RNA–RNA interactions. However, these approaches depend on structure predictions and have limited accuracy, arguably due to overlooking important cellular environment factors. In this work, we develop a biophysical model to describe asRNA–RNA hybridization that incorporates in vivo factors using large-scale experimental hybridization data for three model RNAs: a group I intron, CsrB and a tRNA. A unique element of our model is the estimation of the availability of the target region to interact with a given asRNA using a differential entropic consideration of suboptimal structures. We showcase the utility of this model by evaluating its prediction capabilities in four additional RNAs: a group II intron, Spinach II, 2-MS2 binding domain and glgC 5΄ UTR. Additionally, we demonstrate the applicability of this approach to other bacterial species by predicting sRNA–mRNA binding regions in two newly discovered, though uncharacterized, regulatory RNAs.
Collapse
Affiliation(s)
- Jorge Vazquez-Anderson
- McKetta Department of Chemical Engineering, The University of Texas at Austin, 200 E. Dean Keeton St., Stop C0400, Austin, TX 78712, USA
| | - Mia K Mihailovic
- McKetta Department of Chemical Engineering, The University of Texas at Austin, 200 E. Dean Keeton St., Stop C0400, Austin, TX 78712, USA
| | - Kevin C Baldridge
- McKetta Department of Chemical Engineering, The University of Texas at Austin, 200 E. Dean Keeton St., Stop C0400, Austin, TX 78712, USA
| | - Kristofer G Reyes
- Department of Operations Research and Financial Engineering, Princeton University, Sherrerd Hall, Charlton St., Princeton, NJ 08544, USA
| | - Katie Haning
- McKetta Department of Chemical Engineering, The University of Texas at Austin, 200 E. Dean Keeton St., Stop C0400, Austin, TX 78712, USA
| | - Seung Hee Cho
- Institute for Cellular & Molecular Biology, The University of Texas at Austin, 2500 Speedway, Stop A4800, Austin, TX 78712, USA
| | - Paul Amador
- Institute for Cellular & Molecular Biology, The University of Texas at Austin, 2500 Speedway, Stop A4800, Austin, TX 78712, USA
| | - Warren B Powell
- Department of Operations Research and Financial Engineering, Princeton University, Sherrerd Hall, Charlton St., Princeton, NJ 08544, USA
| | - Lydia M Contreras
- McKetta Department of Chemical Engineering, The University of Texas at Austin, 200 E. Dean Keeton St., Stop C0400, Austin, TX 78712, USA
| |
Collapse
|
20
|
Umu SU, Gardner PP. A comprehensive benchmark of RNA-RNA interaction prediction tools for all domains of life. Bioinformatics 2017; 33:988-996. [PMID: 27993777 PMCID: PMC5408919 DOI: 10.1093/bioinformatics/btw728] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2016] [Accepted: 11/13/2016] [Indexed: 12/15/2022] Open
Abstract
Motivation The aim of this study is to assess the performance of RNA-RNA interaction prediction tools for all domains of life. Results Minimum free energy (MFE) and alignment methods constitute most of the current RNA interaction prediction algorithms. The MFE tools that include accessibility (i.e. RNAup, IntaRNA and RNAplex) to the final predicted binding energy have better true positive rates (TPRs) with a high positive predictive values (PPVs) in all datasets than other methods. They can also differentiate almost half of the native interactions from background. The algorithms that include effects of internal binding energies to their model and alignment methods seem to have high TPR but relatively low associated PPV compared to accessibility based methods. Availability and Implementation We shared our wrapper scripts and datasets at Github (github.com/UCanCompBio/RNA_Interactions_Benchmark). All parameters are documented for personal use. Contact sinan.umu@pg.canterbury.ac.nz. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Sinan Ugur Umu
- School of Biological Sciences.,Biomolecular Interaction Centre
| | - Paul P Gardner
- School of Biological Sciences.,Biomolecular Interaction Centre.,Bio-Protection Research Centre, University of Canterbury, Christchurch, New Zealand
| |
Collapse
|
21
|
Abstract
RNA-RNA binding is a required step for many regulatory and catalytic processes in the cell. Identifying RNA-RNA hybridization sites is challenging because of the competition between intramolecular and intermolecular structure formation. A complete picture of RNA-RNA binding includes an understanding of single-stranded folding and binding site accessibility, and is strongly concentration-dependent. This chapter provides guidance for using RNAstructure to predict RNA-RNA binding sites and RNA-RNA structures, utilizing free energy minimization and partition function calculations. RNAstructure is freely available at http://rna.urmc.rochester.edu/RNAstructure.html .
Collapse
Affiliation(s)
- Laura DiChiacchio
- Department of Biochemistry & Biophysics, University of Rochester Medical Center, 601 Elmwood Avenue, 712, Rochester, NY, 14642, USA
- Center for RNA Biology, University of Rochester Medical Center, 601 Elmwood Avenue, 712, Rochester, NY, 14642, USA
| | - David H Mathews
- Department of Biochemistry & Biophysics, University of Rochester Medical Center, 601 Elmwood Avenue, 712, Rochester, NY, 14642, USA.
- Center for RNA Biology, University of Rochester Medical Center, 601 Elmwood Avenue, 712, Rochester, NY, 14642, USA.
| |
Collapse
|