1
|
Takizawa H, Iwakiri J, Asai K. RintC: fast and accuracy-aware decomposition of distributions of RNA secondary structures with extended logsumexp. BMC Bioinformatics 2020; 21:210. [PMID: 32448174 PMCID: PMC7245837 DOI: 10.1186/s12859-020-3535-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2019] [Accepted: 05/05/2020] [Indexed: 11/22/2022] Open
Abstract
Background Analysis of secondary structures is essential for understanding the functions of RNAs. Because RNA molecules thermally fluctuate, it is necessary to analyze the probability distributions of their secondary structures. Existing methods, however, are not applicable to long RNAs owing to their high computational complexity. Additionally, previous research has suffered from two numerical difficulties: overflow and significant numerical errors. Result In this research, we reduced the computational complexity of calculating the landscape of the probability distribution of secondary structures by introducing a maximum-span constraint. In addition, we resolved numerical computation problems through two techniques: extended logsumexp and accuracy-guaranteed numerical computation. We analyzed the stability of the secondary structures of 16S ribosomal RNAs at various temperatures without overflow. The results obtained are consistent with previous research on thermophilic bacteria, suggesting that our method is applicable in thermal stability analysis. Furthermore, we quantitatively assessed numerical stability using our method.. Conclusion These results demonstrate that the proposed method is applicable to long RNAs..
Collapse
Affiliation(s)
- Hiroki Takizawa
- Graduate School of Frontier Sciences, The University of Tokyo, Chiba, Japan
| | - Junichi Iwakiri
- Graduate School of Frontier Sciences, The University of Tokyo, Chiba, Japan
| | - Kiyoshi Asai
- Graduate School of Frontier Sciences, The University of Tokyo, Chiba, Japan. .,Artificial Intelligence Research Center (AIRC), National Institute of Advanced Industrial Science and Technology (AIST), Tokyo, Japan.
| |
Collapse
|
2
|
Steger G, Riesner D. Viroid research and its significance for RNA technology and basic biochemistry. Nucleic Acids Res 2019; 46:10563-10576. [PMID: 30304486 PMCID: PMC6237808 DOI: 10.1093/nar/gky903] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2018] [Accepted: 09/24/2018] [Indexed: 12/27/2022] Open
Abstract
Viroids were described 47 years ago as the smallest RNA molecules capable of infecting plants and autonomously self-replicating without an encoded protein. Work on viroids initiated the development of a number of innovative methods. Novel chromatographic and gelelectrophoretic methods were developed for the purification and characterization of viroids; these methods were later used in molecular biology, gene technology and in prion research. Theoretical and experimental studies of RNA folding demonstrated the general biological importance of metastable structures, and nuclear magnetic resonance spectroscopy of viroid RNA showed the partially covalent nature of hydrogen bonds in biological macromolecules. RNA biochemistry and molecular biology profited from viroid research, such as in the detection of RNA as template of DNA-dependent polymerases and in mechanisms of gene silencing. Viroids, the first circular RNA detected in nature, are important for studies on the much wider spectrum of circular RNAs and other non-coding RNAs.
Collapse
Affiliation(s)
- Gerhard Steger
- Department of Biology, Institut für Physikalische Biologie, Heinrich-Heine-Universität Düsseldorf, Universitätsstr. 1, 40225 Düsseldorf, Germany
| | - Detlev Riesner
- Department of Biology, Institut für Physikalische Biologie, Heinrich-Heine-Universität Düsseldorf, Universitätsstr. 1, 40225 Düsseldorf, Germany
| |
Collapse
|
3
|
Fukunaga T, Hamada M. Computational approaches for alternative and transient secondary structures of ribonucleic acids. Brief Funct Genomics 2018; 18:182-191. [PMID: 30689706 DOI: 10.1093/bfgp/ely042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Transient and alternative structures of ribonucleic acids (RNAs) play essential roles in various regulatory processes, such as translation regulation in living cells. Because experimental analyses for RNA structures are difficult and time-consuming, computational approaches based on RNA secondary structures are promising. In this article, we review computational methods for detecting and analyzing transient/alternative secondary structures of RNAs, including static approaches based on probabilistic distributions of RNA secondary structures and dynamic approaches such as kinetic folding and folding pathway predictions.
Collapse
|
4
|
Clote P, Bayegan AH. RNA folding kinetics using Monte Carlo and Gillespie algorithms. J Math Biol 2017; 76:1195-1227. [PMID: 28780735 DOI: 10.1007/s00285-017-1169-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2016] [Revised: 07/09/2017] [Indexed: 11/26/2022]
Abstract
RNA secondary structure folding kinetics is known to be important for the biological function of certain processes, such as the hok/sok system in E. coli. Although linear algebra provides an exact computational solution of secondary structure folding kinetics with respect to the Turner energy model for tiny ([Formula: see text]20 nt) RNA sequences, the folding kinetics for larger sequences can only be approximated by binning structures into macrostates in a coarse-grained model, or by repeatedly simulating secondary structure folding with either the Monte Carlo algorithm or the Gillespie algorithm. Here we investigate the relation between the Monte Carlo algorithm and the Gillespie algorithm. We prove that asymptotically, the expected time for a K-step trajectory of the Monte Carlo algorithm is equal to [Formula: see text] times that of the Gillespie algorithm, where [Formula: see text] denotes the Boltzmann expected network degree. If the network is regular (i.e. every node has the same degree), then the mean first passage time (MFPT) computed by the Monte Carlo algorithm is equal to MFPT computed by the Gillespie algorithm multiplied by [Formula: see text]; however, this is not true for non-regular networks. In particular, RNA secondary structure folding kinetics, as computed by the Monte Carlo algorithm, is not equal to the folding kinetics, as computed by the Gillespie algorithm, although the mean first passage times are roughly correlated. Simulation software for RNA secondary structure folding according to the Monte Carlo and Gillespie algorithms is publicly available, as is our software to compute the expected degree of the network of secondary structures of a given RNA sequence-see http://bioinformatics.bc.edu/clote/RNAexpNumNbors .
Collapse
Affiliation(s)
- Peter Clote
- Department of Biology, Boston College, Chestnut Hill, MA, 02467, USA.
| | - Amir H Bayegan
- Department of Biology, Boston College, Chestnut Hill, MA, 02467, USA
| |
Collapse
|
5
|
Michálik J, Touzet H, Ponty Y. Efficient approximations of RNA kinetics landscape using non-redundant sampling. Bioinformatics 2017; 33:i283-i292. [PMID: 28882001 PMCID: PMC5870705 DOI: 10.1093/bioinformatics/btx269] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
MOTIVATION Kinetics is key to understand many phenomena involving RNAs, such as co-transcriptional folding and riboswitches. Exact out-of-equilibrium studies induce extreme computational demands, leading state-of-the-art methods to rely on approximated kinetics landscapes, obtained using sampling strategies that strive to generate the key landmarks of the landscape topology. However, such methods are impeded by a large level of redundancy within sampled sets. Such a redundancy is uninformative, and obfuscates important intermediate states, leading to an incomplete vision of RNA dynamics. RESULTS We introduce RNANR, a new set of algorithms for the exploration of RNA kinetics landscapes at the secondary structure level. RNANR considers locally optimal structures, a reduced set of RNA conformations, in order to focus its sampling on basins in the kinetic landscape. Along with an exhaustive enumeration, RNANR implements a novel non-redundant stochastic sampling, and offers a rich array of structural parameters. Our tests on both real and random RNAs reveal that RNANR allows to generate more unique structures in a given time than its competitors, and allows a deeper exploration of kinetics landscapes. AVAILABILITY AND IMPLEMENTATION RNANR is freely available at https://project.inria.fr/rnalands/rnanr . CONTACT yann.ponty@lix.polytechnique.fr.
Collapse
Affiliation(s)
- Juraj Michálik
- AMIB project, Inria Saclay, Palaiseau, France
- LIX CNRS UMR 7161, Ecole Polytechnique, Palaiseau, France
| | - Hélène Touzet
- CNRS, CRIStAL (UMR 9189, University of Lille), Villeneuve d’Ascq, France
- Bonsai project, Inria Lille-Nord Europe, Villeneuve d’Ascq, France
| | - Yann Ponty
- AMIB project, Inria Saclay, Palaiseau, France
- LIX CNRS UMR 7161, Ecole Polytechnique, Palaiseau, France
| |
Collapse
|
6
|
Li J, Xu C, Wang L, Liang H, Feng W, Cai Z, Wang Y, Cong W, Liu Y. PSRna: Prediction of small RNA secondary structures based on reverse complementary folding method. J Bioinform Comput Biol 2016; 14:1643001. [PMID: 27045556 DOI: 10.1142/s0219720016430010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Prediction of RNA secondary structures is an important problem in computational biology and bioinformatics, since RNA secondary structures are fundamental for functional analysis of RNA molecules. However, small RNA secondary structures are scarce and few algorithms have been specifically designed for predicting the secondary structures of small RNAs. Here we propose an algorithm named "PSRna" for predicting small-RNA secondary structures using reverse complementary folding and characteristic hairpin loops of small RNAs. Unlike traditional algorithms that usually generate multi-branch loops and 5[Formula: see text] end self-folding, PSRna first estimated the maximum number of base pairs of RNA secondary structures based on the dynamic programming algorithm and a path matrix is constructed at the same time. Second, the backtracking paths are extracted from the path matrix based on backtracking algorithm, and each backtracking path represents a secondary structure. To improve accuracy, the predicted RNA secondary structures are filtered based on their free energy, where only the secondary structure with the minimum free energy was identified as the candidate secondary structure. Our experiments on real data show that the proposed algorithm is superior to two popular methods, RNAfold and RNAstructure, in terms of sensitivity, specificity and Matthews correlation coefficient (MCC).
Collapse
Affiliation(s)
- Jin Li
- * College of Automation, Harbin Engineering University, Harbin, P. R. China
| | - Chengzhen Xu
- * College of Automation, Harbin Engineering University, Harbin, P. R. China
| | - Lei Wang
- * College of Automation, Harbin Engineering University, Harbin, P. R. China
| | - Hong Liang
- * College of Automation, Harbin Engineering University, Harbin, P. R. China
| | - Weixing Feng
- * College of Automation, Harbin Engineering University, Harbin, P. R. China
| | - Zhongxi Cai
- * College of Automation, Harbin Engineering University, Harbin, P. R. China
| | - Ying Wang
- * College of Automation, Harbin Engineering University, Harbin, P. R. China
| | - Wang Cong
- * College of Automation, Harbin Engineering University, Harbin, P. R. China
| | - Yunlong Liu
- * College of Automation, Harbin Engineering University, Harbin, P. R. China.,† Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, 410 West 10th Street, Suite 5000, Indianapolis, IN 46202, USA
| |
Collapse
|
7
|
Abstract
In this article, we introduce the software suite Hermes, which provides fast, novel algorithms for RNA secondary structure kinetics. Using the fast Fourier transform to efficiently compute the Boltzmann probability that a secondary structure S of a given RNA sequence has base pair distance x (resp. y) from reference structure A (resp. B), Hermes computes the exact kinetics of folding from A to B in this coarse-grained model. In particular, Hermes computes the mean first passage time from the transition probability matrix by using matrix inversion, and also computes the equilibrium time from the rate matrix by using spectral decomposition. Due to the model granularity and the speed of Hermes, it is capable of determining secondary structure refolding kinetics for large RNA sequences, beyond the range of other methods. Comparative benchmarking of Hermes with other methods indicates that Hermes provides refolding kinetics of accuracy suitable for use in the computational design of RNA, an important area of synthetic biology. Source code and documentation for Hermes are available.
Collapse
Affiliation(s)
- Evan Senter
- Department of Biology, Boston College , Chestnut Hill, Massachusetts
| | | |
Collapse
|