1
|
Abdel-Maksoud MA, Askar MA, Abdel-rahman IY, Gharib M, Aufy M. Integrating Network Pharmacology and Molecular Docking Approach to Elucidate the Mechanism of Commiphora wightii for the Treatment of Rheumatoid Arthritis. Bioinform Biol Insights 2024; 18:11779322241247634. [PMID: 38765022 PMCID: PMC11102677 DOI: 10.1177/11779322241247634] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Accepted: 03/28/2024] [Indexed: 05/21/2024] Open
Abstract
Background Rheumatoid arthritis (RA) is considered a notable prolonged inflammatory condition with no proper cure. Synovial inflammation and synovial pannus are crucial in the onset of RA. The "tumor-like" invading proliferation of new arteries is a keynote of RA. Commiphora wightii (C wightii) is a perennial, deciduous, and trifoliate plant used in several areas of southeast Asia to cure numerous ailments, including arthritis, diabetes, obesity, and asthma. Several in vitro investigations have indicated C wightii's therapeutic efficacy in the treatment of arthritis. However, the precise molecular action is yet unknown. Material and methods In this study, a network pharmacology approach was applied to uncover potential targets, active therapeutic ingredients and signaling pathways in C wightii for the treatment of arthritis. In the groundwork of this research, we examined the active constituent-compound-target-pathway network and evaluated that (Guggulsterol-V, Myrrhahnone B, and Campesterol) decisively donated to the development of arthritis by affecting tumor necrosis factor (TNF), PIK3CA, and MAPK3 genes. Later on, docking was employed to confirm the active components' efficiency against the potential targets. Results According to molecular-docking research, several potential targets of RA bind tightly with the corresponding key active ingredient of C wightii. With the aid of network pharmacology techniques, we conclude that the signaling pathways and biological processes involved in C wightii had an impact on the prevention of arthritis. The outcomes of molecular docking also serve as strong recommendations for future research. In the context of this study, network pharmacology combined with molecular docking analysis showed that C wightii acted on arthritis-related signaling pathways to exhibit a promising preventive impact on arthritis. Conclusion These results serve as the basis for grasping the mechanism of the antiarthritis activity of C wightii. However, further in vivo/in vitro study is needed to verify the reliability of these targets for the treatment of arthritis.
Collapse
Affiliation(s)
- Mostafa A Abdel-Maksoud
- Department of Botany and Microbiology, College of Science, King Saud University, Riyadh, Saudi Arabia
| | - Mostafa A Askar
- Radiation Biology Department, National Centre for Radiation Research and Technology (NCRRT), Egyptian Atomic Energy Authority (EAEA), Cairo, Egypt
| | - Ibrahim Y Abdel-rahman
- Radiation Biology Department, National Centre for Radiation Research and Technology (NCRRT), Egyptian Atomic Energy Authority (EAEA), Cairo, Egypt
| | - Mustafa Gharib
- Radiation Biology Department, National Centre for Radiation Research and Technology (NCRRT), Egyptian Atomic Energy Authority (EAEA), Cairo, Egypt
| | - Mohammed Aufy
- Division of Pharmacology and Toxicology, Department of Pharmaceutical Sciences, University of Vienna, Vienna, Austria
| |
Collapse
|
2
|
Rao A, Gollapalli P, Shetty NP. Gene expression profile analysis unravelled the systems level association of renal cell carcinoma with diabetic nephropathy and Matrix-metalloproteinase-9 as a potential therapeutic target. J Biomol Struct Dyn 2023; 41:7535-7550. [PMID: 36106961 DOI: 10.1080/07391102.2022.2122567] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Accepted: 09/03/2022] [Indexed: 10/14/2022]
Abstract
Type 2 diabetes (T2D) and cancer share many common risk factors. However, the potential biological link that connects the two at the molecular level is still unclear. The experimental evidence suggests that several genes and their pathways may be involved in developing cancerous conditions associated with diabetes. In this study, we identified the protein-protein interaction (PPI) networks and the hub protein(s) that interlink T2D and cancer using genome-scale differential gene expression profiles. Further, the PPI network of AMP-activated protein kinase (AMPK) in cancer was analyzed to explore novel insights into the molecular association between the two conditions. The densely connected regions were analyzed by constructing the backbone and subnetworks with key nodes and shortest pathways, respectively. The PPI network studies identified Matrix-metalloproteinase-9 (MMP-9) as a hub protein playing a vital role in glomerulonephritis tubular diseases and some genetic kidney diseases. MMP-9 was also associated with different growth factors, like tumor necrosis factor (TNF-α), transforming growth factor 1 (TGF-1), and pathways like chemokine signaling, NOD-like receptor signaling, etc. Further, the molecular docking and molecular dynamic simulation studies supported the druggability of MMP-9, suggesting it as a potential therapeutic target in treating renal cell carcinoma linked with diabetic kidney disease.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Aditya Rao
- Plant Cell Biotechnology Department, CSIR-Central Food Technological Research Institute, Mysore, Karnataka, India
| | - Pavan Gollapalli
- Center for Bioinformatics and Biostatistics, Nitte (Deemed to be University), Mangalore, Karnataka, India
| | - Nandini Prasad Shetty
- Plant Cell Biotechnology Department, CSIR-Central Food Technological Research Institute, Mysore, Karnataka, India
| |
Collapse
|
3
|
Li C, Feng C, Ma G, Fu S, Chen M, Zhang W, Li J. Time-course RNA-seq analysis reveals stage-specific and melatonin-triggered gene expression patterns during the hair follicle growth cycle in Capra hircus. BMC Genomics 2022; 23:140. [PMID: 35172715 PMCID: PMC8848980 DOI: 10.1186/s12864-022-08331-z] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2021] [Accepted: 01/19/2022] [Indexed: 12/29/2022] Open
Abstract
Background Cashmere goat is famous for its high-quality fibers. The growth of cashmere in secondary hair follicles exhibits a seasonal pattern arising from circannual changes in the natural photoperiod. Although several studies have compared and analyzed the differences in gene expression between different hair follicle growth stages, the selection of samples in these studies relies on research experience or morphological evidence. Distinguishing hair follicle growth cycle according to gene expression patterns may help to explore the regulation mechanisms related to cashmere growth and the effect of melatonin from a molecular level more accurately. Results In this study, we applied RNA-sequencing to the hair follicles of three normal and three melatonin-treated Inner Mongolian cashmere goats sampled every month during a whole hair follicle growth cycle. A total of 3559 and 988 genes were subjected as seasonal changing genes (SCGs) in the control and treated groups, respectively. The SCGs in the normal group were divided into three clusters, and their specific expression patterns help to group the hair follicle growth cycle into anagen, catagen and telogen stages. Some canonical pathways such as Wnt, TGF-beta and Hippo signaling pathways were detected as promoting the hair follicle growth, while Cell adhesion molecules (CAMs), Cytokine-cytokine receptor interaction, Jak-STAT, Fc epsilon RI, NOD-like receptor, Rap1, PI3K-Akt, cAMP, NF-kappa B and many immune-related pathways were detected in the catagen and telogen stages. The PI3K-Akt signaling, ECM-receptor interaction and Focal adhesion were found in the transition stage between telogen to anagen, which may serve as candidate biomarkers for telogen-anagen regeneration. A total of 16 signaling pathways, 145 pathway mRNAs, and 93 lncRNAs were enrolled to construct the pathway-mRNA-lncRNA network, which indicated the function of lncRNAs through interacting with their co-expressed mRNAs. Pairwise comparisons between the control and melatonin-treated groups also indicated 941 monthly differentially expressed genes (monthly DEGs). These monthly DEGs were mainly distributed from April and September, which revealed a potential signal pathway map regulating the anagen stage triggered by melatonin. Enrichment analysis showed that Wnt, Hedgehog, ECM, Chemokines and NF-kappa B signaling pathways may be involved in the regulation of non-quiescence and secondary shedding under the influence of melatonin. Conclusions Our study decoded the key regulators of the whole hair follicle growth cycle, laying the foundation for the control of hair follicle growth and improvement of cashmere yield. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08331-z.
Collapse
Affiliation(s)
- Chun Li
- College of Animal Science and Technology, Inner Mongolia Minzu University, Tongliao, 028000, China
| | - Cong Feng
- College of Life Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Guangyuan Ma
- College of Life Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Shaoyin Fu
- Inner Mongolia Academy of Agricultural & Animal Husbandry Sciences, Hohhot, 010018, China
| | - Ming Chen
- College of Life Sciences, Zhejiang University, Hangzhou, 310058, China. .,College of Life Science and Food Engineering, Inner Mongolia Minzu University, Tongliao, 028000, China.
| | - Wenguang Zhang
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, 010018, China.
| | - Jinquan Li
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, 010018, China.
| |
Collapse
|
4
|
Deep Learning Method for RNA Secondary Structure Prediction with Pseudoknots Based on Large-Scale Data. JOURNAL OF HEALTHCARE ENGINEERING 2021. [DOI: 10.1155/2021/6699996] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Traditional machine learning methods are widely used in the field of RNA secondary structure prediction and have achieved good results. However, with the emergence of large-scale data, deep learning methods have more advantages than traditional machine learning methods. As the number of network layers increases in deep learning, there will often be problems such as increased parameters and overfitting. We used two deep learning models, GoogLeNet and TCN, to predict RNA secondary results. And from the perspective of the depth and width of the network, improvements are made based on the neural network model, which can effectively improve the computational efficiency while extracting more feature information. We process the existing real RNA data through experiments, use deep learning models to extract useful features from a large amount of RNA sequence data and structure data, and then predict the extracted features to obtain each base’s pairing probability. The characteristics of RNA secondary structure and dynamic programming methods are used to process the base prediction results, and the structure with the largest sum of the probability of each base pairing is obtained, and this structure will be used as the optimal RNA secondary structure. We, respectively, evaluated GoogLeNet and TCN models based on 5sRNA, tRNA data, and tmRNA data, and compared them with other standard prediction algorithms. The sensitivity and specificity of the GoogLeNet model on the 5sRNA and tRNA data sets are about 16% higher than the best prediction results in other algorithms. The sensitivity and specificity of the GoogLeNet model on the tmRNA dataset are about 9% higher than the best prediction results in other algorithms. As deep learning algorithms’ performance is related to the size of the data set, as the scale of RNA data continues to expand, the prediction accuracy of deep learning methods for RNA secondary structure will continue to improve.
Collapse
|
5
|
The histone methyltransferase inhibitor A-366 enhances hemoglobin expression in erythroleukemia cells upon co-exposure with chemical inducers in culture. ACTA ACUST UNITED AC 2021; 28:2. [PMID: 33407944 PMCID: PMC7788816 DOI: 10.1186/s40709-020-00132-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2020] [Revised: 12/20/2020] [Accepted: 12/23/2020] [Indexed: 01/02/2023]
Abstract
Background Erythroleukemia is caused by the uncontrolled multiplication of immature erythroid progenitor cells which fail to differentiate into erythrocytes. By directly targeting this class of malignant cells, the induction of terminal erythroid differentiation represents a vital therapeutic strategy for this disease. Erythroid differentiation involves the execution of a well-orchestrated gene expression program in which epigenetic enzymes play critical roles. In order to identify novel epigenetic mediators of differentiation, this study explores the effects of multiple, highly specific, epigenetic enzyme inhibitors, in murine and human erythroleukemia cell lines. Results We used a group of compounds designed to uniquely target the following epigenetic enzymes: G9a/GLP, EZH1/2, SMYD2, PRMT3, WDR5, SETD7, SUV420H1 and DOT1L. The majority of the probes had a negative impact on both cell proliferation and differentiation. On the contrary, one of the compounds, A-366, demonstrated the opposite effect by promoting erythroid differentiation of both cell models. A-366 is a selective inhibitor of the G9a methyltransferase and the chromatin reader Spindlin1. Investigation of the molecular mechanism of action revealed that A-366 forced cells to exit from the cell cycle, a fact that favored erythroid differentiation. Further analysis led to the identification of a group of genes that mediate the A-366 effects and include CDK2, CDK4 and CDK6. Conclusions A-366, a selective inhibitor of G9a and Spindlin1, demonstrates a compelling role in the erythroid maturation process by promoting differentiation, a fact that is highly beneficial for patients suffering from erythroleukemia. In conclusion, this data calls for further investigation towards the delivery of epigenetic drugs and especially A-366 in hematopoietic disorders.
Collapse
|
6
|
Afanasyeva A, Nagao C, Mizuguchi K. Prediction of the secondary structure of short DNA aptamers. Biophys Physicobiol 2019; 16:287-294. [PMID: 31984183 PMCID: PMC6975895 DOI: 10.2142/biophysico.16.0_287] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2019] [Accepted: 07/29/2019] [Indexed: 12/20/2022] Open
Abstract
Aptamers have a spectrum of applications in biotechnology and drug design, because of the relative simplicity of experimental protocols and advantages of stability and specificity associated with their structural properties. However, to understand the structure-function relationships of aptamers, robust structure modeling tools are necessary. Several such tools have been developed and extensively tested, although most of them target various forms of biological RNA. In this study, we tested the performance of three tools in application to DNA aptamers, since DNA aptamers are the focus of many studies, particularly in drug discovery. We demonstrated that in most cases, the secondary structure of DNA can be reconstructed with acceptable accuracy by at least one of the three tools tested (Mfold, RNAfold, and CentroidFold), although the G-quadruplex motif found in many of the DNA aptamer structures complicates the prediction, as well as the pseudoknot interaction. This problem should be addressed more carefully to improve prediction accuracy.
Collapse
Affiliation(s)
- Arina Afanasyeva
- Artificial Intelligence Center for Health and Biomedical Research (ArCHER), National Institutes of Biomedical Innovation, Health and Nutrition, Ibaraki, Osaka 567-0085, Japan
| | - Chioko Nagao
- Laboratory of In-silico Drug Design, Center for Drug Design Research (CDDR), National Institutes of Biomedical Innovation, Health and Nutrition, Ibaraki, Osaka 567-0085, Japan
| | - Kenji Mizuguchi
- Artificial Intelligence Center for Health and Biomedical Research (ArCHER), National Institutes of Biomedical Innovation, Health and Nutrition, Ibaraki, Osaka 567-0085, Japan.,Laboratory of In-silico Drug Design, Center for Drug Design Research (CDDR), National Institutes of Biomedical Innovation, Health and Nutrition, Ibaraki, Osaka 567-0085, Japan
| |
Collapse
|
7
|
Jabbari H, Wark I, Montemagno C, Will S. Knotty: efficient and accurate prediction of complex RNA pseudoknot structures. Bioinformatics 2019; 34:3849-3856. [PMID: 29868872 DOI: 10.1093/bioinformatics/bty420] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2017] [Accepted: 05/24/2018] [Indexed: 12/14/2022] Open
Abstract
Motivation The computational prediction of RNA secondary structure by free energy minimization has become an important tool in RNA research. However in practice, energy minimization is mostly limited to pseudoknot-free structures or rather simple pseudoknots, not covering many biologically important structures such as kissing hairpins. Algorithms capable of predicting sufficiently complex pseudoknots (for sequences of length n) used to have extreme complexities, e.g. Pknots has O(n6) time and O(n4) space complexity. The algorithm CCJ dramatically improves the asymptotic run time for predicting complex pseudoknots (handling almost all relevant pseudoknots, while being slightly less general than Pknots), but this came at the cost of large constant factors in space and time, which strongly limited its practical application (∼200 bases already require 256 GB space). Results We present a CCJ-type algorithm, Knotty, that handles the same comprehensive pseudoknot class of structures as CCJ with improved space complexity of Θ(n3+Z)-due to the applied technique of sparsification, the number of 'candidates', Z, appears to grow significantly slower than n4 on our benchmark set (which include pseudoknotted RNAs up to 400 nt). In terms of run time over this benchmark, Knotty clearly outperforms Pknots and the original CCJ implementation, CCJ 1.0; Knotty's space consumption fundamentally improves over CCJ 1.0, being on a par with the space-economic Pknots. By comparing to CCJ 2.0, our unsparsified Knotty variant, we demonstrate the isolated effect of sparsification. Moreover, Knotty employs the state-of-the-art energy model of 'HotKnots DP09', which results in superior prediction accuracy over Pknots. Availability and implementation Our software is available at https://github.com/HosnaJabbari/Knotty. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Hosna Jabbari
- Department of Computer Science, University of Vermont, Burlington, VT, USA
| | - Ian Wark
- Ingenuity Lab, Department of Chemical and Materials Engineering, University of Alberta, Edmonton, Canada
| | - Carlo Montemagno
- Department of Electrical and Computer Engineering, Southern Illinois University, Carbondale, IL, USA
| | - Sebastian Will
- Theoretical Biochemistry Group (TBI), Institute for Theoretical Chemistry, University of Vienna, Vienna, Austria
| |
Collapse
|
8
|
Jabbari H, Wark I, Montemagno C. RNA secondary structure prediction with pseudoknots: Contribution of algorithm versus energy model. PLoS One 2018; 13:e0194583. [PMID: 29621250 PMCID: PMC5886407 DOI: 10.1371/journal.pone.0194583] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2017] [Accepted: 03/06/2018] [Indexed: 11/25/2022] Open
Abstract
Motivation RNA is a biopolymer with various applications inside the cell and in biotechnology. Structure of an RNA molecule mainly determines its function and is essential to guide nanostructure design. Since experimental structure determination is time-consuming and expensive, accurate computational prediction of RNA structure is of great importance. Prediction of RNA secondary structure is relatively simpler than its tertiary structure and provides information about its tertiary structure, therefore, RNA secondary structure prediction has received attention in the past decades. Numerous methods with different folding approaches have been developed for RNA secondary structure prediction. While methods for prediction of RNA pseudoknot-free structure (structures with no crossing base pairs) have greatly improved in terms of their accuracy, methods for prediction of RNA pseudoknotted secondary structure (structures with crossing base pairs) still have room for improvement. A long-standing question for improving the prediction accuracy of RNA pseudoknotted secondary structure is whether to focus on the prediction algorithm or the underlying energy model, as there is a trade-off on computational cost of the prediction algorithm versus the generality of the method. Results The aim of this work is to argue when comparing different methods for RNA pseudoknotted structure prediction, the combination of algorithm and energy model should be considered and a method should not be considered superior or inferior to others if they do not use the same scoring model. We demonstrate that while the folding approach is important in structure prediction, it is not the only important factor in prediction accuracy of a given method as the underlying energy model is also as of great value. Therefore we encourage researchers to pay particular attention in comparing methods with different energy models.
Collapse
Affiliation(s)
- Hosna Jabbari
- Department of Computer Science, University of Vermont, Burlington, Vermont, United States of America
- * E-mail:
| | - Ian Wark
- Ingenuity Lab, Department of Chemical and Materials Engineering, University of Alberta, Edmonton, Alberta, Canada
| | - Carlo Montemagno
- Southern Illinois University Carbondale, Carbondale, Illinois, United States of America
| |
Collapse
|
9
|
Legendre A, Angel E, Tahi F. Bi-objective integer programming for RNA secondary structure prediction with pseudoknots. BMC Bioinformatics 2018; 19:13. [PMID: 29334887 PMCID: PMC5769536 DOI: 10.1186/s12859-018-2007-7] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2017] [Accepted: 01/02/2018] [Indexed: 01/05/2023] Open
Abstract
BACKGROUND RNA structure prediction is an important field in bioinformatics, and numerous methods and tools have been proposed. Pseudoknots are specific motifs of RNA secondary structures that are difficult to predict. Almost all existing methods are based on a single model and return one solution, often missing the real structure. An alternative approach would be to combine different models and return a (small) set of solutions, maximizing its quality and diversity in order to increase the probability that it contains the real structure. RESULTS We propose here an original method for predicting RNA secondary structures with pseudoknots, based on integer programming. We developed a generic bi-objective integer programming algorithm allowing to return optimal and sub-optimal solutions optimizing simultaneously two models. This algorithm was then applied to the combination of two known models of RNA secondary structure prediction, namely MEA and MFE. The resulting tool, called BiokoP, is compared with the other methods in the literature. The results show that the best solution (structure with the highest F1-score) is, in most cases, given by BiokoP. Moreover, the results of BiokoP are homogeneous, regardless of the pseudoknot type or the presence or not of pseudoknots. Indeed, the F1-scores are always higher than 70% for any number of solutions returned. CONCLUSION The results obtained by BiokoP show that combining the MEA and the MFE models, as well as returning several optimal and several sub-optimal solutions, allow to improve the prediction of secondary structures. One perspective of our work is to combine better mono-criterion models, in particular to combine a model based on the comparative approach with the MEA and the MFE models. This leads to develop in the future a new multi-objective algorithm to combine more than two models. BiokoP is available on the EvryRNA platform: https://EvryRNA.ibisc.univ-evry.fr .
Collapse
Affiliation(s)
- Audrey Legendre
- IBISC, Univ Evry, Université Paris-Saclay, Evry, 91025, France
| | - Eric Angel
- IBISC, Univ Evry, Université Paris-Saclay, Evry, 91025, France
| | - Fariza Tahi
- IBISC, Univ Evry, Université Paris-Saclay, Evry, 91025, France.
| |
Collapse
|
10
|
Chen J, Gong S, Wang Y, Zhang W. Kinetic partitioning mechanism of HDV ribozyme folding. J Chem Phys 2014; 140:025102. [DOI: 10.1063/1.4861037] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
|
11
|
Gupta A, Rahman R, Li K, Gribskov M. Identifying complete RNA structural ensembles including pseudoknots. RNA Biol 2012; 9:187-99. [PMID: 22418849 DOI: 10.4161/rna.18386] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
The close relationship between RNA structure and function underlines the significance of accurately predicting RNA structures from sequence information. Structural topologies such as pseudoknots are of particular interest due to their ubiquity and direct involvement in RNA function, but identifying pseudoknots is a computationally challenging problem and existing heuristic approaches usually perform poorly for RNA sequences of even a few hundred bases. We survey the performance of pseudoknot prediction methods on a data set of full-length RNA sequences representing varied sequence lengths, and biological RNA classes such as RNase P RNA, Group I Intron, tmRNA and tRNA. Pseudoknot prediction methods are compared with minimum free energy and suboptimal secondary structure prediction methods in terms of correct base-pairs, stems and pseudoknots and we find that the ensemble of suboptimal structure predictions succeeds in identifying correct structural elements in RNA that are usually missed in MFE and pseudoknot predictions. We propose a strategy to identify a comprehensive set of non-redundant stems in the suboptimal structure space of a RNA molecule by applying heuristics that reduce the structural redundancy of the predicted suboptimal structures by merging slightly varying stems that are predicted to form in local sequence regions. This reduced-redundancy set of structural elements consistently outperforms more specialized approaches.in data sets. Thus, the suboptimal folding space can be used to represent the structural diversity of an RNA molecule more comprehensively than optimal structure prediction approaches alone.
Collapse
Affiliation(s)
- Aditi Gupta
- Department of Biological Sciences; Purdue University; West Lafayette, IN, USA
| | | | | | | |
Collapse
|
12
|
Sato K, Kato Y, Hamada M, Akutsu T, Asai K. IPknot: fast and accurate prediction of RNA secondary structures with pseudoknots using integer programming. ACTA ACUST UNITED AC 2011; 27:i85-93. [PMID: 21685106 PMCID: PMC3117384 DOI: 10.1093/bioinformatics/btr215] [Citation(s) in RCA: 181] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
MOTIVATION Pseudoknots found in secondary structures of a number of functional RNAs play various roles in biological processes. Recent methods for predicting RNA secondary structures cover certain classes of pseudoknotted structures, but only a few of them achieve satisfying predictions in terms of both speed and accuracy. RESULTS We propose IPknot, a novel computational method for predicting RNA secondary structures with pseudoknots based on maximizing expected accuracy of a predicted structure. IPknot decomposes a pseudoknotted structure into a set of pseudoknot-free substructures and approximates a base-pairing probability distribution that considers pseudoknots, leading to the capability of modeling a wide class of pseudoknots and running quite fast. In addition, we propose a heuristic algorithm for refining base-paring probabilities to improve the prediction accuracy of IPknot. The problem of maximizing expected accuracy is solved by using integer programming with threshold cut. We also extend IPknot so that it can predict the consensus secondary structure with pseudoknots when a multiple sequence alignment is given. IPknot is validated through extensive experiments on various datasets, showing that IPknot achieves better prediction accuracy and faster running time as compared with several competitive prediction methods. AVAILABILITY The program of IPknot is available at http://www.ncrna.org/software/ipknot/. IPknot is also available as a web server at http://rna.naist.jp/ipknot/. CONTACT satoken@k.u-tokyo.ac.jp; ykato@is.naist.jp SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Kengo Sato
- Graduate School of Frontier Sciences, University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8561, Japan.
| | | | | | | | | |
Collapse
|
13
|
Sperschneider J, Datta A. DotKnot: pseudoknot prediction using the probability dot plot under a refined energy model. Nucleic Acids Res 2010; 38:e103. [PMID: 20123730 PMCID: PMC2853144 DOI: 10.1093/nar/gkq021] [Citation(s) in RCA: 71] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
RNA pseudoknots are functional structure elements with key roles in viral and cellular processes. Prediction of a pseudoknotted minimum free energy structure is an NP-complete problem. Practical algorithms for RNA structure prediction including restricted classes of pseudoknots suffer from high runtime and poor accuracy for longer sequences. A heuristic approach is to search for promising pseudoknot candidates in a sequence and verify those. Afterwards, the detected pseudoknots can be further analysed using bioinformatics or laboratory techniques. We present a novel pseudoknot detection method called DotKnot that extracts stem regions from the secondary structure probability dot plot and assembles pseudoknot candidates in a constructive fashion. We evaluate pseudoknot free energies using novel parameters, which have recently become available. We show that the conventional probability dot plot makes a wide class of pseudoknots including those with bulged stems manageable in an explicit fashion. The energy parameters now become the limiting factor in pseudoknot prediction. DotKnot is an efficient method for long sequences, which finds pseudoknots with higher accuracy compared to other known prediction algorithms. DotKnot is accessible as a web server at http://dotknot.csse.uwa.edu.au.
Collapse
Affiliation(s)
- Jana Sperschneider
- School of Computer Science and Software Engineering, The University of Western Australia, Perth, WA 6009, Australia.
| | | |
Collapse
|
14
|
|
15
|
Euprosterna elaeasa virus genome sequence and evolution of the Tetraviridae family: emergence of bipartite genomes and conservation of the VPg signal with the dsRNA Birnaviridae family. Virology 2009; 397:145-54. [PMID: 19954807 DOI: 10.1016/j.virol.2009.10.042] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2009] [Revised: 10/08/2009] [Accepted: 10/28/2009] [Indexed: 11/21/2022]
Abstract
The Tetraviridae is a family of non-enveloped positive-stranded RNA insect viruses that is defined by the T=4 symmetry of virions. We report the complete Euprosterna elaeasa virus (EeV) genome sequence of 5698 nt with no poly(A) tail and two overlapping open reading frames, encoding the replicase and capsid precursor, with approximately 67% amino acid identity to Thosea asigna virus (TaV). The N-terminally positioned 17 kDa protein is released from the capsid precursor by a NPGP motif. EeV has 40 nm non-enveloped isometric particles composed of 58 and 7 kDa proteins. The 3'-end of TaV/EeV is predicted to form a conserved pseudoknot. Replicases of TaV and EeV include a newly delineated VPg signal mediating the protein priming of RNA synthesis in dsRNA Birnaviridae. Results of rooted phylogenetic analysis of replicase and capsid proteins are presented to implicate recombination between monopartite tetraviruses, involving autonomization of a sgRNA, in the emergence of bipartite tetraviruses. They are also used to revise the Tetraviridae taxonomy.
Collapse
|
16
|
Cao S, Chen SJ. Predicting structures and stabilities for H-type pseudoknots with interhelix loops. RNA (NEW YORK, N.Y.) 2009; 15:696-706. [PMID: 19237463 PMCID: PMC2661829 DOI: 10.1261/rna.1429009] [Citation(s) in RCA: 81] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/21/2008] [Accepted: 01/10/2009] [Indexed: 05/20/2023]
Abstract
RNA pseudoknots play a critical role in RNA-related biology from the assembly of ribosome to the regulation of viral gene expression. A predictive model for pseudoknot structure and stability is essential for understanding and designing RNA structure and function. A previous statistical mechanical theory allows us to treat canonical H-type RNA pseudoknots that contain no intervening loop between the helices (see S. Cao and S.J. Chen [2006] in Nucleic Acids Research, Vol. 34; pp. 2634-2652). Biologically significant RNA pseudoknots often contain interhelix loops. Predicting the structure and stability for such more-general pseudoknots remains an unsolved problem. In the present study, we develop a predictive model for pseudoknots with interhelix loops. The model gives conformational entropy, stability, and the free-energy landscape from RNA sequences. The main features of this new model are the computation of the conformational entropy and folding free-energy base on the complete conformational ensemble and rigorous treatment for the excluded volume effects. Extensive tests for the structural predictions show overall good accuracy with average sensitivity and specificity equal to 0.91 and 0.91, respectively. The theory developed here may be a solid starting point for first-principles modeling of more complex, larger RNAs.
Collapse
Affiliation(s)
- Song Cao
- Department of Physics, University of Missouri, Columbia, 65211, USA
| | | |
Collapse
|
17
|
Kleesiek J, Torda AE. RNA secondary structure prediction using a self-consistent mean field approach. J Comput Chem 2009; 31:1135-42. [DOI: 10.1002/jcc.21398] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
18
|
Chen X, He SM, Bu D, Zhang F, Wang Z, Chen R, Gao W. FlexStem: improving predictions of RNA secondary structures with pseudoknots by reducing the search space. ACTA ACUST UNITED AC 2008; 24:1994-2001. [PMID: 18586700 DOI: 10.1093/bioinformatics/btn327] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION RNA secondary structures with pseudoknots are often predicted by minimizing free energy, which is proved to be NP-hard. Due to kinetic reasons the real RNA secondary structure often has local instead of global minimum free energy. This implies that we may improve the performance of RNA secondary structure prediction by taking kinetics into account and minimize free energy in a local area. RESULT we propose a novel algorithm named FlexStem to predict RNA secondary structures with pseudoknots. Still based on MFE criterion, FlexStem adopts comprehensive energy models that allow complex pseudoknots. Unlike classical thermodynamic methods, our approach aims to simulate the RNA folding process by successive addition of maximal stems, reducing the search space while maintaining or even improving the prediction accuracy. This reduced space is constructed by our maximal stem strategy and stem-adding rule induced from elaborate statistical experiments on real RNA secondary structures. The strategy and the rule also reflect the folding characteristic of RNA from a new angle and help compensate for the deficiency of merely relying on MFE in RNA structure prediction. We validate FlexStem by applying it to tRNAs, 5SrRNAs and a large number of pseudoknotted structures and compare it with the well-known algorithms such as RNAfold, PKNOTS, PknotsRG, HotKnots and ILM according to their overall sensitivities and specificities, as well as positive and negative controls on pseudoknots. The results show that FlexStem significantly increases the prediction accuracy through its local search strategy. AVAILABILITY Software is available at http://pfind.ict.ac.cn/FlexStem/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Xiang Chen
- Key Lab of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China.
| | | | | | | | | | | | | |
Collapse
|
19
|
Sperschneider J, Datta A. KnotSeeker: heuristic pseudoknot detection in long RNA sequences. RNA (NEW YORK, N.Y.) 2008; 14:630-640. [PMID: 18314500 PMCID: PMC2271355 DOI: 10.1261/rna.968808] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/20/2007] [Accepted: 01/11/2008] [Indexed: 05/26/2023]
Abstract
Pseudoknots are folded structures in RNA molecules that perform essential functions as part of cellular transcription machinery and regulatory processes. The prediction of these structures in RNA molecules has important implications in antiviral drug design. It has been shown that the prediction of pseudoknots is an NP-complete problem. Practical structure prediction algorithms based on free energy minimization employ a restricted problem class and dynamic programming. However, these algorithms are computationally very expensive, and their accuracy deteriorates if the input sequence containing the pseudoknot is too long. Heuristic methods can be more efficient, but do not guarantee an optimal solution in regards to the minimum free energy model. We present KnotSeeker, a new heuristic algorithm for the detection of pseudoknots in RNA sequences as a preliminary step for structure prediction. Our method uses a hybrid sequence matching and free energy minimization approach to perform a screening of the primary sequence. We select short sequence fragments as possible candidates that may contain pseudoknots and verify them by using an existing dynamic programming algorithm and a minimum weight independent set calculation. KnotSeeker is significantly more accurate in detecting pseudoknots compared to other common methods as reported in the literature. It is very efficient and therefore a practical tool, especially for long sequences. The algorithm has been implemented in Python and it also uses C/C++ code from several other known techniques. The code is available from http://www.csse.uwa.edu.au/~datta/pseudoknot.
Collapse
Affiliation(s)
- Jana Sperschneider
- School of Computer Science and Software Engineering, University of Western Australia, Perth, WA 6009, Australia.
| | | |
Collapse
|
20
|
Smit S, Rother K, Heringa J, Knight R. From knotted to nested RNA structures: a variety of computational methods for pseudoknot removal. RNA (NEW YORK, N.Y.) 2008; 14:410-6. [PMID: 18230758 PMCID: PMC2248259 DOI: 10.1261/rna.881308] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Abstract
Pseudoknots are abundant in RNA structures. Many computational analyses require pseudoknot-free structures, which means that some of the base pairs in the knotted structure must be disregarded to obtain a nested structure. There is a surprising diversity of methods to perform this pseudoknot removal task, but these methods are often poorly described and studies can therefore be difficult to reproduce (in part, because different procedures may be intuitively obvious to different investigators). Here we provide a variety of algorithms for pseudoknot removal, some of which can incorporate sequence or alignment information in the removal process. We demonstrate that different methods lead to different results, which might affect structure-based analyses. This work thus provides a starting point for discussion of the extent to which these different methods recapture the underlying biological reality. We provide access to reference implementations through a web interface (at http://www.ibi.vu.nl/programs/k2nwww), and the source code is available in the PyCogent project.
Collapse
Affiliation(s)
- Sandra Smit
- Centre for Integrative Bioinformatics VU (IBIVU), Vrije Universiteit Amsterdam, 1081 HV Amsterdam, The Netherlands
| | | | | | | |
Collapse
|
21
|
Abstract
Understanding how RNA folds and what causes it to unfold has become more important as knowledge of the diverse functions of RNA has increased. Here we review the contributions of single-molecule experiments to providing answers to questions such as: How much energy is required to unfold a secondary or tertiary structure? How fast is the process? How do helicases unwind double helices? Are the unwinding activities of RNA-dependent RNA polymerases and of ribosomes different from other helicases? We discuss the use of optical tweezers to monitor the unfolding activities of helicases, polymerases, and ribosomes, and to apply force to unfold RNAs directly. We also review the applications of fluorescence and fluorescence resonance energy transfer to measure RNA dynamics.
Collapse
Affiliation(s)
- Pan T X Li
- Department of Biological Sciences, University at Albany, State University of New York, Albany, NY 12222, USA.
| | | | | |
Collapse
|