1
|
Long R, Guo Z, Han D, Liu B, Yuan X, Chen G, Heng PA, Zhang L. siRNADiscovery: a graph neural network for siRNA efficacy prediction via deep RNA sequence analysis. Brief Bioinform 2024; 25:bbae563. [PMID: 39503523 PMCID: PMC11539000 DOI: 10.1093/bib/bbae563] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2024] [Revised: 09/28/2024] [Accepted: 10/21/2024] [Indexed: 11/08/2024] Open
Abstract
The clinical adoption of small interfering RNAs (siRNAs) has prompted the development of various computational strategies for siRNA design, from traditional data analysis to advanced machine learning techniques. However, previous studies have inadequately considered the full complexity of the siRNA silencing mechanism, neglecting critical elements such as siRNA positioning on mRNA, RNA base-pairing probabilities, and RNA-AGO2 interactions, thereby limiting the insight and accuracy of existing models. Here, we introduce siRNADiscovery, a Graph Neural Network (GNN) framework that leverages both non-empirical and empirical rule-based features of siRNA and mRNA to effectively capture the complex dynamics of gene silencing. On multiple internal datasets, siRNADiscovery achieves state-of-the-art performance. Significantly, siRNADiscovery also outperforms existing methodologies in in vitro studies and on an externally validated dataset. Additionally, we develop a new data-splitting methodology that addresses the data leakage issue, a frequently overlooked problem in previous studies, ensuring the robustness and stability of our model under various experimental settings. Through rigorous testing, siRNADiscovery has demonstrated remarkable predictive accuracy and robustness, making significant contributions to the field of gene silencing. Furthermore, our approach to redefining data-splitting standards aims to set new benchmarks for future research in the domain of predictive biological modeling for siRNA.
Collapse
Affiliation(s)
- Rongzhuo Long
- School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, 211198, Nanjing, China
| | - Ziyu Guo
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Central Ave, Hong Kong SAR, China
| | - Da Han
- Institute of Molecular Medicine (IMM) and Department of Laboratory Medicine, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, 200240, Shanghai, China
- Hangzhou Institute of Medicine (HIM), Chinese Academy of Sciences, 310022, Hangzhou, Zhejiang, China
| | - Boxiang Liu
- Department of Pharmacy, Faculty of Science, National University of Singapore, Singapore, 117543, Singapore
| | - Xudong Yuan
- ACON Pharmaceuticals, 2557 Route 130 S, Ste 3, Cranbury, NJ 08512, USA
| | | | - Pheng-Ann Heng
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Central Ave, Hong Kong SAR, China
| | - Liang Zhang
- Hangzhou Institute of Medicine (HIM), Chinese Academy of Sciences, 310022, Hangzhou, Zhejiang, China
| |
Collapse
|
2
|
Liu T, Huang J, Luo D, Ren L, Ning L, Huang J, Lin H, Zhang Y. Cm-siRPred: Predicting chemically modified siRNA efficiency based on multi-view learning strategy. Int J Biol Macromol 2024; 264:130638. [PMID: 38460652 DOI: 10.1016/j.ijbiomac.2024.130638] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Revised: 12/27/2023] [Accepted: 03/03/2024] [Indexed: 03/11/2024]
Abstract
The rational modification of siRNA molecules is crucial for ensuring their drug-like properties. Machine learning-based prediction of chemically modified siRNA (cm-siRNA) efficiency can significantly optimize the design process of siRNA chemical modifications, saving time and cost in siRNA drug development. However, existing in-silico methods suffer from limitations such as small datasets, inadequate data representation capabilities, and lack of interpretability. Therefore, in this study, we developed the Cm-siRPred algorithm based on a multi-view learning strategy. The algorithm employs a multi-view strategy to represent the double-strand sequences, chemical modifications, and physicochemical properties of cm-siRNA. It incorporates a cross-attention model to globally correlate different representation vectors and a two-layer CNN module to learn local correlation features. The algorithm demonstrates exceptional performance in cross-validation experiments, independent dataset, and case studies on approved siRNA drugs, and showcasing its robustness and generalization ability. In addition, we developed a user-friendly webserver that enables efficient prediction of cm-siRNA efficiency and assists in the design of siRNA drug chemical modifications. In summary, Cm-siRPred is a practical tool that offers valuable technical support for siRNA chemical modification and drug efficiency research, while effectively assisting in the development of novel small nucleic acid drugs. Cm-siRPred is freely available at https://cellknowledge.com.cn/sirnapredictor/.
Collapse
Affiliation(s)
- Tianyuan Liu
- Innovative Institute of Chinese Medicine and Pharmacy, Academy for Interdiscipline, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China
| | - Junyang Huang
- School of Medicine, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Delun Luo
- Innovative Institute of Chinese Medicine and Pharmacy, Academy for Interdiscipline, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China; Chengdu Jingrunze Gene Technology Company Limited, Chengdu 611138, China
| | - Liping Ren
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu 611844, China
| | - Lin Ning
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu 611844, China
| | - Jian Huang
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China.
| | - Hao Lin
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China.
| | - Yang Zhang
- Innovative Institute of Chinese Medicine and Pharmacy, Academy for Interdiscipline, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China.
| |
Collapse
|
3
|
Dong X, Zheng W. Cheminformatics Modeling of Gene Silencing for Both Natural and Chemically Modified siRNAs. Molecules 2022; 27:6412. [PMID: 36234948 PMCID: PMC9570765 DOI: 10.3390/molecules27196412] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Revised: 09/23/2022] [Accepted: 09/25/2022] [Indexed: 11/17/2022] Open
Abstract
In designing effective siRNAs for a specific mRNA target, it is critically important to have predictive models for the potency of siRNAs. None of the published methods characterized the chemical structures of individual nucleotides constituting a siRNA molecule; therefore, they cannot predict the potency of gene silencing by chemically modified siRNAs (cm-siRNA). We propose a new approach that can predict the potency of gene silencing by cm-siRNAs, which characterizes each nucleotide (NT) using 12 BCUT cheminformatics descriptors describing its charge distribution, hydrophobic and polar properties. Thus, a 21-NT siRNA molecule is described by 252 descriptors resulting from concatenating all the BCUT values of its composing nucleotides. Partial Least Square is employed to develop statistical models. The Huesken data (2431 natural siRNA molecules) were used to perform model building and evaluation for natural siRNAs. Our results were comparable with or superior to those from Huesken's algorithm. The Bramsen dataset (48 cm-siRNAs) was used to build and test the models for cm-siRNAs. The predictive r2 of the resulting models reached 0.65 (or Pearson r values of 0.82). Thus, this new method can be used to successfully model gene silencing potency by both natural and chemically modified siRNA molecules.
Collapse
Affiliation(s)
| | - Weifan Zheng
- BRITE Institute and Department of Pharmaceutical Sciences, College of Health and Sciences (CHAS), North Carolina Central University, Durham, NC 27707, USA
| |
Collapse
|
4
|
Abstract
Small interfering RNA (siRNA) is a clinically approved therapeutic modality, which has attracted widespread attention not only from basic research but also from pharmaceutical industry. As siRNA can theoretically modulate any disease-related gene's expression, plenty of siRNA therapeutic pipelines have been established by tens of biotechnology companies. The drug performance of siRNA heavily depends on the sequence, the chemical modification, and the delivery of siRNA. Here, we describe the rational design protocol of siRNA, and provide some modification patterns that can enhance siRNA's stability and reduce its off-target effect. Also, the delivery method based on N-acetylgalactosamine (GalNAc)-siRNA conjugate that is widely employed to develop therapeutic regimens for liver-related diseases is also recapitulated.
Collapse
Affiliation(s)
- Mei Lu
- School of Life Science, Advanced Research Institute of Multidisciplinary Science, and Institute of Engineering Medicine, Key Laboratory of Molecular Medicine and Biotherapy, Beijing Institute of Technology, Beijing, China
| | - Mengjie Zhang
- School of Life Science, Advanced Research Institute of Multidisciplinary Science, and Institute of Engineering Medicine, Key Laboratory of Molecular Medicine and Biotherapy, Beijing Institute of Technology, Beijing, China
| | - Bo Hu
- School of Life Science, Advanced Research Institute of Multidisciplinary Science, and Institute of Engineering Medicine, Key Laboratory of Molecular Medicine and Biotherapy, Beijing Institute of Technology, Beijing, China
| | - Yuanyu Huang
- School of Life Science, Advanced Research Institute of Multidisciplinary Science, and Institute of Engineering Medicine, Key Laboratory of Molecular Medicine and Biotherapy, Beijing Institute of Technology, Beijing, China.
| |
Collapse
|
5
|
He B, Huang J, Chen H. PVsiRNAPred: Prediction of plant exclusive virus-derived small interfering RNAs by deep convolutional neural network. J Bioinform Comput Biol 2020; 17:1950039. [PMID: 32019412 DOI: 10.1142/s0219720019500392] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Plant exclusive virus-derived small interfering RNAs (vsiRNAs) regulate various biological processes, especially important in antiviral immunity. The identification of plant vsiRNAs is important for understanding the biogenesis and function mechanisms of vsiRNAs and further developing anti-viral plants. In this study, we extracted plant vsiRNA sequences from the PVsiRNAdb database. We then utilized deep convolutional neural network (CNN) to develop a deep learning algorithm for predicting plant vsiRNAs based on vsiRNA sequence composition, known as PVsiRNAPred. The key part of PVsiRNAPred is the CNN module, which automatically learns hierarchical representations of vsiRNA sequences related to vsiRNA profiles in plants. When evaluated using an independent testing dataset, the accuracy of the model was 65.70%, which was higher than those of five conventional machine learning method-based classifiers. In addition, PVsiRNAPred obtained a sensitivity of 67.11%, specificity of 64.26% and Matthews correlation coefficient (MCC) of 0.31, and the area under the receiver operating characteristic (ROC) curve (AUC) of PVsiRNAPred was 0.71 in the independent test. The permutation test with 1000 shuffles resulted in a p value of<0.001. The above results reveal that PVsiRNAPred has favorable generalization capabilities. We hope PVsiRNAPred, the first bioinformatics algorithm for predicting plant vsiRNAs, will allow efficient discovery of new vsiRNAs.
Collapse
Affiliation(s)
- Bifang He
- Medical College, Guizhou University, Jiaxiu Road, Huaxi Zone, Guiyang 550025, P. R. China.,Center for Informational Biology, University of Electronic Science and Technology of China, No. 2006, Xiyuan Ave, West Hi-Tech Zone, Chengdu 611731, P. R. China
| | - Jian Huang
- Center for Informational Biology, University of Electronic Science and Technology of China, No. 2006, Xiyuan Ave, West Hi-Tech Zone, Chengdu 611731, P. R. China
| | - Heng Chen
- Medical College, Guizhou University, Jiaxiu Road, Huaxi Zone, Guiyang 550025, P. R. China
| |
Collapse
|
6
|
Gatta AK, Hariharapura RC, Udupa N, Reddy MS, Josyula VR. Strategies for improving the specificity of siRNAs for enhanced therapeutic potential. Expert Opin Drug Discov 2018; 13:709-725. [PMID: 29902093 DOI: 10.1080/17460441.2018.1480607] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
INTRODUCTION RNA interference has become a tool of choice in the development of drugs in various therapeutic areas of Post Transcriptional Gene Silencing (PTGS). The critical element in developing successful RNAi therapeutics lies in designing small interfering RNA (siRNA) using an efficient algorithm satisfying the designing criteria. Further, translation of siRNA from bench-side to bedside needs an efficient delivery system and/or chemical modification. Areas covered: This review emphasizes the importance of dicer, the criteria for efficient siRNA design, the currently available algorithms and strategies to overcome off-target effects, immune stimulatory effects and endosomal trap. Expert opinion: Specificity and stability are the primary concerns for siRNA therapeutics. The design criteria and algorithms should be chosen rationally to have a siRNA sequence that binds to the corresponding mRNA as it happens in the Watson and Crick base pairing. However, it must evade a few more hurdles (Endocytosis, Serum stability etc.) to be functional in the cytosol.
Collapse
Affiliation(s)
- Aditya Kiran Gatta
- a Cell and Molecular Biology lab, Department of Pharmaceutical Biotechnology , Manipal College of Pharmaceutical Sciences, Manipal Academy of Higher Education , Manipal , Karnataka , India
| | - Raghu Chandrashekhar Hariharapura
- a Cell and Molecular Biology lab, Department of Pharmaceutical Biotechnology , Manipal College of Pharmaceutical Sciences, Manipal Academy of Higher Education , Manipal , Karnataka , India
| | - Nayanabhirama Udupa
- b Research Directorate of Health Sciences , Manipal Academy of Higher Education , Manipal , Karnataka , India
| | - Meka Sreenivasa Reddy
- c Department of Pharmaceutics, Manipal College of Pharmaceutical Sciences , Manipal Academy of Higher Education , Manipal , Karnataka , India
| | - Venkata Rao Josyula
- a Cell and Molecular Biology lab, Department of Pharmaceutical Biotechnology , Manipal College of Pharmaceutical Sciences, Manipal Academy of Higher Education , Manipal , Karnataka , India
| |
Collapse
|
7
|
ASPsiRNA: A Resource of ASP-siRNAs Having Therapeutic Potential for Human Genetic Disorders and Algorithm for Prediction of Their Inhibitory Efficacy. G3-GENES GENOMES GENETICS 2017; 7:2931-2943. [PMID: 28696921 PMCID: PMC5592921 DOI: 10.1534/g3.117.044024] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Allele-specific siRNAs (ASP-siRNAs) have emerged as promising therapeutic molecules owing to their selectivity to inhibit the mutant allele or associated single-nucleotide polymorphisms (SNPs) sparing the expression of the wild-type counterpart. Thus, a dedicated bioinformatics platform encompassing updated ASP-siRNAs and an algorithm for the prediction of their inhibitory efficacy will be helpful in tackling currently intractable genetic disorders. In the present study, we have developed the ASPsiRNA resource (http://crdd.osdd.net/servers/aspsirna/) covering three components viz (i) ASPsiDb, (ii) ASPsiPred, and (iii) analysis tools like ASP-siOffTar. ASPsiDb is a manually curated database harboring 4543 (including 422 chemically modified) ASP-siRNAs targeting 78 unique genes involved in 51 different diseases. It furnishes comprehensive information from experimental studies on ASP-siRNAs along with multidimensional genetic and clinical information for numerous mutations. ASPsiPred is a two-layered algorithm to predict efficacy of ASP-siRNAs for fully complementary mutant (Effmut) and wild-type allele (Effwild) with one mismatch by ASPsiPredSVM and ASPsiPredmatrix, respectively. In ASPsiPredSVM, 922 unique ASP-siRNAs with experimentally validated quantitative Effmut were used. During 10-fold cross-validation (10nCV) employing various sequence features on the training/testing dataset (T737), the best predictive model achieved a maximum Pearson’s correlation coefficient (PCC) of 0.71. Further, the accuracy of the classifier to predict Effmut against novel genes was assessed by leave one target out cross-validation approach (LOTOCV). ASPsiPredmatrix was constructed from rule-based studies describing the effect of single siRNA:mRNA mismatches on the efficacy at 19 different locations of siRNA. Thus, ASPsiRNA encompasses the first database, prediction algorithm, and off-target analysis tool that is expected to accelerate research in the field of RNAi-based therapeutics for human genetic diseases.
Collapse
|
8
|
Utilizing Selected Di- and Trinucleotides of siRNA to Predict RNAi Activity. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2017; 2017:5043984. [PMID: 28243313 PMCID: PMC5294759 DOI: 10.1155/2017/5043984] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/25/2016] [Accepted: 12/15/2016] [Indexed: 02/04/2023]
Abstract
Small interfering RNAs (siRNAs) induce posttranscriptional gene silencing in various organisms. siRNAs targeted to different positions of the same gene show different effectiveness; hence, predicting siRNA activity is a crucial step. In this paper, we developed and evaluated a powerful tool named “siRNApred” with a new mixed feature set to predict siRNA activity. To improve the prediction accuracy, we proposed 2-3NTs as our new features. A Random Forest siRNA activity prediction model was constructed using the feature set selected by our proposed Binary Search Feature Selection (BSFS) algorithm. Experimental data demonstrated that the binding site of the Argonaute protein correlates with siRNA activity. “siRNApred” is effective for selecting active siRNAs, and the prediction results demonstrate that our method can outperform other current siRNA activity prediction methods in terms of prediction accuracy.
Collapse
|
9
|
Abstract
Post-transcriptional gene silencing is a widely used method to suppress gene expression. Unfortunately only a portion of siRNAs do successfully reduce gene expression. Target mRNA secondary structures and siRNA-mRNA thermodynamic features are believed to contribute to the silencing activity. However, there is still an open discussion as to what determines siRNA efficacy. In this retrospective study, we analysed the target accessibility comparing very high (VH) compared with low (L) efficacy siRNA sequences obtained from the siRecords Database. We determined the contribution of mRNA target local secondary structures on silencing efficacy. Both the univariable and the multivariable logistic regression evidenced no relationship between siRNA efficacy and mRNA target secondary structures. Moreover, none of the thermodynamic and sequence-base parameters taken into consideration (H-b index, ΔG°overall, ΔG°duplex, ΔG°break-target and GC%) was associated with siRNA efficacy. We found that features believed to be predictive of silencing efficacy are not confirmed to be so when externally evaluated in a large heterogeneous sample. Although it was proposed that silencing efficacy could be influenced by local target accessibility we show that this could be not generalizable because of the diversity of experimental setting that may not be representative of biological systems especially in view of the many local protein factors, usually not taken into consideration, which could hamper the silencing process. We analysed several siRNA-mRNA target features involved in silencing efficacy. We found out that features believed to be predictive of silencing efficacy are not such when transferred to a larger dataset of experiments and different experimental settings.
Collapse
|
10
|
Thang BN, Ho TB, Kanda T. A semi-supervised tensor regression model for siRNA efficacy prediction. BMC Bioinformatics 2015; 16:80. [PMID: 25888201 PMCID: PMC4379720 DOI: 10.1186/s12859-015-0495-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2014] [Accepted: 02/10/2015] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Short interfering RNAs (siRNAs) can knockdown target genes and thus have an immense impact on biology and pharmacy research. The key question of which siRNAs have high knockdown ability in siRNA research remains challenging as current known results are still far from expectation. RESULTS This work aims to develop a generic framework to enhance siRNA knockdown efficacy prediction. The key idea is first to enrich siRNA sequences by incorporating them with rules found for designing effective siRNAs and representing them as enriched matrices, then to employ the bilinear tensor regression to predict knockdown efficacy of those matrices. Experiments show that the proposed method achieves better results than existing models in most cases. CONCLUSIONS Our model not only provides a suitable siRNA representation but also can predict siRNA efficacy more accurate and stable than most of state-of-the-art models. Source codes are freely available on the web at: http://www.jaist.ac.jp/\~bao/BiLTR/ .
Collapse
Affiliation(s)
- Bui Ngoc Thang
- School of Knowledge Science, Japan Advanced Institute of Science and Technology, 1-1 Asahidai, Nomi, Ishikawa, Japan.
- University of Engineering and Technology, Vietnam National University Hanoi, 144 Xuan Thuy, Cau Giay, Hanoi, Vietnam.
| | - Tu Bao Ho
- School of Knowledge Science, Japan Advanced Institute of Science and Technology, 1-1 Asahidai, Nomi, Ishikawa, Japan.
- John von Neumann Institute, Vietnam National University Ho at Chi Minh City, Quarter 6, Linh Trung Ward, Thu Duc District, Ho Chi Minh, Vietnam.
| | - Tatsuo Kanda
- Graduate School of Medicine, Chiba University, 1-8-1 Inohahan, Chuo-ku, Chiba, Japan.
| |
Collapse
|
11
|
Abstract
RNA interference mediated by small interfering RNAs is a powerful tool for investigation of gene functions and is increasingly used as a therapeutic agent. However, not all siRNAs are equally potent, and although simple rules for the selection of good siRNAs were proposed early on, siRNAs are still plagued with widely fluctuating efficiency. Recently, new design tools incorporating both the structural features of the targeted RNAs and the sequence features of the siRNAs substantially improved the efficacy of siRNAs. In this chapter we will present a review of sequence and structure-based algorithms behind them.
Collapse
Affiliation(s)
- Hakim Tafer
- Institut fur Informatik, Universitat Leipzig, Leipzig, Germany
| |
Collapse
|
12
|
Aragão FJ, Nogueira EO, Tinoco MLP, Faria JC. Molecular characterization of the first commercial transgenic common bean immune to the Bean golden mosaic virus. J Biotechnol 2013; 166:42-50. [DOI: 10.1016/j.jbiotec.2013.04.009] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2012] [Revised: 04/18/2013] [Accepted: 04/22/2013] [Indexed: 11/29/2022]
|
13
|
Das S, Ghosal S, Kozak K, Chakrabarti J. An siRNA designing tool with a unique functional off-target filtering approach. J Biomol Struct Dyn 2012; 31:1343-57. [PMID: 23140209 DOI: 10.1080/07391102.2012.736758] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
Investigations have revealed that silencing unwanted transcripts or off-targeting can induce false positive phenotype during RNA interference (RNAi)-based gene function study. But still the standard computational approaches towards small interfering RNA (siRNA) off-target minimization fall short in terms of addressing this false positive phenotype issue. Some of these off-targets may interfere with the biochemical pathway being investigated. It may also inadvertently target cell's metabolic pathways with unquantifiable consequences on the processes of user's interest. Here, we report the development of a siRNA selection tool that, for the first time, implements a functional off-target filtering that aims to minimize false positive phenotypes arising from inadvertent targets that are functionally similar or related to the direct target gene, along with a multi-parametric classifier (support vector machine) for optimized selection of potent siRNAs. The functional off-target filtering minimizes the number of off-target genes which are functionally related to the direct target gene, i.e. involved in a common biological process and may have similar phenotype. A text-mining algorithm is used to find related biological processes associated with the direct target and each off-target transcript by comparison of the biological processes associated with these genes. It also gives the user a choice to select one or more off-targets that may be potentially more harmful, from a predicted off-target gene list to be filtered out. Testing with huge set of biologically validated siRNAs from three different sources showed consistent good performance of our tool in terms of effective siRNA selection. It outperformed four potent siRNA selection algorithms of present day in terms of specificity in the selection of highly efficient siRNAs when compared on a common test set. A genome wide testing with potent siRNAs used in high-content screening confirmed validation of 2767 designed siRNAs in terms of phenotypic output. This tool presently supports siRNA designs for human genes and is freely available at http://gyanxet-beta.com .
Collapse
Affiliation(s)
- Shaoli Das
- a Indian Association for the Cultivation of Science , Kolkata , West Bengal , 700032 , India
| | | | | | | |
Collapse
|
14
|
Liu Q, Zhou H, Cui J, Cao Z, Xu Y. Reconsideration of in-silico siRNA design based on feature selection: a cross-platform data integration perspective. PLoS One 2012; 7:e37879. [PMID: 22655076 PMCID: PMC3360065 DOI: 10.1371/journal.pone.0037879] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2012] [Accepted: 04/25/2012] [Indexed: 01/24/2023] Open
Abstract
RNA interference via exogenous short interference RNAs (siRNA) is increasingly more widely employed as a tool in gene function studies, drug target discovery and disease treatment. Currently there is a strong need for rational siRNA design to achieve more reliable and specific gene silencing; and to keep up with the increasing needs for a wider range of applications. While progress has been made in the ability to design siRNAs with specific targets, we are clearly at an infancy stage towards achieving rational design of siRNAs with high efficacy. Among the many obstacles to overcome, lack of general understanding of what sequence features of siRNAs may affect their silencing efficacy and of large-scale homogeneous data needed to carry out such association analyses represents two challenges. To address these issues, we investigated a feature-selection based in-silico siRNA design from a novel cross-platform data integration perspective. An integration analysis of 4,482 siRNAs from ten meta-datasets was conducted for ranking siRNA features, according to their possible importance to the silencing efficacy of siRNAs across heterogeneous data sources. Our ranking analysis revealed for the first time the most relevant features based on cross-platform experiments, which compares favorably with the traditional in-silico siRNA feature screening based on the small samples of individual platform data. We believe that our feature ranking analysis can offer more creditable suggestions to help improving the design of siRNA with specific silencing targets. Data and scripts are available at http://csbl.bmb.uga.edu/publications/materials/qiliu/siRNA.html.
Collapse
Affiliation(s)
- Qi Liu
- Department of Bioinformatics, Tongji University, Shanghai, China
| | - Han Zhou
- Department of Bioinformatics, Tongji University, Shanghai, China
| | - Juan Cui
- Computational Systems Biology Laboratory, Department of Biochemistry and Molecular Biology, and Institute of Bioinformatics, University of Georgia, Athens, Georgia, United States of America
| | - Zhiwei Cao
- Department of Bioinformatics, Tongji University, Shanghai, China
- * E-mail: (ZC); (YX)
| | - Ying Xu
- Computational Systems Biology Laboratory, Department of Biochemistry and Molecular Biology, and Institute of Bioinformatics, University of Georgia, Athens, Georgia, United States of America
- College of Computer Science and Technology, Jilin University, Changchun, China
- * E-mail: (ZC); (YX)
| |
Collapse
|
15
|
DEsi: A design engine of siRNA that integrates SVMs prediction and feature filters. BIOCATALYSIS AND AGRICULTURAL BIOTECHNOLOGY 2012. [DOI: 10.1016/j.bcab.2012.01.005] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
16
|
Thakur N, Qureshi A, Kumar M. VIRsiRNAdb: a curated database of experimentally validated viral siRNA/shRNA. Nucleic Acids Res 2012; 40:D230-6. [PMID: 22139916 PMCID: PMC3245049 DOI: 10.1093/nar/gkr1147] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2011] [Revised: 10/04/2011] [Accepted: 11/09/2011] [Indexed: 12/22/2022] Open
Abstract
RNAi technology has been emerging as a potential modality to inhibit viruses during past decade. In literature a few siRNA databases have been reported that focus on targeting human and mammalian genes but experimentally validated viral siRNA databases are lacking. We have developed VIRsiRNAdb, a manually curated database having comprehensive details of 1358 siRNA/shRNA targeting viral genome regions. Further, wherever available, information regarding alternative efficacies of above 300 siRNAs derived from different assays has also been incorporated. Important fields included in the database are siRNA sequence, virus subtype, target genome region, cell type, target object, experimental assay, efficacy, off-target and siRNA matching with reference viral sequences. Database also provides the users with facilities of advance search, browsing, data submission, linking to external databases and useful siRNA analysis tools especially siTarAlign which align the siRNA with reference viral genomes or user defined sequences. VIRsiRNAdb contains extensive details of siRNA/shRNA targeting 42 important human viruses including influenza virus, hepatitis B virus, HPV and SARS Corona virus. VIRsiRNAdb would prove useful for researchers in picking up the best viral siRNA for antiviral therapeutics development and also for developing better viral siRNA design tools. The database is freely available at http://crdd.osdd.net/servers/virsirnadb.
Collapse
Affiliation(s)
| | | | - Manoj Kumar
- Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh-160036, India
| |
Collapse
|
17
|
ElHefnawi M, Hassan N, Kamar M, Siam R, Remoli AL, El-Azab I, AlAidy O, Marsili G, Sgarbanti M. The design of optimal therapeutic small interfering RNA molecules targeting diverse strains of influenza A virus. Bioinformatics 2011; 27:3364-70. [DOI: 10.1093/bioinformatics/btr555] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
|
18
|
mRNA turnover rate limits siRNA and microRNA efficacy. Mol Syst Biol 2011; 6:433. [PMID: 21081925 PMCID: PMC3010119 DOI: 10.1038/msb.2010.89] [Citation(s) in RCA: 84] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2010] [Accepted: 10/07/2010] [Indexed: 12/22/2022] Open
Abstract
Based on a simple model of the mRNA life cycle, we predict that mRNAs with high turnover rates in the cell are more difficult to perturb with RNAi. We test this hypothesis using a luciferase reporter system and obtain additional evidence from a variety of large-scale data sets, including microRNA overexpression experiments and RT–qPCR-based efficacy measurements for thousands of siRNAs. Our results suggest that mRNA half-lives will influence how mRNAs are differentially perturbed whenever small RNA levels change in the cell, not only after transfection but also during differentiation, pathogenesis and normal cell physiology.
What determines how strongly an mRNA responds to a microRNA or an siRNA? We know that properties of the sequence match between the small RNA and the mRNA are crucial. However, large-scale validations of siRNA efficacies have shown that certain transcripts remain recalcitrant to perturbation even after repeated redesign of the siRNA (Krueger et al, 2007). Weak response to RNAi may thus be an inherent property of the mRNA, but the underlying factors have proven difficult to uncover. siRNAs induce degradation by sequence-specific cleavage of their target mRNAs (Elbashir et al, 2001). MicroRNAs, too, induce mRNA degradation, and ∼80% of their effect on protein levels can be explained by changes in transcript abundance (Hendrickson et al, 2009; Guo et al, 2010). Given that multiple factors act simultaneously to degrade individual mRNAs, we here consider whether variable responses to micro/siRNA regulation may, in part, be explained simply by the basic dynamics of mRNA turnover. If a transcript is already under strong destabilizing regulation, it is theoretically possible that the relative change in abundance after the addition of a novel degrading factor would be less pronounced compared with a stable transcript (Figure 1). mRNA turnover is achieved by a multitude of factors, and the influence of such factors on targetability can be explored. However, their combined action, including yet unknown factors, is summarized into a single property: the mRNA decay rate. First, we explored the theoretical relationship between the pre-existing turnover rate of an mRNA, and its expected susceptibility to perturbation by a small RNA. We assumed a basic model of the mRNA life cycle, in which the rate of transcription is constant and the rate of degradation is described by first-order kinetics. Under this model, the relative change in steady-state expression level will become smaller as the pre-existing decay rate grows larger, independent of the transcription rate. This relationship persists also if we assume various degrees of synergy and antagonism between the pre-existing factors and the external factor, with increasing synergism leading to transcripts being more equally targetable, regardless of their pre-existing decay rate. We next generated a series of four luciferase reporter constructs with destabilizing AU-rich elements (AREs) of various strengths incorporated into their 3′ UTRs. To evaluate how the different constructs would respond to perturbation, we performed co-transfections with an siRNA targeted at the coding region of the luciferase gene. This reduced the signal of the non-destabilized construct to 26% compared with a control siRNA. In contrast, the most destabilized construct showed 42% remaining reporter activity, and we could observe a dose–response relationship across the series. The reporter experiment encouraged an investigation of this effect on real-world mRNAs. We analyzed a set of 2622 siRNAs, for which individual efficacies were determined using RT–qPCR 48 h post-transfection in HeLa cells (www.appliedbiosystems.com). Of these, 1778 could be associated with an experimentally determined decay rate (Figure 4A). Although the overall correlation between the two variables was modest (Spearman's rank correlation rs=0.22, P<1e−20), we found that siRNAs directed at high-turnover (t1/2<200 min) and medium-turnover (200<t1/2<1000 min) mRNAs caused significantly less repression than those targeting long-lived (t1/2>1000 min) transcripts (P<8e−11 and 4e−9, respectively, two-tailed KS-test, Figure 4B). While 41.6% (498/1196) of the siRNAs directed at low-turnover transcripts reached 10% remaining expression or better, only 16.7% (31/186) of the siRNAs that targeted high-turnover mRNAs reached this high degree of silencing (Figure 4B). Reduced targetability (25.2%, 100/396) was also seen for transcripts with medium-turnover rate. Our results based on siRNA data suggested that turnover rates could also influence microRNA targeting. By assembling genome-wide mRNA expression data from 20 published microRNA transfections in HeLa cells, we found that predicted target mRNAs with short and medium half-life were significantly less repressed after transfection than their long-lived counterparts (P<8e−5 and P<0.03, respectively, two-tailed KS-test). Specifically, 10.2% (293/2874) of long-lived targets versus 4.4% (41/942) of short-lived targets were strongly (z-score <−3) repressed. siRNAs are known to cause off-target effects that are mediated, in part, by microRNA-like seed complementarity (Jackson et al, 2006). We analyzed changes in transcript levels after transfection of seven different siRNAs, each with a unique seed region (Jackson et al, 2006). Putative ‘off-targets' were identified by mapping of non-conserved seed matches in 3′ UTRs. We found that low-turnover mRNAs (t1/2 >1000 min) were more affected by seed-mediated off-target silencing than high-turnover mRNAs (t1/2 <200 min), with twice as many long-lived seed-containing transcripts (3.8 versus 1.9%) being strongly (z-score <−3) repressed. In summary, mRNA turnover rates have an important influence on the changes exerted by small RNAs on mRNA levels. It can be assumed that mRNA half-lives will influence how mRNAs are differentially perturbed whenever small RNA levels change in the cell, not only after transfection but also during differentiation, pathogenesis and normal cell physiology. The microRNA pathway participates in basic cellular processes and its discovery has enabled the development of si/shRNAs as powerful investigational tools and potential therapeutics. Based on a simple kinetic model of the mRNA life cycle, we hypothesized that mRNAs with high turnover rates may be more resistant to RNAi-mediated silencing. The results of a simple reporter experiment strongly supported this hypothesis. We followed this with a genome-wide scale analysis of a rich corpus of experiments, including RT–qPCR validation data for thousands of siRNAs, siRNA/microRNA overexpression data and mRNA stability data. We find that short-lived transcripts are less affected by microRNA overexpression, suggesting that microRNA target prediction would be improved if mRNA turnover rates were considered. Similarly, short-lived transcripts are more difficult to silence using siRNAs, and our results may explain why certain transcripts are inherently recalcitrant to perturbation by small RNAs.
Collapse
|
19
|
Ebalunode JO, Jagun C, Zheng W. Informatics approach to the rational design of siRNA libraries. Methods Mol Biol 2011; 672:341-58. [PMID: 20838976 DOI: 10.1007/978-1-60761-839-3_14] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
This chapter surveys the literature for state-of-the-art methods for the rational design of siRNA libraries. It identifies and presents major milestones in the field of computational modeling of siRNA's gene silencing efficacy. Commonly used features of siRNAs are summarized along with major machine learning techniques employed to build the predictive models. It has also outlined several web-enabled siRNA design tools. To face the challenge of modeling and rational design of chemically modified siRNAs, it also proposes a new cheminformatics approach for the representation and characterization of siRNA molecules. Some preliminary results with this new approach are presented to demonstrate the promising potential of this method for the modeling of siRNA's efficacy. Together with novel delivery technologies and chemical modification techniques, rational siRNA design algorithms will ultimately contribute to chemical biology research and the efficient development of siRNA therapeutics.
Collapse
Affiliation(s)
- Jerry O Ebalunode
- Department of Pharmaceutical Sciences, BRITE Institute, North Carolina Central University, Durham, NC, USA
| | | | | |
Collapse
|
20
|
Xu H, Schaniel C, Lemischka IR, Ma’ayan A. Toward a complete in silico, multi-layered embryonic stem cell regulatory network. WILEY INTERDISCIPLINARY REVIEWS. SYSTEMS BIOLOGY AND MEDICINE 2010; 2:708-33. [PMID: 20890967 PMCID: PMC2951283 DOI: 10.1002/wsbm.93] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Recent efforts in systematically profiling embryonic stem (ES) cells have yielded a wealth of high-throughput data. Complementarily, emerging databases and computational tools facilitate ES cell studies and further pave the way toward the in silico reconstruction of regulatory networks encompassing multiple molecular layers. Here, we briefly survey databases, algorithms, and software tools used to organize and analyze high-throughput experimental data collected to study mammalian cellular systems with a focus on ES cells. The vision of using heterogeneous data to reconstruct a complete multi-layered ES cell regulatory network is discussed. This review also provides an accompanying manually extracted dataset of different types of regulatory interactions from low-throughput experimental ES cell studies available at http://amp.pharm.mssm.edu/iscmid/literature.
Collapse
Affiliation(s)
- Huilei Xu
- Department of Gene and Cell Medicine and The Black Family Stem Cell Institute, Mount Sinai School of Medicine, New York, NY 10029, USA
- Department of Pharmacology and System Therapeutics and Systems Biology Center New York (SBCNY), Mount Sinai School of Medicine, New York, NY 10029, USA
| | - Christoph Schaniel
- Department of Gene and Cell Medicine and The Black Family Stem Cell Institute, Mount Sinai School of Medicine, New York, NY 10029, USA
| | - Ihor R. Lemischka
- Department of Gene and Cell Medicine and The Black Family Stem Cell Institute, Mount Sinai School of Medicine, New York, NY 10029, USA
| | - Avi Ma’ayan
- Department of Pharmacology and System Therapeutics and Systems Biology Center New York (SBCNY), Mount Sinai School of Medicine, New York, NY 10029, USA
| |
Collapse
|
21
|
Liu Q, Xu Q, Zheng VW, Xue H, Cao Z, Yang Q. Multi-task learning for cross-platform siRNA efficacy prediction: an in-silico study. BMC Bioinformatics 2010; 11:181. [PMID: 20380733 PMCID: PMC2873531 DOI: 10.1186/1471-2105-11-181] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2009] [Accepted: 04/10/2010] [Indexed: 11/30/2022] Open
Abstract
BACKGROUND Gene silencing using exogenous small interfering RNAs (siRNAs) is now a widespread molecular tool for gene functional study and new-drug target identification. The key mechanism in this technique is to design efficient siRNAs that incorporated into the RNA-induced silencing complexes (RISC) to bind and interact with the mRNA targets to repress their translations to proteins. Although considerable progress has been made in the computational analysis of siRNA binding efficacy, few joint analysis of different RNAi experiments conducted under different experimental scenarios has been done in research so far, while the joint analysis is an important issue in cross-platform siRNA efficacy prediction. A collective analysis of RNAi mechanisms for different datasets and experimental conditions can often provide new clues on the design of potent siRNAs. RESULTS An elegant multi-task learning paradigm for cross-platform siRNA efficacy prediction is proposed. Experimental studies were performed on a large dataset of siRNA sequences which encompass several RNAi experiments recently conducted by different research groups. By using our multi-task learning method, the synergy among different experiments is exploited and an efficient multi-task predictor for siRNA efficacy prediction is obtained. The 19 most popular biological features for siRNA according to their jointly importance in multi-task learning were ranked. Furthermore, the hypothesis is validated out that the siRNA binding efficacy on different messenger RNAs(mRNAs) have different conditional distribution, thus the multi-task learning can be conducted by viewing tasks at an "mRNA"-level rather than at the "experiment"-level. Such distribution diversity derived from siRNAs bound to different mRNAs help indicate that the properties of target mRNA have important implications on the siRNA binding efficacy. CONCLUSIONS The knowledge gained from our study provides useful insights on how to analyze various cross-platform RNAi data for uncovering of their complex mechanism.
Collapse
Affiliation(s)
- Qi Liu
- College of Life Science and Biotechnology, Tongji University, China
- Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong
| | - Qian Xu
- Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong
| | - Vincent W Zheng
- Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong
| | - Hong Xue
- Department of Biochemistry, Hong Kong University of Science and Technology, Hong Kong
| | - Zhiwei Cao
- College of Life Science and Biotechnology, Tongji University, China
- Shanghai Center for Bioinformation Technology, China
| | - Qiang Yang
- Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong
| |
Collapse
|
22
|
Abstract
RNA interference, mediated by small interfering RNAs (siRNAs), is a powerful tool for investigation of gene functions and it is increasingly being used as a therapeutic agent. However, not all siRNAs are equally potent - although simple rules for the selection of good siRNAs were proposed early on, siRNAs are still plagued with widely fluctuating efficiency. Recently, new design tools that incorporate both the structural features of the targeted RNAs and the sequence features of the siRNAs have substantially improved the efficacy of siRNAs. In this chapter, we present the algorithms behind these accessibility-aided tools and show how to design efficient siRNAs with their help.
Collapse
Affiliation(s)
- Ivo L Hofacker
- Institute for Theoretical Chemistry, University Vienna, Vienna, Austria
| | | |
Collapse
|
23
|
Ng KL, Liu HC, Lee SC. ncRNAppi--a tool for identifying disease-related miRNA and siRNA targeting pathways. Bioinformatics 2009; 25:3199-201. [DOI: 10.1093/bioinformatics/btp574] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
|
24
|
Hajeri PB, Singh SK. siRNAs: their potential as therapeutic agents--Part I. Designing of siRNAs. Drug Discov Today 2009; 14:851-8. [PMID: 19540928 DOI: 10.1016/j.drudis.2009.06.001] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2009] [Accepted: 06/08/2009] [Indexed: 12/25/2022]
Abstract
RNA interference (RNAi) is a novel and essential biological process, as well as a powerful experimental tool with the potential to be used in therapeutic development. RNAi-based strategies have the capability of being able to be driven from bench to bedside. It is very important to develop the precise tools for designing the siRNAs to get the most efficient knockdown of the target genes and to reduce any off-target effects. In this review we have discussed the strategies and parameters required for effective siRNA designing and synthesis, based on already published literature.
Collapse
Affiliation(s)
- Praveensingh B Hajeri
- Section of Infectious Diseases & Immunobiology, Centre for Cellular and Molecular Biology, Uppal Road, Hyderabad-500007, AP, India
| | | |
Collapse
|
25
|
Secondary RNA Structure and its Role in RNA Interference to Silence the Respiratory Syncytial Virus Fusion Protein Gene. Mol Biotechnol 2009; 43:200-11. [DOI: 10.1007/s12033-009-9190-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2009] [Accepted: 05/16/2009] [Indexed: 12/25/2022]
|
26
|
Gong W, Ren Y, Zhou H, Wang Y, Kang S, Li T. siDRM: an effective and generally applicable online siRNA design tool. ACTA ACUST UNITED AC 2008; 24:2405-6. [PMID: 18718944 DOI: 10.1093/bioinformatics/btn442] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
Small interfering RNAs (siRNAs) have become an indispensable tool for the investigation of gene functions. Most existing siRNA design tools were trained on datasets assembled from confined origins, incompatible with the diverse siRNA laboratory practice to which these tools will ultimately be applied. We have performed an updated analysis using the disjunctive rule merging (DRM) approach on a large and diverse dataset compiled from siRecords, and implemented the resulting rule sets in siDRM, a new online siRNA design tool. siDRM also implements a few high-sensitivity rule sets and fast rule sets, links to siRecords, and uses several filters to check unwanted detrimental effects, including innate immune responses, cell toxic effects and off-target activities in selecting siRNAs. A performance comparison using an independent dataset indicated that siDRM outperforms 19 existing siRNA design tools in identifying effective siRNAs.
Collapse
Affiliation(s)
- Wuming Gong
- Department of Neuroscience, University of Minnesota, Minneapolis, MN 55455, USA
| | | | | | | | | | | |
Collapse
|
27
|
Vareková RS, Bradác I, Plchút M, Skrdla M, Wacenovsky M, Mahr H, Mayer G, Tanner H, Brugger H, Withalm J, Lederer P, Huber H, Gierlinger G, Graf R, Tafer H, Hofacker I, Schuster P, Polcík M. www.rnaworkbench.com: A new program for analyzing RNA interference. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2008; 90:89-94. [PMID: 18207283 DOI: 10.1016/j.cmpb.2007.12.001] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/05/2007] [Revised: 10/30/2007] [Accepted: 12/01/2007] [Indexed: 05/25/2023]
Abstract
RNA interference (RNAi) has become an important tool to study and utilize gene silencing by introducing short interfering RNA (siRNA). In order to predict the most efficient siRNAs, a new software tool, RNA Workbench (RNAWB), has been designed and is freely available (after registration) on http://www.rnaworkbench.com. In addition to the standard selection rules, RNAWB includes the possibility of statistical analyses of the applied selection rules (criteria). The role of RNA secondary structures in the RNA interference process as well as the application of sequence rules are discussed to show the applicability of the software.
Collapse
|
28
|
Ichihara M, Murakumo Y, Masuda A, Matsuura T, Asai N, Jijiwa M, Ishida M, Shinmi J, Yatsuya H, Qiao S, Takahashi M, Ohno K. Thermodynamic instability of siRNA duplex is a prerequisite for dependable prediction of siRNA activities. Nucleic Acids Res 2007; 35:e123. [PMID: 17884914 PMCID: PMC2094068 DOI: 10.1093/nar/gkm699] [Citation(s) in RCA: 89] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
We developed a simple algorithm, i-Score (inhibitory-Score), to predict active siRNAs by applying a linear regression model to 2431 siRNAs. Our algorithm is exclusively comprised of nucleotide (nt) preferences at each position, and no other parameters are taken into account. Using a validation dataset comprised of 419 siRNAs, we found that the prediction accuracy of i-Score is as good as those of s-Biopredsi, ThermoComposition21 and DSIR, which employ a neural network model or more parameters in a linear regression model. Reynolds and Katoh also predict active siRNAs efficiently, but the numbers of siRNAs predicted to be active are less than one-eighth of that of i-Score. We additionally found that exclusion of thermostable siRNAs, whose whole stacking energy (ΔG) is less than −34.6 kcal/mol, improves the prediction accuracy in i-Score, s-Biopredsi, ThermoComposition21 and DSIR. We also developed a universal target vector, pSELL, with which we can assay an siRNA activity of any sequence in either the sense or antisense direction. We assayed 86 siRNAs in HEK293 cells using pSELL, and validated applicability of i-Score and the whole ΔG value in designing siRNAs.
Collapse
Affiliation(s)
- Masatoshi Ichihara
- Department of Biomedical Sciences, College of Life and Health Sciences, Chubu University, 1200 Matsumoto, Kasugai 487-8501, Department of Pathology, Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Department of Public Health/Health Information Dynamics, Field of Social Life Science, Program in Health and Community Medicine and Division of Molecular Pathology, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan
| | - Yoshiki Murakumo
- Department of Biomedical Sciences, College of Life and Health Sciences, Chubu University, 1200 Matsumoto, Kasugai 487-8501, Department of Pathology, Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Department of Public Health/Health Information Dynamics, Field of Social Life Science, Program in Health and Community Medicine and Division of Molecular Pathology, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan
| | - Akio Masuda
- Department of Biomedical Sciences, College of Life and Health Sciences, Chubu University, 1200 Matsumoto, Kasugai 487-8501, Department of Pathology, Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Department of Public Health/Health Information Dynamics, Field of Social Life Science, Program in Health and Community Medicine and Division of Molecular Pathology, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan
| | - Toru Matsuura
- Department of Biomedical Sciences, College of Life and Health Sciences, Chubu University, 1200 Matsumoto, Kasugai 487-8501, Department of Pathology, Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Department of Public Health/Health Information Dynamics, Field of Social Life Science, Program in Health and Community Medicine and Division of Molecular Pathology, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan
| | - Naoya Asai
- Department of Biomedical Sciences, College of Life and Health Sciences, Chubu University, 1200 Matsumoto, Kasugai 487-8501, Department of Pathology, Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Department of Public Health/Health Information Dynamics, Field of Social Life Science, Program in Health and Community Medicine and Division of Molecular Pathology, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan
| | - Mayumi Jijiwa
- Department of Biomedical Sciences, College of Life and Health Sciences, Chubu University, 1200 Matsumoto, Kasugai 487-8501, Department of Pathology, Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Department of Public Health/Health Information Dynamics, Field of Social Life Science, Program in Health and Community Medicine and Division of Molecular Pathology, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan
| | - Maki Ishida
- Department of Biomedical Sciences, College of Life and Health Sciences, Chubu University, 1200 Matsumoto, Kasugai 487-8501, Department of Pathology, Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Department of Public Health/Health Information Dynamics, Field of Social Life Science, Program in Health and Community Medicine and Division of Molecular Pathology, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan
| | - Jun Shinmi
- Department of Biomedical Sciences, College of Life and Health Sciences, Chubu University, 1200 Matsumoto, Kasugai 487-8501, Department of Pathology, Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Department of Public Health/Health Information Dynamics, Field of Social Life Science, Program in Health and Community Medicine and Division of Molecular Pathology, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan
| | - Hiroshi Yatsuya
- Department of Biomedical Sciences, College of Life and Health Sciences, Chubu University, 1200 Matsumoto, Kasugai 487-8501, Department of Pathology, Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Department of Public Health/Health Information Dynamics, Field of Social Life Science, Program in Health and Community Medicine and Division of Molecular Pathology, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan
| | - Shanlou Qiao
- Department of Biomedical Sciences, College of Life and Health Sciences, Chubu University, 1200 Matsumoto, Kasugai 487-8501, Department of Pathology, Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Department of Public Health/Health Information Dynamics, Field of Social Life Science, Program in Health and Community Medicine and Division of Molecular Pathology, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan
| | - Masahide Takahashi
- Department of Biomedical Sciences, College of Life and Health Sciences, Chubu University, 1200 Matsumoto, Kasugai 487-8501, Department of Pathology, Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Department of Public Health/Health Information Dynamics, Field of Social Life Science, Program in Health and Community Medicine and Division of Molecular Pathology, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan
| | - Kinji Ohno
- Department of Biomedical Sciences, College of Life and Health Sciences, Chubu University, 1200 Matsumoto, Kasugai 487-8501, Department of Pathology, Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Department of Public Health/Health Information Dynamics, Field of Social Life Science, Program in Health and Community Medicine and Division of Molecular Pathology, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan
- *To whom correspondence should be addressed. +81 52 744 2446+81 52 744 2449
| |
Collapse
|
29
|
sIR: siRNA Information Resource, a web-based tool for siRNA sequence design and analysis and an open access siRNA database. BMC Bioinformatics 2007; 8:178. [PMID: 17540034 PMCID: PMC1896181 DOI: 10.1186/1471-2105-8-178] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2006] [Accepted: 05/31/2007] [Indexed: 11/23/2022] Open
Abstract
Background RNA interference has revolutionized our ability to study the effects of altering the expression of single genes in mammalian (and other) cells through targeted knockdown of gene expression. In this report we describe a web-based computational tool, siRNA Information Resource (sIR), which consists of a new open source database that contains validation information about published siRNA sequences and also provides a user-friendly interface to design and analyze siRNA sequences against a chosen target sequence. Results The siRNA design tool described in this paper employs empirically determined rules derived from a meta-analysis of the published data; it uses a weighted scoring system that determines the optimal sequence within a target mRNA and thus aids in the rational selection of siRNA sequences. This scoring system shows a non-linear correlation with the knockdown efficiency of siRNAs. sIR provides a fast, customized BLAST output for all selected siRNA sequences against a variety of databases so that the user can verify the uniqueness of the design. We have pre-designed siRNAs for all the known human genes (24,502) in the Refseq database. These siRNAs were pre-BLASTed against the human Unigene database to estimate the target specificity and all results are available online. Conclusion Although most of the rules for this scoring system were influenced by previously published rules, the weighted scoring system provides better flexibility in designing an appropriate siRNA when compared to the un-weighted scoring system. sIR is not only a comprehensive tool used to design siRNA sequences and lookup pre-designed siRNAs, but it is also a platform where researchers can share information on siRNA design and use.
Collapse
|
30
|
Bradác I, Svobodová Vareková R, Wacenovsky M, Skrdla M, Plchút M, Polcík M. siRNA selection criteria--statistical analyses of applicability and significance. Biochem Biophys Res Commun 2007; 359:83-7. [PMID: 17524355 DOI: 10.1016/j.bbrc.2007.05.056] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2007] [Accepted: 05/09/2007] [Indexed: 12/13/2022]
Abstract
RNA interference is a powerful tool for gene silencing, which is mediated by introducing siRNA. In the present study, statistical analyses of published siRNA selection criteria, the interpretation of some criteria and systematic searching for new criteria have been carried out for CGB siRNA and siRecords databases. The results of the analyses are as follows: (i) Our study supports the two-state model of the RNA-induced silencing complex (RISC). (ii) Stable 5'-S ends of a siRNA sequence, higher stability of the whole siRNA, and low breaking energy of siRNA duplex occurs in effective siRNA sequences. Also low internal stability of the 5'-AS terminus is preferred. (iii) Secondary structure can be successfully used as an RNAi selection criterion. (iv) Several published sequence criteria have been confirmed and also new criteria have been developed. (v) Also a Target Patterns criterion, which is comparable or better than the best known criteria, has been created.
Collapse
Affiliation(s)
- Ivan Bradác
- Bioinformatics group, ANF DATA (subsidiary of Siemens), Herspická 5, 639 00 Brno, Czech Republic
| | | | | | | | | | | |
Collapse
|
31
|
Vert JP, Foveau N, Lajaunie C, Vandenbrouck Y. An accurate and interpretable model for siRNA efficacy prediction. BMC Bioinformatics 2006; 7:520. [PMID: 17137497 PMCID: PMC1698581 DOI: 10.1186/1471-2105-7-520] [Citation(s) in RCA: 216] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2006] [Accepted: 11/30/2006] [Indexed: 12/21/2022] Open
Abstract
Background The use of exogenous small interfering RNAs (siRNAs) for gene silencing has quickly become a widespread molecular tool providing a powerful means for gene functional study and new drug target identification. Although considerable progress has been made recently in understanding how the RNAi pathway mediates gene silencing, the design of potent siRNAs remains challenging. Results We propose a simple linear model combining basic features of siRNA sequences for siRNA efficacy prediction. Trained and tested on a large dataset of siRNA sequences made recently available, it performs as well as more complex state-of-the-art models in terms of potency prediction accuracy, with the advantage of being directly interpretable. The analysis of this linear model allows us to detect and quantify the effect of nucleotide preferences at particular positions, including previously known and new observations. We also detect and quantify a strong propensity of potent siRNAs to contain short asymmetric motifs in their sequence, and show that, surprisingly, these motifs alone contain at least as much relevant information for potency prediction as the nucleotide preferences for particular positions. Conclusion The model proposed for prediction of siRNA potency is as accurate as a state-of-the-art nonlinear model and is easily interpretable in terms of biological features. It is freely available on the web at
Collapse
Affiliation(s)
- Jean-Philippe Vert
- Centre for Computational Biology, Ecole des Mines de Paris, 35 rue Saint-Honoré, 77300 Fontainebleau, France
| | - Nicolas Foveau
- Laboratoire de Biologie, Informatique, Mathématiques, Département Réponse et Dynamique Cellulaire, CEA Grenoble, 17 rue des Martyrs, 38054 Grenoble, France
| | - Christian Lajaunie
- Centre for Computational Biology, Ecole des Mines de Paris, 35 rue Saint-Honoré, 77300 Fontainebleau, France
| | - Yves Vandenbrouck
- Laboratoire de Biologie, Informatique, Mathématiques, Département Réponse et Dynamique Cellulaire, CEA Grenoble, 17 rue des Martyrs, 38054 Grenoble, France
| |
Collapse
|
32
|
Integrated siRNA design based on surveying of features associated with high RNAi effectiveness. BMC Bioinformatics 2006; 7:516. [PMID: 17129386 PMCID: PMC1698580 DOI: 10.1186/1471-2105-7-516] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2006] [Accepted: 11/27/2006] [Indexed: 12/17/2022] Open
Abstract
Background Short interfering RNAs have allowed the development of clean and easily regulated methods for disruption of gene expression. However, while these methods continue to grow in popularity, designing effective siRNA experiments can be challenging. The various existing siRNA design guidelines suffer from two problems: they differ considerably from each other, and they produce high levels of false-positive predictions when tested on data of independent origins. Results Using a distinctly large set of siRNA efficacy data assembled from a vast diversity of origins (the siRecords data, containing records of 3,277 siRNA experiments targeting 1,518 genes, derived from 1,417 independent studies), we conducted extensive analyses of all known features that have been implicated in increasing RNAi effectiveness. A number of features having positive impacts on siRNA efficacy were identified. By performing quantitative analyses on cooperative effects among these features, then applying a disjunctive rule merging (DRM) algorithm, we developed a bundle of siRNA design rule sets with the false positive problem well curbed. A comparison with 15 online siRNA design tools indicated that some of the rule sets we developed surpassed all of these design tools commonly used in siRNA design practice in positive predictive values (PPVs). Conclusion The availability of the large and diverse siRNA dataset from siRecords and the approach we describe in this report have allowed the development of highly effective and generally applicable siRNA design rule sets. Together with ever improving RNAi lab techniques, these design rule sets are expected to make siRNAs a more useful tool for molecular genetics, functional genomics, and drug discovery studies.
Collapse
|
33
|
Holen T. Efficient prediction of siRNAs with siRNArules 1.0: an open-source JAVA approach to siRNA algorithms. RNA (NEW YORK, N.Y.) 2006; 12:1620-5. [PMID: 16870995 PMCID: PMC1557693 DOI: 10.1261/rna.81006] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
RNAi interference and siRNA have become useful tools for investigation of gene function. However, the discovery that not all siRNA are equally efficient made necessary screens or design algorithms to obtain high activity siRNA candidates. Several algorithms have been published, but they remain inefficient, obscure, or commercially restricted. This article describes an open-source JAVA program that is surprisingly efficient at predicting active siRNAs (Pearson correlation coefficient r = 0.52, n = 526 siRNAs). Furthermore, this version 1.0 sets the stage for further improvement of the free code by the open-source community (http://sourceforge.net/).
Collapse
Affiliation(s)
- Torgeir Holen
- Centre for Molecular Biology and Neuroscience (CMBN), University of Oslo, Oslo, Norway.
| |
Collapse
|
34
|
Abstract
Small interfering RNAs (siRNAs) have been widely exploited for sequence-specific gene knockdown, predominantly to investigate gene function in cultured vertebrate cells, and also hold promise as therapeutic agents. Because not all siRNAs that are cognate to a given target mRNA are equally effective, computational tools have been developed based on experimental data to increase the likelihood of selecting effective siRNAs. Furthermore, because target-complementary siRNAs can also target other mRNAs containing sequence segments that are partially complementary to the siRNA, most computational tools include ways to reduce potential off-target effects in the siRNA selection process. Though these methods facilitate selection of functional siRNAs, they do not yet alleviate the need for experimental validation. This perspective provides a practical guide based on current wisdom for selecting siRNAs.
Collapse
Affiliation(s)
- Yi Pei
- Howard Hughes Medical Institute, Laboratory of RNA Molecular Biology, The Rockefeller University, 1230 York Avenue, Box 186, New York, New York 10021, USA
| | | |
Collapse
|