1
|
Bai Y, Zhong H, Wang T, Lu ZJ. OligoFormer: an accurate and robust prediction method for siRNA design. Bioinformatics 2024; 40:btae577. [PMID: 39321261 PMCID: PMC11494384 DOI: 10.1093/bioinformatics/btae577] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2024] [Revised: 08/14/2024] [Accepted: 09/23/2024] [Indexed: 09/27/2024] Open
Abstract
MOTIVATION RNA interference (RNAi) has become a widely used experimental approach for post-transcriptional regulation and is increasingly showing its potential as future targeted drugs. However, the prediction of highly efficient siRNAs (small interfering RNAs) is still hindered by dataset biases, the inadequacy of prediction methods, and the presence of off-target effects. To overcome these limitations, we propose an accurate and robust prediction method, OligoFormer, for siRNA design. RESULTS OligoFormer comprises three different modules including thermodynamic calculation, RNA-FM module, and Oligo encoder. Oligo encoder is the core module based on the transformer encoder. Taking siRNA and mRNA sequences as input, OligoFormer can obtain thermodynamic parameters, RNA-FM embedding, and Oligo embedding through these three modules, respectively. We carefully benchmarked OligoFormer against six comparable methods on siRNA efficacy datasets. OligoFormer outperforms all the other methods, with an average improvement of 9% in AUC, 6.6% in PRC, 9.8% in F1 score, and 5.1% in PCC compared to the best method among them in our inter-dataset validation. We also provide a comprehensive pipeline with prediction of siRNA efficacy and off-target effects using PITA score and TargetScan score. The ablation study shows RNA-FM module and thermodynamic parameters improved the performance and accelerated convergence of OligoFormer. The saliency maps by gradient backpropagation and base preference maps show certain base preferences in initial and terminal region of siRNAs. AVAILABILITY AND IMPLEMENTATION The source code of OligoFormer is freely available on GitHub at: https://github.com/lulab/OligoFormer. Docker image of OligoFormer is freely available on the docker hub at https://hub.docker.com/r/yilanbai/oligoformer.
Collapse
Affiliation(s)
- Yilan Bai
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing, 100084, China
- Institute for Precision Medicine, Tsinghua University, Beijing, 100084, China
| | - Haochen Zhong
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing, 100084, China
- Institute for Precision Medicine, Tsinghua University, Beijing, 100084, China
| | - Taiwei Wang
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing, 100084, China
- Institute for Precision Medicine, Tsinghua University, Beijing, 100084, China
- Academy for Advanced Interdisciplinary Studies (AAIS), and Peking University–Tsinghua University–National Institute of Biological Sciences Joint Graduate Program (PTN), Peking University, Beijing, 100871, China
| | - Zhi John Lu
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing, 100084, China
- Institute for Precision Medicine, Tsinghua University, Beijing, 100084, China
| |
Collapse
|
2
|
Chavez-Pena C. RNAi-Mediated Silencing in the Insect Cell-Baculovirus Expression System. Methods Mol Biol 2024; 2829:91-107. [PMID: 38951329 DOI: 10.1007/978-1-0716-3961-0_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/03/2024]
Abstract
RNA interference (RNAi) serves as an indispensable tool for gene function studies and has been substantiated through extensive research for its practical applications in the baculovirus expression vector system (BEVS). This chapter expands the RNAi toolkit in insect cell culture by including small interfering RNA (siRNA) in the protocol, in addition to the conventional use of double-stranded RNA (dsRNA). This chapter also brings attention to key design and reporting considerations, based on Minimum Information About an RNAi Experiment (MIARE) guidelines. Recommendations regarding online tools for dsRNA and siRNA design are provided, along with guidance on choosing suitable methods for measuring silencing outcomes.
Collapse
|
3
|
Dong X, Zheng W. Cheminformatics Modeling of Gene Silencing for Both Natural and Chemically Modified siRNAs. Molecules 2022; 27:6412. [PMID: 36234948 PMCID: PMC9570765 DOI: 10.3390/molecules27196412] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Revised: 09/23/2022] [Accepted: 09/25/2022] [Indexed: 11/17/2022] Open
Abstract
In designing effective siRNAs for a specific mRNA target, it is critically important to have predictive models for the potency of siRNAs. None of the published methods characterized the chemical structures of individual nucleotides constituting a siRNA molecule; therefore, they cannot predict the potency of gene silencing by chemically modified siRNAs (cm-siRNA). We propose a new approach that can predict the potency of gene silencing by cm-siRNAs, which characterizes each nucleotide (NT) using 12 BCUT cheminformatics descriptors describing its charge distribution, hydrophobic and polar properties. Thus, a 21-NT siRNA molecule is described by 252 descriptors resulting from concatenating all the BCUT values of its composing nucleotides. Partial Least Square is employed to develop statistical models. The Huesken data (2431 natural siRNA molecules) were used to perform model building and evaluation for natural siRNAs. Our results were comparable with or superior to those from Huesken's algorithm. The Bramsen dataset (48 cm-siRNAs) was used to build and test the models for cm-siRNAs. The predictive r2 of the resulting models reached 0.65 (or Pearson r values of 0.82). Thus, this new method can be used to successfully model gene silencing potency by both natural and chemically modified siRNA molecules.
Collapse
Affiliation(s)
| | - Weifan Zheng
- BRITE Institute and Department of Pharmaceutical Sciences, College of Health and Sciences (CHAS), North Carolina Central University, Durham, NC 27707, USA
| |
Collapse
|
4
|
Abstract
Small interfering RNA (siRNA) is a clinically approved therapeutic modality, which has attracted widespread attention not only from basic research but also from pharmaceutical industry. As siRNA can theoretically modulate any disease-related gene's expression, plenty of siRNA therapeutic pipelines have been established by tens of biotechnology companies. The drug performance of siRNA heavily depends on the sequence, the chemical modification, and the delivery of siRNA. Here, we describe the rational design protocol of siRNA, and provide some modification patterns that can enhance siRNA's stability and reduce its off-target effect. Also, the delivery method based on N-acetylgalactosamine (GalNAc)-siRNA conjugate that is widely employed to develop therapeutic regimens for liver-related diseases is also recapitulated.
Collapse
Affiliation(s)
- Mei Lu
- School of Life Science, Advanced Research Institute of Multidisciplinary Science, and Institute of Engineering Medicine, Key Laboratory of Molecular Medicine and Biotherapy, Beijing Institute of Technology, Beijing, China
| | - Mengjie Zhang
- School of Life Science, Advanced Research Institute of Multidisciplinary Science, and Institute of Engineering Medicine, Key Laboratory of Molecular Medicine and Biotherapy, Beijing Institute of Technology, Beijing, China
| | - Bo Hu
- School of Life Science, Advanced Research Institute of Multidisciplinary Science, and Institute of Engineering Medicine, Key Laboratory of Molecular Medicine and Biotherapy, Beijing Institute of Technology, Beijing, China
| | - Yuanyu Huang
- School of Life Science, Advanced Research Institute of Multidisciplinary Science, and Institute of Engineering Medicine, Key Laboratory of Molecular Medicine and Biotherapy, Beijing Institute of Technology, Beijing, China.
| |
Collapse
|
5
|
Wu C, Li Q, Xing R, Fan GL. Using the Chou’s Pseudo Component to Predict the ncRNA Locations Based on the Improved K-Nearest Neighbor (iKNN) Classifier. Curr Bioinform 2020. [DOI: 10.2174/1574893614666191003142406] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Background:
The non-coding RNA identification at the organelle genome level is a
challenging task. In our previous work, an ncRNA dataset with less than 80% sequence identity
was built, and a method incorporating an increment of diversity combining with support vector
machine method was proposed.
Objective:
Based on the ncRNA_361 dataset, a novel decision-making method-an improved
KNN (iKNN) classifier was proposed.
Methods:
In this paper, based on the iKNN algorithm, the physicochemical features of nucleotides,
the degeneracy of genetic codons, and topological secondary structure were selected to represent
the effective ncRNA characters. Then, the incremental feature selection method was utilized to optimize
the feature set.
Results:
The results of iKNN indicated that the decision-making method of mean value is distinctly
superior to the traditional decision-making method of majority vote the Increment of Diversity
Combining Support Vector Machine (ID-SVM). The iKNN algorithm achieved an overall accuracy
of 97.368% in the jackknife test, when k=3.
Conclusion:
It should be noted that the triplets of the structure-sequence mode under reading
frames not only contains the entire sequence information but also reflects whether the base was
paired or not, and the secondary structural topological parameters further describe the ncRNA secondary
structure on the spatial level. The ncRNA dataset and the iKNN classifier are freely available
at http://202.207.14.87:8032/fuwu/iKNN/index.asp.
Collapse
Affiliation(s)
- Chengyan Wu
- Baotou Teacher’s College, Inner Mongolia University of Science and Technology, Baotou, China
| | - Qianzhong Li
- Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot, China
| | - Ru Xing
- Baotou Teacher’s College, Inner Mongolia University of Science and Technology, Baotou, China
| | - Guo-Liang Fan
- Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot, China
| |
Collapse
|
6
|
Fischer B, Ly TD, Schmidt V, Hendig D, Kuhn J, Knabbe C, Faust I. Xylosyltransferase-deficient human HEK293 cells show a strongly reduced proliferation capacity and viability. Biochem Biophys Res Commun 2020; 521:507-513. [PMID: 31677793 DOI: 10.1016/j.bbrc.2019.10.148] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2019] [Accepted: 10/22/2019] [Indexed: 01/08/2023]
Abstract
Human xylosyltransferases-I and -II (XT-I and XT-II) catalyze the initial and rate-limiting step in proteoglycan (PG)-biosynthesis. Because PG are major components of the extracellular matrix (ECM), an alternated XT expression is associated with the manifestation of ECM-related diseases. While Drosophila melanogaster and Caenorhabditis elegans only harbor one XT-isoform, all higher organisms contain two isoforms, which are expressed in a tissue-specific manner. The reason for the appearance of two isoenzymes remains unexplained and remarkable, as all other enzymes involved in the synthesis of the tetrasaccharid linker, which connects the PG core protein with attached glycosaminoglycans, only show one isoform. In human, mutations in the XYLT genes cause diseases affecting the homeostasis of the ECM, such as skeletal dysplasias. We investigated for the first time whether already XT-I-deficient human embryonic kidney (HEK293) cells can compensate for decreased expression levels of both XT-isoforms. A siRNA-mediated XYLT2 mRNA knockdown led to reduced cellular proliferation rates and a partially increased cellular senescence of treated HEK293 cells. These results were verified by conducting a stable CRISPR/Cas9-mediated XYLT2 knockout, which revealed that only cells expressing at least partially functional XT-II proteins remain proliferative. Our study, therefore, shows for the first time that cells lacking both XT-isoforms are not viable and clearly indicates the importance of the XT concerning the cellular metabolism.
Collapse
Affiliation(s)
- Bastian Fischer
- Institut für Laboratoriums- und Transfusionsmedizin, Herz- und Diabeteszentrum Nordrhein-Westfalen, Universitätsklinik der Ruhr-Universität Bochum, Georgstrasse 11, 32545, Bad Oeynhausen, Germany.
| | - Thanh-Diep Ly
- Institut für Laboratoriums- und Transfusionsmedizin, Herz- und Diabeteszentrum Nordrhein-Westfalen, Universitätsklinik der Ruhr-Universität Bochum, Georgstrasse 11, 32545, Bad Oeynhausen, Germany
| | - Vanessa Schmidt
- Institut für Laboratoriums- und Transfusionsmedizin, Herz- und Diabeteszentrum Nordrhein-Westfalen, Universitätsklinik der Ruhr-Universität Bochum, Georgstrasse 11, 32545, Bad Oeynhausen, Germany
| | - Doris Hendig
- Institut für Laboratoriums- und Transfusionsmedizin, Herz- und Diabeteszentrum Nordrhein-Westfalen, Universitätsklinik der Ruhr-Universität Bochum, Georgstrasse 11, 32545, Bad Oeynhausen, Germany
| | - Joachim Kuhn
- Institut für Laboratoriums- und Transfusionsmedizin, Herz- und Diabeteszentrum Nordrhein-Westfalen, Universitätsklinik der Ruhr-Universität Bochum, Georgstrasse 11, 32545, Bad Oeynhausen, Germany
| | - Cornelius Knabbe
- Institut für Laboratoriums- und Transfusionsmedizin, Herz- und Diabeteszentrum Nordrhein-Westfalen, Universitätsklinik der Ruhr-Universität Bochum, Georgstrasse 11, 32545, Bad Oeynhausen, Germany
| | - Isabel Faust
- Institut für Laboratoriums- und Transfusionsmedizin, Herz- und Diabeteszentrum Nordrhein-Westfalen, Universitätsklinik der Ruhr-Universität Bochum, Georgstrasse 11, 32545, Bad Oeynhausen, Germany
| |
Collapse
|
7
|
Abstract
Small silencing RNAs have provided powerful reverse genetics tools and have opened new areas of research. This introduction describes the use of RNAi to suppress expression of individual genes for loss-of-function analysis. It also summarizes methods for measuring specific and global changes in small RNA expression, as well as methods to inhibit the function of individual endogenous small RNA species such as miRNAs.
Collapse
|
8
|
Kwon OS, Kwon SJ, Kim JS, Lee G, Maeng HJ, Lee J, Hwang GS, Cha HJ, Chun KH. Designing Tyrosinase siRNAs by Multiple Prediction Algorithms and Evaluation of Their Anti-Melanogenic Effects. Biomol Ther (Seoul) 2018; 26:282-289. [PMID: 29223142 PMCID: PMC5933895 DOI: 10.4062/biomolther.2017.115] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2017] [Revised: 08/03/2017] [Accepted: 08/07/2017] [Indexed: 11/05/2022] Open
Abstract
Melanin is a pigment produced from tyrosine in melanocytes. Although melanin has a protective role against UVB radiation-induced damage, it is also associated with the development of melanoma and darker skin tone. Tyrosinase is a key enzyme in melanin synthesis, which regulates the rate-limiting step during conversion of tyrosine into DOPA and dopaquinone. To develop effective RNA interference therapeutics, we designed a melanin siRNA pool by applying multiple prediction programs to reduce human tyrosinase levels. First, 272 siRNAs passed the target accessibility evaluation using the RNAxs program. Then we selected 34 siRNA sequences with ΔG ≥-34.6 kcal/mol, i-Score value ≥65, and siRNA scales score ≤30. siRNAs were designed as 19-bp RNA duplexes with an asymmetric 3' overhang at the 3' end of the antisense strand. We tested if these siRNAs effectively reduced tyrosinase gene expression using qRT-PCR and found that 17 siRNA sequences were more effective than commercially available siRNA. Three siRNAs further tested showed an effective visual color change in MNT-1 human cells without cytotoxic effects, indicating these sequences are anti-melanogenic. Our study revealed that human tyrosinase siRNAs could be efficiently designed using multiple prediction algorithms.
Collapse
Affiliation(s)
- Ok-Seon Kwon
- Department of Life Sciences, Sogang University, Seoul 04107, Republic of Korea
| | - Soo-Jung Kwon
- Department of Life Sciences, Sogang University, Seoul 04107, Republic of Korea
| | - Jin Sang Kim
- Leaders Cosmetics Co., Ltd., Anseong 17599, Republic of Korea
| | - Gunbong Lee
- Leaders Cosmetics Co., Ltd., Anseong 17599, Republic of Korea
| | - Han-Joo Maeng
- Gachon Institute of Pharmaceutical Sciences, College of Pharmacy, Gachon University, Incheon 21936, Republic of Korea
| | - Jeongmi Lee
- School of Pharmacy, Sungkyunkwan University, Suwon 16419, Republic of Korea
| | - Gwi Seo Hwang
- Laboratory of Cell Differentiation Research, College of Korean Medicine, Gachon University, Seongnam 13120, Republic of Korea
| | - Hyuk-Jin Cha
- Department of Life Sciences, Sogang University, Seoul 04107, Republic of Korea
| | - Kwang-Hoon Chun
- Gachon Institute of Pharmaceutical Sciences, College of Pharmacy, Gachon University, Incheon 21936, Republic of Korea
| |
Collapse
|
9
|
ElHefnawi M, Kim T, Kamar MA, Min S, Hassan NM, El-Ahwany E, Kim H, Zada S, Amer M, Windisch MP. In Silico Design and Experimental Validation of siRNAs Targeting Conserved Regions of Multiple Hepatitis C Virus Genotypes. PLoS One 2016; 11:e0159211. [PMID: 27441640 PMCID: PMC4956106 DOI: 10.1371/journal.pone.0159211] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2016] [Accepted: 06/28/2016] [Indexed: 12/16/2022] Open
Abstract
RNA interference (RNAi) is a post-transcriptional gene silencing mechanism that mediates the sequence-specific degradation of targeted RNA and thus provides a tremendous opportunity for development of oligonucleotide-based drugs. Here, we report on the design and validation of small interfering RNAs (siRNAs) targeting highly conserved regions of the hepatitis C virus (HCV) genome. To aim for therapeutic applications by optimizing the RNAi efficacy and reducing potential side effects, we considered different factors such as target RNA variations, thermodynamics and accessibility of the siRNA and target RNA, and off-target effects. This aim was achieved using an in silico design and selection protocol complemented by an automated MysiRNA-Designer pipeline. The protocol included the design and filtration of siRNAs targeting highly conserved and accessible regions within the HCV internal ribosome entry site, and adjacent core sequences of the viral genome with high-ranking efficacy scores. Off-target analysis excluded siRNAs with potential binding to human mRNAs. Under this strict selection process, two siRNAs (HCV353 and HCV258) were selected based on their predicted high specificity and potency. These siRNAs were tested for antiviral efficacy in HCV genotype 1 and 2 replicon cell lines. Both in silico-designed siRNAs efficiently inhibited HCV RNA replication, even at low concentrations and for short exposure times (24h); they also exceeded the antiviral potencies of reference siRNAs targeting HCV. Furthermore, HCV353 and HCV258 siRNAs also inhibited replication of patient-derived HCV genotype 4 isolates in infected Huh-7 cells. Prolonged treatment of HCV replicon cells with HCV353 did not result in the appearance of escape mutant viruses. Taken together, these results reveal the accuracy and strength of our integrated siRNA design and selection protocols. These protocols could be used to design highly potent and specific RNAi-based therapeutic oligonucleotide interventions.
Collapse
Affiliation(s)
- Mahmoud ElHefnawi
- Informatics and Systems Department, Biomedical Informatics and Chemo-Informatics Group, Centre of Excellence for Advanced Sciences (CEAS), Division of Engineering Research, National Research Centre, Cairo, Egypt
- Centre for Informatics, Nile University, Shiekh Zayed City, Egypt
- Yousef-Jameel Science and Technology Research Centre, American University in Cairo, New Cairo, Egypt
- * E-mail: (MEH); (MPW)
| | - TaeKyu Kim
- Hepatitis Research Laboratory, Institut Pasteur Korea, 696 Sampyung-dong, Bundang-gu, Seongnam-si, Gyeonggi-do, Republic of Korea
| | - Mona A. Kamar
- Yousef-Jameel Science and Technology Research Centre, American University in Cairo, New Cairo, Egypt
| | - Saehong Min
- Hepatitis Research Laboratory, Institut Pasteur Korea, 696 Sampyung-dong, Bundang-gu, Seongnam-si, Gyeonggi-do, Republic of Korea
| | - Nafisa M. Hassan
- Yousef-Jameel Science and Technology Research Centre, American University in Cairo, New Cairo, Egypt
| | - Eman El-Ahwany
- Biology Department, American University in Cairo, New Cairo, Egypt
- Immunology Department, Theodor Bilharz Research Institute, Giza, Egypt
| | - Heeyoung Kim
- Hepatitis Research Laboratory, Institut Pasteur Korea, 696 Sampyung-dong, Bundang-gu, Seongnam-si, Gyeonggi-do, Republic of Korea
| | - Suher Zada
- Yousef-Jameel Science and Technology Research Centre, American University in Cairo, New Cairo, Egypt
- Biology Department, American University in Cairo, New Cairo, Egypt
| | - Marwa Amer
- Biology Department, American University in Cairo, New Cairo, Egypt
- Faculty of Biotechnology, Misr University for Science and Technology, 6 of October City, Egypt
| | - Marc P. Windisch
- Hepatitis Research Laboratory, Institut Pasteur Korea, 696 Sampyung-dong, Bundang-gu, Seongnam-si, Gyeonggi-do, Republic of Korea
- * E-mail: (MEH); (MPW)
| |
Collapse
|
10
|
Effect of siRNA pre-Exposure on Subsequent Response to siRNA Therapy. Pharm Res 2015; 32:3813-26. [DOI: 10.1007/s11095-015-1741-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2015] [Accepted: 06/10/2015] [Indexed: 12/13/2022]
|
11
|
Murali R, John PG, Peter S D. Soft computing model for optimized siRNA design by identifying off target possibilities using artificial neural network model. Gene 2015; 562:152-8. [PMID: 25725126 DOI: 10.1016/j.gene.2015.02.067] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2014] [Revised: 01/25/2015] [Accepted: 02/22/2015] [Indexed: 01/22/2023]
Abstract
The ability of small interfering RNA (siRNA) to do posttranscriptional gene regulation by knocking down targeted genes is an important research topic in functional genomics, biomedical research and in cancer therapeutics. Many tools had been developed to design exogenous siRNA with high experimental inhibition. Even though considerable amount of work has been done in designing exogenous siRNA, design of effective siRNA sequences is still a challenging work because the target mRNAs must be selected such that their corresponding siRNAs are likely to be efficient against that target and unlikely to accidentally silence other transcripts due to sequence similarity. In some cases, siRNAs may tolerate mismatches with the target mRNA, but knockdown of genes other than the intended target could make serious consequences. Hence to design siRNAs, two important concepts must be considered: the ability in knocking down target genes and the off target possibility on any nontarget genes. So before doing gene silencing by siRNAs, it is essential to analyze their off target effects in addition to their inhibition efficacy against a particular target. Only a few methods have been developed by considering both efficacy and off target possibility of siRNA against a gene. In this paper we present a new design of neural network model with whole stacking energy (ΔG) that enables to identify the efficacy and off target effect of siRNAs against target genes. The tool lists all siRNAs against a particular target with their inhibition efficacy and number of matches or sequence similarity with other genes in the database. We could achieve an excellent performance of Pearson Correlation Coefficient (R=0. 74) and Area Under Curve (AUC=0.906) when the threshold of whole stacking energy is ≥-34.6 kcal/mol. To the best of the author's knowledge, this is one of the best score while considering the "combined efficacy and off target possibility" of siRNA for silencing a gene. The proposed model shall be useful for designing exogenous siRNA for therapeutic applications and gene silencing techniques in the area of bioinformatics. The software is developed as a desktop application and available at http://opsid.in/opsid/.
Collapse
Affiliation(s)
- Reena Murali
- Department of Computer Science and Engineering, Rajiv Gandhi Institute of Technology, Kerala, India.
| | - Philips George John
- Department of Computer Science and Engineering, Rajiv Gandhi Institute of Technology, Kerala, India
| | - David Peter S
- Department of Computer Science and Engineering, Cochin University of Science & Technology, Kerala, India
| |
Collapse
|
12
|
Malefyt AP, Wu M, Vocelle DB, Kappes SJ, Lindeman SD, Chan C, Walton SP. Improved asymmetry prediction for short interfering RNAs. FEBS J 2014; 281:320-30. [PMID: 24393396 DOI: 10.1111/febs.12599] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2013] [Revised: 08/28/2013] [Accepted: 09/26/2013] [Indexed: 01/10/2023]
Abstract
In the development of RNA interference therapeutics, merely selecting short interfering RNA (siRNA) sequences that are complementary to the mRNA target does not guarantee target silencing. Current algorithms for selecting siRNAs rely on many parameters, one of which is asymmetry, often predicted through calculation of the relative thermodynamic stabilities of the two ends of the siRNA. However, we have previously shown that highly active siRNA sequences are likely to have particular nucleotides at each 5'-end, independently of their thermodynamic asymmetry. Here, we describe an algorithm for predicting highly active siRNA sequences based only on these two asymmetry parameters. The algorithm uses end-sequence nucleotide preferences and predicted thermodynamic stabilities, each weighted on the basis of training data from the literature, to rank the probability that an siRNA sequence will have high or low activity. The algorithm successfully predicts weakly and highly active sequences for enhanced green fluorescent protein and protein kinase R. Use of these two parameters in combination improves the prediction of siRNA activity over current approaches for predicting asymmetry. Going forward, we anticipate that this approach to siRNA asymmetry prediction will be incorporated into the next generation of siRNA selection algorithms.
Collapse
Affiliation(s)
- Amanda P Malefyt
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, MI, USA
| | | | | | | | | | | | | |
Collapse
|
13
|
Angart P, Vocelle D, Chan C, Walton SP. Design of siRNA Therapeutics from the Molecular Scale. Pharmaceuticals (Basel) 2013; 6:440-68. [PMID: 23976875 PMCID: PMC3749788 DOI: 10.3390/ph6040440] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
While protein-based therapeutics is well-established in the market, development of nucleic acid therapeutics has lagged. Short interfering RNAs (siRNAs) represent an exciting new direction for the pharmaceutical industry. These small, chemically synthesized RNAs can knock down the expression of target genes through the use of a native eukaryotic pathway called RNA interference (RNAi). Though siRNAs are routinely used in research studies of eukaryotic biological processes, transitioning the technology to the clinic has proven challenging. Early efforts to design an siRNA therapeutic have demonstrated the difficulties in generating a highly-active siRNA with good specificity and a delivery vehicle that can protect the siRNA as it is transported to a specific tissue. In this review article, we discuss design considerations for siRNA therapeutics, identifying criteria for choosing therapeutic targets, producing highly-active siRNA sequences, and designing an optimized delivery vehicle. Taken together, these design considerations provide logical guidelines for generating novel siRNA therapeutics.
Collapse
Affiliation(s)
- Phillip Angart
- Department of Chemical Engineering and Materials Science, Michigan State University, 428 S. Shaw Lane, Room 2527, East Lansing, MI 48824, USA; (P.A.); (D.V.); (C.C.)
| | | | | | | |
Collapse
|
14
|
Liu L, Li QZ, Lin H, Zuo YC. The effect of regions flanking target site on siRNA potency. Genomics 2013; 102:215-22. [PMID: 23891614 DOI: 10.1016/j.ygeno.2013.07.009] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2012] [Revised: 07/14/2013] [Accepted: 07/16/2013] [Indexed: 11/28/2022]
Abstract
For a successful RNA interference (RNAi) experiment, selecting the small interference RNA (siRNA) candidates which maximize the knock down effect of the given gene is the critical step. Although various computational approaches have been attempted, the design of efficient siRNA candidates is far from satisfactory yet. In this study, we proposed a novel feature selection algorithm of combined random forest and support vector machine to predict active siRNAs. Using a publically available dataset, we demonstrated that the predictive accuracy would be markedly improved when the context sequence features outside the target site were included. The Pearson correlation coefficient for regression is as high as 0.721, compared to 0.671, 0.668, 0.680, and 0.645, for Biopredsi, i-score, ThermoComposition21 and DSIR, respectively. It revealed that siRNA-target interaction requires appropriate sequence context not only in the target site but also in a broad region flanking the target site.
Collapse
Affiliation(s)
- Li Liu
- Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021, China.
| | | | | | | |
Collapse
|
15
|
What parameters to consider and which software tools to use for target selection and molecular design of small interfering RNAs. Methods Mol Biol 2013; 942:1-16. [PMID: 23027043 DOI: 10.1007/978-1-62703-119-6_1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
The design of small gene silencing RNAs with a high probability of being efficient still has some elements of an art, especially when the lowest concentration of small molecules needs to be utilized. The design of highly target-specific small interfering RNAs or short hairpin RNAs is even a greater challenging task. Some logical schemes and software tools that can be used for simplifying both tasks are presented here. In addition, sequence motifs and sequence composition biases of small interfering RNAs that have to be avoided because of specificity concerns are also detailed.
Collapse
|
16
|
Takasaki S. Methods for selecting effective siRNA target sequences using a variety of statistical and analytical techniques. Methods Mol Biol 2013; 942:17-55. [PMID: 23027044 DOI: 10.1007/978-1-62703-119-6_2] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022]
Abstract
Short interfering RNA (siRNA) has been widely used for studying gene function in mammalian cells but varies markedly in its gene silencing efficacy. Although many design rules/guidelines for effective siRNAs based on various criteria have been reported recently, there are only a few consistencies among them. This makes it difficult to select effective siRNA sequences in mammalian genes. This chapter first reviews the recently reported siRNA design guidelines and then proposes new methods for selecting effective siRNA sequences from many possible candidates by using decision tree learning, Bayes' theorem, and average silencing probability on the basis of a large number of known effective siRNAs. These methods differ from the previous score-based siRNA design techniques and can predict the probability that a candidate siRNA sequence will be effective. Evaluation of these methods by applying them to recently reported effective and ineffective siRNA sequences for a number of genes indicates that they would be useful for many other genes. They should, therefore, be of general utility for selecting effective siRNA sequences for mammalian genes. The chapter also describes another method using a hidden Markov model to select the optimal functional siRNAs and discusses the frequencies of combinations of two successive nucleotides as an important characteristic of effective siRNA sequences.
Collapse
|
17
|
Abstract
Synthetic small interfering RNAs (siRNAs) have revolutionized functional genomics in mammalian cell cultures due to their reliability, efficiency, and ease of use. This success, however, has not fully translated into siRNA applications in vivo and in siRNA therapeutics where initial optimism has been dampened by a lack of efficient delivery strategies and reports of siRNA off-target effects and immunogenicity. Encouragingly, most aspects of siRNA behavior can be addressed by careful engineering of siRNAs incorporating beneficial chemical modifications into discrete nucleotide positions during siRNA synthesis. Here, we review the literature (Subheadings 1 -3) and provide a quick guide (Subheading 4) to how the performance of siRNA can be improved by chemical modification to suit specific applications in vitro and in vivo.
Collapse
Affiliation(s)
- Jesper B Bramsen
- Department of Molecular Biology and Genetics, Interdisciplinary Nanoscience Center (iNANO), University of Aarhus, Aarhus, Denmark.
| | | |
Collapse
|
18
|
Abstract
RNA interference (RNAi) mediated by small interfering RNA (siRNA) is now widely used to knock down gene expression in a sequence-specific manner, making it a powerful tool not only for studying gene functions but also for therapeutic applications. siRNA decreases the expression level of the intended target gene with complete complementarity by cleaving its mRNA. However, the efficacy of each siRNA widely varies depending on its sequence in mammalian cells; only a limited fraction of randomly designed siRNAs is functional. Moreover, off-target silencing effects arise when the siRNA has partial complementarity in the seed region with unintended genes. Here, we describe the rational designing of functional, off-target effect-reduced siRNAs using siDirect 2.0 Web server (http://siDirect2.RNAi.jp/). By using the default parameters, siDirect 2.0 can design at least one qualified siRNA for >94% of human mRNA sequences in the RefSeq database.
Collapse
Affiliation(s)
- Yuki Naito
- Department of Biophysics and Biochemistry, Graduate School of Science, University of Tokyo, Tokyo, Japan
| | | |
Collapse
|
19
|
Liu Q, Zhou H, Zhu R, Xu Y, Cao Z. Reconsideration of in silico siRNA design from a perspective of heterogeneous data integration: problems and solutions. Brief Bioinform 2012; 15:292-305. [PMID: 23275634 DOI: 10.1093/bib/bbs073] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
The success of RNA interference (RNAi) depends on the interaction between short interference RNAs (siRNAs) and mRNAs. Design of highly efficient and specific siRNAs has become a challenging issue in applications of RNAi. Here, we present a detailed survey on the state-of-the-art siRNAs design, focusing on several key issues with the current in silico RNAi studies, including: (i) inconsistencies among the proposed guidelines for siRNAs design and the incomplete list of siRNAs features, (ii) improper integration of the heterogeneous cross-platform siRNAs data, (iii) inadequate consideration of the binding specificity of the target mRNAs and (iv) reduction in the 'off-target' effect in siRNAs design. With these considerations, the popular in silico siRNAs design rules are reexamined and several inconsistent viewpoints toward siRNAs feature identifications are clarified. In addition, novel computational models for siRNAs design using state-of-art machine learning techniques are discussed, which focus on heterogeneous data integration, joint feature selection and customized siRNAs screening toward highly specific targets. We believe that addressing such issues in siRNA study will provide new clues for further improved design of more efficient and specific siRNAs in RNAi.
Collapse
Affiliation(s)
- Qi Liu
- Department of Biochemistry and Molecular Biology A110, Life Science Building, 120 Green Street, University of Georgia, Athens, GA 30602-7229, USA. Tel.: +706-542-9779; Fax: +706-542-9751/7782; ; Zhiwei Cao, School of Life Sciences and Technology, Tongji University, 1239 Siping Road, Shanghai 200092, China. Tel.: +86-21-54065003; Fax: +86-21-65980296; E-mail:
| | | | | | | | | |
Collapse
|
20
|
Chen J, Zhang W. Kinetic analysis of the effects of target structure on siRNA efficiency. J Chem Phys 2012; 137:225102. [DOI: 10.1063/1.4769821] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
21
|
|
22
|
Mini-clusters with mean probabilities for identifying effective siRNAs. BMC Res Notes 2012; 5:512. [PMID: 22988973 PMCID: PMC3499396 DOI: 10.1186/1756-0500-5-512] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2011] [Accepted: 08/07/2012] [Indexed: 11/25/2022] Open
Abstract
Background The distinction between the effective siRNAs and the ineffective ones is in high demand for gene knockout technology. To design effective siRNAs, many approaches have been proposed. Those approaches attempt to classify the siRNAs into effective and ineffective classes but they are difficult to decide the boundary between these two classes. Findings Here, we try to split effective and ineffective siRNAs into many smaller subclasses by RMP-MiC(the relative mean probabilities of siRNAs with the mini-clusters algorithm). The relative mean probabilities of siRNAs are the modified arithmetic mean value of three probabilities, which come from three Markov chain of effective siRNAs. The mini-clusters algorithm is a modified version of micro-cluster algorithm. Conclusions When the RMP-MiC was applied to the experimental siRNAs, the result shows that all effective siRNAs can be identified correctly, and no more than 9% ineffective siRNAs are misidentified as effective ones. We observed that the efficiency of those misidentified ineffective siRNAs exceed 70%, which is very closed to the used efficiency threshold. From the analysis of the siRNAs data, we suggest that the mini-clusters algorithm with relative mean probabilities can provide new insights to the applications for distinguishing effective siRNAs from ineffective ones.
Collapse
|
23
|
Matveeva OV, Nazipova NN, Ogurtsov AY, Shabalina SA. Optimized models for design of efficient miR30-based shRNAs. Front Genet 2012; 3:163. [PMID: 22952469 PMCID: PMC3429853 DOI: 10.3389/fgene.2012.00163] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2012] [Accepted: 08/10/2012] [Indexed: 11/13/2022] Open
Abstract
Small hairpin RNAs (shRNAs) became an important research tool in cell biology. Reliable design of these molecules is essential for the needs of large functional genomics projects. To optimize the design of efficient shRNAs, we performed comparative, thermodynamic, and correlation analyses of ~18,000 miR30-based shRNAs with known functional efficiencies, derived from the Sensor Assay project (Fellmann et al., 2011). We identified features of the shRNA guide strand that significantly correlate with the silencing efficiency and performed multiple regression analysis, using 4/5 of the data for training purposes and 1/5 for cross validation. A model that included the position-dependent nucleotide preferences was predictive in the cross-validation data subset (R = 0.39). However, a model, which in addition to the nucleotide preferences included thermodynamic shRNA features such as a thermodynamic duplex stability and position-dependent thermodynamic profile (dinucleotide free energy) was performing better (R = 0.53). Software "miR_Scan" was developed based upon the optimized models. Calculated mRNA target secondary structure stability showed correlation with shRNA silencing efficiency but failed to improve the model. Correlation analysis demonstrates that our algorithm for identification of efficient miR30-based shRNA molecules performs better than approaches that were developed for design of chemically synthesized siRNAs (R(max) = 0.36).
Collapse
Affiliation(s)
- Olga V Matveeva
- Department of Human Genetics, University of Utah Salt Lake City, UT, USA
| | | | | | | |
Collapse
|
24
|
Bramsen JB, Kjems J. Development of Therapeutic-Grade Small Interfering RNAs by Chemical Engineering. Front Genet 2012; 3:154. [PMID: 22934103 PMCID: PMC3422727 DOI: 10.3389/fgene.2012.00154] [Citation(s) in RCA: 77] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2012] [Accepted: 07/31/2012] [Indexed: 12/25/2022] Open
Abstract
Recent successes in clinical trials have provided important proof of concept that small interfering RNAs (siRNAs) indeed constitute a new promising class of therapeutics. Although great efforts are still needed to ensure efficient means of delivery in vivo, the siRNA molecule itself has been successfully engineered by chemical modification to meet initial challenges regarding specificity, stability, and immunogenicity. To date, a great wealth of siRNA architectures and types of chemical modification are available for promoting safe siRNA-mediated gene silencing in vivo and, consequently, the choice of design and modification types can be challenging to individual experimenters. Here we review the literature and devise how to improve siRNA performance by structural design and specific chemical modification to ensure potent and specific gene silencing without unwarranted side-effects and hereby complement the ongoing efforts to improve cell targeting and delivery by other carrier molecules.
Collapse
Affiliation(s)
- Jesper B Bramsen
- Interdisciplinary Nanoscience Center, Department of Molecular Biology and Genetics, Aarhus University Aarhus C, Denmark
| | | |
Collapse
|
25
|
Naito Y, Ui-Tei K. siRNA Design Software for a Target Gene-Specific RNA Interference. Front Genet 2012; 3:102. [PMID: 22701467 PMCID: PMC3371628 DOI: 10.3389/fgene.2012.00102] [Citation(s) in RCA: 59] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2012] [Accepted: 05/20/2012] [Indexed: 01/05/2023] Open
Abstract
RNA interference (RNAi) is a mechanism through which small interfering RNA (siRNA) induces sequence-specific posttranscriptional gene silencing. RNAi is commonly recognized as a powerful tool not only for functional genomics but also for therapeutic applications. Twenty-one-nucleotide-long siRNA suppresses the expression of the intended gene whose transcript possesses perfect complementarity to the siRNA guide strand. Hence, its silencing effect has been assumed to be extremely specific. However, accumulated evidences revealed that siRNA could downregulate unintended genes with partial complementarities mainly to the seven-nucleotide seed region of siRNA. This phenomenon is referred to as off-target effect. We have revealed that the capability to induce off-target effect is strongly correlated to the thermodynamic stability in siRNA seed-target duplex. For understanding accurate target gene function and successful therapeutic application, it may be critical to select a target gene-specific siRNA with minimized off-target effect. Here we present our siRNA design software for a target-specific RNAi. In addition, we also introduce the software programs open to the public for designing functional siRNAs.
Collapse
Affiliation(s)
- Yuki Naito
- Department of Biophysics and Biochemistry, Graduate School of Science, The University of Tokyo Tokyo, Japan
| | | |
Collapse
|
26
|
Mysara M, Elhefnawi M, Garibaldi JM. MysiRNA: Improving siRNA efficacy prediction using a machine-learning model combining multi-tools and whole stacking energy (ΔG). J Biomed Inform 2012; 45:528-34. [DOI: 10.1016/j.jbi.2012.02.005] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2011] [Revised: 01/25/2012] [Accepted: 02/15/2012] [Indexed: 10/28/2022]
|
27
|
Abstract
The design of small interfering RNA (siRNA) is a multi factorial problem that has gained the attention of many researchers in the area of therapeutic and functional genomics. MysiRNA score was previously introduced that improves the correlation of siRNA activity prediction considering state of the art algorithms. In this paper, a new program, MysiRNA-Designer, is described which integrates several factors in an automated work-flow considering mRNA transcripts variations, siRNA and mRNA target accessibility, and both near-perfect and partial off-target matches. It also features the MysiRNA score, a highly ranked correlated siRNA efficacy prediction score for ranking the designed siRNAs, in addition to top scoring models Biopredsi, DISR, Thermocomposition21 and i-Score, and integrates them in a unique siRNA score-filtration technique. This multi-score filtration layer filters siRNA that passes the 90% thresholds calculated from experimental dataset features. MysiRNA-Designer takes an accession, finds conserved regions among its transcript space, finds accessible regions within the mRNA, designs all possible siRNAs for these regions, filters them based on multi-scores thresholds, and then performs SNP and off-target filtration. These strict selection criteria were tested against human genes in which at least one active siRNA was designed from 95.7% of total genes. In addition, when tested against an experimental dataset, MysiRNA-Designer was found capable of rejecting 98% of the false positive siRNAs, showing superiority over three state of the art siRNA design programs. MysiRNA is a freely accessible (Microsoft Windows based) desktop application that can be used to design siRNA with a high accuracy and specificity. We believe that MysiRNA-Designer has the potential to play an important role in this area.
Collapse
|
28
|
Abstract
Chemically synthesized siRNAs are widely used for gene silencing. For in vitro applications, stability, delivery, and immunological issues are rarely problematic, but for in vivo applications the situation is different. Limited stability, undesirable pharmacokinetic behaviour, and unanticipated side effects from the immune system call for more careful structural siRNA design and inclusion of chemical modifications at selected positions. Also the notion that siRNA induces significant off-target silencing of many non-related genes has promted new effective measures to enhance specificity. The scope of this review is to provide a simple guide to successful chemical and structural modification of siRNAs with improved activity, stability, specificity, and low toxicity.
Collapse
|
29
|
Wang L, Huang C, Yang JY. Predicting siRNA potency with random forests and support vector machines. BMC Genomics 2010; 11 Suppl 3:S2. [PMID: 21143784 PMCID: PMC2999347 DOI: 10.1186/1471-2164-11-s3-s2] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
Background Short interfering RNAs (siRNAs) can be used to knockdown gene expression in functional genomics. For a target gene of interest, many siRNA molecules may be designed, whereas their efficiency of expression inhibition often varies. Results To facilitate gene functional studies, we have developed a new machine learning method to predict siRNA potency based on random forests and support vector machines. Since there were many potential sequence features, random forests were used to select the most relevant features affecting gene expression inhibition. Support vector machine classifiers were then constructed using the selected sequence features for predicting siRNA potency. Interestingly, gene expression inhibition is significantly affected by nucleotide dimer and trimer compositions of siRNA sequence. Conclusions The findings in this study should help design potent siRNAs for functional genomics, and might also provide further insights into the molecular mechanism of RNA interference.
Collapse
Affiliation(s)
- Liangjiang Wang
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC 29634, USA.
| | | | | |
Collapse
|
30
|
Takasaki S. Efficient prediction methods for selecting effective siRNA sequences. Comput Biol Med 2010; 40:149-58. [PMID: 20022002 DOI: 10.1016/j.compbiomed.2009.11.011] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2008] [Revised: 09/19/2009] [Accepted: 11/18/2009] [Indexed: 10/20/2022]
Abstract
Although short interfering RNA (siRNA) has been widely used for studying gene functions in mammalian cells, its gene silencing efficacy varies markedly and there are only a few consistencies among the recently reported design rules/guidelines for selecting siRNA sequences effective for mammalian genes. Another shortcoming of the previously reported methods is that they cannot estimate the probability that a candidate sequence will silence the target gene. This paper first reviewed the recently reported siRNA design guidelines and clarified the problems concerning the guidelines. It then proposed two prediction methods-Radial Basis Function (RBF) network and decision tree learning-and their combined method for selecting effective siRNA target sequences from many possible candidate sequences. They are quite different from the previous score-based siRNA design techniques and can predict the probability that a candidate siRNA sequence will be effective. The methods imply high estimation accuracy for selecting candidate siRNA sequences.
Collapse
|
31
|
Matveeva OV, Kang Y, Spiridonov AN, Sætrom P, Nemtsov VA, Ogurtsov AY, Nechipurenko YD, Shabalina SA. Optimization of duplex stability and terminal asymmetry for shRNA design. PLoS One 2010; 5:e10180. [PMID: 20422034 PMCID: PMC2857877 DOI: 10.1371/journal.pone.0010180] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2009] [Accepted: 03/01/2010] [Indexed: 12/17/2022] Open
Abstract
Prediction of efficient oligonucleotides for RNA interference presents a serious challenge, especially for the development of genome-wide RNAi libraries which encounter difficulties and limitations due to ambiguities in the results and the requirement for significant computational resources. Here we present a fast and practical algorithm for shRNA design based on the thermodynamic parameters. In order to identify shRNA and siRNA features universally associated with high silencing efficiency, we analyzed structure-activity relationships in thousands of individual RNAi experiments from publicly available databases (ftp://ftp.ncbi.nlm.nih.gov/pub/shabalin/siRNA/si_shRNA_selector/). Using this statistical analysis, we found free energy ranges for the terminal duplex asymmetry and for fully paired duplex stability, such that shRNAs or siRNAs falling in both ranges have a high probability of being efficient. When combined, these two parameters yield a approximately 72% success rate on shRNAs from the siRecords database, with the target RNA levels reduced to below 20% of the control. Two other parameters correlate well with silencing efficiency: the stability of target RNA and the antisense strand secondary structure. Both parameters also correlate with the short RNA duplex stability; as a consequence, adding these parameters to our prediction scheme did not substantially improve classification accuracy. To test the validity of our predictions, we designed 83 shRNAs with optimal terminal asymmetry, and experimentally verified that small shifts in duplex stability strongly affected silencing efficiency. We showed that shRNAs with short fully paired stems could be successfully selected by optimizing only two parameters: terminal duplex asymmetry and duplex stability of the hypothetical cleavage product, which also relates to the specificity of mRNA target recognition. Our approach performs at the level of the best currently utilized algorithms that take into account prediction of the secondary structure of the target and antisense RNAs, but at significantly lower computational costs. Based on this study, we created the si-shRNA Selector program that predicts both highly efficient shRNAs and functional siRNAs (ftp://ftp.ncbi.nlm.nih.gov/pub/shabalin/siRNA/si_shRNA_selector/).
Collapse
Affiliation(s)
- Olga V. Matveeva
- Department of Human Genetics, University of Utah, Salt Lake City, Utah, United States of America
- * E-mail: (OVM); (SAS)
| | - Yibin Kang
- Department of Molecular Biology, Princeton University, Princeton, New Jersey, United States of America
| | - Alexey N. Spiridonov
- Department of Applied Mathematics, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Pål Sætrom
- Department of Computer and Information Science, Norwegian University of Science and Technology, Trondheim, Norway
- Department of Cancer Research and Molecular Medicine, Norwegian University of Science and Technology, Trondheim, Norway
| | | | - Aleksey Y. Ogurtsov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Yury D. Nechipurenko
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| | - Svetlana A. Shabalina
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America
- * E-mail: (OVM); (SAS)
| |
Collapse
|
32
|
Wang X, Wang X, Varma RK, Beauchamp L, Magdaleno S, Sendera TJ. Selection of hyperfunctional siRNAs with improved potency and specificity. Nucleic Acids Res 2010; 37:e152. [PMID: 19846596 PMCID: PMC2794195 DOI: 10.1093/nar/gkp864] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
One critical step in RNA interference (RNAi) experiments is to design small interfering RNAs (siRNAs) that can greatly reduce the expression of the target transcripts, but not of other unintended targets. Although various statistical and computational approaches have been attempted, this remains a challenge facing RNAi researchers. Here, we present a new experimentally validated method for siRNA design. By analyzing public siRNA data and focusing on hyperfunctional siRNAs, we identified a set of sequence features as potency selection criteria to build an siRNA design algorithm with support vector machines. Additional bioinformatics filters were also included in the algorithm to increase RNAi specificity by reducing potential sequence cross-hybridization or microRNA-like effects. Independent validation experiments were performed, which indicated that the newly designed siRNAs have significantly improved performance, and worked effectively even at low concentrations. Furthermore, our cell-based studies demonstrated that the siRNA off-target effects were significantly reduced when the siRNAs were delivered into cells at the 3 nM concentration compared to 30 nM. Thus, the capability of our new design program to select highly potent siRNAs also renders increased RNAi specificity because these siRNAs can be used at a much lower concentration. The siRNA design web server is available at http://www5.appliedbiosystems.com/tools/siDesign/.
Collapse
Affiliation(s)
- Xiaowei Wang
- Department of Radiation Oncology, Washington University School of Medicine, St Louis, MO 63108, USA.
| | | | | | | | | | | |
Collapse
|
33
|
Abstract
RNA interference, mediated by small interfering RNAs (siRNAs), is a powerful tool for investigation of gene functions and it is increasingly being used as a therapeutic agent. However, not all siRNAs are equally potent - although simple rules for the selection of good siRNAs were proposed early on, siRNAs are still plagued with widely fluctuating efficiency. Recently, new design tools that incorporate both the structural features of the targeted RNAs and the sequence features of the siRNAs have substantially improved the efficacy of siRNAs. In this chapter, we present the algorithms behind these accessibility-aided tools and show how to design efficient siRNAs with their help.
Collapse
Affiliation(s)
- Ivo L Hofacker
- Institute for Theoretical Chemistry, University Vienna, Vienna, Austria
| | | |
Collapse
|
34
|
Abstract
RNA interference (RNAi) involves sequence-specific downregulation of target genes, leading to gene silencing in vitro and in vivo. Synthetic small interfering RNAs (siRNAs), formulated with appropriate delivery agents, can serve as effective tools for RNAi-based therapeutics. The potential of siRNA to provide antiviral activity has been studied extensively in many respiratory viruses, including influenza virus, wherein specific siRNAs target highly-conserved regions of influenza viral genome, leading to potent inhibition of viral RNA replication. Despite various delivery strategies, such as polycations and liposomes that have been employed to formulate siRNAs, effective delivery modalities are still needed. Although current strategies can provide significant biodistribution and delivery into lungs allowing gene silencing, complete protection and prolonged survival rates against multiple strains of influenza virus still remains a key challenge. Here, we describe methods and procedures pertaining to screening and selection of highly effective influenza-specific siRNAs in cell culture, in mice, and in the ferret model. This will be potentially useful to evaluate RNAi as a therapeutic modality for future clinical application.
Collapse
|
35
|
Huang F, Zhou J, Yang Z, Cui L, Zhang W, Yuan C, Yang S, Zhu J, Hua X. RNA interference inhibits hepatitis E virus mRNA accumulation and protein synthesis in vitro. Vet Microbiol 2009; 142:261-7. [PMID: 19963327 PMCID: PMC7117326 DOI: 10.1016/j.vetmic.2009.10.023] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2009] [Revised: 10/20/2009] [Accepted: 10/28/2009] [Indexed: 11/25/2022]
Abstract
Hepatitis E virus (HEV) is a zoonotic pathogen to which several species, including human beings, pigs and rodents, are reported to be susceptible. To date, vaccines developed against HEV still need to be improved and a structural gene (ORF2), which encodes a capsid protein with high sequence conservation found across HEV genotypes, is a potential candidate. To exploit the possibility of using RNA interference (RNAi) as a strategy against HEV infection, four small interference RNA (siRNA) duplex targeting ORF2 gene were constructed. A challenge against HEV infection by RNAi was performed in A549 cells. Real-Time quantitative polymerase chain reaction (Real-Time qPCR) and Western blot assay demonstrated that four HEV specific siRNAs (si-ORF2-1, si-ORF2-2, si-ORF2-3 and si-ORF2-4) were capable of protecting cells against HEV infection with very high specificity and efficiency. The results suggest that RNAi is a potent anti-HEV infection prophylaxis strategy.
Collapse
Affiliation(s)
- Fen Huang
- School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, PR China
| | | | | | | | | | | | | | | | | |
Collapse
|
36
|
Comparing artificial neural networks, general linear models and support vector machines in building predictive models for small interfering RNAs. PLoS One 2009; 4:e7522. [PMID: 19847297 PMCID: PMC2760777 DOI: 10.1371/journal.pone.0007522] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2008] [Accepted: 07/22/2009] [Indexed: 12/20/2022] Open
Abstract
Background Exogenous short interfering RNAs (siRNAs) induce a gene knockdown effect in cells by interacting with naturally occurring RNA processing machinery. However not all siRNAs induce this effect equally. Several heterogeneous kinds of machine learning techniques and feature sets have been applied to modeling siRNAs and their abilities to induce knockdown. There is some growing agreement to which techniques produce maximally predictive models and yet there is little consensus for methods to compare among predictive models. Also, there are few comparative studies that address what the effect of choosing learning technique, feature set or cross validation approach has on finding and discriminating among predictive models. Principal Findings Three learning techniques were used to develop predictive models for effective siRNA sequences including Artificial Neural Networks (ANNs), General Linear Models (GLMs) and Support Vector Machines (SVMs). Five feature mapping methods were also used to generate models of siRNA activities. The 2 factors of learning technique and feature mapping were evaluated by complete 3×5 factorial ANOVA. Overall, both learning techniques and feature mapping contributed significantly to the observed variance in predictive models, but to differing degrees for precision and accuracy as well as across different kinds and levels of model cross-validation. Conclusions The methods presented here provide a robust statistical framework to compare among models developed under distinct learning techniques and feature sets for siRNAs. Further comparisons among current or future modeling approaches should apply these or other suitable statistically equivalent methods to critically evaluate the performance of proposed models. ANN and GLM techniques tend to be more sensitive to the inclusion of noisy features, but the SVM technique is more robust under large numbers of features for measures of model precision and accuracy. Features found to result in maximally predictive models are not consistent across learning techniques, suggesting care should be taken in the interpretation of feature relevance. In the models developed here, there are statistically differentiable combinations of learning techniques and feature mapping methods where the SVM technique under a specific combination of features significantly outperforms all the best combinations of features within the ANN and GLM techniques.
Collapse
|
37
|
Abstract
Background RNA interference (RNAi) is a cellular mechanism in which a short/small double stranded RNA induces the degradation of its sequence specific target mRNA, leading to specific gene silencing. Since its discovery, RNAi has become a powerful biological technique for gene function studies and drug discovery. The very first requirement of applying RNAi is to design functional small interfering RNA (siRNA) that can uniquely induce the degradation of the targeted mRNA. It has been shown that many functional synthetic siRNAs share some common characteristics, such as GC content limitation and free energy preferences at both terminals, etc. Results Our three-phase algorithm was developed to design siRNA on a whole-genome scale based on those identified characteristics of functional siRNA. When this algorithm was applied to design short hairpin RNA (shRNA), the validated success rate of shRNAs was over 70%, which was almost double the rate reported for TRC library. This indicates that the designs of siRNA and shRNA may share the same concerns. Further analysis of the shRNA dataset of 444 designs reveals that the high free energy states of the two terminals have the largest positive impact on the shRNA efficacy. Enforcing these energy characteristics of both terminals can further improve the shRNA design success rate to 83.1%. We also found that functional shRNAs have less probability for their 3' terminals to be involved in mRNA secondary structure formation. Conclusion Functional shRNAs prefer high free energy states at both terminals. High free energy states of the two terminals were found to be the largest positive impact factor on shRNA efficacy. In addition, the accessibility of the 3' terminal is another key factor to shRNA efficacy.
Collapse
Affiliation(s)
- Hong Zhou
- Saint Joseph College, West Hartford, CT 06117, USA.
| | | |
Collapse
|
38
|
Klingelhoefer JW, Moutsianas L, Holmes C. Approximate Bayesian feature selection on a large meta-dataset offers novel insights on factors that effect siRNA potency. Bioinformatics 2009; 25:1594-601. [PMID: 19420052 PMCID: PMC2940241 DOI: 10.1093/bioinformatics/btp284] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2009] [Revised: 04/19/2009] [Accepted: 04/22/2009] [Indexed: 11/15/2022] Open
Abstract
MOTIVATION Short interfering RNA (siRNA)-induced RNA interference is an endogenous pathway in sequence-specific gene silencing. The potency of different siRNAs to inhibit a common target varies greatly and features affecting inhibition are of high current interest. The limited success in predicting siRNA potency being reported so far could originate in the small number and the heterogeneity of available datasets in addition to the knowledge-driven, empirical basis on which features thought to be affecting siRNA potency are often chosen. We attempt to overcome these problems by first constructing a meta-dataset of 6483 publicly available siRNAs (targeting mammalian mRNA), the largest to date, and then applying a Bayesian analysis which accommodates feature set uncertainty. A stochastic logistic regression-based algorithm is designed to explore a vast model space of 497 compositional, structural and thermodynamic features, identifying associations with siRNA potency. RESULTS Our algorithm reveals a number of features associated with siRNA potency that are, to the best of our knowledge, either under reported in literature, such as anti-sense 5' -3' motif 'UCU', or not reported at all, such as the anti-sense 5' -3' motif 'ACGA'. These findings should aid in improving future siRNA potency predictions and might offer further insights into the working of the RNA-induced silencing complex (RISC).
Collapse
|
39
|
Takasaki S. Methods for selecting effective siRNA sequences by using statistical and clustering techniques. Methods Mol Biol 2009; 487:1-39. [PMID: 19301640 DOI: 10.1007/978-1-60327-547-7_1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Short interfering RNAs (siRNAs) have been widely used for studying gene functions in mammalian cells but vary markedly in their gene-silencing efficacy. Although many design rules/guidelines for effective siRNAs based on various criteria have been reported recently, there are only a few consistencies among them. This makes it difficult to select effective siRNA sequences targeting mammalian genes. This chapter first reviews the reported siRNA design guidelines and clarifies the problems concerning the current guidelines. It then describes the recently reported new scoring methods for selecting effective siRNA sequences by using statistics and clustering techniques such as the self-organizing map (SOM) and the radial basis function (RBF) network. In the proposed three methods, individual scores are defined as a gene degradation measure based on position-specific statistical significances. The effectiveness of the methods was confirmed by evaluating effective and ineffective siRNAs for recently reported genes and comparison with other reported scoring methods. The sizes (values) of these scores are closely correlated with the degree of gene degradation, and the scores can easily be used for selecting high-potential siRNA candidates. The evaluation results indicate that the proposed new methods are useful for selecting siRNA sequences targeting mammalian mRNA sequences.
Collapse
|
40
|
Lu ZJ, Mathews DH. OligoWalk: an online siRNA design tool utilizing hybridization thermodynamics. Nucleic Acids Res 2008; 36:W104-8. [PMID: 18490376 PMCID: PMC2447759 DOI: 10.1093/nar/gkn250] [Citation(s) in RCA: 76] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Given an mRNA sequence as input, the OligoWalk web server generates a list of small interfering RNA (siRNA) candidate sequences, ranked by the probability of being efficient siRNA (silencing efficacy greater than 70%). To accomplish this, the server predicts the free energy changes of the hybridization of an siRNA to a target mRNA, considering both siRNA and mRNA self-structure. The free energy changes of the structures are rigorously calculated using a partition function calculation. By changing advanced options, the free energy changes can also be calculated using less rigorous lowest free energy structure or suboptimal structure prediction methods for the purpose of comparison. Considering the predicted free energy changes and local siRNA sequence features, the server selects efficient siRNA with high accuracy using a support vector machine. On average, the fraction of efficient siRNAs selected by the server that will be efficient at silencing is 78.6%. The OligoWalk web server is freely accessible through internet at http://rna.urmc.rochester.edu/servers/oligowalk.
Collapse
Affiliation(s)
- Zhi John Lu
- Department of Biochemistry and Biophysics, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, NY 14642, USA
| | | |
Collapse
|
41
|
Lu ZJ, Mathews DH. Fundamental differences in the equilibrium considerations for siRNA and antisense oligodeoxynucleotide design. Nucleic Acids Res 2008; 36:3738-45. [PMID: 18483081 PMCID: PMC2441788 DOI: 10.1093/nar/gkn266] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Both siRNA and antisense oligodeoxynucleotides (ODNs) inhibit the expression of a complementary gene. In this study, fundamental differences in the considerations for RNA interference and antisense ODNs are reported. In siRNA and antisense ODN databases, positive correlations are observed between the cost to open the mRNA target self-structure and the stability of the duplex to be formed, meaning the sites along the mRNA target with highest potential to form strong duplexes with antisense strands also have the greatest tendency to be involved in pre-existing structure. Efficient siRNA have less stable siRNA-target duplex stability than inefficient siRNA, but the opposite is true for antisense ODNs. It is, therefore, more difficult to avoid target self-structure in antisense ODN design. Self-structure stabilities of oligonucleotide and target correlate to the silencing efficacy of siRNA. Oligonucleotide self-structure correlations to efficacy of antisense ODNs, conversely, are insignificant. Furthermore, self-structure in the target appears to correlate with antisense ODN efficacy, but such that more effective antisense ODNs appear to target mRNA regions with greater self-structure. Therefore, different criteria are suggested for the design of efficient siRNA and antisense ODNs and the design of antisense ODNs is more challenging.
Collapse
Affiliation(s)
- Zhi John Lu
- Department of Biochemistry and Biophysics and Department of Biostatistics & Computational Biology, University of Rochester Medical Center, Box 712, 601 Elmwood Avenue, Rochester, NY 14642, USA
| | | |
Collapse
|
42
|
The impact of target site accessibility on the design of effective siRNAs. Nat Biotechnol 2008; 26:578-83. [PMID: 18438400 DOI: 10.1038/nbt1404] [Citation(s) in RCA: 215] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2007] [Accepted: 04/07/2008] [Indexed: 11/08/2022]
Abstract
Small-interfering RNAs (siRNAs) assemble into RISC, the RNA-induced silencing complex, which cleaves complementary mRNAs. Despite their fluctuating efficacy, siRNAs are widely used to assess gene function. Although this limitation could be ascribed, in part, to variations in the assembly and activation of RISC, downstream events in the RNA interference (RNAi) pathway, such as target site accessibility, have so far not been investigated extensively. In this study we present a comprehensive analysis of target RNA structure effects on RNAi by computing the accessibility of the target site for interaction with the siRNA. Based on our observations, we developed a novel siRNA design tool, RNAxs, by combining known siRNA functionality criteria with target site accessibility. We calibrated our method on two data sets comprising 573 siRNAs for 38 genes, and tested it on an independent set of 360 siRNAs targeting four additional genes. Overall, RNAxs proves to be a robust siRNA selection tool that substantially improves the prediction of highly efficient siRNAs.
Collapse
|
43
|
McSwiggen JA, Seth S. A potential treatment for pandemic influenza using siRNAs targeting conserved regions of influenza A. Expert Opin Biol Ther 2008; 8:299-313. [DOI: 10.1517/14712598.8.3.299] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
|
44
|
Abstract
Small interfering RNA (siRNA) are widely used to infer gene function. Here, insights in the equilibrium of siRNA-target hybridization are used for selection of efficient siRNA. The accessibilities of siRNA and target mRNA for hybridization, as measured by folding free energy change, are shown to be significantly correlated with efficacy. For this study, a partition function calculation that considers all possible secondary structures is used to predict target site accessibility; a significant improvement over calculations that consider only the predicted lowest free energy structure or a set of low free energy structures. The predicted thermodynamic features, in addition to siRNA sequence features, are used as input for a support vector machine that selects functional siRNA. The method works well for predicting efficient siRNA (efficacy >70%) in a large siRNA data set from Novartis. The positive predictive value (percentage of sites predicted to be efficient for silencing that are) is as high as 87.6%. The sensitivity and specificity are 22.7 and 96.5%, respectively. When tested on data from different sources, the positive predictive value increased 8.1% by adding equilibrium terms to 25 local sequence features. Prediction of hybridization affinity using partition functions is now available in the RNAstructure software package.
Collapse
Affiliation(s)
- Zhi John Lu
- Department of Biochemistry & Biophysics and Department of Biostatistics & Computational Biology, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, NY 14642, USA
| | | |
Collapse
|
45
|
Birmingham A, Anderson E, Sullivan K, Reynolds A, Boese Q, Leake D, Karpilow J, Khvorova A. A protocol for designing siRNAs with high functionality and specificity. Nat Protoc 2007; 2:2068-78. [PMID: 17853862 DOI: 10.1038/nprot.2007.278] [Citation(s) in RCA: 152] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Effective gene silencing by the RNA interference (RNAi) pathway requires a comprehensive understanding of the elements that influence small interfering RNA (siRNA) functionality and specificity. These include (i) sequence space restrictions that define the boundaries of siRNA targeting, (ii) structural and sequence features required for efficient siRNA performance, (iii) mechanisms that underlie nonspecific gene modulation and (iv) additional features specific to the intended use (i.e., inclusion of native sugar or base chemical modifications for increased stability or specificity, vector design, etc.). Attention to each of these factors enhances siRNA performance and heightens overall confidence in the output of RNAi-mediated functional genomic studies. Here, we provide a detailed protocol explaining the methodologies used for manual and web-based design of siRNAs.
Collapse
|
46
|
Ichihara M, Murakumo Y, Masuda A, Matsuura T, Asai N, Jijiwa M, Ishida M, Shinmi J, Yatsuya H, Qiao S, Takahashi M, Ohno K. Thermodynamic instability of siRNA duplex is a prerequisite for dependable prediction of siRNA activities. Nucleic Acids Res 2007; 35:e123. [PMID: 17884914 PMCID: PMC2094068 DOI: 10.1093/nar/gkm699] [Citation(s) in RCA: 89] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
We developed a simple algorithm, i-Score (inhibitory-Score), to predict active siRNAs by applying a linear regression model to 2431 siRNAs. Our algorithm is exclusively comprised of nucleotide (nt) preferences at each position, and no other parameters are taken into account. Using a validation dataset comprised of 419 siRNAs, we found that the prediction accuracy of i-Score is as good as those of s-Biopredsi, ThermoComposition21 and DSIR, which employ a neural network model or more parameters in a linear regression model. Reynolds and Katoh also predict active siRNAs efficiently, but the numbers of siRNAs predicted to be active are less than one-eighth of that of i-Score. We additionally found that exclusion of thermostable siRNAs, whose whole stacking energy (ΔG) is less than −34.6 kcal/mol, improves the prediction accuracy in i-Score, s-Biopredsi, ThermoComposition21 and DSIR. We also developed a universal target vector, pSELL, with which we can assay an siRNA activity of any sequence in either the sense or antisense direction. We assayed 86 siRNAs in HEK293 cells using pSELL, and validated applicability of i-Score and the whole ΔG value in designing siRNAs.
Collapse
Affiliation(s)
- Masatoshi Ichihara
- Department of Biomedical Sciences, College of Life and Health Sciences, Chubu University, 1200 Matsumoto, Kasugai 487-8501, Department of Pathology, Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Department of Public Health/Health Information Dynamics, Field of Social Life Science, Program in Health and Community Medicine and Division of Molecular Pathology, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan
| | - Yoshiki Murakumo
- Department of Biomedical Sciences, College of Life and Health Sciences, Chubu University, 1200 Matsumoto, Kasugai 487-8501, Department of Pathology, Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Department of Public Health/Health Information Dynamics, Field of Social Life Science, Program in Health and Community Medicine and Division of Molecular Pathology, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan
| | - Akio Masuda
- Department of Biomedical Sciences, College of Life and Health Sciences, Chubu University, 1200 Matsumoto, Kasugai 487-8501, Department of Pathology, Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Department of Public Health/Health Information Dynamics, Field of Social Life Science, Program in Health and Community Medicine and Division of Molecular Pathology, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan
| | - Toru Matsuura
- Department of Biomedical Sciences, College of Life and Health Sciences, Chubu University, 1200 Matsumoto, Kasugai 487-8501, Department of Pathology, Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Department of Public Health/Health Information Dynamics, Field of Social Life Science, Program in Health and Community Medicine and Division of Molecular Pathology, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan
| | - Naoya Asai
- Department of Biomedical Sciences, College of Life and Health Sciences, Chubu University, 1200 Matsumoto, Kasugai 487-8501, Department of Pathology, Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Department of Public Health/Health Information Dynamics, Field of Social Life Science, Program in Health and Community Medicine and Division of Molecular Pathology, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan
| | - Mayumi Jijiwa
- Department of Biomedical Sciences, College of Life and Health Sciences, Chubu University, 1200 Matsumoto, Kasugai 487-8501, Department of Pathology, Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Department of Public Health/Health Information Dynamics, Field of Social Life Science, Program in Health and Community Medicine and Division of Molecular Pathology, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan
| | - Maki Ishida
- Department of Biomedical Sciences, College of Life and Health Sciences, Chubu University, 1200 Matsumoto, Kasugai 487-8501, Department of Pathology, Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Department of Public Health/Health Information Dynamics, Field of Social Life Science, Program in Health and Community Medicine and Division of Molecular Pathology, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan
| | - Jun Shinmi
- Department of Biomedical Sciences, College of Life and Health Sciences, Chubu University, 1200 Matsumoto, Kasugai 487-8501, Department of Pathology, Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Department of Public Health/Health Information Dynamics, Field of Social Life Science, Program in Health and Community Medicine and Division of Molecular Pathology, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan
| | - Hiroshi Yatsuya
- Department of Biomedical Sciences, College of Life and Health Sciences, Chubu University, 1200 Matsumoto, Kasugai 487-8501, Department of Pathology, Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Department of Public Health/Health Information Dynamics, Field of Social Life Science, Program in Health and Community Medicine and Division of Molecular Pathology, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan
| | - Shanlou Qiao
- Department of Biomedical Sciences, College of Life and Health Sciences, Chubu University, 1200 Matsumoto, Kasugai 487-8501, Department of Pathology, Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Department of Public Health/Health Information Dynamics, Field of Social Life Science, Program in Health and Community Medicine and Division of Molecular Pathology, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan
| | - Masahide Takahashi
- Department of Biomedical Sciences, College of Life and Health Sciences, Chubu University, 1200 Matsumoto, Kasugai 487-8501, Department of Pathology, Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Department of Public Health/Health Information Dynamics, Field of Social Life Science, Program in Health and Community Medicine and Division of Molecular Pathology, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan
| | - Kinji Ohno
- Department of Biomedical Sciences, College of Life and Health Sciences, Chubu University, 1200 Matsumoto, Kasugai 487-8501, Department of Pathology, Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Department of Public Health/Health Information Dynamics, Field of Social Life Science, Program in Health and Community Medicine and Division of Molecular Pathology, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan
- *To whom correspondence should be addressed. +81 52 744 2446+81 52 744 2449
| |
Collapse
|
47
|
Hägerlöf M, Hedman H, Elmroth SKC. Platination of the siRNA sense-strand modulates both efficacy and selectivity in vitro. Biochem Biophys Res Commun 2007; 361:14-9. [PMID: 17632077 DOI: 10.1016/j.bbrc.2007.06.131] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2007] [Accepted: 06/19/2007] [Indexed: 11/17/2022]
Abstract
The use of short interfering RNAs (siRNA) for selective suppression of protein production has rapidly become a commonly used technique for transient modulation of protein levels. In the present paper, we investigate whether introduction of platinated bases in the sense strand can be used to modulate the efficacy of siRNAs. Four different siRNAs were studied, all targeting the initial AU-rich 3' UTR of Wnt-5a mRNA. The siRNAs were characterized with respect to melting properties and translational inhibitory effect in vitro using luciferase as a reporter gene. The translation inhibition studies reveal that all platinated siRNA remain efficient. For an siRNA with partial complementarity to the luciferase gene, platination was shown to reduce the off-target effects. All siRNAs were found to be active in cellular in vitro translation systems, reaching suppression levels well above 80% for the majority of siRNAs investigated.
Collapse
Affiliation(s)
- Margareta Hägerlöf
- Biokemi, Kemicentrum, Lunds Universitet, Box 124, SE-221 00 Lund, Sweden
| | | | | |
Collapse
|
48
|
Jiang P, Wu H, Da Y, Sang F, Wei J, Sun X, Lu Z. RFRCDB-siRNA: improved design of siRNAs by random forest regression model coupled with database searching. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2007; 87:230-8. [PMID: 17644215 DOI: 10.1016/j.cmpb.2007.06.001] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2007] [Revised: 06/01/2007] [Accepted: 06/01/2007] [Indexed: 05/16/2023]
Abstract
Although the observations concerning the factors which influence the siRNA efficacy give clues to the mechanism of RNAi, the quantitative prediction of the siRNA efficacy is still a challenge task. In this paper, we introduced a novel non-linear regression method: random forest regression (RFR), to quantitatively estimate siRNAs efficacy values. Compared with an alternative machine learning regression algorithm, support vector machine regression (SVR) and four other score-based algorithms [A. Reynolds, D. Leake, Q. Boese, S. Scaringe, W.S. Marshall, A. Khvorova, Rational siRNA design for RNA interference, Nat. Biotechnol. 22 (2004) 326-330; K. Ui-Tei, Y. Naito, F. Takahashi, T. Haraguchi, H. Ohki-Hamazaki, A. Juni, R. Ueda, K. Saigo, Guidelines for the selection of highly effective siRNA sequences for mammalian and chick RNA interference, Nucleic Acids Res. 32 (2004) 936-948; A.C. Hsieh, R. Bo, J. Manola, F. Vazquez, O. Bare, A. Khvorova, S. Scaringe, W.R. Sellers, A library of siRNA duplexes targeting the phosphoinositide 3-kinase pathway: determinants of gene silencing for use in cell-based screens, Nucleic Acids Res. 32 (2004) 893-901; M. Amarzguioui, H. Prydz, An algorithm for selection of functional siRNA sequences, Biochem. Biophys. Res. Commun. 316 (2004) 1050-1058) our RFR model achieved the best performance of all. A web-server, RFRCDB-siRNA (http://www.bioinf.seu.edu.cn/siRNA/index.htm), has been developed. RFRCDB-siRNA consists of two modules: a siRNA-centric database and a RFR prediction system. RFRCDB-siRNA works as follows: (1) Instead of directly predicting the gene silencing activity of siRNAs, the service takes these siRNAs as queries to search against the siRNA-centric database. The matched sequences with the exceeding the user defined functionality value threshold are kept. (2) The mismatched sequences are then processed into the RFR prediction system for further analysis.
Collapse
Affiliation(s)
- Peng Jiang
- State Key Laboratory of Bioelectronics, Department of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, PR China
| | | | | | | | | | | | | |
Collapse
|
49
|
Peek AS. Improving model predictions for RNA interference activities that use support vector machine regression by combining and filtering features. BMC Bioinformatics 2007; 8:182. [PMID: 17553157 PMCID: PMC1906837 DOI: 10.1186/1471-2105-8-182] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2006] [Accepted: 06/06/2007] [Indexed: 12/29/2022] Open
Abstract
Background RNA interference (RNAi) is a naturally occurring phenomenon that results in the suppression of a target RNA sequence utilizing a variety of possible methods and pathways. To dissect the factors that result in effective siRNA sequences a regression kernel Support Vector Machine (SVM) approach was used to quantitatively model RNA interference activities. Results Eight overall feature mapping methods were compared in their abilities to build SVM regression models that predict published siRNA activities. The primary factors in predictive SVM models are position specific nucleotide compositions. The secondary factors are position independent sequence motifs (N-grams) and guide strand to passenger strand sequence thermodynamics. Finally, the factors that are least contributory but are still predictive of efficacy are measures of intramolecular guide strand secondary structure and target strand secondary structure. Of these, the site of the 5' most base of the guide strand is the most informative. Conclusion The capacity of specific feature mapping methods and their ability to build predictive models of RNAi activity suggests a relative biological importance of these features. Some feature mapping methods are more informative in building predictive models and overall t-test filtering provides a method to remove some noisy features or make comparisons among datasets. Together, these features can yield predictive SVM regression models with increased predictive accuracy between predicted and observed activities both within datasets by cross validation, and between independently collected RNAi activity datasets. Feature filtering to remove features should be approached carefully in that it is possible to reduce feature set size without substantially reducing predictive models, but the features retained in the candidate models become increasingly distinct. Software to perform feature prediction and SVM training and testing on nucleic acid sequences can be found at the following site: .
Collapse
Affiliation(s)
- Andrew S Peek
- Department of Bioinformatics, Integrated DNA Technologies, Inc., Coralville, IA 52241, USA.
| |
Collapse
|