1
|
Mosquera S, Ginésy M, Bocos-Asenjo IT, Amin H, Diez-Hermano S, Diez JJ, Niño-Sánchez J. Spray-induced gene silencing to control plant pathogenic fungi: A step-by-step guide. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2025; 67:801-825. [PMID: 39912551 DOI: 10.1111/jipb.13848] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/25/2024] [Accepted: 12/31/2024] [Indexed: 02/07/2025]
Abstract
RNA interference (RNAi)-based control technologies are gaining popularity as potential alternatives to synthetic fungicides in the ongoing effort to manage plant pathogenic fungi. Among these methods, spray-induced gene silencing (SIGS) emerges as particularly promising due to its convenience and feasibility for development. This approach is a new technology for plant disease management, in which double-stranded RNAs (dsRNAs) targeting essential or virulence genes are applied to plants or plant products and subsequently absorbed by plant pathogens, triggering a gene silencing effect and the inhibition of the infection process. Spray-induced gene silencing has demonstrated efficacy in laboratory settings against various fungal pathogens. However, as research progressed from the laboratory to the greenhouse and field environments, novel challenges arose, such as ensuring the stability of dsRNAs and their effective delivery to fungal targets. Here, we provide a practical guide to SIGS for the control of plant pathogenic fungi. This guide outlines the essential steps and considerations needed for designing and assessing dsRNA molecules. It also addresses key challenges inherent to SIGS, including delivery and stability of dsRNA molecules, and how nanoencapsulation of dsRNAs can aid in overcoming these obstacles. Additionally, the guide underscores existing knowledge gaps that warrant further research and aims to provide assistance to researchers, especially those new to the field, encouraging the advancement of SIGS for the control of a broad range of fungal pathogens.
Collapse
Affiliation(s)
- Sandra Mosquera
- Department of Plant Production and Forest Resources, Sustainable Forest Management Research Institute (iuFOR), College of Agricultural Engineering (ETSIIAA), University of Valladolid, Palencia, 34004, Spain
| | - Mireille Ginésy
- Department of Plant Production and Forest Resources, Sustainable Forest Management Research Institute (iuFOR), College of Agricultural Engineering (ETSIIAA), University of Valladolid, Palencia, 34004, Spain
| | - Irene Teresa Bocos-Asenjo
- Department of Plant Production and Forest Resources, Sustainable Forest Management Research Institute (iuFOR), College of Agricultural Engineering (ETSIIAA), University of Valladolid, Palencia, 34004, Spain
| | - Huma Amin
- Department of Plant Production and Forest Resources, Sustainable Forest Management Research Institute (iuFOR), College of Agricultural Engineering (ETSIIAA), University of Valladolid, Palencia, 34004, Spain
| | - Sergio Diez-Hermano
- Department of Plant Production and Forest Resources, Sustainable Forest Management Research Institute (iuFOR), College of Agricultural Engineering (ETSIIAA), University of Valladolid, Palencia, 34004, Spain
| | - Julio Javier Diez
- Department of Plant Production and Forest Resources, Sustainable Forest Management Research Institute (iuFOR), College of Agricultural Engineering (ETSIIAA), University of Valladolid, Palencia, 34004, Spain
| | - Jonatan Niño-Sánchez
- Department of Plant Production and Forest Resources, Sustainable Forest Management Research Institute (iuFOR), College of Agricultural Engineering (ETSIIAA), University of Valladolid, Palencia, 34004, Spain
| |
Collapse
|
2
|
Adib AA, Karim MM. Design of therapeutic siRNAs for potential application to infection with chikungunya virus. Heliyon 2025; 11:e41824. [PMID: 39897885 PMCID: PMC11782961 DOI: 10.1016/j.heliyon.2025.e41824] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Revised: 01/08/2025] [Accepted: 01/08/2025] [Indexed: 02/04/2025] Open
Abstract
Emergence of the Chikungunya virus (CHIKV) is a new threat in the world. The disastrous effect of this virus and the unavailability of specific drugs complicated the control and management of the disease. The development of a siRNA-based drug using multiple computational tools could be a way out as one of its therapeutics. Currently, very few siRNAs against CHIKV have been computationally designed and published. Here, we considered various parts of the CHIKV genome encoding different essential protein-coding genes for designing siRNAs with a view to silencing them, thereby rendering the virus inactive. Seven potential primary siRNAs were constructed, of which, five are hereafter recommended to be used as a therapeutic tool against the virus.
Collapse
Affiliation(s)
- Ahmed Ahsan Adib
- Department of Microbiology, University of Dhaka, Dhaka, 1100, Bangladesh
| | | |
Collapse
|
3
|
Welden JR, Margvelani G, Miaro M, Mathews D, Rodgers DW, Stamm S. An oligo walk to identify siRNAs against the circular Tau 12->7 RNA. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.01.27.635119. [PMID: 39974901 PMCID: PMC11838314 DOI: 10.1101/2025.01.27.635119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/21/2025]
Abstract
Circular RNAs are associated with numerous diseases and recent evidence shows that they can be translated into proteins after undergoing RNA modification. Circular RNAs differ from their 'linear' mRNA counterparts in their backsplice site, allowing selective targeting using RNA interference, which however limits the options to place the siRNA. We tested all possible siRNAs against the backsplice site of the circTau 12->7 RNA after it was subjected to adenosine to inosine RNA editing, a modification that promotes translation of the circRNA. Most siRNAs reduced the circRNA and protein abundance, which however did not correlate. We identified an siRNA with an IC50 of 750 pmol efficacy on protein expression. This circRNA fulfilled six of the eight criteria for siRNAs targeting mRNAs. Thus, modified circRNAs expressing protein can be targeted with siRNAs, but their optimal sequence needs to be determined empirically.
Collapse
|
4
|
Yuan C, Sun Y, Chen J, Xu Q, Zhou X, Zou Z, Xia Q. Haemaphysalis longicornis subolesin controls the infection and transmission of severe fever with thrombocytopenia syndrome virus. NPJ Vaccines 2025; 10:17. [PMID: 39856151 PMCID: PMC11761454 DOI: 10.1038/s41541-024-01061-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2024] [Accepted: 12/28/2024] [Indexed: 01/27/2025] Open
Abstract
Severe fever with thrombocytopenia syndrome (SFTS) caused by the SFTS virus (SFTSV) is an emerging tick-borne disease with a high mortality rate. Haemaphysalis longicornis is the primary reservoir and vector of SFTSV. Here, we found that targeting subolesin (SUB), an anti-tick vaccine candidate, affects the infection and transmission of SFTSV in H. longicornis. RNAi-mediated knockdown of SUB repressed SFTSV infection in the salivary glands but not in the gut of H. longicornis, which may be associated with the modulation of protein processing in endoplasmic reticulum revealed by transcriptomic analysis. Knockdown of SUB decreased the survival and engorgement rates of ticks and impaired the horizontal and co-feeding transmission of SFTSV. Furthermore, active immunization with recombinant SUB inhibited the co-feeding transmission of SFTSV, although it had no significant effect on the blood-feeding behavior of infected ticks. Collectively, these results provide a potential target for controlling SFTS and other tick-borne viral diseases.
Collapse
Affiliation(s)
- Chuanfei Yuan
- NHC Key Laboratory of Tropical Disease Control, School of Tropical Medicine, Hainan Medical University, Haikou, China.
| | - Yu Sun
- NHC Key Laboratory of Tropical Disease Control, School of Tropical Medicine, Hainan Medical University, Haikou, China
| | - Jingjing Chen
- NHC Key Laboratory of Tropical Disease Control, School of Tropical Medicine, Hainan Medical University, Haikou, China
| | - Qiong Xu
- NHC Key Laboratory of Tropical Disease Control, School of Tropical Medicine, Hainan Medical University, Haikou, China
| | - Xiang Zhou
- NHC Key Laboratory of Tropical Disease Control, School of Tropical Medicine, Hainan Medical University, Haikou, China
| | - Zhen Zou
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.
| | - Qianfeng Xia
- NHC Key Laboratory of Tropical Disease Control, School of Tropical Medicine, Hainan Medical University, Haikou, China.
| |
Collapse
|
5
|
Zhang Y, Yang T, Yang Y, Xu D, Hu Y, Zhang S, Luo N, Ning L, Ren L. siRNAEfficacyDB: An experimentally supported small interfering RNA efficacy database. IET Syst Biol 2024; 18:199-207. [PMID: 39541343 DOI: 10.1049/syb2.12102] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2024] [Revised: 09/26/2024] [Accepted: 10/19/2024] [Indexed: 11/16/2024] Open
Abstract
Small interfering RNA (siRNA) has revolutionised biomedical research and drug development through precise post-transcriptional gene silencing technology. Despite its immense potential, siRNA therapy still faces technical challenges, such as delivery efficiency, targeting specificity, and molecular stability. To address these challenges and facilitate siRNA drug development, we have developed siRNAEfficacyDB, a comprehensive database that integrates experimentally validated siRNA efficacy data. This database contains 3544 siRNA records, covering 42 target genes and 5 cell lines. It provides detailed information on siRNA sequences, target genes, inhibition efficiencies, experimental techniques, cell lines, siRNA concentrations, and incubation times. siRNAEfficacyDB offers a user-friendly web interface that makes it easy to query, browse and analyse data, enabling efficient access to siRNA-related information. In summary, siRNAEfficacyDB provides a useful data foundation for siRNA drug design and optimisation, serving as a valuable resource for advancing computer-aided siRNA design research and nucleic acid drug development. siRNAEfficacyDB is freely available at https://cellknowledge.com.cn/siRNAEfficacy for non-commercial use.
Collapse
Affiliation(s)
- Yang Zhang
- Innovative Institute of Chinese Medicine and Pharmacy, Academy for Interdiscipline, Chengdu University of Traditional Chinese Medicine, Chengdu, China
| | - Ting Yang
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu, China
| | - Yu Yang
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu, China
| | - Dongsheng Xu
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu, China
| | - Yucheng Hu
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu, China
| | - Shuo Zhang
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu, China
| | - Nanchao Luo
- School of Computer Science and Technology, Aba Teachers College, Aba, Sichuan, China
| | - Lin Ning
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu, China
| | - Liping Ren
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu, China
| |
Collapse
|
6
|
Mittal A, Ali SE, Mathews DH. Using the RNAstructure Software Package to Predict Conserved RNA Structures. Curr Protoc 2024; 4:e70054. [PMID: 39540715 DOI: 10.1002/cpz1.70054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2024]
Abstract
The structures of many non-coding RNAs (ncRNA) are conserved by evolution to a greater extent than their sequences. By predicting the conserved structure of two or more homologous sequences, the accuracy of secondary structure prediction can be improved as compared to structure prediction for a single sequence. Here, we provide protocols for the use of four programs in the RNAstructure suite to predict conserved structures: Multilign, TurboFold, Dynalign, and PARTS. TurboFold iteratively aligns multiple homologous sequences and estimates the pairing probabilities for the conserved structure. Dynalign, PARTS, and Multilign are dynamic programming algorithms that simultaneously align sequences and identify the common secondary structure. Dynalign uses a pair of homologs and finds the lowest free energy common structure. PARTS uses a pair of homologs and estimates pairing probabilities from the base pairing probabilities estimated for each sequence. Multilign uses two or more homologs and finds the lowest free energy common structure using multiple pairwise calculations with Dynalign. It scales linearly with the number of sequences. We outline the strengths of each program. These programs can be run through web servers, on the command line, or with graphical user interfaces. © 2024 Wiley Periodicals LLC. Basic Protocol 1: Predicting a structure conserved in three or more sequences with the RNAstructure web server Basic Protocol 2: Predicting a structure conserved in two sequences with the RNAstructure web server Alternative Protocol 1: Predicting a structure conserved in multiple sequences in the RNAstructure graphical user interface Alternative Protocol 2: Predicting a structure conserved in two sequences with Dynalign in the RNAstructure graphical user interface Alternative Protocol 3: Running TurboFold on the command line.
Collapse
Affiliation(s)
- Abhinav Mittal
- Department of Biochemistry & Biophysics and Center for RNA Biology, University of Rochester Medical Center, Rochester, New York
| | - Sara E Ali
- Department of Biochemistry & Biophysics and Center for RNA Biology, University of Rochester Medical Center, Rochester, New York
| | - David H Mathews
- Department of Biochemistry & Biophysics and Center for RNA Biology, University of Rochester Medical Center, Rochester, New York
| |
Collapse
|
7
|
Bai Y, Zhong H, Wang T, Lu ZJ. OligoFormer: an accurate and robust prediction method for siRNA design. Bioinformatics 2024; 40:btae577. [PMID: 39321261 PMCID: PMC11494384 DOI: 10.1093/bioinformatics/btae577] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2024] [Revised: 08/14/2024] [Accepted: 09/23/2024] [Indexed: 09/27/2024] Open
Abstract
MOTIVATION RNA interference (RNAi) has become a widely used experimental approach for post-transcriptional regulation and is increasingly showing its potential as future targeted drugs. However, the prediction of highly efficient siRNAs (small interfering RNAs) is still hindered by dataset biases, the inadequacy of prediction methods, and the presence of off-target effects. To overcome these limitations, we propose an accurate and robust prediction method, OligoFormer, for siRNA design. RESULTS OligoFormer comprises three different modules including thermodynamic calculation, RNA-FM module, and Oligo encoder. Oligo encoder is the core module based on the transformer encoder. Taking siRNA and mRNA sequences as input, OligoFormer can obtain thermodynamic parameters, RNA-FM embedding, and Oligo embedding through these three modules, respectively. We carefully benchmarked OligoFormer against six comparable methods on siRNA efficacy datasets. OligoFormer outperforms all the other methods, with an average improvement of 9% in AUC, 6.6% in PRC, 9.8% in F1 score, and 5.1% in PCC compared to the best method among them in our inter-dataset validation. We also provide a comprehensive pipeline with prediction of siRNA efficacy and off-target effects using PITA score and TargetScan score. The ablation study shows RNA-FM module and thermodynamic parameters improved the performance and accelerated convergence of OligoFormer. The saliency maps by gradient backpropagation and base preference maps show certain base preferences in initial and terminal region of siRNAs. AVAILABILITY AND IMPLEMENTATION The source code of OligoFormer is freely available on GitHub at: https://github.com/lulab/OligoFormer. Docker image of OligoFormer is freely available on the docker hub at https://hub.docker.com/r/yilanbai/oligoformer.
Collapse
Affiliation(s)
- Yilan Bai
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing, 100084, China
- Institute for Precision Medicine, Tsinghua University, Beijing, 100084, China
| | - Haochen Zhong
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing, 100084, China
- Institute for Precision Medicine, Tsinghua University, Beijing, 100084, China
| | - Taiwei Wang
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing, 100084, China
- Institute for Precision Medicine, Tsinghua University, Beijing, 100084, China
- Academy for Advanced Interdisciplinary Studies (AAIS), and Peking University–Tsinghua University–National Institute of Biological Sciences Joint Graduate Program (PTN), Peking University, Beijing, 100871, China
| | - Zhi John Lu
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing, 100084, China
- Institute for Precision Medicine, Tsinghua University, Beijing, 100084, China
| |
Collapse
|
8
|
Martinelli DD. From sequences to therapeutics: Using machine learning to predict chemically modified siRNA activity. Genomics 2024; 116:110815. [PMID: 38431033 DOI: 10.1016/j.ygeno.2024.110815] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2023] [Revised: 01/01/2024] [Accepted: 02/25/2024] [Indexed: 03/05/2024]
Abstract
Small interfering RNAs (siRNAs) exemplify the promise of genetic medicine in the discovery of novel therapeutic modalities. Their ability to selectively suppress gene expression makes them ideal candidates for the development of oligonucleotide pharmaceuticals. Recent advancements in machine learning (ML) have facilitated the design of unmodified siRNA and efficacy prediction. However, a model trained to predict the silencing activity of siRNAs with diverse chemical modification patterns is yet to be published despite the importance of such modifications in designing siRNAs with the potential to reach the level of clinical use. This study presents the first application of ML to efficiently classify chemically modified siRNAs on the basis of sequence and chemical modification patterns alone. Three algorithms were evaluated at three classification thresholds and compared according to sensitivity, specificity, consistency of feature weights with empirical knowledge, and performance using an external validation dataset. Finally, possible directions for future research were proposed.
Collapse
|
9
|
Müller T, Mautner S, Videm P, Eggenhofer F, Raden M, Backofen R. CheRRI-Accurate classification of the biological relevance of putative RNA-RNA interaction sites. Gigascience 2024; 13:giae022. [PMID: 38837942 PMCID: PMC11152173 DOI: 10.1093/gigascience/giae022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Revised: 03/04/2024] [Accepted: 04/22/2024] [Indexed: 06/07/2024] Open
Abstract
BACKGROUND RNA-RNA interactions are key to a wide range of cellular functions. The detection of potential interactions helps to understand the underlying processes. However, potential interactions identified via in silico or experimental high-throughput methods can lack precision because of a high false-positive rate. RESULTS We present CheRRI, the first tool to evaluate the biological relevance of putative RNA-RNA interaction sites. CheRRI filters candidates via a machine learning-based model trained on experimental RNA-RNA interactome data. Its unique setup combines interactome data and an established thermodynamic prediction tool to integrate experimental data with state-of-the-art computational models. Applying these data to an automated machine learning approach provides the opportunity to not only filter data for potential false positives but also tailor the underlying interaction site model to specific needs. CONCLUSIONS CheRRI is a stand-alone postprocessing tool to filter either predicted or experimentally identified potential RNA-RNA interactions on a genomic level to enhance the quality of interaction candidates. It is easy to install (via conda, pip packages), use (via Galaxy), and integrate into existing RNA-RNA interaction pipelines.
Collapse
Affiliation(s)
- Teresa Müller
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Georges-Koehler-Allee 106, 79110 Freiburg, Germany
| | - Stefan Mautner
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Georges-Koehler-Allee 106, 79110 Freiburg, Germany
| | - Pavankumar Videm
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Georges-Koehler-Allee 106, 79110 Freiburg, Germany
| | - Florian Eggenhofer
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Georges-Koehler-Allee 106, 79110 Freiburg, Germany
| | - Martin Raden
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Georges-Koehler-Allee 106, 79110 Freiburg, Germany
| | - Rolf Backofen
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Georges-Koehler-Allee 106, 79110 Freiburg, Germany
- Signalling Research Centre CIBSS, University of Freiburg, Schaenzlestr. 18, 79104 Freiburg, Germany
| |
Collapse
|
10
|
Zuber J, Mathews DH. Estimating RNA Secondary Structure Folding Free Energy Changes with efn2. Methods Mol Biol 2024; 2726:1-13. [PMID: 38780725 DOI: 10.1007/978-1-0716-3519-3_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/25/2024]
Abstract
A number of analyses require estimates of the folding free energy changes of specific RNA secondary structures. These predictions are often based on a set of nearest neighbor parameters that models the folding stability of a RNA secondary structure as the sum of folding stabilities of the structural elements that comprise the secondary structure. In the software suite RNAstructure, the free energy change calculation is implemented in the program efn2. The efn2 program estimates the folding free energy change and the experimental uncertainty in the folding free energy change. It can be run through the graphical user interface for RNAstructure, from the command line, or a web server. This chapter provides detailed protocols for using efn2.
Collapse
Affiliation(s)
- Jeffrey Zuber
- Department of Biochemistry & Biophysics and Center for RNA Biology, University of Rochester Medical Center, Rochester, NY, USA
| | - David H Mathews
- Department of Biochemistry & Biophysics and Center for RNA Biology, University of Rochester Medical Center, Rochester, NY, USA.
- Department of Biostatistics & Computational Biology, University of Rochester Medical Center, Rochester, NY, USA.
| |
Collapse
|
11
|
Chavez-Pena C. RNAi-Mediated Silencing in the Insect Cell-Baculovirus Expression System. Methods Mol Biol 2024; 2829:91-107. [PMID: 38951329 DOI: 10.1007/978-1-0716-3961-0_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/03/2024]
Abstract
RNA interference (RNAi) serves as an indispensable tool for gene function studies and has been substantiated through extensive research for its practical applications in the baculovirus expression vector system (BEVS). This chapter expands the RNAi toolkit in insect cell culture by including small interfering RNA (siRNA) in the protocol, in addition to the conventional use of double-stranded RNA (dsRNA). This chapter also brings attention to key design and reporting considerations, based on Minimum Information About an RNAi Experiment (MIARE) guidelines. Recommendations regarding online tools for dsRNA and siRNA design are provided, along with guidance on choosing suitable methods for measuring silencing outcomes.
Collapse
|
12
|
Hara K, Iwano N, Fukunaga T, Hamada M. DeepRaccess: high-speed RNA accessibility prediction using deep learning. FRONTIERS IN BIOINFORMATICS 2023; 3:1275787. [PMID: 37881622 PMCID: PMC10597636 DOI: 10.3389/fbinf.2023.1275787] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Accepted: 09/29/2023] [Indexed: 10/27/2023] Open
Abstract
RNA accessibility is a useful RNA secondary structural feature for predicting RNA-RNA interactions and translation efficiency in prokaryotes. However, conventional accessibility calculation tools, such as Raccess, are computationally expensive and require considerable computational time to perform transcriptome-scale analysis. In this study, we developed DeepRaccess, which predicts RNA accessibility based on deep learning methods. DeepRaccess was trained to take artificial RNA sequences as input and to predict the accessibility of these sequences as calculated by Raccess. Simulation and empirical dataset analyses showed that the accessibility predicted by DeepRaccess was highly correlated with the accessibility calculated by Raccess. In addition, we confirmed that DeepRaccess could predict protein abundance in E.coli with moderate accuracy from the sequences around the start codon. We also demonstrated that DeepRaccess achieved tens to hundreds of times software speed-up in a GPU environment. The source codes and the trained models of DeepRaccess are freely available at https://github.com/hmdlab/DeepRaccess.
Collapse
Affiliation(s)
- Kaisei Hara
- Department of Electrical Engineering and Bioscience, Graduate School of Advanced Science and Engineering, Waseda University, Tokyo, Japan
- Computational Bio Big-Data Open Innovation Laboratory, AIST-Waseda University, Tokyo, Japan
| | - Natsuki Iwano
- Department of Electrical Engineering and Bioscience, Graduate School of Advanced Science and Engineering, Waseda University, Tokyo, Japan
| | - Tsukasa Fukunaga
- Waseda Institute for Advanced Study, Waseda University, Tokyo, Japan
| | - Michiaki Hamada
- Department of Electrical Engineering and Bioscience, Graduate School of Advanced Science and Engineering, Waseda University, Tokyo, Japan
- Computational Bio Big-Data Open Innovation Laboratory, AIST-Waseda University, Tokyo, Japan
- Graduate School of Medicine, Nippon Medical School, Tokyo, Japan
| |
Collapse
|
13
|
Monopoli KR, Korkin D, Khvorova A. Asymmetric trichotomous partitioning overcomes dataset limitations in building machine learning models for predicting siRNA efficacy. MOLECULAR THERAPY. NUCLEIC ACIDS 2023; 33:93-109. [PMID: 37456778 PMCID: PMC10338369 DOI: 10.1016/j.omtn.2023.06.010] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Accepted: 06/09/2023] [Indexed: 07/18/2023]
Abstract
Chemically modified small interfering RNAs (siRNAs) are promising therapeutics guiding sequence-specific silencing of disease genes. Identifying chemically modified siRNA sequences that effectively silence target genes remains challenging. Such determinations necessitate computational algorithms. Machine learning is a powerful predictive approach for tackling biological problems but typically requires datasets significantly larger than most available siRNA datasets. Here, we describe a framework applying machine learning to a small dataset (356 modified sequences) for siRNA efficacy prediction. To overcome noise and biological limitations in siRNA datasets, we apply a trichotomous, two-threshold, partitioning approach, producing several combinations of classification threshold pairs. We then test the effects of different thresholds on random forest machine learning model performance using a novel evaluation metric accounting for class imbalances. We identify thresholds yielding a model with high predictive power, outperforming a linear model generated from the same data, that was predictive upon experimental evaluation. Using a novel model feature extraction method, we observe target site base importances and base preferences consistent with our current understanding of the siRNA-mediated silencing mechanism, with the random forest providing higher resolution than the linear model. This framework applies to any classification challenge involving small biological datasets, providing an opportunity to develop high-performing design algorithms for oligonucleotide therapies.
Collapse
Affiliation(s)
- Kathryn R. Monopoli
- Department of Bioinformatics & Computational Biology, Worcester Polytechnic Institute, Worcester, MA 01609, USA
- RNA Therapeutics Institute, University of Massachusetts Chan Medical School, Worcester, MA 01655, USA
| | - Dmitry Korkin
- Department of Bioinformatics & Computational Biology, Worcester Polytechnic Institute, Worcester, MA 01609, USA
| | - Anastasia Khvorova
- RNA Therapeutics Institute, University of Massachusetts Chan Medical School, Worcester, MA 01655, USA
| |
Collapse
|
14
|
Abstract
RNAstructure is a user-friendly program for the prediction and analysis of RNA secondary structure. It is available as a web server, a program with a graphical user interface, or a set of command line tools. The programs are available for Microsoft Windows, macOS, or Linux. This article provides protocols for prediction of RNA secondary structure (using the web server, the graphical user interface, or the command line) and high-affinity oligonucleotide binding sites to a structured RNA target (using the graphical user interface). © 2023 Wiley Periodicals LLC. Basic Protocol 1: Predicting RNA secondary structure using the RNAstructure web server Alternate Protocol 1: Predicting secondary structure and base pair probabilities using the RNAstructure graphical user interface Alternate Protocol 2: Predicting secondary structure and base pair probabilities using the RNAstructure command line interface Basic Protocol 2: Predicting binding affinities of oligonucleotides complementary to an RNA target using OligoWalk.
Collapse
Affiliation(s)
- Sara E. Ali
- Department of Biochemistry & Biophysics and Center for RNA Biology, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642
| | - Abhinav Mittal
- Department of Biochemistry & Biophysics and Center for RNA Biology, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642
| | - David H. Mathews
- Department of Biochemistry & Biophysics and Center for RNA Biology, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642
| |
Collapse
|
15
|
Zhang H, Li S, Zhang L, Mathews D, Huang L. LazySampling and LinearSampling: fast stochastic sampling of RNA secondary structure with applications to SARS-CoV-2. Nucleic Acids Res 2022; 51:e7. [PMID: 36401871 PMCID: PMC9881153 DOI: 10.1093/nar/gkac1029] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Revised: 09/22/2022] [Accepted: 10/21/2022] [Indexed: 11/21/2022] Open
Abstract
Many RNAs fold into multiple structures at equilibrium, and there is a need to sample these structures according to their probabilities in the ensemble. The conventional sampling algorithm suffers from two limitations: (i) the sampling phase is slow due to many repeated calculations; and (ii) the end-to-end runtime scales cubically with the sequence length. These issues make it difficult to be applied to long RNAs, such as the full genomes of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). To address these problems, we devise a new sampling algorithm, LazySampling, which eliminates redundant work via on-demand caching. Based on LazySampling, we further derive LinearSampling, an end-to-end linear time sampling algorithm. Benchmarking on nine diverse RNA families, the sampled structures from LinearSampling correlate better with the well-established secondary structures than Vienna RNAsubopt and RNAplfold. More importantly, LinearSampling is orders of magnitude faster than standard tools, being 428× faster (72 s versus 8.6 h) than RNAsubopt on the full genome of SARS-CoV-2 (29 903 nt). The resulting sample landscape correlates well with the experimentally guided secondary structure models, and is closer to the alternative conformations revealed by experimentally driven analysis. Finally, LinearSampling finds 23 regions of 15 nt with high accessibilities in the SARS-CoV-2 genome, which are potential targets for COVID-19 diagnostics and therapeutics.
Collapse
Affiliation(s)
- He Zhang
- Baidu Research, Sunnyvale, CA, USA,School of Electrical Engineering & Computer Science, Oregon State University, Corvallis, OR, USA
| | - Sizhen Li
- School of Electrical Engineering & Computer Science, Oregon State University, Corvallis, OR, USA
| | - Liang Zhang
- School of Electrical Engineering & Computer Science, Oregon State University, Corvallis, OR, USA
| | - David H Mathews
- Department of Biochemistry & Biophysics, University of Rochester Medical Center, Rochester, NY 14642, USA,Center for RNA Biology, University of Rochester Medical Center, Rochester, NY 14642, USA,Department of Biostatistics & Computational Biology, University of Rochester Medical Center, Rochester, NY 14642, USA
| | | |
Collapse
|
16
|
Abstract
The highly specific induction of RNA interference-mediated gene knockdown, based on the direct application of small interfering RNAs (siRNAs), opens novel avenues towards innovative therapies. Two decades after the discovery of the RNA interference mechanism, the first siRNA drugs received approval for clinical use by the US Food and Drug Administration and the European Medicines Agency between 2018 and 2022. These are mainly based on an siRNA conjugation with a targeting moiety for liver hepatocytes, N-acetylgalactosamine, and cover the treatment of acute hepatic porphyria, transthyretin-mediated amyloidosis, hypercholesterolemia, and primary hyperoxaluria type 1. Still, the development of siRNA therapeutics faces several challenges and issues, including the definition of optimal siRNAs in terms of target, sequence, and chemical modifications, siRNA delivery to its intended site of action, and the absence of unspecific off-target effects. Further siRNA drugs are in clinical studies, based on different delivery systems and covering a wide range of different pathologies including metabolic diseases, hematology, infectious diseases, oncology, ocular diseases, and others. This article reviews the knowledge on siRNA design and chemical modification, as well as issues related to siRNA delivery that may be addressed using different delivery systems. Details on the mode of action and clinical status of the various siRNA therapeutics are provided, before giving an outlook on issues regarding the future of siRNA drugs and on their potential as one emerging standard modality in pharmacotherapy. Notably, this may also cover otherwise un-druggable diseases, the definition of non-coding RNAs as targets, and novel concepts of personalized and combination treatment regimens.
Collapse
Affiliation(s)
- Maik Friedrich
- Faculty of Leipzig, Institute of Clinical Immunology, Max-Bürger-Forschungszentrum (MBFZ), University of Leipzig, Leipzig, Germany.,Department of Vaccines and Infection Models, Fraunhofer Institute for Cell Therapy and Immunology IZI, Leipzig, Germany
| | - Achim Aigner
- Rudolf-Boehm Institute for Pharmacology and Toxicology, Clinical Pharmacology, University of Leipzig, Haertelstrasse 16-18, 04107, Leipzig, Germany.
| |
Collapse
|
17
|
Asadbeigi A, Norouzi M, Vafaei Sadi MS, Saffari M, Bakhtiarizadeh MR. CaSilico: A versatile CRISPR package for in silico CRISPR RNA designing for Cas12, Cas13, and Cas14. Front Bioeng Biotechnol 2022; 10:957131. [PMID: 36017348 PMCID: PMC9395711 DOI: 10.3389/fbioe.2022.957131] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Accepted: 07/04/2022] [Indexed: 12/13/2022] Open
Abstract
The efficiency of the CRISPR-Cas system is highly dependent on well-designed CRISPR RNA (crRNA). To facilitate the use of various types of CRISPR-Cas systems, there is a need for the development of computational tools to design crRNAs which cover different CRISPR-Cas systems with off-target analysis capability. Numerous crRNA design tools have been developed, but nearly all of them are dedicated to design crRNA for genome editing. Hence, we developed a tool matching the needs of both beginners and experts, named CaSilico, which was inspired by the limitations of the current crRNA design tools for designing crRNAs for Cas12, Cas13, and Cas14 CRISPR-Cas systems. This tool considers a comprehensive list of the principal rules that are not yet well described to design crRNA for these types. Using a list of important features such as mismatch tolerance rules, self-complementarity, GC content, frequency of cleaving base around the target site, target accessibility, and PFS (protospacer flanking site) or PAM (protospacer adjacent motif) requirement, CaSilico searches all potential crRNAs in a user-input sequence. Considering these features help users to rank all crRNAs for a sequence and make an informed decision about whether a crRNA is suited for an experiment or not. Our tool is sufficiently flexible to tune some key parameters governing the design of crRNA and identification of off-targets, which can lead to an increase in the chances of successful CRISPR-Cas experiments. CaSilico outperforms previous crRNA design tools in the following aspects: 1) supporting any reference genome/gene/transcriptome for which an FASTA file is available; 2) designing crRNAs that simultaneously target multiple sequences through conserved region detection among a set of sequences; 3) considering new CRISPR-Cas subtypes; and 4) reporting a list of different features for each candidate crRNA, which can help the user to select the best one. Given these capabilities, CaSilico addresses end-user concerns arising from the use of sophisticated bioinformatics algorithms and has a wide range of potential research applications in different areas, especially in the design of crRNA for pathogen diagnosis. CaSilico was successfully applied to design crRNAs for different genes in the SARS-CoV-2 genome, as some of the crRNAs have been experimentally tested in the previous studies.
Collapse
Affiliation(s)
- Adnan Asadbeigi
- Department of Medical Genetics, Cancer Institute, Faculty of Medicine, Tehran University of Medical Sciences (TUMS), Tehran, Iran
| | - Milad Norouzi
- Department of Bioinformatics, Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| | | | - Mojtaba Saffari
- Department of Medical Genetics, Faculty of Medicine, Tehran University of Medical Sciences (TUMS), Tehran, Iran
| | - Mohammad Reza Bakhtiarizadeh
- Department of Animal and Poultry Science, College of Aburaihan, University of Tehran, Tehran, Iran
- *Correspondence: Mohammad Reza Bakhtiarizadeh,
| |
Collapse
|
18
|
Zuber J, Schroeder SJ, Sun H, Turner DH, Mathews DH. Nearest neighbor rules for RNA helix folding thermodynamics: improved end effects. Nucleic Acids Res 2022; 50:5251-5262. [PMID: 35524574 PMCID: PMC9122537 DOI: 10.1093/nar/gkac261] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Revised: 03/29/2022] [Accepted: 04/08/2022] [Indexed: 12/26/2022] Open
Abstract
Nearest neighbor parameters for estimating the folding stability of RNA secondary structures are in widespread use. For helices, current parameters penalize terminal AU base pairs relative to terminal GC base pairs. We curated an expanded database of helix stabilities determined by optical melting experiments. Analysis of the updated database shows that terminal penalties depend on the sequence identity of the adjacent penultimate base pair. New nearest neighbor parameters that include this additional sequence dependence accurately predict the measured values of 271 helices in an updated database with a correlation coefficient of 0.982. This refined understanding of helix ends facilitates fitting terms for base pair stacks with GU pairs. Prior parameter sets treated 5′GGUC3′ paired to 3′CUGG5′ separately from other 5′GU3′/3′UG5′ stacks. The improved understanding of helix end stability, however, makes the separate treatment unnecessary. Introduction of the additional terms was tested with three optical melting experiments. The average absolute difference between measured and predicted free energy changes at 37°C for these three duplexes containing terminal adjacent AU and GU pairs improved from 1.38 to 0.27 kcal/mol. This confirms the need for the additional sequence dependence in the model.
Collapse
Affiliation(s)
- Jeffrey Zuber
- Alnylam Pharmaceuticals, Inc., Cambridge, MA 02142, USA
| | - Susan J Schroeder
- Department of Chemistry and Biochemistry, and Department of Microbiology and Plant Biology, University of Oklahoma, Norman, OK 73019, USA
| | - Hongying Sun
- Department of Biochemistry & Biophysics, University of Rochester, Rochester, NY 14642, USA.,Center for RNA Biology, University of Rochester, Rochester, NY 14642, USA
| | - Douglas H Turner
- Center for RNA Biology, University of Rochester, Rochester, NY 14642, USA.,Department of Chemistry, University of Rochester, Rochester, NY 14627, USA
| | - David H Mathews
- Department of Biochemistry & Biophysics, University of Rochester, Rochester, NY 14642, USA.,Center for RNA Biology, University of Rochester, Rochester, NY 14642, USA.,Department of Biostatistics & Computational Biology, University of Rochester, Rochester, NY 14642, USA
| |
Collapse
|
19
|
Design of siRNA molecules for silencing of membrane glycoprotein, nucleocapsid phosphoprotein, and surface glycoprotein genes of SARS-CoV2. J Genet Eng Biotechnol 2022; 20:65. [PMID: 35482116 PMCID: PMC9047631 DOI: 10.1186/s43141-022-00346-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2021] [Accepted: 04/18/2022] [Indexed: 12/24/2022]
Abstract
The global COVID-19 pandemic caused by SARS-CoV2 infected millions of people and resulted in more than 4 million deaths worldwide. Apart from vaccines and drugs, RNA silencing is a novel approach for treating COVID-19. In the present study, siRNAs were designed for the conserved regions targeting three structural genes, M, N, and S, from forty whole-genome sequences of SARS-CoV2 using four different software, RNAxs, siDirect, i-Score Designer, and OligoWalk. Only siRNAs which were predicted in common by all the four servers were considered for further shortlisting. A multistep filtering approach has been adopted in the present study for the final selection of siRNAs by the usage of different online tools, viz., siRNA scales, MaxExpect, DuplexFold, and SMEpred. All these web-based tools consider several important parameters for designing functional siRNAs, e.g., target-site accessibility, duplex stability, position-specific nucleotide preference, inhibitory score, thermodynamic parameters, GC content, and efficacy in cleaving the target. In addition, a few parameters like GC content and dG value of the entire siRNA were also considered for shortlisting of the siRNAs. Antisense strands were subjected to check for any off-target similarities using BLAST. Molecular docking was carried out to study the interactions of guide strands with AGO2 protein. A total of six functional siRNAs (two for each gene) have been finally selected for targeting M, N, and S genes of SARS-CoV2. The siRNAs have not shown any off-target effects, interacted with the domain(s) of AGO2 protein, and were efficacious in cleaving the target mRNA. However, the siRNAs designed in the present study need to be tested in vitro and in vivo in the future.
Collapse
|
20
|
Palit P, Chowdhury FT, Baruah N, Sarkar B, Mou SN, Kamal M, Siddiqua TJ, Noor Z, Ahmed T. A Comprehensive Computational Investigation into the Conserved Virulent Proteins of Shigella species Unveils Potential Small-Interfering RNA Candidates as a New Therapeutic Strategy against Shigellosis. MOLECULES (BASEL, SWITZERLAND) 2022; 27:molecules27061936. [PMID: 35335300 PMCID: PMC8950558 DOI: 10.3390/molecules27061936] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Revised: 12/19/2021] [Accepted: 12/27/2021] [Indexed: 11/16/2022]
Abstract
Shigella species account for the second-leading cause of deaths due to diarrheal diseases among children of less than 5 years of age. The emergence of multi-drug-resistant Shigella isolates and the lack of availability of Shigella vaccines have led to the pertinence in the efforts made for the development of new therapeutic strategies against shigellosis. Consequently, designing small-interfering RNA (siRNA) candidates against such infectious agents represents a novel approach to propose new therapeutic candidates to curb the rampant rise of anti-microbial resistance in such pathogens. In this study, we analyzed 264 conserved sequences from 15 different conserved virulence genes of Shigella sp., through extensive rational validation using a plethora of first-generation and second-generation computational algorithms for siRNA designing. Fifty-eight siRNA candidates were obtained by using the first-generation algorithms, out of which only 38 siRNA candidates complied with the second-generation rules of siRNA designing. Further computational validation showed that 16 siRNA candidates were found to have a substantial functional efficiency, out of which 11 siRNA candidates were found to be non-immunogenic. Finally, three siRNA candidates exhibited a sterically feasible three-dimensional structure as exhibited by parameters of nucleic acid geometry such as: the probability of wrong sugar puckers, bad backbone confirmations, bad bonds, and bad angles being within the accepted threshold for stable tertiary structure. Although the findings of our study require further wet-lab validation and optimization for therapeutic use in the treatment of shigellosis, the computationally validated siRNA candidates are expected to suppress the expression of the virulence genes, namely: IpgD (siRNA 9) and OspB (siRNA 15 and siRNA 17) and thus act as a prospective tool in the RNA interference (RNAi) pathway. However, the findings of our study require further wet-lab validation and optimization for regular therapeutic use for treatment of shigellosis.
Collapse
Affiliation(s)
- Parag Palit
- International Centre for Diarrhoeal Disease Research, Bangladesh (icddr,b), Dhaka 1212, Bangladesh; (P.P.); (M.K.); (T.J.S.); (T.A.)
| | - Farhana Tasnim Chowdhury
- Department of Biochemistry and Molecular Biology, University of Dhaka, Dhaka 1000, Bangladesh; (F.T.C.); (B.S.); (S.N.M.)
| | - Namrata Baruah
- Department of Biological Sciences and Bioengineering, Indian Institute of Technology, Kanpur 208016, Uttar Pradesh, India;
| | - Bonoshree Sarkar
- Department of Biochemistry and Molecular Biology, University of Dhaka, Dhaka 1000, Bangladesh; (F.T.C.); (B.S.); (S.N.M.)
| | - Sadia Noor Mou
- Department of Biochemistry and Molecular Biology, University of Dhaka, Dhaka 1000, Bangladesh; (F.T.C.); (B.S.); (S.N.M.)
| | - Mehnaz Kamal
- International Centre for Diarrhoeal Disease Research, Bangladesh (icddr,b), Dhaka 1212, Bangladesh; (P.P.); (M.K.); (T.J.S.); (T.A.)
| | - Towfida Jahan Siddiqua
- International Centre for Diarrhoeal Disease Research, Bangladesh (icddr,b), Dhaka 1212, Bangladesh; (P.P.); (M.K.); (T.J.S.); (T.A.)
| | - Zannatun Noor
- International Centre for Diarrhoeal Disease Research, Bangladesh (icddr,b), Dhaka 1212, Bangladesh; (P.P.); (M.K.); (T.J.S.); (T.A.)
- Correspondence:
| | - Tahmeed Ahmed
- International Centre for Diarrhoeal Disease Research, Bangladesh (icddr,b), Dhaka 1212, Bangladesh; (P.P.); (M.K.); (T.J.S.); (T.A.)
| |
Collapse
|
21
|
Secondary structure prediction for RNA sequences including N 6-methyladenosine. Nat Commun 2022; 13:1271. [PMID: 35277476 PMCID: PMC8917230 DOI: 10.1038/s41467-022-28817-4] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Accepted: 02/10/2022] [Indexed: 01/22/2023] Open
Abstract
There is increasing interest in the roles of covalently modified nucleotides in RNA. There has been, however, an inability to account for modifications in secondary structure prediction because of a lack of software and thermodynamic parameters. We report the solution for these issues for N6-methyladenosine (m6A), allowing secondary structure prediction for an alphabet of A, C, G, U, and m6A. The RNAstructure software now works with user-defined nucleotide alphabets of any size. We also report a set of nearest neighbor parameters for helices and loops containing m6A, using experiments. Interestingly, N6-methylation decreases folding stability for adenosines in the middle of a helix, has little effect on folding stability for adenosines at the ends of helices, and increases folding stability for unpaired adenosines stacked on a helix. We demonstrate predictions for an N6-methylation-activated protein recognition site from MALAT1 and human transcriptome-wide effects of N6-methylation on the probability of adenosine being buried in a helix. RNA folding free energy nearest neighbor parameters were determined for sequences with the nucleotide m6A. The RNAstructure software package can accommodate modified nucleotides, enabling secondary structure prediction of sequences with m6A.
Collapse
|
22
|
Zambrano RAI, Hernandez-Perez C, Takahashi MK. RNA Structure Prediction, Analysis, and Design: An Introduction to Web-Based Tools. Methods Mol Biol 2022; 2518:253-269. [PMID: 35666450 DOI: 10.1007/978-1-0716-2421-0_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Understanding RNA structure has become critical in the study of RNA in their roles as mediators of biological processes. To aid in these studies, computational algorithms that utilize thermodynamics have been developed to predict RNA secondary structure. Due to the importance of intermolecular interactions, the algorithms have been expanded to determine and predict RNA-RNA hybridization. This chapter discusses popular webservers with the tools for RNA secondary structure prediction, RNA-RNA hybridization, and design. We address key features that distinguish common-functioning programs and their purposes for the interests of the user. Ultimately, we hope this review elucidates web-based tools researchers may take advantage of in their investigations of RNA structure and function.
Collapse
Affiliation(s)
| | | | - Melissa K Takahashi
- Department of Biology, California State University Northridge, Northridge, CA, USA.
| |
Collapse
|
23
|
Li S, Zhang H, Zhang L, Liu K, Liu B, Mathews DH, Huang L. LinearTurboFold: Linear-time global prediction of conserved structures for RNA homologs with applications to SARS-CoV-2. Proc Natl Acad Sci U S A 2021; 118:e2116269118. [PMID: 34887342 PMCID: PMC8719904 DOI: 10.1073/pnas.2116269118] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/05/2021] [Indexed: 12/26/2022] Open
Abstract
The constant emergence of COVID-19 variants reduces the effectiveness of existing vaccines and test kits. Therefore, it is critical to identify conserved structures in severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) genomes as potential targets for variant-proof diagnostics and therapeutics. However, the algorithms to predict these conserved structures, which simultaneously fold and align multiple RNA homologs, scale at best cubically with sequence length and are thus infeasible for coronaviruses, which possess the longest genomes (∼30,000 nt) among RNA viruses. As a result, existing efforts on modeling SARS-CoV-2 structures resort to single-sequence folding as well as local folding methods with short window sizes, which inevitably neglect long-range interactions that are crucial in RNA functions. Here we present LinearTurboFold, an efficient algorithm for folding RNA homologs that scales linearly with sequence length, enabling unprecedented global structural analysis on SARS-CoV-2. Surprisingly, on a group of SARS-CoV-2 and SARS-related genomes, LinearTurboFold's purely in silico prediction not only is close to experimentally guided models for local structures, but also goes far beyond them by capturing the end-to-end pairs between 5' and 3' untranslated regions (UTRs) (∼29,800 nt apart) that match perfectly with a purely experimental work. Furthermore, LinearTurboFold identifies undiscovered conserved structures and conserved accessible regions as potential targets for designing efficient and mutation-insensitive small-molecule drugs, antisense oligonucleotides, small interfering RNAs (siRNAs), CRISPR-Cas13 guide RNAs, and RT-PCR primers. LinearTurboFold is a general technique that can also be applied to other RNA viruses and full-length genome studies and will be a useful tool in fighting the current and future pandemics.
Collapse
Affiliation(s)
- Sizhen Li
- School of Electrical Engineering & Computer Science, Oregon State University, Corvallis, OR 97331
| | - He Zhang
- Baidu Research, Sunnyvale, CA 94089
- School of Electrical Engineering & Computer Science, Oregon State University, Corvallis, OR 97331
| | - Liang Zhang
- School of Electrical Engineering & Computer Science, Oregon State University, Corvallis, OR 97331
- Baidu Research, Sunnyvale, CA 94089
| | - Kaibo Liu
- Baidu Research, Sunnyvale, CA 94089
- School of Electrical Engineering & Computer Science, Oregon State University, Corvallis, OR 97331
| | | | - David H Mathews
- Department of Biochemistry & Biophysics, University of Rochester Medical Center, Rochester, NY 14642;
- Center for RNA Biology, University of Rochester Medical Center, Rochester, NY 14642
- Department of Biostatistics & Computational Biology, University of Rochester Medical Center, Rochester, NY 14642
| | - Liang Huang
- School of Electrical Engineering & Computer Science, Oregon State University, Corvallis, OR 97331;
- Baidu Research, Sunnyvale, CA 94089
| |
Collapse
|
24
|
Rahman A, Gupta SD, Rahman MA, Tamanna S. An in-silico approach to design potential siRNAs against the ORF57 of Kaposi's sarcoma-associated herpesvirus. Genomics Inform 2021; 19:e47. [PMID: 35012290 PMCID: PMC8752988 DOI: 10.5808/gi.21057] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Revised: 11/25/2021] [Accepted: 12/09/2021] [Indexed: 12/21/2022] Open
Abstract
Kaposi's sarcoma-associated herpesvirus (KSHV) is one of the few human oncogenic viruses, which causes a variety of malignancies, including Kaposi's sarcoma, multicentric Castleman disease, and primary effusion lymphoma, particularly in human immunodeficiency virus patients. The currently available treatment options cannot always prevent the invasion and dissemination of this virus. In recent times, siRNA-based therapeutics are gaining prominence over conventional medications as siRNA can be designed to target almost any gene of interest. The ORF57 is a crucial regulatory protein for lytic gene expression of KSHV. Disruption of this gene translation will inevitably inhibit the replication of the virus in the host cell. Therefore, the ORF57 of KSHV could be a potential target for designing siRNA-based therapeutics. Considering both sequence preferences and target site accessibility, several online tools (i-SCORE Designer, Sfold web server) had been utilized to predict the siRNA guide strand against the ORF57. Subsequently, off-target filtration (BLAST), conservancy test (fuzznuc), and thermodynamics analysis (RNAcofold, RNAalifold, and RNA Structure web server) were also performed to select the most suitable siRNA sequences. Finally, two siRNAs were identified that passed all of the filtration phases and fulfilled the thermodynamic criteria. We hope that the siRNAs predicted in this study would be helpful for the development of new effective therapeutics against KSHV.
Collapse
Affiliation(s)
- Anisur Rahman
- Department of Biotechnology and Genetic Engineering, Faculty of Science, Noakhali Science and Technology University, Noakhali 3814, Bangladesh
| | - Shipan Das Gupta
- Department of Biotechnology and Genetic Engineering, Faculty of Science, Noakhali Science and Technology University, Noakhali 3814, Bangladesh
| | - Md. Anisur Rahman
- Department of Biotechnology and Genetic Engineering, Faculty of Science, Noakhali Science and Technology University, Noakhali 3814, Bangladesh
| | - Saheda Tamanna
- Department of Biotechnology and Genetic Engineering, Faculty of Science, Noakhali Science and Technology University, Noakhali 3814, Bangladesh
| |
Collapse
|
25
|
Zhang H, Zhang L, Li S, Mathews DH, Huang L. LazySampling and LinearSampling: Fast Stochastic Sampling of RNA Secondary Structure with Applications to SARS-CoV-2. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2021:2020.12.29.424617. [PMID: 33398265 PMCID: PMC7781300 DOI: 10.1101/2020.12.29.424617] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Many RNAs fold into multiple structures at equilibrium. The classical stochastic sampling algorithm can sample secondary structures according to their probabilities in the Boltzmann ensemble, and is widely used. However, this algorithm, consisting of a bottom-up partition function phase followed by a top-down sampling phase, suffers from three limitations: (a) the formulation and implementation of the sampling phase are unnecessarily complicated; (b) the sampling phase repeatedly recalculates many redundant recursions already done during the partition function phase; (c) the partition function runtime scales cubically with the sequence length. These issues prevent stochastic sampling from being used for very long RNAs such as the full genomes of SARS-CoV-2. To address these problems, we first adopt a hypergraph framework under which the sampling algorithm can be greatly simplified. We then present three sampling algorithms under this framework, among which the LazySampling algorithm is the fastest by eliminating redundant work in the sampling phase via on-demand caching. Based on LazySampling, we further replace the cubic-time partition function by a linear-time approximate one, and derive LinearSampling, an end-to-end linear-time sampling algorithm that is orders of magnitude faster than the standard one. For instance, LinearSampling is 176Ã- faster (38.9s vs. 1.9h) than Vienna RNAsubopt on the full genome of Ebola virus (18,959 nt ). More importantly, LinearSampling is the first RNA structure sampling algorithm to scale up to the full-genome of SARS-CoV-2 without local window constraints, taking only 69.2 seconds on its reference sequence (29,903 nt ). The resulting sample correlates well with the experimentally-guided structures. On the SARS-CoV-2 genome, LinearSampling finds 23 regions of 15 nt with high accessibilities, which are potential targets for COVID-19 diagnostics and drug design. See code: https://github.com/LinearFold/LinearSampling.
Collapse
|
26
|
Li S, Zhang H, Zhang L, Liu K, Liu B, Mathews DH, Huang L. LinearTurboFold: Linear-Time Global Prediction of Conserved Structures for RNA Homologs with Applications to SARS-CoV-2. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2021:2020.11.23.393488. [PMID: 34816262 PMCID: PMC8609897 DOI: 10.1101/2020.11.23.393488] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The constant emergence of COVID-19 variants reduces the effectiveness of existing vaccines and test kits. Therefore, it is critical to identify conserved structures in SARS-CoV-2 genomes as potential targets for variant-proof diagnostics and therapeutics. However, the algorithms to predict these conserved structures, which simultaneously fold and align multiple RNA homologs, scale at best cubically with sequence length, and are thus infeasible for coronaviruses, which possess the longest genomes (∼30,000 nt ) among RNA viruses. As a result, existing efforts on modeling SARS-CoV-2 structures resort to single sequence folding as well as local folding methods with short window sizes, which inevitably neglect long-range interactions that are crucial in RNA functions. Here we present LinearTurboFold, an efficient algorithm for folding RNA homologs that scales linearly with sequence length, enabling unprecedented global structural analysis on SARS-CoV-2. Surprisingly, on a group of SARS-CoV-2 and SARS-related genomes, LinearTurbo-Fold's purely in silico prediction not only is close to experimentally-guided models for local structures, but also goes far beyond them by capturing the end-to-end pairs between 5' and 3' UTRs (∼29,800 nt apart) that match perfectly with a purely experimental work. Furthermore, LinearTurboFold identifies novel conserved structures and conserved accessible regions as potential targets for designing efficient and mutation-insensitive small-molecule drugs, antisense oligonucleotides, siRNAs, CRISPR-Cas13 guide RNAs and RT-PCR primers. LinearTurboFold is a general technique that can also be applied to other RNA viruses and full-length genome studies, and will be a useful tool in fighting the current and future pandemics. SIGNIFICANCE STATEMENT Conserved RNA structures are critical for designing diagnostic and therapeutic tools for many diseases including COVID-19. However, existing algorithms are much too slow to model the global structures of full-length RNA viral genomes. We present LinearTurboFold, a linear-time algorithm that is orders of magnitude faster, making it the first method to simultaneously fold and align whole genomes of SARS-CoV-2 variants, the longest known RNA virus (∼30 kilobases). Our work enables unprecedented global structural analysis and captures long-range interactions that are out of reach for existing algorithms but crucial for RNA functions. LinearTurboFold is a general technique for full-length genome studies and can help fight the current and future pandemics.
Collapse
Affiliation(s)
- Sizhen Li
- School of Electrical Engineering & Computer Science, Oregon State University, Corvallis, OR
| | - He Zhang
- School of Electrical Engineering & Computer Science, Oregon State University, Corvallis, OR
- Baidu Research, Sunnyvale, CA
| | - Liang Zhang
- School of Electrical Engineering & Computer Science, Oregon State University, Corvallis, OR
- Baidu Research, Sunnyvale, CA
| | - Kaibo Liu
- School of Electrical Engineering & Computer Science, Oregon State University, Corvallis, OR
- Baidu Research, Sunnyvale, CA
| | | | - David H. Mathews
- Department of Biochemistry & Biophysics, Center for RNA Biology, and Department of Biostatistics & Computational Biology, University of Rochester Medical Center, Rochester, NY
| | - Liang Huang
- School of Electrical Engineering & Computer Science, Oregon State University, Corvallis, OR
- Baidu Research, Sunnyvale, CA
| |
Collapse
|
27
|
Bao C, Ermolenko DN. Ribosome as a Translocase and Helicase. BIOCHEMISTRY (MOSCOW) 2021; 86:992-1002. [PMID: 34488575 PMCID: PMC8294220 DOI: 10.1134/s0006297921080095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
During protein synthesis, ribosome moves along mRNA to decode one codon after the other. Ribosome translocation is induced by a universally conserved protein, elongation factor G (EF-G) in bacteria and elongation factor 2 (EF-2) in eukaryotes. EF-G-induced translocation results in unwinding of the intramolecular secondary structures of mRNA by three base pairs at a time that renders the translating ribosome a processive helicase. Professor Alexander Sergeevich Spirin has made numerous seminal contributions to understanding the molecular mechanism of translocation. Here, we review Spirin's insights into the ribosomal translocation and recent advances in the field that stemmed from Spirin's pioneering work. We also discuss key remaining challenges in studies of translocase and helicase activities of the ribosome.
Collapse
Affiliation(s)
- Chen Bao
- Department of Biochemistry & Biophysics, School of Medicine and Dentistry and Center for RNA Biology, University of Rochester, Rochester, NY, USA.
| | - Dmitri N Ermolenko
- Department of Biochemistry & Biophysics, School of Medicine and Dentistry and Center for RNA Biology, University of Rochester, Rochester, NY, USA.
| |
Collapse
|
28
|
Zhao Q, Zhao Z, Fan X, Yuan Z, Mao Q, Yao Y. Review of machine learning methods for RNA secondary structure prediction. PLoS Comput Biol 2021; 17:e1009291. [PMID: 34437528 PMCID: PMC8389396 DOI: 10.1371/journal.pcbi.1009291] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022] Open
Abstract
Secondary structure plays an important role in determining the function of noncoding RNAs. Hence, identifying RNA secondary structures is of great value to research. Computational prediction is a mainstream approach for predicting RNA secondary structure. Unfortunately, even though new methods have been proposed over the past 40 years, the performance of computational prediction methods has stagnated in the last decade. Recently, with the increasing availability of RNA structure data, new methods based on machine learning (ML) technologies, especially deep learning, have alleviated the issue. In this review, we provide a comprehensive overview of RNA secondary structure prediction methods based on ML technologies and a tabularized summary of the most important methods in this field. The current pending challenges in the field of RNA secondary structure prediction and future trends are also discussed.
Collapse
Affiliation(s)
- Qi Zhao
- College of Medicine and Biological Information Engineering, Northeastern University, Shenyang, Liaoning, China
| | - Zheng Zhao
- School of Information Science and Technology, Dalian Maritime University, Dalian, Liaoning, China
| | - Xiaoya Fan
- School of Software, Key Laboratory for Ubiquitous Network and Service Software of Liaoning Province, Dalian University of Technology, Dalian, Liaoning, China
| | - Zhengwei Yuan
- Key Laboratory of Health Ministry for Congenital Malformation, Shengjing Hospital of China Medical University, Shenyang, Liaoning, China
| | - Qian Mao
- College of Light Industry, Liaoning University, Shenyang, Liaoning, China
- Key Laboratory of Agroproducts Processing Technology, Changchun University, Changchun, Jilin, China
| | - Yudong Yao
- Department of Electrical and Computer Engineering, Stevens Institute of Technology, Hoboken, New Jersey, United States of America
| |
Collapse
|
29
|
In Vitro Inhibitory Analysis of Rationally Designed siRNAs against MERS-CoV Replication in Huh7 Cells. Molecules 2021; 26:molecules26092610. [PMID: 33947034 PMCID: PMC8125306 DOI: 10.3390/molecules26092610] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2021] [Revised: 04/23/2021] [Accepted: 04/27/2021] [Indexed: 11/17/2022] Open
Abstract
MERS-CoV was identified for the first time in Jeddah, Saudi Arabia in 2012 in a hospitalized patient. This virus subsequently spread to 27 countries with a total of 939 deaths and 2586 confirmed cases and now has become a serious concern globally. Camels are well known for the transmission of the virus to the human population. In this report, we have discussed the prediction, designing, and evaluation of potential siRNA targeting the ORF1ab gene for the inhibition of MERS-CoV replication. The online software, siDirect 2.0 was used to predict and design the siRNAs, their secondary structure and their target accessibility. ORF1ab gene folding was performed by RNAxs and RNAfold software. A total of twenty-one siRNAs were selected from 462 siRNAs according to their scoring and specificity. siRNAs were evaluated in vitro for their cytotoxicity and antiviral efficacy in Huh7 cell line. No significant cytotoxicity was observed for all siRNAs in Huh7 cells. The in vitro study showed the inhibition of viral replication by three siRNAs. The data generated in this study provide preliminary and encouraging information to evaluate the siRNAs separately as well as in combination against MERS-CoV replication in other cell lines. The prediction of siRNAs using online software resulted in the filtration and selection of potential siRNAs with high accuracy and strength. This computational approach resulted in three effective siRNAs that can be taken further to in vivo animal studies and can be used to develop safe and effective antiviral therapies for other prevalent disease-causing viruses.
Collapse
|
30
|
Sato K, Akiyama M, Sakakibara Y. RNA secondary structure prediction using deep learning with thermodynamic integration. Nat Commun 2021; 12:941. [PMID: 33574226 PMCID: PMC7878809 DOI: 10.1038/s41467-021-21194-4] [Citation(s) in RCA: 178] [Impact Index Per Article: 44.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Accepted: 01/15/2021] [Indexed: 12/23/2022] Open
Abstract
Accurate predictions of RNA secondary structures can help uncover the roles of functional non-coding RNAs. Although machine learning-based models have achieved high performance in terms of prediction accuracy, overfitting is a common risk for such highly parameterized models. Here we show that overfitting can be minimized when RNA folding scores learnt using a deep neural network are integrated together with Turner’s nearest-neighbor free energy parameters. Training the model with thermodynamic regularization ensures that folding scores and the calculated free energy are as close as possible. In computational experiments designed for newly discovered non-coding RNAs, our algorithm (MXfold2) achieves the most robust and accurate predictions of RNA secondary structures without sacrificing computational efficiency compared to several other algorithms. The results suggest that integrating thermodynamic information could help improve the robustness of deep learning-based predictions of RNA secondary structure. Accurately predicting the secondary structure of non-coding RNAs can help unravel their function. Here the authors propose a method integrating thermodynamic information and deep learning to improve the robustness of RNA secondary structure prediction compared to several existing algorithms.
Collapse
Affiliation(s)
- Kengo Sato
- Department of Biosciences and Informatics, Keio University, 3-14-1 Hiyoshi, Kohoku-ku, Yokohama, Japan.
| | - Manato Akiyama
- Department of Biosciences and Informatics, Keio University, 3-14-1 Hiyoshi, Kohoku-ku, Yokohama, Japan
| | - Yasubumi Sakakibara
- Department of Biosciences and Informatics, Keio University, 3-14-1 Hiyoshi, Kohoku-ku, Yokohama, Japan
| |
Collapse
|
31
|
Perez-Mendez M, Zárate-Segura P, Salas-Benito J, Bastida-González F. siRNA Design to Silence the 3'UTR Region of Zika Virus. BIOMED RESEARCH INTERNATIONAL 2020; 2020:6759346. [PMID: 32802864 PMCID: PMC7421096 DOI: 10.1155/2020/6759346] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/15/2020] [Revised: 05/16/2020] [Accepted: 07/02/2020] [Indexed: 12/25/2022]
Abstract
The disease caused by the Zika virus (ZIKV) has positioned itself as one of the main public health problems in Mexico. One of the main reasons is it causes microcephaly and other birth defects. The transmission of ZIKV is through Aedes aegypti and Ae. albopictus mosquitoes, which are found in a larger space of the national territory. In addition, it can also be transmitted via blood transfusion, sexual relations, and maternal-fetal route. So far, there are no vaccines or specific treatments to deal with this infection. Currently, some new therapeutics such as small interfering RNAs (siRNAs) are able to regulate or suppress transcription in viruses. Therefore, in this project, an in silico siRNA was designed for the 3'UTR region of ZIKV via bioinformatics tools. The designed siRNA was synthesized and transfected into the C6/36 cell line, previously infected with ZIKV in order to assess the ability of the siRNA to inhibit viral replication. The designed siRNA was able to inhibit significantly (p < 0.05) ZIKV replication; this siRNA could be considered a potential therapeutic towards the disease that causes ZIKV and the medical problems generated.
Collapse
Affiliation(s)
- María Perez-Mendez
- Laboratorio de Medicina Traslacional, Escuela Superior de Medicina, Instituto Politécnico Nacional, St. Salvador Díaz Mirón Esquina Plan de San Luis, Santo Tomas, Miguel Hidalgo, CDMX 11340, Mexico
| | - Paola Zárate-Segura
- Laboratorio de Medicina Traslacional, Escuela Superior de Medicina, Instituto Politécnico Nacional, St. Salvador Díaz Mirón Esquina Plan de San Luis, Santo Tomas, Miguel Hidalgo, CDMX 11340, Mexico
| | - Juan Salas-Benito
- Laboratorio de Biomedicina Molecular 3, Escuela Nacional de Medicina y Homeopatía, Instituto Politécnico Nacional, Guillermo Massieu Helguera 239, La Escalera-Ticomán, Gustavo A. Madero, CDMX 07329, Mexico
| | - Fernando Bastida-González
- Laboratorio de Biología Molecular, Laboratorio Estatal de Salud Pública del Estado de México, Paseo Tollocan s/n, La Moderna de la Cruz, EDOMEX, Toluca, 50180, Mexico
| |
Collapse
|
32
|
Zhang H, Zhang L, Mathews DH, Huang L. LinearPartition: linear-time approximation of RNA folding partition function and base-pairing probabilities. Bioinformatics 2020; 36:i258-i267. [PMID: 32657379 PMCID: PMC7355276 DOI: 10.1093/bioinformatics/btaa460] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
MOTIVATION RNA secondary structure prediction is widely used to understand RNA function. Recently, there has been a shift away from the classical minimum free energy methods to partition function-based methods that account for folding ensembles and can therefore estimate structure and base pair probabilities. However, the classical partition function algorithm scales cubically with sequence length, and is therefore prohibitively slow for long sequences. This slowness is even more severe than cubic-time free energy minimization due to a substantially larger constant factor in runtime. RESULTS Inspired by the success of our recent LinearFold algorithm that predicts the approximate minimum free energy structure in linear time, we design a similar linear-time heuristic algorithm, LinearPartition, to approximate the partition function and base-pairing probabilities, which is shown to be orders of magnitude faster than Vienna RNAfold and CONTRAfold (e.g. 2.5 days versus 1.3 min on a sequence with length 32 753 nt). More interestingly, the resulting base-pairing probabilities are even better correlated with the ground-truth structures. LinearPartition also leads to a small accuracy improvement when used for downstream structure prediction on families with the longest length sequences (16S and 23S rRNAs), as well as a substantial improvement on long-distance base pairs (500+ nt apart). AVAILABILITY AND IMPLEMENTATION Code: http://github.com/LinearFold/LinearPartition; Server: http://linearfold.org/partition. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- He Zhang
- Baidu Research, Sunnyvale, CA 94089, USA
- School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR 97330, USA
| | - Liang Zhang
- School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR 97330, USA
| | - David H Mathews
- Department of Biochemistry & Biophysics, University of Rochester Medical Center, Rochester, NY 48306, USA
- Center for RNA Biology, University of Rochester Medical Center, Rochester, NY 48306, USA
- Department of Biostatistics & Computational Biology, University of Rochester Medical Center, Rochester, NY 48306, USA
| | - Liang Huang
- Baidu Research, Sunnyvale, CA 94089, USA
- School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR 97330, USA
| |
Collapse
|
33
|
Ermolenko DN, Mathews DH. Making ends meet: New functions of mRNA secondary structure. WILEY INTERDISCIPLINARY REVIEWS-RNA 2020; 12:e1611. [PMID: 32597020 DOI: 10.1002/wrna.1611] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Revised: 05/19/2020] [Accepted: 05/19/2020] [Indexed: 11/10/2022]
Abstract
The 5' cap and 3' poly(A) tail of mRNA are known to synergistically regulate mRNA translation and stability. Recent computational and experimental studies revealed that both protein-coding and non-coding RNAs will fold with extensive intramolecular secondary structure, which will result in close distances between the sequence ends. This proximity of the ends is a sequence-independent, universal property of most RNAs. Only low-complexity sequences without guanosines are without secondary structure and exhibit end-to-end distances expected for RNA random coils. The innate proximity of RNA ends might have important biological implications that remain unexplored. In particular, the inherent compactness of mRNA might regulate translation initiation by facilitating the formation of protein complexes that bridge mRNA 5' and 3' ends. Additionally, the proximity of mRNA ends might mediate coupling of 3' deadenylation to 5' end mRNA decay. This article is categorized under: RNA Structure and Dynamics > RNA Structure, Dynamics, and Chemistry RNA Structure and Dynamics > Influence of RNA Structure in Biological Systems Translation > Translation Regulation.
Collapse
Affiliation(s)
- Dmitri N Ermolenko
- Department of Biochemistry & Biophysics and Center for RNA Biology, School of Medicine and Dentistry, University of Rochester, Rochester, New York, USA
| | - David H Mathews
- Department of Biochemistry & Biophysics and Center for RNA Biology, School of Medicine and Dentistry, University of Rochester, Rochester, New York, USA
| |
Collapse
|
34
|
van den Berg FT, Ely A, Arbuthnot P. Generating DNA Expression Cassettes Encoding Multimeric Artificial MicroRNA Precursors. Methods Mol Biol 2020; 2115:185-197. [PMID: 32006402 DOI: 10.1007/978-1-0716-0290-4_11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
RNA interference (RNAi) is a promising tool for the treatment of chronic viral infection, such as that caused by the hepatitis B virus (HBV). RNAi activators, including expressed primary microRNA (pri-miRNA) mimics, can effectively silence viral gene expression and thereby inhibit viral replication. Here we describe a protocol for the design, generation and functional assessment of cassettes encoding effective single and multimeric pri-miRNA mimics. Artificial miRNAs targeting viral genes can be identified in silico and used to design corresponding pri-miRNA mimics. A two-step generation and TA cloning protocol can be used to produce single mimics, while the strategic use of restriction sites enables concatenation of mimics in a sub-cloning protocol. Basic gene silencing function of pri-miRNA mimics in cell culture can then be assessed using a dual luciferase assay and appropriate minimal targets. The methods described here for the generation of effective pri-miRNA mimics targeting HBV can be applied in the silencing of other viral or endogenous genes.
Collapse
Affiliation(s)
- Fiona T van den Berg
- Wits-SAMRC Antiviral Gene Therapy Research Unit, Department of Molecular Medicine & Haematology, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
| | - Abdullah Ely
- Wits-SAMRC Antiviral Gene Therapy Research Unit, Department of Molecular Medicine & Haematology, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
| | - Patrick Arbuthnot
- Wits-SAMRC Antiviral Gene Therapy Research Unit, Department of Molecular Medicine & Haematology, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa.
| |
Collapse
|
35
|
Huang L, Zhang H, Deng D, Zhao K, Liu K, Hendrix DA, Mathews DH. LinearFold: linear-time approximate RNA folding by 5'-to-3' dynamic programming and beam search. Bioinformatics 2019; 35:i295-i304. [PMID: 31510672 PMCID: PMC6681470 DOI: 10.1093/bioinformatics/btz375] [Citation(s) in RCA: 68] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
MOTIVATION Predicting the secondary structure of an ribonucleic acid (RNA) sequence is useful in many applications. Existing algorithms [based on dynamic programming] suffer from a major limitation: their runtimes scale cubically with the RNA length, and this slowness limits their use in genome-wide applications. RESULTS We present a novel alternative O(n3)-time dynamic programming algorithm for RNA folding that is amenable to heuristics that make it run in O(n) time and O(n) space, while producing a high-quality approximation to the optimal solution. Inspired by incremental parsing for context-free grammars in computational linguistics, our alternative dynamic programming algorithm scans the sequence in a left-to-right (5'-to-3') direction rather than in a bottom-up fashion, which allows us to employ the effective beam pruning heuristic. Our work, though inexact, is the first RNA folding algorithm to achieve linear runtime (and linear space) without imposing constraints on the output structure. Surprisingly, our approximate search results in even higher overall accuracy on a diverse database of sequences with known structures. More interestingly, it leads to significantly more accurate predictions on the longest sequence families in that database (16S and 23S Ribosomal RNAs), as well as improved accuracies for long-range base pairs (500+ nucleotides apart), both of which are well known to be challenging for the current models. AVAILABILITY AND IMPLEMENTATION Our source code is available at https://github.com/LinearFold/LinearFold, and our webserver is at http://linearfold.org (sequence limit: 100 000nt). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Liang Huang
- School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, USA
- Baidu Research USA, Sunnyvale, CA, USA
| | - He Zhang
- Baidu Research USA, Sunnyvale, CA, USA
| | - Dezhong Deng
- School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, USA
| | - Kai Zhao
- School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, USA
| | - Kaibo Liu
- School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, USA
- Baidu Research USA, Sunnyvale, CA, USA
| | - David A Hendrix
- School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, USA
- Department of Biochemistry & Biophysics, Oregon State University, University of Rochester Medical Center, Rochester, NY, USA
| | - David H Mathews
- Department of Biochemistry & Biophysics, University of Rochester Medical Center, Rochester, NY, USA
- Center for RNA Biology, University of Rochester Medical Center, Rochester, NY, USA
- Department of Biostatistics & Computational Biology, University of Rochester Medical Center, Rochester, NY, USA
| |
Collapse
|
36
|
Myeloid Cell Leukemia-1 (MCL-1) siRNA Therapy Showed Cytotoxic Effect on T Cells Acute Lymphoblastic Leukemia. INTERNATIONAL JOURNAL OF CANCER MANAGEMENT 2019. [DOI: 10.5812/ijcm.87773] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
37
|
Guru Vishnu P, Bhattacharya TK, Bhushan B, Kumar P, Chatterjee RN, Paswan C, Dushyanth K, Divya D, Prasad AR. In silico prediction of short hairpin RNA and in vitro silencing of activin receptor type IIB in chicken embryo fibroblasts by RNA interference. Mol Biol Rep 2019; 46:2947-2959. [PMID: 30879273 DOI: 10.1007/s11033-019-04756-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2019] [Accepted: 03/08/2019] [Indexed: 12/26/2022]
Abstract
Gene silencing by RNA interference is extensively used reverse genetic approach to analyse the implications of any gene in mammalian systems. The silencing of the Activin type IIB receptor belonging to transforming growth factor beta superfamily has demonstrated increase in muscle growth in many species. We designed five short hairpin RNA constructs targeting coding region of chicken ACTRIIB. All the shRNAs were transfected into chicken embryo fibroblast cells and evaluated their silencing efficiency by real time PCR and western blotting. Initially the computational analysis of target region and shRNA constructs was undertaken to predict sequence based features (secondary structures, GC% and H-b index) and thermodynamic features (ΔGoverall, ΔGduplex, ΔGbreak-target, ΔGintra-oligomer, ΔGinter-oligomer and ΔΔGends). We determined that all these predicted features were associated with shRNA efficacy. The invitro analysis of shRNA constructs exhibited significant (P < 0.05) reduction in the levels of ACTRIIB at mRNA and protein level. The knock down efficiency of shRNAs varied significantly (P < 0.001) from 83% (shRNA 1) to 43% (shRNA 5). All the shRNAs up regulated the myogenic pathway associated genes (MyoD and MyoG) significantly (P < 0.05). There was significant (P < 0.05) up-regulation of IFNA, IFNB and MHCII transcripts. The ACTRIIB expression was inversely associated with the expression of myogenic pathway and immune response genes. The anti ACTRIIB shRNA construct 1 and 3 exhibited maximum knock down efficiency with minimal interferon response, and can be used for generating ACTRIIB knockdown chicken with higher muscle mass.
Collapse
Affiliation(s)
- P Guru Vishnu
- Sri Venkateswara Veterinary University, Tirupathi, A.P., India.
| | | | - Bharat Bhushan
- Division of Animal Genetics & Breeding, Indian Veterinary Research Institute, Izatnagar, U.P., India
| | - Pushpendra Kumar
- Division of Animal Genetics & Breeding, Indian Veterinary Research Institute, Izatnagar, U.P., India
| | | | | | - K Dushyanth
- ICAR-Directorate of Poultry Research, Hyderabad, India
| | - D Divya
- ICAR-Directorate of Poultry Research, Hyderabad, India
| | - A Rajendra Prasad
- Division of Animal Genetics & Breeding, Indian Veterinary Research Institute, Izatnagar, U.P., India
| |
Collapse
|
38
|
Bayer LV, Omar OS, Bratu DP, Catrina IE. PinMol: Python application for designing molecular beacons for live cell imaging of endogenous mRNAs. RNA (NEW YORK, N.Y.) 2019; 25:305-318. [PMID: 30573696 PMCID: PMC6380279 DOI: 10.1261/rna.069542.118] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/09/2018] [Accepted: 12/04/2018] [Indexed: 06/09/2023]
Abstract
Molecular beacons are nucleic acid oligomers labeled with a fluorophore and a quencher that fold in a hairpin-shaped structure, which fluoresce only when bound to their target RNA. They are used for the visualization of endogenous mRNAs in live cells. Here, we report a Python program (PinMol) that designs molecular beacons best suited for live cell imaging by using structural information from secondary structures of the target RNA, predicted via energy minimization approaches. PinMol takes into account the accessibility of the targeted regions, as well as the inter- and intramolecular interactions of each selected probe. To demonstrate its applicability, we synthesized an oskar mRNA-specific molecular beacon (osk1236), which is selected by PinMol to target a more accessible region than a manually designed oskar-specific molecular beacon (osk2216). We previously demonstrated osk2216 to be efficient in detecting oskar mRNA in in vivo experiments. Here, we show that osk1236 outperformed osk2216 in live cell imaging experiments.
Collapse
Affiliation(s)
- Livia V Bayer
- Biological Sciences Department, Hunter College, City University of New York, New York, New York 10065, USA
- Program in Molecular, Cellular, and Developmental Biology, The Graduate Center, City University of New York, New York, New York 10016, USA
| | - Omar S Omar
- Biological Sciences Department, Hunter College, City University of New York, New York, New York 10065, USA
- Program in Molecular, Cellular, and Developmental Biology, The Graduate Center, City University of New York, New York, New York 10016, USA
| | - Diana P Bratu
- Biological Sciences Department, Hunter College, City University of New York, New York, New York 10065, USA
- Program in Molecular, Cellular, and Developmental Biology, The Graduate Center, City University of New York, New York, New York 10016, USA
| | - Irina E Catrina
- Biological Sciences Department, Hunter College, City University of New York, New York, New York 10065, USA
| |
Collapse
|
39
|
mRNAs and lncRNAs intrinsically form secondary structures with short end-to-end distances. Nat Commun 2018; 9:4328. [PMID: 30337527 PMCID: PMC6193969 DOI: 10.1038/s41467-018-06792-z] [Citation(s) in RCA: 47] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2018] [Accepted: 09/20/2018] [Indexed: 12/17/2022] Open
Abstract
The 5' and 3' termini of RNA play important roles in many cellular processes. Using Förster resonance energy transfer (FRET), we show that mRNAs and lncRNAs have an intrinsic propensity to fold in the absence of proteins into structures in which the 5' end and 3' end are ≤7 nm apart irrespective of mRNA length. Computational estimates suggest that the inherent proximity of the ends is a universal property of most mRNA and lncRNA sequences. Only guanosine-depleted RNA sequences with low sequence complexity are unstructured and exhibit end-to-end distances expected for the random coil conformation of RNA. While the biological implications remain to be explored, short end-to-end distances could facilitate the binding of protein factors that regulate translation initiation by bridging mRNA 5' and 3' ends. Furthermore, our studies provide the basis for measuring, computing and manipulating end-to-end distances and secondary structure in RNA in research and biotechnology.
Collapse
|
40
|
Antunes D, Jorge NAN, Caffarena ER, Passetti F. Using RNA Sequence and Structure for the Prediction of Riboswitch Aptamer: A Comprehensive Review of Available Software and Tools. Front Genet 2018; 8:231. [PMID: 29403526 PMCID: PMC5780412 DOI: 10.3389/fgene.2017.00231] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2017] [Accepted: 12/21/2017] [Indexed: 12/14/2022] Open
Abstract
RNA molecules are essential players in many fundamental biological processes. Prokaryotes and eukaryotes have distinct RNA classes with specific structural features and functional roles. Computational prediction of protein structures is a research field in which high confidence three-dimensional protein models can be proposed based on the sequence alignment between target and templates. However, to date, only a few approaches have been developed for the computational prediction of RNA structures. Similar to proteins, RNA structures may be altered due to the interaction with various ligands, including proteins, other RNAs, and metabolites. A riboswitch is a molecular mechanism, found in the three kingdoms of life, in which the RNA structure is modified by the binding of a metabolite. It can regulate multiple gene expression mechanisms, such as transcription, translation initiation, and mRNA splicing and processing. Due to their nature, these entities also act on the regulation of gene expression and detection of small metabolites and have the potential to helping in the discovery of new classes of antimicrobial agents. In this review, we describe software and web servers currently available for riboswitch aptamer identification and secondary and tertiary structure prediction, including applications.
Collapse
Affiliation(s)
- Deborah Antunes
- Scientific Computing Program (PROCC), Computational Biophysics and Molecular Modeling Group, Fundação Oswaldo Cruz, Rio de Janeiro, Brazil
| | - Natasha A N Jorge
- Laboratory of Functional Genomics and Bioinformatics, Oswaldo Cruz Institute, Fundação Oswaldo Cruz, Rio de Janeiro, Brazil.,Laboratory of Gene Expression Regulation, Carlos Chagas Institute, Fundação Oswaldo Cruz, Curitiba, Brazil
| | - Ernesto R Caffarena
- Scientific Computing Program (PROCC), Computational Biophysics and Molecular Modeling Group, Fundação Oswaldo Cruz, Rio de Janeiro, Brazil
| | - Fabio Passetti
- Laboratory of Functional Genomics and Bioinformatics, Oswaldo Cruz Institute, Fundação Oswaldo Cruz, Rio de Janeiro, Brazil.,Laboratory of Gene Expression Regulation, Carlos Chagas Institute, Fundação Oswaldo Cruz, Curitiba, Brazil
| |
Collapse
|
41
|
Vazquez-Anderson J, Mihailovic MK, Baldridge KC, Reyes KG, Haning K, Cho SH, Amador P, Powell WB, Contreras LM. Optimization of a novel biophysical model using large scale in vivo antisense hybridization data displays improved prediction capabilities of structurally accessible RNA regions. Nucleic Acids Res 2017; 45:5523-5538. [PMID: 28334800 PMCID: PMC5435917 DOI: 10.1093/nar/gkx115] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2015] [Accepted: 02/14/2017] [Indexed: 11/17/2022] Open
Abstract
Current approaches to design efficient antisense RNAs (asRNAs) rely primarily on a thermodynamic understanding of RNA–RNA interactions. However, these approaches depend on structure predictions and have limited accuracy, arguably due to overlooking important cellular environment factors. In this work, we develop a biophysical model to describe asRNA–RNA hybridization that incorporates in vivo factors using large-scale experimental hybridization data for three model RNAs: a group I intron, CsrB and a tRNA. A unique element of our model is the estimation of the availability of the target region to interact with a given asRNA using a differential entropic consideration of suboptimal structures. We showcase the utility of this model by evaluating its prediction capabilities in four additional RNAs: a group II intron, Spinach II, 2-MS2 binding domain and glgC 5΄ UTR. Additionally, we demonstrate the applicability of this approach to other bacterial species by predicting sRNA–mRNA binding regions in two newly discovered, though uncharacterized, regulatory RNAs.
Collapse
Affiliation(s)
- Jorge Vazquez-Anderson
- McKetta Department of Chemical Engineering, The University of Texas at Austin, 200 E. Dean Keeton St., Stop C0400, Austin, TX 78712, USA
| | - Mia K Mihailovic
- McKetta Department of Chemical Engineering, The University of Texas at Austin, 200 E. Dean Keeton St., Stop C0400, Austin, TX 78712, USA
| | - Kevin C Baldridge
- McKetta Department of Chemical Engineering, The University of Texas at Austin, 200 E. Dean Keeton St., Stop C0400, Austin, TX 78712, USA
| | - Kristofer G Reyes
- Department of Operations Research and Financial Engineering, Princeton University, Sherrerd Hall, Charlton St., Princeton, NJ 08544, USA
| | - Katie Haning
- McKetta Department of Chemical Engineering, The University of Texas at Austin, 200 E. Dean Keeton St., Stop C0400, Austin, TX 78712, USA
| | - Seung Hee Cho
- Institute for Cellular & Molecular Biology, The University of Texas at Austin, 2500 Speedway, Stop A4800, Austin, TX 78712, USA
| | - Paul Amador
- Institute for Cellular & Molecular Biology, The University of Texas at Austin, 2500 Speedway, Stop A4800, Austin, TX 78712, USA
| | - Warren B Powell
- Department of Operations Research and Financial Engineering, Princeton University, Sherrerd Hall, Charlton St., Princeton, NJ 08544, USA
| | - Lydia M Contreras
- McKetta Department of Chemical Engineering, The University of Texas at Austin, 200 E. Dean Keeton St., Stop C0400, Austin, TX 78712, USA
| |
Collapse
|
42
|
Liu Y, Fu QZ, Pu L, Song LL, Wang YY, Liu J, Wang ZL, Wang ZM. Effect of RNA interference of the expression of HMGA2 on the proliferation and invasion ability of ACHN renal cell carcinoma cells. Mol Med Rep 2017; 16:5107-5112. [PMID: 28849119 PMCID: PMC5647043 DOI: 10.3892/mmr.2017.7258] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2016] [Accepted: 04/04/2017] [Indexed: 11/06/2022] Open
Abstract
This aim of the present study was to observe the effect of high mobility group AT-hook 2 (HMGA2) on the proliferation and invasion ability of ACHN renal cell carcinoma (RCC) cells. Human ACHN cells, an RCC cell line, and HKC normal human renal tubular epithelial cells were cultured. HMGA2 small interfering (si)RNA, Mock-siRNA and their negative control group were designed and synthesized. Subsequently, the ACHN cells were transiently transfected using RNA interference technology. Finally, the mRNA and protein expression levels of HMGA2 were detected using reverse transcription-polymerase chain reaction and western blot analyses. The proliferation ability of the ACHN cells was determined using MTT, and ACHN cell invasion ability was detected using the Transwell method. The results showed that the mRNA and protein expression levels of HMGA2 in the ACHN cells were considerably higher, compared with those in the HKC cells (P<0.01). The RCC cells, in which the expression of HMGA2 was specifically silenced, was successfully constructed. The proliferation rate of cells in the HMGA2-siRNA group was significantly lower, compared with that of cells in the Mock-siRNA group and control group at 24, 48, 72 and 96 h post-transfection (P<0.05). The invasion ability of cells in the HMGA2-siRNA group was significantly lower, compared with that of cells in the Mock-siRNA group and control group (P<0.05) 48 h following transfection. Therefore, the HMGA2 gene may function as an oncogene in the occurrence and development of RCC, and provide specific targets for the targeted therapy of RCC. Further detailed investigations of the HMGA2 gene are important for future gene therapy of RCC.
Collapse
Affiliation(s)
- Ying Liu
- Xi'an Jiaotong University Health Science Center, Xi'an, Shanxi 710049, P.R. China
| | - Qi-Zhong Fu
- Department of Urological Surgery, The Affiliated Zhongshan Hospital of Dalian University, Dalian, Liaoning 116001, P.R. China
| | - Lin Pu
- Department of Urological Surgery, The Affiliated Zhongshan Hospital of Dalian University, Dalian, Liaoning 116001, P.R. China
| | - Ling-Ling Song
- Department of Urological Surgery, The Affiliated Zhongshan Hospital of Dalian University, Dalian, Liaoning 116001, P.R. China
| | - Yi-Yun Wang
- Department of Urological Surgery, The Affiliated Zhongshan Hospital of Dalian University, Dalian, Liaoning 116001, P.R. China
| | - Jing Liu
- Department of Urological Surgery, The Affiliated Zhongshan Hospital of Dalian University, Dalian, Liaoning 116001, P.R. China
| | - Zhen-Long Wang
- Department of Urological Surgery, The Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an, Shanxi 710004, P.R. China
| | - Zi-Ming Wang
- Department of Urological Surgery, The Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an, Shanxi 710004, P.R. China
| |
Collapse
|
43
|
Mathews DH, Turner DH, Watson RM. RNA Secondary Structure Prediction. ACTA ACUST UNITED AC 2016; 67:11.2.1-11.2.19. [DOI: 10.1002/cpnc.19] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
44
|
Sloma MF, Mathews DH. Exact calculation of loop formation probability identifies folding motifs in RNA secondary structures. RNA (NEW YORK, N.Y.) 2016; 22:1808-1818. [PMID: 27852924 PMCID: PMC5113201 DOI: 10.1261/rna.053694.115] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2015] [Accepted: 09/08/2016] [Indexed: 05/10/2023]
Abstract
RNA secondary structure prediction is widely used to analyze RNA sequences. In an RNA partition function calculation, free energy nearest neighbor parameters are used in a dynamic programming algorithm to estimate statistical properties of the secondary structure ensemble. Previously, partition functions have largely been used to estimate the probability that a given pair of nucleotides form a base pair, the conditional stacking probability, the accessibility to binding of a continuous stretch of nucleotides, or a representative sample of RNA structures. Here it is demonstrated that an RNA partition function can also be used to calculate the exact probability of formation of hairpin loops, internal loops, bulge loops, or multibranch loops at a given position. This calculation can also be used to estimate the probability of formation of specific helices. Benchmarking on a set of RNA sequences with known secondary structures indicated that loops that were calculated to be more probable were more likely to be present in the known structure than less probable loops. Furthermore, highly probable loops are more likely to be in the known structure than the set of loops predicted in the lowest free energy structures.
Collapse
Affiliation(s)
- Michael F Sloma
- Department of Biochemistry and Biophysics and Center for RNA Biology, University of Rochester Medical Center, Rochester, New York 14642, USA
| | - David H Mathews
- Department of Biochemistry and Biophysics and Center for RNA Biology, University of Rochester Medical Center, Rochester, New York 14642, USA
| |
Collapse
|
45
|
Angart PA, Carlson RJ, Adu-Berchie K, Walton SP. Terminal Duplex Stability and Nucleotide Identity Differentially Control siRNA Loading and Activity in RNA Interference. Nucleic Acid Ther 2016; 26:309-317. [PMID: 27399870 PMCID: PMC5067871 DOI: 10.1089/nat.2016.0612] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2016] [Accepted: 05/06/2016] [Indexed: 01/17/2023] Open
Abstract
Efficient short interfering RNA (siRNA)-mediated gene silencing requires selection of a sequence that is complementary to the intended target and possesses sequence and structural features that encourage favorable functional interactions with the RNA interference (RNAi) pathway proteins. In this study, we investigated how terminal sequence and structural characteristics of siRNAs contribute to siRNA strand loading and silencing activity and how these characteristics ultimately result in a functionally asymmetric duplex in cultured HeLa cells. Our results reiterate that the most important characteristic in determining siRNA activity is the 5' terminal nucleotide identity. Our findings further suggest that siRNA loading is controlled principally by the hybridization stability of the 5' terminus (Nucleotides: 1-2) of each siRNA strand, independent of the opposing terminus. Postloading, RNA-induced silencing complex (RISC)-specific activity was found to be improved by lower hybridization stability in the 5' terminus (Nucleotides: 3-4) of the loaded siRNA strand and greater hybridization stability toward the 3' terminus (Nucleotides: 17-18). Concomitantly, specific recognition of the 5' terminal nucleotide sequence by human Argonaute 2 (Ago2) improves RISC half-life. These findings indicate that careful selection of siRNA sequences can maximize both the loading and the specific activity of the intended guide strand.
Collapse
Affiliation(s)
- Phillip A Angart
- Department of Chemical Engineering and Materials Science, Michigan State University , East Lansing, Michigan
| | - Rebecca J Carlson
- Department of Chemical Engineering and Materials Science, Michigan State University , East Lansing, Michigan
| | - Kwasi Adu-Berchie
- Department of Chemical Engineering and Materials Science, Michigan State University , East Lansing, Michigan
| | - S Patrick Walton
- Department of Chemical Engineering and Materials Science, Michigan State University , East Lansing, Michigan
| |
Collapse
|
46
|
Abstract
RNA-RNA binding is a required step for many regulatory and catalytic processes in the cell. Identifying RNA-RNA hybridization sites is challenging because of the competition between intramolecular and intermolecular structure formation. A complete picture of RNA-RNA binding includes an understanding of single-stranded folding and binding site accessibility, and is strongly concentration-dependent. This chapter provides guidance for using RNAstructure to predict RNA-RNA binding sites and RNA-RNA structures, utilizing free energy minimization and partition function calculations. RNAstructure is freely available at http://rna.urmc.rochester.edu/RNAstructure.html .
Collapse
Affiliation(s)
- Laura DiChiacchio
- Department of Biochemistry & Biophysics, University of Rochester Medical Center, 601 Elmwood Avenue, 712, Rochester, NY, 14642, USA
- Center for RNA Biology, University of Rochester Medical Center, 601 Elmwood Avenue, 712, Rochester, NY, 14642, USA
| | - David H Mathews
- Department of Biochemistry & Biophysics, University of Rochester Medical Center, 601 Elmwood Avenue, 712, Rochester, NY, 14642, USA.
- Center for RNA Biology, University of Rochester Medical Center, 601 Elmwood Avenue, 712, Rochester, NY, 14642, USA.
| |
Collapse
|
47
|
DiChiacchio L, Sloma MF, Mathews DH. AccessFold: predicting RNA-RNA interactions with consideration for competing self-structure. Bioinformatics 2015; 32:1033-9. [PMID: 26589271 DOI: 10.1093/bioinformatics/btv682] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2014] [Accepted: 11/15/2015] [Indexed: 01/09/2023] Open
Abstract
MOTIVATION There are numerous examples of RNA-RNA complexes, including microRNA-mRNA and small RNA-mRNA duplexes for regulation of translation, guide RNA interactions with target RNA for post-transcriptional modification and small nuclear RNA duplexes for splicing. Predicting the base pairs formed between two interacting sequences remains difficult, at least in part because of the competition between unimolecular and bimolecular structure. RESULTS Two algorithms were developed for improved prediction of bimolecular RNA structure that consider the competition between self-structure and bimolecular structure. These algorithms utilize two novel approaches to evaluate accessibility: free energy density minimization and pseudo-energy minimization. Free energy density minimization minimizes the folding free energy change per nucleotide involved in an intermolecular secondary structure. Pseudo-energy minimization (called AccessFold) minimizes the sum of free energy change and a pseudo-free energy penalty for bimolecular pairing of nucleotides that are unlikely to be accessible for bimolecular structure. The pseudo-free energy, derived from unimolecular pairing probabilities, is applied per nucleotide in bimolecular pairs, and this approach is able to predict binding sites that are split by unimolecular structures. A benchmark set of 17 bimolecular RNA structures was assembled to assess structure prediction. Pseudo-energy minimization provides a statistically significant improvement in sensitivity over the method that was found in a benchmark to be the most accurate previously available method, with an improvement from 36.8% to 57.8% in mean sensitivity for base pair prediction. AVAILABILITY AND IMPLEMENTATION Pseudo-energy minimization is available for download as AccessFold, under an open-source license and as part of the RNAstructure package, at: http://rna.urmc.rochester.edu/RNAstructure.html CONTACT david_mathews@urmc.rochester.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Laura DiChiacchio
- Department of Biochemistry and Biophysics and Center for RNA Biology and
| | - Michael F Sloma
- Department of Biochemistry and Biophysics and Center for RNA Biology and
| | - David H Mathews
- Department of Biochemistry and Biophysics and Center for RNA Biology and Department of Biostatistics and Computational Biology, University of Rochester Medical Center, Rochester, NY 14642, USA
| |
Collapse
|
48
|
Sloma MF, Mathews DH. Improving RNA secondary structure prediction with structure mapping data. Methods Enzymol 2015; 553:91-114. [PMID: 25726462 DOI: 10.1016/bs.mie.2014.10.053] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Methods to probe RNA secondary structure, such as small molecule modifying agents, secondary structure-specific nucleases, inline probing, and SHAPE chemistry, are widely used to study the structure of functional RNA. Computational secondary structure prediction programs can incorporate probing data to predict structure with high accuracy. In this chapter, an overview of current methods for probing RNA secondary structure is provided, including modern high-throughput methods. Methods for guiding secondary structure prediction algorithms using these data are explained, and best practices for using these data are provided. This chapter concludes by listing a number of open questions about how to best use probing data, and what these data can provide.
Collapse
Affiliation(s)
- Michael F Sloma
- Department of Biochemistry & Biophysics, University of Rochester Medical Center, Box 712, Rochester, New York, USA; Center for RNA Biology, University of Rochester Medical Center, Box 712, Rochester, New York, USA
| | - David H Mathews
- Department of Biochemistry & Biophysics, University of Rochester Medical Center, Box 712, Rochester, New York, USA; Center for RNA Biology, University of Rochester Medical Center, Box 712, Rochester, New York, USA.
| |
Collapse
|
49
|
Abstract
RNA-binding proteins (RBPs) are important regulators of eukaryotic gene expression. Genomes typically encode dozens to hundreds of proteins containing RNA-binding domains, which collectively recognize diverse RNA sequences and structures. Recent advances in high-throughput methods for assaying the targets of RBPs in vitro and in vivo allow large-scale derivation of RNA-binding motifs as well as determination of RNA–protein interactions in living cells. In parallel, many computational methods have been developed to analyze and interpret these data. The interplay between RNA secondary structure and RBP binding has also been a growing theme. Integrating RNA–protein interaction data with observations of post-transcriptional regulation will enhance our understanding of the roles of these important proteins.
Collapse
|
50
|
Montes C, Castro Á, Barba P, Rubio J, Sánchez E, Carvajal D, Aguirre C, Tapia E, DelÍ Orto P, Decroocq V, Prieto H. Differential RNAi responses of Nicotiana benthamiana individuals transformed with a hairpin-inducing construct during Plum pox virus challenge. Virus Genes 2014; 49:325-38. [PMID: 24964777 DOI: 10.1007/s11262-014-1093-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2014] [Accepted: 05/30/2014] [Indexed: 10/25/2022]
Abstract
Gene silencing and large-scale small RNA analysis can be used to develop RNA interference (RNAi)-based resistance strategies for Plum pox virus (PPV), a high impact disease of Prunus spp. In this study, a pPPViRNA hairpin-inducing vector harboring two silencing motif-rich regions of the PPV coat protein (CP) gene was evaluated in transgenic Nicotiana benthamiana (NB) plants. Wild-type NB plants infected with a chimeric PPV virus (PPV::GFP) exhibited affected leaves with mosaic chlorosis congruent to GFP fluorescence at 21 day post-inoculation; transgenic lines depicted a range of phenotypes from fully resistant to susceptible. ELISA values and GFP fluorescence intensities were used to select transgenic-resistant (TG-R) and transgenic-susceptible (TG-S) lines for further characterization of small interfering RNAs (siRNAs) by large-scale small RNA sequencing. In infected TG-S and untransformed (WT) plants, the observed siRNAs were nearly exclusively 21- and 22-nt siRNAs that targeted the whole PPV::GFP genome; 24-nt siRNAs were absent in these individuals. Challenged TG-R plants accumulated a full set of 21- to 24-nt siRNAs that were primarily associated with the selected motif-rich regions, indicating that a trans-acting siRNAs process prevented viral multiplication. BLAST analysis identified 13 common siRNA clusters targeting the CP gene. 21-nt siRNA sequences were associated with the 22-nt siRNAs and the scarce 23- and 24-nt molecules in TG-S plants and with most of the observed 22-, 23-, and 24-nt siRNAs in TG-R individuals. These results validate the use of a multi-hot spot silencing vector against PPV and elucidate the molecules by which hairpin-inducing vectors initiate RNAi in vivo.
Collapse
Affiliation(s)
- Christian Montes
- Biotechnology Laboratory, Instituto de Investigaciones Agropecuarias, La Platina Research Station, Avenida Santa Rosa 11610, La Pintana, 8831314, Santiago, Chile
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|