1
|
Hara K, Iwano N, Fukunaga T, Hamada M. DeepRaccess: high-speed RNA accessibility prediction using deep learning. FRONTIERS IN BIOINFORMATICS 2023; 3:1275787. [PMID: 37881622 PMCID: PMC10597636 DOI: 10.3389/fbinf.2023.1275787] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Accepted: 09/29/2023] [Indexed: 10/27/2023] Open
Abstract
RNA accessibility is a useful RNA secondary structural feature for predicting RNA-RNA interactions and translation efficiency in prokaryotes. However, conventional accessibility calculation tools, such as Raccess, are computationally expensive and require considerable computational time to perform transcriptome-scale analysis. In this study, we developed DeepRaccess, which predicts RNA accessibility based on deep learning methods. DeepRaccess was trained to take artificial RNA sequences as input and to predict the accessibility of these sequences as calculated by Raccess. Simulation and empirical dataset analyses showed that the accessibility predicted by DeepRaccess was highly correlated with the accessibility calculated by Raccess. In addition, we confirmed that DeepRaccess could predict protein abundance in E.coli with moderate accuracy from the sequences around the start codon. We also demonstrated that DeepRaccess achieved tens to hundreds of times software speed-up in a GPU environment. The source codes and the trained models of DeepRaccess are freely available at https://github.com/hmdlab/DeepRaccess.
Collapse
Affiliation(s)
- Kaisei Hara
- Department of Electrical Engineering and Bioscience, Graduate School of Advanced Science and Engineering, Waseda University, Tokyo, Japan
- Computational Bio Big-Data Open Innovation Laboratory, AIST-Waseda University, Tokyo, Japan
| | - Natsuki Iwano
- Department of Electrical Engineering and Bioscience, Graduate School of Advanced Science and Engineering, Waseda University, Tokyo, Japan
| | - Tsukasa Fukunaga
- Waseda Institute for Advanced Study, Waseda University, Tokyo, Japan
| | - Michiaki Hamada
- Department of Electrical Engineering and Bioscience, Graduate School of Advanced Science and Engineering, Waseda University, Tokyo, Japan
- Computational Bio Big-Data Open Innovation Laboratory, AIST-Waseda University, Tokyo, Japan
- Graduate School of Medicine, Nippon Medical School, Tokyo, Japan
| |
Collapse
|
2
|
RNA Secondary Structure Alteration Caused by Single Nucleotide Variants. Methods Mol Biol 2023; 2586:107-120. [PMID: 36705901 DOI: 10.1007/978-1-0716-2768-6_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
A point mutation in coding RNA can cause not only an amino acid substitution but also a dynamic change of RNA secondary structure, leading to a dysfunctional RNA. Although in silico structure prediction has been used to detect structure-disrupting point mutations known as riboSNitches, exhaustive simulation of long RNAs is needed to detect a significant enrichment or depletion of riboSNitches in functional RNAs. Here, we have developed a novel algorithm Radiam (RNA secondary structure Analysis with Deletion, Insertion, And substitution Mutations) for a comprehensive riboSNitch analysis of long RNAs. Radiam is based on the ParasoR framework, which efficiently computes local RNA secondary structures for long RNAs. ParasoR can compute a variety of structure scores over globally consistent structures with maximal span constraints for the base pair distance. Using the reusable structure database made by ParasoR, Radiam performs an efficient recomputation of the secondary structures for mutated sequences. An exhaustive simulation of Radiam is expected to find reliable riboSNitch candidates on long RNAs by evaluating their statistical significance in terms of the change of local structure stability.
Collapse
|
3
|
Ono Y, Asai K. Rtools: A Web Server for Various Secondary Structural Analyses on Single RNA Sequences. Methods Mol Biol 2023; 2586:1-14. [PMID: 36705895 DOI: 10.1007/978-1-0716-2768-6_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Predicting the secondary structures of RNA molecules is an essential step to characterize their functions, but the thermodynamic probability of any prediction is generally small. On the other hand, there are a few tools for calculating and visualizing various secondary structural information from RNA sequences. We implemented a web server that calculates in parallel various features of secondary structures: different types of secondary structure predictions, the marginal probabilities for local structural contexts, accessibilities of the subsequences, the energy changes by arbitrary base mutations, and the measures for validations of the predicted secondary structures. The web server is available at http://rtools.cbrc.jp , which integrates software tools, CentroidFold, CentroidHomfold, IPknot, CapR, Raccess, Rchange, RintD, and RintW.
Collapse
Affiliation(s)
- Yukiteru Ono
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, Tokyo, Japan
| | - Kiyoshi Asai
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, Tokyo, Japan.
| |
Collapse
|
4
|
Zhao YH, Zhou T, Wang JX, Li Y, Fang MF, Liu JN, Li ZH. Evolution and structural variations in chloroplast tRNAs in gymnosperms. BMC Genomics 2021; 22:750. [PMID: 34663228 PMCID: PMC8524817 DOI: 10.1186/s12864-021-08058-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Accepted: 10/06/2021] [Indexed: 11/22/2022] Open
Abstract
Background Chloroplast transfer RNAs (tRNAs) can participate in various vital processes. Gymnosperms have important ecological and economic value, and they are the dominant species in forest ecosystems in the Northern Hemisphere. However, the evolution and structural changes in chloroplast tRNAs in gymnosperms remain largely unclear. Results In this study, we determined the nucleotide evolution, phylogenetic relationships, and structural variations in 1779 chloroplast tRNAs in gymnosperms. The numbers and types of tRNA genes present in the chloroplast genomes of different gymnosperms did not differ greatly, where the average number of tRNAs was 33 and the frequencies of occurrence for various types of tRNAs were generally consistent. Nearly half of the anticodons were absent. Molecular sequence variation analysis identified the conserved secondary structures of tRNAs. About a quarter of the tRNA genes were found to contain precoded 3′ CCA tails. A few tRNAs have undergone novel structural changes that are closely related to their minimum free energy, and these structural changes affect the stability of the tRNAs. Phylogenetic analysis showed that tRNAs have evolved from multiple common ancestors. The transition rate was higher than the transversion rate in gymnosperm chloroplast tRNAs. More loss events than duplication events have occurred in gymnosperm chloroplast tRNAs during their evolutionary process. Conclusions These findings provide novel insights into the molecular evolution and biological characteristics of chloroplast tRNAs in gymnosperms. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-08058-3.
Collapse
Affiliation(s)
- Yu-He Zhao
- Key Laboratory of Resource Biology and Biotechnology in Western China (Ministry of Education), College of Life Sciences, Northwest University, Xi'an, 710069, China
| | - Tong Zhou
- Key Laboratory of Resource Biology and Biotechnology in Western China (Ministry of Education), College of Life Sciences, Northwest University, Xi'an, 710069, China
| | - Jiu-Xia Wang
- Key Laboratory of Resource Biology and Biotechnology in Western China (Ministry of Education), College of Life Sciences, Northwest University, Xi'an, 710069, China
| | - Yan Li
- Key Laboratory of Resource Biology and Biotechnology in Western China (Ministry of Education), College of Life Sciences, Northwest University, Xi'an, 710069, China
| | - Min-Feng Fang
- Key Laboratory of Resource Biology and Biotechnology in Western China (Ministry of Education), College of Life Sciences, Northwest University, Xi'an, 710069, China
| | - Jian-Ni Liu
- State Key Laboratory of Continental Dynamics, Department of Geology, Early Life Institute, Northwest University, Xi'an, 710069, China
| | - Zhong-Hu Li
- Key Laboratory of Resource Biology and Biotechnology in Western China (Ministry of Education), College of Life Sciences, Northwest University, Xi'an, 710069, China.
| |
Collapse
|
5
|
Shatoff E, Bundschuh R. Single nucleotide polymorphisms affect RNA-protein interactions at a distance through modulation of RNA secondary structures. PLoS Comput Biol 2020; 16:e1007852. [PMID: 32379750 PMCID: PMC7237046 DOI: 10.1371/journal.pcbi.1007852] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2019] [Revised: 05/19/2020] [Accepted: 04/06/2020] [Indexed: 11/19/2022] Open
Abstract
Single nucleotide polymorphisms are widely associated with disease, but the ways in which they cause altered phenotypes are often unclear, especially when they appear in non-coding regions. One way in which non-coding polymorphisms could cause disease is by affecting crucial RNA-protein interactions. While it is clear that changing a protein binding motif will alter protein binding, it has been shown that single nucleotide polymorphisms can affect RNA secondary structure, and here we show that single nucleotide polymorphisms can affect RNA-protein interactions from outside binding motifs through altered RNA secondary structure. By using a modified version of the Vienna Package and PAR-CLIP data for HuR (ELAVL1) in humans we characterize the genome-wide effect of single nucleotide polymorphisms on HuR binding and show that they can have a many-fold effect on the affinity of HuR binding to RNA transcripts from tens of bases away. We also find some evidence that the effect of single nucleotide polymorphisms on protein binding might be under selection, with the non-reference alleles tending to make it harder for a protein to bind.
Collapse
Affiliation(s)
- Elan Shatoff
- Department of Physics, The Ohio State University, Columbus, Ohio, United States of America
- Center for RNA Biology, The Ohio State University, Columbus, Ohio, United States of America
| | - Ralf Bundschuh
- Department of Physics, The Ohio State University, Columbus, Ohio, United States of America
- Center for RNA Biology, The Ohio State University, Columbus, Ohio, United States of America
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio, United States of America
- Division of Hematology, Department of Internal Medicine, The Ohio State University, Columbus, Ohio, United States of America
| |
Collapse
|
6
|
Behloul N, Wei W, Baha S, Liu Z, Wen J, Meng J. Effects of mRNA secondary structure on the expression of HEV ORF2 proteins in Escherichia coli. Microb Cell Fact 2017; 16:200. [PMID: 29137642 PMCID: PMC5686824 DOI: 10.1186/s12934-017-0812-8] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2017] [Accepted: 11/06/2017] [Indexed: 12/21/2022] Open
Abstract
Background Viral protein expression in Escherichia coli (E. coli) is a powerful tool for structural/functional studies as well as for vaccine and diagnostics development. However, numerous factors such as codon bias, mRNA secondary structure and nucleotides distribution, have been indentified to hamper this heterologous expression. Results In this study, we combined computational and biochemical methods to analyze the influence of these factors on the expression of different segments of hepatitis E virus (HEV) ORF 2 protein and hepatitis B virus surface antigen (HBsAg). Three out of five HEV antigens were expressed while all three HBsAg fragments were not. The computational analysis revealed a significant difference in nucleotide distribution between expressed and non-expressed genes; and all these non-expressing constructs shared similar stable 5′-end mRNA secondary structures that affected the accessibility of both Shine-Dalgarno (SD) sequence and start codon AUG. By modifying the 5′-end of HEV and HBV non-expressed genes, there was a significant increase in the total free energy of the mRNA secondary structures that permitted the exposure of the SD sequence and the start codon, which in turn, led to the successful expression of these genes in E. coli. Conclusions This study demonstrates that the mRNA secondary structure near the start codon is the key limiting factor for an efficient expression of HEV ORF2 proteins in E. coli. It describes also a simple and effective strategy for the production of viral proteins of different lengths for immunogenicity/antigenicity comparative studies during vaccine and diagnostics development. Electronic supplementary material The online version of this article (10.1186/s12934-017-0812-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Nouredine Behloul
- Department of Microbiology and Immunology, School of Medicine, Southeast University, 87 DingJiaQiao Road, Nanjing, 210009, Jiangsu, China
| | - Wenjuan Wei
- Department of Microbiology and Immunology, School of Medicine, Southeast University, 87 DingJiaQiao Road, Nanjing, 210009, Jiangsu, China
| | - Sarra Baha
- Department of Microbiology and Immunology, School of Medicine, Southeast University, 87 DingJiaQiao Road, Nanjing, 210009, Jiangsu, China
| | - Zhenzhen Liu
- Department of Microbiology and Immunology, School of Medicine, Southeast University, 87 DingJiaQiao Road, Nanjing, 210009, Jiangsu, China
| | - Jiyue Wen
- Department of Pharmacology, Anhui Medical University, Hefei, 230032, China
| | - Jihong Meng
- Department of Microbiology and Immunology, School of Medicine, Southeast University, 87 DingJiaQiao Road, Nanjing, 210009, Jiangsu, China.
| |
Collapse
|
7
|
Hamada M. In silico approaches to RNA aptamer design. Biochimie 2017; 145:8-14. [PMID: 29032056 DOI: 10.1016/j.biochi.2017.10.005] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2017] [Accepted: 10/09/2017] [Indexed: 10/18/2022]
Abstract
RNA aptamers are ribonucleic acids that bind to specific target molecules. An RNA aptamer for a disease-related protein has great potential for development into a new drug. However, huge time and cost investments are required to develop an RNA aptamer into a pharmaceutical. Recently, SELEX combined with high-throughput sequencers (i.e., HT-SELEX) has been widely used to select candidate RNA aptamers that bind to a target protein with high affinity and specificity. After candidate selection, further optimizations such as shortening and modifying candidate sequences are performed. In these steps, in silico approaches are expected to reduce the time and cost associated with aptamer drug development. In this article, we review existing in silico approaches to RNA aptamer development, including a method for ranking the candidates of RNA aptamers from HT-SELEX data, clustering a huge number of aptamer sequences, and finding motifs amidst a set of significant RNA aptamers. It is expected that further studies in addition to these methods will be utilized for in silico RNA aptamer design, permitting a minimal number of experiments to be performed through the utilization of sophisticated computational methods.
Collapse
Affiliation(s)
- Michiaki Hamada
- Bioinformatics Laboratory, Department of Electrical Engineering and Bioscience, Faculty of Science and Engineering, Waseda University, 55N-06-10, 3-4-1, Okubo Shinjuku-ku, Tokyo 169-8555, Japan; Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology (AIST), 63-520, 3-4-1, Okubo Shinjuku-ku, Tokyo 169-8555, Japan; Institute for Medical-oriented Structural Biology, Waseda University, 2-2, Wakamatsu-cho Shinjuku-ku, Tokyo 162-8480, Japan; Artificial Intelligence Research Center (AIRC), National Institute of Advanced Industrial Science and Technology (AIST), 2-3-26, Aomi, Koto-ku, Tokyo 135-0064, Japan; Graduate School of Medicine, Nippon Medical School, 1-1-5, Sendagi, Bunkyo-ku, Tokyo 113-8602, Japan.
| |
Collapse
|
8
|
Hamada M, Ono Y, Kiryu H, Sato K, Kato Y, Fukunaga T, Mori R, Asai K. Rtools: a web server for various secondary structural analyses on single RNA sequences. Nucleic Acids Res 2016; 44:W302-7. [PMID: 27131356 PMCID: PMC4987903 DOI: 10.1093/nar/gkw337] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2016] [Accepted: 04/15/2016] [Indexed: 11/12/2022] Open
Abstract
The secondary structures, as well as the nucleotide sequences, are the important features of RNA molecules to characterize their functions. According to the thermodynamic model, however, the probability of any secondary structure is very small. As a consequence, any tool to predict the secondary structures of RNAs has limited accuracy. On the other hand, there are a few tools to compensate the imperfect predictions by calculating and visualizing the secondary structural information from RNA sequences. It is desirable to obtain the rich information from those tools through a friendly interface. We implemented a web server of the tools to predict secondary structures and to calculate various structural features based on the energy models of secondary structures. By just giving an RNA sequence to the web server, the user can get the different types of solutions of the secondary structures, the marginal probabilities such as base-paring probabilities, loop probabilities and accessibilities of the local bases, the energy changes by arbitrary base mutations as well as the measures for validations of the predicted secondary structures. The web server is available at http://rtools.cbrc.jp, which integrates software tools, CentroidFold, CentroidHomfold, IPKnot, CapR, Raccess, Rchange and RintD.
Collapse
Affiliation(s)
- Michiaki Hamada
- Department of Electrical Engineering and Bioscience, Faculty of Science and Engineering, Waseda University, 55N-06-10, 3-4-1, Okubo Shinjuku-ku, Tokyo 169-8555, Japan Artificial Intelligence Research Center (AIRC), National Institute of Advanced Industrial Science and Technology (AIST), 2-4-7 Aomi, Koto-ku, 135-0064 Tokyo, Japan
| | - Yukiteru Ono
- IMSBIO Co., Ltd, 4-21-1-601 Higashi-Ikebukuro, Toshima-ku, Tokyo 170-0013, Japan
| | - Hisanori Kiryu
- Graduate School of Frontier Sciences, University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa 277-8562, Japan
| | - Kengo Sato
- Department of Biosciences and Informatics, Keio University, 3-14-1 Hiyoshi, Kohoku-ku, Yokohama 223-8522, Japan
| | - Yuki Kato
- Center for iPS Cell Research and Application (CiRA), Kyoto University, Shogoin, Sakyo-ku, Kyoto 606-8507, Japan
| | - Tsukasa Fukunaga
- Graduate School of Frontier Sciences, University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa 277-8562, Japan
| | - Ryota Mori
- Graduate School of Frontier Sciences, University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa 277-8562, Japan
| | - Kiyoshi Asai
- Artificial Intelligence Research Center (AIRC), National Institute of Advanced Industrial Science and Technology (AIST), 2-4-7 Aomi, Koto-ku, 135-0064 Tokyo, Japan Graduate School of Frontier Sciences, University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa 277-8562, Japan
| |
Collapse
|
9
|
Bioinformatics tools for lncRNA research. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2016; 1859:23-30. [DOI: 10.1016/j.bbagrm.2015.07.014] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/30/2015] [Revised: 07/07/2015] [Accepted: 07/14/2015] [Indexed: 12/28/2022]
|
10
|
Pei S, Anthony JS, Meyer MM. Sampled ensemble neutrality as a feature to classify potential structured RNAs. BMC Genomics 2015; 16:35. [PMID: 25649229 PMCID: PMC4333902 DOI: 10.1186/s12864-014-1203-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2014] [Accepted: 12/22/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Structured RNAs have many biological functions ranging from catalysis of chemical reactions to gene regulation. Yet, many homologous structured RNAs display most of their conservation at the secondary or tertiary structure level. As a result, strategies for structured RNA discovery rely heavily on identification of sequences sharing a common stable secondary structure. However, correctly distinguishing structured RNAs from surrounding genomic sequence remains challenging, especially during de novo discovery. RNA also has a long history as a computational model for evolution due to the direct link between genotype (sequence) and phenotype (structure). From these studies it is clear that evolved RNA structures, like protein structures, can be considered robust to point mutations. In this context, an RNA sequence is considered robust if its neutrality (extent to which single mutant neighbors maintain the same secondary structure) is greater than that expected for an artificial sequence with the same minimum free energy structure. RESULTS In this work, we bring concepts from evolutionary biology to bear on the structured RNA de novo discovery process. We hypothesize that alignments corresponding to structured RNAs should consist of neutral sequences. We evaluate several measures of neutrality for their ability to distinguish between alignments of structured RNA sequences drawn from Rfam and various decoy alignments. We also introduce a new measure of RNA structural neutrality, the structure ensemble neutrality (SEN). SEN seeks to increase the biological relevance of existing neutrality measures in two ways. First, it uses information from an alignment of homologous sequences to identify a conserved biologically relevant structure for comparison. Second, it only counts base-pairs of the original structure that are absent in the comparison structure and does not penalize the formation of additional base-pairs. CONCLUSION We find that several measures of neutrality are effective at separating structured RNAs from decoy sequences, including both shuffled alignments and flanking genomic sequence. Furthermore, as an independent feature classifier to identify structured RNAs, SEN yields comparable performance to current approaches that consider a variety of features including stability and sequence identity. Finally, SEN outperforms other measures of neutrality at detecting mutational robustness in bacterial regulatory RNA structures.
Collapse
Affiliation(s)
- Shermin Pei
- Boston College, 140 Commonwealth Ave., Chestnut Hill, 02467, MA, USA.
| | - Jon S Anthony
- Boston College, 140 Commonwealth Ave., Chestnut Hill, 02467, MA, USA.
| | - Michelle M Meyer
- Boston College, 140 Commonwealth Ave., Chestnut Hill, 02467, MA, USA.
| |
Collapse
|
11
|
Li MJ, Wang J. Current trend of annotating single nucleotide variation in humans--A case study on SNVrap. Methods 2014; 79-80:32-40. [PMID: 25308971 DOI: 10.1016/j.ymeth.2014.10.003] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2014] [Revised: 09/25/2014] [Accepted: 10/02/2014] [Indexed: 12/16/2022] Open
Abstract
As high throughput methods, such as whole genome genotyping arrays, whole exome sequencing (WES) and whole genome sequencing (WGS), have detected huge amounts of genetic variants associated with human diseases, function annotation of these variants is an indispensable step in understanding disease etiology. Large-scale functional genomics projects, such as The ENCODE Project and Roadmap Epigenomics Project, provide genome-wide profiling of functional elements across different human cell types and tissues. With the urgent demands for identification of disease-causal variants, comprehensive and easy-to-use annotation tool is highly in demand. Here we review and discuss current progress and trend of the variant annotation field. Furthermore, we introduce a comprehensive web portal for annotating human genetic variants. We use gene-based features and the latest functional genomics datasets to annotate single nucleotide variation (SNVs) in human, at whole genome scale. We further apply several function prediction algorithms to annotate SNVs that might affect different biological processes, including transcriptional gene regulation, alternative splicing, post-transcriptional regulation, translation and post-translational modifications. The SNVrap web portal is freely available at http://jjwanglab.org/snvrap.
Collapse
Affiliation(s)
- Mulin Jun Li
- Centre for Genomic Sciences, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region, China; Department of Biochemistry, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region, China; Shenzhen Institute of Research and Innovation, The University of Hong Kong, Shenzhen, China
| | - Junwen Wang
- Centre for Genomic Sciences, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region, China; Department of Biochemistry, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region, China; Shenzhen Institute of Research and Innovation, The University of Hong Kong, Shenzhen, China.
| |
Collapse
|
12
|
Bao G, Dong H, Zhu Y, Mao S, Zhang T, Zhang Y, Chen Z, Li Y. Comparative genomic and proteomic analyses of Clostridium acetobutylicum Rh8 and its parent strain DSM 1731 revealed new understandings on butanol tolerance. Biochem Biophys Res Commun 2014; 450:1612-8. [DOI: 10.1016/j.bbrc.2014.07.052] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2014] [Accepted: 07/09/2014] [Indexed: 10/25/2022]
|
13
|
Sabarinathan R, Tafer H, Seemann SE, Hofacker IL, Stadler PF, Gorodkin J. RNAsnp: efficient detection of local RNA secondary structure changes induced by SNPs. Hum Mutat 2013; 34:546-56. [PMID: 23315997 PMCID: PMC3708107 DOI: 10.1002/humu.22273] [Citation(s) in RCA: 105] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2012] [Accepted: 12/18/2012] [Indexed: 02/05/2023]
Abstract
Structural characteristics are essential for the functioning of many noncoding RNAs and cis-regulatory elements of mRNAs. SNPs may disrupt these structures, interfere with their molecular function, and hence cause a phenotypic effect. RNA folding algorithms can provide detailed insights into structural effects of SNPs. The global measures employed so far suffer from limited accuracy of folding programs on large RNAs and are computationally too demanding for genome-wide applications. Here, we present a strategy that focuses on the local regions of maximal structural change between mutant and wild-type. These local regions are approximated in a “screening mode” that is intended for genome-wide applications. Furthermore, localized regions are identified as those with maximal discrepancy. The mutation effects are quantified in terms of empirical P values. To this end, the RNAsnp software uses extensive precomputed tables of the distribution of SNP effects as function of length and GC content. RNAsnp thus achieves both a noise reduction and speed-up of several orders of magnitude over shuffling-based approaches. On a data set comprising 501 SNPs associated with human-inherited diseases, we predict 54 to have significant local structural effect in the untranslated region of mRNAs. RNAsnp is available at http://rth.dk/resources/rnasnp.
Collapse
|
14
|
Sükösd Z, Knudsen B, Anderson JWJ, Novák A, Kjems J, Pedersen CNS. Characterising RNA secondary structure space using information entropy. BMC Bioinformatics 2013; 14 Suppl 2:S22. [PMID: 23368905 PMCID: PMC3549843 DOI: 10.1186/1471-2105-14-s2-s22] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Comparative methods for RNA secondary structure prediction use evolutionary information from RNA alignments to increase prediction accuracy. The model is often described in terms of stochastic context-free grammars (SCFGs), which generate a probability distribution over secondary structures. It is, however, unclear how this probability distribution changes as a function of the input alignment. As prediction programs typically only return a single secondary structure, better characterisation of the underlying probability space of RNA secondary structures is of great interest. In this work, we show how to efficiently compute the information entropy of the probability distribution over RNA secondary structures produced for RNA alignments by a phylo-SCFG, and implement it for the PPfold model. We also discuss interpretations and applications of this quantity, including how it can clarify reasons for low prediction reliability scores. PPfold and its source code are available from http://birc.au.dk/software/ppfold/.
Collapse
Affiliation(s)
- Zsuzsanna Sükösd
- Bioinformatics Research Center, Aarhus University, Aarhus, Denmark.
| | | | | | | | | | | |
Collapse
|
15
|
Basu M, Das T, Ghosh A, Majumder S, Maji AK, Kanjilal SD, Mukhopadhyay I, Roychowdhury S, Banerjee S, Sengupta S. Gene-gene interaction and functional impact of polymorphisms on innate immune genes in controlling Plasmodium falciparum blood infection level. PLoS One 2012; 7:e46441. [PMID: 23071570 PMCID: PMC3470565 DOI: 10.1371/journal.pone.0046441] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2012] [Accepted: 08/30/2012] [Indexed: 12/19/2022] Open
Abstract
Genetic variations in toll-like receptors and cytokine genes of the innate immune pathways have been implicated in controlling parasite growth and the pathogenesis of Plasmodium falciparum mediated malaria. We previously published genetic association of TLR4 non-synonymous and TNF-α promoter polymorphisms with P.falciparum blood infection level and here we extend the study considerably by (i) investigating genetic dependence of parasite-load on interleukin-12B polymorphisms, (ii) reconstructing gene-gene interactions among candidate TLRs and cytokine loci, (iii) exploring genetic and functional impact of epistatic models and (iv) providing mechanistic insights into functionality of disease-associated regulatory polymorphisms. Our data revealed that carriage of AA (P = 0.0001) and AC (P = 0.01) genotypes of IL12B 3′UTR polymorphism was associated with a significant increase of mean log-parasitemia relative to rare homozygous genotype CC. Presence of IL12B+1188 polymorphism in five of six multifactor models reinforced its strong genetic impact on malaria phenotype. Elevation of genetic risk in two-component models compared to the corresponding single locus and reduction of IL12B (2.2 fold) and lymphotoxin-α (1.7 fold) expressions in patients'peripheral-blood-mononuclear-cells under TLR4Thr399Ile risk genotype background substantiated the role of Multifactor Dimensionality Reduction derived models. Marked reduction of promoter activity of TNF-α risk haplotype (C-C-G-G) compared to wild-type haplotype (T-C-G-G) with (84%) and without (78%) LPS stimulation and the loss of binding of transcription factors detected in-silico supported a causal role of TNF-1031. Significantly lower expression of IL12B+1188 AA (5 fold) and AC (9 fold) genotypes compared to CC and under-representation (P = 0.0048) of allele A in transcripts of patients' PBMCs suggested an Allele-Expression-Imbalance. Allele (A+1188C) dependent differential stability (2 fold) of IL12B-transcripts upon actinomycin-D treatment and observed structural modulation (P = 0.013) of RNA-ensemble were the plausible explanations for AEI. In conclusion, our data provides functional support to the hypothesis that de-regulated receptor-cytokine axis of innate immune pathway influences blood infection level in P. falciparum malaria.
Collapse
Affiliation(s)
- Madhumita Basu
- Department of Biochemistry, University of Calcutta, Kolkata, West Bengal, India
| | - Tania Das
- Cancer & Cell Biology Division, Indian Institute of Chemical Biology, Kolkata, West Bengal, India
| | - Alip Ghosh
- Centre for Liver Research, The Institute of Post-Graduate Medical Education & Research, Kolkata, West Bengal, India
| | - Subhadipa Majumder
- Department of Biochemistry, University of Calcutta, Kolkata, West Bengal, India
| | - Ardhendu Kumar Maji
- Department of Protozoology, The Calcutta School of Tropical Medicine, Kolkata, West Bengal, India
| | - Sumana Datta Kanjilal
- Department of Pediatric Medicine, Calcutta National Medical College, Kolkata, West Bengal, India
| | | | - Susanta Roychowdhury
- Cancer & Cell Biology Division, Indian Institute of Chemical Biology, Kolkata, West Bengal, India
| | - Soma Banerjee
- Centre for Liver Research, The Institute of Post-Graduate Medical Education & Research, Kolkata, West Bengal, India
| | - Sanghamitra Sengupta
- Department of Biochemistry, University of Calcutta, Kolkata, West Bengal, India
- * E-mail:
| |
Collapse
|