1
|
Kirven K, Bevilacqua P, Assmann S. VariantFoldRNA: a flexible, containerized, and scalable pipeline for genome-wide riboSNitch prediction. NAR Genom Bioinform 2025; 7:lqaf066. [PMID: 40443739 PMCID: PMC12121482 DOI: 10.1093/nargab/lqaf066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2024] [Revised: 03/05/2025] [Accepted: 05/15/2025] [Indexed: 06/02/2025] Open
Abstract
Single nucleotide polymorphisms (SNPs) can alter RNA structure by changing the proportions of existing conformations or leading to new conformations in the structural ensemble. Such structure-changing SNPs, or riboSNitches, have been associated with diseases in humans and climate adaptation in plants. While several computational tools are available for predicting whether an SNP is a riboSNitch, these tools were generally developed to analyze individual RNAs and are not optimized for genome-wide analyses. To fill this gap, we developed VariantFoldRNA, a flexible, containerized, and automated pipeline for genome-wide prediction of riboSNitches. Our pipeline automatically installs all dependencies, can be run locally or on high-performance clusters, and is modular, enabling the user to customize the analysis for the research question of interest. VariantFoldRNA can predict riboSNitches genome-wide at user-specified temperatures and splicing conditions, opening the door to novel analyses. The pipeline is an open-source command-line tool that is freely available at https://github.com/The-Bevilacqua-Lab/variantfoldrna.
Collapse
Affiliation(s)
- Kobie J Kirven
- Graduate Program in Bioinformatics and Genomics, Pennsylvania State University, University Park, PA 16802, United States
- Center for RNA Molecular Biology, Pennsylvania State University, University Park, PA 16802, United States
| | - Philip C Bevilacqua
- Center for RNA Molecular Biology, Pennsylvania State University, University Park, PA 16802, United States
- Department of Chemistry, Pennsylvania State University, University Park, PA 16802, United States
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA 16802, United States
| | - Sarah M Assmann
- Center for RNA Molecular Biology, Pennsylvania State University, University Park, PA 16802, United States
- Department of Biology, Pennsylvania State University, University Park, PA 16802, United States
| |
Collapse
|
2
|
Zhu T, Chu Y, Niu G, Pan R, Chen M, Cheng Y, Zhang Y, Li Z, Jiang S, Hao L, Zou D, Xu T, Zhang Z. Editome Disease Knowledgebase v2.0: an updated resource of editome-disease associations through literature curation and integrative analysis. BIOINFORMATICS ADVANCES 2025; 5:vbaf012. [PMID: 39968378 PMCID: PMC11835235 DOI: 10.1093/bioadv/vbaf012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/13/2024] [Revised: 01/09/2025] [Accepted: 01/23/2025] [Indexed: 02/20/2025]
Abstract
Motivation Editome Disease Knowledgebase (EDK) is a curated resource of knowledge between RNA editome and human diseases. Since its first release in 2018, a number of studies have discovered previously uncharacterized editome-disease associations and generated an abundance of RNA editing datasets. Thus, it is desirable to make significant updates for EDK by incorporating more editome-disease associations as well as their related editing profiles. Results Here, we present EDK v2.0, an updated version of editome-disease associations based on both literature curation and integrative analysis. EDK v2.0 incorporates a curated collection of 1097 editome-disease associations involving 115 diseases from 321 publications. Meanwhile, based on a standardized pipeline, EDK v2.0 provides RNA editing profiles from 48 datasets covering 2536 samples across 55 diseases. Through differential analysis on RNA editing, it further identifies a total of 7190 differential edited genes and 86 242 differential editing sites (DESs), leading to 266 339 DES-disease associations. Moreover, a curated list of 28 160 cis-RNA editing QTL associations, 458 187 DES-RNA binding protein associations, and 21 DES-RNA secondary structure associations are annotated and added to EDK v2.0. Additionally, it is equipped with a series of user-friendly tools to facilitate RNA editing online analysis. Availability and implementation https://ngdc.cncb.ac.cn/edk/.
Collapse
Affiliation(s)
- Tongtong Zhu
- National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yuan Chu
- National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Guangyi Niu
- National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Rong Pan
- National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Ming Chen
- National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yuanyuan Cheng
- National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yuansheng Zhang
- National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Zhao Li
- National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Shuai Jiang
- National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Lili Hao
- National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Dong Zou
- National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Tianyi Xu
- National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Zhang Zhang
- National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
3
|
Tants JN, Schlundt A. The role of structure in regulatory RNA elements. Biosci Rep 2024; 44:BSR20240139. [PMID: 39364891 PMCID: PMC11499389 DOI: 10.1042/bsr20240139] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2024] [Revised: 10/02/2024] [Accepted: 10/04/2024] [Indexed: 10/05/2024] Open
Abstract
Regulatory RNA elements fulfill functions such as translational regulation, control of transcript levels, and regulation of viral genome replication. Trans-acting factors (i.e., RNA-binding proteins) bind the so-called cis elements and confer functionality to the complex. The specificity during protein-RNA complex (RNP) formation often exploits the structural plasticity of RNA. Functional integrity of cis-trans pairs depends on the availability of properly folded RNA elements, and RNA conformational transitions can cause diseases. Knowledge of RNA structure and the conformational space is needed for understanding complex formation and deducing functional effects. However, structure determination of RNAs under in vivo conditions remains challenging. This review provides an overview of structured eukaryotic and viral RNA cis elements and discusses the effect of RNA structural equilibria on RNP formation. We showcase implications of RNA structural changes for diseases, outline strategies for RNA structure-based drug targeting, and summarize the methodological toolbox for deciphering RNA structures.
Collapse
Affiliation(s)
- Jan-Niklas Tants
- Institute for Molecular Biosciences and Biomolecular Resonance Center (BMRZ), Goethe University Frankfurt, Max-von-Laue-Str. 7-9, 60438 Frankfurt, Germany
| | - Andreas Schlundt
- Institute for Molecular Biosciences and Biomolecular Resonance Center (BMRZ), Goethe University Frankfurt, Max-von-Laue-Str. 7-9, 60438 Frankfurt, Germany
- University of Greifswald, Institute of Biochemistry, Felix-Hausdorff-Str. 4, 17489 Greifswald, Germany
| |
Collapse
|
4
|
Xu Q, Bao X, Lin Z, Tang L, He LN, Ren J, Zuo Z, Hu K. AStruct: detection of allele-specific RNA secondary structure in structuromic probing data. BMC Bioinformatics 2024; 25:91. [PMID: 38429654 PMCID: PMC11264973 DOI: 10.1186/s12859-024-05704-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Accepted: 02/14/2024] [Indexed: 03/03/2024] Open
Abstract
BACKGROUND Uncovering functional genetic variants from an allele-specific perspective is of paramount importance in advancing our understanding of gene regulation and genetic diseases. Recently, various allele-specific events, such as allele-specific gene expression, allele-specific methylation, and allele-specific binding, have been explored on a genome-wide scale due to the development of high-throughput sequencing methods. RNA secondary structure, which plays a crucial role in multiple RNA-associated processes like RNA modification, translation and splicing, has emerged as an essential focus of relevant research. However, tools to identify genetic variants associated with allele-specific RNA secondary structures are still lacking. RESULTS Here, we develop a computational tool called 'AStruct' that enables us to detect allele-specific RNA secondary structure (ASRS) from RT-stop based structuromic probing data. AStruct shows robust performance in both simulated datasets and public icSHAPE datasets. We reveal that single nucleotide polymorphisms (SNPs) with higher AStruct scores are enriched in coding regions and tend to be functional. These SNPs are highly conservative, have the potential to disrupt sites involved in m6A modification or protein binding, and are frequently associated with disease. CONCLUSIONS AStruct is a tool dedicated to invoke allele-specific RNA secondary structure events at heterozygous SNPs in RT-stop based structuromic probing data. It utilizes allelic variants, base pairing and RT-stop information under different cell conditions to detect dynamic and functional ASRS. Compared to sequence-based tools, AStruct considers dynamic cell conditions and outperforms in detecting functional variants. AStruct is implemented in JAVA and is freely accessible at: https://github.com/canceromics/AStruct .
Collapse
Affiliation(s)
- Qingru Xu
- State Key Laboratory of Oncology in South China, Cancer Center, Collaborative Innovation Center for Cancer Medicine, School of Life Sciences, Sun Yat-sen University, Guangzhou, 510060, China
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Xiaoqiong Bao
- State Key Laboratory of Oncology in South China, Cancer Center, Collaborative Innovation Center for Cancer Medicine, School of Life Sciences, Sun Yat-sen University, Guangzhou, 510060, China
| | - Zhuobin Lin
- Guangdong Key Laboratory of Liver Disease Research, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, China
| | - Lin Tang
- State Key Laboratory of Oncology in South China, Cancer Center, Collaborative Innovation Center for Cancer Medicine, School of Life Sciences, Sun Yat-sen University, Guangzhou, 510060, China
| | - Li-Na He
- State Key Laboratory of Oncology in South China, Cancer Center, Collaborative Innovation Center for Cancer Medicine, School of Life Sciences, Sun Yat-sen University, Guangzhou, 510060, China
| | - Jian Ren
- State Key Laboratory of Oncology in South China, Cancer Center, Collaborative Innovation Center for Cancer Medicine, School of Life Sciences, Sun Yat-sen University, Guangzhou, 510060, China
| | - Zhixiang Zuo
- State Key Laboratory of Oncology in South China, Cancer Center, Collaborative Innovation Center for Cancer Medicine, School of Life Sciences, Sun Yat-sen University, Guangzhou, 510060, China.
| | - Kunhua Hu
- Guangdong Key Laboratory of Liver Disease Research, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, China.
| |
Collapse
|
5
|
Assmann SM, Chou HL, Bevilacqua PC. Rock, scissors, paper: How RNA structure informs function. THE PLANT CELL 2023; 35:1671-1707. [PMID: 36747354 PMCID: PMC10226581 DOI: 10.1093/plcell/koad026] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Revised: 01/05/2023] [Accepted: 01/30/2023] [Indexed: 05/30/2023]
Abstract
RNA can fold back on itself to adopt a wide range of structures. These range from relatively simple hairpins to intricate 3D folds and can be accompanied by regulatory interactions with both metabolites and macromolecules. The last 50 yr have witnessed elucidation of an astonishing array of RNA structures including transfer RNAs, ribozymes, riboswitches, the ribosome, the spliceosome, and most recently entire RNA structuromes. These advances in RNA structural biology have deepened insight into fundamental biological processes including gene editing, transcription, translation, and structure-based detection and response to temperature and other environmental signals. These discoveries reveal that RNA can be relatively static, like a rock; that it can have catalytic functions of cutting bonds, like scissors; and that it can adopt myriad functional shapes, like paper. We relate these extraordinary discoveries in the biology of RNA structure to the plant way of life. We trace plant-specific discovery of ribozymes and riboswitches, alternative splicing, organellar ribosomes, thermometers, whole-transcriptome structuromes and pan-structuromes, and conclude that plants have a special set of RNA structures that confer unique types of gene regulation. We finish with a consideration of future directions for the RNA structure-function field.
Collapse
Affiliation(s)
- Sarah M Assmann
- Department of Biology, Pennsylvania State University, University Park, PA 16802, USA
- Center for RNA Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA
| | - Hong-Li Chou
- Department of Biology, Pennsylvania State University, University Park, PA 16802, USA
| | - Philip C Bevilacqua
- Center for RNA Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA
- Department of Chemistry, Pennsylvania State University, University Park, PA 16802, USA
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA
| |
Collapse
|
6
|
How does precursor RNA structure influence RNA processing and gene expression? Biosci Rep 2023; 43:232489. [PMID: 36689327 PMCID: PMC9977717 DOI: 10.1042/bsr20220149] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 01/17/2023] [Accepted: 01/23/2023] [Indexed: 01/24/2023] Open
Abstract
RNA is a fundamental biomolecule that has many purposes within cells. Due to its single-stranded and flexible nature, RNA naturally folds into complex and dynamic structures. Recent technological and computational advances have produced an explosion of RNA structural data. Many RNA structures have regulatory and functional properties. Studying the structure of nascent RNAs is particularly challenging due to their low abundance and long length, but their structures are important because they can influence RNA processing. Precursor RNA processing is a nexus of pathways that determines mature isoform composition and that controls gene expression. In this review, we examine what is known about human nascent RNA structure and the influence of RNA structure on processing of precursor RNAs. These known structures provide examples of how other nascent RNAs may be structured and show how novel RNA structures may influence RNA processing including splicing and polyadenylation. RNA structures can be targeted therapeutically to treat disease.
Collapse
|
7
|
RNA Secondary Structure Alteration Caused by Single Nucleotide Variants. Methods Mol Biol 2023; 2586:107-120. [PMID: 36705901 DOI: 10.1007/978-1-0716-2768-6_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
A point mutation in coding RNA can cause not only an amino acid substitution but also a dynamic change of RNA secondary structure, leading to a dysfunctional RNA. Although in silico structure prediction has been used to detect structure-disrupting point mutations known as riboSNitches, exhaustive simulation of long RNAs is needed to detect a significant enrichment or depletion of riboSNitches in functional RNAs. Here, we have developed a novel algorithm Radiam (RNA secondary structure Analysis with Deletion, Insertion, And substitution Mutations) for a comprehensive riboSNitch analysis of long RNAs. Radiam is based on the ParasoR framework, which efficiently computes local RNA secondary structures for long RNAs. ParasoR can compute a variety of structure scores over globally consistent structures with maximal span constraints for the base pair distance. Using the reusable structure database made by ParasoR, Radiam performs an efficient recomputation of the secondary structures for mutated sequences. An exhaustive simulation of Radiam is expected to find reliable riboSNitch candidates on long RNAs by evaluating their statistical significance in terms of the change of local structure stability.
Collapse
|
8
|
Yang W, Lyu Y, Xiang R, Yang J. Long Noncoding RNAs in the Pathogenesis of Insulin Resistance. Int J Mol Sci 2022; 23:ijms232416054. [PMID: 36555704 PMCID: PMC9785789 DOI: 10.3390/ijms232416054] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Revised: 12/10/2022] [Accepted: 12/14/2022] [Indexed: 12/23/2022] Open
Abstract
Insulin resistance (IR), designated as the blunted response of insulin target tissues to physiological level of insulin, plays crucial roles in the development and progression of diabetes, nonalcoholic fatty liver disease (NAFLD) and other diseases. So far, the distinct mechanism(s) of IR still needs further exploration. Long non-coding RNA (lncRNA) is a class of non-protein coding RNA molecules with a length greater than 200 nucleotides. LncRNAs are widely involved in many biological processes including cell differentiation, proliferation, apoptosis and metabolism. More recently, there has been increasing evidence that lncRNAs participated in the pathogenesis of IR, and the dysregulated lncRNA profile played important roles in the pathogenesis of metabolic diseases including obesity, diabetes and NAFLD. For example, the lncRNAs MEG3, H19, MALAT1, GAS5, lncSHGL and several other lncRNAs have been shown to regulate insulin signaling and glucose/lipid metabolism in various tissues. In this review, we briefly introduced the general features of lncRNA and the methods for lncRNA research, and then summarized and discussed the recent advances on the roles and mechanisms of lncRNAs in IR, particularly focused on liver, skeletal muscle and adipose tissues.
Collapse
Affiliation(s)
- Weili Yang
- Beijing Key Laboratory of Diabetes Research and Care, Beijing Diabetes Institute, Beijing Tongren Hospital, Capital Medical University, Beijing 100730, China
| | - Yixiang Lyu
- Department of Physiology and Pathophysiology, School of Basic Medical Sciences, Peking University Health Science Center, Beijing 100191, China
- Key Laboratory of Cardiovascular Science of the Ministry of Education, Center for Non-Coding RNA Medicine, Beijing 100191, China
| | - Rui Xiang
- Department of Physiology and Pathophysiology, School of Basic Medical Sciences, Peking University Health Science Center, Beijing 100191, China
- Key Laboratory of Cardiovascular Science of the Ministry of Education, Center for Non-Coding RNA Medicine, Beijing 100191, China
| | - Jichun Yang
- Department of Physiology and Pathophysiology, School of Basic Medical Sciences, Peking University Health Science Center, Beijing 100191, China
- Key Laboratory of Cardiovascular Science of the Ministry of Education, Center for Non-Coding RNA Medicine, Beijing 100191, China
- Correspondence:
| |
Collapse
|
9
|
Waldern JM, Kumar J, Laederach A. Disease-associated human genetic variation through the lens of precursor and mature RNA structure. Hum Genet 2022; 141:1659-1672. [PMID: 34741198 PMCID: PMC9072596 DOI: 10.1007/s00439-021-02395-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Accepted: 10/26/2021] [Indexed: 12/14/2022]
Abstract
Disease-associated variants (DAVs) are commonly considered either through a genomic lens that describes variant function at the DNA level, or at the protein function level if the variant is translated. Although the genomic and proteomic effects of variation are well-characterized, genetic variants disrupting post-transcriptional regulation is another mechanism of disease that remains understudied. Specific RNA sequence motifs mediate post-transcriptional regulation both in the nucleus and cytoplasm of eukaryotic cells, often by binding to RNA-binding proteins or other RNAs. However, many DAVs map far from these motifs, which suggests deeper layers of post-transcriptional mechanistic control. Here, we consider a transcriptomic framework to outline the importance of post-transcriptional regulation as a mechanism of disease-causing single-nucleotide variation in the human genome. We first describe the composition of the human transcriptome and the importance of abundant yet overlooked components such as introns and untranslated regions (UTRs) of messenger RNAs (mRNAs). We present an analysis of Human Gene Mutation Database variants mapping to mRNAs and examine the distribution of causative disease-associated variation across the transcriptome. Although our analysis confirms the importance of post-transcriptional regulatory motifs, a majority of DAVs do not directly map to known regulatory motifs. Therefore, we review evidence that regions outside these well-characterized motifs can regulate function by RNA structure-mediated mechanisms in all four elements of an mRNA: exons, introns, 5' and 3' UTRs. To this end, we review published examples of riboSNitches, which are single-nucleotide variants that result in a change in RNA structure that is causative of the disease phenotype. In this review, we present the current state of knowledge of how DAVs act at the transcriptome level, both through altering post-transcriptional regulatory motifs and by the effects of RNA structure.
Collapse
Affiliation(s)
- Justin M Waldern
- Department of Biology, University of North Carolina, Chapel Hill, NC, 27599, USA
| | - Jayashree Kumar
- Department of Biology, University of North Carolina, Chapel Hill, NC, 27599, USA
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Alain Laederach
- Department of Biology, University of North Carolina, Chapel Hill, NC, 27599, USA.
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA.
| |
Collapse
|
10
|
G R, Mitra A, Pk V. Predicting functional riboSNitches in the context of alternative splicing. Gene X 2022; 837:146694. [PMID: 35738445 DOI: 10.1016/j.gene.2022.146694] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 05/11/2022] [Accepted: 06/17/2022] [Indexed: 11/19/2022] Open
Abstract
RNAs are the major regulators of gene expression, and their secondary structures play crucial roles at different levels. RiboSNitches are disease-associated SNPs that cause changes in the pre-mRNA secondary structural ensemble. Several riboSNitches have been detected in the 5' and 3' untranslated regions and lncRNA. Although cases of secondary structural elements playing a regulatory role in alternative splicing are known, regions specific to splicing events, such as splice junctions have not received much attention. We tested splice-site mutations for their efficiency in disrupting the secondary structure and hypothesized that these could play a crucial role in alternative splicing. Multiple riboSNitch prediction methods were applied to obtain overlapping results that are potentially more reliable. Putative riboSNitches were identified from aberrant 5' and 3' splice site mutations, cancer-causing somatic mutations, and genes that harbor the regulatory RNA secondary structural elements. Our workflow for predicting riboSNitches associated with alternative splicing is novel and paves the way for subsequent experimental validation.
Collapse
Affiliation(s)
- Ramya G
- Center for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Gachibowli, Hyderabad, Telangana 500032, India.
| | - Abhijit Mitra
- Center for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Gachibowli, Hyderabad, Telangana 500032, India.
| | - Vinod Pk
- Center for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Gachibowli, Hyderabad, Telangana 500032, India.
| |
Collapse
|
11
|
Ferrero-Serrano Á, Sylvia MM, Forstmeier PC, Olson AJ, Ware D, Bevilacqua PC, Assmann SM. Experimental demonstration and pan-structurome prediction of climate-associated riboSNitches in Arabidopsis. Genome Biol 2022; 23:101. [PMID: 35440059 PMCID: PMC9017077 DOI: 10.1186/s13059-022-02656-4] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2021] [Accepted: 03/20/2022] [Indexed: 11/23/2022] Open
Abstract
Background Genome-wide association studies (GWAS) aim to correlate phenotypic changes with genotypic variation. Upon transcription, single nucleotide variants (SNVs) may alter mRNA structure, with potential impacts on transcript stability, macromolecular interactions, and translation. However, plant genomes have not been assessed for the presence of these structure-altering polymorphisms or “riboSNitches.” Results We experimentally demonstrate the presence of riboSNitches in transcripts of two Arabidopsis genes, ZINC RIBBON 3 (ZR3) and COTTON GOLGI-RELATED 3 (CGR3), which are associated with continentality and temperature variation in the natural environment. These riboSNitches are also associated with differences in the abundance of their respective transcripts, implying a role in regulating the gene's expression in adaptation to local climate conditions. We then computationally predict riboSNitches transcriptome-wide in mRNAs of 879 naturally inbred Arabidopsis accessions. We characterize correlations between SNPs/riboSNitches in these accessions and 434 climate descriptors of their local environments, suggesting a role of these variants in local adaptation. We integrate this information in CLIMtools V2.0 and provide a new web resource, T-CLIM, that reveals associations between transcript abundance variation and local environmental variation. Conclusion We functionally validate two plant riboSNitches and, for the first time, demonstrate riboSNitch conditionality dependent on temperature, coining the term “conditional riboSNitch.” We provide the first pan-genome-wide prediction of riboSNitches in plants. We expand our previous CLIMtools web resource with riboSNitch information and with 1868 additional Arabidopsis genomes and 269 additional climate conditions, which will greatly facilitate in silico studies of natural genetic variation, its phenotypic consequences, and its role in local adaptation. Supplementary Information The online version contains supplementary material available at 10.1186/s13059-022-02656-4.
Collapse
Affiliation(s)
- Ángel Ferrero-Serrano
- Department of Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA.
| | - Megan M Sylvia
- Department of Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA
| | - Peter C Forstmeier
- Department of Biochemistry, Microbiology, and Molecular Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA
| | - Andrew J Olson
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
| | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA.,USDA ARS NAA Robert W. Holley Center for Agriculture and Health, Ithaca, NY, 14853, USA
| | - Philip C Bevilacqua
- Department of Biochemistry, Microbiology, and Molecular Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA.,Department of Chemistry, Pennsylvania State University, University Park, State College, PA, 16802, USA.,Center for RNA Molecular Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA
| | - Sarah M Assmann
- Department of Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA. .,Center for RNA Molecular Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA.
| |
Collapse
|
12
|
Dynamic Molecular Epidemiology Reveals Lineage-Associated Single-Nucleotide Variants That Alter RNA Structure in Chikungunya Virus. Genes (Basel) 2021; 12:genes12020239. [PMID: 33567556 PMCID: PMC7914560 DOI: 10.3390/genes12020239] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Revised: 01/29/2021] [Accepted: 02/04/2021] [Indexed: 01/21/2023] Open
Abstract
Chikungunya virus (CHIKV) is an emerging Alphavirus which causes millions of human infections every year. Outbreaks have been reported in Africa and Asia since the early 1950s, from three CHIKV lineages: West African, East Central South African, and Asian Urban. As new outbreaks occurred in the Americas, individual strains from the known lineages have evolved, creating new monophyletic groups that generated novel geographic-based lineages. Building on a recently updated phylogeny of CHIKV, we report here the availability of an interactive CHIKV phylodynamics dataset, which is based on more than 900 publicly available CHIKV genomes. We provide an interactive view of CHIKV molecular epidemiology built on Nextstrain, a web-based visualization framework for real-time tracking of pathogen evolution. CHIKV molecular epidemiology reveals single nucleotide variants that change the stability and fold of locally stable RNA structures. We propose alternative RNA structure formation in different CHIKV lineages by predicting more than a dozen RNA elements that are subject to perturbation of the structure ensemble upon variation of a single nucleotide.
Collapse
|