1
|
Szikszai M, Magnus M, Sanghi S, Kadyan S, Bouatta N, Rivas E. RNA3DB: A structurally-dissimilar dataset split for training and benchmarking deep learning models for RNA structure prediction. J Mol Biol 2024:168552. [PMID: 38552946 DOI: 10.1016/j.jmb.2024.168552] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Revised: 03/19/2024] [Accepted: 03/22/2024] [Indexed: 04/09/2024]
Abstract
With advances in protein structure prediction thanks to deep learning models like AlphaFold, RNA structure prediction has recently received increased attention from deep learning researchers. RNAs introduce substantial challenges due to the sparser availability and lower structural diversity of the experimentally resolved RNA structures in comparison to protein structures. These challenges are often poorly addressed by the existing literature, many of which report inflated performance due to using training and testing sets with significant structural overlap. Further, the most recent Critical Assessment of Structure Prediction (CASP15) has shown that deep learning models for RNA structure are currently outperformed by traditional methods. In this paper we present RNA3DB, a dataset of structured RNAs, derived from the Protein Data Bank (PDB), that is designed for training and benchmarking deep learning models. The RNA3DB method arranges the RNA 3D chains into distinct groups (Components) that are non-redundant both with regard to sequence as well as structure, providing a robust way of dividing training, validation, and testing sets. Any split of these structurally-dissimilar Components are guaranteed to produce test and validations sets that are distinct by sequence and structure from those in the training set. We provide the RNA3DB dataset, a particular train/test split of the RNA3DB Components (in an approximate 70/30 ratio) that will be updated periodically. We also provide the RNA3DB methodology along with the source-code, with the goal of creating a reproducible and customizable tool for producing structurally-dissimilar dataset splits for structural RNAs.
Collapse
Affiliation(s)
- Marcell Szikszai
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, 02138, MA, USA
| | - Marcin Magnus
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, 02138, MA, USA
| | - Siddhant Sanghi
- Department of Systems Biology, Columbia University, New York 10027, NY, USA; College of Biological Sciences, UC Davis, Davis 95616, CA, USA
| | - Sachin Kadyan
- Department of Systems Biology, Columbia University, New York 10027, NY, USA
| | - Nazim Bouatta
- Laboratory of Systems Pharmacology, Harvard Medical School, Boston 02115, MA, USA
| | - Elena Rivas
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, 02138, MA, USA
| |
Collapse
|
2
|
Xie S, He J, Feng B, Rao D, Wang S, He Y. A potential biological signature of 7-methylguanosine-related lncRNA to predict the immunotherapy effects in bladder cancer. Heliyon 2023; 9:e15897. [PMID: 37215925 PMCID: PMC10199227 DOI: 10.1016/j.heliyon.2023.e15897] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Revised: 04/12/2023] [Accepted: 04/25/2023] [Indexed: 05/24/2023] Open
Abstract
Background Bladder urothelial carcinoma (BLCA) is the second prevalent genitourinary carcinoma globally. N7-methylguanosine (m7G) is important for tumorigenesis and progression. This study aimed to build a predictive model for m7G-related long non-coding RNAs (lncRNAs), elucidate their role in the tumor immune microenvironment (TIME), and predict immunotherapy response in BLCA. Methods We first used univariate Cox regression and coexpression analyses to identify m7G-related lncRNAs. Next, the prognostic model was built by utilizing LASSO regression analysis. Then, the prognostic significance of the model was examined utilizing Kaplan-Meier survival analysis, receiver operating characteristic (ROC) curves, nomogram, and univariate, multivariate Cox regression. We also analyzed Gene set enrichment analyses (GSEA), immune analysis and principal component analysis (PCA) in risk groups. To further predict immunotherapy effectiveness, we evaluated the predictive ability for immunotherapy in 2 risk groups and clusters using tumor immune dysfunction and exclusion (TIDE) score and Immunophenoscore (IPS). Results Seven lncRNAs related to m7G were used to create a model. The calibration plots for the model suggested a strong fit with the prediction of overall survival (OS). The area under the curve (AUC) for first, second, and third years was respectively, 0.722, 0.711, and 0.686. In addition, the risk score had strong correlation with TIME features and genes linked to immune checkpoint blockade (ICB). TIDE scores were dramatically different between two risk groups (p < 0.05), and IPS scores were markedly different between two clusters (p < 0.05). Conclusion Our research constructed a novel m7G-related lncRNAs that could be used to predict patient outcomes and the effectiveness of immunotherapy in BLCA. Immunotherapy may be more effective for the low-risk group and cluster 2.
Collapse
Affiliation(s)
- Shangxun Xie
- Department of Urology, The Second Affiliated Hospital and Yuying Children's Hospital of Wenzhou Medical University, Wenzhou, Zhejiang Province 325000, People's Republic of China
| | - Jibao He
- Department of Urology, Jiangsu Province Academy of Traditional Chinese Medicine, Nanjing, Jiangsu Province 210028, People's Republic of China
| | - Baofu Feng
- Nanjing Medical University, Nanjing, Jiangsu Province 210028, People's Republic of China
| | - Dapang Rao
- Department of Urology, The Second Affiliated Hospital and Yuying Children's Hospital of Wenzhou Medical University, Wenzhou, Zhejiang Province 325000, People's Republic of China
| | - Shuaibin Wang
- Department of Urology, The Second Affiliated Hospital and Yuying Children's Hospital of Wenzhou Medical University, Wenzhou, Zhejiang Province 325000, People's Republic of China
| | - Youhua He
- Department of Urology, The Second Affiliated Hospital and Yuying Children's Hospital of Wenzhou Medical University, Wenzhou, Zhejiang Province 325000, People's Republic of China
| |
Collapse
|
3
|
Zhang C, Xia J, Zhang S, Li J, Zhou T, Hu K. Expression pattern, tumor immune landscape, and prognostic value of N7‑methylguanosine regulators in bladder urothelial carcinoma. Oncol Lett 2023; 25:169. [PMID: 36960192 PMCID: PMC10028492 DOI: 10.3892/ol.2023.13755] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Accepted: 02/17/2023] [Indexed: 03/12/2023] Open
Abstract
N7-Methylguanosine (m7G) modification is important in post-transcriptional regulation. dysregulation of m7G RNA modification has been reported to be markedly associated with cancer. However, its importance in bladder urothelial carcinoma (BLCA) remains poorly characterized. The present study systematically analyzed mRNA gene expression data and clinical information from The Cancer Genome Atlas and further constructed robust risk signatures for the four regulators of m7G RNA modification (nudix hydrolase 11, gem nuclear organelle-associated protein 5, eukaryotic translation initiation factor 3 subunit D and cytoplasmic FMR1 interacting protein 1). The differential expression and cell function of m7G-related genes in bladder cancer cells were verified by reverse transcription-quantitative PCR, Cell Counting Kit-8 and colony formation assays. The four-gene-based model could accurately predict the prognosis of BLCA. Nomogram-based clinical decisions had a higher net benefit compared with that of individual predictors. Through immune infiltration analysis, it was found that immune cell infiltration affected the prognosis of patients with BLCA. Finally, the present study identified potential therapeutics that differ between high and low-risk groups based on four genes. In summary, the current findings revealed an essential role for m7G RNA modification regulators in BLCA, and developed risk signatures as promising prognostic markers in patients with BLCA.
Collapse
Affiliation(s)
- Chi Zhang
- Department of Oncology, Dongfang Hospital, Beijing University of Chinese Medicine, Beijing 100078, P.R. China
| | - Jiangnan Xia
- School of Basic Medical Sciences, Guangzhou University of Chinese Medicine, Guangzhou, Guangdong 510006, P.R. China
| | - Simiao Zhang
- School of Chinese Medicine, Hunan University of Chinese Medicine, Changsha, Hunan 410208, P.R. China
| | - Jing Li
- Department of Oncology, The First Hospital of Hunan University of Chinese Medicine, Changsha, Hunan 410021, P.R. China
| | - Tian Zhou
- Department of Oncology, Dongfang Hospital, Beijing University of Chinese Medicine, Beijing 100078, P.R. China
| | - Kaiwen Hu
- Department of Oncology, Dongfang Hospital, Beijing University of Chinese Medicine, Beijing 100078, P.R. China
- Correspondence to: Dr Kaiwen Hu, Department of Oncology, Dongfang Hospital, Beijing University of Chinese Medicine, 6 Fangxingyuan, Fengtai, Beijing 100078, P.R. China, E-mail:
| |
Collapse
|
4
|
Proteostasis Deregulation in Neurodegeneration and Its Link with Stress Granules: Focus on the Scaffold and Ribosomal Protein RACK1. Cells 2022; 11:cells11162590. [PMID: 36010666 PMCID: PMC9406587 DOI: 10.3390/cells11162590] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Revised: 08/09/2022] [Accepted: 08/17/2022] [Indexed: 12/12/2022] Open
Abstract
The role of protein misfolding, deposition, and clearance has been the dominant topic in the last decades of investigation in the field of neurodegeneration. The impairment of protein synthesis, along with RNA metabolism and RNA granules, however, are significantly emerging as novel potential targets for the comprehension of the molecular events leading to neuronal deficits. Indeed, defects in ribosome activity, ribosome stalling, and PQC—all ribosome-related processes required for proteostasis regulation—can contribute to triggering stress conditions and promoting the formation of stress granules (SGs) that could evolve in the formation of pathological granules, usually occurring during neurodegenerating effects. In this review, the interplay between proteostasis, mRNA metabolism, and SGs has been explored in a neurodegenerative context with a focus on Alzheimer’s disease (AD), although some defects in these same mechanisms can also be found in frontotemporal dementia (FTD) and amyotrophic lateral sclerosis (ALS), which are discussed here. Finally, we highlight the role of the receptor for activated C kinase 1 (RACK1) in these pathologies and note that, besides its well characterized function as a scaffold protein, it has an important role in translation and can associate to stress granules (SGs) determining cell fate in response to diverse stress stimuli.
Collapse
|
5
|
Xiao Y, Yang J, Yang M, Len J, Yu Y. Comprehensive analysis of 7-methylguanosine and immune microenvironment characteristics in clear cell renal cell carcinomas. Front Genet 2022; 13:866819. [PMID: 36003341 PMCID: PMC9393245 DOI: 10.3389/fgene.2022.866819] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Accepted: 07/08/2022] [Indexed: 11/13/2022] Open
Abstract
Clear cell renal cell carcinoma (ccRCC) is one of the most common tumors in the urinary system. ccRCC has obvious immunological characteristics, and the infiltration of immune cells is related to the prognosis of ccRCC. The effect of immune checkpoint therapy is related to the dynamic changes of the tumor immune microenvironment (TIM). The 7-methylguanosine (m7G) is an additional mRNA modification ability besides m6A, which is closely related to the TIM and affects the occurrence and development of tumors. At present, the correlations between m7G and the immune microenvironment, treatment, and prognosis of ccRCC are not clear. As far as we know, there was no study on the relationship between m7G and the immune microenvironment and survival of clear cell renal cell carcinomas. A comprehensive analysis of the correlations between them and the construction of a prognosis model are helpful to improve the treatment strategy. Two different molecular subtypes were identified in 539 ccRCC samples by describing the differences of 29 m7G-related genes. It was found that the clinical features, TIM, and prognosis of ccRCC patients were correlated with the m7G-related genes. We found that there were significant differences in the expression of PD-1, CTLA4, and PD-L1 between high- and low-risk groups. To sum up, m7G-related genes play a potential role in the TIM, treatment, and prognosis of ccRCC. Our results provide new findings for ccRCC and help to improve the immunotherapy strategies and prognosis of patients.
Collapse
Affiliation(s)
- Yu Xiao
- The Affiliated Hospital, Kunming University of Science and Technology, Kunming, China
- Department of Urology, The First People’s Hospital of Yunnan Province, Kunming, YN, China
| | - Junfeng Yang
- The Affiliated Hospital, Kunming University of Science and Technology, Kunming, China
- Department of Urology, The First People’s Hospital of Yunnan Province, Kunming, YN, China
| | - Maolin Yang
- The Affiliated Hospital, Kunming University of Science and Technology, Kunming, China
- Department of Urology, The First People’s Hospital of Yunnan Province, Kunming, YN, China
| | - Jinjun Len
- The Affiliated Hospital, Kunming University of Science and Technology, Kunming, China
- Department of Urology, The First People’s Hospital of Yunnan Province, Kunming, YN, China
| | - Yanhong Yu
- The Affiliated Hospital, Kunming University of Science and Technology, Kunming, China
- Department of Urology, The First People’s Hospital of Yunnan Province, Kunming, YN, China
- *Correspondence: Yanhong Yu,
| |
Collapse
|
6
|
Towards SINEUP-based therapeutics: Design of an in vitro synthesized SINEUP RNA. MOLECULAR THERAPY. NUCLEIC ACIDS 2022; 27:1092-1102. [PMID: 35228902 PMCID: PMC8857549 DOI: 10.1016/j.omtn.2022.01.021] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Accepted: 01/28/2022] [Indexed: 12/28/2022]
Abstract
SINEUPs are a novel class of natural and synthetic non-coding antisense RNA molecules able to increase the translation of a target mRNA. They present a modular organization comprising an unstructured antisense target-specific domain, which sets the specificity of each individual SINEUP, and a structured effector domain, which is responsible for the translation enhancement. In order to design a fully functional in vitro transcribed SINEUP for therapeutics applications, SINEUP RNAs were synthesized in vitro with a variety of chemical modifications and screened for their activity on endogenous target mRNA upon transfection. Three combinations of modified ribonucleotides-2'O methyl-ATP (Am), N6 methyl-ATP (m6A), and pseudo-UTP (ψ)-conferred SINEUP activity to naked RNA. The best combination tested in this study was fully modified with m6A and ψ. Aside from functionality, this combination conferred improved stability upon transfection and higher thermal stability. Common structural determinants of activity were identified by circular dichroisms, defining a core functional structure that is achieved with different combinations of modifications.
Collapse
|
7
|
SINEUPs: a novel toolbox for RNA therapeutics. Essays Biochem 2021; 65:775-789. [PMID: 34623427 PMCID: PMC8564737 DOI: 10.1042/ebc20200114] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Revised: 07/22/2021] [Accepted: 08/23/2021] [Indexed: 12/17/2022]
Abstract
RNA molecules have emerged as a new class of promising therapeutics to expand the range of druggable targets in the genome. In addition to ‘canonical’ protein-coding mRNAs, the emerging richness of sense and antisense long non-coding RNAs (lncRNAs) provides a new reservoir of molecular tools for RNA-based drugs. LncRNAs are composed of modular structural domains with specific activities involving the recruitment of protein cofactors or directly interacting with nucleic acids. A single therapeutic RNA transcript can then be assembled combining domains with defined secondary structures and functions, and antisense sequences specific for the RNA/DNA target of interest. As the first representative molecules of this new pharmacology, we have identified SINEUPs, a new functional class of natural antisense lncRNAs that increase the translation of partially overlapping mRNAs. Their activity is based on the combination of two domains: an embedded mouse inverted SINEB2 element that enhances mRNA translation (effector domain) and an overlapping antisense region that provides specificity for the target sense transcript (binding domain). By genetic engineering, synthetic SINEUPs can potentially target any mRNA of interest increasing translation and therefore the endogenous level of the encoded protein. In this review, we describe the state-of-the-art knowledge of SINEUPs and discuss recent publications showing their potential application in diseases where a physiological increase of endogenous protein expression can be therapeutic.
Collapse
|
8
|
Abstract
Systematics is described for annotation of variations in RNA molecules. The conceptual framework is part of Variation Ontology (VariO) and facilitates depiction of types of variations, their functional and structural effects and other consequences in any RNA molecule in any organism. There are more than 150 RNA related VariO terms in seven levels, which can be further combined to generate even more complicated and detailed annotations. The terms are described together with examples, usually for variations and effects in human and in diseases. RNA variation type has two subcategories: variation classification and origin with subterms. Altogether six terms are available for function description. Several terms are available for affected RNA properties. The ontology contains also terms for structural description for affected RNA type, post-transcriptional RNA modifications, secondary and tertiary structure effects and RNA sugar variations. Together with the DNA and protein concepts and annotations, RNA terms allow comprehensive description of variations of genetic and non-genetic origin at all possible levels. The VariO annotations are readable both for humans and computer programs for advanced data integration and mining.
Collapse
Affiliation(s)
- Mauno Vihinen
- Department of Experimental Medical Science, Lund University, Lund, Sweden
| |
Collapse
|
9
|
Cinget B, Bélanger RR. Discovery of new group I-D introns leads to creation of subtypes and link to an adaptive response of the mitochondrial genome in fungi. RNA Biol 2020; 17:1252-1260. [PMID: 32449459 PMCID: PMC7595605 DOI: 10.1080/15476286.2020.1763024] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
Group I catalytic introns are widespread in bacterial, archaeal, viral, organellar, and some eukaryotic genomes, where they are reported to provide regulatory functions. The group I introns are currently divided into five types (A-E), which are themselves distributed into several subtypes, with the exception of group I type D intron (GI-D). GI-D introns belong to the rarest group with only 17 described to date, including only one with a putative role reported in fungi, where it would interfere with an adaptive response in the cytochrome b (COB) gene to quinone outside inhibitor (QoI) fungicide resistance. Using homology search methods taking into account both conserved sequences and RNA secondary structures, we analysed the mitochondrial genomes or COB genes of 169 fungal species, including some frequently under QoI selection pressure. These analyses have led to the identification of 216 novel GI-D introns, and the definition of three distinct subtypes, one of which being linked with a functional activity. We have further uncovered a homing site for this GI-D intron type, which helps refine the accepted model of quinone outside inhibitor resistance, whereby mobility of the intron across fungal mitochondrial genomes, would influence a fungus ability to develop resistance to QoIs.
Collapse
Affiliation(s)
- Benjamin Cinget
- Département de Phytologie, Faculty of Agriculture and Food Sciences, Centre de Recherche en Innovation des Végétaux (CRIV), Université Laval , Québec, Québec, Canada
| | - Richard R Bélanger
- Département de Phytologie, Faculty of Agriculture and Food Sciences, Centre de Recherche en Innovation des Végétaux (CRIV), Université Laval , Québec, Québec, Canada
| |
Collapse
|
10
|
Magnus M, Antczak M, Zok T, Wiedemann J, Lukasiak P, Cao Y, Bujnicki JM, Westhof E, Szachniuk M, Miao Z. RNA-Puzzles toolkit: a computational resource of RNA 3D structure benchmark datasets, structure manipulation, and evaluation tools. Nucleic Acids Res 2020; 48:576-588. [PMID: 31799609 PMCID: PMC7145511 DOI: 10.1093/nar/gkz1108] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2019] [Revised: 11/06/2019] [Accepted: 11/15/2019] [Indexed: 12/12/2022] Open
Abstract
Significant improvements have been made in the efficiency and accuracy of RNA 3D structure prediction methods during the succeeding challenges of RNA-Puzzles, a community-wide effort on the assessment of blind prediction of RNA tertiary structures. The RNA-Puzzles contest has shown, among others, that the development and validation of computational methods for RNA fold prediction strongly depend on the benchmark datasets and the structure comparison algorithms. Yet, there has been no systematic benchmark set or decoy structures available for the 3D structure prediction of RNA, hindering the standardization of comparative tests in the modeling of RNA structure. Furthermore, there has not been a unified set of tools that allows deep and complete RNA structure analysis, and at the same time, that is easy to use. Here, we present RNA-Puzzles toolkit, a computational resource including (i) decoy sets generated by different RNA 3D structure prediction methods (raw, for-evaluation and standardized datasets), (ii) 3D structure normalization, analysis, manipulation, visualization tools (RNA_format, RNA_normalizer, rna-tools) and (iii) 3D structure comparison metric tools (RNAQUA, MCQ4Structures). This resource provides a full list of computational tools as well as a standard RNA 3D structure prediction assessment protocol for the community.
Collapse
Affiliation(s)
- Marcin Magnus
- International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
- ReMedy-International Research Agenda Unit, Centre of New Technologies, University of Warsaw, 02-097 Warsaw, Poland
| | - Maciej Antczak
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, 60-965 Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Tomasz Zok
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, 60-965 Poznan, Poland
| | - Jakub Wiedemann
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, 60-965 Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Piotr Lukasiak
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, 60-965 Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Yang Cao
- Center of Growth, Metabolism and Aging, Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, PR China
| | - Janusz M Bujnicki
- International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
- Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, Poznan, Poland
| | - Eric Westhof
- Architecture et Réactivité de l’ARN, Université de Strasbourg, Institut de biologie moléculaire et cellulaire du CNRS, 12 allée Konrad Roentgen, 67084 Strasbourg, France
| | - Marta Szachniuk
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, 60-965 Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Zhichao Miao
- Translational Research Institute of Brain and Brain-Like Intelligence and Department of Anesthesiology, Shanghai Fourth People's Hospital Affiliated to Tongji University School of Medicine, Shanghai 200081, China
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridge CB10 1SD, UK
- Newcastle Fibrosis Research Group, Institute of Cellular Medicine, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| |
Collapse
|
11
|
Magnus M, Kappel K, Das R, Bujnicki JM. RNA 3D structure prediction guided by independent folding of homologous sequences. BMC Bioinformatics 2019; 20:512. [PMID: 31640563 PMCID: PMC6806525 DOI: 10.1186/s12859-019-3120-y] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2019] [Accepted: 10/01/2019] [Indexed: 01/12/2023] Open
Abstract
BACKGROUND The understanding of the importance of RNA has dramatically changed over recent years. As in the case of proteins, the function of an RNA molecule is encoded in its tertiary structure, which in turn is determined by the molecule's sequence. The prediction of tertiary structures of complex RNAs is still a challenging task. RESULTS Using the observation that RNA sequences from the same RNA family fold into conserved structure, we test herein whether parallel modeling of RNA homologs can improve ab initio RNA structure prediction. EvoClustRNA is a multi-step modeling process, in which homologous sequences for the target sequence are selected using the Rfam database. Subsequently, independent folding simulations using Rosetta FARFAR and SimRNA are carried out. The model of the target sequence is selected based on the most common structural arrangement of the common helical fragments. As a test, on two blind RNA-Puzzles challenges, EvoClustRNA predictions ranked as the first of all submissions for the L-glutamine riboswitch and as the second for the ZMP riboswitch. Moreover, through a benchmark of known structures, we discovered several cases in which particular homologs were unusually amenable to structure recovery in folding simulations compared to the single original target sequence. CONCLUSION This work, for the first time to our knowledge, demonstrates the importance of the selection of the target sequence from an alignment of an RNA family for the success of RNA 3D structure prediction. These observations prompt investigations into a new direction of research for checking 3D structure "foldability" or "predictability" of related RNA sequences to obtain accurate predictions. To support new research in this area, we provide all relevant scripts in a documented and ready-to-use form. By exploring new ideas and identifying limitations of the current RNA 3D structure prediction methods, this work is bringing us closer to the near-native computational RNA 3D models.
Collapse
Affiliation(s)
- Marcin Magnus
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Warsaw, Poland
| | - Kalli Kappel
- Biophysics Program, Stanford University, Stanford, CA USA
| | - Rhiju Das
- Biophysics Program, Stanford University, Stanford, CA USA
- Department of Biochemistry, Stanford University, Stanford, CA USA
- Department of Physics, Stanford University, Stanford, CA USA
| | - Janusz M. Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Warsaw, Poland
- Laboratory of Bioinformatics, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Poznan, Poland
| |
Collapse
|
12
|
Prats-Ejarque G, Lu L, Salazar VA, Moussaoui M, Boix E. Evolutionary Trends in RNA Base Selectivity Within the RNase A Superfamily. Front Pharmacol 2019; 10:1170. [PMID: 31649540 PMCID: PMC6794472 DOI: 10.3389/fphar.2019.01170] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2019] [Accepted: 09/12/2019] [Indexed: 11/13/2022] Open
Abstract
There is a growing interest in the pharmaceutical industry to design novel tailored drugs for RNA targeting. The vertebrate-specific RNase A superfamily is nowadays one of the best characterized family of enzymes and comprises proteins involved in host defense with specific cytotoxic and immune-modulatory properties. We observe within the family a structural variability at the substrate-binding site associated to a diversification of biological properties. In this work, we have analyzed the enzyme specificity at the secondary base binding site. Towards this end, we have performed a kinetic characterization of the canonical RNase types together with a molecular dynamic simulation of selected representative family members. The RNases' catalytic activity and binding interactions have been compared using UpA, UpG and UpI dinucleotides. Our results highlight an evolutionary trend from lower to higher order vertebrates towards an enhanced discrimination power of selectivity for adenine respect to guanine at the secondary base binding site (B2). Interestingly, the shift from guanine to adenine preference is achieved in all the studied family members by equivalent residues through distinct interaction modes. We can identify specific polar and charged side chains that selectively interact with donor or acceptor purine groups. Overall, we observe selective bidentate polar and electrostatic interactions: Asn to N1/N6 and N6/N7 adenine groups in mammals versus Glu/Asp and Arg to N1/N2, N1/O6 and O6/N7 guanine groups in non-mammals. In addition, kinetic and molecular dynamics comparative results on UpG versus UpI emphasize the main contribution of Glu/Asp interactions to N1/N2 group for guanine selectivity in lower order vertebrates. A close inspection at the B2 binding pocket also highlights the principal contribution of the protein ß6 and L4 loop regions. Significant differences in the orientation and extension of the L4 loop could explain how the same residues can participate in alternative binding modes. The analysis suggests that within the RNase A superfamily an evolution pressure has taken place at the B2 secondary binding site to provide novel substrate-recognition patterns. We are confident that a better knowledge of the enzymes' nucleotide recognition pattern would contribute to identify their physiological substrate and eventually design applied therapies to modulate their biological functions.
Collapse
Affiliation(s)
- Guillem Prats-Ejarque
- Department of Biochemistry and Molecular Biology, Faculty of Biosciences, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Lu Lu
- Department of Biochemistry and Molecular Biology, Faculty of Biosciences, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Vivian A Salazar
- Department of Biochemistry and Molecular Biology, Faculty of Biosciences, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Mohammed Moussaoui
- Department of Biochemistry and Molecular Biology, Faculty of Biosciences, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Ester Boix
- Department of Biochemistry and Molecular Biology, Faculty of Biosciences, Universitat Autònoma de Barcelona, Barcelona, Spain
| |
Collapse
|
13
|
Jedrzejczyk D, Chworos A. Self-Assembling RNA Nanoparticle for Gene Expression Regulation in a Model System. ACS Synth Biol 2019; 8:491-497. [PMID: 30649860 DOI: 10.1021/acssynbio.8b00319] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
In the search for enzymatically processed RNA fragments, we found the novel three-way junction motif. The structure prediction suggested the arrangement of helices at acute angle approx. 60°. This allows the design of a trimeric RNA nanoparticle that can be functionalized with multiple regulatory fragments. Such RNA nano-object of equilateral triangular shape was applied for gene expression regulation studies in two independent cellular systems. Biochemical and functional studies confirmed the predicted shape and structure of the nanoparticle. The regulatory siRNA fragments incorporated into the nanoparticle were effectively released and triggered gene silencing. The regulatory effect was prolonged when induced with structuralized RNA compared to unstructured siRNAs. In these studies, the enzymatic processing of the motif was utilized for function release from the nanoparticle, enabling simultaneous delivery of different regulatory functions. This methodology of sequence search, RNA structural prediction, and application for rational design opens a new way for creating enzymatically processed RNA nanoparticles.
Collapse
Affiliation(s)
- Dominika Jedrzejczyk
- Centre of Molecular and Macromolecular Studies , Polish Academy of Sciences , Sienkiewicza 112 , 90-363 Lodz , Poland
| | - Arkadiusz Chworos
- Centre of Molecular and Macromolecular Studies , Polish Academy of Sciences , Sienkiewicza 112 , 90-363 Lodz , Poland
| |
Collapse
|
14
|
The State of Long Non-Coding RNA Biology. Noncoding RNA 2018; 4:ncrna4030017. [PMID: 30103474 PMCID: PMC6162524 DOI: 10.3390/ncrna4030017] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2018] [Revised: 07/30/2018] [Accepted: 08/07/2018] [Indexed: 12/15/2022] Open
Abstract
Transcriptomic studies have demonstrated that the vast majority of the genomes of mammals and other complex organisms is expressed in highly dynamic and cell-specific patterns to produce large numbers of intergenic, antisense and intronic long non-protein-coding RNAs (lncRNAs). Despite well characterized examples, their scaling with developmental complexity, and many demonstrations of their association with cellular processes, development and diseases, lncRNAs are still to be widely accepted as major players in gene regulation. This may reflect an underappreciation of the extent and precision of the epigenetic control of differentiation and development, where lncRNAs appear to have a central role, likely as organizational and guide molecules: most lncRNAs are nuclear-localized and chromatin-associated, with some involved in the formation of specialized subcellular domains. I suggest that a reassessment of the conceptual framework of genetic information and gene expression in the 4-dimensional ontogeny of spatially organized multicellular organisms is required. Together with this and further studies on their biology, the key challenges now are to determine the structure–function relationships of lncRNAs, which may be aided by emerging evidence of their modular structure, the role of RNA editing and modification in enabling epigenetic plasticity, and the role of RNA signaling in transgenerational inheritance of experience.
Collapse
|
15
|
Rigden DJ, Fernández XM. The 2018 Nucleic Acids Research database issue and the online molecular biology database collection. Nucleic Acids Res 2018; 46:D1-D7. [PMID: 29316735 PMCID: PMC5753253 DOI: 10.1093/nar/gkx1235] [Citation(s) in RCA: 58] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2017] [Accepted: 11/29/2017] [Indexed: 12/20/2022] Open
Abstract
The 2018 Nucleic Acids Research Database Issue contains 181 papers spanning molecular biology. Among them, 82 are new and 84 are updates describing resources that appeared in the Issue previously. The remaining 15 cover databases most recently published elsewhere. Databases in the area of nucleic acids include 3DIV for visualisation of data on genome 3D structure and RNArchitecture, a hierarchical classification of RNA families. Protein databases include the established SMART, ELM and MEROPS while GPCRdb and the newcomer STCRDab cover families of biomedical interest. In the area of metabolism, HMDB and Reactome both report new features while PULDB appears in NAR for the first time. This issue also contains reports on genomics resources including Ensembl, the UCSC Genome Browser and ENCODE. Update papers from the IUPHAR/BPS Guide to Pharmacology and DrugBank are highlights of the drug and drug target section while a number of proteomics databases including proteomicsDB are also covered. The entire Database Issue is freely available online on the Nucleic Acids Research website (https://academic.oup.com/nar). The NAR online Molecular Biology Database Collection has been updated, reviewing 138 entries, adding 88 new resources and eliminating 47 discontinued URLs, bringing the current total to 1737 databases. It is available at http://www.oxfordjournals.org/nar/database/c/.
Collapse
Affiliation(s)
- Daniel J Rigden
- Institute of Integrative Biology, University of Liverpool, Crown Street, Liverpool L69 7ZB, UK
| | | |
Collapse
|