1
|
Appasamy SD, Zirbel CL. R3DMCS: a web server for visualizing structural variation in RNA motifs across experimental 3D structures from the same organism or across species. Bioinformatics 2024; 40:btae682. [PMID: 39546379 PMCID: PMC11588024 DOI: 10.1093/bioinformatics/btae682] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2024] [Revised: 10/17/2024] [Accepted: 11/13/2024] [Indexed: 11/17/2024] Open
Abstract
MOTIVATION The recent progress in RNA structure determination methods has resulted in a surge of newly solved RNA 3D structures. However, there is an absence of a user-friendly browser-based tool that can facilitate the comparison and visualization of RNA motifs across multiple 3D structures. RESULTS We introduce R3DMCS, a web server that allows users to compare selected RNA nucleotides across all 3D structures of a given molecule from a given species, or across all 3D structures mapped to a single Rfam family. Starting from one instance of the motif, R3DMCS retrieves, aligns, annotates, organizes, and displays 3D coordinates of corresponding sets of nucleotides from other 3D structures. With R3DMCS, one can explore conformational changes of motifs due to 3D structures being solved in different functional states or different experimental conditions. One can also investigate conservation of 3D structure across species, or changes in 3D structure due to changes in sequence. AVAILABILITY AND IMPLEMENTATION R3DMCS is open-source software and freely available at https://rna.bgsu.edu/correspondence/ and https://github.com/BGSU-RNA/RNA-3D-correspondence.
Collapse
Affiliation(s)
- Sri Devan Appasamy
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
- Department of Biological Sciences, Bowling Green State University, Bowling Green, OH 43403, United States
| | - Craig L Zirbel
- Department of Mathematics and Statistics, Bowling Green State University, Bowling Green, OH 43403, United States
| |
Collapse
|
2
|
Toews S, Wacker A, Faison EM, Duchardt-Ferner E, Richter C, Mathieu D, Bottaro S, Zhang Q, Schwalbe H. The 5'-terminal stem-loop RNA element of SARS-CoV-2 features highly dynamic structural elements that are sensitive to differences in cellular pH. Nucleic Acids Res 2024; 52:7971-7986. [PMID: 38842942 PMCID: PMC11260494 DOI: 10.1093/nar/gkae477] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 05/16/2024] [Accepted: 05/22/2024] [Indexed: 07/23/2024] Open
Abstract
We present the nuclear magnetic resonance spectroscopy (NMR) solution structure of the 5'-terminal stem loop 5_SL1 (SL1) of the SARS-CoV-2 genome. SL1 contains two A-form helical elements and two regions with non-canonical structure, namely an apical pyrimidine-rich loop and an asymmetric internal loop with one and two nucleotides at the 5'- and 3'-terminal part of the sequence, respectively. The conformational ensemble representing the averaged solution structure of SL1 was validated using NMR residual dipolar coupling (RDC) and small-angle X-ray scattering (SAXS) data. We show that the internal loop is the major binding site for fragments of low molecular weight. This internal loop of SL1 can be stabilized by an A12-C28 interaction that promotes the transient formation of an A+•C base pair. As a consequence, the pKa of the internal loop adenosine A12 is shifted to 5.8, compared to a pKa of 3.63 of free adenosine. Furthermore, applying a recently developed pH-differential mutational profiling (PD-MaP) approach, we not only recapitulated our NMR findings of SL1 but also unveiled multiple sites potentially sensitive to pH across the 5'-UTR of SARS-CoV-2.
Collapse
Affiliation(s)
- Sabrina Toews
- Institute of Organic Chemistry and Chemical Biology, Johann Wolfgang Goethe-University Frankfurt, Frankfurt/Main, Hesse 60438, Germany
- Center for Biomolecular Magnetic Resonance (BMRZ), Johann Wolfgang Goethe-University Frankfurt, Frankfurt/Main, Hesse 60438, Germany
| | - Anna Wacker
- Institute of Organic Chemistry and Chemical Biology, Johann Wolfgang Goethe-University Frankfurt, Frankfurt/Main, Hesse 60438, Germany
- Center for Biomolecular Magnetic Resonance (BMRZ), Johann Wolfgang Goethe-University Frankfurt, Frankfurt/Main, Hesse 60438, Germany
| | - Edgar M Faison
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC27599, USA
| | - Elke Duchardt-Ferner
- Center for Biomolecular Magnetic Resonance (BMRZ), Johann Wolfgang Goethe-University Frankfurt, Frankfurt/Main, Hesse 60438, Germany
- Institute of Molecular Biosciences, Johann Wolfgang Goethe-University Frankfurt, Frankfurt/Main, Hesse 60438, Germany
| | - Christian Richter
- Institute of Organic Chemistry and Chemical Biology, Johann Wolfgang Goethe-University Frankfurt, Frankfurt/Main, Hesse 60438, Germany
- Center for Biomolecular Magnetic Resonance (BMRZ), Johann Wolfgang Goethe-University Frankfurt, Frankfurt/Main, Hesse 60438, Germany
| | - Daniel Mathieu
- Bruker BioSpin GmbH, Ettlingen, Baden-Württemberg 76275, Germany
| | - Sandro Bottaro
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen 2200, Denmark
| | - Qi Zhang
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC27599, USA
| | - Harald Schwalbe
- Institute of Organic Chemistry and Chemical Biology, Johann Wolfgang Goethe-University Frankfurt, Frankfurt/Main, Hesse 60438, Germany
- Center for Biomolecular Magnetic Resonance (BMRZ), Johann Wolfgang Goethe-University Frankfurt, Frankfurt/Main, Hesse 60438, Germany
| |
Collapse
|
3
|
Bohdan DR, Voronina VV, Bujnicki JM, Baulin EF. A comprehensive survey of long-range tertiary interactions and motifs in non-coding RNA structures. Nucleic Acids Res 2023; 51:8367-8382. [PMID: 37471030 PMCID: PMC10484739 DOI: 10.1093/nar/gkad605] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Accepted: 07/07/2023] [Indexed: 07/21/2023] Open
Abstract
Understanding the 3D structure of RNA is key to understanding RNA function. RNA 3D structure is modular and can be seen as a composition of building blocks of various sizes called tertiary motifs. Currently, long-range motifs formed between distant loops and helical regions are largely less studied than the local motifs determined by the RNA secondary structure. We surveyed long-range tertiary interactions and motifs in a non-redundant set of non-coding RNA 3D structures. A new dataset of annotated LOng-RAnge RNA 3D modules (LORA) was built using an approach that does not rely on the automatic annotations of non-canonical interactions. An original algorithm, ARTEM, was developed for annotation-, sequence- and topology-independent superposition of two arbitrary RNA 3D modules. The proposed methods allowed us to identify and describe the most common long-range RNA tertiary motifs. Along with the prevalent canonical A-minor interactions, a large number of previously undescribed staple interactions were observed. The most frequent long-range motifs were found to belong to three main motif families: planar staples, tilted staples, and helical packing motifs.
Collapse
Affiliation(s)
- Davyd R Bohdan
- Department of Innovation and High Technology, Moscow Institute of Physics and Technology, Dolgoprudny 141701, Russia
| | - Valeria V Voronina
- Department of Information Systems, Ulyanovsk State Technical University, Ulyanovsk 432027, Russia
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw 02-109, Poland
| | - Eugene F Baulin
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw 02-109, Poland
| |
Collapse
|
4
|
Saon MS, Kirkpatrick CC, Znosko BM. Identification and characterization of RNA pentaloop sequence families. NAR Genom Bioinform 2023; 5:lqac102. [PMID: 36632613 PMCID: PMC9830547 DOI: 10.1093/nargab/lqac102] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Revised: 10/28/2022] [Accepted: 12/12/2022] [Indexed: 01/11/2023] Open
Abstract
One of the current methods for predicting RNA tertiary structure is fragment-based homology, which predicts tertiary structure from secondary structure. For a successful prediction, this method requires a library of the tertiary structures of small motifs clipped from previously solved RNA 3D structures. Because of the limited number of available tertiary structures, it is not practical to find structures for all sequences of all motifs. Identifying sequence families for motifs can fill the gaps because all sequences within a family are expected to have similar structural features. Currently, a collection of well-characterized sequence families has been identified for tetraloops. Because of their prevalence and biological functions, pentaloop structures should also be well-characterized. In this study, 10 pentaloop sequence families are identified. For each family, the common and distinguishing structural features are highlighted. These sequence families can be used to predict the tertiary structure of pentaloop sequences for which a solved structure is not available.
Collapse
Affiliation(s)
- Md Sharear Saon
- Department of Chemistry, Saint Louis University, Saint Louis, MO 63103, USA
| | | | - Brent M Znosko
- Department of Chemistry, Saint Louis University, Saint Louis, MO 63103, USA
| |
Collapse
|
5
|
Moniot A, Guermeur Y, de Vries SJ, Chauvot de Beauchene I. ProtNAff: protein-bound Nucleic Acid filters and fragment libraries. Bioinformatics 2022; 38:3911-3917. [PMID: 35775902 DOI: 10.1093/bioinformatics/btac430] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2021] [Revised: 04/25/2022] [Accepted: 06/28/2022] [Indexed: 12/24/2022] Open
Abstract
MOTIVATION Atomistic models of nucleic acids (NA) fragments can be used to model the 3D structures of specific protein-NA interactions and address the problem of great NA flexibility, especially in their single-stranded regions. One way to obtain relevant NA fragments is to extract them from existing 3D structures corresponding to the targeted context (e.g. specific 2D structures, protein families, sequences) and to learn from them. Several databases exist for specific NA 3D motifs, especially in RNA, but none can handle the variety of possible contexts. RESULTS This article presents protNAff (protein-bound Nucleic Acids filters and fragments), a new pipeline for the conception of searchable databases on the 2D and 3D structures of protein-bound NA, the selection of context-specific (regions of) NA structures by combinations of filters, and the creation of context-specific NA fragment libraries. The strength of this pipeline is its modularity, allowing users to adapt it to many specific modeling problems. As examples, the pipeline is applied to the quantitative analysis of (i) the sequence-specificity of trinucleotide conformations, (ii) the conformational diversity of RNA at several levels of resolution, (iii) the effect of protein binding on RNA local conformations and (iv) the protein-binding propensity of RNA hairpin loops of various lengths. AVAILABILITY AND IMPLEMENTATION The source code is freely available for download at URL https://github.com/isaureCdB/protNAff. The database and the trinucleotide fragment library are downloadable at URL https://zenodo.org/record/6483823#.YmbVhFxByV4. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Antoine Moniot
- LORIA (CNRS - INRIA - Université de Lorraine), Nancy 54000, France
| | - Yann Guermeur
- LORIA (CNRS - INRIA - Université de Lorraine), Nancy 54000, France
| | - Sjoerd Jacob de Vries
- Ressource Parisienne en Bioinformatique Structurale (RPBS), Paris 75013, France.,BFA, CNRS UMR 8251, INSERM ERL U1133, Paris 75013, France
| | | |
Collapse
|
6
|
Zhao Q, Zhao Z, Fan X, Yuan Z, Mao Q, Yao Y. Review of machine learning methods for RNA secondary structure prediction. PLoS Comput Biol 2021; 17:e1009291. [PMID: 34437528 PMCID: PMC8389396 DOI: 10.1371/journal.pcbi.1009291] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022] Open
Abstract
Secondary structure plays an important role in determining the function of noncoding RNAs. Hence, identifying RNA secondary structures is of great value to research. Computational prediction is a mainstream approach for predicting RNA secondary structure. Unfortunately, even though new methods have been proposed over the past 40 years, the performance of computational prediction methods has stagnated in the last decade. Recently, with the increasing availability of RNA structure data, new methods based on machine learning (ML) technologies, especially deep learning, have alleviated the issue. In this review, we provide a comprehensive overview of RNA secondary structure prediction methods based on ML technologies and a tabularized summary of the most important methods in this field. The current pending challenges in the field of RNA secondary structure prediction and future trends are also discussed.
Collapse
Affiliation(s)
- Qi Zhao
- College of Medicine and Biological Information Engineering, Northeastern University, Shenyang, Liaoning, China
| | - Zheng Zhao
- School of Information Science and Technology, Dalian Maritime University, Dalian, Liaoning, China
| | - Xiaoya Fan
- School of Software, Key Laboratory for Ubiquitous Network and Service Software of Liaoning Province, Dalian University of Technology, Dalian, Liaoning, China
| | - Zhengwei Yuan
- Key Laboratory of Health Ministry for Congenital Malformation, Shengjing Hospital of China Medical University, Shenyang, Liaoning, China
| | - Qian Mao
- College of Light Industry, Liaoning University, Shenyang, Liaoning, China
- Key Laboratory of Agroproducts Processing Technology, Changchun University, Changchun, Jilin, China
| | - Yudong Yao
- Department of Electrical and Computer Engineering, Stevens Institute of Technology, Hoboken, New Jersey, United States of America
| |
Collapse
|
7
|
Richardson KE, Adams MS, Kirkpatrick CC, Gohara DW, Znosko BM. Identification and Characterization of New RNA Tetraloop Sequence Families. Biochemistry 2019; 58:4809-4820. [PMID: 31714066 DOI: 10.1021/acs.biochem.9b00535] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
There is an abundance of RNA sequence information available due to the efforts of sequencing projects. However, current techniques implemented to solve the tertiary structures of RNA, such as NMR and X-ray crystallography, are difficult and time-consuming. Therefore, biophysical techniques are not able to keep pace with the abundance of sequence information available. Because of this, there is a need to develop quick and efficient ways to predict RNA tertiary structure from sequence. One promising approach is to identify structural patterns within previously solved 3D structures and apply these patterns to new sequences. RNA tetraloops are one of the most common naturally occurring secondary structure motifs. Here, we use RNA Characterization of Secondary Structure Motifs (CoSSMos), Dissecting the Spatial Structure of RNA (DSSR), and a bioinformatic approach to search for and characterize tertiary structure patterns among tetraloops. Not surprising, we identified the well-known GNRA and UNCG tetraloops, as well as the previously identified RNYA tetraloop. However, some previously identified characteristics of these families were not observed in this data set, and some new characteristics were identified. In addition, we also identified and characterized three new tetraloop sequence families: YGAR, UGGU, and RMSA. This new structural information sheds light on the tertiary structure of tetraloops and contributes to the efforts of RNA tertiary structure prediction from sequence.
Collapse
Affiliation(s)
- Katherine E Richardson
- Department of Chemistry , Saint Louis University , Saint Louis , Missouri 63103 , United States
| | - Miranda S Adams
- Department of Chemistry , Saint Louis University , Saint Louis , Missouri 63103 , United States
| | - Charles C Kirkpatrick
- Department of Chemistry , Saint Louis University , Saint Louis , Missouri 63103 , United States
| | - David W Gohara
- Department of Biochemistry and Molecular Biology , Saint Louis University , Saint Louis , Missouri 63103 , United States
| | - Brent M Znosko
- Department of Chemistry , Saint Louis University , Saint Louis , Missouri 63103 , United States
| |
Collapse
|