101
|
Frezza E, Courban A, Allouche D, Sargueil B, Pasquali S. The interplay between molecular flexibility and RNA chemical probing reactivities analyzed at the nucleotide level via an extensive molecular dynamics study. Methods 2019; 162-163:108-127. [PMID: 31145972 DOI: 10.1016/j.ymeth.2019.05.021] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2018] [Revised: 05/22/2019] [Accepted: 05/22/2019] [Indexed: 12/20/2022] Open
Abstract
Determination of the tridimensional structure of ribonucleic acid molecules is fundamental for understanding their function in the cell. A common method to investigate RNA structures of large molecules is the use of chemical probes such as SHAPE (2'-hydroxyl acylation analyzed by primer extension) reagents, DMS (dimethyl sulfate) and CMCT (1-cyclohexyl-3-(2-morpholinoethyl) carbodiimide metho-p-toluene sulfate), the reaction of which is dependent on the local structural properties of each nucleotide. In order to understand the interplay between local flexibility, sugar pucker, canonical pairing and chemical reactivity of the probes, we performed all-atom molecular dynamics simulations on a set of RNA molecules for which both tridimensional structure and chemical probing data are available and we analyzed the correlations between geometrical parameters and the chemical reactivity. Our study confirms that SHAPE reactivity is guided by the local flexibility of the different chemical moieties but suggests that a combination of multiple parameters is needed to better understand the implications of the reactivity at the molecular level. This is also the case for DMS and CMCT for which the reactivity appears to be more complex than commonly accepted.
Collapse
Affiliation(s)
- Elisa Frezza
- Faculté de Pharmacie de Paris, Laboratoire de Cristallographie et RMN Biologiques, UMR 8015 - CNRS, Université Paris Descartes, 4 Avenue de l'Observatoire 75270 PARIS CEDEX 06, France.
| | - Antoine Courban
- Faculté de Pharmacie de Paris, Laboratoire de Cristallographie et RMN Biologiques, UMR 8015 - CNRS, Université Paris Descartes, 4 Avenue de l'Observatoire 75270 PARIS CEDEX 06, France
| | - Delphine Allouche
- Faculté de Pharmacie de Paris, Laboratoire de Cristallographie et RMN Biologiques, UMR 8015 - CNRS, Université Paris Descartes, 4 Avenue de l'Observatoire 75270 PARIS CEDEX 06, France
| | - Bruno Sargueil
- Faculté de Pharmacie de Paris, Laboratoire de Cristallographie et RMN Biologiques, UMR 8015 - CNRS, Université Paris Descartes, 4 Avenue de l'Observatoire 75270 PARIS CEDEX 06, France.
| | - Samuela Pasquali
- Faculté de Pharmacie de Paris, Laboratoire de Cristallographie et RMN Biologiques, UMR 8015 - CNRS, Université Paris Descartes, 4 Avenue de l'Observatoire 75270 PARIS CEDEX 06, France.
| |
Collapse
|
102
|
Olson WK, Li S, Kaukonen T, Colasanti AV, Xin Y, Lu XJ. Effects of Noncanonical Base Pairing on RNA Folding: Structural Context and Spatial Arrangements of G·A Pairs. Biochemistry 2019; 58:2474-2487. [PMID: 31008589 PMCID: PMC6729125 DOI: 10.1021/acs.biochem.9b00122] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Noncanonical base pairs play important roles in assembling the three-dimensional structures critical to the diverse functions of RNA. These associations contribute to the looped segments that intersperse the canonical double-helical elements within folded, globular RNA molecules. They stitch together various structural elements, serve as recognition elements for other molecules, and act as sites of intrinsic stiffness or deformability. This work takes advantage of new software (DSSR) designed to streamline the analysis and annotation of RNA three-dimensional structures. The multiscale structural information gathered for individual molecules, combined with the growing number of unique, well-resolved RNA structures, makes it possible to examine the collective features deeply and to uncover previously unrecognized patterns of chain organization. Here we focus on a subset of noncanonical base pairs involving guanine and adenine and the links between their modes of association, secondary structural context, and contributions to tertiary folding. The rigorous descriptions of base-pair geometry that we employ facilitate characterization of recurrent geometric motifs and the structural settings in which these arrangements occur. Moreover, the numerical parameters hint at the natural motions of the interacting bases and the pathways likely to connect different spatial forms. We draw attention to higher-order multiplexes involving two or more G·A pairs and the roles these associations appear to play in bridging different secondary structural units. The collective data reveal pairing propensities in base organization, secondary structural context, and deformability and serve as a starting point for further multiscale investigations and/or simulations of RNA folding.
Collapse
Affiliation(s)
- Wilma K. Olson
- Department of Chemistry & Chemical Biology and Center for Quantitative Biology, Rutgers, the State University of New Jersey, Piscataway, New Jersey 08854, USA
| | - Shuxiang Li
- Department of Chemistry & Chemical Biology and Center for Quantitative Biology, Rutgers, the State University of New Jersey, Piscataway, New Jersey 08854, USA
| | - Thomas Kaukonen
- Department of Chemistry & Chemical Biology and Center for Quantitative Biology, Rutgers, the State University of New Jersey, Piscataway, New Jersey 08854, USA
| | - Andrew V. Colasanti
- Department of Chemistry & Chemical Biology and Center for Quantitative Biology, Rutgers, the State University of New Jersey, Piscataway, New Jersey 08854, USA
| | - Yurong Xin
- Department of Chemistry & Chemical Biology and Center for Quantitative Biology, Rutgers, the State University of New Jersey, Piscataway, New Jersey 08854, USA
| | - Xiang-Jun Lu
- Department of Biological Sciences, Columbia University, New York, New York 10027, USA
| |
Collapse
|
103
|
Sun S, Wu Q, Peng Z, Yang J. Enhanced prediction of RNA solvent accessibility with long short-term memory neural networks and improved sequence profiles. Bioinformatics 2019; 35:1686-1691. [PMID: 30321300 DOI: 10.1093/bioinformatics/bty876] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2017] [Revised: 09/11/2018] [Accepted: 10/13/2018] [Indexed: 02/21/2025] Open
Abstract
MOTIVATION The de novo prediction of RNA tertiary structure remains a grand challenge. Predicted RNA solvent accessibility provides an opportunity to address this challenge. To the best of our knowledge, there is only one method (RNAsnap) available for RNA solvent accessibility prediction. However, its performance is unsatisfactory for protein-free RNAs. RESULTS We developed RNAsol, a new algorithm to predict RNA solvent accessibility. RNAsol was built based on improved sequence profiles from the covariance models and trained with the long short-term memory (LSTM) neural networks. Independent tests on the same datasets from RNAsnap show that RNAsol achieves the mean Pearson's correlation coefficient (PCC) of 0.43/0.26 for the protein-bound/protein-free RNA molecules, which is 26.5%/136.4% higher than that of RNAsnap. When the training set is enlarged to include both types of RNAs, the PCCs increase to 0.49 and 0.46 for protein-bound and protein-free RNAs, respectively. The success of RNAsol is attributed to two aspects, including the improved sequence profiles constructed by the sequence-profile alignment and the enhanced training by the LSTM neural networks. AVAILABILITY AND IMPLEMENTATION http://yanglab.nankai.edu.cn/RNAsol/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Saisai Sun
- School of Mathematical Sciences, Nankai University, Tianjin, China
| | - Qi Wu
- School of Mathematical Sciences, Nankai University, Tianjin, China
| | - Zhenling Peng
- Center for Applied Mathematics, Tianjin University, Tianjin, China
| | - Jianyi Yang
- School of Mathematical Sciences, Nankai University, Tianjin, China
| |
Collapse
|
104
|
Wamhoff EC, Banal JL, Bricker WP, Shepherd TR, Parsons MF, Veneziano R, Stone MB, Jun H, Wang X, Bathe M. Programming Structured DNA Assemblies to Probe Biophysical Processes. Annu Rev Biophys 2019; 48:395-419. [PMID: 31084582 PMCID: PMC7035826 DOI: 10.1146/annurev-biophys-052118-115259] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Structural DNA nanotechnology is beginning to emerge as a widely accessible research tool to mechanistically study diverse biophysical processes. Enabled by scaffolded DNA origami in which a long single strand of DNA is weaved throughout an entire target nucleic acid assembly to ensure its proper folding, assemblies of nearly any geometric shape can now be programmed in a fully automatic manner to interface with biology on the 1-100-nm scale. Here, we review the major design and synthesis principles that have enabled the fabrication of a specific subclass of scaffolded DNA origami objects called wireframe assemblies. These objects offer unprecedented control over the nanoscale organization of biomolecules, including biomolecular copy numbers, presentation on convex or concave geometries, and internal versus external functionalization, in addition to stability in physiological buffer. To highlight the power and versatility of this synthetic structural biology approach to probing molecular and cellular biophysics, we feature its application to three leading areas of investigation: light harvesting and nanoscale energy transport, RNA structural biology, and immune receptor signaling, with an outlook toward unique mechanistic insight that may be gained in these areas in the coming decade.
Collapse
Affiliation(s)
- Eike-Christian Wamhoff
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA;
| | - James L Banal
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA;
| | - William P Bricker
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA;
| | - Tyson R Shepherd
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA;
| | - Molly F Parsons
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA;
| | - Rémi Veneziano
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA;
| | - Matthew B Stone
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA;
| | - Hyungmin Jun
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA;
| | - Xiao Wang
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA;
| | - Mark Bathe
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA;
| |
Collapse
|
105
|
Abstract
Interactions between RNA and proteins are pervasive in biology, driving fundamental processes such as protein translation and participating in the regulation of gene expression. Modeling the energies of RNA-protein interactions is therefore critical for understanding and repurposing living systems but has been hindered by complexities unique to RNA-protein binding. Here, we bring together several advances to complete a calculation framework for RNA-protein binding affinities, including a unified free energy function for bound complexes, automated Rosetta modeling of mutations, and use of secondary structure-based energetic calculations to model unbound RNA states. The resulting Rosetta-Vienna RNP-ΔΔG method achieves root-mean-squared errors (RMSEs) of 1.3 kcal/mol on high-throughput MS2 coat protein-RNA measurements and 1.5 kcal/mol on an independent test set involving the signal recognition particle, human U1A, PUM1, and FOX-1. As a stringent test, the method achieves RMSE accuracy of 1.4 kcal/mol in blind predictions of hundreds of human PUM2-RNA relative binding affinities. Overall, these RMSE accuracies are significantly better than those attained by prior structure-based approaches applied to the same systems. Importantly, Rosetta-Vienna RNP-ΔΔG establishes a framework for further improvements in modeling RNA-protein binding that can be tested by prospective high-throughput measurements on new systems.
Collapse
|
106
|
Stasiewicz J, Mukherjee S, Nithin C, Bujnicki JM. QRNAS: software tool for refinement of nucleic acid structures. BMC STRUCTURAL BIOLOGY 2019; 19:5. [PMID: 30898165 PMCID: PMC6429776 DOI: 10.1186/s12900-019-0103-1] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/25/2018] [Accepted: 03/05/2019] [Indexed: 01/03/2023]
Abstract
BACKGROUND Computational models of RNA 3D structure often present various inaccuracies caused by simplifications used in structure prediction methods, such as template-based modeling or coarse-grained simulations. To obtain a high-quality model, the preliminary RNA structural model needs to be refined, taking into account atomic interactions. The goal of the refinement is not only to improve the local quality of the model but to bring it globally closer to the true structure. RESULTS We present QRNAS, a software tool for fine-grained refinement of nucleic acid structures, which is an extension of the AMBER simulation method with additional restraints. QRNAS is capable of handling RNA, DNA, chimeras, and hybrids thereof, and enables modeling of nucleic acids containing modified residues. CONCLUSIONS We demonstrate the ability of QRNAS to improve the quality of models generated with different methods. QRNAS was able to improve MolProbity scores of NMR structures, as well as of computational models generated in the course of the RNA-Puzzles experiment. The overall geometry improvement may be associated with increased model accuracy, especially on the level of correctly modeled base-pairs, but the systematic improvement of root mean square deviation to the reference structure should not be expected. The method has been integrated into a computational modeling workflow, enabling improved RNA 3D structure prediction.
Collapse
Affiliation(s)
- Juliusz Stasiewicz
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, 02-109 Warsaw, Poland
| | - Sunandan Mukherjee
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, 02-109 Warsaw, Poland
| | - Chandran Nithin
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, 02-109 Warsaw, Poland
| | - Janusz M. Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, 02-109 Warsaw, Poland
- Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, 61-614 Poznań, Poland
| |
Collapse
|
107
|
Ponce-Salvatierra A, Astha, Merdas K, Nithin C, Ghosh P, Mukherjee S, Bujnicki JM. Computational modeling of RNA 3D structure based on experimental data. Biosci Rep 2019; 39:BSR20180430. [PMID: 30670629 PMCID: PMC6367127 DOI: 10.1042/bsr20180430] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2018] [Revised: 01/19/2019] [Accepted: 01/21/2019] [Indexed: 01/02/2023] Open
Abstract
RNA molecules are master regulators of cells. They are involved in a variety of molecular processes: they transmit genetic information, sense cellular signals and communicate responses, and even catalyze chemical reactions. As in the case of proteins, RNA function is dictated by its structure and by its ability to adopt different conformations, which in turn is encoded in the sequence. Experimental determination of high-resolution RNA structures is both laborious and difficult, and therefore the majority of known RNAs remain structurally uncharacterized. To address this problem, predictive computational methods were developed based on the accumulated knowledge of RNA structures determined so far, the physical basis of the RNA folding, and taking into account evolutionary considerations, such as conservation of functionally important motifs. However, all theoretical methods suffer from various limitations, and they are generally unable to accurately predict structures for RNA sequences longer than 100-nt residues unless aided by additional experimental data. In this article, we review experimental methods that can generate data usable by computational methods, as well as computational approaches for RNA structure prediction that can utilize data from experimental analyses. We outline methods and data types that can be potentially useful for RNA 3D structure modeling but are not commonly used by the existing software, suggesting directions for future development.
Collapse
Affiliation(s)
- Almudena Ponce-Salvatierra
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, Warsaw PL-02-109, Poland
| | - Astha
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, Warsaw PL-02-109, Poland
| | - Katarzyna Merdas
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, Warsaw PL-02-109, Poland
| | - Chandran Nithin
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, Warsaw PL-02-109, Poland
| | - Pritha Ghosh
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, Warsaw PL-02-109, Poland
| | - Sunandan Mukherjee
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, Warsaw PL-02-109, Poland
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, Warsaw PL-02-109, Poland
- Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, ul. Umultowska 89, Poznan PL-61-614, Poland
| |
Collapse
|
108
|
Giambasu GM, Case DA, York DM. Predicting Site-Binding Modes of Ions and Water to Nucleic Acids Using Molecular Solvation Theory. J Am Chem Soc 2019; 141:2435-2445. [PMID: 30632365 PMCID: PMC6574206 DOI: 10.1021/jacs.8b11474] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
Site binding of ions and water shapes nucleic acids folding, dynamics, and biological function, complementing the more diffuse, nonspecific "territorial" ion binding. Unlike territorial binding, prediction of site-specific binding to nucleic acids remains an unsolved challenge in computational biophysics. This work presents a new toolset based on the 3D-RISM molecular solvation theory and topological analysis that predicts cation and water site binding to nucleic acids. 3D-RISM is shown to accurately capture alkali cations and water binding to the central channel, transversal loops, and grooves of the Oxytricha nova's telomeres' G-quadruplex ( Oxy-GQ), in agreement with high-resolution crystallographic data. To improve the computed cation occupancy along the Oxy-GQ central channel, it was necessary to refine and validate new cation-oxygen parameters using structural and thermodynamic data available for crown ethers and ion channels. This single set of parameters that describes both localized and delocalized binding to various biological systems is used to gain insight into cation occupancy along the Oxy-GQ channel under various salt conditions. The paper concludes with prospects for extending the method to predict divalent cation binding to nucleic acids. This work advances the forefront of theoretical methods able to provide predictive insight into ion atmosphere effects on nucleic acids function.
Collapse
Affiliation(s)
- George M. Giambasu
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, New Brunswick, New Jersey 08901, United States
| | - David A. Case
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, New Brunswick, New Jersey 08901, United States
| | - Darrin M. York
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, New Brunswick, New Jersey 08901, United States
- Laboratory for Biomolecular Simulation Research, Rutgers, The State University of New Jersey, New Brunswick, New Jersey 08901, United States
- Center for Integrative Proteomics Research, Rutgers, The State University of New Jersey, New Brunswick, New Jersey 08901, United States
| |
Collapse
|
109
|
Bottaro S, Bussi G, Pinamonti G, Reißer S, Boomsma W, Lindorff-Larsen K. Barnaba: software for analysis of nucleic acid structures and trajectories. RNA (NEW YORK, N.Y.) 2019; 25:219-231. [PMID: 30420522 PMCID: PMC6348988 DOI: 10.1261/rna.067678.118] [Citation(s) in RCA: 50] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/26/2018] [Accepted: 11/06/2018] [Indexed: 06/09/2023]
Abstract
RNA molecules are highly dynamic systems characterized by a complex interplay between sequence, structure, dynamics, and function. Molecular simulations can potentially provide powerful insights into the nature of these relationships. The analysis of structures and molecular trajectories of nucleic acids can be nontrivial because it requires processing very high-dimensional data that are not easy to visualize and interpret. Here we introduce Barnaba, a Python library aimed at facilitating the analysis of nucleic acid structures and molecular simulations. The software consists of a variety of analysis tools that allow the user to (i) calculate distances between three-dimensional structures using different metrics, (ii) back-calculate experimental data from three-dimensional structures, (iii) perform cluster analysis and dimensionality reductions, (iv) search three-dimensional motifs in PDB structures and trajectories, and (v) construct elastic network models for nucleic acids and nucleic acids-protein complexes. In addition, Barnaba makes it possible to calculate torsion angles, pucker conformations, and to detect base-pairing/base-stacking interactions. Barnaba produces graphics that conveniently visualize both extended secondary structure and dynamics for a set of molecular conformations. The software is available as a command-line tool as well as a library, and supports a variety of file formats such as PDB, dcd, and xtc files. Source code, documentation, and examples are freely available at https://github.com/srnas/barnaba under GNU GPLv3 license.
Collapse
Affiliation(s)
- Sandro Bottaro
- Structural Biology and NMR Laboratory and Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen 2200, Denmark
- International School for Advanced Studies, 34136 Trieste, Italy
| | - Giovanni Bussi
- International School for Advanced Studies, 34136 Trieste, Italy
| | - Giovanni Pinamonti
- International School for Advanced Studies, 34136 Trieste, Italy
- Department of Mathematics and Computer Science, Freie Universität, 14195 Berlin, Germany
| | - Sabine Reißer
- International School for Advanced Studies, 34136 Trieste, Italy
| | - Wouter Boomsma
- Department of Computer Science, University of Copenhagen, Copenhagen 2200, Denmark
| | - Kresten Lindorff-Larsen
- Structural Biology and NMR Laboratory and Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen 2200, Denmark
| |
Collapse
|
110
|
López-Blanco JR, Chacón P. KORP: knowledge-based 6D potential for fast protein and loop modeling. Bioinformatics 2019; 35:3013-3019. [DOI: 10.1093/bioinformatics/btz026] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2018] [Revised: 01/03/2019] [Accepted: 01/08/2019] [Indexed: 12/18/2022] Open
Abstract
Abstract
Motivation
Knowledge-based statistical potentials constitute a simpler and easier alternative to physics-based potentials in many applications, including folding, docking and protein modeling. Here, to improve the effectiveness of the current approximations, we attempt to capture the six-dimensional nature of residue–residue interactions from known protein structures using a simple backbone-based representation.
Results
We have developed KORP, a knowledge-based pairwise potential for proteins that depends on the relative position and orientation between residues. Using a minimalist representation of only three backbone atoms per residue, KORP utilizes a six-dimensional joint probability distribution to outperform state-of-the-art statistical potentials for native structure recognition and best model selection in recent critical assessment of protein structure prediction and loop-modeling benchmarks. Compared with the existing methods, our side-chain independent potential has a lower complexity and better efficiency. The superior accuracy and robustness of KORP represent a promising advance for protein modeling and refinement applications that require a fast but highly discriminative energy function.
Availability and implementation
http://chaconlab.org/modeling/korp.
Supplementary information
Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- José Ramón López-Blanco
- Department of Biological Chemical Physics, Rocasolano Institute of Physical Chemistry C.S.I.C, Madrid, Spain
| | - Pablo Chacón
- Department of Biological Chemical Physics, Rocasolano Institute of Physical Chemistry C.S.I.C, Madrid, Spain
| |
Collapse
|
111
|
Dans PD, Gallego D, Balaceanu A, Darré L, Gómez H, Orozco M. Modeling, Simulations, and Bioinformatics at the Service of RNA Structure. Chem 2019. [DOI: 10.1016/j.chempr.2018.09.015] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
|
112
|
Schroeder SJ. Challenges and approaches to predicting RNA with multiple functional structures. RNA (NEW YORK, N.Y.) 2018; 24:1615-1624. [PMID: 30143552 PMCID: PMC6239171 DOI: 10.1261/rna.067827.118] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
The revolution in sequencing technology demands new tools to interpret the genetic code. As in vivo transcriptome-wide chemical probing techniques advance, new challenges emerge in the RNA folding problem. The emphasis on one sequence folding into a single minimum free energy structure is fading as a new focus develops on generating RNA structural ensembles and identifying functional structural features in ensembles. This review describes an efficient combinatorially complete method and three free energy minimization approaches to predicting RNA structures with more than one functional fold, as well as two methods for analysis of a thermodynamics-based Boltzmann ensemble of structures. The review then highlights two examples of viral RNA 3'-UTR regions that fold into more than one conformation and have been characterized by single molecule fluorescence energy resonance transfer or NMR spectroscopy. These examples highlight the different approaches and challenges in predicting structure and function from sequence for RNA with multiple biological roles and folds. More well-defined examples and new metrics for measuring differences in RNA structures will guide future improvements in prediction of RNA structure and function from sequence.
Collapse
Affiliation(s)
- Susan J Schroeder
- Department of Chemistry and Biochemistry, Department of Microbiology and Plant Biology, University of Oklahoma, Norman, Oklahoma 73019, USA
| |
Collapse
|
113
|
Mailler E, Paillart JC, Marquet R, Smyth RP, Vivet-Boudou V. The evolution of RNA structural probing methods: From gels to next-generation sequencing. WILEY INTERDISCIPLINARY REVIEWS-RNA 2018; 10:e1518. [PMID: 30485688 DOI: 10.1002/wrna.1518] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2018] [Revised: 09/13/2018] [Accepted: 10/17/2018] [Indexed: 01/09/2023]
Abstract
RNA molecules are important players in all domains of life and the study of the relationship between their multiple flexible states and the associated biological roles has increased in recent years. For several decades, chemical and enzymatic structural probing experiments have been used to determine RNA structure. During this time, there has been a steady improvement in probing reagents and experimental methods, and today the structural biologist community has a large range of tools at its disposal to probe the secondary structure of RNAs in vitro and in cells. Early experiments used radioactive labeling and polyacrylamide gel electrophoresis as read-out methods. This was superseded by capillary electrophoresis, and more recently by next-generation sequencing. Today, powerful structural probing methods can characterize RNA structure on a genome-wide scale. In this review, we will provide an overview of RNA structural probing methodologies from a historical and technical perspective. This article is categorized under: RNA Structure and Dynamics > RNA Structure, Dynamics, and Chemistry RNA Methods > RNA Analyses in vitro and In Silico RNA Methods > RNA Analyses in Cells.
Collapse
Affiliation(s)
- Elodie Mailler
- Architecture et Réactivité de l'ARN, Université de Strasbourg, CNRS, Strasbourg, France
| | | | - Roland Marquet
- Architecture et Réactivité de l'ARN, Université de Strasbourg, CNRS, Strasbourg, France
| | - Redmond P Smyth
- Architecture et Réactivité de l'ARN, Université de Strasbourg, CNRS, Strasbourg, France
| | - Valerie Vivet-Boudou
- Architecture et Réactivité de l'ARN, Université de Strasbourg, CNRS, Strasbourg, France
| |
Collapse
|
114
|
Kappel K, Das R. Sampling Native-like Structures of RNA-Protein Complexes through Rosetta Folding and Docking. Structure 2018; 27:140-151.e5. [PMID: 30416038 DOI: 10.1016/j.str.2018.10.001] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2018] [Revised: 08/27/2018] [Accepted: 10/05/2018] [Indexed: 10/27/2022]
Abstract
RNA-protein complexes underlie numerous cellular processes including translation, splicing, and posttranscriptional regulation of gene expression. The structures of these complexes are crucial to their functions but often elude high-resolution structure determination. Computational methods are needed that can integrate low-resolution data for RNA-protein complexes while modeling de novo the large conformational changes of RNA components upon complex formation. To address this challenge, we describe RNP-denovo, a Rosetta method to simultaneously fold-and-dock RNA to a protein surface. On a benchmark set of diverse RNA-protein complexes not solvable with prior strategies, RNP-denovo consistently sampled native-like structures with better than nucleotide resolution. We revisited three past blind modeling challenges involving the spliceosome, telomerase, and a methyltransferase-ribosomal RNA complex in which previous methods gave poor results. When coupled with the same sparse FRET, crosslinking, and functional data used previously, RNP-denovo gave models with significantly improved accuracy. These results open a route to modeling global folds of RNA-protein complexes from low-resolution data.
Collapse
Affiliation(s)
- Kalli Kappel
- Biophysics Program, Stanford University, Stanford, CA 94305, USA
| | - Rhiju Das
- Biophysics Program, Stanford University, Stanford, CA 94305, USA; Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305, USA; Department of Physics, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
115
|
Li J, Zhu W, Wang J, Li W, Gong S, Zhang J, Wang W. RNA3DCNN: Local and global quality assessments of RNA 3D structures using 3D deep convolutional neural networks. PLoS Comput Biol 2018; 14:e1006514. [PMID: 30481171 PMCID: PMC6258470 DOI: 10.1371/journal.pcbi.1006514] [Citation(s) in RCA: 47] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2018] [Accepted: 09/14/2018] [Indexed: 11/18/2022] Open
Abstract
Quality assessment is essential for the computational prediction and design of RNA tertiary structures. To date, several knowledge-based statistical potentials have been proposed and proved to be effective in identifying native and near-native RNA structures. All these potentials are based on the inverse Boltzmann formula, while differing in the choice of the geometrical descriptor, reference state, and training dataset. Via an approach that diverges completely from the conventional statistical potentials, our work explored the power of a 3D convolutional neural network (CNN)-based approach as a quality evaluator for RNA 3D structures, which used a 3D grid representation of the structure as input without extracting features manually. The RNA structures were evaluated by examining each nucleotide, so our method can also provide local quality assessment. Two sets of training samples were built. The first one included 1 million samples generated by high-temperature molecular dynamics (MD) simulations and the second one included 1 million samples generated by Monte Carlo (MC) structure prediction. Both MD and MC procedures were performed for a non-redundant set of 414 RNAs. For two training datasets (one including only MD training samples and the other including both MD and MC training samples), we trained two neural networks, named RNA3DCNN_MD and RNA3DCNN_MDMC, respectively. The former is suitable for assessing near-native structures, while the latter is suitable for assessing structures covering large structural space. We tested the performance of our method and made comparisons with four other traditional scoring functions. On two of three test datasets, our method performed similarly to the state-of-the-art traditional scoring function, and on the third test dataset, our method was far superior to other scoring functions. Our method can be downloaded from https://github.com/lijunRNA/RNA3DCNN.
Collapse
Affiliation(s)
- Jun Li
- National Laboratory of Solid State Microstructures, School of Physics, Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing, China
| | - Wei Zhu
- National Laboratory of Solid State Microstructures, School of Physics, Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing, China
| | - Jun Wang
- National Laboratory of Solid State Microstructures, School of Physics, Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing, China
- State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, China
| | - Wenfei Li
- National Laboratory of Solid State Microstructures, School of Physics, Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing, China
| | - Sheng Gong
- Department of Pharmaceutics, Nanjing General Hospital, Nanjing University Medical School, Nanjing, China
| | - Jian Zhang
- National Laboratory of Solid State Microstructures, School of Physics, Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing, China
- State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, China
| | - Wei Wang
- National Laboratory of Solid State Microstructures, School of Physics, Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing, China
| |
Collapse
|
116
|
Abstract
The past decades have witnessed tremendous developments in our understanding of RNA biology. At the core of these advances have been studies aimed at discerning RNA structure and at understanding the forces that influence the RNA folding process. It is easy to take the present state of understanding for granted, but there is much to be learned by considering the path to our current understanding, which has been tortuous, with the birth and death of models, the adaptation of experimental tools originally developed for characterization of protein structure and catalysis, and the development of novel tools for probing RNA. In this review we tour the stages of RNA folding studies, considering them as "epochs" that can be generalized across scientific disciplines. These epochs span from the discovery of catalytic RNA, through biophysical insights into the putative primordial RNA World, to characterization of structured RNAs, the building and testing of models, and, finally, to the development of models with the potential to yield generalizable predictive and quantitative models for RNA conformational, thermodynamic, and kinetic behavior. We hope that this accounting will aid others as they navigate the many fascinating questions about RNA and its roles in biology, in the past, present, and future.
Collapse
Affiliation(s)
- Daniel Herschlag
- Department of Biochemistry, Stanford University, Stanford, California 94305
- Department of Chemical Engineering, Stanford University, Stanford, California 94305
- Department of Chemistry, Stanford University, Stanford, California 94305
- Stanford ChEM-H (Chemistry, Engineering, and Medicine for Human Health), Stanford, California 94305
| | - Steve Bonilla
- Department of Biochemistry, Stanford University, Stanford, California 94305
- Department of Chemical Engineering, Stanford University, Stanford, California 94305
| | - Namita Bisaria
- Whitehead Institute for Biomedical Research, Cambridge, Massachusetts 02142
| |
Collapse
|
117
|
Bevilacqua PC, Assmann SM. Technique Development for Probing RNA Structure In Vivo and Genome-Wide. Cold Spring Harb Perspect Biol 2018; 10:a032250. [PMID: 30275275 PMCID: PMC6169808 DOI: 10.1101/cshperspect.a032250] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
How organisms perceive and respond to their surroundings is one of the great questions in biology. It is clear that RNA plays key roles in sensing. Cellular and environmental cues that RNA responds to include temperature, ions, metabolites, and biopolymers. Recent advances in next-generation sequencing and in vivo chemical probing have provided unprecedented insights into RNA folding in vivo and genome-wide. Patterns of chemical reactivity have implicated control of gene expression by RNA and aided prediction of RNA structure. Central to these advances has been development of molecular biological and chemical techniques. Key advances are improvements in the quality, cost, and throughput of library preparation; availability of a wider array of chemicals for probing RNA structure in vivo; and robustness and user friendliness of data analysis. Insights from probing transcriptomes and future directions are provided.
Collapse
Affiliation(s)
- Philip C Bevilacqua
- Departments of Chemistry and Biochemistry & Molecular Biology, Pennsylvania State University, University Park, Pennsylvania 16802
| | - Sarah M Assmann
- Department of Biology, Pennsylvania State University, University Park, Pennsylvania 16802
| |
Collapse
|
118
|
Nithin C, Ghosh P, Bujnicki JM. Bioinformatics Tools and Benchmarks for Computational Docking and 3D Structure Prediction of RNA-Protein Complexes. Genes (Basel) 2018; 9:genes9090432. [PMID: 30149645 PMCID: PMC6162694 DOI: 10.3390/genes9090432] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2018] [Revised: 07/26/2018] [Accepted: 08/21/2018] [Indexed: 12/29/2022] Open
Abstract
RNA-protein (RNP) interactions play essential roles in many biological processes, such as regulation of co-transcriptional and post-transcriptional gene expression, RNA splicing, transport, storage and stabilization, as well as protein synthesis. An increasing number of RNP structures would aid in a better understanding of these processes. However, due to the technical difficulties associated with experimental determination of macromolecular structures by high-resolution methods, studies on RNP recognition and complex formation present significant challenges. As an alternative, computational prediction of RNP interactions can be carried out. Structural models obtained by theoretical predictive methods are, in general, less reliable compared to models based on experimental measurements but they can be sufficiently accurate to be used as a basis for to formulating functional hypotheses. In this article, we present an overview of computational methods for 3D structure prediction of RNP complexes. We discuss currently available methods for macromolecular docking and for scoring 3D structural models of RNP complexes in particular. Additionally, we also review benchmarks that have been developed to assess the accuracy of these methods.
Collapse
Affiliation(s)
- Chandran Nithin
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland.
| | - Pritha Ghosh
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland.
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland.
- Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, ul. Umultowska 89, PL-61-614 Poznan, Poland.
| |
Collapse
|
119
|
Antczak M, Zok T, Osowiecki M, Popenda M, Adamiak RW, Szachniuk M. RNAfitme: a webserver for modeling nucleobase and nucleoside residue conformation in fixed-backbone RNA structures. BMC Bioinformatics 2018; 19:304. [PMID: 30134831 PMCID: PMC6106928 DOI: 10.1186/s12859-018-2317-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2018] [Accepted: 08/16/2018] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND Computational RNA 3D structure prediction and modeling are rising as complementary approaches to high-resolution experimental techniques for structure determination. They often apply to substitute or complement them. Recently, researchers' interests have directed towards in silico methods to fit, remodel and refine RNA tertiary structure models. Their power lies in a problem-specific exploration of RNA conformational space and efficient optimization procedures. The aim is to improve the accuracy of models obtained either computationally or experimentally. RESULTS Here, we present RNAfitme, a versatile webserver tool for remodeling of nucleobase- and nucleoside residue conformations in the fixed-backbone RNA 3D structures. Our approach makes use of dedicated libraries that define RNA conformational space. They have been built upon torsional angle characteristics of PDB-deposited RNA structures. RNAfitme can be applied to reconstruct full-atom model of RNA from its backbone; remodel user-selected nucleobase/nucleoside residues in a given RNA structure; predict RNA 3D structure based on the sequence and the template of a homologous molecule of the same size; refine RNA 3D model by reducing steric clashes indicated during structure quality assessment. RNAfitme is a publicly available tool with an intuitive interface. It is freely accessible at http://rnafitme.cs.put.poznan.pl/ CONCLUSIONS: RNAfitme has been applied in various RNA 3D remodeling scenarios for several types of input data. Computational experiments proved its efficiency, accuracy, and usefulness in the processing of RNAs of any size. Fidelity of RNAfitme predictions has been thoroughly tested for RNA 3D structures determined experimentally and modeled in silico.
Collapse
Affiliation(s)
- Maciej Antczak
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, Piotrowo 2, 60-965, Poznan, Poland.,Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704, Poznan, Poland
| | - Tomasz Zok
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, Piotrowo 2, 60-965, Poznan, Poland.,Poznan Supercomputing and Networking Center, Jana Pawla II 10, 61-139, Poznan, Poland
| | - Maciej Osowiecki
- Department of Biology, Adam Mickiewicz University, Umultowska 89, 61-614, Poznan, Poland
| | - Mariusz Popenda
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704, Poznan, Poland
| | - Ryszard W Adamiak
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, Piotrowo 2, 60-965, Poznan, Poland.,Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704, Poznan, Poland
| | - Marta Szachniuk
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, Piotrowo 2, 60-965, Poznan, Poland. .,Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704, Poznan, Poland.
| |
Collapse
|
120
|
Sieradzan AK, Golon Ł, Liwo A. Prediction of DNA and RNA structure with the NARES-2P force field and conformational space annealing. Phys Chem Chem Phys 2018; 20:19656-19663. [PMID: 30014063 DOI: 10.1039/c8cp03018a] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
A physics-based method for the prediction of the structures of nucleic acids, which is based on the physics-based 2-bead NARES-2P model of polynucleotides and global-optimization Conformational Space Annealing (CSA) algorithm has been proposed. The target structure is sought as the global-energy-minimum structure, which ignores the entropy component of the free energy but spares expensive multicanonical simulations necessary to find the conformational ensemble with the lowest free energy. The CSA algorithm has been modified to optimize its performance when treating both single and multi-chain nucleic acids. It was shown that the method finds the native fold for simple RNA molecules and DNA duplexes and with limited distance restraints, which can easily be obtained from the secondary-structure-prediction servers, complex RNA folds can be treated with using moderate computer resources.
Collapse
Affiliation(s)
- Adam K Sieradzan
- Faculty of Chemistry, University of Gdańsk, 80-308 Gdańsk, Poland.
| | | | | |
Collapse
|
121
|
Shi YZ, Jin L, Feng CJ, Tan YL, Tan ZJ. Predicting 3D structure and stability of RNA pseudoknots in monovalent and divalent ion solutions. PLoS Comput Biol 2018; 14:e1006222. [PMID: 29879103 PMCID: PMC6007934 DOI: 10.1371/journal.pcbi.1006222] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2018] [Revised: 06/19/2018] [Accepted: 05/22/2018] [Indexed: 01/30/2023] Open
Abstract
RNA pseudoknots are a kind of minimal RNA tertiary structural motifs, and their three-dimensional (3D) structures and stability play essential roles in a variety of biological functions. Therefore, to predict 3D structures and stability of RNA pseudoknots is essential for understanding their functions. In the work, we employed our previously developed coarse-grained model with implicit salt to make extensive predictions and comprehensive analyses on the 3D structures and stability for RNA pseudoknots in monovalent/divalent ion solutions. The comparisons with available experimental data show that our model can successfully predict the 3D structures of RNA pseudoknots from their sequences, and can also make reliable predictions for the stability of RNA pseudoknots with different lengths and sequences over a wide range of monovalent/divalent ion concentrations. Furthermore, we made comprehensive analyses on the unfolding pathway for various RNA pseudoknots in ion solutions. Our analyses for extensive pseudokonts and the wide range of monovalent/divalent ion concentrations verify that the unfolding pathway of RNA pseudoknots is mainly dependent on the relative stability of unfolded intermediate states, and show that the unfolding pathway of RNA pseudoknots can be significantly modulated by their sequences and solution ion conditions.
Collapse
Affiliation(s)
- Ya-Zhou Shi
- Research Center of Nonlinear Science, School of Mathematics and Computer Science, Wuhan Textile University, Wuhan, China
- Department of Physics and Key Laboratory of Artificial Micro- and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Lei Jin
- Department of Physics and Key Laboratory of Artificial Micro- and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Chen-Jie Feng
- Department of Physics and Key Laboratory of Artificial Micro- and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Ya-Lan Tan
- Department of Physics and Key Laboratory of Artificial Micro- and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Zhi-Jie Tan
- Department of Physics and Key Laboratory of Artificial Micro- and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| |
Collapse
|
122
|
Spasic A, Kennedy SD, Needham L, Manoharan M, Kierzek R, Turner DH, Mathews DH. Molecular dynamics correctly models the unusual major conformation of the GAGU RNA internal loop and with NMR reveals an unusual minor conformation. RNA (NEW YORK, N.Y.) 2018; 24:656-672. [PMID: 29434035 PMCID: PMC5900564 DOI: 10.1261/rna.064527.117] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/17/2017] [Accepted: 01/19/2018] [Indexed: 05/08/2023]
Abstract
The RNA "GAGU" duplex, (5'GACGAGUGUCA)2, contains the internal loop (5'-GAGU-3')2 , which has two conformations in solution as determined by NMR spectroscopy. The major conformation has a loop structure consisting of trans-Watson-Crick/Hoogsteen GG pairs, A residues stacked on each other, U residues bulged outside the helix, and all sugars with a C2'-endo conformation. This differs markedly from the internal loops, (5'-GAGC-3')2, (5'-AAGU-3')2, and (5'-UAGG-3')2, which all have cis-Watson-Crick/Watson-Crick AG "imino" pairs flanked by cis-Watson-Crick/Watson-Crick canonical pairs resulting in maximal hydrogen bonding. Here, molecular dynamics was used to test whether the Amber force field (ff99 + bsc0 + OL3) approximates molecular interactions well enough to keep stable the unexpected conformation of the GAGU major duplex structure and the NMR structures of the duplexes containing (5'-GAGC-3')2, (5'-AAGU-3')2, and (5'-UAGG-3')2 internal loops. One-microsecond simulations were repeated four times for each of the duplexes starting in their NMR conformations. With the exception of (5'-UAGG-3')2, equivalent simulations were also run starting with alternative conformations. Results indicate that the Amber force field keeps the NMR conformations of the duplexes stable for at least 1 µsec. They also demonstrate an unexpected minor conformation for the (5'-GAGU-3')2 loop that is consistent with newly measured NMR spectra of duplexes with natural and modified nucleotides. Thus, unrestrained simulations led to the determination of the previously unknown minor conformation. The stability of the native (5'-GAGU-3')2 internal loop as compared to other loops can be explained by changes in hydrogen bonding and stacking as the flanking bases are changed.
Collapse
Affiliation(s)
- Aleksandar Spasic
- Department of Biochemistry and Biophysics, University of Rochester Medical Center, Rochester, New York 14642, USA
- Center for RNA Biology, University of Rochester Medical Center, Rochester, New York 14642, USA
| | - Scott D Kennedy
- Department of Biochemistry and Biophysics, University of Rochester Medical Center, Rochester, New York 14642, USA
| | - Laura Needham
- Department of Biochemistry and Biophysics, University of Rochester Medical Center, Rochester, New York 14642, USA
- Center for RNA Biology, University of Rochester Medical Center, Rochester, New York 14642, USA
| | - Muthiah Manoharan
- Department of Discovery, Alnylam Pharmaceuticals, Cambridge, Massachusetts 02142, USA
| | - Ryszard Kierzek
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan Noskowskiego, Poland
| | - Douglas H Turner
- Center for RNA Biology, University of Rochester Medical Center, Rochester, New York 14642, USA
- Department of Chemistry, University of Rochester, Rochester, New York 14627-0216, USA
| | - David H Mathews
- Department of Biochemistry and Biophysics, University of Rochester Medical Center, Rochester, New York 14642, USA
- Center for RNA Biology, University of Rochester Medical Center, Rochester, New York 14642, USA
- Department of Biostatistics and Computational Biology, University of Rochester Medical Center, Rochester, New York 14642, USA
| |
Collapse
|
123
|
Watkins AM, Geniesse C, Kladwang W, Zakrevsky P, Jaeger L, Das R. Blind prediction of noncanonical RNA structure at atomic accuracy. SCIENCE ADVANCES 2018; 4:eaar5316. [PMID: 29806027 PMCID: PMC5969821 DOI: 10.1126/sciadv.aar5316] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/19/2017] [Accepted: 04/17/2018] [Indexed: 05/26/2023]
Abstract
Prediction of RNA structure from nucleotide sequence remains an unsolved grand challenge of biochemistry and requires distinct concepts from protein structure prediction. Despite extensive algorithmic development in recent years, modeling of noncanonical base pairs of new RNA structural motifs has not been achieved in blind challenges. We report a stepwise Monte Carlo (SWM) method with a unique add-and-delete move set that enables predictions of noncanonical base pairs of complex RNA structures. A benchmark of 82 diverse motifs establishes the method's general ability to recover noncanonical pairs ab initio, including multistrand motifs that have been refractory to prior approaches. In a blind challenge, SWM models predicted nucleotide-resolution chemical mapping and compensatory mutagenesis experiments for three in vitro selected tetraloop/receptors with previously unsolved structures (C7.2, C7.10, and R1). As a final test, SWM blindly and correctly predicted all noncanonical pairs of a Zika virus double pseudoknot during a recent community-wide RNA-Puzzle. Stepwise structure formation, as encoded in the SWM method, enables modeling of noncanonical RNA structure in a variety of previously intractable problems.
Collapse
Affiliation(s)
- Andrew M. Watkins
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Caleb Geniesse
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305, USA
- Biophysics Program, Stanford University, Stanford, CA 94305, USA
| | - Wipapat Kladwang
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Paul Zakrevsky
- Department of Chemistry and Biochemistry, Biomolecular Science and Engineering Program, University of California at Santa Barbara, Santa Barbara, CA 93106, USA
| | - Luc Jaeger
- Department of Chemistry and Biochemistry, Biomolecular Science and Engineering Program, University of California at Santa Barbara, Santa Barbara, CA 93106, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305, USA
- Biophysics Program, Stanford University, Stanford, CA 94305, USA
- Department of Physics, Stanford University, Stanford, CA 94305, USA
| |
Collapse
|
124
|
Baronti L, Karlsson H, Marušič M, Petzold K. A guide to large-scale RNA sample preparation. Anal Bioanal Chem 2018; 410:3239-3252. [PMID: 29546546 PMCID: PMC5937877 DOI: 10.1007/s00216-018-0943-8] [Citation(s) in RCA: 57] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2017] [Revised: 01/25/2018] [Accepted: 02/05/2018] [Indexed: 12/30/2022]
Abstract
RNA is becoming more important as an increasing number of functions, both regulatory and enzymatic, are being discovered on a daily basis. As the RNA boom has just begun, most techniques are still in development and changes occur frequently. To understand RNA functions, revealing the structure of RNA is of utmost importance, which requires sample preparation. We review the latest methods to produce and purify a variation of RNA molecules for different purposes with the main focus on structural biology and biophysics. We present a guide aimed at identifying the most suitable method for your RNA and your biological question and highlighting the advantages of different methods. Graphical abstract In this review we present different methods for large-scale production and purification of RNAs for structural and biophysical studies.
Collapse
Affiliation(s)
- Lorenzo Baronti
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Scheeles Väg 2, 17177, Stockholm, Sweden
| | - Hampus Karlsson
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Scheeles Väg 2, 17177, Stockholm, Sweden
| | - Maja Marušič
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Scheeles Väg 2, 17177, Stockholm, Sweden
| | - Katja Petzold
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Scheeles Väg 2, 17177, Stockholm, Sweden.
| |
Collapse
|
125
|
Šponer J, Bussi G, Krepl M, Banáš P, Bottaro S, Cunha RA, Gil-Ley A, Pinamonti G, Poblete S, Jurečka P, Walter NG, Otyepka M. RNA Structural Dynamics As Captured by Molecular Simulations: A Comprehensive Overview. Chem Rev 2018; 118:4177-4338. [PMID: 29297679 PMCID: PMC5920944 DOI: 10.1021/acs.chemrev.7b00427] [Citation(s) in RCA: 386] [Impact Index Per Article: 55.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2017] [Indexed: 12/14/2022]
Abstract
With both catalytic and genetic functions, ribonucleic acid (RNA) is perhaps the most pluripotent chemical species in molecular biology, and its functions are intimately linked to its structure and dynamics. Computer simulations, and in particular atomistic molecular dynamics (MD), allow structural dynamics of biomolecular systems to be investigated with unprecedented temporal and spatial resolution. We here provide a comprehensive overview of the fast-developing field of MD simulations of RNA molecules. We begin with an in-depth, evaluatory coverage of the most fundamental methodological challenges that set the basis for the future development of the field, in particular, the current developments and inherent physical limitations of the atomistic force fields and the recent advances in a broad spectrum of enhanced sampling methods. We also survey the closely related field of coarse-grained modeling of RNA systems. After dealing with the methodological aspects, we provide an exhaustive overview of the available RNA simulation literature, ranging from studies of the smallest RNA oligonucleotides to investigations of the entire ribosome. Our review encompasses tetranucleotides, tetraloops, a number of small RNA motifs, A-helix RNA, kissing-loop complexes, the TAR RNA element, the decoding center and other important regions of the ribosome, as well as assorted others systems. Extended sections are devoted to RNA-ion interactions, ribozymes, riboswitches, and protein/RNA complexes. Our overview is written for as broad of an audience as possible, aiming to provide a much-needed interdisciplinary bridge between computation and experiment, together with a perspective on the future of the field.
Collapse
Affiliation(s)
- Jiří Šponer
- Institute of Biophysics of the Czech Academy of Sciences , Kralovopolska 135 , Brno 612 65 , Czech Republic
| | - Giovanni Bussi
- Scuola Internazionale Superiore di Studi Avanzati , Via Bonomea 265 , Trieste 34136 , Italy
| | - Miroslav Krepl
- Institute of Biophysics of the Czech Academy of Sciences , Kralovopolska 135 , Brno 612 65 , Czech Republic
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science , Palacky University Olomouc , 17. listopadu 12 , Olomouc 771 46 , Czech Republic
| | - Pavel Banáš
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science , Palacky University Olomouc , 17. listopadu 12 , Olomouc 771 46 , Czech Republic
| | - Sandro Bottaro
- Structural Biology and NMR Laboratory, Department of Biology , University of Copenhagen , Copenhagen 2200 , Denmark
| | - Richard A Cunha
- Scuola Internazionale Superiore di Studi Avanzati , Via Bonomea 265 , Trieste 34136 , Italy
| | - Alejandro Gil-Ley
- Scuola Internazionale Superiore di Studi Avanzati , Via Bonomea 265 , Trieste 34136 , Italy
| | - Giovanni Pinamonti
- Scuola Internazionale Superiore di Studi Avanzati , Via Bonomea 265 , Trieste 34136 , Italy
| | - Simón Poblete
- Scuola Internazionale Superiore di Studi Avanzati , Via Bonomea 265 , Trieste 34136 , Italy
| | - Petr Jurečka
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science , Palacky University Olomouc , 17. listopadu 12 , Olomouc 771 46 , Czech Republic
| | - Nils G Walter
- Single Molecule Analysis Group and Center for RNA Biomedicine, Department of Chemistry , University of Michigan , Ann Arbor , Michigan 48109 , United States
| | - Michal Otyepka
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science , Palacky University Olomouc , 17. listopadu 12 , Olomouc 771 46 , Czech Republic
| |
Collapse
|
126
|
Antczak M, Popenda M, Zok T, Zurkowski M, Adamiak RW, Szachniuk M. New algorithms to represent complex pseudoknotted RNA structures in dot-bracket notation. Bioinformatics 2018; 34:1304-1312. [PMID: 29236971 PMCID: PMC5905660 DOI: 10.1093/bioinformatics/btx783] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2017] [Revised: 10/23/2017] [Accepted: 12/08/2017] [Indexed: 11/12/2022] Open
Abstract
Motivation Understanding the formation, architecture and roles of pseudoknots in RNA structures are one of the most difficult challenges in RNA computational biology and structural bioinformatics. Methods predicting pseudoknots typically perform this with poor accuracy, often despite experimental data incorporation. Existing bioinformatic approaches differ in terms of pseudoknots' recognition and revealing their nature. A few ways of pseudoknot classification exist, most common ones refer to a genus or order. Following the latter one, we propose new algorithms that identify pseudoknots in RNA structure provided in BPSEQ format, determine their order and encode in dot-bracket-letter notation. The proposed encoding aims to illustrate the hierarchy of RNA folding. Results New algorithms are based on dynamic programming and hybrid (combining exhaustive search and random walk) approaches. They evolved from elementary algorithm implemented within the workflow of RNA FRABASE 1.0, our database of RNA structure fragments. They use different scoring functions to rank dissimilar dot-bracket representations of RNA structure. Computational experiments show an advantage of new methods over the others, especially for large RNA structures. Availability and implementation Presented algorithms have been implemented as new functionality of RNApdbee webserver and are ready to use at http://rnapdbee.cs.put.poznan.pl. Contact mszachniuk@cs.put.poznan.pl. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Maciej Antczak
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland
| | - Mariusz Popenda
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| | - Tomasz Zok
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland
- Poznan Supercomputing and Networking Center, Poznan, Poland
| | - Michal Zurkowski
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland
| | - Ryszard W Adamiak
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| | - Marta Szachniuk
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| |
Collapse
|
127
|
Berger KD, Kennedy SD, Schroeder SJ, Znosko BM, Sun H, Mathews DH, Turner DH. Surprising Sequence Effects on GU Closure of Symmetric 2 × 2 Nucleotide RNA Internal Loops. Biochemistry 2018; 57:2121-2131. [PMID: 29570276 PMCID: PMC5963885 DOI: 10.1021/acs.biochem.7b01306] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
GU base pairs are important RNA structural motifs and often close loops. Accurate prediction of RNA structures relies upon understanding the interactions determining structure. The thermodynamics of some 2 × 2 nucleotide internal loops closed by GU pairs are not well understood. Here, several self-complementary oligonucleotide sequences expected to form duplexes with 2 × 2 nucleotide internal loops closed by GU pairs were investigated. Surprisingly, nuclear magnetic resonance revealed that many of the sequences exist in equilibrium between hairpin and duplex conformations. This equilibrium is not observed with loops closed by Watson-Crick pairs. To measure the thermodynamics of some 2 × 2 nucleotide internal loops closed by GU pairs, non-self-complementary sequences that preclude formation of hairpins were designed. The measured thermodynamics indicate that some internal loops closed by GU pairs are unusually unstable. This instability accounts for the observed equilibria between duplex and hairpin conformations. Moreover, it suggests that future three-dimensional structures of loops closed by GU pairs may reveal interactions that unexpectedly destabilize folding.
Collapse
Affiliation(s)
- Kyle D. Berger
- Department of Biochemistry and Biophysics, University of Rochester School of Medicine and Dentistry, Rochester, NY 14642
- Center for RNA Biology, University of Rochester School of Medicine and Dentistry, Rochester, NY 14642
| | - Scott D. Kennedy
- Department of Biochemistry and Biophysics, University of Rochester School of Medicine and Dentistry, Rochester, NY 14642
- Center for RNA Biology, University of Rochester School of Medicine and Dentistry, Rochester, NY 14642
| | | | - Brent M. Znosko
- Department of Chemistry, Saint Louis University, St. Louis MO 63103
| | - Hongying Sun
- Department of Biochemistry and Biophysics, University of Rochester School of Medicine and Dentistry, Rochester, NY 14642
- Center for RNA Biology, University of Rochester School of Medicine and Dentistry, Rochester, NY 14642
| | - David H. Mathews
- Department of Biochemistry and Biophysics, University of Rochester School of Medicine and Dentistry, Rochester, NY 14642
- Center for RNA Biology, University of Rochester School of Medicine and Dentistry, Rochester, NY 14642
| | - Douglas H. Turner
- Center for RNA Biology, University of Rochester School of Medicine and Dentistry, Rochester, NY 14642
- Department of Chemistry, University of Rochester, Rochester, NY 14627
| |
Collapse
|
128
|
Leppek K, Das R, Barna M. Functional 5' UTR mRNA structures in eukaryotic translation regulation and how to find them. Nat Rev Mol Cell Biol 2018; 19:158-174. [PMID: 29165424 PMCID: PMC5820134 DOI: 10.1038/nrm.2017.103] [Citation(s) in RCA: 578] [Impact Index Per Article: 82.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
RNA molecules can fold into intricate shapes that can provide an additional layer of control of gene expression beyond that of their sequence. In this Review, we discuss the current mechanistic understanding of structures in 5' untranslated regions (UTRs) of eukaryotic mRNAs and the emerging methodologies used to explore them. These structures may regulate cap-dependent translation initiation through helicase-mediated remodelling of RNA structures and higher-order RNA interactions, as well as cap-independent translation initiation through internal ribosome entry sites (IRESs), mRNA modifications and other specialized translation pathways. We discuss known 5' UTR RNA structures and how new structure probing technologies coupled with prospective validation, particularly compensatory mutagenesis, are likely to identify classes of structured RNA elements that shape post-transcriptional control of gene expression and the development of multicellular organisms.
Collapse
Affiliation(s)
- Kathrin Leppek
- Department of Developmental Biology, Stanford University, Stanford, California 94305, USA
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| | - Rhiju Das
- Departments of Biochemistry and Physics, Stanford University, Stanford, California 94305, USA
| | - Maria Barna
- Department of Developmental Biology, Stanford University, Stanford, California 94305, USA
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| |
Collapse
|
129
|
Poblete S, Bottaro S, Bussi G. A nucleobase-centered coarse-grained representation for structure prediction of RNA motifs. Nucleic Acids Res 2018; 46:1674-1683. [PMID: 29272539 PMCID: PMC5829650 DOI: 10.1093/nar/gkx1269] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2016] [Revised: 12/05/2017] [Accepted: 12/07/2017] [Indexed: 01/30/2023] Open
Abstract
We introduce the SPlit-and-conQueR (SPQR) model, a coarse-grained (CG) representation of RNA designed for structure prediction and refinement. In our approach, the representation of a nucleotide consists of a point particle for the phosphate group and an anisotropic particle for the nucleoside. The interactions are, in principle, knowledge-based potentials inspired by the $\mathcal {E}$SCORE function, a base-centered scoring function. However, a special treatment is given to base-pairing interactions and certain geometrical conformations which are lost in a raw knowledge-based model. This results in a representation able to describe planar canonical and non-canonical base pairs and base-phosphate interactions and to distinguish sugar puckers and glycosidic torsion conformations. The model is applied to the folding of several structures, including duplexes with internal loops of non-canonical base pairs, tetraloops, junctions and a pseudoknot. For the majority of these systems, experimental structures are correctly predicted at the level of individual contacts. We also propose a method for efficiently reintroducing atomistic detail from the CG representation.
Collapse
Affiliation(s)
- Simón Poblete
- Scuola Internazionale Superiore di Studi Avanzati, 265, Via Bonomea I-34136 Trieste, Italy
| | - Sandro Bottaro
- Scuola Internazionale Superiore di Studi Avanzati, 265, Via Bonomea I-34136 Trieste, Italy
| | - Giovanni Bussi
- Scuola Internazionale Superiore di Studi Avanzati, 265, Via Bonomea I-34136 Trieste, Italy
| |
Collapse
|
130
|
Antunes D, Jorge NAN, Caffarena ER, Passetti F. Using RNA Sequence and Structure for the Prediction of Riboswitch Aptamer: A Comprehensive Review of Available Software and Tools. Front Genet 2018; 8:231. [PMID: 29403526 PMCID: PMC5780412 DOI: 10.3389/fgene.2017.00231] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2017] [Accepted: 12/21/2017] [Indexed: 12/14/2022] Open
Abstract
RNA molecules are essential players in many fundamental biological processes. Prokaryotes and eukaryotes have distinct RNA classes with specific structural features and functional roles. Computational prediction of protein structures is a research field in which high confidence three-dimensional protein models can be proposed based on the sequence alignment between target and templates. However, to date, only a few approaches have been developed for the computational prediction of RNA structures. Similar to proteins, RNA structures may be altered due to the interaction with various ligands, including proteins, other RNAs, and metabolites. A riboswitch is a molecular mechanism, found in the three kingdoms of life, in which the RNA structure is modified by the binding of a metabolite. It can regulate multiple gene expression mechanisms, such as transcription, translation initiation, and mRNA splicing and processing. Due to their nature, these entities also act on the regulation of gene expression and detection of small metabolites and have the potential to helping in the discovery of new classes of antimicrobial agents. In this review, we describe software and web servers currently available for riboswitch aptamer identification and secondary and tertiary structure prediction, including applications.
Collapse
Affiliation(s)
- Deborah Antunes
- Scientific Computing Program (PROCC), Computational Biophysics and Molecular Modeling Group, Fundação Oswaldo Cruz, Rio de Janeiro, Brazil
| | - Natasha A N Jorge
- Laboratory of Functional Genomics and Bioinformatics, Oswaldo Cruz Institute, Fundação Oswaldo Cruz, Rio de Janeiro, Brazil.,Laboratory of Gene Expression Regulation, Carlos Chagas Institute, Fundação Oswaldo Cruz, Curitiba, Brazil
| | - Ernesto R Caffarena
- Scientific Computing Program (PROCC), Computational Biophysics and Molecular Modeling Group, Fundação Oswaldo Cruz, Rio de Janeiro, Brazil
| | - Fabio Passetti
- Laboratory of Functional Genomics and Bioinformatics, Oswaldo Cruz Institute, Fundação Oswaldo Cruz, Rio de Janeiro, Brazil.,Laboratory of Gene Expression Regulation, Carlos Chagas Institute, Fundação Oswaldo Cruz, Curitiba, Brazil
| |
Collapse
|
131
|
Williams B, Zhao B, Tandon A, Ding F, Weeks KM, Zhang Q, Dokholyan NV. Structure modeling of RNA using sparse NMR constraints. Nucleic Acids Res 2018; 45:12638-12647. [PMID: 29165648 PMCID: PMC5728392 DOI: 10.1093/nar/gkx1058] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2017] [Accepted: 10/18/2017] [Indexed: 01/04/2023] Open
Abstract
RNAs fold into distinct molecular conformations that are often essential for their functions. Accurate structure modeling of complex RNA motifs, including ubiquitous non-canonical base pairs and pseudoknots, remains a challenge. Here, we present an NMR-guided all-atom discrete molecular dynamics (DMD) platform, iFoldNMR, for rapid and accurate structure modeling of complex RNAs. We show that sparse distance constraints from imino resonances, which can be readily obtained from routine NMR experiments and easier to compile than laborious assignments of non-solvent-exchangeable protons, are sufficient to direct a DMD search for low-energy RNA conformers. Benchmarking on a set of RNAs with complex folds spanning up to 56 nucleotides in length yields structural models that recapitulate experimentally determined structures with all-heavy-atom RMSDs ranging from 2.4 to 6.5 Å. This platform represents an efficient approach for high-throughput RNA structure modeling and will facilitate analysis of diverse, newly discovered functional RNAs.
Collapse
Affiliation(s)
- Benfeard Williams
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.,Molecular and Cellular Biophysics Program, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Bo Zhao
- Molecular and Cellular Biophysics Program, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.,Department of Chemistry, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Arpit Tandon
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.,Molecular and Cellular Biophysics Program, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Feng Ding
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA
| | - Kevin M Weeks
- Department of Chemistry, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Qi Zhang
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.,Molecular and Cellular Biophysics Program, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Nikolay V Dokholyan
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.,Molecular and Cellular Biophysics Program, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| |
Collapse
|
132
|
Xu X, Chen SJ. Hierarchical Assembly of RNA Three-Dimensional Structures Based on Loop Templates. J Phys Chem B 2018; 122:5327-5335. [PMID: 29258305 DOI: 10.1021/acs.jpcb.7b10102] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The current RNA structure prediction methods cannot keep up the pace of the rapidly increasing number of sequences and the emerging new functions of RNAs. Template-based RNA three-dimensional structure prediction methods are restricted by the limited number of known RNA structures, and traditional motif-based search for the templates does not always lead to successful results. Here we report a new template search and assembly algorithm, the hierarchical loop template-assembly method (VfoldLA). The method searches for templates for single strand loop/junctions instead of the whole motifs, which often renders no available templates, or short fragments (several nucleotides), which requires a long computational time to assemble and refine. The VfoldLA method has the advantage of accounting for local and nonlocal interloop interactions. Benchmark tests indicate that this new method can provide low-resolution predictions for RNA conformations at different levels of structural complexities. Furthermore, the VfoldLA-predicted conformations may also serve as reliable putative models for further structure prediction and refinements. VfoldLA is accessible at http://rna.physics.missouri.edu/vfoldLA .
Collapse
Affiliation(s)
- Xiaojun Xu
- Institute of Bioinformatics and Medical Engineering , Jiangsu University of Technology , Changzhou , Jiangsu 213001 , China.,Department of Physics, Department of Biochemistry, and Informatics Institute , University of Missouri , Columbia , Missouri 65211 , United States
| | - Shi-Jie Chen
- Department of Physics, Department of Biochemistry, and Informatics Institute , University of Missouri , Columbia , Missouri 65211 , United States
| |
Collapse
|
133
|
Lim CS, Brown CM. Know Your Enemy: Successful Bioinformatic Approaches to Predict Functional RNA Structures in Viral RNAs. Front Microbiol 2018; 8:2582. [PMID: 29354101 PMCID: PMC5758548 DOI: 10.3389/fmicb.2017.02582] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2017] [Accepted: 12/11/2017] [Indexed: 12/14/2022] Open
Abstract
Structured RNA elements may control virus replication, transcription and translation, and their distinct features are being exploited by novel antiviral strategies. Viral RNA elements continue to be discovered using combinations of experimental and computational analyses. However, the wealth of sequence data, notably from deep viral RNA sequencing, viromes, and metagenomes, necessitates computational approaches being used as an essential discovery tool. In this review, we describe practical approaches being used to discover functional RNA elements in viral genomes. In addition to success stories in new and emerging viruses, these approaches have revealed some surprising new features of well-studied viruses e.g., human immunodeficiency virus, hepatitis C virus, influenza, and dengue viruses. Some notable discoveries were facilitated by new comparative analyses of diverse viral genome alignments. Importantly, comparative approaches for finding RNA elements embedded in coding and non-coding regions differ. With the exponential growth of computer power we have progressed from stem-loop prediction on single sequences to cutting edge 3D prediction, and from command line to user friendly web interfaces. Despite these advances, many powerful, user friendly prediction tools and resources are underutilized by the virology community.
Collapse
Affiliation(s)
- Chun Shen Lim
- Department of Biochemistry, School of Biomedical Sciences, University of Otago, Dunedin, New Zealand
| | - Chris M Brown
- Department of Biochemistry, School of Biomedical Sciences, University of Otago, Dunedin, New Zealand
| |
Collapse
|
134
|
Jain S, Schlick T. F-RAG: Generating Atomic Coordinates from RNA Graphs by Fragment Assembly. J Mol Biol 2017; 429:3587-3605. [PMID: 28988954 PMCID: PMC5693719 DOI: 10.1016/j.jmb.2017.09.017] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2017] [Revised: 09/12/2017] [Accepted: 09/22/2017] [Indexed: 10/18/2022]
Abstract
Coarse-grained models represent attractive approaches to analyze and simulate ribonucleic acid (RNA) molecules, for example, for structure prediction and design, as they simplify the RNA structure to reduce the conformational search space. Our structure prediction protocol RAGTOP (RNA-As-Graphs Topology Prediction) represents RNA structures as tree graphs and samples graph topologies to produce candidate graphs. However, for a more detailed study and analysis, construction of atomic from coarse-grained models is required. Here we present our graph-based fragment assembly algorithm (F-RAG) to convert candidate three-dimensional (3D) tree graph models, produced by RAGTOP into atomic structures. We use our related RAG-3D utilities to partition graphs into subgraphs and search for structurally similar atomic fragments in a data set of RNA 3D structures. The fragments are edited and superimposed using common residues, full atomic models are scored using RAGTOP's knowledge-based potential, and geometries of top scoring models is optimized. To evaluate our models, we assess all-atom RMSDs and Interaction Network Fidelity (a measure of residue interactions) with respect to experimentally solved structures and compare our results to other fragment assembly programs. For a set of 50 RNA structures, we obtain atomic models with reasonable geometries and interactions, particularly good for RNAs containing junctions. Additional improvements to our protocol and databases are outlined. These results provide a good foundation for further work on RNA structure prediction and design applications.
Collapse
Affiliation(s)
- Swati Jain
- Department of Chemistry, New York University, 1001 Silver, 100 Washington Square East, New York, NY 10003, USA
| | - Tamar Schlick
- Department of Chemistry, New York University, 1001 Silver, 100 Washington Square East, New York, NY 10003, USA; Courant Institute of Mathematical Sciences, New York University, 251 Mercer Street, New York, NY 10012, USA; New York University-East China Normal University Center for Computational Chemistry at New York University Shanghai, Room 340, Geography Building, North Zhongshan Road, 3663 Shanghai, China.
| |
Collapse
|
135
|
Moretti R, Lyskov S, Das R, Meiler J, Gray JJ. Web-accessible molecular modeling with Rosetta: The Rosetta Online Server that Includes Everyone (ROSIE). Protein Sci 2017; 27:259-268. [PMID: 28960691 DOI: 10.1002/pro.3313] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2017] [Revised: 09/21/2017] [Accepted: 09/25/2017] [Indexed: 12/12/2022]
Abstract
The Rosetta molecular modeling software package provides a large number of experimentally validated tools for modeling and designing proteins, nucleic acids, and other biopolymers, with new protocols being added continually. While freely available to academic users, external usage is limited by the need for expertise in the Unix command line environment. To make Rosetta protocols available to a wider audience, we previously created a web server called Rosetta Online Server that Includes Everyone (ROSIE), which provides a common environment for hosting web-accessible Rosetta protocols. Here we describe a simplification of the ROSIE protocol specification format, one that permits easier implementation of Rosetta protocols. Whereas the previous format required creating multiple separate files in different locations, the new format allows specification of the protocol in a single file. This new, simplified protocol specification has more than doubled the number of Rosetta protocols available under ROSIE. These new applications include pKa determination, lipid accessibility calculation, ribonucleic acid redesign, protein-protein docking, protein-small molecule docking, symmetric docking, antibody docking, cyclic toxin docking, critical binding peptide determination, and mapping small molecule binding sites. ROSIE is freely available to academic users at http://rosie.rosettacommons.org.
Collapse
Affiliation(s)
- Rocco Moretti
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee
| | - Sergey Lyskov
- Department of Chemical and Biomolecular Engineering, The Johns Hopkins University, Baltimore, Maryland
| | - Rhiju Das
- Department of Biochemistry, Stanford University, Stanford, California.,Department of Physics, Stanford University, Stanford, California
| | - Jens Meiler
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee
| | - Jeffrey J Gray
- Department of Chemical and Biomolecular Engineering, The Johns Hopkins University, Baltimore, Maryland.,Program in Molecular Biophysics, The Johns Hopkins University, Baltimore, Maryland
| |
Collapse
|
136
|
Wiedemann J, Zok T, Milostan M, Szachniuk M. LCS-TA to identify similar fragments in RNA 3D structures. BMC Bioinformatics 2017; 18:456. [PMID: 29058576 PMCID: PMC5651598 DOI: 10.1186/s12859-017-1867-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2017] [Accepted: 10/09/2017] [Indexed: 11/30/2022] Open
Abstract
Background In modern structural bioinformatics, comparison of molecular structures aimed to identify and assess similarities and differences between them is one of the most commonly performed procedures. It gives the basis for evaluation of in silico predicted models. It constitutes the preliminary step in searching for structural motifs. In particular, it supports tracing the molecular evolution. Faced with an ever-increasing amount of available structural data, researchers need a range of methods enabling comparative analysis of the structures from either global or local perspective. Results Herein, we present a new, superposition-independent method which processes pairs of RNA 3D structures to identify their local similarities. The similarity is considered in the context of structure bending and bonds’ rotation which are described by torsion angles. In the analyzed RNA structures, the method finds the longest continuous segments that show similar torsion within a user-defined threshold. The length of the segment is provided as local similarity measure. The method has been implemented as LCS-TA algorithm (Longest Continuous Segments in Torsion Angle space) and is incorporated into our MCQ4Structures application, freely available for download from http://www.cs.put.poznan.pl/tzok/mcq/. Conclusions The presented approach ties torsion-angle-based method of structure analysis with the idea of local similarity identification by handling continuous 3D structure segments. The first method, implemented in MCQ4Structures, has been successfully utilized in RNA-Puzzles initiative. The second one, originally applied in Euclidean space, is a component of LGA (Local-Global Alignment) algorithm commonly used in assessing protein models submitted to CASP. This unique combination of concepts implemented in LCS-TA provides a new perspective on structure quality assessment in local and quantitative aspect. A series of computational experiments show the first results of applying our method to comparison of RNA 3D models. LCS-TA can be used for identifying strengths and weaknesses in the prediction of RNA tertiary structures. Electronic supplementary material The online version of this article (10.1186/s12859-017-1867-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jakub Wiedemann
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, Piotrowo 2, 60-965, Poznan, Poland
| | - Tomasz Zok
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, Piotrowo 2, 60-965, Poznan, Poland.,Poznan Supercomputing and Networking Center, Jana Pawla II 10, 61-139, Poznan, Poland
| | - Maciej Milostan
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, Piotrowo 2, 60-965, Poznan, Poland.,Poznan Supercomputing and Networking Center, Jana Pawla II 10, 61-139, Poznan, Poland
| | - Marta Szachniuk
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, Piotrowo 2, 60-965, Poznan, Poland. .,Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704, Poznan, Poland.
| |
Collapse
|
137
|
RNA structure prediction: from 2D to 3D. Emerg Top Life Sci 2017; 1:275-285. [DOI: 10.1042/etls20160027] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2017] [Revised: 07/27/2017] [Accepted: 08/10/2017] [Indexed: 11/17/2022]
Abstract
We summarize different levels of RNA structure prediction, from classical 2D structure to extended secondary structure and motif-based research toward 3D structure prediction of RNA. We outline the importance of classical secondary structure during all those levels of structure prediction.
Collapse
|
138
|
RNA structure inference through chemical mapping after accidental or intentional mutations. Proc Natl Acad Sci U S A 2017; 114:9876-9881. [PMID: 28851837 DOI: 10.1073/pnas.1619897114] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Despite the critical roles RNA structures play in regulating gene expression, sequencing-based methods for experimentally determining RNA base pairs have remained inaccurate. Here, we describe a multidimensional chemical-mapping method called "mutate-and-map read out through next-generation sequencing" (M2-seq) that takes advantage of sparsely mutated nucleotides to induce structural perturbations at partner nucleotides and then detects these events through dimethyl sulfate (DMS) probing and mutational profiling. In special cases, fortuitous errors introduced during DNA template preparation and RNA transcription are sufficient to give M2-seq helix signatures; these signals were previously overlooked or mistaken for correlated double-DMS events. When mutations are enhanced through error-prone PCR, in vitro M2-seq experimentally resolves 33 of 68 helices in diverse structured RNAs including ribozyme domains, riboswitch aptamers, and viral RNA domains with a single false positive. These inferences do not require energy minimization algorithms and can be made by either direct visual inspection or by a neural-network-inspired algorithm called M2-net. Measurements on the P4-P6 domain of the Tetrahymena group I ribozyme embedded in Xenopus egg extract demonstrate the ability of M2-seq to detect RNA helices in a complex biological environment.
Collapse
|
139
|
Kauffmann AD, Kennedy SD, Zhao J, Turner DH. Nuclear Magnetic Resonance Structure of an 8 × 8 Nucleotide RNA Internal Loop Flanked on Each Side by Three Watson-Crick Pairs and Comparison to Three-Dimensional Predictions. Biochemistry 2017; 56:3733-3744. [PMID: 28700212 DOI: 10.1021/acs.biochem.7b00201] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
The prediction of RNA three-dimensional structure from sequence alone has been a long-standing goal. High-resolution, experimentally determined structures of simple noncanonical pairings and motifs are critical to the development of prediction programs. Here, we present the nuclear magnetic resonance structure of the (5'CCAGAAACGGAUGGA)2 duplex, which contains an 8 × 8 nucleotide internal loop flanked by three Watson-Crick pairs on each side. The loop is comprised of a central 5'AC/3'CA nearest neighbor flanked by two 3RRs motifs, a known stable motif consisting of three consecutive sheared GA pairs. Hydrogen bonding patterns between base pairs in the loop, the all-atom root-mean-square deviation for the loop, and the deformation index were used to compare the structure to automated predictions by MC-sym, RNA FARFAR, and RNAComposer.
Collapse
Affiliation(s)
- Andrew D Kauffmann
- Department of Chemistry, University of Rochester , Rochester, New York 14627, United States.,Center for RNA Biology, University of Rochester , Rochester, New York 14627, United States
| | - Scott D Kennedy
- Department of Biochemistry and Biophysics, School of Medicine & Dentistry, University of Rochester , Rochester, New York 14642, United States.,Center for RNA Biology, University of Rochester , Rochester, New York 14627, United States
| | - Jianbo Zhao
- Department of Chemistry, University of Rochester , Rochester, New York 14627, United States.,Center for RNA Biology, University of Rochester , Rochester, New York 14627, United States
| | - Douglas H Turner
- Department of Chemistry, University of Rochester , Rochester, New York 14627, United States.,Center for RNA Biology, University of Rochester , Rochester, New York 14627, United States
| |
Collapse
|
140
|
Abstract
Inspired by the recent success of scientific-discovery games for predicting protein tertiary and RNA secondary structures, we have developed an open software for coarse-grained RNA folding simulations, guided by human intuition. To determine the extent to which interactive simulations can accurately predict 3D RNA structures of increasing complexity and lengths (four RNAs with 22-47 nucleotides), an interactive experiment was conducted with 141 participants who had very little knowledge of nucleic acids systems and computer simulations, and had received only a brief description of the important forces stabilizing RNA structures. Their structures and full trajectories have been analyzed statistically and compared to standard replica exchange molecular dynamics simulations. Our analyses show that participants gain easily chemical intelligence to fold simple and nontrivial topologies, with little computer time, and this result opens the door for the use of human-guided simulations to RNA folding. Our experiment shows that interactive simulations have better chances of success when the user widely explores the conformational space. Interestingly, providing on-the-fly feedback of the root mean square deviation with respect to the experimental structure did not improve the quality of the proposed models.
Collapse
|
141
|
Modelling the three-dimensional structure of the right-terminal domain of pospiviroids. Sci Rep 2017; 7:711. [PMID: 28386073 PMCID: PMC5429643 DOI: 10.1038/s41598-017-00764-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2017] [Accepted: 03/13/2017] [Indexed: 12/20/2022] Open
Abstract
Viroids, the smallest know plant pathogens, consist solely of a circular, single-stranded, non-coding RNA. Thus for all of their biological functions, like replication, processing, and transport, they have to present sequence or structural features to exploit host proteins. Viroid binding protein 1 (Virp1) is indispensable for replication of pospiviroids, the largest genus of viroids, in a host plant as well as in protoplasts. Virp1 is known to bind at two sites in the terminal right (TR) domain of pospiviroids; each site consists of a purine- (R-) and a pyrimidine- (Y-)rich motif that are partially base-paired to each other. Here we model the important structural features of the domain and show that it contains an internal loop of two Y · Y cis Watson-Crick/Watson-Crick (cWW) pairs, an asymmetric internal loop including a cWW and a trans Watson/Hoogsteen pair, and a thermodynamically quite stable hairpin loop with several stacking interactions. These features are discussed in connection to the known biological functions of the TR domain.
Collapse
|
142
|
Abstract
Biological functions of RNA molecules are dependent upon sustained specific three-dimensional (3D) structures of RNA, with or without the help of proteins. Understanding of RNA structure is frequently based on 2D structures, which describe only the Watson-Crick (WC) base pairs. Here, we hierarchically review the structural elements of RNA and how they contribute to RNA 3D structure. We focus our analysis on the non-WC base pairs and on RNA modules. Several computer programs have now been designed to predict RNA modules. We describe the RNA-Puzzles initiative, which is a community-wide, blind assessment of RNA 3D structure prediction programs to determine the capabilities and bottlenecks of current predictions. The assessment metrics used in RNA-Puzzles are briefly described. The detection of RNA 3D modules from sequence data and their automatic implementation belong to the current challenges in RNA 3D structure prediction.
Collapse
Affiliation(s)
- Zhichao Miao
- Architecture et Réactivité de l'ARN, Université de Strasbourg, Institut de Biologie Moléculaire et Cellulaire du CNRS, 67000 Strasbourg, France; ,
| | - Eric Westhof
- Architecture et Réactivité de l'ARN, Université de Strasbourg, Institut de Biologie Moléculaire et Cellulaire du CNRS, 67000 Strasbourg, France; ,
| |
Collapse
|