51
|
Zhang T, Singh J, Litfin T, Zhan J, Paliwal K, Zhou Y. RNAcmap: A Fully Automatic Pipeline for Predicting Contact Maps of RNAs by Evolutionary Coupling Analysis. Bioinformatics 2021; 37:3494-3500. [PMID: 34021744 DOI: 10.1093/bioinformatics/btab391] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2020] [Revised: 03/27/2021] [Accepted: 05/18/2021] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION The accuracy of RNA secondary and tertiary structure prediction can be significantly improved by using structural restraints derived from evolutionary coupling or direct coupling analysis. Currently, these coupling analyses relied on manually curated multiple sequence alignments collected in the Rfam database, which contains 3016 families. By comparison, millions of non-coding RNA sequences are known. Here, we established RNAcmap, a fully automatic pipeline that enables evolutionary coupling analysis for any RNA sequences. The homology search was based on the covariance model built by INFERNAL according to two secondary structure predictors: a folding-based algorithm RNAfold and the latest deep-learning method SPOT-RNA. RESULTS We showed that the performance of RNAcmap is less dependent on the specific evolutionary coupling tool but is more dependent on the accuracy of secondary structure predictor with the best performance given by RNAcmap (SPOT-RNA). The performance of RNAcmap (SPOT-RNA) is comparable to that based on Rfam-supplied alignment and consistent for those sequences that are not in Rfam collections. Further improvement can be made with a simple meta predictor RNAcmap (SPOT-RNA/RNAfold) depending on which secondary structure predictor can find more homologous sequences. Reliable base-pairing information generated from RNAcmap, for RNAs with high effective homologous sequences, in particular, will be useful for aiding RNA structure prediction. AVAILABILITY RNAcmap is available as a web server at https://sparks-lab.org/server/rnacmap/ and as a standalone application along with the datasets at https://github.com/sparks-lab-org/RNAcmap_standalone. A platform independent and fully configured docker image of RNAcmap is also provided at https://hub.docker.com/r/jaswindersingh2/rnacmap.
Collapse
Affiliation(s)
- Tongchuan Zhang
- Institute for Glycomics and School of Information and Communication Technology, Griffith University, Parklands Dr. Southport, QLD 4222, Australia
| | - Jaswinder Singh
- Signal Processing Laboratory, School of Engineering and Built Environment, Griffith University, Brisbane, QLD 4111, Australia
| | - Thomas Litfin
- Institute for Glycomics and School of Information and Communication Technology, Griffith University, Parklands Dr. Southport, QLD 4222, Australia
| | - Jian Zhan
- Institute for Glycomics and School of Information and Communication Technology, Griffith University, Parklands Dr. Southport, QLD 4222, Australia
| | - Kuldip Paliwal
- Signal Processing Laboratory, School of Engineering and Built Environment, Griffith University, Brisbane, QLD 4111, Australia
| | - Yaoqi Zhou
- Institute for Glycomics and School of Information and Communication Technology, Griffith University, Parklands Dr. Southport, QLD 4222, Australia.,Institute for Systems and Physical Biology, Shenzhen Bay Laboratory, Shenzhen 518055, China
| |
Collapse
|
52
|
Pairing a high-resolution statistical potential with a nucleobase-centric sampling algorithm for improving RNA model refinement. Nat Commun 2021; 12:2777. [PMID: 33986288 PMCID: PMC8119458 DOI: 10.1038/s41467-021-23100-4] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2021] [Accepted: 04/13/2021] [Indexed: 12/04/2022] Open
Abstract
Refining modelled structures to approach experimental accuracy is one of the most challenging problems in molecular biology. Despite many years’ efforts, the progress in protein or RNA structure refinement has been slow because the global minimum given by the energy scores is not at the experimentally determined “native” structure. Here, we propose a fully knowledge-based energy function that captures the full orientation dependence of base–base, base–oxygen and oxygen–oxygen interactions with the RNA backbone modelled by rotameric states and internal energies. A total of 4000 quantum-mechanical calculations were performed to reweight base–base statistical potentials for minimizing possible effects of indirect interactions. The resulting BRiQ knowledge-based potential, equipped with a nucleobase-centric sampling algorithm, provides a robust improvement in refining near-native RNA models generated by a wide variety of modelling techniques. Predicting RNA structure from sequence is challenging due to the relative sparsity of experimentally-determined RNA 3D structures for model training. Here, the authors propose a way to incorporate knowledge on interactions at the atomic and base–base level to refine the prediction of RNA structures.
Collapse
|
53
|
Schlick T, Portillo-Ledesma S, Myers CG, Beljak L, Chen J, Dakhel S, Darling D, Ghosh S, Hall J, Jan M, Liang E, Saju S, Vohr M, Wu C, Xu Y, Xue E. Biomolecular Modeling and Simulation: A Prospering Multidisciplinary Field. Annu Rev Biophys 2021; 50:267-301. [PMID: 33606945 PMCID: PMC8105287 DOI: 10.1146/annurev-biophys-091720-102019] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
We reassess progress in the field of biomolecular modeling and simulation, following up on our perspective published in 2011. By reviewing metrics for the field's productivity and providing examples of success, we underscore the productive phase of the field, whose short-term expectations were overestimated and long-term effects underestimated. Such successes include prediction of structures and mechanisms; generation of new insights into biomolecular activity; and thriving collaborations between modeling and experimentation, including experiments driven by modeling. We also discuss the impact of field exercises and web games on the field's progress. Overall, we note tremendous success by the biomolecular modeling community in utilization of computer power; improvement in force fields; and development and application of new algorithms, notably machine learning and artificial intelligence. The combined advances are enhancing the accuracy andscope of modeling and simulation, establishing an exemplary discipline where experiment and theory or simulations are full partners.
Collapse
Affiliation(s)
- Tamar Schlick
- Department of Chemistry, New York University, New York, New York 10003, USA;
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, USA
- New York University-East China Normal University Center for Computational Chemistry, New York University Shanghai, Shanghai 200122, China
| | | | - Christopher G Myers
- Department of Chemistry, New York University, New York, New York 10003, USA;
| | - Lauren Beljak
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Justin Chen
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Sami Dakhel
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Daniel Darling
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Sayak Ghosh
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Joseph Hall
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Mikaeel Jan
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Emily Liang
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Sera Saju
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Mackenzie Vohr
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Chris Wu
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Yifan Xu
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Eva Xue
- College of Arts and Science, New York University, New York, New York 10003, USA
| |
Collapse
|
54
|
Thavarajah W, Hertz LM, Bushhouse DZ, Archuleta CM, Lucks JB. RNA Engineering for Public Health: Innovations in RNA-Based Diagnostics and Therapeutics. Annu Rev Chem Biomol Eng 2021; 12:263-286. [PMID: 33900805 PMCID: PMC9714562 DOI: 10.1146/annurev-chembioeng-101420-014055] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
RNA is essential for cellular function: From sensing intra- and extracellular signals to controlling gene expression, RNA mediates a diverse and expansive list of molecular processes. A long-standing goal of synthetic biology has been to develop RNA engineering principles that can be used to harness and reprogram these RNA-mediated processes to engineer biological systems to solve pressing global challenges. Recent advances in the field of RNA engineering are bringing this to fruition, enabling the creation of RNA-based tools to combat some of the most urgent public health crises. Specifically, new diagnostics using engineered RNAs are able to detect both pathogens and chemicals while generating an easily detectable fluorescent signal as an indicator. New classes of vaccines and therapeutics are also using engineered RNAs to target a wide range of genetic and pathogenic diseases. Here, we discuss the recent breakthroughs in RNA engineering enabling these innovations and examine how advances in RNA design promise to accelerate the impact of engineered RNA systems.
Collapse
Affiliation(s)
- Walter Thavarajah
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois 60208, USA; .,Center for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, USA.,Center for Water Research, Northwestern University, Evanston, Illinois 60208, USA
| | - Laura M Hertz
- Center for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, USA.,Interdisciplinary Biological Sciences Graduate Program, Northwestern University, Evanston, Illinois 60208, USA
| | - David Z Bushhouse
- Center for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, USA.,Interdisciplinary Biological Sciences Graduate Program, Northwestern University, Evanston, Illinois 60208, USA
| | - Chloé M Archuleta
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois 60208, USA; .,Center for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, USA.,Center for Water Research, Northwestern University, Evanston, Illinois 60208, USA
| | - Julius B Lucks
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois 60208, USA; .,Center for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, USA.,Center for Water Research, Northwestern University, Evanston, Illinois 60208, USA.,Center for Engineering Sustainability and Resilience, Northwestern University, Evanston, Illinois 60208, USA
| |
Collapse
|
55
|
Ma J, Saikia N, Godar S, Hamilton GL, Ding F, Alper J, Sanabria H. Ensemble Switching Unveils a Kinetic Rheostat Mechanism of the Eukaryotic Thiamine Pyrophosphate Riboswitch. RNA (NEW YORK, N.Y.) 2021; 27:rna.075937.120. [PMID: 33863818 PMCID: PMC8208051 DOI: 10.1261/rna.075937.120] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Accepted: 04/13/2021] [Indexed: 05/05/2023]
Abstract
Thiamine pyrophosphate (TPP) riboswitches regulate thiamine metabolism by inhibiting the translation of enzymes essential to thiamine synthesis pathways upon binding to thiamine pyrophosphate in cells across all domains of life. Recent work on the Arabidopsis thaliana TPP riboswitch suggests a multi-step TPP binding process involving multiple riboswitch configurational ensembles and that Mg2+ dependence underlies the mechanism of TPP recognition and subsequent transition to the expression-inhibiting state of the aptamer domain followed by changes in the expression platform. However, details of the relationship between TPP riboswitch conformational changes and interactions with TPP and Mg2+ ¬¬in the aptamer domain constituting this mechanism are unknown. Therefore, we integrated single-molecule multiparameter fluorescence and force spectroscopy with atomistic molecular dynamics simulations and found that conformational transitions within the aptamer domain's sensor helices associated with TPP and Mg2+ ligand binding occurred between at least five different ensembles on timescales ranging from µs to ms. These dynamics are orders of magnitude faster than the 10 second-timescale folding kinetics associated with expression-state switching in the switch sequence. Together, our results show that a TPP and Mg2+ dependent mechanism determines dynamic configurational state ensemble switching of the aptamer domain's sensor helices that regulates the stability of the switch helix, which ultimately may lead to the expression-inhibiting state of the riboswitch. Additionally, we propose that two pathways exist for ligand recognition and that this mechanism underlies a kinetic rheostat-like behavior of the Arabidopsis thaliana TPP riboswitch.
Collapse
Affiliation(s)
- Junyan Ma
- Department of Chemistry, Clemson University
| | | | - Subash Godar
- Department of Physics and Astronomy, Clemson University
| | | | - Feng Ding
- Department of Physics and Astronomy, Clemson University
| | - Joshua Alper
- Department of Physics and Astronomy, Clemson University
| | - Hugo Sanabria
- Department of Physics and Astronomy, Clemson University
| |
Collapse
|
56
|
Rangan R, Watkins AM, Chacon J, Kretsch R, Kladwang W, Zheludev IN, Townley J, Rynge M, Thain G, Das R. De novo 3D models of SARS-CoV-2 RNA elements from consensus experimental secondary structures. Nucleic Acids Res 2021; 49:3092-3108. [PMID: 33693814 PMCID: PMC8034642 DOI: 10.1093/nar/gkab119] [Citation(s) in RCA: 62] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Revised: 02/08/2021] [Accepted: 02/16/2021] [Indexed: 12/12/2022] Open
Abstract
The rapid spread of COVID-19 is motivating development of antivirals targeting conserved SARS-CoV-2 molecular machinery. The SARS-CoV-2 genome includes conserved RNA elements that offer potential small-molecule drug targets, but most of their 3D structures have not been experimentally characterized. Here, we provide a compilation of chemical mapping data from our and other labs, secondary structure models, and 3D model ensembles based on Rosetta's FARFAR2 algorithm for SARS-CoV-2 RNA regions including the individual stems SL1-8 in the extended 5' UTR; the reverse complement of the 5' UTR SL1-4; the frameshift stimulating element (FSE); and the extended pseudoknot, hypervariable region, and s2m of the 3' UTR. For eleven of these elements (the stems in SL1-8, reverse complement of SL1-4, FSE, s2m and 3' UTR pseudoknot), modeling convergence supports the accuracy of predicted low energy states; subsequent cryo-EM characterization of the FSE confirms modeling accuracy. To aid efforts to discover small molecule RNA binders guided by computational models, we provide a second set of similarly prepared models for RNA riboswitches that bind small molecules. Both datasets ('FARFAR2-SARS-CoV-2', https://github.com/DasLab/FARFAR2-SARS-CoV-2; and 'FARFAR2-Apo-Riboswitch', at https://github.com/DasLab/FARFAR2-Apo-Riboswitch') include up to 400 models for each RNA element, which may facilitate drug discovery approaches targeting dynamic ensembles of RNA molecules.
Collapse
Affiliation(s)
- Ramya Rangan
- Biophysics Program, Stanford University, Stanford, CA 94305, USA
| | - Andrew M Watkins
- Department of Biochemistry, Stanford University School of Medicine, Stanford CA 94305, USA
| | - Jose Chacon
- Department of Biochemistry, Stanford University School of Medicine, Stanford CA 94305, USA
| | - Rachael Kretsch
- Biophysics Program, Stanford University, Stanford, CA 94305, USA
| | - Wipapat Kladwang
- Department of Biochemistry, Stanford University School of Medicine, Stanford CA 94305, USA
| | - Ivan N Zheludev
- Department of Biochemistry, Stanford University School of Medicine, Stanford CA 94305, USA
| | | | - Mats Rynge
- Information Sciences Institute, University of Southern California, Marina Del Rey, CA 90292, USA
| | - Gregory Thain
- Department of Computer Sciences, University of Wisconsin–Madison, Madison, WI 53706 USA
| | - Rhiju Das
- Biophysics Program, Stanford University, Stanford, CA 94305, USA
- Department of Biochemistry, Stanford University School of Medicine, Stanford CA 94305, USA
- Department of Physics, Stanford University, Stanford, CA 94305, USA
| |
Collapse
|
57
|
Le KH, Adolf-Bryfogle J, Klima JC, Lyskov S, Labonte J, Bertolani S, Burman SSR, Leaver-Fay A, Weitzner B, Maguire J, Rangan R, Adrianowycz MA, Alford RF, Adal A, Nance ML, Wu Y, Willis J, Kulp DW, Das R, Dunbrack RL, Schief W, Kuhlman B, Siegel JB, Gray JJ. PyRosetta Jupyter Notebooks Teach Biomolecular Structure Prediction and Design. BIOPHYSICIST (ROCKVILLE, MD.) 2021; 2:108-122. [PMID: 35128343 DOI: 10.35459/tbp.2019.000147] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
Biomolecular structure drives function, and computational capabilities have progressed such that the prediction and computational design of biomolecular structures is increasingly feasible. Because computational biophysics attracts students from many different backgrounds and with different levels of resources, teaching the subject can be challenging. One strategy to teach diverse learners is with interactive multimedia material that promotes self-paced, active learning. We have created a hands-on education strategy with a set of sixteen modules that teach topics in biomolecular structure and design, from fundamentals of conformational sampling and energy evaluation to applications like protein docking, antibody design, and RNA structure prediction. Our modules are based on PyRosetta, a Python library that encapsulates all computational modules and methods in the Rosetta software package. The workshop-style modules are implemented as Jupyter Notebooks that can be executed in the Google Colaboratory, allowing learners access with just a web browser. The digital format of Jupyter Notebooks allows us to embed images, molecular visualization movies, and interactive coding exercises. This multimodal approach may better reach students from different disciplines and experience levels as well as attract more researchers from smaller labs and cognate backgrounds to leverage PyRosetta in their science and engineering research. All materials are freely available at https://github.com/RosettaCommons/PyRosetta.notebooks.
Collapse
Affiliation(s)
- Kathy H Le
- T. C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, Maryland, United States
| | - Jared Adolf-Bryfogle
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, California, United States
| | - Jason C Klima
- Institute for Protein Design, University of Washington, Seattle, Washington, United States.,Department of Biochemistry, University of Washington, Seattle, Washington, United States.,Lyell Immunopharma, Inc., Seattle, Washington, United States
| | - Sergey Lyskov
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States
| | - Jason Labonte
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States.,Department of Chemistry, Franklin & Marshall College, Lancaster, Pennsylvania, United States
| | - Steven Bertolani
- Department of Chemistry, Department of Biochemistry and Molecular Medicine, Genome Center, University of California, Davis, Davis, California, United States
| | - Shourya S Roy Burman
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States
| | - Andrew Leaver-Fay
- Department of Biochemistry, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States
| | - Brian Weitzner
- Institute for Protein Design, University of Washington, Seattle, Washington, United States.,Department of Biochemistry, University of Washington, Seattle, Washington, United States.,Lyell Immunopharma, Inc., Seattle, Washington, United States
| | - Jack Maguire
- Program in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States
| | - Ramya Rangan
- Program in Biophysics, Stanford University, Stanford, California, United States
| | - Matt A Adrianowycz
- Program in Biophysics, Stanford University, Stanford, California, United States
| | - Rebecca F Alford
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States
| | - Aleexsan Adal
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States
| | - Morgan L Nance
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, Maryland, United States
| | - Yuanhan Wu
- Vaccine and Immunotherapy Center, Wistar Institute, Philadelphia, Pennsylvania, United States
| | - Jordan Willis
- RubrYc Therapeutics, San Ramon, California, United States
| | - Daniel W Kulp
- Vaccine and Immunotherapy Center, Wistar Institute, Philadelphia, Pennsylvania, United States
| | - Rhiju Das
- Program in Biophysics, Stanford University, Stanford, California, United States
| | | | - William Schief
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, California, United States
| | - Brian Kuhlman
- Department of Biochemistry, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States.,Program in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States
| | - Justin B Siegel
- Department of Chemistry, Department of Biochemistry and Molecular Medicine, Genome Center, University of California, Davis, Davis, California, United States
| | - Jeffrey J Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States.,Program in Molecular Biophysics, Johns Hopkins University, Baltimore, Maryland, United States
| |
Collapse
|
58
|
An RNA-centric historical narrative around the Protein Data Bank. J Biol Chem 2021; 296:100555. [PMID: 33744291 PMCID: PMC8080527 DOI: 10.1016/j.jbc.2021.100555] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Revised: 02/17/2021] [Accepted: 03/16/2021] [Indexed: 01/06/2023] Open
Abstract
Some of the amazing contributions brought to the scientific community by the Protein Data Bank (PDB) are described. The focus is on nucleic acid structures with a bias toward RNA. The evolution and key roles in science of the PDB and other structural databases for nucleic acids illustrate how small initial ideas can become huge and indispensable resources with the unflinching willingness of scientists to cooperate globally. The progress in the understanding of the molecular interactions driving RNA architectures followed the rapid increase in RNA structures in the PDB. That increase was consecutive to improvements in chemical synthesis and purification of RNA molecules, as well as in biophysical methods for structure determination and computer technology. The RNA modeling efforts from the early beginnings are also described together with their links to the state of structural knowledge and technological development. Structures of RNA and of its assemblies are physical objects, which, together with genomic data, allow us to integrate present-day biological functions and the historical evolution in all living species on earth.
Collapse
|
59
|
Abstract
The molecules of the ribonucleic acid (RNA) perform a variety of vital roles in all living cells. Their biological function depends on their structure and dynamics, both of which are difficult to experimentally determine but can be theoretically inferred based on the RNA sequence. SimRNA is one of the computational methods for molecular simulations of RNA 3D structure formation. The method is based on a simplified (coarse-grained) representation of nucleotide chains, a statistically derived model of interactions (statistical potential), and the Monte Carlo method as a conformational sampling scheme.The current version of SimRNA (3.22) is able to predict basic topologies of RNA molecules with sizes up to about 50-70 nucleotides, based on their sequences only, and larger molecules if supplied with appropriate distance restraints. The user can specify various types of restraints, including secondary structure, pairwise atom-atom distances, and positions of atoms. SimRNA can be also used for studying systems composed of several chains of RNA. SimRNA is a folding simulations method, thus it allows for examining folding pathways, getting an approximate view of the energy landscapes.
Collapse
|
60
|
Calonaci N, Jones A, Cuturello F, Sattler M, Bussi G. Machine learning a model for RNA structure prediction. NAR Genom Bioinform 2021; 2:lqaa090. [PMID: 33575634 PMCID: PMC7671377 DOI: 10.1093/nargab/lqaa090] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Revised: 10/06/2020] [Accepted: 10/20/2020] [Indexed: 01/04/2023] Open
Abstract
RNA function crucially depends on its structure. Thermodynamic models currently used for secondary structure prediction rely on computing the partition function of folding ensembles, and can thus estimate minimum free-energy structures and ensemble populations. These models sometimes fail in identifying native structures unless complemented by auxiliary experimental data. Here, we build a set of models that combine thermodynamic parameters, chemical probing data (DMS and SHAPE) and co-evolutionary data (direct coupling analysis) through a network that outputs perturbations to the ensemble free energy. Perturbations are trained to increase the ensemble populations of a representative set of known native RNA structures. In the chemical probing nodes of the network, a convolutional window combines neighboring reactivities, enlightening their structural information content and the contribution of local conformational ensembles. Regularization is used to limit overfitting and improve transferability. The most transferable model is selected through a cross-validation strategy that estimates the performance of models on systems on which they are not trained. With the selected model we obtain increased ensemble populations for native structures and more accurate predictions in an independent validation set. The flexibility of the approach allows the model to be easily retrained and adapted to incorporate arbitrary experimental information.
Collapse
Affiliation(s)
- Nicola Calonaci
- International School for Advanced Studies, via Bonomea 265, 34136 Trieste, Italy
| | - Alisha Jones
- Institute of Structural Biology, Helmholtz Zentrum München, Neuherberg 85764, Germany
| | - Francesca Cuturello
- International School for Advanced Studies, via Bonomea 265, 34136 Trieste, Italy
| | - Michael Sattler
- Institute of Structural Biology, Helmholtz Zentrum München, Neuherberg 85764, Germany
| | - Giovanni Bussi
- International School for Advanced Studies, via Bonomea 265, 34136 Trieste, Italy
| |
Collapse
|
61
|
Zhang D, Li J, Chen SJ. IsRNA1: De Novo Prediction and Blind Screening of RNA 3D Structures. J Chem Theory Comput 2021; 17:1842-1857. [PMID: 33560836 DOI: 10.1021/acs.jctc.0c01148] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Modeling structures and functions of large ribonucleic acid (RNAs) especially with complicated topologies is highly challenging due to the inefficiency of large conformational sampling and the presence of complicated tertiary interactions. To address this problem, one highly promising approach is coarse-grained modeling. Here, following an iterative simulated reference state approach to decipher the correlations between different structural parameters, we developed a potent coarse-grained RNA model named as IsRNA1 for RNA studies. Molecular dynamics simulations in the IsRNA1 can predict the native structures of small RNAs from a sequence and fold medium-sized RNAs into near-native tertiary structures with the assistance of secondary structure constraints. A large-scale benchmark test on RNA 3D structure prediction shows that IsRNA1 exhibits improved performance for relatively large RNAs of complicated topologies, such as large stem-loop structures and structures containing long-range tertiary interactions. The advantages of IsRNA1 include the consideration of the correlations between the different structural variables, the appropriate characterization of canonical base-pairing and base-stacking interactions, and the better sampling for the backbone conformations. Moreover, a blind screening protocol was developed based on IsRNA1 to identify good structural models from a pool of candidates without prior knowledge of the native structures.
Collapse
Affiliation(s)
- Dong Zhang
- Department of Physics, Department of Biochemistry, and Institute of Data Science and Informatics, University of Missouri, Columbia, Missouri 65211, United States
| | - Jun Li
- Department of Physics, Department of Biochemistry, and Institute of Data Science and Informatics, University of Missouri, Columbia, Missouri 65211, United States
| | - Shi-Jie Chen
- Department of Physics, Department of Biochemistry, and Institute of Data Science and Informatics, University of Missouri, Columbia, Missouri 65211, United States
| |
Collapse
|
62
|
Abstract
Novel RNA motif design is of great practical importance for technology and medicine. Increasingly, computational design plays an important role in such efforts. Our coarse-grained RAG (RNA-As-Graphs) framework offers strategies for enumerating the universe of RNA 2D folds, selecting "RNA-like" candidates for design, and determining sequences that fold onto these candidates. In RAG, RNA secondary structures are represented as tree or dual graphs. Graphs with known RNA structures are called "existing", and the others are labeled "hypothetical". By using simplified features for RNA graphs, we have clustered the hypothetical graphs into "RNA-like" and "non-RNA-like" groups and proposed RNA-like graphs as candidates for design. Here, we propose a new way of designing graph features by using Fiedler vectors. The new features reflect graph shapes better, and they lead to a more clustered organization of existing graphs. We show significant increases in K-means clustering accuracy by using the new features (e.g., up to 95% and 98% accuracy for tree and dual graphs, respectively). In addition, we propose a scoring model for top graph candidate selection. This scoring model allows users to set a threshold for candidates, and it incorporates weighing of existing graphs based on their corresponding number of known RNAs. We include a list of top scored RNA-like candidates, which we hope will stimulate future novel RNA design.
Collapse
Affiliation(s)
- Qiyao Zhu
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, United States
| | - Tamar Schlick
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, United States
- Department of Chemistry, New York University, New York, New York 10003, United States
- NYU-ECNU Center for Computational Chemistry, NYU Shanghai, Shanghai 200062, P. R. China
| |
Collapse
|
63
|
Arriola JT, Müller UF. A combinatorial method to isolate short ribozymes from complex ribozyme libraries. Nucleic Acids Res 2020; 48:e116. [PMID: 33035338 PMCID: PMC7672470 DOI: 10.1093/nar/gkaa834] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2020] [Revised: 08/28/2020] [Accepted: 10/01/2020] [Indexed: 11/13/2022] Open
Abstract
In vitro selections are the only known methods to generate catalytic RNAs (ribozymes) that do not exist in nature. Such new ribozymes are used as biochemical tools, or to address questions on early stages of life. In both cases, it is helpful to identify the shortest possible ribozymes since they are easier to deploy as a tool, and because they are more likely to have emerged in a prebiotic environment. One of our previous selection experiments led to a library containing hundreds of different ribozyme clusters that catalyze the triphosphorylation of their 5'-terminus. This selection showed that RNA systems can use the prebiotically plausible molecule cyclic trimetaphosphate as an energy source. From this selected ribozyme library, the shortest ribozyme that was previously identified had a length of 67 nucleotides. Here we describe a combinatorial method to identify short ribozymes from libraries containing many ribozymes. Using this protocol on the library of triphosphorylation ribozymes, we identified a 17-nucleotide sequence motif embedded in a 44-nucleotide pseudoknot structure. The described combinatorial approach can be used to analyze libraries obtained by different in vitro selection experiments.
Collapse
Affiliation(s)
- Joshua T Arriola
- Department of Chemistry & Biochemistry, University of California San Diego, La Jolla, California 92093, USA
| | - Ulrich F Müller
- Department of Chemistry & Biochemistry, University of California San Diego, La Jolla, California 92093, USA
| |
Collapse
|
64
|
Martina Z, Christina H, Le L, Jesse DC, Liang YC, Christian SM, Monaghan AG, Kennedy AA, Yesselman JD, Gifford RR, Tai AW, Kutluay SB, Li ML, Brewer G, Tolbert BS, Hargrove AE. Amilorides inhibit SARS-CoV-2 replication in vitro by targeting RNA structures. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2020:2020.12.05.409821. [PMID: 33299997 PMCID: PMC7724665 DOI: 10.1101/2020.12.05.409821] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
The SARS-CoV-2 pandemic, and the likelihood of future coronavirus pandemics, has rendered our understanding of coronavirus biology more essential than ever. Small molecule chemical probes offer to both reveal novel aspects of virus replication and to serve as leads for antiviral therapeutic development. The RNA-biased amiloride scaffold was recently tuned to target a viral RNA structure critical for translation in enterovirus 71, ultimately uncovering a novel mechanism to modulate positive-sense RNA viral translation and replication. Analysis of CoV RNA genomes reveal many conserved RNA structures in the 5'-UTR and proximal region critical for viral translation and replication, including several containing bulge-like secondary structures suitable for small molecule targeting. Following phylogenetic conservation analysis of this region, we screened an amiloride-based small molecule library against a less virulent human coronavirus, OC43, to identify lead ligands. Amilorides inhibited OC43 replication as seen in viral plaque assays. Select amilorides also potently inhibited replication competent SARS-CoV-2 as evident in the decreased levels of cell free virions in cell culture supernatants of treated cells. Reporter screens confirmed the importance of RNA structures in the 5'-end of the viral genome for small molecule activity. Finally, NMR chemical shift perturbation studies of the first six stem loops of the 5'-end revealed specific amiloride interactions with stem loops 4, 5a, and 6, all of which contain bulge like structures and were predicted to be strongly bound by the lead amilorides in retrospective docking studies. Taken together, the use of multiple orthogonal approaches allowed us to identify the first small molecules aimed at targeting RNA structures within the 5'-UTR and proximal region of the CoV genome. These molecules will serve as chemical probes to further understand CoV RNA biology and can pave the way for the development of specific CoV RNA-targeted antivirals.
Collapse
Affiliation(s)
- Zafferani Martina
- Chemistry Department, Duke University, 124 Science Drive; Durham, NC USA 27705
| | - Haddad Christina
- Department of Chemistry, Case Western Reserve University, Cleveland OH 441106
| | - Luo Le
- Department of Chemistry, Case Western Reserve University, Cleveland OH 441106
| | | | - Yuan-Chiu Liang
- Department of Chemistry, Case Western Reserve University, Cleveland OH 441106
| | - Shema Mugisha Christian
- Department of Molecular Microbiology, Washington University School of Medicine, Saint Louis, MO 63110, USA
| | - Adeline G. Monaghan
- Chemistry Department, Duke University, 124 Science Drive; Durham, NC USA 27705
| | - Andrew A. Kennedy
- Department of Internal Medicine and Department of Microbiology & Immunology, University of Michigan, 1150 W Medical Center Dr, Ann Arbor MI 48109
| | - Joseph D. Yesselman
- Department of Chemistry, University of Nebraska-Lincoln, Lincoln, NE 68588 USA
| | - Robert R. Gifford
- MRC-University of Glasgow Centre for Virus Research, 464 Bearsden Rd, Bearsden, Glasgow, UK, G61 1QH
| | - Andrew W. Tai
- Department of Internal Medicine and Department of Microbiology & Immunology, University of Michigan, 1150 W Medical Center Dr, Ann Arbor MI 48109
| | - Sebla B. Kutluay
- Department of Molecular Microbiology, Washington University School of Medicine, Saint Louis, MO 63110, USA
| | - Mei-Ling Li
- Department of Biochemistry and Molecular Biology, Rutgers Robert Wood Johnson Medical School, 675 Hoes Lane West, Piscataway, NJ USA 08854
| | - Gary Brewer
- Department of Biochemistry and Molecular Biology, Rutgers Robert Wood Johnson Medical School, 675 Hoes Lane West, Piscataway, NJ USA 08854
| | - Blanton S. Tolbert
- Department of Chemistry, Case Western Reserve University, Cleveland OH 441106
| | - Amanda E. Hargrove
- Chemistry Department, Duke University, 124 Science Drive; Durham, NC USA 27705
| |
Collapse
|
65
|
Rapid and accurate determination of atomistic RNA dynamic ensemble models using NMR and structure prediction. Nat Commun 2020; 11:5531. [PMID: 33139729 PMCID: PMC7608651 DOI: 10.1038/s41467-020-19371-y] [Citation(s) in RCA: 45] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2020] [Accepted: 10/07/2020] [Indexed: 11/08/2022] Open
Abstract
Biomolecules form dynamic ensembles of many inter-converting conformations which are key for understanding how they fold and function. However, determining ensembles is challenging because the information required to specify atomic structures for thousands of conformations far exceeds that of experimental measurements. We addressed this data gap and dramatically simplified and accelerated RNA ensemble determination by using structure prediction tools that leverage the growing database of RNA structures to generate a conformation library. Refinement of this library with NMR residual dipolar couplings provided an atomistic ensemble model for HIV-1 TAR, and the model accuracy was independently supported by comparisons to quantum-mechanical calculations of NMR chemical shifts, comparison to a crystal structure of a substate, and through designed ensemble redistribution via atomic mutagenesis. Applications to TAR bulge variants and more complex tertiary RNAs support the generality of this approach and the potential to make the determination of atomic-resolution RNA ensembles routine. Determining dynamic ensembles of biomolecules is still challenging. Here the authors present an approach for rapid RNA ensemble determination that combines RNA structure prediction tools and NMR residual dipolar coupling data and use it to determine atomistic ensemble models for a variety of RNAs.
Collapse
|
66
|
Li B, Cao Y, Westhof E, Miao Z. Advances in RNA 3D Structure Modeling Using Experimental Data. Front Genet 2020; 11:574485. [PMID: 33193680 PMCID: PMC7649352 DOI: 10.3389/fgene.2020.574485] [Citation(s) in RCA: 45] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Accepted: 09/02/2020] [Indexed: 12/26/2022] Open
Abstract
RNA is a unique bio-macromolecule that can both record genetic information and perform biological functions in a variety of molecular processes, including transcription, splicing, translation, and even regulating protein function. RNAs adopt specific three-dimensional conformations to enable their functions. Experimental determination of high-resolution RNA structures using x-ray crystallography is both laborious and demands expertise, thus, hindering our comprehension of RNA structural biology. The computational modeling of RNA structure was a milestone in the birth of bioinformatics. Although computational modeling has been greatly improved over the last decade showing many successful cases, the accuracy of such computational modeling is not only length-dependent but also varies according to the complexity of the structure. To increase credibility, various experimental data were integrated into computational modeling. In this review, we summarize the experiments that can be integrated into RNA structure modeling as well as the computational methods based on these experimental data. We also demonstrate how computational modeling can help the experimental determination of RNA structure. We highlight the recent advances in computational modeling which can offer reliable structure models using high-throughput experimental data.
Collapse
Affiliation(s)
- Bing Li
- Center of Growth, Metabolism and Aging, Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, China
| | - Yang Cao
- Center of Growth, Metabolism and Aging, Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, China
| | - Eric Westhof
- Architecture et Réactivité de l’ARN, Institut de Biologie Moléculaire et Cellulaire du CNRS, Université de Strasbourg, Strasbourg, France
| | - Zhichao Miao
- Translational Research Institute of Brain and Brain-Like Intelligence, Department of Anesthesiology, Shanghai Fourth People’s Hospital Affiliated to Tongji University School of Medicine, Shanghai, China
- Newcastle Fibrosis Research Group, Institute of Cellular Medicine, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, United Kingdom
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, United Kingdom
| |
Collapse
|
67
|
Šulc P. The Multiscale Future of RNA Modeling. Biophys J 2020; 119:1270-1272. [PMID: 32941784 PMCID: PMC7462927 DOI: 10.1016/j.bpj.2020.08.026] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Revised: 08/14/2020] [Accepted: 08/25/2020] [Indexed: 11/28/2022] Open
Affiliation(s)
- Petr Šulc
- School of Molecular Sciences and Center for Molecular Design and Biomimetics, The Biodesign Institute, Arizona State University, Tempe, Arizona.
| |
Collapse
|
68
|
Xu X, Chen SJ. Topological constraints of RNA pseudoknotted and loop-kissing motifs: applications to three-dimensional structure prediction. Nucleic Acids Res 2020; 48:6503-6512. [PMID: 32491164 PMCID: PMC7337929 DOI: 10.1093/nar/gkaa463] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2020] [Accepted: 05/19/2020] [Indexed: 01/23/2023] Open
Abstract
An RNA global fold can be described at the level of helix orientations and relatively flexible loop conformations that connect the helices. The linkage between the helices plays an essential role in determining the structural topology, which restricts RNA local and global folds, especially for RNA tertiary structures involving cross-linked base pairs. We quantitatively analyze the topological constraints on RNA 3D conformational space, in particular, on the distribution of helix orientations, for pseudoknots and loop-loop kissing structures. The result shows that a viable conformational space is predominantly determined by the motif type, helix size, and loop size, indicating a strong topological coupling between helices and loops in RNA tertiary motifs. Moreover, the analysis indicates that (cross-linked) tertiary contacts can cause much stronger topological constraints on RNA global fold than non-cross-linked base pairs. Furthermore, based on the topological constraints encoded in the 2D structure and the 3D templates, we develop a 3D structure prediction approach. This approach can be further combined with structure probing methods to expand the capability of computational prediction for large RNA folds.
Collapse
Affiliation(s)
- Xiaojun Xu
- Institute of Bioinformatics and Medical Engineering, Jiangsu University of Technology, Changzhou, Jiangsu 213001, China
| | - Shi-Jie Chen
- Department of Physics, Department of Biochemistry, and Informatics Institute, University of Missouri, Columbia, MO 65211, USA
| |
Collapse
|
69
|
Saaidi A, Allouche D, Regnier M, Sargueil B, Ponty Y. IPANEMAP: integrative probing analysis of nucleic acids empowered by multiple accessibility profiles. Nucleic Acids Res 2020; 48:8276-8289. [PMID: 32735675 PMCID: PMC7470984 DOI: 10.1093/nar/gkaa607] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Revised: 07/03/2020] [Accepted: 07/29/2020] [Indexed: 11/13/2022] Open
Abstract
The manual production of reliable RNA structure models from chemical probing experiments benefits from the integration of information derived from multiple protocols and reagents. However, the interpretation of multiple probing profiles remains a complex task, hindering the quality and reproducibility of modeling efforts. We introduce IPANEMAP, the first automated method for the modeling of RNA structure from multiple probing reactivity profiles. Input profiles can result from experiments based on diverse protocols, reagents, or collection of variants, and are jointly analyzed to predict the dominant conformations of an RNA. IPANEMAP combines sampling, clustering and multi-optimization, to produce secondary structure models that are both stable and well-supported by experimental evidences. The analysis of multiple reactivity profiles, both publicly available and produced in our study, demonstrates the good performances of IPANEMAP, even in a mono probing setting. It confirms the potential of integrating multiple sources of probing data, informing the design of informative probing assays.
Collapse
Affiliation(s)
- Afaf Saaidi
- CNRS UMR 7161, LIX, Ecole Polytechnique, Institut Polytechnique de Paris, 1 rue Estienne d'Orves, 91120 Palaiseau, France
| | - Delphine Allouche
- CNRS UMR 8038, CitCoM, Université de Paris, 4 avenue de l'observatoire, 75006 Paris, France
| | - Mireille Regnier
- CNRS UMR 7161, LIX, Ecole Polytechnique, Institut Polytechnique de Paris, 1 rue Estienne d'Orves, 91120 Palaiseau, France
| | - Bruno Sargueil
- CNRS UMR 8038, CitCoM, Université de Paris, 4 avenue de l'observatoire, 75006 Paris, France
| | - Yann Ponty
- CNRS UMR 7161, LIX, Ecole Polytechnique, Institut Polytechnique de Paris, 1 rue Estienne d'Orves, 91120 Palaiseau, France
| |
Collapse
|
70
|
3dRNA: Building RNA 3D structure with improved template library. Comput Struct Biotechnol J 2020; 18:2416-2423. [PMID: 33005304 PMCID: PMC7508704 DOI: 10.1016/j.csbj.2020.08.017] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2020] [Revised: 08/18/2020] [Accepted: 08/21/2020] [Indexed: 11/22/2022] Open
Abstract
Most of computational methods of building RNA tertiary structure are template-based. The template-based methods usually can give more accurate 3D structures due to the use of native 3D templates, but they cannot work if the 3D templates are not available. So, a more complete library of the native 3D templates is very important for this type of methods. 3dRNA is a template-based method for building RNA tertiary structure previously proposed by us. In this paper we report improved 3D template libraries of 3dRNA by using two different schemes that give two libraries 3dRNA_Lib1 and 3dRNA_Lib2. These libraries expand the original one by nearly ten times. Benchmark shows that they can significantly increase the accuracy of 3dRNA, especially in building complex and large RNA 3D structures.
Collapse
|
71
|
Yan S, Peck JM, Ilgu M, Nilsen-Hamilton M, Lamm MH. Sampling Performance of Multiple Independent Molecular Dynamics Simulations of an RNA Aptamer. ACS OMEGA 2020; 5:20187-20201. [PMID: 32832772 PMCID: PMC7439393 DOI: 10.1021/acsomega.0c01867] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/22/2020] [Accepted: 07/22/2020] [Indexed: 06/11/2023]
Abstract
Using multiple independent simulations instead of one long simulation has been shown to improve the sampling performance attained with the molecular dynamics (MD) simulation method. However, it is generally not known how long each independent simulation should be, how many independent simulations should be used, or to what extent either of these factors affects the overall sampling performance achieved for a given system. The goal of the present study was to assess the sampling performance of multiple independent MD simulations, where each independent simulation begins from a different initial molecular conformation. For this purpose, we used an RNA aptamer that is 25 nucleotides long as a case study. The initial conformations of the aptamer are derived from six de novo predicted 3D structures. Each of the six de novo predicted structures is energy minimized in solution and equilibrated with MD simulations at high temperature. Ten conformations from these six high-temperature equilibration runs are selected as initial conformations for further simulations at ambient temperature. In total, we conducted 60 independent MD simulations, each with a duration of 100 ns, to study the conformation and dynamics of the aptamer. For each group of 10 independent simulations that originated from a particular de novo predicted structure, we evaluated the potential energy distribution of the RNA and used recurrence quantification analysis to examine the sampling of RNA conformational transitions. To assess the impact of starting from different de novo predicted structures, we computed the density of structure projection on principal components to compare the regions sampled by the different groups of ten independent simulations. The recurrence rate and dependence of initial conformation among the groups were also compared. We stress the necessity of using different initial configurations as simulation starting points by showing long simulations from different initial structures suffer from being trapped in different states. Finally, we summarized the sampling efficiency for the complete set of 60 independent simulations and determined regions of under-sampling on the potential energy landscape. The results suggest that conducting multiple independent simulations using a diverse set of de novo predicted structures is a promising approach to achieve sufficient sampling. This approach avoids undesirable outcomes, such as the problem of the RNA aptamer being trapped in a local minimum. For others wishing to conduct multiple independent simulations, the analysis protocol presented in this study is a guide for examining overall sampling and determining if more simulations are necessary for sufficient sampling.
Collapse
Affiliation(s)
- Shuting Yan
- Department
of Chemical and Biological Engineering, Iowa State University, Ames, Iowa 50011, United States
| | - Jason M. Peck
- Department
of Chemical and Biological Engineering, Iowa State University, Ames, Iowa 50011, United States
| | - Muslum Ilgu
- Roy
J Carver Department of Biochemistry, Biophysics and Molecular Biology, Iowa State University, Ames, Iowa 50011, United States
- Aptalogic
Inc., Ames, Iowa 50014, United States
- Department
of Biological Sciences, Middle East Technical
University, Ankara, Ankara 06800, Turkey
| | - Marit Nilsen-Hamilton
- Roy
J Carver Department of Biochemistry, Biophysics and Molecular Biology, Iowa State University, Ames, Iowa 50011, United States
- Aptalogic
Inc., Ames, Iowa 50014, United States
| | - Monica H. Lamm
- Department
of Chemical and Biological Engineering, Iowa State University, Ames, Iowa 50011, United States
| |
Collapse
|
72
|
Watkins AM, Rangan R, Das R. FARFAR2: Improved De Novo Rosetta Prediction of Complex Global RNA Folds. Structure 2020; 28:963-976.e6. [PMID: 32531203 PMCID: PMC7415647 DOI: 10.1016/j.str.2020.05.011] [Citation(s) in RCA: 128] [Impact Index Per Article: 25.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2019] [Revised: 04/27/2020] [Accepted: 05/20/2020] [Indexed: 01/01/2023]
Abstract
Predicting RNA three-dimensional structures from sequence could accelerate understanding of the growing number of RNA molecules being discovered across biology. Rosetta's Fragment Assembly of RNA with Full-Atom Refinement (FARFAR) has shown promise in community-wide blind RNA-Puzzle trials, but lack of a systematic and automated benchmark has left unclear what limits FARFAR performance. Here, we benchmark FARFAR2, an algorithm integrating RNA-Puzzle-inspired innovations with updated fragment libraries and helix modeling. In 16 of 21 RNA-Puzzles revisited without experimental data or expert intervention, FARFAR2 recovers native-like structures more accurate than models submitted during the RNA-Puzzles trials. Remaining bottlenecks include conformational sampling for >80-nucleotide problems and scoring function limitations more generally. Supporting these conclusions, preregistered blind models for adenovirus VA-I RNA and five riboswitch complexes predicted native-like folds with 3- to 14 Å root-mean-square deviation accuracies. We present a FARFAR2 webserver and three large model archives (FARFAR2-Classics, FARFAR2-Motifs, and FARFAR2-Puzzles) to guide future applications and advances.
Collapse
Affiliation(s)
- Andrew Martin Watkins
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Ramya Rangan
- Biophysics Program, Stanford University, Stanford, CA 94305, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305, USA; Biophysics Program, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
73
|
Miao Z, Adamiak RW, Antczak M, Boniecki MJ, Bujnicki J, Chen SJ, Cheng CY, Cheng Y, Chou FC, Das R, Dokholyan NV, Ding F, Geniesse C, Jiang Y, Joshi A, Krokhotin A, Magnus M, Mailhot O, Major F, Mann TH, Piątkowski P, Pluta R, Popenda M, Sarzynska J, Sun L, Szachniuk M, Tian S, Wang J, Wang J, Watkins AM, Wiedemann J, Xiao Y, Xu X, Yesselman JD, Zhang D, Zhang Y, Zhang Z, Zhao C, Zhao P, Zhou Y, Zok T, Żyła A, Ren A, Batey RT, Golden BL, Huang L, Lilley DM, Liu Y, Patel DJ, Westhof E. RNA-Puzzles Round IV: 3D structure predictions of four ribozymes and two aptamers. RNA (NEW YORK, N.Y.) 2020; 26:982-995. [PMID: 32371455 PMCID: PMC7373991 DOI: 10.1261/rna.075341.120] [Citation(s) in RCA: 79] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/12/2020] [Accepted: 04/03/2020] [Indexed: 05/21/2023]
Abstract
RNA-Puzzles is a collective endeavor dedicated to the advancement and improvement of RNA 3D structure prediction. With agreement from crystallographers, the RNA structures are predicted by various groups before the publication of the crystal structures. We now report the prediction of 3D structures for six RNA sequences: four nucleolytic ribozymes and two riboswitches. Systematic protocols for comparing models and crystal structures are described and analyzed. In these six puzzles, we discuss (i) the comparison between the automated web servers and human experts; (ii) the prediction of coaxial stacking; (iii) the prediction of structural details and ligand binding; (iv) the development of novel prediction methods; and (v) the potential improvements to be made. We show that correct prediction of coaxial stacking and tertiary contacts is essential for the prediction of RNA architecture, while ligand binding modes can only be predicted with low resolution and simultaneous prediction of RNA structure with accurate ligand binding still remains out of reach. All the predicted models are available for the future development of force field parameters and the improvement of comparison and assessment tools.
Collapse
Affiliation(s)
- Zhichao Miao
- Translational Research Institute of Brain and Brain-Like Intelligence and Department of Anesthesiology, Shanghai Fourth People's Hospital Affiliated to Tongji University School of Medicine, Shanghai 200081, China
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, CB10 1SD, United Kingdom
- Newcastle Fibrosis Research Group, Institute of Cellular Medicine, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, NE2 4HH, United Kingdom
| | - Ryszard W Adamiak
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
- Institute of Computing Science, Poznan University of Technology, 60-965 Poznan, Poland
| | - Maciej Antczak
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
- Institute of Computing Science, Poznan University of Technology, 60-965 Poznan, Poland
| | - Michał J Boniecki
- International Institute of Molecular and Cell Biology in Warsaw, Księcia Trojdena 4, 02-109 Warsaw, Poland
| | - Janusz Bujnicki
- International Institute of Molecular and Cell Biology in Warsaw, Księcia Trojdena 4, 02-109 Warsaw, Poland
| | - Shi-Jie Chen
- Department of Physics and Astronomy, Department of Biochemistry, MU Institute for Data Science and Informatics, University of Missouri-Columbia, Missouri 65211, USA
| | - Clarence Yu Cheng
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Yi Cheng
- Department of Physics and Astronomy, Department of Biochemistry, MU Institute for Data Science and Informatics, University of Missouri-Columbia, Missouri 65211, USA
| | - Fang-Chieh Chou
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Nikolay V Dokholyan
- Department of Pharmacology, Penn State College of Medicine, Hershey, Pennsylvania, 17033, USA
- Department of Biochemistry and Molecular Biology, Penn State College of Medicine, Hershey, Pennsylvania, 17033, USA
| | - Feng Ding
- Department of Physics and Astronomy, Clemson University, Clemson, South Carolina 29634, USA
| | - Caleb Geniesse
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Yangwei Jiang
- Department of Physics and Astronomy, Department of Biochemistry, MU Institute for Data Science and Informatics, University of Missouri-Columbia, Missouri 65211, USA
| | - Astha Joshi
- International Institute of Molecular and Cell Biology in Warsaw, Księcia Trojdena 4, 02-109 Warsaw, Poland
| | - Andrey Krokhotin
- Department of Biochemistry and Biophysics and Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA
- Departments of Pathology, Genetics and Developmental Biology, Howard Hughes Medical Institute, Stanford Medical School, Palo Alto, California, 94305, USA
| | - Marcin Magnus
- International Institute of Molecular and Cell Biology in Warsaw, Księcia Trojdena 4, 02-109 Warsaw, Poland
| | - Olivier Mailhot
- Institute for Research in Immunology and Cancer (IRIC), Department of Computer Science and Operations Research, Université de Montréal, Montréal, Québec, H3C 3J7, Canada
| | - Francois Major
- Institute for Research in Immunology and Cancer (IRIC), Department of Computer Science and Operations Research, Université de Montréal, Montréal, Québec, H3C 3J7, Canada
| | - Thomas H Mann
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Paweł Piątkowski
- International Institute of Molecular and Cell Biology in Warsaw, Księcia Trojdena 4, 02-109 Warsaw, Poland
| | - Radoslaw Pluta
- International Institute of Molecular and Cell Biology in Warsaw, Księcia Trojdena 4, 02-109 Warsaw, Poland
| | - Mariusz Popenda
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Joanna Sarzynska
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Lizhen Sun
- Department of Physics and Astronomy, Department of Biochemistry, MU Institute for Data Science and Informatics, University of Missouri-Columbia, Missouri 65211, USA
| | - Marta Szachniuk
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
- Institute of Computing Science, Poznan University of Technology, 60-965 Poznan, Poland
| | - Siqi Tian
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Jian Wang
- Department of Pharmacology, Penn State College of Medicine, Hershey, Pennsylvania, 17033, USA
| | - Jun Wang
- School of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China
| | - Andrew M Watkins
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Jakub Wiedemann
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
- Institute of Computing Science, Poznan University of Technology, 60-965 Poznan, Poland
| | - Yi Xiao
- School of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China
| | - Xiaojun Xu
- Department of Physics and Astronomy, Department of Biochemistry, MU Institute for Data Science and Informatics, University of Missouri-Columbia, Missouri 65211, USA
| | - Joseph D Yesselman
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Dong Zhang
- Department of Physics and Astronomy, Department of Biochemistry, MU Institute for Data Science and Informatics, University of Missouri-Columbia, Missouri 65211, USA
| | - Yi Zhang
- School of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China
| | - Zhenzhen Zhang
- Department of Physics and Astronomy, Clemson University, Clemson, South Carolina 29634, USA
| | - Chenhan Zhao
- Department of Physics and Astronomy, Department of Biochemistry, MU Institute for Data Science and Informatics, University of Missouri-Columbia, Missouri 65211, USA
| | - Peinan Zhao
- Department of Physics and Astronomy, Department of Biochemistry, MU Institute for Data Science and Informatics, University of Missouri-Columbia, Missouri 65211, USA
| | - Yuanzhe Zhou
- Department of Physics and Astronomy, Department of Biochemistry, MU Institute for Data Science and Informatics, University of Missouri-Columbia, Missouri 65211, USA
| | - Tomasz Zok
- Institute of Computing Science, Poznan University of Technology, 60-965 Poznan, Poland
| | - Adriana Żyła
- International Institute of Molecular and Cell Biology in Warsaw, Księcia Trojdena 4, 02-109 Warsaw, Poland
| | - Aiming Ren
- Life Sciences Institute, Zhejiang University, Hangzhou, Zhejiang 310058, China
| | - Robert T Batey
- Department of Biochemistry, University of Colorado at Boulder, Campus Box 596, Boulder, Colorado 80309-0596, USA
| | - Barbara L Golden
- Department of Biochemistry, Purdue University, West Lafayette, Indiana 47907, USA
| | - Lin Huang
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Medical Research Center, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou 510120, P. R. China
- RNA Biomedical Institute, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou 510120, P. R. China
- Cancer Research UK Nucleic Acid Structure Research Group, MSI/WTB Complex, The University of Dundee, Dow Street, Dundee DD1 5EH, United Kingdom
| | - David M Lilley
- Cancer Research UK Nucleic Acid Structure Research Group, MSI/WTB Complex, The University of Dundee, Dow Street, Dundee DD1 5EH, United Kingdom
| | - Yijin Liu
- Cancer Research UK Nucleic Acid Structure Research Group, MSI/WTB Complex, The University of Dundee, Dow Street, Dundee DD1 5EH, United Kingdom
| | - Dinshaw J Patel
- Structural Biology Program, Memorial Sloan Kettering Cancer Center, New York, New York 10065, USA
| | - Eric Westhof
- Arch et Reactivite de l'ARN, Univ de Strasbourg, Inst de Biol Mol et Cell du CNRS, 67084 Strasbourg, France
| |
Collapse
|
74
|
Pucci F, Zerihun MB, Peter EK, Schug A. Evaluating DCA-based method performances for RNA contact prediction by a well-curated data set. RNA (NEW YORK, N.Y.) 2020; 26:794-802. [PMID: 32276988 PMCID: PMC7297115 DOI: 10.1261/rna.073809.119] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/28/2019] [Accepted: 03/31/2020] [Indexed: 06/11/2023]
Abstract
RNA molecules play many pivotal roles in a cell that are still not fully understood. Any detailed understanding of RNA function requires knowledge of its three-dimensional structure, yet experimental RNA structure resolution remains demanding. Recent advances in sequencing provide unprecedented amounts of sequence data that can be statistically analyzed by methods such as direct coupling analysis (DCA) to determine spatial proximity or contacts of specific nucleic acid pairs, which improve the quality of structure prediction. To quantify this structure prediction improvement, we here present a well curated data set of about 70 RNA structures of high resolution and compare different nucleotide-nucleotide contact prediction methods available in the literature. We observe only minor differences between the performances of the different methods. Moreover, we discuss how robust these predictions are for different contact definitions and how strongly they depend on procedures used to curate and align the families of homologous RNA sequences.
Collapse
Affiliation(s)
- Fabrizio Pucci
- John von Neumann Institute for Computing, Jülich Supercomputing Centre, Forschungszentrum Jülich, 52428 Jülich, Germany
| | - Mehari B Zerihun
- John von Neumann Institute for Computing, Jülich Supercomputing Centre, Forschungszentrum Jülich, 52428 Jülich, Germany
- Steinbuch Centre for Computing, Karlsruhe Institute of Technology, 76344 Eggenstein-Leopoldshafen, Germany
- Department of Physics, Karlsruhe Institute of Technology, 76344 Eggenstein-Leopoldshafen, Germany
| | - Emanuel K Peter
- John von Neumann Institute for Computing, Jülich Supercomputing Centre, Forschungszentrum Jülich, 52428 Jülich, Germany
| | - Alexander Schug
- John von Neumann Institute for Computing, Jülich Supercomputing Centre, Forschungszentrum Jülich, 52428 Jülich, Germany
| |
Collapse
|
75
|
Zhang H, Zhang L, Mathews DH, Huang L. LinearPartition: linear-time approximation of RNA folding partition function and base-pairing probabilities. Bioinformatics 2020; 36:i258-i267. [PMID: 32657379 PMCID: PMC7355276 DOI: 10.1093/bioinformatics/btaa460] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
MOTIVATION RNA secondary structure prediction is widely used to understand RNA function. Recently, there has been a shift away from the classical minimum free energy methods to partition function-based methods that account for folding ensembles and can therefore estimate structure and base pair probabilities. However, the classical partition function algorithm scales cubically with sequence length, and is therefore prohibitively slow for long sequences. This slowness is even more severe than cubic-time free energy minimization due to a substantially larger constant factor in runtime. RESULTS Inspired by the success of our recent LinearFold algorithm that predicts the approximate minimum free energy structure in linear time, we design a similar linear-time heuristic algorithm, LinearPartition, to approximate the partition function and base-pairing probabilities, which is shown to be orders of magnitude faster than Vienna RNAfold and CONTRAfold (e.g. 2.5 days versus 1.3 min on a sequence with length 32 753 nt). More interestingly, the resulting base-pairing probabilities are even better correlated with the ground-truth structures. LinearPartition also leads to a small accuracy improvement when used for downstream structure prediction on families with the longest length sequences (16S and 23S rRNAs), as well as a substantial improvement on long-distance base pairs (500+ nt apart). AVAILABILITY AND IMPLEMENTATION Code: http://github.com/LinearFold/LinearPartition; Server: http://linearfold.org/partition. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- He Zhang
- Baidu Research, Sunnyvale, CA 94089, USA
- School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR 97330, USA
| | - Liang Zhang
- School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR 97330, USA
| | - David H Mathews
- Department of Biochemistry & Biophysics, University of Rochester Medical Center, Rochester, NY 48306, USA
- Center for RNA Biology, University of Rochester Medical Center, Rochester, NY 48306, USA
- Department of Biostatistics & Computational Biology, University of Rochester Medical Center, Rochester, NY 48306, USA
| | - Liang Huang
- Baidu Research, Sunnyvale, CA 94089, USA
- School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR 97330, USA
| |
Collapse
|
76
|
Kasprzak WK, Ahmed NA, Shapiro BA. Modeling ligand docking to RNA in the design of RNA-based nanostructures. Curr Opin Biotechnol 2020; 63:16-25. [DOI: 10.1016/j.copbio.2019.10.010] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2019] [Accepted: 10/30/2019] [Indexed: 12/30/2022]
|
77
|
Cuturello F, Tiana G, Bussi G. Assessing the accuracy of direct-coupling analysis for RNA contact prediction. RNA (NEW YORK, N.Y.) 2020; 26:637-647. [PMID: 32115426 PMCID: PMC7161351 DOI: 10.1261/rna.074179.119] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/29/2019] [Accepted: 02/26/2020] [Indexed: 05/31/2023]
Abstract
Many noncoding RNAs are known to play a role in the cell directly linked to their structure. Structure prediction based on the sole sequence is, however, a challenging task. On the other hand, thanks to the low cost of sequencing technologies, a very large number of homologous sequences are becoming available for many RNA families. In the protein community, the idea of exploiting the covariance of mutations within a family to predict the protein structure using the direct-coupling-analysis (DCA) method has emerged in the last decade. The application of DCA to RNA systems has been limited so far. We here perform an assessment of the DCA method on 17 riboswitch families, comparing it with the commonly used mutual information analysis and with state-of-the-art R-scape covariance method. We also compare different flavors of DCA, including mean-field, pseudolikelihood, and a proposed stochastic procedure (Boltzmann learning) for solving exactly the DCA inverse problem. Boltzmann learning outperforms the other methods in predicting contacts observed in high-resolution crystal structures.
Collapse
Affiliation(s)
- Francesca Cuturello
- Scuola Internazionale Superiore di Studi Avanzati, International School for Advanced Studies, 34136 Trieste, Italy
| | - Guido Tiana
- Center for Complexity and Biosystems and Department of Physics, Università degli Studi di Milano and INFN, 20133 Milano, Italy
| | - Giovanni Bussi
- Scuola Internazionale Superiore di Studi Avanzati, International School for Advanced Studies, 34136 Trieste, Italy
| |
Collapse
|
78
|
Wright ES. RNAconTest: comparing tools for noncoding RNA multiple sequence alignment based on structural consistency. RNA (NEW YORK, N.Y.) 2020; 26:531-540. [PMID: 32005745 PMCID: PMC7161358 DOI: 10.1261/rna.073015.119] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/20/2019] [Accepted: 01/28/2020] [Indexed: 05/05/2023]
Abstract
The importance of noncoding RNA sequences has become increasingly clear over the past decade. New RNA families are often detected and analyzed using comparative methods based on multiple sequence alignments. Accordingly, a number of programs have been developed for aligning and deriving secondary structures from sets of RNA sequences. Yet, the best tools for these tasks remain unclear because existing benchmarks contain too few sequences belonging to only a small number of RNA families. RNAconTest (RNA consistency test) is a new benchmarking approach relying on the observation that secondary structure is often conserved across highly divergent RNA sequences from the same family. RNAconTest scores multiple sequence alignments based on the level of consistency among known secondary structures belonging to reference sequences in their output alignment. Similarly, consensus secondary structure predictions are scored according to their agreement with one or more known structures in a family. Comparing the performance of 10 popular alignment programs using RNAconTest revealed that DAFS, DECIPHER, LocARNA, and MAFFT created the most structurally consistent alignments. The best consensus secondary structure predictions were generated by DAFS and LocARNA (via RNAalifold). Many of the methods specific to noncoding RNAs exhibited poor scalability as the number or length of input sequences increased, and several programs displayed substantial declines in score as more sequences were aligned. Overall, RNAconTest provides a means of testing and improving tools for comparative RNA analysis, as well as highlighting the best available approaches. RNAconTest is available from the DECIPHER website (http://DECIPHER.codes/Downloads.html).
Collapse
Affiliation(s)
- Erik S Wright
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, Pennsylvania 15219, USA
| |
Collapse
|
79
|
Genetic Biosensor Design for Natural Product Biosynthesis in Microorganisms. Trends Biotechnol 2020; 38:797-810. [PMID: 32359951 DOI: 10.1016/j.tibtech.2020.03.013] [Citation(s) in RCA: 83] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2019] [Revised: 03/26/2020] [Accepted: 03/26/2020] [Indexed: 12/28/2022]
Abstract
Low yield and low titer of natural products are common issues in natural product biosynthesis through microbial cell factories. One effective way to resolve such bottlenecks is to design genetic biosensors to monitor and regulate the biosynthesis of target natural products. In this review, we evaluate the most recent advances in the design of genetic biosensors for natural product biosynthesis in microorganisms. In particular, we examine strategies for selection of genetic parts and construction principles for the design and evaluation of genetic biosensors. We also review the latest applications of transcription factor- and riboswitch-based genetic biosensors in natural product biosynthesis. Lastly, we discuss challenges and solutions in designing genetic biosensors for the biosynthesis of natural products in microorganisms.
Collapse
|
80
|
Orengo C, Velankar S, Wodak S, Zoete V, Bonvin AMJJ, Elofsson A, Feenstra KA, Gerloff DL, Hamelryck T, Hancock JM, Helmer-Citterich M, Hospital A, Orozco M, Perrakis A, Rarey M, Soares C, Sussman JL, Thornton JM, Tuffery P, Tusnady G, Wierenga R, Salminen T, Schneider B. A community proposal to integrate structural bioinformatics activities in ELIXIR (3D-Bioinfo Community). F1000Res 2020; 9. [PMID: 32566135 PMCID: PMC7284151 DOI: 10.12688/f1000research.20559.1] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 03/05/2020] [Indexed: 12/11/2022] Open
Abstract
Structural bioinformatics provides the scientific methods and tools to analyse, archive, validate, and present the biomolecular structure data generated by the structural biology community. It also provides an important link with the genomics community, as structural bioinformaticians also use the extensive sequence data to predict protein structures and their functional sites. A very broad and active community of structural bioinformaticians exists across Europe, and 3D-Bioinfo will establish formal platforms to address their needs and better integrate their activities and initiatives. Our mission will be to strengthen the ties with the structural biology research communities in Europe covering life sciences, as well as chemistry and physics and to bridge the gap between these researchers in order to fully realize the potential of structural bioinformatics. Our Community will also undertake dedicated educational, training and outreach efforts to facilitate this, bringing new insights and thus facilitating the development of much needed innovative applications e.g. for human health, drug and protein design. Our combined efforts will be of critical importance to keep the European research efforts competitive in this respect. Here we highlight the major European contributions to the field of structural bioinformatics, the most pressing challenges remaining and how Europe-wide interactions, enabled by ELIXIR and its platforms, will help in addressing these challenges and in coordinating structural bioinformatics resources across Europe. In particular, we present recent activities and future plans to consolidate an ELIXIR 3D-Bioinfo Community in structural bioinformatics and propose means to develop better links across the community. These include building new consortia, organising workshops to establish data standards and seeking community agreement on benchmark data sets and strategies. We also highlight existing and planned collaborations with other ELIXIR Communities and other European infrastructures, such as the structural biology community supported by Instruct-ERIC, with whom we have synergies and overlapping common interests.
Collapse
Affiliation(s)
- Christine Orengo
- Structural and Molecular Biology Department, University College, London, UK
| | - Sameer Velankar
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, CB10 1SD, UK
| | - Shoshana Wodak
- VIB-VUB Center for Structural Biology, Brussels, Belgium
| | - Vincent Zoete
- Department of Oncology, Lausanne University, Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Alexandre M J J Bonvin
- Bijvoet Center, Faculty of Science - Chemistry, Utrecht University, Utrecht, 3584CH, The Netherlands
| | - Arne Elofsson
- Science for Life Laboratory, Stockholm University, Solna, S-17121, Sweden
| | - K Anton Feenstra
- Dept. Computer Science, Center for Integrative Bioinformatics VU (IBIVU), Vrije Universiteit, Amsterdam, 1081 HV, The Netherlands
| | - Dietland L Gerloff
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, L-4367, Luxembourg
| | - Thomas Hamelryck
- Bioinformatics center, Department of Biology, University of Copenhagen, Copenhagen, DK-2200, Denmark
| | | | | | - Adam Hospital
- Institute for Research in Biomedicine, The Barcelona Institute of Science and Technology, Barcelona, 08028, Spain
| | - Modesto Orozco
- Institute for Research in Biomedicine, The Barcelona Institute of Science and Technology, Barcelona, 08028, Spain
| | | | - Matthias Rarey
- ZBH - Center for Bioinformatics, Universität Hamburg, Hamburg, D-20146, Germany
| | - Claudio Soares
- Instituto de Tecnologia Química e Biológica Antonio Xavier, Universidade Nova de Lisboa, Lisbon, Portugal
| | - Joel L Sussman
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, 76100, Israel
| | - Janet M Thornton
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, CB10 1SD, UK
| | - Pierre Tuffery
- Ressource Parisienne en Bioinformatique Structurale, Université de Paris, Paris, F-75205, France
| | - Gabor Tusnady
- Membrane Bioinformatics Research Group, Institute of Enzymology, Budapest, H-1117, Hungary
| | | | - Tiina Salminen
- Structural Bioinformatics Laboratory, Åbo Akademi University, Turku, FI-20500, Finland
| | - Bohdan Schneider
- Institute of Biotechnology of the Czech Academy of Sciences, Vestec, CZ-25250, Czech Republic
| |
Collapse
|
81
|
Yan Y, Tao H, He J, Huang SY. The HDOCK server for integrated protein–protein docking. Nat Protoc 2020; 15:1829-1852. [DOI: 10.1038/s41596-020-0312-x] [Citation(s) in RCA: 288] [Impact Index Per Article: 57.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2019] [Accepted: 02/03/2020] [Indexed: 12/27/2022]
|
82
|
In silico design and validation of high-affinity RNA aptamers targeting epithelial cellular adhesion molecule dimers. Proc Natl Acad Sci U S A 2020; 117:8486-8493. [PMID: 32234785 DOI: 10.1073/pnas.1913242117] [Citation(s) in RCA: 54] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Nucleic acid aptamers hold great promise for therapeutic applications due to their favorable intrinsic properties, as well as high-throughput experimental selection techniques. Despite the utility of the systematic evolution of ligands by the exponential enrichment (SELEX) method for aptamer determination, complementary in silico aptamer design is highly sought after to facilitate virtual screening and increased understanding of important nucleic acid-protein interactions. Here, with a combined experimental and theoretical approach, we have developed two optimal epithelial cellular adhesion molecule (EpCAM) aptamers. Our structure-based in silico method first predicts their binding modes and then optimizes them for EpCAM with molecular dynamics simulations, docking, and free energy calculations. Our isothermal titration calorimetry experiments further confirm that the EpCAM aptamers indeed exhibit enhanced affinity over a previously patented nanomolar aptamer, EP23. Moreover, our study suggests that EP23 and the de novo designed aptamers primarily bind to EpCAM dimers (and not monomers, as hypothesized in previous published works), suggesting a paradigm for developing EpCAM-targeted therapies.
Collapse
|
83
|
Alaidi O, Aboul‐ela F. Statistical mechanical prediction of ligand perturbation to RNA secondary structure and application to riboswitches. J Comput Chem 2020; 41:1521-1537. [DOI: 10.1002/jcc.26195] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Revised: 01/03/2020] [Accepted: 03/09/2020] [Indexed: 02/04/2023]
Affiliation(s)
- Osama Alaidi
- Biocomplexity for Research and Consulting Cairo Egypt
| | - Fareed Aboul‐ela
- Center for X‐Ray Determination of the Structure of MatterZewail City of Science and Technology Giza Egypt
| |
Collapse
|
84
|
Mitchell C, Polanco JA, DeWald L, Kress D, Jaeger L, Grabow WW. Responsive self-assembly of tectoRNAs with loop-receptor interactions from the tetrahydrofolate (THF) riboswitch. Nucleic Acids Res 2020; 47:6439-6451. [PMID: 31045210 PMCID: PMC6614920 DOI: 10.1093/nar/gkz304] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2018] [Revised: 03/22/2019] [Accepted: 04/16/2019] [Indexed: 12/17/2022] Open
Abstract
Naturally occurring RNAs are known to exhibit a high degree of modularity, whereby specific structural modules (or motifs) can be mixed and matched to create new molecular architectures. The modular nature of RNA also affords researchers the ability to characterize individual structural elements in controlled synthetic contexts in order to gain new and critical insights into their particular structural features and overall performance. Here, we characterized the binding affinity of a unique loop–receptor interaction found in the tetrahydrofolate (THF) riboswitch using rationally designed self-assembling tectoRNAs. Our work suggests that the THF loop–receptor interaction has been fine-tuned for its particular role as a riboswitch component. We also demonstrate that the thermodynamic stability of this interaction can be modulated by the presence of folinic acid, which induces a local structural change at the level of the loop–receptor. This corroborates the existence of a THF binding site within this tertiary module and paves the way for its potential use as a THF responsive module for RNA nanotechnology and synthetic biology.
Collapse
Affiliation(s)
- Charles Mitchell
- Department of Chemistry and Biochemistry, Seattle Pacific University, Seattle, WA 918119-1997, USA
| | - Julio A Polanco
- Department of Chemistry and Biochemistry, Biomolecular Science and Engineering Program, University of California, Santa Barbara, CA 93106-9510, USA
| | - Laura DeWald
- Department of Chemistry and Biochemistry, Seattle Pacific University, Seattle, WA 918119-1997, USA
| | - Dustin Kress
- Department of Chemistry and Biochemistry, Seattle Pacific University, Seattle, WA 918119-1997, USA
| | - Luc Jaeger
- Department of Chemistry and Biochemistry, Biomolecular Science and Engineering Program, University of California, Santa Barbara, CA 93106-9510, USA
| | - Wade W Grabow
- Department of Chemistry and Biochemistry, Seattle Pacific University, Seattle, WA 918119-1997, USA
| |
Collapse
|
85
|
Zhang Z, Xiong P, Zhang T, Wang J, Zhan J, Zhou Y. Accurate inference of the full base-pairing structure of RNA by deep mutational scanning and covariation-induced deviation of activity. Nucleic Acids Res 2020; 48:1451-1465. [PMID: 31872260 PMCID: PMC7026644 DOI: 10.1093/nar/gkz1192] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2019] [Revised: 12/10/2019] [Accepted: 12/11/2019] [Indexed: 11/12/2022] Open
Abstract
Despite the large number of noncoding RNAs in human genome and their roles in many diseases include cancer, we know very little about them due to lack of structural clues. The centerpiece of the structural clues is the full RNA base-pairing structure of secondary and tertiary contacts that can be precisely obtained only from costly and time-consuming 3D structure determination. Here, we performed deep mutational scanning of self-cleaving CPEB3 ribozyme by error-prone PCR and showed that a library of <5 × 104 single-to-triple mutants is sufficient to infer 25 of 26 base pairs including non-nested, nonhelical, and noncanonical base pairs with both sensitivity and precision at 96%. Such accurate inference was further confirmed by a twister ribozyme at 100% precision with only noncanonical base pairs as false negatives. The performance was resulted from analyzing covariation-induced deviation of activity by utilizing both functional and nonfunctional variants for unsupervised classification, followed by Monte Carlo (MC) simulated annealing with mutation-derived scores. Highly accurate inference can also be obtained by combining MC with evolution/direct coupling analysis, R-scape or epistasis analysis. The results highlight the usefulness of deep mutational scanning for high-accuracy structural inference of self-cleaving ribozymes with implications for other structured RNAs that permit high-throughput functional selections.
Collapse
Affiliation(s)
- Zhe Zhang
- High Magnetic Field Laboratory, Key Laboratory of High Magnetic Field and Ion Beam Physical Biology, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, Anhui, P. R. China
- University of Chinese Academy of Sciences, Beijing 101408, P. R. China
- Institute for Glycomics, Griffith University, Parklands Drive, Southport, QLD 4222, Australia
| | - Peng Xiong
- Institute for Glycomics, Griffith University, Parklands Drive, Southport, QLD 4222, Australia
| | - Tongchuan Zhang
- Institute for Glycomics, Griffith University, Parklands Drive, Southport, QLD 4222, Australia
| | - Junfeng Wang
- High Magnetic Field Laboratory, Key Laboratory of High Magnetic Field and Ion Beam Physical Biology, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, Anhui, P. R. China
- Institute of Physical Science and Information Technology, Anhui University, Hefei 230031, Anhui, P. R. China
| | - Jian Zhan
- Institute for Glycomics, Griffith University, Parklands Drive, Southport, QLD 4222, Australia
| | - Yaoqi Zhou
- Institute for Glycomics, Griffith University, Parklands Drive, Southport, QLD 4222, Australia
- School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, QLD 4222, Australia
| |
Collapse
|
86
|
Magnus M, Antczak M, Zok T, Wiedemann J, Lukasiak P, Cao Y, Bujnicki JM, Westhof E, Szachniuk M, Miao Z. RNA-Puzzles toolkit: a computational resource of RNA 3D structure benchmark datasets, structure manipulation, and evaluation tools. Nucleic Acids Res 2020; 48:576-588. [PMID: 31799609 PMCID: PMC7145511 DOI: 10.1093/nar/gkz1108] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2019] [Revised: 11/06/2019] [Accepted: 11/15/2019] [Indexed: 12/12/2022] Open
Abstract
Significant improvements have been made in the efficiency and accuracy of RNA 3D structure prediction methods during the succeeding challenges of RNA-Puzzles, a community-wide effort on the assessment of blind prediction of RNA tertiary structures. The RNA-Puzzles contest has shown, among others, that the development and validation of computational methods for RNA fold prediction strongly depend on the benchmark datasets and the structure comparison algorithms. Yet, there has been no systematic benchmark set or decoy structures available for the 3D structure prediction of RNA, hindering the standardization of comparative tests in the modeling of RNA structure. Furthermore, there has not been a unified set of tools that allows deep and complete RNA structure analysis, and at the same time, that is easy to use. Here, we present RNA-Puzzles toolkit, a computational resource including (i) decoy sets generated by different RNA 3D structure prediction methods (raw, for-evaluation and standardized datasets), (ii) 3D structure normalization, analysis, manipulation, visualization tools (RNA_format, RNA_normalizer, rna-tools) and (iii) 3D structure comparison metric tools (RNAQUA, MCQ4Structures). This resource provides a full list of computational tools as well as a standard RNA 3D structure prediction assessment protocol for the community.
Collapse
Affiliation(s)
- Marcin Magnus
- International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
- ReMedy-International Research Agenda Unit, Centre of New Technologies, University of Warsaw, 02-097 Warsaw, Poland
| | - Maciej Antczak
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, 60-965 Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Tomasz Zok
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, 60-965 Poznan, Poland
| | - Jakub Wiedemann
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, 60-965 Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Piotr Lukasiak
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, 60-965 Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Yang Cao
- Center of Growth, Metabolism and Aging, Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, PR China
| | - Janusz M Bujnicki
- International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
- Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, Poznan, Poland
| | - Eric Westhof
- Architecture et Réactivité de l’ARN, Université de Strasbourg, Institut de biologie moléculaire et cellulaire du CNRS, 12 allée Konrad Roentgen, 67084 Strasbourg, France
| | - Marta Szachniuk
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, 60-965 Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Zhichao Miao
- Translational Research Institute of Brain and Brain-Like Intelligence and Department of Anesthesiology, Shanghai Fourth People's Hospital Affiliated to Tongji University School of Medicine, Shanghai 200081, China
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridge CB10 1SD, UK
- Newcastle Fibrosis Research Group, Institute of Cellular Medicine, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| |
Collapse
|
87
|
Kim DN, Thiel BC, Mrozowich T, Hennelly SP, Hofacker IL, Patel TR, Sanbonmatsu KY. Zinc-finger protein CNBP alters the 3-D structure of lncRNA Braveheart in solution. Nat Commun 2020; 11:148. [PMID: 31919376 PMCID: PMC6952434 DOI: 10.1038/s41467-019-13942-4] [Citation(s) in RCA: 54] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2019] [Accepted: 12/09/2019] [Indexed: 02/08/2023] Open
Abstract
Long non-coding RNAs (lncRNAs) constitute a significant fraction of the transcriptome, playing important roles in development and disease. However, our understanding of structure-function relationships for this emerging class of RNAs has been limited to secondary structures. Here, we report the 3-D atomistic structural study of epigenetic lncRNA, Braveheart (Bvht), and its complex with CNBP (Cellular Nucleic acid Binding Protein). Using small angle X-ray scattering (SAXS), we elucidate the ensemble of Bvht RNA conformations in solution, revealing that Bvht lncRNA has a well-defined, albeit flexible 3-D structure that is remodeled upon CNBP binding. Our study suggests that CNBP binding requires multiple domains of Bvht and the RHT/AGIL RNA motif. We show that RHT/AGIL, previously shown to interact with CNBP, contains a highly flexible loop surrounded by more ordered helices. As one of the largest RNA-only 3-D studies, the work lays the foundation for future structural studies of lncRNA-protein complexes.
Collapse
Affiliation(s)
- Doo Nam Kim
- Theoretical Biology and Biophysics Group, Los Alamos National Laboratory, Los Alamos, New Mexico, USA
| | - Bernhard C Thiel
- Department of Theoretical Chemistry, University of Vienna, Vienna, Austria
| | - Tyler Mrozowich
- Alberta RNA Research & Training Institute, Department of Chemistry and Biochemistry, University of Lethbridge, Lethbridge, Alberta, Canada
| | - Scott P Hennelly
- Theoretical Biology and Biophysics Group, Los Alamos National Laboratory, Los Alamos, New Mexico, USA
- New Mexico Consortium, Los Alamos, New Mexico, USA
| | - Ivo L Hofacker
- Department of Theoretical Chemistry, University of Vienna, Vienna, Austria
- Bioinformatics and Computational Biology, Faculty of Computer Science, University of Vienna, Vienna, Austria
| | - Trushar R Patel
- Alberta RNA Research & Training Institute, Department of Chemistry and Biochemistry, University of Lethbridge, Lethbridge, Alberta, Canada.
- Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada.
- Li Ka Shing Institute of Virology, University of Alberta, Edmonton, Alberta, Canada.
| | - Karissa Y Sanbonmatsu
- Theoretical Biology and Biophysics Group, Los Alamos National Laboratory, Los Alamos, New Mexico, USA.
- New Mexico Consortium, Los Alamos, New Mexico, USA.
| |
Collapse
|
88
|
Singh J, Hanson J, Paliwal K, Zhou Y. RNA secondary structure prediction using an ensemble of two-dimensional deep neural networks and transfer learning. Nat Commun 2019; 10:5407. [PMID: 31776342 PMCID: PMC6881452 DOI: 10.1038/s41467-019-13395-9] [Citation(s) in RCA: 181] [Impact Index Per Article: 30.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2019] [Accepted: 11/01/2019] [Indexed: 01/03/2023] Open
Abstract
The majority of our human genome transcribes into noncoding RNAs with unknown structures and functions. Obtaining functional clues for noncoding RNAs requires accurate base-pairing or secondary-structure prediction. However, the performance of such predictions by current folding-based algorithms has been stagnated for more than a decade. Here, we propose the use of deep contextual learning for base-pair prediction including those noncanonical and non-nested (pseudoknot) base pairs stabilized by tertiary interactions. Since only [Formula: see text]250 nonredundant, high-resolution RNA structures are available for model training, we utilize transfer learning from a model initially trained with a recent high-quality bpRNA dataset of [Formula: see text]10,000 nonredundant RNAs made available through comparative analysis. The resulting method achieves large, statistically significant improvement in predicting all base pairs, noncanonical and non-nested base pairs in particular. The proposed method (SPOT-RNA), with a freely available server and standalone software, should be useful for improving RNA structure modeling, sequence alignment, and functional annotations.
Collapse
Affiliation(s)
- Jaswinder Singh
- Signal Processing Laboratory, School of Engineering and Built Environment, Griffith University, Brisbane, QLD, 4111, Australia
| | - Jack Hanson
- Signal Processing Laboratory, School of Engineering and Built Environment, Griffith University, Brisbane, QLD, 4111, Australia
| | - Kuldip Paliwal
- Signal Processing Laboratory, School of Engineering and Built Environment, Griffith University, Brisbane, QLD, 4111, Australia.
| | - Yaoqi Zhou
- Institute for Glycomics and School of Information and Communication Technology, Griffith University, Parklands Dr., Southport, QLD, 4222, Australia.
| |
Collapse
|
89
|
Richardson KE, Adams MS, Kirkpatrick CC, Gohara DW, Znosko BM. Identification and Characterization of New RNA Tetraloop Sequence Families. Biochemistry 2019; 58:4809-4820. [PMID: 31714066 DOI: 10.1021/acs.biochem.9b00535] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
There is an abundance of RNA sequence information available due to the efforts of sequencing projects. However, current techniques implemented to solve the tertiary structures of RNA, such as NMR and X-ray crystallography, are difficult and time-consuming. Therefore, biophysical techniques are not able to keep pace with the abundance of sequence information available. Because of this, there is a need to develop quick and efficient ways to predict RNA tertiary structure from sequence. One promising approach is to identify structural patterns within previously solved 3D structures and apply these patterns to new sequences. RNA tetraloops are one of the most common naturally occurring secondary structure motifs. Here, we use RNA Characterization of Secondary Structure Motifs (CoSSMos), Dissecting the Spatial Structure of RNA (DSSR), and a bioinformatic approach to search for and characterize tertiary structure patterns among tetraloops. Not surprising, we identified the well-known GNRA and UNCG tetraloops, as well as the previously identified RNYA tetraloop. However, some previously identified characteristics of these families were not observed in this data set, and some new characteristics were identified. In addition, we also identified and characterized three new tetraloop sequence families: YGAR, UGGU, and RMSA. This new structural information sheds light on the tertiary structure of tetraloops and contributes to the efforts of RNA tertiary structure prediction from sequence.
Collapse
Affiliation(s)
- Katherine E Richardson
- Department of Chemistry , Saint Louis University , Saint Louis , Missouri 63103 , United States
| | - Miranda S Adams
- Department of Chemistry , Saint Louis University , Saint Louis , Missouri 63103 , United States
| | - Charles C Kirkpatrick
- Department of Chemistry , Saint Louis University , Saint Louis , Missouri 63103 , United States
| | - David W Gohara
- Department of Biochemistry and Molecular Biology , Saint Louis University , Saint Louis , Missouri 63103 , United States
| | - Brent M Znosko
- Department of Chemistry , Saint Louis University , Saint Louis , Missouri 63103 , United States
| |
Collapse
|
90
|
Jin L, Tan YL, Wu Y, Wang X, Shi YZ, Tan ZJ. Structure folding of RNA kissing complexes in salt solutions: predicting 3D structure, stability, and folding pathway. RNA (NEW YORK, N.Y.) 2019; 25:1532-1548. [PMID: 31391217 PMCID: PMC6795135 DOI: 10.1261/rna.071662.119] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/30/2019] [Accepted: 08/02/2019] [Indexed: 05/08/2023]
Abstract
RNA kissing complexes are essential for genomic RNA dimerization and regulation of gene expression, and their structures and stability are critical to their biological functions. In this work, we used our previously developed coarse-grained model with an implicit structure-based electrostatic potential to predict three-dimensional (3D) structures and stability of RNA kissing complexes in salt solutions. For extensive RNA kissing complexes, our model shows great reliability in predicting 3D structures from their sequences, and our additional predictions indicate that the model can capture the dependence of 3D structures of RNA kissing complexes on monovalent/divalent ion concentrations. Moreover, the comparisons with extensive experimental data show that the model can make reliable predictions on the stability for various RNA kissing complexes over wide ranges of monovalent/divalent ion concentrations. Notably, for RNA kissing complexes, our further analyses show the important contribution of coaxial stacking to the 3D structures and stronger stability than the corresponding kissing-interface duplexes at high salts. Furthermore, our comprehensive analyses for RNA kissing complexes reveal that the thermally folding pathway for a complex sequence is mainly determined by the relative stability of two possible folded states of kissing complex and extended duplex, which can be significantly modulated by its sequence.
Collapse
Affiliation(s)
- Lei Jin
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Ya-Lan Tan
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Yao Wu
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Xunxun Wang
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Ya-Zhou Shi
- Research Center of Nonlinear Science, School of Mathematics and Computer Science, Wuhan Textile University, Wuhan 430073, China
| | - Zhi-Jie Tan
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| |
Collapse
|
91
|
Zhang T, Hu G, Yang Y, Wang J, Zhou Y. All-Atom Knowledge-Based Potential for RNA Structure Discrimination Based on the Distance-Scaled Finite Ideal-Gas Reference State. J Comput Biol 2019; 27:856-867. [PMID: 31638408 DOI: 10.1089/cmb.2019.0251] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Noncoding RNAs are increasingly found to play a wide variety of roles in living organisms. Yet, their functional mechanisms are poorly understood because their structures are difficult to determine experimentally. As a result, developing more effective computational techniques to predict RNA structures becomes increasingly an urgent task. One key challenge in RNA structure prediction is the lack of an accurate free energy function to guide RNA folding and discriminate native and near-native structures from decoy conformations. In this study, we developed an all-atom distance-dependent knowledge-based energy function for RNA that is based on a reference state (distance-scaled finite ideal-gas reference state, DFIRE) proven successful for protein structure discrimination. Using four separate benchmarks including RNA puzzles, we found that this DFIRE-based RNA statistical energy function is able to discriminate native and near-native structures against decoys with performance comparable with or better than several existing scoring functions compared. The energy function is expected to be useful for improving the detection of RNA near-native structures.
Collapse
Affiliation(s)
- Tongchuan Zhang
- Institute for Glycomics, School of Informatics and Communication Technology, Griffith University, Southport, Australia
| | - Guodong Hu
- Shandong Provincial Key Laboratory of Biophysics, Institute of Biophysics, Dezhou University, Dezhou, China
| | - Yuedong Yang
- School of Data and Computer Science, Sun Yat-Sen University, Guangzhou, China
| | - Jihua Wang
- Shandong Provincial Key Laboratory of Biophysics, Institute of Biophysics, Dezhou University, Dezhou, China
| | - Yaoqi Zhou
- Institute for Glycomics, School of Informatics and Communication Technology, Griffith University, Southport, Australia.,Shandong Provincial Key Laboratory of Biophysics, Institute of Biophysics, Dezhou University, Dezhou, China
| |
Collapse
|
92
|
Magnus M, Kappel K, Das R, Bujnicki JM. RNA 3D structure prediction guided by independent folding of homologous sequences. BMC Bioinformatics 2019; 20:512. [PMID: 31640563 PMCID: PMC6806525 DOI: 10.1186/s12859-019-3120-y] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2019] [Accepted: 10/01/2019] [Indexed: 01/12/2023] Open
Abstract
BACKGROUND The understanding of the importance of RNA has dramatically changed over recent years. As in the case of proteins, the function of an RNA molecule is encoded in its tertiary structure, which in turn is determined by the molecule's sequence. The prediction of tertiary structures of complex RNAs is still a challenging task. RESULTS Using the observation that RNA sequences from the same RNA family fold into conserved structure, we test herein whether parallel modeling of RNA homologs can improve ab initio RNA structure prediction. EvoClustRNA is a multi-step modeling process, in which homologous sequences for the target sequence are selected using the Rfam database. Subsequently, independent folding simulations using Rosetta FARFAR and SimRNA are carried out. The model of the target sequence is selected based on the most common structural arrangement of the common helical fragments. As a test, on two blind RNA-Puzzles challenges, EvoClustRNA predictions ranked as the first of all submissions for the L-glutamine riboswitch and as the second for the ZMP riboswitch. Moreover, through a benchmark of known structures, we discovered several cases in which particular homologs were unusually amenable to structure recovery in folding simulations compared to the single original target sequence. CONCLUSION This work, for the first time to our knowledge, demonstrates the importance of the selection of the target sequence from an alignment of an RNA family for the success of RNA 3D structure prediction. These observations prompt investigations into a new direction of research for checking 3D structure "foldability" or "predictability" of related RNA sequences to obtain accurate predictions. To support new research in this area, we provide all relevant scripts in a documented and ready-to-use form. By exploring new ideas and identifying limitations of the current RNA 3D structure prediction methods, this work is bringing us closer to the near-native computational RNA 3D models.
Collapse
Affiliation(s)
- Marcin Magnus
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Warsaw, Poland
| | - Kalli Kappel
- Biophysics Program, Stanford University, Stanford, CA USA
| | - Rhiju Das
- Biophysics Program, Stanford University, Stanford, CA USA
- Department of Biochemistry, Stanford University, Stanford, CA USA
- Department of Physics, Stanford University, Stanford, CA USA
| | - Janusz M. Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Warsaw, Poland
- Laboratory of Bioinformatics, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Poznan, Poland
| |
Collapse
|
93
|
Wang K, Jian Y, Wang H, Zeng C, Zhao Y. RBind: computational network method to predict RNA binding sites. Bioinformatics 2019; 34:3131-3136. [PMID: 29718097 DOI: 10.1093/bioinformatics/bty345] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2017] [Accepted: 04/24/2018] [Indexed: 12/21/2022] Open
Abstract
Motivation Non-coding RNA molecules play essential roles by interacting with other molecules to perform various biological functions. However, it is difficult to determine RNA structures due to their flexibility. At present, the number of experimentally solved RNA-ligand and RNA-protein structures is still insufficient. Therefore, binding sites prediction of non-coding RNA is required to understand their functions. Results Current RNA binding site prediction algorithms produce many false positive nucleotides that are distance away from the binding sites. Here, we present a network approach, RBind, to predict the RNA binding sites. We benchmarked RBind in RNA-ligand and RNA-protein datasets. The average accuracy of 0.82 in RNA-ligand and 0.63 in RNA-protein testing showed that this network strategy has a reliable accuracy for binding sites prediction. Availability and implementation The codes and datasets are available at https://zhaolab.com.cn/RBind. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Kaili Wang
- Institute of Biophysics and Department of Physics, Central China Normal University, Wuhan, China
| | - Yiren Jian
- Department of Physics, The George Washington University, Washington, DC, USA
| | - Huiwen Wang
- Institute of Biophysics and Department of Physics, Central China Normal University, Wuhan, China
| | - Chen Zeng
- Institute of Biophysics and Department of Physics, Central China Normal University, Wuhan, China.,Department of Physics, The George Washington University, Washington, DC, USA
| | - Yunjie Zhao
- Institute of Biophysics and Department of Physics, Central China Normal University, Wuhan, China
| |
Collapse
|
94
|
Yesselman JD, Eiler D, Carlson ED, Gotrik MR, d'Aquino AE, Ooms AN, Kladwang W, Carlson PD, Shi X, Costantino DA, Herschlag D, Lucks JB, Jewett MC, Kieft JS, Das R. Computational design of three-dimensional RNA structure and function. NATURE NANOTECHNOLOGY 2019; 14:866-873. [PMID: 31427748 PMCID: PMC7324284 DOI: 10.1038/s41565-019-0517-8] [Citation(s) in RCA: 53] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/26/2017] [Accepted: 06/24/2019] [Indexed: 05/30/2023]
Abstract
RNA nanotechnology seeks to create nanoscale machines by repurposing natural RNA modules. The field is slowed by the current need for human intuition during three-dimensional structural design. Here, we demonstrate that three distinct problems in RNA nanotechnology can be reduced to a pathfinding problem and automatically solved through an algorithm called RNAMake. First, RNAMake discovers highly stable single-chain solutions to the classic problem of aligning a tetraloop and its sequence-distal receptor, with experimental validation from chemical mapping, gel electrophoresis, solution X-ray scattering and crystallography with 2.55 Å resolution. Second, RNAMake automatically generates structured tethers that integrate 16S and 23S ribosomal RNAs into single-chain ribosomal RNAs that remain uncleaved by ribonucleases and assemble onto messenger RNA. Third, RNAMake enables the automated stabilization of small-molecule binding RNAs, with designed tertiary contacts that improve the binding affinity of the ATP aptamer and improve the fluorescence and stability of the Spinach RNA in cell extracts and in living Escherichia coli cells.
Collapse
Affiliation(s)
- Joseph D Yesselman
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
| | - Daniel Eiler
- Department of Biochemistry and Molecular Genetics, University of Colorado Denver School of Medicine, Aurora, CO, USA
| | - Erik D Carlson
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA
- Chemistry of Life Processes Institute, Northwestern University, Evanston, IL, USA
- Center for Synthetic Biology, Northwestern University, Evanston, IL, USA
| | - Michael R Gotrik
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
| | - Anne E d'Aquino
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA
- Center for Synthetic Biology, Northwestern University, Evanston, IL, USA
- Interdisciplinary Biological Sciences Graduate Program, Northwestern University, Evanston, IL, USA
| | - Alexandra N Ooms
- Department of Cancer Genetics & Genomics, Stanford University School of Medicine, Stanford, CA, USA
| | - Wipapat Kladwang
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
| | - Paul D Carlson
- Robert F. Smith School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY, USA
| | - Xuesong Shi
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
| | - David A Costantino
- Department of Biochemistry and Molecular Genetics, University of Colorado Denver School of Medicine, Aurora, CO, USA
| | - Daniel Herschlag
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
- Department of Chemistry, Stanford University School of Medicine, Stanford, CA, USA
- Stanford ChEM-H (Chemistry, Engineering, and Medicine for Human Health), Stanford University, Stanford, CA, USA
| | - Julius B Lucks
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA
- Chemistry of Life Processes Institute, Northwestern University, Evanston, IL, USA
- Center for Synthetic Biology, Northwestern University, Evanston, IL, USA
- Interdisciplinary Biological Sciences Graduate Program, Northwestern University, Evanston, IL, USA
| | - Michael C Jewett
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA
- Chemistry of Life Processes Institute, Northwestern University, Evanston, IL, USA
- Center for Synthetic Biology, Northwestern University, Evanston, IL, USA
- Interdisciplinary Biological Sciences Graduate Program, Northwestern University, Evanston, IL, USA
| | - Jeffrey S Kieft
- Department of Biochemistry and Molecular Genetics, University of Colorado Denver School of Medicine, Aurora, CO, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA.
- Department of Physics, Stanford University, Stanford, CA, USA.
| |
Collapse
|
95
|
Polaski JT, Kletzien OA, Drogalis LK, Batey RT. A functional genetic screen reveals sequence preferences within a key tertiary interaction in cobalamin riboswitches required for ligand selectivity. Nucleic Acids Res 2019; 46:9094-9105. [PMID: 29945209 PMCID: PMC6158498 DOI: 10.1093/nar/gky539] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2018] [Accepted: 05/30/2018] [Indexed: 01/14/2023] Open
Abstract
Riboswitches are structured mRNA sequences that regulate gene expression by directly binding intracellular metabolites. Generating the appropriate regulatory response requires the RNA rapidly and stably acquire higher-order structure to form the binding pocket, bind the appropriate effector molecule and undergo a structural transition to inform the expression machinery. These requirements place riboswitches under strong kinetic constraints, likely restricting the sequence space accessible by recurrent structural modules such as the kink turn and the T-loop. Class-II cobalamin riboswitches contain two T-loop modules: one directing global folding of the RNA and another buttressing the ligand binding pocket. While the T-loop module directing folding is highly conserved, the T-loop associated with binding is substantially less so, with no clear consensus sequence. To further understand the functional role of the binding-associated module, a functional genetic screen of a library of riboswitches with the T-loop and its interacting nucleotides was used to build an experimental phylogeny comprised of sequences that possess a wide range of cobalamin-dependent regulatory activity. Our results reveal conservation patterns of the T-loop and its interaction with the binding core that allow for rapid tertiary structure formation and demonstrate its importance for generating strong ligand-dependent repression of mRNA expression.
Collapse
Affiliation(s)
- Jacob T Polaski
- Department of Chemistry and Biochemistry, University of Colorado, Boulder, CO 80309, USA
| | - Otto A Kletzien
- Department of Chemistry and Biochemistry, University of Colorado, Boulder, CO 80309, USA
| | - Lea K Drogalis
- Department of Chemistry and Biochemistry, University of Colorado, Boulder, CO 80309, USA
| | - Robert T Batey
- Department of Chemistry and Biochemistry, University of Colorado, Boulder, CO 80309, USA
| |
Collapse
|
96
|
Zok T, Antczak M, Zurkowski M, Popenda M, Blazewicz J, Adamiak RW, Szachniuk M. RNApdbee 2.0: multifunctional tool for RNA structure annotation. Nucleic Acids Res 2019; 46:W30-W35. [PMID: 29718468 PMCID: PMC6031003 DOI: 10.1093/nar/gky314] [Citation(s) in RCA: 71] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2018] [Accepted: 04/14/2018] [Indexed: 01/07/2023] Open
Abstract
In the field of RNA structural biology and bioinformatics, an access to correctly annotated RNA structure is of crucial importance, especially in the secondary and 3D structure predictions. RNApdbee webserver, introduced in 2014, primarily aimed to address the problem of RNA secondary structure extraction from the PDB files. Its new version, RNApdbee 2.0, is a highly advanced multifunctional tool for RNA structure annotation, revealing the relationship between RNA secondary and 3D structure given in the PDB or PDBx/mmCIF format. The upgraded version incorporates new algorithms for recognition and classification of high-ordered pseudoknots in large RNA structures. It allows analysis of isolated base pairs impact on RNA structure. It can visualize RNA secondary structures—including that of quadruplexes—with depiction of non-canonical interactions. It also annotates motifs to ease identification of stems, loops and single-stranded fragments in the input RNA structure. RNApdbee 2.0 is implemented as a publicly available webserver with an intuitive interface and can be freely accessed at http://rnapdbee.cs.put.poznan.pl/
Collapse
Affiliation(s)
- Tomasz Zok
- Institute of Computing Science, and European Centre for Bioinformatics and Genomics, Poznan University of Technology, Piotrowo 2, 60-965 Poznan, Poland.,Poznan Supercomputing and Networking Center, Jana Pawla II 10, 61-139 Poznan, Poland
| | - Maciej Antczak
- Institute of Computing Science, and European Centre for Bioinformatics and Genomics, Poznan University of Technology, Piotrowo 2, 60-965 Poznan, Poland
| | - Michal Zurkowski
- Institute of Computing Science, and European Centre for Bioinformatics and Genomics, Poznan University of Technology, Piotrowo 2, 60-965 Poznan, Poland
| | - Mariusz Popenda
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznan, Poland
| | - Jacek Blazewicz
- Institute of Computing Science, and European Centre for Bioinformatics and Genomics, Poznan University of Technology, Piotrowo 2, 60-965 Poznan, Poland.,Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznan, Poland
| | - Ryszard W Adamiak
- Institute of Computing Science, and European Centre for Bioinformatics and Genomics, Poznan University of Technology, Piotrowo 2, 60-965 Poznan, Poland.,Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznan, Poland
| | - Marta Szachniuk
- Institute of Computing Science, and European Centre for Bioinformatics and Genomics, Poznan University of Technology, Piotrowo 2, 60-965 Poznan, Poland.,Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznan, Poland
| |
Collapse
|
97
|
Tan YL, Feng CJ, Jin L, Shi YZ, Zhang W, Tan ZJ. What is the best reference state for building statistical potentials in RNA 3D structure evaluation? RNA (NEW YORK, N.Y.) 2019; 25:793-812. [PMID: 30996105 PMCID: PMC6573789 DOI: 10.1261/rna.069872.118] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/08/2018] [Accepted: 04/06/2019] [Indexed: 05/14/2023]
Abstract
Knowledge-based statistical potentials have been shown to be efficient in protein structure evaluation/prediction, and the core difference between various statistical potentials is attributed to the choice of reference states. However, for RNA 3D structure evaluation, a comprehensive examination on reference states is still lacking. In this work, we built six statistical potentials based on six reference states widely used in protein structure evaluation, including averaging, quasi-chemical approximation, atom-shuffled, finite-ideal-gas, spherical-noninteracting, and random-walk-chain reference states, and we examined the six reference states against three RNA test sets including six subsets. Our extensive examinations show that, overall, for identifying native structures and ranking decoy structures, the finite-ideal-gas and random-walk-chain reference states are slightly superior to others, while for identifying near-native structures, there is only a slight difference between these reference states. Our further analyses show that the performance of a statistical potential is apparently dependent on the quality of the training set. Furthermore, we found that the performance of a statistical potential is closely related to the origin of test sets, and for the three realistic test subsets, the six statistical potentials have overall unsatisfactory performance. This work presents a comprehensive examination on the existing reference states and statistical potentials for RNA 3D structure evaluation.
Collapse
Affiliation(s)
- Ya-Lan Tan
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Chen-Jie Feng
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Lei Jin
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Ya-Zhou Shi
- Research Center of Nonlinear Science, School of Mathematics and Computer Science, Wuhan Textile University, Wuhan 430073, China
| | - Wenbing Zhang
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Zhi-Jie Tan
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| |
Collapse
|
98
|
Wang J, Williams B, Chirasani VR, Krokhotin A, Das R, Dokholyan NV. Limits in accuracy and a strategy of RNA structure prediction using experimental information. Nucleic Acids Res 2019; 47:5563-5572. [PMID: 31106330 PMCID: PMC6582333 DOI: 10.1093/nar/gkz427] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2019] [Revised: 05/03/2019] [Accepted: 05/08/2019] [Indexed: 01/22/2023] Open
Abstract
RNA structural complexity and flexibility present a challenge for computational modeling efforts. Experimental information and bioinformatics data can be used as restraints to improve the accuracy of RNA tertiary structure prediction. Regarding utilization of restraints, the fundamental questions are: (i) What is the limit in prediction accuracy that one can achieve with arbitrary number of restraints? (ii) Is there a strategy for selection of the minimal number of restraints that would result in the best structural model? We address the first question by testing the limits in prediction accuracy using native contacts as restraints. To address the second question, we develop an algorithm based on the distance variation allowed by secondary structure (DVASS), which ranks restraints according to their importance to RNA tertiary structure prediction. We find that due to kinetic traps, the greatest improvement in the structure prediction accuracy is achieved when we utilize only 40-60% of the total number of native contacts as restraints. When the restraints are sorted by DVASS algorithm, using only the first 20% ranked restraints can greatly improve the prediction accuracy. Our findings suggest that only a limited number of strategically selected distance restraints can significantly assist in RNA structure modeling.
Collapse
Affiliation(s)
- Jian Wang
- Department of Pharmacology, Penn State University College of Medicine, Hershey, PA 17033, USA
| | - Benfeard Williams
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27514, USA
| | - Venkata R Chirasani
- Department of Pharmacology, Penn State University College of Medicine, Hershey, PA 17033, USA
| | - Andrey Krokhotin
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27514, USA
| | - Rajeshree Das
- Weinberg College of Arts and Sciences, Northwestern University, Evanston, IL 60208, USA
| | - Nikolay V Dokholyan
- Department of Pharmacology, Penn State University College of Medicine, Hershey, PA 17033, USA
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27514, USA
- Department of Biochemistry and Molecular Biology, Penn State University College of Medicine, Hershey, PA 17033, USA
- Department of Chemistry, Penn State University, University Park, PA 16802, USA
- Department of Biomedical Engineering, Penn State University, University Park, PA 16802, USA
| |
Collapse
|
99
|
Abstract
The three-dimensional structures of RNA molecules provide rich and often critical information for understanding their functions, including how they recognize small molecule and protein partners. Computational modeling of RNA 3D structure is becoming increasingly accurate, particularly with the availability of growing numbers of template structures already solved experimentally and the development of sequence alignment and 3D modeling tools to take advantage of this database. For several recent "RNA puzzle" blind modeling challenges, we have successfully identified useful template structures and achieved accurate structure predictions through homology modeling tools developed in the Rosetta software suite. We describe our semi-automated methodology here and walk through two illustrative examples: an adenine riboswitch aptamer, modeled from a template guanine riboswitch structure, and a SAM I/IV riboswitch aptamer, modeled from a template SAM I riboswitch structure.
Collapse
Affiliation(s)
- Andrew M Watkins
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, United States
| | - Ramya Rangan
- Biophysics Program, Stanford University, Stanford, CA, United States
| | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, United States; Biophysics Program, Stanford University, Stanford, CA, United States.
| |
Collapse
|
100
|
Pucci F, Schug A. Shedding light on the dark matter of the biomolecular structural universe: Progress in RNA 3D structure prediction. Methods 2019; 162-163:68-73. [DOI: 10.1016/j.ymeth.2019.04.012] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2018] [Revised: 04/12/2019] [Accepted: 04/22/2019] [Indexed: 11/25/2022] Open
|