1
|
Bu F, Adam Y, Adamiak RW, Antczak M, de Aquino BRH, Badepally NG, Batey RT, Baulin EF, Boinski P, Boniecki MJ, Bujnicki JM, Carpenter KA, Chacon J, Chen SJ, Chiu W, Cordero P, Das NK, Das R, Dawson WK, DiMaio F, Ding F, Dock-Bregeon AC, Dokholyan NV, Dror RO, Dunin-Horkawicz S, Eismann S, Ennifar E, Esmaeeli R, Farsani MA, Ferré-D'Amaré AR, Geniesse C, Ghanim GE, Guzman HV, Hood IV, Huang L, Jain DS, Jaryani F, Jin L, Joshi A, Karelina M, Kieft JS, Kladwang W, Kmiecik S, Koirala D, Kollmann M, Kretsch RC, Kurciński M, Li J, Li S, Magnus M, Masquida B, Moafinejad SN, Mondal A, Mukherjee S, Nguyen THD, Nikolaev G, Nithin C, Nye G, Pandaranadar Jeyeram IPN, Perez A, Pham P, Piccirilli JA, Pilla SP, Pluta R, Poblete S, Ponce-Salvatierra A, Popenda M, Popenda L, Pucci F, Rangan R, Ray A, Ren A, Sarzynska J, Sha CM, Stefaniak F, Su Z, Suddala KC, Szachniuk M, Townshend R, Trachman RJ, Wang J, Wang W, Watkins A, Wirecki TK, Xiao Y, Xiong P, Xiong Y, Yang J, Yesselman JD, Zhang J, Zhang Y, Zhang Z, Zhou Y, Zok T, Zhang D, Zhang S, Żyła A, Westhof E, Miao Z. RNA-Puzzles Round V: blind predictions of 23 RNA structures. Nat Methods 2025; 22:399-411. [PMID: 39623050 PMCID: PMC11810798 DOI: 10.1038/s41592-024-02543-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Accepted: 10/29/2024] [Indexed: 01/16/2025]
Abstract
RNA-Puzzles is a collective endeavor dedicated to the advancement and improvement of RNA three-dimensional structure prediction. With agreement from structural biologists, RNA structures are predicted by modeling groups before publication of the experimental structures. We report a large-scale set of predictions by 18 groups for 23 RNA-Puzzles: 4 RNA elements, 2 Aptamers, 4 Viral elements, 5 Ribozymes and 8 Riboswitches. We describe automatic assessment protocols for comparisons between prediction and experiment. Our analyses reveal some critical steps to be overcome to achieve good accuracy in modeling RNA structures: identification of helix-forming pairs and of non-Watson-Crick modules, correct coaxial stacking between helices and avoidance of entanglements. Three of the top four modeling groups in this round also ranked among the top four in the CASP15 contest.
Collapse
Grants
- T32 GM066706 NIGMS NIH HHS
- NSFC T2225007 National Natural Science Foundation of China (National Science Foundation of China)
- R35 GM134919 NIGMS NIH HHS
- R35GM145409 Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- R35 GM145409 NIGMS NIH HHS
- 32270707 National Natural Science Foundation of China (National Science Foundation of China)
- R35 GM122579 NIGMS NIH HHS
- R35 GM134864 NIGMS NIH HHS
- T32 grant GM066706 Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- P20GM121342 Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- R21 CA219847 NCI NIH HHS
- 32171191 National Natural Science Foundation of China (National Science Foundation of China)
- P20 GM121342 NIGMS NIH HHS
- R35 GM152029 NIGMS NIH HHS
- R01 GM073850 NIGMS NIH HHS
- F32 GM112294 NIGMS NIH HHS
- ZIA DK075136 Intramural NIH HHS
- Z.M. is supported by Major Projects of Guangzhou National Laboratory, (Grant No. GZNL2023A01006, GZNL2024A01002, SRPG22-003, SRPG22-006, SRPG22-007, HWYQ23-003, YW-YFYJ0102), the National Key R&D Programs of China (2023YFF1204700, 2023YFF1204701, 2021YFF1200900, 2021YFF1200903). This work is part of the ITI 2021-2028 program and supported by IdEx Unistra (ANR-10-IDEX-0002 to E.W.), SFRI-STRAT’US project (ANR-20-SFRI-0012) and EUR IMCBio (IMCBio ANR-17-EURE-0023 to E.W.) under the framework of the French Investments for the Future Program.
- E.W. acknowledges also support from Wenzhou Institute, University of Chinese Academy of Sciences (WIUCASQD2024002).
- E.F.B. was additionally supported by European Molecular Biology Organization (EMBO) fellowship (ALTF 525-2022).
- Boniecki’s research was supported by the Polish National Science Center Poland (NCN) (grant 2016/23/B/ST6/03433 to Michal J. Boniecki). Predictions were performed using computational resources of the Interdisciplinary Centre for Mathematical and Computational Modelling of the University of Warsaw (ICM) (grant G66-9).
- J.M.B. is supported by the National Science Centre in Poland (NCN grants: 2017/26/A/NZ1/01083 to J.M.B., 2021/43/D/NZ1/03360 to S.M., 2020/39/B/NZ2/03127 to F.S., 2020/39/D/NZ2/02837 to T.K.W.). J.M.B. acknowledge Poland high-performance computing Infrastructure PLGrid (HPC Centers: ACK Cyfronet AGH, PCSS, CI TASK, WCSS) for providing computer facilities and support within the computational grant PLG/2023/016080.
- S.J.C. is supported by the National Institutes of Health under Grant R35-GM134919.
- R.D. is supported by Stanford Bio-X (to R.D., R.O.D., R.C.K., and S.E.); Stanford Gerald J. Lieberman Fellowship (to R.R.); the National Institutes of Health (R21 CA219847 and R35 GM122579 to R.D.), the Howard Hughes Medical Institute (HHMI, to R.D.); Consejo Nacional de Ciencia y Tecnología CONACyT Fellowship 312765 (P.C.); the Ruth L. Kirschstein National Research Service Award Postdoctoral Fellowships GM112294 (to J.D.Y.); National Science Foundation Graduate Research Fellowships (R.J.L.T. and R.R.); the National Library of Medicine T15 Training Grant (NLM T15007033 to K.A.C.); the U.S. Department of Energy, Office of Science Graduate Student Research program (R.J.L.T.).
- The National Institutes of Health grants 1R35 GM134864 and the Passan Foundation.
- R.O.D. is supported by the U.S. Department of Energy, Office of Science, Scientific Discovery through Advanced Computing (SciDAC) program (R.O.D.); Intel (R.O.D.).
- A.F.D. is supported, in part, by the intramural program of the National Heart, Lung and Blood Institute, National Institutes of Health, USA.
- Guangdong Science and Technology Department (2022A1515010328, 2023B1212060013, 2020B1212030004), Fundamental Research Funds for the Central Universities, Sun Yat-sen University (23ptpy41).
- D.K. is supported by the NSF CAREER award MCB-2236996, and start-up, SURFF, and START awards from the University of Maryland Baltimore County to D.K.
- BM is supported by the Interdisciplinary Thematic Institute IMCBio, as part of the ITI 2021-2028 program at the University of Strasbourg, CNRS and Inserm, by IdEx Unistra (ANR-10-IDEX-0002), and EUR (IMCBio ANR-17-EUR-0023), under the framework of the French Investments Program for the Future.
- T.H.D.N. is supported by UKRI-Medical Research Council grant MC_UP_1201/19.
- C.N. and M.K. acknowledge funding from the National Science Centre, Poland [OPUS 2019/33/B/NZ2/02100]; S.P.P. acknowledges funding from the National Science Centre, Poland [OPUS 2020/39/B/NZ2/01301]; S.K. acknowledges funding from the National Science Centre, Poland [Sheng 2021/40/Q/NZ2/00078]; C.N. acknowledge Polish high-performance computing infrastructure PLGrid (HPC Centers: PCSS, ACK Cyfronet AGH, CI TASK, WCSS) for providing computer facilities and support within the computational grants PLG/2022/016043, PLG/2022/015327 and PLG/2020/013424.
- AP is supported by an NSF-CAREER award CHE-2235785
- A.R. is supported by grants from the Natural Science Foundation of China (32325029, 32022039, 91940302, and 91640104), the National Key Research and Development Project of China (2021YFC2300300 and 2023YFC2604300).
- Marta Szachniuk are supported by the National Science Centre, Poland (2019/35/B/ST6/03074 to M.S.), the statutory funds of IBCH PAS and Poznan University of Technology.
- J.W. is supported by the Penn State College of Medicine’s Artificial Intelligence and Biomedical Informatics Program.
- J.Z. is supported by the Intramural Research Program of the NIH, the National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK) (ZIADK075136 to J.Z.), and an NIH Deputy Director for Intramural Research (DDIR) Challenge Award to J.Z.
Collapse
Affiliation(s)
- Fan Bu
- GMU-GIBH Joint School of Life Sciences, The Guangdong-Hong Kong-Macao Joint Laboratory for Cell Fate Regulation and Diseases, Guangzhou National Laboratory, Guangzhou Medical University, Guangzhou, China
- School of Life Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, China
| | - Yagoub Adam
- Inter-institutional Graduate Program on Bioinformatics, Department of Computer Science and Mathematics, FFCLRP, University of São Paulo, Ribeirão Preto, Brazil
- Covenant University Bioinformatics Research (CUBRe), Covenant University, Ota, Nigeria
| | - Ryszard W Adamiak
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland
| | - Maciej Antczak
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland
| | - Belisa Rebeca H de Aquino
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Nagendar Goud Badepally
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Robert T Batey
- Department of Biochemistry, University of Colorado at Boulder, Boulder, CO, USA
| | - Eugene F Baulin
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Pawel Boinski
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland
| | - Michal J Boniecki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Kristy A Carpenter
- Department of Biomedical Data Science, Stanford University, Stanford, CA, USA
| | - Jose Chacon
- Department of Biochemistry, Stanford University, Stanford, CA, USA
- Department of Cell and Developmental Biology, University of California San Diego, San Diego, CA, USA
| | - Shi-Jie Chen
- Department of Physics, Department of Biochemistry and Institute for Data Science and Informatics, University of Missouri, Columbia, MO, USA
| | - Wah Chiu
- Department of Bioengineering and James H. Clark Center, Stanford University, Stanford, CA, USA
| | - Pablo Cordero
- Department of Biochemistry, Stanford University, Stanford, CA, USA
- Stripe, South San Francisco, CA, USA
| | - Naba Krishna Das
- Department of Chemistry and Biochemistry, University of Maryland Baltimore County, Baltimore, MD, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University, Stanford, CA, USA
- Howard Hughes Medical Institute, Stanford University, Stanford, CA, USA
- Biophysics program, Stanford University, Stanford, CA, USA
| | - Wayne K Dawson
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Frank DiMaio
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Feng Ding
- Department of Physics and Astronomy, Clemson University, Clemson, SC, USA
| | - Anne-Catherine Dock-Bregeon
- Laboratory of Integrative Biology of Marine Models (LBI2M), Sorbonne University-CNRS UMR8227, Roscoff, France
| | - Nikolay V Dokholyan
- Department of Pharmacology, Penn State College of Medicine, Hershey, PA, USA
| | - Ron O Dror
- Department of Computer Science, Stanford University, Stanford, CA, USA
- Department of Structural Biology, Stanford University, Stanford, CA, USA
- Department of Molecular and Cellular Physiology, Stanford University, Stanford, CA, USA
- Institute for Computational and Mathematical Engineering, Stanford University, Stanford, CA, USA
| | - Stanisław Dunin-Horkawicz
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Stephan Eismann
- Department of Applied Physics, Stanford University, Stanford, CA, USA
- Atomic AI, South San Francisco, CA, USA
| | - Eric Ennifar
- Architecture et Réactivité de l'ARN, Institut de Biologie Moléculaire et Cellulaire du CNRS, Université de Strasbourg, Strasbourg, France
| | - Reza Esmaeeli
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL, USA
| | - Masoud Amiri Farsani
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Adrian R Ferré-D'Amaré
- Laboratory of Nucleic Acids, National Heart, Lung and Blood Institute, Bethesda, MD, USA
| | - Caleb Geniesse
- Department of Biochemistry, Stanford University, Stanford, CA, USA
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - George E Ghanim
- Medical Research Council Laboratory of Molecular Biology, Cambridge, UK
| | - Horacio V Guzman
- Instituto de Ciencia de Materials de Barcelona, ICMAB-CSIC, Bellaterra E-08193, Spain & Departamento de Física Teórica de la Materia Condensada, Universidad Autónoma de Madrid, Madrid, Spain
| | - Iris V Hood
- Laboratory of Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, Bethesda, MD, USA
| | - Lin Huang
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Guangdong-Hong Kong Joint Laboratory for RNA Medicine, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University Guangzhou, Guangdong, China
| | - Dharm Skandh Jain
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Farhang Jaryani
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Lei Jin
- Department of Physics, Department of Biochemistry and Institute for Data Science and Informatics, University of Missouri, Columbia, MO, USA
| | - Astha Joshi
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Masha Karelina
- Biophysics program, Stanford University, Stanford, CA, USA
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Jeffrey S Kieft
- Department of Biochemistry and Molecular Genetics, University of Colorado Denver School of Medicine, Aurora, CO, USA
- New York Structural Biology Center, New York, NY, USA
| | - Wipapat Kladwang
- Department of Biochemistry, Stanford University, Stanford, CA, USA
- Howard Hughes Medical Institute, Stanford University, Stanford, CA, USA
| | - Sebastian Kmiecik
- Laboratory of Computational Biology, Biological and Chemical Research Center, Faculty of Chemistry, University of Warsaw, Warsaw, Poland
| | - Deepak Koirala
- Department of Chemistry and Biochemistry, University of Maryland Baltimore County, Baltimore, MD, USA
| | - Markus Kollmann
- Department of Computer Science, Heinrich Heine University of Düsseldorf, Düsseldorf, Germany
| | | | - Mateusz Kurciński
- Laboratory of Computational Biology, Biological and Chemical Research Center, Faculty of Chemistry, University of Warsaw, Warsaw, Poland
| | - Jun Li
- Department of Physics, Department of Biochemistry and Institute for Data Science and Informatics, University of Missouri, Columbia, MO, USA
| | - Shuang Li
- Laboratory of Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, Bethesda, MD, USA
| | - Marcin Magnus
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
| | - BenoÎt Masquida
- UMR 7156, CNRS - Université de Strasbourg, IPCB, Strasbourg, France
| | - S Naeim Moafinejad
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Arup Mondal
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL, USA
| | - Sunandan Mukherjee
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | | | - Grigory Nikolaev
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Chandran Nithin
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
- Laboratory of Computational Biology, Biological and Chemical Research Center, Faculty of Chemistry, University of Warsaw, Warsaw, Poland
| | - Grace Nye
- Howard Hughes Medical Institute, Stanford University, Stanford, CA, USA
| | - Iswarya P N Pandaranadar Jeyeram
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Alberto Perez
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL, USA
| | - Phillip Pham
- Howard Hughes Medical Institute, Stanford University, Stanford, CA, USA
| | - Joseph A Piccirilli
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, USA
- Department of Chemistry, The University of Chicago, Chicago, IL, USA
| | - Smita Priyadarshini Pilla
- Laboratory of Computational Biology, Biological and Chemical Research Center, University of Warsaw, Warsaw, Poland
| | - Radosław Pluta
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Simón Poblete
- Facultad de Ingeniería, Arquitectura y Diseño, Universidad San Sebastián, Santiago, Chile
- Centro BASAL Ciencia & Vida, Universidad San Sebastián, Santiago, Chile
| | - Almudena Ponce-Salvatierra
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Mariusz Popenda
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| | - Lukasz Popenda
- NanoBioMedical Centre, Adam Mickiewicz University, Poznan, Poland
| | - Fabrizio Pucci
- Computational Biology and Bioinformatics, Université Libre de Bruxelles, Brussels, Belgium
| | - Ramya Rangan
- Biophysics program, Stanford University, Stanford, CA, USA
- Atomic AI, South San Francisco, CA, USA
| | - Angana Ray
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Aiming Ren
- Life Sciences Institute, Zhejiang University, Hangzhou, China
| | - Joanna Sarzynska
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| | - Congzhou Mike Sha
- Department of Pharmacology, Penn State College of Medicine, Hershey, PA, USA
| | - Filip Stefaniak
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Zhaoming Su
- The State Key Laboratory of Biotherapy, West China Hospital, Chengdu, China
| | - Krishna C Suddala
- Laboratory of Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, Bethesda, MD, USA
| | - Marta Szachniuk
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland
| | - Raphael Townshend
- Department of Computer Science, Stanford University, Stanford, CA, USA
- Atomic AI, South San Francisco, CA, USA
| | - Robert J Trachman
- Laboratory of Nucleic Acids, National Heart, Lung and Blood Institute, Bethesda, MD, USA
| | - Jian Wang
- Department of Pharmacology, Penn State College of Medicine, Hershey, PA, USA
| | - Wenkai Wang
- MOE Frontiers Science Center for Nonlinear Expectations, Research Center for Mathematics and Interdisciplinary Sciences, Shandong University, Qingdao, China
| | - Andrew Watkins
- Department of Biochemistry, Stanford University, Stanford, CA, USA
- Prescient Design, Genentech Research and Early Development, South San Francisco, CA, USA
| | - Tomasz K Wirecki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Yi Xiao
- School of Physics and Key Laboratory of Molecular Biophysics of the Ministry of Education, Huazhong University of Science and Technology, Wuhan, China
| | - Peng Xiong
- School of Life Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, China
- Department of Biomedical Engineering, Suzhou Institute for Advanced Research, University of Science and Technology of China, Suzhou, China
| | - Yiduo Xiong
- School of Physics and Key Laboratory of Molecular Biophysics of the Ministry of Education, Huazhong University of Science and Technology, Wuhan, China
| | - Jianyi Yang
- MOE Frontiers Science Center for Nonlinear Expectations, Research Center for Mathematics and Interdisciplinary Sciences, Shandong University, Qingdao, China
| | - Joseph David Yesselman
- Howard Hughes Medical Institute, Stanford University, Stanford, CA, USA
- Department of Chemistry, University of Nebraska, Lincoln, NE, USA
| | - Jinwei Zhang
- Laboratory of Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, Bethesda, MD, USA
| | - Yi Zhang
- School of Physics and Key Laboratory of Molecular Biophysics of the Ministry of Education, Huazhong University of Science and Technology, Wuhan, China
| | - Zhenzhen Zhang
- Department of Physics and Astronomy, Clemson University, Clemson, SC, USA
| | - Yuanzhe Zhou
- Department of Physics, Department of Biochemistry and Institute for Data Science and Informatics, University of Missouri, Columbia, MO, USA
| | - Tomasz Zok
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland
| | - Dong Zhang
- Department of Physics, Department of Biochemistry and Institute for Data Science and Informatics, University of Missouri, Columbia, MO, USA
| | - Sicheng Zhang
- Department of Physics, Department of Biochemistry and Institute for Data Science and Informatics, University of Missouri, Columbia, MO, USA
| | - Adriana Żyła
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Eric Westhof
- Architecture et Réactivité de l'ARN, Institut de Biologie Moléculaire et Cellulaire du CNRS, Université de Strasbourg, Strasbourg, France.
- Engineering Research Center of Clinical Functional Materials and Diagnosis & Treatment Devices of Zhejiang Province, Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou, China.
| | - Zhichao Miao
- GMU-GIBH Joint School of Life Sciences, The Guangdong-Hong Kong-Macao Joint Laboratory for Cell Fate Regulation and Diseases, Guangzhou National Laboratory, Guangzhou Medical University, Guangzhou, China.
- Shanghai Key Laboratory of Anesthesiology and Brain Functional Modulation, Clinical Research Center for Anesthesiology and Perioperative Medicine, Translational Research Institute of Brain and Brain-Like Intelligence, Shanghai Fourth People's Hospital, School of Medicine, Tongji University, Shanghai, China.
- European Bioinformatics Institute, European Molecular Biology Laboratory, Wellcome Genome Campus, Cambridge, UK.
| |
Collapse
|
2
|
Mukherjee S, Moafinejad SN, Badepally NG, Merdas K, Bujnicki JM. Advances in the field of RNA 3D structure prediction and modeling, with purely theoretical approaches, and with the use of experimental data. Structure 2024; 32:1860-1876. [PMID: 39321802 DOI: 10.1016/j.str.2024.08.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2024] [Revised: 08/08/2024] [Accepted: 08/22/2024] [Indexed: 09/27/2024]
Abstract
Recent advancements in RNA three-dimensional (3D) structure prediction have provided significant insights into RNA biology, highlighting the essential role of RNA in cellular functions and its therapeutic potential. This review summarizes the latest developments in computational methods, particularly the incorporation of artificial intelligence and machine learning, which have improved the efficiency and accuracy of RNA structure predictions. We also discuss the integration of new experimental data types, including cryoelectron microscopy (cryo-EM) techniques and high-throughput sequencing, which have transformed RNA structure modeling. The combination of experimental advances with computational methods represents a significant leap in RNA structure determination. We review the outcomes of RNA-Puzzles and critical assessment of structure prediction (CASP) challenges, which assess the state of the field and limitations of existing methods. Future perspectives are discussed, focusing on the impact of RNA 3D structure prediction on understanding RNA mechanisms and its implications for drug discovery and RNA-targeted therapies, opening new avenues in molecular biology.
Collapse
Affiliation(s)
- Sunandan Mukherjee
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - S Naeim Moafinejad
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Nagendar Goud Badepally
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Katarzyna Merdas
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland.
| |
Collapse
|
3
|
Steffen FD, Cunha RA, Sigel RKO, Börner R. FRET-guided modeling of nucleic acids. Nucleic Acids Res 2024; 52:e59. [PMID: 38869063 PMCID: PMC11260485 DOI: 10.1093/nar/gkae496] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 05/29/2024] [Indexed: 06/14/2024] Open
Abstract
The functional diversity of RNAs is encoded in their innate conformational heterogeneity. The combination of single-molecule spectroscopy and computational modeling offers new attractive opportunities to map structural transitions within nucleic acid ensembles. Here, we describe a framework to harmonize single-molecule Förster resonance energy transfer (FRET) measurements with molecular dynamics simulations and de novo structure prediction. Using either all-atom or implicit fluorophore modeling, we recreate FRET experiments in silico, visualize the underlying structural dynamics and quantify the reaction coordinates. Using multiple accessible-contact volumes as a post hoc scoring method for fragment assembly in Rosetta, we demonstrate that FRET can be used to filter a de novo RNA structure prediction ensemble by refuting models that are not compatible with in vitro FRET measurement. We benchmark our FRET-assisted modeling approach on double-labeled DNA strands and validate it against an intrinsically dynamic manganese(II)-binding riboswitch. We show that a FRET coordinate describing the assembly of a four-way junction allows our pipeline to recapitulate the global fold of the riboswitch displayed by the crystal structure. We conclude that computational fluorescence spectroscopy facilitates the interpretability of dynamic structural ensembles and improves the mechanistic understanding of nucleic acid interactions.
Collapse
Affiliation(s)
- Fabio D Steffen
- Department of Chemistry, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
| | - Richard A Cunha
- Department of Chemistry, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
| | - Roland K O Sigel
- Department of Chemistry, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
| | - Richard Börner
- Department of Chemistry, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
| |
Collapse
|
4
|
Rinaldi S, Moroni E, Rozza R, Magistrato A. Frontiers and Challenges of Computing ncRNAs Biogenesis, Function and Modulation. J Chem Theory Comput 2024; 20:993-1018. [PMID: 38287883 DOI: 10.1021/acs.jctc.3c01239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2024]
Abstract
Non-coding RNAs (ncRNAs), generated from nonprotein coding DNA sequences, constitute 98-99% of the human genome. Non-coding RNAs encompass diverse functional classes, including microRNAs, small interfering RNAs, PIWI-interacting RNAs, small nuclear RNAs, small nucleolar RNAs, and long non-coding RNAs. With critical involvement in gene expression and regulation across various biological and physiopathological contexts, such as neuronal disorders, immune responses, cardiovascular diseases, and cancer, non-coding RNAs are emerging as disease biomarkers and therapeutic targets. In this review, after providing an overview of non-coding RNAs' role in cell homeostasis, we illustrate the potential and the challenges of state-of-the-art computational methods exploited to study non-coding RNAs biogenesis, function, and modulation. This can be done by directly targeting them with small molecules or by altering their expression by targeting the cellular engines underlying their biosynthesis. Drawing from applications, also taken from our work, we showcase the significance and role of computer simulations in uncovering fundamental facets of ncRNA mechanisms and modulation. This information may set the basis to advance gene modulation tools and therapeutic strategies to address unmet medical needs.
Collapse
Affiliation(s)
- Silvia Rinaldi
- National Research Council of Italy (CNR) - Institute of Chemistry of OrganoMetallic Compounds (ICCOM), c/o Area di Ricerca CNR di Firenze Via Madonna del Piano 10, 50019 Sesto Fiorentino, Florence, Italy
| | - Elisabetta Moroni
- National Research Council of Italy (CNR) - Institute of Chemical Sciences and Technologies (SCITEC), via Mario Bianco 9, 20131 Milano, Italy
| | - Riccardo Rozza
- National Research Council of Italy (CNR) - Institute of Material Foundry (IOM) c/o International School for Advanced Studies (SISSA), Via Bonomea, 265, 34136 Trieste, Italy
| | - Alessandra Magistrato
- National Research Council of Italy (CNR) - Institute of Material Foundry (IOM) c/o International School for Advanced Studies (SISSA), Via Bonomea, 265, 34136 Trieste, Italy
| |
Collapse
|
5
|
Kamga Youmbi FI, Kengne Tchendji V, Tayou Djamegni C. P-FARFAR2: A multithreaded greedy approach to sampling low-energy RNA structures in Rosetta FARFAR2. Comput Biol Chem 2023; 104:107878. [PMID: 37167861 DOI: 10.1016/j.compbiolchem.2023.107878] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2022] [Revised: 04/23/2023] [Accepted: 05/01/2023] [Indexed: 05/13/2023]
Abstract
RNA (ribonucleic acid) structure prediction finds many applications in health science and drug discovery due to its importance in several life regulatory processes. But despite significant advances in the close field of protein prediction, RNA 3D structure still poses a tremendous challenge to predict, especially for large sequences. In this regard, the approach unfolded by Rosetta FARFAR2 (Fragment Assembly of RNA with Full-Atom Refinement, version 2) has shown promising results, but the algorithm is non-deterministic by nature. In this paper, we develop P-FARFAR2: a parallel enhancement of FARFAR2 that increases its ability to assemble low-energy structures via multithreaded exploration of random configurations in a greedy manner. This strategy, appearing in the literature under the term "parallel mechanism", is made viable through two measures: first, the synchronization window is coarsened to several Monte Carlo cycles; second, all but one of the threads are differentiated as auxiliary and set to perform a weakened version of the problem. Following empirical analysis on a diverse range of RNA structures, we report achieving statistical significance in lowering the energy levels of ensuing samples. And consequently, despite the moderate-to-weak correlation between energy levels and prediction accuracy, this achievement happens to propagate to accuracy measurements.
Collapse
Affiliation(s)
| | - Vianney Kengne Tchendji
- Department of Mathematics and Computer Science, University of Dschang, PO Box 67, Dschang, Cameroon.
| | - Clémentin Tayou Djamegni
- Department of Mathematics and Computer Science, University of Dschang, PO Box 67, Dschang, Cameroon; Department of Computer Engineering, University of Dschang, PO Box 134, Bandjoun, Cameroon.
| |
Collapse
|
6
|
Bagnolini G, Luu TB, Hargrove AE. Recognizing the power of machine learning and other computational methods to accelerate progress in small molecule targeting of RNA. RNA (NEW YORK, N.Y.) 2023; 29:473-488. [PMID: 36693763 PMCID: PMC10019373 DOI: 10.1261/rna.079497.122] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
RNA structures regulate a wide range of processes in biology and disease, yet small molecule chemical probes or drugs that can modulate these functions are rare. Machine learning and other computational methods are well poised to fill gaps in knowledge and overcome the inherent challenges in RNA targeting, such as the dynamic nature of RNA and the difficulty of obtaining RNA high-resolution structures. Successful tools to date include principal component analysis, linear discriminate analysis, k-nearest neighbor, artificial neural networks, multiple linear regression, and many others. Employment of these tools has revealed critical factors for selective recognition in RNA:small molecule complexes, predictable differences in RNA- and protein-binding ligands, and quantitative structure activity relationships that allow the rational design of small molecules for a given RNA target. Herein we present our perspective on the value of using machine learning and other computation methods to advance RNA:small molecule targeting, including select examples and their validation as well as necessary and promising future directions that will be key to accelerate discoveries in this important field.
Collapse
Affiliation(s)
- Greta Bagnolini
- Department of Chemistry, Duke University, Durham, North Carolina 27708, USA
| | - TinTin B Luu
- Department of Chemistry, Duke University, Durham, North Carolina 27708, USA
| | - Amanda E Hargrove
- Department of Chemistry, Duke University, Durham, North Carolina 27708, USA
- Department of Biochemistry, Duke University School of Medicine, Durham, North Carolina 27710, USA
| |
Collapse
|
7
|
Watkins AM, Das R. RNA 3D Modeling with FARFAR2, Online. Methods Mol Biol 2023; 2586:233-249. [PMID: 36705908 DOI: 10.1007/978-1-0716-2768-6_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Understanding the three-dimensional structure of an RNA molecule is often essential to understanding its function. Sampling algorithms and energy functions for RNA structure prediction are improving, due to the increasing diversity of structural data available for training statistical potentials and testing structural data, along with a steady supply of blind challenges through the RNA-Puzzles initiative. The recent FARFAR2 algorithm enables near-native structure predictions on fairly complex RNA structures, including automated selection of final candidate models and estimation of model accuracy. Here, we describe the use of a publicly available webserver for RNA modeling for realistic scenarios using FARFAR2, available at https://rosie.rosettacommons.org/farfar2 . We walk through two cases in some detail: a simple model pseudoknot from the frameshifting element of beet western yellows virus modeled using the "basic interface" to the webserver and a replication of RNA-Puzzle 20, a metagenomic twister sister ribozyme, using the "advanced interface." We also describe example runs of FARFAR2 modeling including two kinds of experimental data: a c-di-GMP riboswitch modeled with low-resolution restraints from MOHCA-seq experiments and a tandem GA motif modeled with 1H NMR chemical shifts.
Collapse
Affiliation(s)
- Andrew M Watkins
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
- Prescient Design, Genentech, South San Francisco, CA, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA.
- Biophysics Program, Stanford University, Stanford, CA, USA.
| |
Collapse
|
8
|
Kofman C, Watkins AM, Kim D, Willi JA, Wooldredge A, Karim A, Das R, Jewett MC. Computationally-guided design and selection of high performing ribosomal active site mutants. Nucleic Acids Res 2022; 50:13143-13154. [PMID: 36484094 PMCID: PMC9825160 DOI: 10.1093/nar/gkac1036] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Revised: 10/13/2022] [Accepted: 10/22/2022] [Indexed: 12/14/2022] Open
Abstract
Understanding how modifications to the ribosome affect function has implications for studying ribosome biogenesis, building minimal cells, and repurposing ribosomes for synthetic biology. However, efforts to design sequence-modified ribosomes have been limited because point mutations in the ribosomal RNA (rRNA), especially in the catalytic active site (peptidyl transferase center; PTC), are often functionally detrimental. Moreover, methods for directed evolution of rRNA are constrained by practical considerations (e.g. library size). Here, to address these limitations, we developed a computational rRNA design approach for screening guided libraries of mutant ribosomes. Our method includes in silico library design and selection using a Rosetta stepwise Monte Carlo method (SWM), library construction and in vitro testing of combined ribosomal assembly and translation activity, and functional characterization in vivo. As a model, we apply our method to making modified ribosomes with mutant PTCs. We engineer ribosomes with as many as 30 mutations in their PTCs, highlighting previously unidentified epistatic interactions, and show that SWM helps identify sequences with beneficial phenotypes as compared to random library sequences. We further demonstrate that some variants improve cell growth in vivo, relative to wild type ribosomes. We anticipate that SWM design and selection may serve as a powerful tool for rRNA engineering.
Collapse
Affiliation(s)
- Camila Kofman
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL 60208, USA
- Chemistry of Life Processes Institute, Northwestern University, Evanston, IL 60208, USA
- Center for Synthetic Biology, Northwestern University, Evanston, IL 60208, USA
| | - Andrew M Watkins
- Department of Biochemistry, Stanford University, Stanford, CA 94305, USA
- Prescient Design, Genentech, South San Francisco, CA 94080, USA
| | - Do Soon Kim
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL 60208, USA
- Chemistry of Life Processes Institute, Northwestern University, Evanston, IL 60208, USA
- Center for Synthetic Biology, Northwestern University, Evanston, IL 60208, USA
- Inceptive Nucleics, Inc., Palo Alto, CA 94304, USA
| | - Jessica A Willi
- Chemistry of Life Processes Institute, Northwestern University, Evanston, IL 60208, USA
- Center for Synthetic Biology, Northwestern University, Evanston, IL 60208, USA
| | - Alexandra C Wooldredge
- Chemistry of Life Processes Institute, Northwestern University, Evanston, IL 60208, USA
- Center for Synthetic Biology, Northwestern University, Evanston, IL 60208, USA
| | - Ashty S Karim
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL 60208, USA
- Chemistry of Life Processes Institute, Northwestern University, Evanston, IL 60208, USA
- Center for Synthetic Biology, Northwestern University, Evanston, IL 60208, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University, Stanford, CA 94305, USA
- Department of Physics, Stanford University, Stanford, CA 94305, USA
| | - Michael C Jewett
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL 60208, USA
- Chemistry of Life Processes Institute, Northwestern University, Evanston, IL 60208, USA
- Center for Synthetic Biology, Northwestern University, Evanston, IL 60208, USA
- Robert H. Lurie Comprehensive Cancer Center and Simpson Querrey Institute, Northwestern University, Chicago, IL 60611, USA
| |
Collapse
|
9
|
Lee J, Coronado JN, Cho N, Lim J, Hosford BM, Seo S, Kim DS, Kofman C, Moore JS, Ellington AD, Anslyn EV, Jewett MC. Ribosome-mediated biosynthesis of pyridazinone oligomers in vitro. Nat Commun 2022; 13:6322. [PMID: 36280685 PMCID: PMC9592601 DOI: 10.1038/s41467-022-33701-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2022] [Accepted: 09/28/2022] [Indexed: 12/25/2022] Open
Abstract
The ribosome is a macromolecular machine that catalyzes the sequence-defined polymerization of L-α-amino acids into polypeptides. The catalysis of peptide bond formation between amino acid substrates is based on entropy trapping, wherein the adjacency of transfer RNA (tRNA)-coupled acyl bonds in the P-site and the α-amino groups in the A-site aligns the substrates for coupling. The plasticity of this catalytic mechanism has been observed in both remnants of the evolution of the genetic code and modern efforts to reprogram the genetic code (e.g., ribosomal incorporation of non-canonical amino acids, ribosomal ester formation). However, the limits of ribosome-mediated polymerization are underexplored. Here, rather than peptide bonds, we demonstrate ribosome-mediated polymerization of pyridazinone bonds via a cyclocondensation reaction between activated γ-keto and α-hydrazino ester monomers. In addition, we demonstrate the ribosome-catalyzed synthesis of peptide-hybrid oligomers composed of multiple sequence-defined alternating pyridazinone linkages. Our results highlight the plasticity of the ribosome's ancient bond-formation mechanism, expand the range of non-canonical polymeric backbones that can be synthesized by the ribosome, and open the door to new applications in synthetic biology.
Collapse
Affiliation(s)
- Joongoo Lee
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, 60208, USA.
- Department of Chemical Engineering, Pohang University of Science and Technology (POSTECH), Pohang, 37673, Republic of Korea.
| | - Jaime N Coronado
- Department of Chemistry, University of Texas at Austin, Austin, TX, 78712, USA
| | - Namjin Cho
- Department of Chemical Engineering, Pohang University of Science and Technology (POSTECH), Pohang, 37673, Republic of Korea
| | - Jongdoo Lim
- Department of Chemistry, University of Texas at Austin, Austin, TX, 78712, USA
| | - Brandon M Hosford
- Department of Chemistry, University of Texas at Austin, Austin, TX, 78712, USA
| | - Sangwon Seo
- Department of Chemistry, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Republic of Korea
- Center for Catalytic Hydrocarbon Functionalizations, Institute for Basic Science (IBS), Daejeon, 34141, Republic of Korea
| | - Do Soon Kim
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, 60208, USA
| | - Camila Kofman
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, 60208, USA
| | - Jeffrey S Moore
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
| | - Andrew D Ellington
- Department of Chemistry and Biochemistry, Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, TX, 78712, USA
| | - Eric V Anslyn
- Department of Chemistry, University of Texas at Austin, Austin, TX, 78712, USA.
| | - Michael C Jewett
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, 60208, USA.
- Interdisplinary Biological Sciences Graduate Program, Evanston, IL, 60208, USA.
- Chemistry of Life Processes Institute, Evanston, IL, 60208, USA.
- Robert H. Lurie Comprehensive Cancer Center, Evanston, IL, 60208, USA.
- Simpson Querrey Institute, Evanston, IL, 60208, USA.
- Center for Synthetic Biology, Northwestern University and Biological Engineering, 2145 Sheridan Road, Evanston, IL, 60208, USA.
| |
Collapse
|
10
|
Magi Meconi G, Sasselli IR, Bianco V, Onuchic JN, Coluzza I. Key aspects of the past 30 years of protein design. REPORTS ON PROGRESS IN PHYSICS. PHYSICAL SOCIETY (GREAT BRITAIN) 2022; 85:086601. [PMID: 35704983 DOI: 10.1088/1361-6633/ac78ef] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Accepted: 06/15/2022] [Indexed: 06/15/2023]
Abstract
Proteins are the workhorse of life. They are the building infrastructure of living systems; they are the most efficient molecular machines known, and their enzymatic activity is still unmatched in versatility by any artificial system. Perhaps proteins' most remarkable feature is their modularity. The large amount of information required to specify each protein's function is analogically encoded with an alphabet of just ∼20 letters. The protein folding problem is how to encode all such information in a sequence of 20 letters. In this review, we go through the last 30 years of research to summarize the state of the art and highlight some applications related to fundamental problems of protein evolution.
Collapse
Affiliation(s)
- Giulia Magi Meconi
- Computational Biophysics Lab, Center for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo de Miramon 182, 20014, Donostia-San Sebastián, Spain
| | - Ivan R Sasselli
- Computational Biophysics Lab, Center for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo de Miramon 182, 20014, Donostia-San Sebastián, Spain
| | | | - Jose N Onuchic
- Center for Theoretical Biological Physics, Department of Physics & Astronomy, Department of Chemistry, Department of Biosciences, Rice University, Houston, TX 77251, United States of America
| | - Ivan Coluzza
- BCMaterials, Basque Center for Materials, Applications and Nanostructures, Bld. Martina Casiano, UPV/EHU Science Park, Barrio Sarriena s/n, 48940 Leioa, Spain
- Basque Foundation for Science, Ikerbasque, 48009, Bilbao, Spain
| |
Collapse
|
11
|
Liu Z, Yang Y, Li D, Lv X, Chen X, Dai Q. Prediction of the RNA Tertiary Structure Based on a Random Sampling Strategy and Parallel Mechanism. Front Genet 2022; 12:813604. [PMID: 35069706 PMCID: PMC8769045 DOI: 10.3389/fgene.2021.813604] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Accepted: 11/19/2021] [Indexed: 12/14/2022] Open
Abstract
Background: Macromolecule structure prediction remains a fundamental challenge of bioinformatics. Over the past several decades, the Rosetta framework has provided solutions to diverse challenges in computational biology. However, it is challenging to model RNA tertiary structures effectively when the de novo modeling of RNA involves solving a well-defined small puzzle. Methods: In this study, we introduce a stepwise Monte Carlo parallelization (SMCP) algorithm for RNA tertiary structure prediction. Millions of conformations were randomly searched using the Monte Carlo algorithm and stepwise ansatz hypothesis, and SMCP uses a parallel mechanism for efficient sampling. Moreover, to achieve better prediction accuracy and completeness, we judged and processed the modeling results. Results: A benchmark of nine single-stranded RNA loops drawn from riboswitches establishes the general ability of the algorithm to model RNA with high accuracy and integrity, including six motifs that cannot be solved by knowledge mining-based modeling algorithms. Experimental results show that the modeling accuracy of the SMCP algorithm is up to 0.14 Å, and the modeling integrity on this benchmark is extremely high. Conclusion: SMCP is an ab initio modeling algorithm that substantially outperforms previous algorithms in the Rosetta framework, especially in improving the accuracy and completeness of the model. It is expected that the work will provide new research ideas for macromolecular structure prediction in the future. In addition, this work will provide theoretical basis for the development of the biomedical field.
Collapse
Affiliation(s)
- Zhendong Liu
- School of Computer Science and Technology, Shandong Jianzhu University, Jinan, China
| | - Yurong Yang
- School of Computer Science and Technology, Shandong Jianzhu University, Jinan, China
| | - Dongyan Li
- School of Computer Science and Technology, Shandong Jianzhu University, Jinan, China
| | - Xinrong Lv
- School of Computer Science and Technology, Shandong Jianzhu University, Jinan, China
| | - Xi Chen
- School of Computer Science and Technology, Shandong Jianzhu University, Jinan, China
| | - Qionghai Dai
- Department of Automation, Tsinghua University, Beijing, China
| |
Collapse
|
12
|
Koehler Leman J, Lyskov S, Lewis SM, Adolf-Bryfogle J, Alford RF, Barlow K, Ben-Aharon Z, Farrell D, Fell J, Hansen WA, Harmalkar A, Jeliazkov J, Kuenze G, Krys JD, Ljubetič A, Loshbaugh AL, Maguire J, Moretti R, Mulligan VK, Nance ML, Nguyen PT, Ó Conchúir S, Roy Burman SS, Samanta R, Smith ST, Teets F, Tiemann JKS, Watkins A, Woods H, Yachnin BJ, Bahl CD, Bailey-Kellogg C, Baker D, Das R, DiMaio F, Khare SD, Kortemme T, Labonte JW, Lindorff-Larsen K, Meiler J, Schief W, Schueler-Furman O, Siegel JB, Stein A, Yarov-Yarovoy V, Kuhlman B, Leaver-Fay A, Gront D, Gray JJ, Bonneau R. Ensuring scientific reproducibility in bio-macromolecular modeling via extensive, automated benchmarks. Nat Commun 2021; 12:6947. [PMID: 34845212 PMCID: PMC8630030 DOI: 10.1038/s41467-021-27222-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Accepted: 11/02/2021] [Indexed: 01/14/2023] Open
Abstract
Each year vast international resources are wasted on irreproducible research. The scientific community has been slow to adopt standard software engineering practices, despite the increases in high-dimensional data, complexities of workflows, and computational environments. Here we show how scientific software applications can be created in a reproducible manner when simple design goals for reproducibility are met. We describe the implementation of a test server framework and 40 scientific benchmarks, covering numerous applications in Rosetta bio-macromolecular modeling. High performance computing cluster integration allows these benchmarks to run continuously and automatically. Detailed protocol captures are useful for developers and users of Rosetta and other macromolecular modeling tools. The framework and design concepts presented here are valuable for developers and users of any type of scientific software and for the scientific community to create reproducible methods. Specific examples highlight the utility of this framework, and the comprehensive documentation illustrates the ease of adding new tests in a matter of hours.
Collapse
Affiliation(s)
- Julia Koehler Leman
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, 10010, USA.
- Department of Biology, New York University, New York, NY, 10003, USA.
| | - Sergey Lyskov
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Steven M Lewis
- Cyrus Biotechnology, 1201 Second Ave, Suite 900, Seattle, WA, 98101, USA
| | - Jared Adolf-Bryfogle
- Department of Immunology and Microbiology, Scripps Research, La Jolla, CA, 92037, USA
- IAVI Neutralizing Antibody Center, Scripps Research, La Jolla, CA, 92037, USA
| | - Rebecca F Alford
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Kyle Barlow
- Graduate Program in Bioinformatics, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Ziv Ben-Aharon
- Department of Microbiology and Molecular Genetics, Hebrew University, Hadassah Medical School, POB 12272, Jerusalem, 91120, Israel
| | - Daniel Farrell
- Department of Biochemistry, University of Washington, Seattle, WA, 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
| | - Jason Fell
- Genome Center, University of California, Davis, CA, 95616, USA
- Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, 95616, USA
- Department of Chemistry, University of California, Davis, CA, 95616, USA
| | - William A Hansen
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, 08904, USA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, 08904, USA
| | - Ameya Harmalkar
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Jeliazko Jeliazkov
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Georg Kuenze
- Department of Chemistry, Vanderbilt University, Nashville, TN, 37235, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, 37235, USA
- Institute for Drug Discovery, Medical School, Leipzig University, 04103, Leipzig, Germany
| | - Justyna D Krys
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093, Warsaw, Poland
| | - Ajasja Ljubetič
- Department of Biochemistry, University of Washington, Seattle, WA, 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
| | - Amanda L Loshbaugh
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, 94158, USA
- Biophysics Graduate Program, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Jack Maguire
- Program in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Rocco Moretti
- Department of Chemistry, Vanderbilt University, Nashville, TN, 37235, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, 37235, USA
| | - Vikram Khipple Mulligan
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, 10010, USA
| | - Morgan L Nance
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Phuong T Nguyen
- Department of Physiology and Membrane Biology, School of Medicine, University of California, Davis, CA, 95616, USA
| | - Shane Ó Conchúir
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Shourya S Roy Burman
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Rituparna Samanta
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Shannon T Smith
- Center for Structural Biology, Vanderbilt University, Nashville, TN, 37235, USA
- Chemical and Physical Biology Program, Vanderbilt University, Nashville, TN, 37235, USA
| | - Frank Teets
- Department of Bioochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27516, USA
| | - Johanna K S Tiemann
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, DK-2200, Copenhagen N., Denmark
| | - Andrew Watkins
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Hope Woods
- Center for Structural Biology, Vanderbilt University, Nashville, TN, 37235, USA
- Chemical and Physical Biology Program, Vanderbilt University, Nashville, TN, 37235, USA
| | - Brahm J Yachnin
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, 08904, USA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, 08904, USA
| | - Christopher D Bahl
- Institute for Protein Innovation, Boston, MA, 02115, USA
- Division of Hematology/Oncology, Boston Children's Hospital, Boston, MA, 02115, USA
- Department of Pediatrics, Harvard Medical School, Boston, MA, 02115, USA
| | | | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Frank DiMaio
- Department of Biochemistry, University of Washington, Seattle, WA, 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
| | - Sagar D Khare
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, 08904, USA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, 08904, USA
| | - Tanja Kortemme
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, 94158, USA
- Biophysics Graduate Program, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Jason W Labonte
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Kresten Lindorff-Larsen
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, DK-2200, Copenhagen N., Denmark
| | - Jens Meiler
- Department of Chemistry, Vanderbilt University, Nashville, TN, 37235, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, 37235, USA
- Institute for Drug Discovery, Medical School, Leipzig University, 04103, Leipzig, Germany
| | - William Schief
- Department of Immunology and Microbiology, Scripps Research, La Jolla, CA, 92037, USA
- IAVI Neutralizing Antibody Center, Scripps Research, La Jolla, CA, 92037, USA
| | - Ora Schueler-Furman
- Department of Microbiology and Molecular Genetics, Hebrew University, Hadassah Medical School, POB 12272, Jerusalem, 91120, Israel
| | - Justin B Siegel
- Genome Center, University of California, Davis, CA, 95616, USA
- Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, 95616, USA
- Department of Chemistry, University of California, Davis, CA, 95616, USA
| | - Amelie Stein
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, DK-2200, Copenhagen N., Denmark
| | - Vladimir Yarov-Yarovoy
- Department of Physiology and Membrane Biology, School of Medicine, University of California, Davis, CA, 95616, USA
| | - Brian Kuhlman
- Department of Bioochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27516, USA
| | - Andrew Leaver-Fay
- Department of Bioochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27516, USA
| | - Dominik Gront
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093, Warsaw, Poland
| | - Jeffrey J Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA.
| | - Richard Bonneau
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, 10010, USA.
- Department of Biology, New York University, New York, NY, 10003, USA.
- Department of Computer Science, New York University, New York, NY, 10003, USA.
| |
Collapse
|
13
|
De Bisschop G, Allouche D, Frezza E, Masquida B, Ponty Y, Will S, Sargueil B. Progress toward SHAPE Constrained Computational Prediction of Tertiary Interactions in RNA Structure. Noncoding RNA 2021; 7:71. [PMID: 34842779 PMCID: PMC8628965 DOI: 10.3390/ncrna7040071] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Revised: 10/29/2021] [Accepted: 11/02/2021] [Indexed: 01/04/2023] Open
Abstract
As more sequencing data accumulate and novel puzzling genetic regulations are discovered, the need for accurate automated modeling of RNA structure increases. RNA structure modeling from chemical probing experiments has made tremendous progress, however accurately predicting large RNA structures is still challenging for several reasons: RNA are inherently flexible and often adopt many energetically similar structures, which are not reliably distinguished by the available, incomplete thermodynamic model. Moreover, computationally, the problem is aggravated by the relevance of pseudoknots and non-canonical base pairs, which are hardly predicted efficiently. To identify nucleotides involved in pseudoknots and non-canonical interactions, we scrutinized the SHAPE reactivity of each nucleotide of the 188 nt long lariat-capping ribozyme under multiple conditions. Reactivities analyzed in the light of the X-ray structure were shown to report accurately the nucleotide status. Those that seemed paradoxical were rationalized by the nucleotide behavior along molecular dynamic simulations. We show that valuable information on intricate interactions can be deduced from probing with different reagents, and in the presence or absence of Mg2+. Furthermore, probing at increasing temperature was remarkably efficient at pointing to non-canonical interactions and pseudoknot pairings. The possibilities of following such strategies to inform structure modeling software are discussed.
Collapse
Affiliation(s)
- Grégoire De Bisschop
- Université de Paris, CNRS, UMR 8038/CiTCoM, F-75006 Paris, France; (G.D.B.); (D.A.); (E.F.)
- Institut de Recherches Cliniques de Montréal (IRCM), Montréal, QC H2W 1R7, Canada
| | - Delphine Allouche
- Université de Paris, CNRS, UMR 8038/CiTCoM, F-75006 Paris, France; (G.D.B.); (D.A.); (E.F.)
- Institut Necker-Enfants Malades (INEM), Inserm U1151, 156 rue de Vaugirard, CEDEX 15, 75015 Paris, France
| | - Elisa Frezza
- Université de Paris, CNRS, UMR 8038/CiTCoM, F-75006 Paris, France; (G.D.B.); (D.A.); (E.F.)
| | - Benoît Masquida
- Université de Strasbourg, CNRS UMR7156 GMGM, 67084 Strasbourg, France;
| | - Yann Ponty
- Ecole Polytechnique, CNRS UMR 7161, LIX, 91120 Palaiseau, France; (Y.P.); (S.W.)
| | - Sebastian Will
- Ecole Polytechnique, CNRS UMR 7161, LIX, 91120 Palaiseau, France; (Y.P.); (S.W.)
| | - Bruno Sargueil
- Université de Paris, CNRS, UMR 8038/CiTCoM, F-75006 Paris, France; (G.D.B.); (D.A.); (E.F.)
| |
Collapse
|
14
|
Zhang D, Chen SJ, Zhou R. Modeling Noncanonical RNA Base Pairs by a Coarse-Grained IsRNA2 Model. J Phys Chem B 2021; 125:11907-11915. [PMID: 34694128 DOI: 10.1021/acs.jpcb.1c07288] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Noncanonical base pairs contribute crucially to the three-dimensional architecture of large RNA molecules; however, how to accurately model them remains an open challenge in RNA 3D structure prediction. Here, we report a promising coarse-grained (CG) IsRNA2 model to predict noncanonical base pairs in large RNAs through molecular dynamics simulations. By introducing a five-bead per nucleotide CG representation to reserve the three interacting edges of nucleobases, IsRNA2 accurately models various base-pairing interactions, including both canonical and noncanonical base pairs. A benchmark test indicated that IsRNA2 achieves a comparable performance to the atomic model in de novo modeling of noncanonical RNA structures. In addition, IsRNA2 was able to refine the 3D structure predictions for large RNAs in RNA-puzzle challenges. Finally, the graphics processing unit acceleration was introduced to speed up the sampling efficiency in IsRNA2 for very large RNA molecules. Therefore, the CG IsRNA2 model reported here offers a reliable approach to predict the structures and dynamics of large RNAs.
Collapse
Affiliation(s)
- Dong Zhang
- College of Life Sciences and Institute of Quantitative Biology, Zhejiang University, Hangzhou 310058, China
| | - Shi-Jie Chen
- Department of Physics, Department of Biochemistry, and Institute of Data Science and Informatics, University of Missouri, Columbia, Missouri 65211, United States
| | - Ruhong Zhou
- College of Life Sciences and Institute of Quantitative Biology, Zhejiang University, Hangzhou 310058, China
| |
Collapse
|
15
|
Manigrasso J, Marcia M, De Vivo M. Computer-aided design of RNA-targeted small molecules: A growing need in drug discovery. Chem 2021. [DOI: 10.1016/j.chempr.2021.05.021] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
16
|
Mulligan VK. Current directions in combining simulation-based macromolecular modeling approaches with deep learning. Expert Opin Drug Discov 2021; 16:1025-1044. [PMID: 33993816 DOI: 10.1080/17460441.2021.1918097] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Introduction: Structure-guided drug discovery relies on accurate computational methods for modeling macromolecules. Simulations provide means of predicting macromolecular folds, of discovering function from structure, and of designing macromolecules to serve as drugs. Success rates are limited for any of these tasks, however. Recently, deep neural network-based methods have greatly enhanced the accuracy of predictions of protein structure from sequence, generating excitement about the potential impact of deep learning.Areas covered: This review introduces biologists to deep neural network architecture, surveys recent successes of deep learning in structure prediction, and discusses emerging deep learning-based approaches for structure-function analysis and design. Particular focus is given to the interplay between simulation-based and neural network-based approaches.Expert opinion: As deep learning grows integral to macromolecular modeling, simulation- and neural network-based approaches must grow more tightly interconnected. Modular software architecture must emerge allowing both types of tools to be combined with maximal versatility. Open sharing of code under permissive licenses will be essential. Although experiments will remain the gold standard for reliable information to guide drug discovery, we may soon see successful drug development projects based on high-accuracy predictions from algorithms that combine simulation with deep learning - the ultimate validation of this combination's power.
Collapse
|
17
|
Pairing a high-resolution statistical potential with a nucleobase-centric sampling algorithm for improving RNA model refinement. Nat Commun 2021; 12:2777. [PMID: 33986288 PMCID: PMC8119458 DOI: 10.1038/s41467-021-23100-4] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2021] [Accepted: 04/13/2021] [Indexed: 12/04/2022] Open
Abstract
Refining modelled structures to approach experimental accuracy is one of the most challenging problems in molecular biology. Despite many years’ efforts, the progress in protein or RNA structure refinement has been slow because the global minimum given by the energy scores is not at the experimentally determined “native” structure. Here, we propose a fully knowledge-based energy function that captures the full orientation dependence of base–base, base–oxygen and oxygen–oxygen interactions with the RNA backbone modelled by rotameric states and internal energies. A total of 4000 quantum-mechanical calculations were performed to reweight base–base statistical potentials for minimizing possible effects of indirect interactions. The resulting BRiQ knowledge-based potential, equipped with a nucleobase-centric sampling algorithm, provides a robust improvement in refining near-native RNA models generated by a wide variety of modelling techniques. Predicting RNA structure from sequence is challenging due to the relative sparsity of experimentally-determined RNA 3D structures for model training. Here, the authors propose a way to incorporate knowledge on interactions at the atomic and base–base level to refine the prediction of RNA structures.
Collapse
|
18
|
Thavarajah W, Hertz LM, Bushhouse DZ, Archuleta CM, Lucks JB. RNA Engineering for Public Health: Innovations in RNA-Based Diagnostics and Therapeutics. Annu Rev Chem Biomol Eng 2021; 12:263-286. [PMID: 33900805 PMCID: PMC9714562 DOI: 10.1146/annurev-chembioeng-101420-014055] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
RNA is essential for cellular function: From sensing intra- and extracellular signals to controlling gene expression, RNA mediates a diverse and expansive list of molecular processes. A long-standing goal of synthetic biology has been to develop RNA engineering principles that can be used to harness and reprogram these RNA-mediated processes to engineer biological systems to solve pressing global challenges. Recent advances in the field of RNA engineering are bringing this to fruition, enabling the creation of RNA-based tools to combat some of the most urgent public health crises. Specifically, new diagnostics using engineered RNAs are able to detect both pathogens and chemicals while generating an easily detectable fluorescent signal as an indicator. New classes of vaccines and therapeutics are also using engineered RNAs to target a wide range of genetic and pathogenic diseases. Here, we discuss the recent breakthroughs in RNA engineering enabling these innovations and examine how advances in RNA design promise to accelerate the impact of engineered RNA systems.
Collapse
Affiliation(s)
- Walter Thavarajah
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois 60208, USA; .,Center for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, USA.,Center for Water Research, Northwestern University, Evanston, Illinois 60208, USA
| | - Laura M Hertz
- Center for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, USA.,Interdisciplinary Biological Sciences Graduate Program, Northwestern University, Evanston, Illinois 60208, USA
| | - David Z Bushhouse
- Center for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, USA.,Interdisciplinary Biological Sciences Graduate Program, Northwestern University, Evanston, Illinois 60208, USA
| | - Chloé M Archuleta
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois 60208, USA; .,Center for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, USA.,Center for Water Research, Northwestern University, Evanston, Illinois 60208, USA
| | - Julius B Lucks
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois 60208, USA; .,Center for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, USA.,Center for Water Research, Northwestern University, Evanston, Illinois 60208, USA.,Center for Engineering Sustainability and Resilience, Northwestern University, Evanston, Illinois 60208, USA
| |
Collapse
|
19
|
Zakrevsky P, Calkins E, Kao YL, Singh G, Keleshian VL, Baudrey S, Jaeger L. In vitro selected GUAA tetraloop-binding receptors with structural plasticity and evolvability towards natural RNA structural modules. Nucleic Acids Res 2021; 49:2289-2305. [PMID: 33524109 PMCID: PMC7913685 DOI: 10.1093/nar/gkab021] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Revised: 01/05/2021] [Accepted: 01/26/2021] [Indexed: 11/24/2022] Open
Abstract
GNRA tetraloop-binding receptor interactions are key components in the macromolecular assembly of a variety of functional RNAs. In nature, there is an apparent bias for GAAA/11nt receptor and GYRA/helix interactions, with the former interaction being thermodynamically more stable than the latter. While past in vitro selections allowed isolation of novel GGAA and GUGA receptors, we report herein an in vitro selection that revealed several novel classes of specific GUAA receptors with binding affinities comparable to those from natural GAAA/11nt interactions. These GUAA receptors have structural homology with double-locked bulge RNA modules naturally occurring in ribosomal RNAs. They display mutational robustness that enables exploration of the sequence/phenotypic space associated to GNRA/receptor interactions through epistasis. Their thermodynamic self-assembly fitness landscape is characterized by a rugged neutral network with possible evolutionary trajectories toward natural GNRA/receptor interactions. High throughput sequencing analysis revealed synergetic mutations located away from the tertiary interactions that positively contribute to assembly fitness. Our study suggests that the repertoire of GNRA/receptor interactions is much larger than initially thought from the analysis of natural stable RNA molecules and also provides clues for their evolution towards natural GNRA/receptors.
Collapse
Affiliation(s)
- Paul Zakrevsky
- Department of Chemistry and Biochemistry, Biomolecular Science and Engineering Program, University of California, Santa Barbara, CA 93106-9510, USA
| | - Erin Calkins
- Department of Chemistry and Biochemistry, Biomolecular Science and Engineering Program, University of California, Santa Barbara, CA 93106-9510, USA
| | - Yi-Ling Kao
- Department of Chemistry and Biochemistry, Biomolecular Science and Engineering Program, University of California, Santa Barbara, CA 93106-9510, USA
| | - Gurkeerat Singh
- Department of Chemistry and Biochemistry, Biomolecular Science and Engineering Program, University of California, Santa Barbara, CA 93106-9510, USA
| | - Vasken L Keleshian
- Department of Chemistry and Biochemistry, Biomolecular Science and Engineering Program, University of California, Santa Barbara, CA 93106-9510, USA
| | - Stephanie Baudrey
- Department of Chemistry and Biochemistry, Biomolecular Science and Engineering Program, University of California, Santa Barbara, CA 93106-9510, USA
| | - Luc Jaeger
- Department of Chemistry and Biochemistry, Biomolecular Science and Engineering Program, University of California, Santa Barbara, CA 93106-9510, USA
| |
Collapse
|
20
|
Zhang D, Li J, Chen SJ. IsRNA1: De Novo Prediction and Blind Screening of RNA 3D Structures. J Chem Theory Comput 2021; 17:1842-1857. [PMID: 33560836 DOI: 10.1021/acs.jctc.0c01148] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Modeling structures and functions of large ribonucleic acid (RNAs) especially with complicated topologies is highly challenging due to the inefficiency of large conformational sampling and the presence of complicated tertiary interactions. To address this problem, one highly promising approach is coarse-grained modeling. Here, following an iterative simulated reference state approach to decipher the correlations between different structural parameters, we developed a potent coarse-grained RNA model named as IsRNA1 for RNA studies. Molecular dynamics simulations in the IsRNA1 can predict the native structures of small RNAs from a sequence and fold medium-sized RNAs into near-native tertiary structures with the assistance of secondary structure constraints. A large-scale benchmark test on RNA 3D structure prediction shows that IsRNA1 exhibits improved performance for relatively large RNAs of complicated topologies, such as large stem-loop structures and structures containing long-range tertiary interactions. The advantages of IsRNA1 include the consideration of the correlations between the different structural variables, the appropriate characterization of canonical base-pairing and base-stacking interactions, and the better sampling for the backbone conformations. Moreover, a blind screening protocol was developed based on IsRNA1 to identify good structural models from a pool of candidates without prior knowledge of the native structures.
Collapse
Affiliation(s)
- Dong Zhang
- Department of Physics, Department of Biochemistry, and Institute of Data Science and Informatics, University of Missouri, Columbia, Missouri 65211, United States
| | - Jun Li
- Department of Physics, Department of Biochemistry, and Institute of Data Science and Informatics, University of Missouri, Columbia, Missouri 65211, United States
| | - Shi-Jie Chen
- Department of Physics, Department of Biochemistry, and Institute of Data Science and Informatics, University of Missouri, Columbia, Missouri 65211, United States
| |
Collapse
|
21
|
Watkins AM, Rangan R, Das R. FARFAR2: Improved De Novo Rosetta Prediction of Complex Global RNA Folds. Structure 2020; 28:963-976.e6. [PMID: 32531203 PMCID: PMC7415647 DOI: 10.1016/j.str.2020.05.011] [Citation(s) in RCA: 128] [Impact Index Per Article: 25.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2019] [Revised: 04/27/2020] [Accepted: 05/20/2020] [Indexed: 01/01/2023]
Abstract
Predicting RNA three-dimensional structures from sequence could accelerate understanding of the growing number of RNA molecules being discovered across biology. Rosetta's Fragment Assembly of RNA with Full-Atom Refinement (FARFAR) has shown promise in community-wide blind RNA-Puzzle trials, but lack of a systematic and automated benchmark has left unclear what limits FARFAR performance. Here, we benchmark FARFAR2, an algorithm integrating RNA-Puzzle-inspired innovations with updated fragment libraries and helix modeling. In 16 of 21 RNA-Puzzles revisited without experimental data or expert intervention, FARFAR2 recovers native-like structures more accurate than models submitted during the RNA-Puzzles trials. Remaining bottlenecks include conformational sampling for >80-nucleotide problems and scoring function limitations more generally. Supporting these conclusions, preregistered blind models for adenovirus VA-I RNA and five riboswitch complexes predicted native-like folds with 3- to 14 Å root-mean-square deviation accuracies. We present a FARFAR2 webserver and three large model archives (FARFAR2-Classics, FARFAR2-Motifs, and FARFAR2-Puzzles) to guide future applications and advances.
Collapse
Affiliation(s)
- Andrew Martin Watkins
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Ramya Rangan
- Biophysics Program, Stanford University, Stanford, CA 94305, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305, USA; Biophysics Program, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
22
|
Leman JK, Weitzner BD, Lewis SM, Adolf-Bryfogle J, Alam N, Alford RF, Aprahamian M, Baker D, Barlow KA, Barth P, Basanta B, Bender BJ, Blacklock K, Bonet J, Boyken SE, Bradley P, Bystroff C, Conway P, Cooper S, Correia BE, Coventry B, Das R, De Jong RM, DiMaio F, Dsilva L, Dunbrack R, Ford AS, Frenz B, Fu DY, Geniesse C, Goldschmidt L, Gowthaman R, Gray JJ, Gront D, Guffy S, Horowitz S, Huang PS, Huber T, Jacobs TM, Jeliazkov JR, Johnson DK, Kappel K, Karanicolas J, Khakzad H, Khar KR, Khare SD, Khatib F, Khramushin A, King IC, Kleffner R, Koepnick B, Kortemme T, Kuenze G, Kuhlman B, Kuroda D, Labonte JW, Lai JK, Lapidoth G, Leaver-Fay A, Lindert S, Linsky T, London N, Lubin JH, Lyskov S, Maguire J, Malmström L, Marcos E, Marcu O, Marze NA, Meiler J, Moretti R, Mulligan VK, Nerli S, Norn C, Ó'Conchúir S, Ollikainen N, Ovchinnikov S, Pacella MS, Pan X, Park H, Pavlovicz RE, Pethe M, Pierce BG, Pilla KB, Raveh B, Renfrew PD, Burman SSR, Rubenstein A, Sauer MF, Scheck A, Schief W, Schueler-Furman O, Sedan Y, Sevy AM, Sgourakis NG, Shi L, Siegel JB, Silva DA, Smith S, Song Y, et alLeman JK, Weitzner BD, Lewis SM, Adolf-Bryfogle J, Alam N, Alford RF, Aprahamian M, Baker D, Barlow KA, Barth P, Basanta B, Bender BJ, Blacklock K, Bonet J, Boyken SE, Bradley P, Bystroff C, Conway P, Cooper S, Correia BE, Coventry B, Das R, De Jong RM, DiMaio F, Dsilva L, Dunbrack R, Ford AS, Frenz B, Fu DY, Geniesse C, Goldschmidt L, Gowthaman R, Gray JJ, Gront D, Guffy S, Horowitz S, Huang PS, Huber T, Jacobs TM, Jeliazkov JR, Johnson DK, Kappel K, Karanicolas J, Khakzad H, Khar KR, Khare SD, Khatib F, Khramushin A, King IC, Kleffner R, Koepnick B, Kortemme T, Kuenze G, Kuhlman B, Kuroda D, Labonte JW, Lai JK, Lapidoth G, Leaver-Fay A, Lindert S, Linsky T, London N, Lubin JH, Lyskov S, Maguire J, Malmström L, Marcos E, Marcu O, Marze NA, Meiler J, Moretti R, Mulligan VK, Nerli S, Norn C, Ó'Conchúir S, Ollikainen N, Ovchinnikov S, Pacella MS, Pan X, Park H, Pavlovicz RE, Pethe M, Pierce BG, Pilla KB, Raveh B, Renfrew PD, Burman SSR, Rubenstein A, Sauer MF, Scheck A, Schief W, Schueler-Furman O, Sedan Y, Sevy AM, Sgourakis NG, Shi L, Siegel JB, Silva DA, Smith S, Song Y, Stein A, Szegedy M, Teets FD, Thyme SB, Wang RYR, Watkins A, Zimmerman L, Bonneau R. Macromolecular modeling and design in Rosetta: recent methods and frameworks. Nat Methods 2020; 17:665-680. [PMID: 32483333 PMCID: PMC7603796 DOI: 10.1038/s41592-020-0848-2] [Show More Authors] [Citation(s) in RCA: 494] [Impact Index Per Article: 98.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Accepted: 04/22/2020] [Indexed: 12/12/2022]
Abstract
The Rosetta software for macromolecular modeling, docking and design is extensively used in laboratories worldwide. During two decades of development by a community of laboratories at more than 60 institutions, Rosetta has been continuously refactored and extended. Its advantages are its performance and interoperability between broad modeling capabilities. Here we review tools developed in the last 5 years, including over 80 methods. We discuss improvements to the score function, user interfaces and usability. Rosetta is available at http://www.rosettacommons.org.
Collapse
Affiliation(s)
- Julia Koehler Leman
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA.
- Department of Biology, New York University, New York, New York, USA.
| | - Brian D Weitzner
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Lyell Immunopharma Inc., Seattle, WA, USA
| | - Steven M Lewis
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
- Department of Biochemistry, Duke University, Durham, NC, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Jared Adolf-Bryfogle
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA, USA
| | - Nawsad Alam
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Rebecca F Alford
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Melanie Aprahamian
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, OH, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Kyle A Barlow
- Graduate Program in Bioinformatics, University of California San Francisco, San Francisco, CA, USA
| | - Patrick Barth
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Baylor College of Medicine, Department of Pharmacology, Houston, TX, USA
| | - Benjamin Basanta
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Biological Physics Structure and Design PhD Program, University of Washington, Seattle, WA, USA
| | - Brian J Bender
- Department of Pharmacology, Vanderbilt University, Nashville, TN, USA
| | - Kristin Blacklock
- Institute of Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Jaume Bonet
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Scott E Boyken
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Lyell Immunopharma Inc., Seattle, WA, USA
| | - Phil Bradley
- Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Chris Bystroff
- Department of Biological Sciences, Rensselaer Polytechnic Institute, Troy, NY, USA
| | - Patrick Conway
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Seth Cooper
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
| | - Bruno E Correia
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Brian Coventry
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
| | | | - Frank DiMaio
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Lorna Dsilva
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
| | - Roland Dunbrack
- Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia, PA, USA
| | - Alexander S Ford
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Brandon Frenz
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Darwin Y Fu
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
| | - Caleb Geniesse
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
| | | | - Ragul Gowthaman
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD, USA
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD, USA
| | - Jeffrey J Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD, USA
| | - Dominik Gront
- Faculty of Chemistry, Biological and Chemical Research Centre, University of Warsaw, Warsaw, Poland
| | - Sharon Guffy
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Scott Horowitz
- Department of Chemistry & Biochemistry, University of Denver, Denver, CO, USA
- The Knoebel Institute for Healthy Aging, University of Denver, Denver, CO, USA
| | - Po-Ssu Huang
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Thomas Huber
- Research School of Chemistry, Australian National University, Canberra, Australian Capital Territory, Australia
| | - Tim M Jacobs
- Program in Bioinformatics and Computational Biology, Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | | | - David K Johnson
- Center for Computational Biology, University of Kansas, Lawrence, KS, USA
| | - Kalli Kappel
- Biophysics Program, Stanford University, Stanford, CA, USA
| | - John Karanicolas
- Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia, PA, USA
| | - Hamed Khakzad
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Institute for Computational Science, University of Zurich, Zurich, Switzerland
- S3IT, University of Zurich, Zurich, Switzerland
| | - Karen R Khar
- Cyrus Biotechnology, Seattle, WA, USA
- Center for Computational Biology, University of Kansas, Lawrence, KS, USA
| | - Sagar D Khare
- Institute of Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Department of Chemistry and Chemical Biology, The State University of New Jersey, Piscataway, NJ, USA
- Center for Integrative Proteomics Research, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Computational Biology and Molecular Biophysics Program, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Firas Khatib
- Department of Computer and Information Science, University of Massachusetts Dartmouth, Dartmouth, MA, USA
| | - Alisa Khramushin
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Indigo C King
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Robert Kleffner
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
| | - Brian Koepnick
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Tanja Kortemme
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Georg Kuenze
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA
| | - Brian Kuhlman
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Daisuke Kuroda
- Medical Device Development and Regulation Research Center, School of Engineering, University of Tokyo, Tokyo, Japan
- Department of Bioengineering, School of Engineering, University of Tokyo, Tokyo, Japan
| | - Jason W Labonte
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
- Department of Chemistry, Franklin & Marshall College, Lancaster, PA, USA
| | - Jason K Lai
- Baylor College of Medicine, Department of Pharmacology, Houston, TX, USA
| | - Gideon Lapidoth
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Andrew Leaver-Fay
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Steffen Lindert
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, OH, USA
| | - Thomas Linsky
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Nir London
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Joseph H Lubin
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Sergey Lyskov
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Jack Maguire
- Program in Bioinformatics and Computational Biology, Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Lars Malmström
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Institute for Computational Science, University of Zurich, Zurich, Switzerland
- S3IT, University of Zurich, Zurich, Switzerland
- Division of Infection Medicine, Department of Clinical Sciences Lund, Faculty of Medicine, Lund University, Lund, Sweden
| | - Enrique Marcos
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Research in Biomedicine Barcelona, The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Orly Marcu
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Nicholas A Marze
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Jens Meiler
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA
- Departments of Chemistry, Pharmacology and Biomedical Informatics, Vanderbilt University, Nashville, TN, USA
- Institute for Chemical Biology, Vanderbilt University, Nashville, TN, USA
| | - Rocco Moretti
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
| | - Vikram Khipple Mulligan
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Santrupti Nerli
- Department of Computer Science, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Christoffer Norn
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Shane Ó'Conchúir
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Noah Ollikainen
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Sergey Ovchinnikov
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Molecular and Cellular Biology Program, University of Washington, Seattle, WA, USA
| | - Michael S Pacella
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Xingjie Pan
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Hahnbeom Park
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Ryan E Pavlovicz
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Manasi Pethe
- Department of Chemistry and Chemical Biology, The State University of New Jersey, Piscataway, NJ, USA
- Center for Integrative Proteomics Research, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Brian G Pierce
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD, USA
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD, USA
| | - Kala Bharath Pilla
- Research School of Chemistry, Australian National University, Canberra, Australian Capital Territory, Australia
| | - Barak Raveh
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - P Douglas Renfrew
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA
| | - Shourya S Roy Burman
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Aliza Rubenstein
- Institute of Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Computational Biology and Molecular Biophysics Program, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Marion F Sauer
- Chemical and Physical Biology Program, Vanderbilt Vaccine Center, Vanderbilt University, Nashville, TN, USA
| | - Andreas Scheck
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - William Schief
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA, USA
| | - Ora Schueler-Furman
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Yuval Sedan
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Alexander M Sevy
- Chemical and Physical Biology Program, Vanderbilt Vaccine Center, Vanderbilt University, Nashville, TN, USA
| | - Nikolaos G Sgourakis
- Department of Chemistry and Biochemistry, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Lei Shi
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Justin B Siegel
- Department of Chemistry, University of California, Davis, Davis, CA, USA
- Department of Biochemistry and Molecular Medicine, University of California, Davis, Davis, California, USA
- Genome Center, University of California, Davis, Davis, CA, USA
| | | | - Shannon Smith
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
| | - Yifan Song
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Amelie Stein
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Maria Szegedy
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Frank D Teets
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Summer B Thyme
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Ray Yu-Ruei Wang
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Andrew Watkins
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
| | - Lior Zimmerman
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Richard Bonneau
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA.
- Department of Biology, New York University, New York, New York, USA.
- Department of Computer Science, New York University, New York, NY, USA.
- Center for Data Science, New York University, New York, NY, USA.
| |
Collapse
|
23
|
Magnus M, Antczak M, Zok T, Wiedemann J, Lukasiak P, Cao Y, Bujnicki JM, Westhof E, Szachniuk M, Miao Z. RNA-Puzzles toolkit: a computational resource of RNA 3D structure benchmark datasets, structure manipulation, and evaluation tools. Nucleic Acids Res 2020; 48:576-588. [PMID: 31799609 PMCID: PMC7145511 DOI: 10.1093/nar/gkz1108] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2019] [Revised: 11/06/2019] [Accepted: 11/15/2019] [Indexed: 12/12/2022] Open
Abstract
Significant improvements have been made in the efficiency and accuracy of RNA 3D structure prediction methods during the succeeding challenges of RNA-Puzzles, a community-wide effort on the assessment of blind prediction of RNA tertiary structures. The RNA-Puzzles contest has shown, among others, that the development and validation of computational methods for RNA fold prediction strongly depend on the benchmark datasets and the structure comparison algorithms. Yet, there has been no systematic benchmark set or decoy structures available for the 3D structure prediction of RNA, hindering the standardization of comparative tests in the modeling of RNA structure. Furthermore, there has not been a unified set of tools that allows deep and complete RNA structure analysis, and at the same time, that is easy to use. Here, we present RNA-Puzzles toolkit, a computational resource including (i) decoy sets generated by different RNA 3D structure prediction methods (raw, for-evaluation and standardized datasets), (ii) 3D structure normalization, analysis, manipulation, visualization tools (RNA_format, RNA_normalizer, rna-tools) and (iii) 3D structure comparison metric tools (RNAQUA, MCQ4Structures). This resource provides a full list of computational tools as well as a standard RNA 3D structure prediction assessment protocol for the community.
Collapse
Affiliation(s)
- Marcin Magnus
- International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
- ReMedy-International Research Agenda Unit, Centre of New Technologies, University of Warsaw, 02-097 Warsaw, Poland
| | - Maciej Antczak
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, 60-965 Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Tomasz Zok
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, 60-965 Poznan, Poland
| | - Jakub Wiedemann
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, 60-965 Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Piotr Lukasiak
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, 60-965 Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Yang Cao
- Center of Growth, Metabolism and Aging, Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, PR China
| | - Janusz M Bujnicki
- International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
- Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, Poznan, Poland
| | - Eric Westhof
- Architecture et Réactivité de l’ARN, Université de Strasbourg, Institut de biologie moléculaire et cellulaire du CNRS, 12 allée Konrad Roentgen, 67084 Strasbourg, France
| | - Marta Szachniuk
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, 60-965 Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Zhichao Miao
- Translational Research Institute of Brain and Brain-Like Intelligence and Department of Anesthesiology, Shanghai Fourth People's Hospital Affiliated to Tongji University School of Medicine, Shanghai 200081, China
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridge CB10 1SD, UK
- Newcastle Fibrosis Research Group, Institute of Cellular Medicine, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| |
Collapse
|
24
|
Huang L, Wang J, Watkins AM, Das R, Lilley DMJ. Structure and ligand binding of the glutamine-II riboswitch. Nucleic Acids Res 2019; 47:7666-7675. [PMID: 31216023 PMCID: PMC6698751 DOI: 10.1093/nar/gkz539] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2019] [Revised: 05/31/2019] [Accepted: 06/06/2019] [Indexed: 12/14/2022] Open
Abstract
We have determined the structure of the glutamine-II riboswitch ligand binding domain using X-ray crystallography. The structure was solved using a novel combination of homology modeling and molecular replacement. The structure comprises three coaxial helical domains, the central one of which is a pseudoknot with partial triplex character. The major groove of this helix provides the binding site for L-glutamine, which is extensively hydrogen bonded to the RNA. Atomic mutation of the RNA at the ligand binding site leads to loss of binding shown by isothermal titration calorimetry, explaining the specificity of the riboswitch. A metal ion also plays an important role in ligand binding. This is directly bonded to a glutamine carboxylate oxygen atom, and its remaining inner-sphere water molecules make hydrogen bonding interactions with the RNA.
Collapse
Affiliation(s)
- Lin Huang
- Cancer Research UK Nucleic Acid Structure Research Group, MSI/WTB Complex, The University of Dundee, Dow Street, Dundee DD1 5EH, UK
| | - Jia Wang
- Cancer Research UK Nucleic Acid Structure Research Group, MSI/WTB Complex, The University of Dundee, Dow Street, Dundee DD1 5EH, UK
| | - Andrew M Watkins
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - David M J Lilley
- Cancer Research UK Nucleic Acid Structure Research Group, MSI/WTB Complex, The University of Dundee, Dow Street, Dundee DD1 5EH, UK
| |
Collapse
|
25
|
Calkins ER, Zakrevsky P, Keleshian VL, Aguilar EG, Geary C, Jaeger L. Deducing putative ancestral forms of GNRA/receptor interactions from the ribosome. Nucleic Acids Res 2019; 47:480-494. [PMID: 30418638 PMCID: PMC6326782 DOI: 10.1093/nar/gky1111] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2018] [Accepted: 10/22/2018] [Indexed: 01/02/2023] Open
Abstract
Stable RNAs rely on a vast repertoire of long-range interactions to assist in the folding of complex cellular machineries such as the ribosome. The universally conserved L39/H89 interaction is a long-range GNRA-like/receptor interaction localized in proximity to the peptidyl transferase center of the large subunit of the ribosome. Because of its central location, L39/H89 likely originated at an early evolutionary stage of the ribosome and played a significant role in its early function. However, L39/H89 self-assembly is impaired outside the ribosomal context. Herein, we demonstrate that structural modularity principles can be used to re-engineer L39/H89 to self-assemble in vitro. The new versions of L39/H89 improve affinity and loop selectivity by several orders of magnitude and retain the structural and functional features of their natural counterparts. These versions of L39/H89 are proposed to be ancestral forms of L39/H89 that were capable of assembling and folding independently from proteins and post-transcriptional modifications. This work demonstrates that novel RNA modules can be rationally designed by taking advantage of the modular syntax of RNA. It offers the prospect of creating new biochemical models of the ancestral ribosome and increases the tool kit for RNA nanotechnology and synthetic biology.
Collapse
Affiliation(s)
- Erin R Calkins
- Department of Chemistry and Biochemistry, Biomolecular Science and Engineering Program, University of California, Santa Barbara, CA 93106-9510, USA
| | - Paul Zakrevsky
- Department of Chemistry and Biochemistry, Biomolecular Science and Engineering Program, University of California, Santa Barbara, CA 93106-9510, USA
| | - Vasken L Keleshian
- Department of Chemistry and Biochemistry, Biomolecular Science and Engineering Program, University of California, Santa Barbara, CA 93106-9510, USA
| | - Eduardo G Aguilar
- Department of Chemistry and Biochemistry, Biomolecular Science and Engineering Program, University of California, Santa Barbara, CA 93106-9510, USA
| | - Cody Geary
- Department of Chemistry and Biochemistry, Biomolecular Science and Engineering Program, University of California, Santa Barbara, CA 93106-9510, USA
| | - Luc Jaeger
- Department of Chemistry and Biochemistry, Biomolecular Science and Engineering Program, University of California, Santa Barbara, CA 93106-9510, USA
| |
Collapse
|
26
|
Koirala D, Shao Y, Koldobskaya Y, Fuller JR, Watkins AM, Shelke SA, Pilipenko EV, Das R, Rice PA, Piccirilli JA. A conserved RNA structural motif for organizing topology within picornaviral internal ribosome entry sites. Nat Commun 2019; 10:3629. [PMID: 31399592 PMCID: PMC6689051 DOI: 10.1038/s41467-019-11585-z] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2018] [Accepted: 07/09/2019] [Indexed: 12/30/2022] Open
Abstract
Picornaviral IRES elements are essential for initiating the cap-independent viral translation. However, three-dimensional structures of these elements remain elusive. Here, we report a 2.84-Å resolution crystal structure of hepatitis A virus IRES domain V (dV) in complex with a synthetic antibody fragment-a crystallization chaperone. The RNA adopts a three-way junction structure, topologically organized by an adenine-rich stem-loop motif. Despite no obvious sequence homology, the dV architecture shows a striking similarity to a circularly permuted form of encephalomyocarditis virus J-K domain, suggesting a conserved strategy for organizing the domain architecture. Recurrence of the motif led us to use homology modeling tools to compute a 3-dimensional structure of the corresponding domain of foot-and-mouth disease virus, revealing an analogous domain organizing motif. The topological conservation observed among these IRESs and other viral domains implicates a structured three-way junction as an architectural scaffold to pre-organize helical domains for recruiting the translation initiation machinery.
Collapse
Affiliation(s)
- Deepak Koirala
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, 60637, USA
| | - Yaming Shao
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, 60637, USA
| | - Yelena Koldobskaya
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, 60637, USA
| | - James R Fuller
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, 60637, USA
| | - Andrew M Watkins
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Sandip A Shelke
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, 60637, USA
| | - Evgeny V Pilipenko
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, 60637, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Phoebe A Rice
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, 60637, USA
| | - Joseph A Piccirilli
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, 60637, USA.
- Department of Chemistry, The University of Chicago, Chicago, IL, 60637, USA.
| |
Collapse
|
27
|
Sequence-dependent RNA helix conformational preferences predictably impact tertiary structure formation. Proc Natl Acad Sci U S A 2019; 116:16847-16855. [PMID: 31375637 DOI: 10.1073/pnas.1901530116] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Structured RNAs and RNA complexes underlie biological processes ranging from control of gene expression to protein translation. Approximately 50% of nucleotides within known structured RNAs are folded into Watson-Crick (WC) base pairs, and sequence changes that preserve these pairs are typically assumed to preserve higher-order RNA structure and binding of macromolecule partners. Here, we report that indirect effects of the helix sequence on RNA tertiary stability are, in fact, significant but are nevertheless predictable from a simple computational model called RNAMake-∆∆G. When tested through the RNA on a massively parallel array (RNA-MaP) experimental platform, blind predictions for >1500 variants of the tectoRNA heterodimer model system achieve high accuracy (rmsd 0.34 and 0.77 kcal/mol for sequence and length changes, respectively). Detailed comparison of predictions to experiments support a microscopic picture of how helix sequence changes subtly modulate conformational fluctuations at each base-pair step, which accumulate to impact RNA tertiary structure stability. Our study reveals a previously overlooked phenomenon in RNA structure formation and provides a framework of computation and experiment for understanding helix conformational preferences and their impact across biological RNA and RNA-protein assemblies.
Collapse
|
28
|
Abstract
The three-dimensional structures of RNA molecules provide rich and often critical information for understanding their functions, including how they recognize small molecule and protein partners. Computational modeling of RNA 3D structure is becoming increasingly accurate, particularly with the availability of growing numbers of template structures already solved experimentally and the development of sequence alignment and 3D modeling tools to take advantage of this database. For several recent "RNA puzzle" blind modeling challenges, we have successfully identified useful template structures and achieved accurate structure predictions through homology modeling tools developed in the Rosetta software suite. We describe our semi-automated methodology here and walk through two illustrative examples: an adenine riboswitch aptamer, modeled from a template guanine riboswitch structure, and a SAM I/IV riboswitch aptamer, modeled from a template SAM I riboswitch structure.
Collapse
Affiliation(s)
- Andrew M Watkins
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, United States
| | - Ramya Rangan
- Biophysics Program, Stanford University, Stanford, CA, United States
| | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, United States; Biophysics Program, Stanford University, Stanford, CA, United States.
| |
Collapse
|
29
|
Bendixsen DP, Collet J, Østman B, Hayden EJ. Genotype network intersections promote evolutionary innovation. PLoS Biol 2019; 17:e3000300. [PMID: 31136568 PMCID: PMC6555535 DOI: 10.1371/journal.pbio.3000300] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2018] [Revised: 06/07/2019] [Accepted: 05/15/2019] [Indexed: 12/27/2022] Open
Abstract
Evolutionary innovations are qualitatively novel traits that emerge through evolution and increase biodiversity. The genetic mechanisms of innovation remain poorly understood. A systems view of innovation requires the analysis of genotype networks—the vast networks of genetic variants that produce the same phenotype. Innovations can occur at the intersection of two different genotype networks. However, the experimental characterization of genotype networks has been hindered by the vast number of genetic variants that need to be functionally analyzed. Here, we use high-throughput sequencing to study the fitness landscape at the intersection of the genotype networks of two catalytic RNA molecules (ribozymes). We determined the ability of numerous neighboring RNA sequences to catalyze two different chemical reactions, and we use these data as a proxy for a genotype to fitness map where two functions come in close proximity. We find extensive functional overlap, and numerous genotypes can catalyze both functions. We demonstrate through evolutionary simulations that these numerous points of intersection facilitate the discovery of a new function. However, the rate of adaptation of the new function depends upon the local ruggedness around the starting location in the genotype network. As a consequence, one direction of adaptation is more rapid than the other. We find that periods of neutral evolution increase rates of adaptation to the new function by allowing populations to spread out in their genotype network. Our study reveals the properties of a fitness landscape where genotype networks intersect and the consequences for evolutionary innovations. Our results suggest that historic innovations in natural systems may have been facilitated by overlapping genotype networks. The determination of the empirical fitness landscape at the genotypic intersection between two different catalytic RNA (ribozyme) functions reveals details about how novel traits can emerge through evolutionary innovation.
Collapse
Affiliation(s)
- Devin P. Bendixsen
- Biomolecular Sciences Graduate Programs, Boise State University, Boise, Idaho, United States of America
- * E-mail: (DPB); (EJH)
| | - James Collet
- Department of Biological Science, Boise State University, Boise, Idaho, United States of America
| | - Bjørn Østman
- Keck Graduate Institute, Claremont, California, United States of America
| | - Eric J. Hayden
- Biomolecular Sciences Graduate Programs, Boise State University, Boise, Idaho, United States of America
- Department of Biological Science, Boise State University, Boise, Idaho, United States of America
- * E-mail: (DPB); (EJH)
| |
Collapse
|
30
|
Olson WK, Li S, Kaukonen T, Colasanti AV, Xin Y, Lu XJ. Effects of Noncanonical Base Pairing on RNA Folding: Structural Context and Spatial Arrangements of G·A Pairs. Biochemistry 2019; 58:2474-2487. [PMID: 31008589 PMCID: PMC6729125 DOI: 10.1021/acs.biochem.9b00122] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Noncanonical base pairs play important roles in assembling the three-dimensional structures critical to the diverse functions of RNA. These associations contribute to the looped segments that intersperse the canonical double-helical elements within folded, globular RNA molecules. They stitch together various structural elements, serve as recognition elements for other molecules, and act as sites of intrinsic stiffness or deformability. This work takes advantage of new software (DSSR) designed to streamline the analysis and annotation of RNA three-dimensional structures. The multiscale structural information gathered for individual molecules, combined with the growing number of unique, well-resolved RNA structures, makes it possible to examine the collective features deeply and to uncover previously unrecognized patterns of chain organization. Here we focus on a subset of noncanonical base pairs involving guanine and adenine and the links between their modes of association, secondary structural context, and contributions to tertiary folding. The rigorous descriptions of base-pair geometry that we employ facilitate characterization of recurrent geometric motifs and the structural settings in which these arrangements occur. Moreover, the numerical parameters hint at the natural motions of the interacting bases and the pathways likely to connect different spatial forms. We draw attention to higher-order multiplexes involving two or more G·A pairs and the roles these associations appear to play in bridging different secondary structural units. The collective data reveal pairing propensities in base organization, secondary structural context, and deformability and serve as a starting point for further multiscale investigations and/or simulations of RNA folding.
Collapse
Affiliation(s)
- Wilma K. Olson
- Department of Chemistry & Chemical Biology and Center for Quantitative Biology, Rutgers, the State University of New Jersey, Piscataway, New Jersey 08854, USA
| | - Shuxiang Li
- Department of Chemistry & Chemical Biology and Center for Quantitative Biology, Rutgers, the State University of New Jersey, Piscataway, New Jersey 08854, USA
| | - Thomas Kaukonen
- Department of Chemistry & Chemical Biology and Center for Quantitative Biology, Rutgers, the State University of New Jersey, Piscataway, New Jersey 08854, USA
| | - Andrew V. Colasanti
- Department of Chemistry & Chemical Biology and Center for Quantitative Biology, Rutgers, the State University of New Jersey, Piscataway, New Jersey 08854, USA
| | - Yurong Xin
- Department of Chemistry & Chemical Biology and Center for Quantitative Biology, Rutgers, the State University of New Jersey, Piscataway, New Jersey 08854, USA
| | - Xiang-Jun Lu
- Department of Biological Sciences, Columbia University, New York, New York 10027, USA
| |
Collapse
|
31
|
Denny SK, Bisaria N, Yesselman JD, Das R, Herschlag D, Greenleaf WJ. High-Throughput Investigation of Diverse Junction Elements in RNA Tertiary Folding. Cell 2018; 174:377-390.e20. [PMID: 29961580 PMCID: PMC6053692 DOI: 10.1016/j.cell.2018.05.038] [Citation(s) in RCA: 56] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2017] [Revised: 03/07/2018] [Accepted: 05/15/2018] [Indexed: 12/21/2022]
Abstract
RNAs fold into defined tertiary structures to function in critical biological processes. While quantitative models can predict RNA secondary structure stability, we are still unable to predict the thermodynamic stability of RNA tertiary structure. Here, we probe conformational preferences of diverse RNA two-way junctions to develop a predictive model for the formation of RNA tertiary structure. We quantitatively measured tertiary assembly energetics of >1,000 of RNA junctions inserted in multiple structural scaffolds to generate a "thermodynamic fingerprint" for each junction. Thermodynamic fingerprints enabled comparison of junction conformational preferences, revealing principles for how sequence influences 3-dimensional conformations. Utilizing fingerprints of junctions with known crystal structures, we generated ensembles for related junctions that predicted their thermodynamic effects on assembly formation. This work reveals sequence-structure-energetic relationships in RNA, demonstrates the capacity for diverse compensation strategies within tertiary structures, and provides a path to quantitative modeling of RNA folding energetics based on "ensemble modularity."
Collapse
Affiliation(s)
| | - Namita Bisaria
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Joseph David Yesselman
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305, USA; Department of Applied Physics, Stanford University, Stanford, CA 94305, USA
| | - Daniel Herschlag
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305, USA; Department of Chemistry, Stanford University, Stanford, CA 94305, USA; Department of Chemical Engineering, Stanford University, Stanford, CA 94305, USA; ChEM-H Institute, Stanford University, Stanford, CA 94305, USA.
| | - William James Greenleaf
- Program in Biophysics, Stanford University, Stanford, CA 94305, USA; Department of Applied Physics, Stanford University, Stanford, CA 94305, USA; Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA; Chan Zuckerberg Biohub, San Francisco, CA 94158, USA.
| |
Collapse
|