1
|
Lee S, Yan S, Dey A, Laederach A, Schlick T. A Cascade of Conformational Switches in SARS-CoV-2 Frameshifting: Coregulation by Upstream and Downstream Elements. Biochemistry 2025; 64:953-966. [PMID: 39907285 PMCID: PMC11840926 DOI: 10.1021/acs.biochem.4c00641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2024] [Revised: 01/13/2025] [Accepted: 01/17/2025] [Indexed: 02/06/2025]
Abstract
Targeting ribosomal frameshifting has emerged as a potential therapeutic intervention strategy against COVID-19. In this process, a -1 shift in the ribosomal reading frame encodes alternative viral proteins. Any interference with this process profoundly affects viral replication and propagation. For SARS-CoV-2, two RNA sites associated with ribosomal frameshifting are positioned on the 5' and 3' of the frameshifting residues. Although much attention has been focused on the 3' frameshift element (FSE), the 5' stem-loop (attenuator hairpin, AH) can play a role. Yet the relationship between the two regions is unknown. In addition, multiple folds of the FSE and FSE-containing RNA regions have been discovered. To gain more insight into these RNA folds in the larger sequence context that includes AH, we apply our graph-theory-based modeling tools to represent RNA secondary structures, "RAG" (RNA-As-Graphs), to generate conformational landscapes that suggest length-dependent conformational distributions. We show that the AH region can coexist as a stem-loop with main and alternative 3-stem pseudoknots of the FSE (dual graphs 3_6 and 3_3 in our notation) but that an alternative stem 1 (AS1) can disrupt the FSE pseudoknots and trigger other folds. A critical length for AS1 of 10-bp regulates key folding transitions. Together with designed mutants and available experimental data, we present a sequential view of length-dependent folds during frameshifting and suggest their mechanistic roles. These structural and mutational insights into both ends of the FSE advance our understanding of the SARS-CoV-2 frameshifting mechanism by suggesting how alternative folds play a role in frameshifting and defining potential therapeutic intervention techniques that target specific folds.
Collapse
Affiliation(s)
- Samuel Lee
- Department
of Chemistry, New York University, New York, New York 10003, United States
| | - Shuting Yan
- Department
of Chemistry, New York University, New York, New York 10003, United States
| | - Abhishek Dey
- Department
of Biotechnology, National Institute of
Pharmaceutical Education and Research-Raebareli (NIPER-R), Lucknow, Uttar Pradesh 226002, India
| | - Alain Laederach
- Department
of Biology, University of North Carolina
at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Tamar Schlick
- Department
of Chemistry, New York University, New York, New York 10003, United States
- Courant
Institute of Mathematical Sciences, New
York University, New York, New York 10012, United States
- NYU-ECNU
Center for Computational Chemistry, NYU
Shanghai, Shanghai 200062, PR China
- NYU Simons
Center for Computational Physical Chemistry, New York University, New York, New York 10003, United States
| |
Collapse
|
2
|
Lee S, Yan S, Dey A, Laederach A, Schlick T. An intricate balancing act: Upstream and downstream frameshift co-regulatory elements. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.27.599960. [PMID: 38979256 PMCID: PMC11230384 DOI: 10.1101/2024.06.27.599960] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/10/2024]
Abstract
Targeting ribosomal frameshifting has emerged as a potential therapeutic intervention strategy against Covid-19. During ribosomal translation, a fraction of elongating ribosomes slips by one base in the 5' direction and enters a new reading frame for viral protein synthesis. Any interference with this process profoundly affects viral replication and propagation. For Covid-19, two RNA sites associated with ribosomal frameshifting for SARS-CoV-2 are positioned on the 5' and 3' of the frameshifting residues. Although much attention has been on the 3' frameshift element (FSE), the 5' stem-loop (attenuator hairpin, AH) can play a role. The formation of AH has been suggested to occur as refolding of the 3' RNA structure is triggered by ribosomal unwinding. However, the attenuation activity and the relationship between the two regions are unknown. To gain more insight into these two related viral RNAs and to further enrich our understanding of ribosomal frameshifting for SARS-CoV-2, we explore the RNA folding of both 5' and 3' regions associated with frameshifting. Using our graph-theory-based modeling tools to represent RNA secondary structures, "RAG" (RNA- As-Graphs), and conformational landscapes to analyze length-dependent conformational distributions, we show that AH coexists with the 3-stem pseudoknot of the 3' FSE (graph 3_6 in our dual graph notation) and alternative pseudoknot (graph 3_3) but less likely with other 3' FSE alternative folds (such as 3-way junction 3_5). This is because an alternative length-dependent Stem 1 (AS1) can disrupt the FSE pseudoknots and trigger other folds. In addition, we design four mutants for long lengths that stabilize or disrupt AH, AS1 or FSE pseudoknot to illustrate the deduced AH/AS1 roles and favor the 3_5, 3_6 or stem-loop. These mutants further show how a strengthened pseudoknot can result from a weakened AS1, while a dominant stem-loop occurs with a strengthened AS1. These structural and mutational insights into both ends of the FSE in SARS-CoV-2 advance our understanding of the SARS-CoV-2 frameshifting mechanism by suggesting a sequence of length-dependent folds, which in turn define potential therapeutic intervention techniques involving both elements. Our work also highlights the complexity of viral landscapes with length-dependent folds, and challenges in analyzing these multiple conformations.
Collapse
Affiliation(s)
- Samuel Lee
- Department of Chemistry, New York University, New York, 10003, NY, U.S.A
| | - Shuting Yan
- Department of Chemistry, New York University, New York, 10003, NY, U.S.A
| | - Abhishek Dey
- Department of Biotechnology, National Institute of Pharmaceutical Education and Research-Raebareli (NIPER-R), Lucknow, 226002, Uttar Pradesh, India
| | - Alain Laederach
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, 27599, NC, U.S.A
| | - Tamar Schlick
- Department of Chemistry, New York University, New York, 10003, NY, U.S.A
- Courant Institute of Mathematical Sciences, New York University, New York, 10012, NY, U.S.A
- NYU-ECNU Center for Computational Chemistry, NYU Shanghai, Shanghai, 200062, P.R.China
- NYU Simons Center for Computational Physical Chemistry, New York University, New York, 10003, NY, U.S.A
| |
Collapse
|
3
|
RNA-As-Graphs Motif Atlas—Dual Graph Library of RNA Modules and Viral Frameshifting-Element Applications. Int J Mol Sci 2022; 23:ijms23169249. [PMID: 36012512 PMCID: PMC9408923 DOI: 10.3390/ijms23169249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Revised: 08/13/2022] [Accepted: 08/14/2022] [Indexed: 11/25/2022] Open
Abstract
RNA motif classification is important for understanding structure/function connections and building phylogenetic relationships. Using our coarse-grained RNA-As-Graphs (RAG) representations, we identify recurrent dual graph motifs in experimentally solved RNA structures based on an improved search algorithm that finds and ranks independent RNA substructures. Our expanded list of 183 existing dual graph motifs reveals five common motifs found in transfer RNA, riboswitch, and ribosomal 5S RNA components. Moreover, we identify three motifs for available viral frameshifting RNA elements, suggesting a correlation between viral structural complexity and frameshifting efficiency. We further partition the RNA substructures into 1844 distinct submotifs, with pseudoknots and junctions retained intact. Common modules are internal loops and three-way junctions, and three submotifs are associated with riboswitches that bind nucleotides, ions, and signaling molecules. Together, our library of existing RNA motifs and submotifs adds to the growing universe of RNA modules, and provides a resource of structures and substructures for novel RNA design.
Collapse
|
4
|
Length-dependent motions of SARS-CoV-2 frameshifting RNA pseudoknot and alternative conformations suggest avenues for frameshifting suppression. Nat Commun 2022; 13:4284. [PMID: 35879278 PMCID: PMC9310368 DOI: 10.1038/s41467-022-31353-w] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Accepted: 06/10/2022] [Indexed: 12/16/2022] Open
Abstract
The SARS-CoV-2 frameshifting element (FSE), a highly conserved mRNA region required for correct translation of viral polyproteins, defines an excellent therapeutic target against Covid-19. As discovered by our prior graph-theory analysis with SHAPE experiments, the FSE adopts a heterogeneous, length-dependent conformational landscape consisting of an assumed 3-stem H-type pseudoknot (graph motif 3_6), and two alternative motifs (3_3 and 3_5). Here, for the first time, we build and simulate, by microsecond molecular dynamics, 30 models for all three motifs plus motif-stabilizing mutants at different lengths. Our 3_6 pseudoknot systems, which agree with experimental structures, reveal interconvertible L and linear conformations likely related to ribosomal pausing and frameshifting. The 3_6 mutant inhibits this transformation and could hamper frameshifting. Our 3_3 systems exhibit length-dependent stem interactions that point to a potential transition pathway connecting the three motifs during ribosomal elongation. Together, our observations provide new insights into frameshifting mechanisms and anti-viral strategies.
Collapse
|
5
|
Yan S, Zhu Q, Jain S, Schlick T. Length-dependent motions of SARS-CoV-2 frameshifting RNA pseudoknot and alternative conformations suggest avenues for frameshifting suppression. RESEARCH SQUARE 2022:rs.3.rs-1160075. [PMID: 35018371 PMCID: PMC8750709 DOI: 10.21203/rs.3.rs-1160075/v1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Conserved SARS-CoV-2 RNA regions of critical biological functions define excellent targets for anti-viral therapeutics against Covid-19 variants. One such region is the frameshifting element (FSE), responsible for correct translation of viral polyproteins. Here, we analyze molecular-dynamics motions of three FSE conformations, discovered by graph-theory analysis, and associated mutants designed by graph-based inverse folding: two distinct 3-stem H-type pseudoknots and a 3-way junction. We find that the prevalent H-type pseudoknot in literature adopts ring-like conformations, which in combination with 5' end threading could promote ribosomal pausing. An inherent shape switch from "L" to linear that may help trigger the frameshifting is suppressed in our designed mutant. The alternative conformation trajectories suggest a stable intermediate structure with mixed stem interactions of all three conformations, pointing to a possible transition pathway during ribosomal translation. These observations provide new insights into anti-viral strategies and frameshifting mechanisms.
Collapse
Affiliation(s)
- Shuting Yan
- Department of Chemistry, New York University, New York, NY 10003 U.S.A
| | - Qiyao Zhu
- Courant Institute of Mathematical Sciences, New York University, New York, NY 10012 U.S.A
| | - Swati Jain
- Department of Chemistry, New York University, New York, NY 10003 U.S.A
| | - Tamar Schlick
- Department of Chemistry, New York University, New York, NY 10003 U.S.A
- Courant Institute of Mathematical Sciences, New York University, New York, NY 10012 U.S.A
- NYU-ECNU Center for Computational Chemistry, NYU Shanghai, Shanghai 200062, P.R. China
| |
Collapse
|
6
|
Abstract
Novel RNA motif design is of great practical importance for technology and medicine. Increasingly, computational design plays an important role in such efforts. Our coarse-grained RAG (RNA-As-Graphs) framework offers strategies for enumerating the universe of RNA 2D folds, selecting "RNA-like" candidates for design, and determining sequences that fold onto these candidates. In RAG, RNA secondary structures are represented as tree or dual graphs. Graphs with known RNA structures are called "existing", and the others are labeled "hypothetical". By using simplified features for RNA graphs, we have clustered the hypothetical graphs into "RNA-like" and "non-RNA-like" groups and proposed RNA-like graphs as candidates for design. Here, we propose a new way of designing graph features by using Fiedler vectors. The new features reflect graph shapes better, and they lead to a more clustered organization of existing graphs. We show significant increases in K-means clustering accuracy by using the new features (e.g., up to 95% and 98% accuracy for tree and dual graphs, respectively). In addition, we propose a scoring model for top graph candidate selection. This scoring model allows users to set a threshold for candidates, and it incorporates weighing of existing graphs based on their corresponding number of known RNAs. We include a list of top scored RNA-like candidates, which we hope will stimulate future novel RNA design.
Collapse
Affiliation(s)
- Qiyao Zhu
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, United States
| | - Tamar Schlick
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, United States
- Department of Chemistry, New York University, New York, New York 10003, United States
- NYU-ECNU Center for Computational Chemistry, NYU Shanghai, Shanghai 200062, P. R. China
| |
Collapse
|
7
|
Jain S, Zhu Q, Paz ASP, Schlick T. Identification of novel RNA design candidates by clustering the extended RNA-As-Graphs library. Biochim Biophys Acta Gen Subj 2020; 1864:129534. [PMID: 31954797 DOI: 10.1016/j.bbagen.2020.129534] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Revised: 01/10/2020] [Accepted: 01/14/2020] [Indexed: 12/31/2022]
Abstract
BACKGROUND We re-evaluate our RNA-As-Graphs clustering approach, using our expanded graph library and new RNA structures, to identify potential RNA-like topologies for design. Our coarse-grained approach represents RNA secondary structures as tree and dual graphs, with vertices and edges corresponding to RNA helices and loops. The graph theoretical framework facilitates graph enumeration, partitioning, and clustering approaches to study RNA structure and its applications. METHODS Clustering graph topologies based on features derived from graph Laplacian matrices and known RNA structures allows us to classify topologies into 'existing' or hypothetical, and the latter into, 'RNA-like' or 'non RNA-like' topologies. Here we update our list of existing tree graph topologies and RAG-3D database of atomic fragments to include newly determined RNA structures. We then use linear and quadratic regression, optionally with dimensionality reduction, to derive graph features and apply several clustering algorithms on our tree-graph library and recently expanded dual-graph library to classify them into the three groups. RESULTS The unsupervised PAM and K-means clustering approaches correctly classify 72-77% of all existing graph topologies and 75-82% of newly added ones as RNA-like. For supervised k-NN clustering, the cross-validation accuracy ranges from 57 to 81%. CONCLUSIONS Using linear regression with unsupervised clustering, or quadratic regression with supervised clustering, provides better accuracies than supervised/linear clustering. All accuracies are better than random, especially for newly added existing topologies, thus lending credibility to our approach. GENERAL SIGNIFICANCE Our updated RAG-3D database and motif classification by clustering present new RNA substructures and RNA-like motifs as novel design candidates.
Collapse
Affiliation(s)
- Swati Jain
- Department of Chemistry, New York University, 1021 Silver, 100 Washington Square East, New York, NY 10003, USA
| | - Qiyao Zhu
- Courant Institute of Mathematical Sciences, New York University, 251 Mercer Street, New York, NY 10012, USA
| | - Amiel S P Paz
- NYU Shanghai, 1555 Century Avenue, Shanghai 200135, China; NYU-ECNU Center for Computational Chemistry, NYU Shanghai, 3663 Zhongshang Road North, Shanghai 200062, China
| | - Tamar Schlick
- Department of Chemistry, New York University, 1021 Silver, 100 Washington Square East, New York, NY 10003, USA; Courant Institute of Mathematical Sciences, New York University, 251 Mercer Street, New York, NY 10012, USA; NYU-ECNU Center for Computational Chemistry, NYU Shanghai, 3663 Zhongshang Road North, Shanghai 200062, China.
| |
Collapse
|
8
|
Inverse folding with RNA-As-Graphs produces a large pool of candidate sequences with target topologies. J Struct Biol 2019; 209:107438. [PMID: 31874236 DOI: 10.1016/j.jsb.2019.107438] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2019] [Revised: 12/18/2019] [Accepted: 12/19/2019] [Indexed: 02/07/2023]
Abstract
We present an RNA-As-Graphs (RAG) based inverse folding algorithm, RAG-IF, to design novel RNA sequences that fold onto target tree graph topologies. The algorithm can be used to enhance our recently reported computational design pipeline (Jain et al., NAR 2018). The RAG approach represents RNA secondary structures as tree and dual graphs, where RNA loops and helices are coarse-grained as vertices and edges, opening the usage of graph theory methods to study, predict, and design RNA structures. Our recently developed computational pipeline for design utilizes graph partitioning (RAG-3D) and atomic fragment assembly (F-RAG) to design sequences to fold onto RNA-like tree graph topologies; the atomic fragments are taken from existing RNA structures that correspond to tree subgraphs. Because F-RAG may not produce the target folds for all designs, automated mutations by RAG-IF algorithm enhance the candidate pool markedly. The crucial residues for mutation are identified by differences between the predicted and the target topology. A genetic algorithm then mutates the selected residues, and the successful sequences are optimized to retain only the minimal or essential mutations. Here we evaluate RAG-IF for 6 RNA-like topologies and generate a large pool of successful candidate sequences with a variety of minimal mutations. We find that RAG-IF adds robustness and efficiency to our RNA design pipeline, making inverse folding motivated by graph topology rather than secondary structure more productive.
Collapse
|
9
|
Jain S, Saju S, Petingi L, Schlick T. An extended dual graph library and partitioning algorithm applicable to pseudoknotted RNA structures. Methods 2019; 162-163:74-84. [PMID: 30928508 DOI: 10.1016/j.ymeth.2019.03.022] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2019] [Revised: 02/28/2019] [Accepted: 03/22/2019] [Indexed: 12/18/2022] Open
Abstract
Exploring novel RNA topologies is imperative for understanding RNA structure and pursuing its design. Our RNA-As-Graphs (RAG) approach exploits graph theory tools and uses coarse-grained tree and dual graphs to represent RNA helices and loops by vertices and edges. Only dual graphs represent pseudoknotted RNAs fully. Here we develop a dual graph enumeration algorithm to generate an expanded library of dual graph topologies for 2-9 vertices, and extend our dual graph partitioning algorithm to identify all possible RNA subgraphs. Our enumeration algorithm connects smaller-vertex graphs, using all possible edge combinations, to build larger-vertex graphs and retain all non-isomorphic graph topologies, thereby more than doubling the size of our prior library to a total of 110,667 dual graph topologies. We apply our dual graph partitioning algorithm, which keeps pseudoknots and junctions intact, to all existing RNA structures to identify all possible substructures up to 9 vertices. In addition, our expanded dual graph library assigns graph topologies to all RNA graphs and subgraphs, rectifying prior inconsistencies. We update our RAG-3Dual database of RNA atomic fragments with all newly identified substructures and their graph IDs, increasing its size by more than 50 times. The enlarged dual graph library and RAG-3Dual database provide a comprehensive repertoire of graph topologies and atomic fragments to study yet undiscovered RNA molecules and design RNA sequences with novel topologies, including a variety of pseudoknotted RNAs.
Collapse
Affiliation(s)
- Swati Jain
- Department of Chemistry, New York University, 1021 Silver, 100 Washington Square East, New York, NY 10003, USA
| | - Sera Saju
- Department of Chemistry, New York University, 1021 Silver, 100 Washington Square East, New York, NY 10003, USA
| | - Louis Petingi
- Computer Science Department, College of Staten Island, City University of New York, Staten Island, New York, NY 10314, USA
| | - Tamar Schlick
- Department of Chemistry, New York University, 1021 Silver, 100 Washington Square East, New York, NY 10003, USA; Courant Institute of Mathematical Sciences, New York University, 251 Mercer Street, New York, NY 10012, USA; NYU-East China Normal University Center for Computational Chemistry at New York University Shanghai, Room 340, Geography Building, North Zhongshan Road, 3663 Shanghai, China.
| |
Collapse
|
10
|
Jain S, Laederach A, Ramos SBV, Schlick T. A pipeline for computational design of novel RNA-like topologies. Nucleic Acids Res 2018; 46:7040-7051. [PMID: 30137633 PMCID: PMC6101589 DOI: 10.1093/nar/gky524] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2017] [Revised: 05/22/2018] [Accepted: 05/24/2018] [Indexed: 12/11/2022] Open
Abstract
Designing novel RNA topologies is a challenge, with important therapeutic and industrial applications. We describe a computational pipeline for design of novel RNA topologies based on our coarse-grained RNA-As-Graphs (RAG) framework. RAG represents RNA structures as tree graphs and describes RNA secondary (2D) structure topologies (currently up to 13 vertices, ≈260 nucleotides). We have previously identified novel graph topologies that are RNA-like among these. Here we describe a systematic design pipeline and illustrate design for six broad design problems using recently developed tools for graph-partitioning and fragment assembly (F-RAG). Following partitioning of the target graph, corresponding atomic fragments from our RAG-3D database are combined using F-RAG, and the candidate atomic models are scored using a knowledge-based potential developed for 3D structure prediction. The sequences of the top scoring models are screened further using available tools for 2D structure prediction. The results indicate that our modular approach based on RNA-like topologies rather than specific 2D structures allows for greater flexibility in the design process, and generates a large number of candidate sequences quickly. Experimental structure probing using SHAPE-MaP for two sequences agree with our predictions and suggest that our combined tools yield excellent candidates for further sequence and experimental screening.
Collapse
Affiliation(s)
- Swati Jain
- Department of Chemistry, New York University, 1001 Silver, 100 Washington Square East, New York, NY 10003, USA
| | - Alain Laederach
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Silvia B V Ramos
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Tamar Schlick
- Department of Chemistry, New York University, 1001 Silver, 100 Washington Square East, New York, NY 10003, USA
- Courant Institute of Mathematical Sciences, New York University, 251 Mercer Street, New York, NY 10012, USA
- NYU-ECNU Center for Computational Chemistry at New York University Shanghai, Room 340, Geography Building, North Zhongshan Road, 3663 Shanghai, China
| |
Collapse
|
11
|
Jain S, Bayrak CS, Petingi L, Schlick T. Dual Graph Partitioning Highlights a Small Group of Pseudoknot-Containing RNA Submotifs. Genes (Basel) 2018; 9:E371. [PMID: 30044451 PMCID: PMC6115904 DOI: 10.3390/genes9080371] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2018] [Revised: 06/26/2018] [Accepted: 06/26/2018] [Indexed: 12/31/2022] Open
Abstract
RNA molecules are composed of modular architectural units that define their unique structural and functional properties. Characterization of these building blocks can help interpret RNA structure/function relationships. We present an RNA secondary structure motif and submotif library using dual graph representation and partitioning. Dual graphs represent RNA helices as vertices and loops as edges. Unlike tree graphs, dual graphs can represent RNA pseudoknots (intertwined base pairs). For a representative set of RNA structures, we construct dual graphs from their secondary structures, and apply our partitioning algorithm to identify non-separable subgraphs (or blocks) without breaking pseudoknots. We report 56 subgraph blocks up to nine vertices; among them, 22 are frequently occurring, 15 of which contain pseudoknots. We then catalog atomic fragments corresponding to the subgraph blocks to define a library of building blocks that can be used for RNA design, which we call RAG-3Dual, as we have done for tree graphs. As an application, we analyze the distribution of these subgraph blocks within ribosomal RNAs of various prokaryotic and eukaryotic species to identify common subgraphs and possible ancestry relationships. Other applications of dual graph partitioning and motif library can be envisioned for RNA structure analysis and design.
Collapse
Affiliation(s)
- Swati Jain
- Department of Chemistry, New York University, New York, NY 10003, USA.
| | - Cigdem S Bayrak
- Department of Chemistry, New York University, New York, NY 10003, USA.
| | - Louis Petingi
- Computer Science Department, College of Staten Island, City University of New York, Staten Island, New York, NY 10314, USA.
| | - Tamar Schlick
- Department of Chemistry, New York University, New York, NY 10003, USA.
- Courant Institute of Mathematical Sciences, New York University, New York, NY 10012, USA.
- NYU-East China Normal University Center for Computational Chemistry, New York University Shanghai, Shanghai 3663, China.
| |
Collapse
|
12
|
Abstract
The structure of RNA has been a natural subject for mathematical modeling, inviting many innovative computational frameworks. This single-stranded polynucleotide chain can fold upon itself in numerous ways to form hydrogen-bonded segments, imperfect with single-stranded loops. Illustrating these paired and non-paired interaction networks, known as RNA's secondary (2D) structure, using mathematical graph objects has been illuminating for RNA structure analysis. Building upon such seminal work from the 1970s and 1980s, graph models are now used to study not only RNA structure but also describe RNA's recurring modular units, sample the conformational space accessible to RNAs, predict RNA's three-dimensional folds, and apply the combined aspects to novel RNA design. In this article, we outline the development of the RNA-As-Graphs (or RAG) approach and highlight current applications to RNA structure prediction and design.
Collapse
Affiliation(s)
- Tamar Schlick
- Department of Chemistry, 100 Washington Square East, Silver Building, New York University, New York, NY 10003, USA; Courant Institute of Mathematical Sciences, New York University, 251 Mercer St., New York, NY 10012, USA; New York University ECNU - Center for Computational Chemistry at NYU Shanghai, 3663 North Zhongshan Road, Shanghai, 200062, China.
| |
Collapse
|
13
|
Jain S, Schlick T. F-RAG: Generating Atomic Coordinates from RNA Graphs by Fragment Assembly. J Mol Biol 2017; 429:3587-3605. [PMID: 28988954 PMCID: PMC5693719 DOI: 10.1016/j.jmb.2017.09.017] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2017] [Revised: 09/12/2017] [Accepted: 09/22/2017] [Indexed: 10/18/2022]
Abstract
Coarse-grained models represent attractive approaches to analyze and simulate ribonucleic acid (RNA) molecules, for example, for structure prediction and design, as they simplify the RNA structure to reduce the conformational search space. Our structure prediction protocol RAGTOP (RNA-As-Graphs Topology Prediction) represents RNA structures as tree graphs and samples graph topologies to produce candidate graphs. However, for a more detailed study and analysis, construction of atomic from coarse-grained models is required. Here we present our graph-based fragment assembly algorithm (F-RAG) to convert candidate three-dimensional (3D) tree graph models, produced by RAGTOP into atomic structures. We use our related RAG-3D utilities to partition graphs into subgraphs and search for structurally similar atomic fragments in a data set of RNA 3D structures. The fragments are edited and superimposed using common residues, full atomic models are scored using RAGTOP's knowledge-based potential, and geometries of top scoring models is optimized. To evaluate our models, we assess all-atom RMSDs and Interaction Network Fidelity (a measure of residue interactions) with respect to experimentally solved structures and compare our results to other fragment assembly programs. For a set of 50 RNA structures, we obtain atomic models with reasonable geometries and interactions, particularly good for RNAs containing junctions. Additional improvements to our protocol and databases are outlined. These results provide a good foundation for further work on RNA structure prediction and design applications.
Collapse
Affiliation(s)
- Swati Jain
- Department of Chemistry, New York University, 1001 Silver, 100 Washington Square East, New York, NY 10003, USA
| | - Tamar Schlick
- Department of Chemistry, New York University, 1001 Silver, 100 Washington Square East, New York, NY 10003, USA; Courant Institute of Mathematical Sciences, New York University, 251 Mercer Street, New York, NY 10012, USA; New York University-East China Normal University Center for Computational Chemistry at New York University Shanghai, Room 340, Geography Building, North Zhongshan Road, 3663 Shanghai, China.
| |
Collapse
|
14
|
Schlick T, Pyle AM. Opportunities and Challenges in RNA Structural Modeling and Design. Biophys J 2017; 113:225-234. [PMID: 28162235 PMCID: PMC5529161 DOI: 10.1016/j.bpj.2016.12.037] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2016] [Revised: 12/08/2016] [Accepted: 12/19/2016] [Indexed: 01/27/2023] Open
Abstract
We describe opportunities and challenges in RNA structural modeling and design, as recently discussed during the second Telluride Science Research Center workshop organized in June 2016. Topics include fundamental processes of RNA, such as structural assemblies (hierarchical folding, multiple conformational states and their clustering), RNA motifs, and chemical reactivity of RNA, as used for structural prediction and functional inference. We also highlight the software and database issues associated with RNA structures, such as the multiple approaches for motif annotation, the need for frequent database updating, and the importance of quality control of RNA structures. We discuss various modeling approaches for structure prediction, mechanistic analysis of RNA reactions, and RNA design, and the complementary roles that both atomistic and coarse-grained approaches play in such simulations. Collectively, as scientists from varied disciplines become familiar and drawn into these unique challenges, new approaches and collaborative efforts will undoubtedly be catalyzed.
Collapse
Affiliation(s)
- Tamar Schlick
- Department of Chemistry, New York University, New York, New York; Courant Institute of Mathematical Sciences, New York University, New York, New York.
| | - Anna Marie Pyle
- Department of Molecular and Cellular and Developmental Biology and Department of Chemistry, Yale University; Howard Hughes Medical Institute, New Haven, Connecticut.
| |
Collapse
|
15
|
|