1
|
Majumder S, Pal D. rCGMM: A Coarse-Grained Force Field Embedding Elastic Network for Studying Small Noncoding RNA Dynamics. J Phys Chem B 2025; 129:3159-3170. [PMID: 40101117 DOI: 10.1021/acs.jpcb.4c07286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/20/2025]
Abstract
Short noncoding RNA molecules play significant roles in catalysis, biological regulation, and disease pathways. Their assessment through sequence-based approaches has been a challenge, compounded by the significant structural flexibility accrued from six free backbone torsions per nucleotide. To efficiently study the structure and dynamics of an extensive repertoire of these molecules in a high throughput mode, we have built a coarse-grained force field using one, two, three, and four pseudoatoms to represent the phosphate, sugar, pyrimidines, and purines, respectively. The Boltzmann inversion method was applied to structures of 5 piRNA, 8 miRNA, and 13 siRNA from the Nucleic Acid Database (NDB) to estimate the initial force field parameters and iteratively optimized through 1 μs molecular dynamics run by comparing against an equivalent all-atom simulation using the CHARMM36 force field. We applied an elastic net to model the hydrogen bond network stabilizing the local structure for double-stranded cases. A spine using pseudoatoms was calculated for the same from the coarse-grain beads, and all beads within a threshold radial distance were constrained using soft distance potentials. Lennard-Jones and Coulomb's potential function modeled the nonbonded interaction. Benchmarks on 26 molecules compared through root-mean-square deviation graphs against all-atom simulation show close concurrence for single- and double-stranded small noncoding RNA molecules. The rCGMM force field is available for download at https://github.com/majumderS/rCGMM.
Collapse
Affiliation(s)
- Subhasree Majumder
- Department of Computational and Data Sciences, Indian Institute of Science, Bengaluru 560 012, India
| | - Debnath Pal
- Department of Computational and Data Sciences, Indian Institute of Science, Bengaluru 560 012, India
| |
Collapse
|
2
|
Leitão AL, Enguita FJ. The Unpaved Road of Non-Coding RNA Structure-Function Relationships: Current Knowledge, Available Methodologies, and Future Trends. Noncoding RNA 2025; 11:20. [PMID: 40126344 PMCID: PMC11932211 DOI: 10.3390/ncrna11020020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2025] [Revised: 01/31/2025] [Accepted: 02/26/2025] [Indexed: 03/25/2025] Open
Abstract
The genomes from complex eukaryotes are enriched in non-coding genes whose transcription products (non-coding RNAs) are involved in the regulation of genomic output at different levels. Non-coding RNA action is predominantly driven by sequence and structural motifs that interact with specific functional partners. Despite the exponential growth in primary RNA sequence data facilitated by next-generation sequencing studies, the availability of tridimensional RNA data is comparatively more limited. The subjacent reasons for this relative lack of information regarding RNA structure are related to the specific chemical nature of RNA molecules and the limitations of the currently available methods for structural characterization of biomolecules. In this review, we describe and analyze the different structural motifs involved in non-coding RNA function and the wet-lab and computational methods used to characterize their structure-function relationships, highlighting the current need for detailed structural studies to explore the molecular determinants of non-coding RNA function.
Collapse
Affiliation(s)
- Ana Lúcia Leitão
- Departamento de Química, Faculdade de Ciências e Tecologia, Universidade NOVA de Lisboa, Campus da Caparica, 2829-516 Caparica, Portugal;
| | - Francisco J. Enguita
- Faculdade de Medicina, Universidade de Lisboa, Av. Prof. Egas Moniz, 1649-028 Lisboa, Portugal
| |
Collapse
|
3
|
Imani S, Li X, Chen K, Maghsoudloo M, Jabbarzadeh Kaboli P, Hashemi M, Khoushab S, Li X. Computational biology and artificial intelligence in mRNA vaccine design for cancer immunotherapy. Front Cell Infect Microbiol 2025; 14:1501010. [PMID: 39902185 PMCID: PMC11788159 DOI: 10.3389/fcimb.2024.1501010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2024] [Accepted: 12/16/2024] [Indexed: 02/05/2025] Open
Abstract
Messenger RNA (mRNA) vaccines offer an adaptable and scalable platform for cancer immunotherapy, requiring optimal design to elicit a robust and targeted immune response. Recent advancements in bioinformatics and artificial intelligence (AI) have significantly enhanced the design, prediction, and optimization of mRNA vaccines. This paper reviews technologies that streamline mRNA vaccine development, from genomic sequencing to lipid nanoparticle (LNP) formulation. We discuss how accurate predictions of neoantigen structures guide the design of mRNA sequences that effectively target immune and cancer cells. Furthermore, we examine AI-driven approaches that optimize mRNA-LNP formulations, enhancing delivery and stability. These technological innovations not only improve vaccine design but also enhance pharmacokinetics and pharmacodynamics, offering promising avenues for personalized cancer immunotherapy.
Collapse
Affiliation(s)
- Saber Imani
- Shulan International Medical College, Zhejiang Shuren University, Hangzhou, Zhejiang, China
| | - Xiaoyan Li
- Shulan International Medical College, Zhejiang Shuren University, Hangzhou, Zhejiang, China
| | - Keyi Chen
- Key Laboratory of Artificial Organs and Computational Medicine in Zhejiang Province, Shulan International Medical College, Zhejiang Shuren University, Hangzhou, Zhejiang, China
| | - Mazaher Maghsoudloo
- Key Laboratory of Epigenetics and Oncology, the Research Center for Preclinical Medicine, Southwest Medical University, Luzhou, Sichuan, China
| | | | - Mehrdad Hashemi
- Department of Genetics, Faculty of Advanced Science and Technology, Tehran Medical Sciences, Islamic Azad University, Tehran, Iran
- Farhikhtegan Medical Convergence sciences Research Center, Farhikhtegan Hospital Tehran Medical sciences, Islamic Azad University, Tehran, Iran
| | - Saloomeh Khoushab
- Department of Genetics, Faculty of Advanced Science and Technology, Tehran Medical Sciences, Islamic Azad University, Tehran, Iran
- Farhikhtegan Medical Convergence sciences Research Center, Farhikhtegan Hospital Tehran Medical sciences, Islamic Azad University, Tehran, Iran
| | - Xiaoping Li
- Key Laboratory of Artificial Organs and Computational Medicine in Zhejiang Province, Shulan International Medical College, Zhejiang Shuren University, Hangzhou, Zhejiang, China
| |
Collapse
|
4
|
Chakrabortty A, Mondal S, Bandyopadhyay S. Conformational Properties of Poly(A)-Binding Protein Complexed with Poly(A) RNA. J Phys Chem B 2024; 128:6449-6462. [PMID: 38941243 DOI: 10.1021/acs.jpcb.4c00704] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/30/2024]
Abstract
Microscopic understanding of protein-RNA interactions is important for different biological activities, such as RNA transport, translation, splicing, silencing, etc. Polyadenine (Poly(A)) binding proteins (PABPs) make up a class of regulatory proteins that play critical roles in protecting the poly(A) tails of cellular mRNAs from nuclease degradation. In this work, we performed molecular dynamics simulations to investigate the conformational modifications of human PABP protein and poly(A) RNA that occur during complexation. It is demonstrated that the intermediate linker domain of the protein transforms from a disordered coil-like structure to a helical form during the recognition process, leading to the formation of the complex. On the other hand, disordered collapsed coil-like RNA on complexation has been found to transform into a rigid extended conformation. Importantly, the binding free energy calculation showed that the thermodynamic stability of the complex is primarily guided by favorable hydrophobic interactions between the protein and the RNA.
Collapse
Affiliation(s)
- Arun Chakrabortty
- Centre for Computational and Data Sciences, Indian Institute of Technology Kharagpur, Kharagpur - 721302, India
| | - Sandip Mondal
- Molecular Modeling Laboratory, Department of Chemistry, Indian Institute of Technology Kharagpur, Kharagpur - 721302, India
| | - Sanjoy Bandyopadhyay
- Molecular Modeling Laboratory, Department of Chemistry, Indian Institute of Technology Kharagpur, Kharagpur - 721302, India
| |
Collapse
|
5
|
Wang F, Xia R, Su Y, Cai P, Xu X. Quantifying RNA structures and interactions with a unified reduced chain representation model. Int J Biol Macromol 2023; 253:127181. [PMID: 37793523 DOI: 10.1016/j.ijbiomac.2023.127181] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Revised: 08/30/2023] [Accepted: 09/25/2023] [Indexed: 10/06/2023]
Abstract
RNA is a pivotal molecule that plays critical roles in various cellular processes. Quantifying RNA structures and interactions is essential to understanding RNA function and developing RNA-based therapeutics. Using a unified five-bead model and a non-redundant database, this paper investigates the structural features and interactions of five commonly occurring RNA motifs, i.e., double-stranded helices, hairpin loops, internal/bulge loops, multi-branched junctions, and single-stranded terminal tails. Analyzing detailed distributions of RNA local structural features and base-base interactions reveals a preference for helical structures in both local backbone structures and base orientations. The interactions between adjacent bases exhibit motif-specific and sequence-dependent characteristics, reflecting the distinct topological constraints imposed by different loop-helix connection modes and the varying pairing and stacking interactions among different sequences. These findings shed light on the stability of RNA helices, emphasizing their significance in providing dominant base pairing and stacking interactions for RNA structures and stability. The four non-helix motifs encompass unpaired nucleotide loops and exhibit diverse base-base interactions, contributing to the structural diversity observed in RNA. Overall, the complexity of RNA structure arises from the intricate interplay of base-base interactions.
Collapse
Affiliation(s)
- Fengfei Wang
- Institute of Bioinformatics and Medical Engineering, School of Mathematics and Physics, Jiangsu University of Technology, Changzhou 213001, China
| | - Renjie Xia
- Institute of Bioinformatics and Medical Engineering, School of Mathematics and Physics, Jiangsu University of Technology, Changzhou 213001, China
| | - Yangyang Su
- Institute of Bioinformatics and Medical Engineering, School of Mathematics and Physics, Jiangsu University of Technology, Changzhou 213001, China
| | - Pinggen Cai
- Department of Applied Physics, Zhejiang University of Technology, Hangzhou 310023, China.
| | - Xiaojun Xu
- Institute of Bioinformatics and Medical Engineering, School of Mathematics and Physics, Jiangsu University of Technology, Changzhou 213001, China.
| |
Collapse
|
6
|
Pandey P, Panday SK, Rimal P, Ancona N, Alexov E. Predicting the Effect of Single Mutations on Protein Stability and Binding with Respect to Types of Mutations. Int J Mol Sci 2023; 24:12073. [PMID: 37569449 PMCID: PMC10418460 DOI: 10.3390/ijms241512073] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Revised: 07/24/2023] [Accepted: 07/26/2023] [Indexed: 08/13/2023] Open
Abstract
The development of methods and algorithms to predict the effect of mutations on protein stability, protein-protein interaction, and protein-DNA/RNA binding is necessitated by the needs of protein engineering and for understanding the molecular mechanism of disease-causing variants. The vast majority of the leading methods require a database of experimentally measured folding and binding free energy changes for training. These databases are collections of experimental data taken from scientific investigations typically aimed at probing the role of particular residues on the above-mentioned thermodynamic characteristics, i.e., the mutations are not introduced at random and do not necessarily represent mutations originating from single nucleotide variants (SNV). Thus, the reported performance of the leading algorithms assessed on these databases or other limited cases may not be applicable for predicting the effect of SNVs seen in the human population. Indeed, we demonstrate that the SNVs and non-SNVs are not equally presented in the corresponding databases, and the distribution of the free energy changes is not the same. It is shown that the Pearson correlation coefficients (PCCs) of folding and binding free energy changes obtained in cases involving SNVs are smaller than for non-SNVs, indicating that caution should be used in applying them to reveal the effect of human SNVs. Furthermore, it is demonstrated that some methods are sensitive to the chemical nature of the mutations, resulting in PCCs that differ by a factor of four across chemically different mutations. All methods are found to underestimate the energy changes by roughly a factor of 2.
Collapse
Affiliation(s)
- Preeti Pandey
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA; (P.P.); (S.K.P.); (P.R.)
| | - Shailesh Kumar Panday
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA; (P.P.); (S.K.P.); (P.R.)
| | - Prawin Rimal
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA; (P.P.); (S.K.P.); (P.R.)
| | - Nicolas Ancona
- Department of Biological Sciences, Clemson University, Clemson, SC 29634, USA;
| | - Emil Alexov
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA; (P.P.); (S.K.P.); (P.R.)
| |
Collapse
|
7
|
Wang X, Tan YL, Yu S, Shi YZ, Tan ZJ. Predicting 3D structures and stabilities for complex RNA pseudoknots in ion solutions. Biophys J 2023; 122:1503-1516. [PMID: 36924021 PMCID: PMC10147842 DOI: 10.1016/j.bpj.2023.03.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Revised: 03/05/2023] [Accepted: 03/10/2023] [Indexed: 03/17/2023] Open
Abstract
RNA pseudoknots are a kind of important tertiary motif, and the structures and stabilities of pseudoknots are generally critical to the biological functions of RNAs with the motifs. In this work, we have carefully refined our previously developed coarse-grained model with salt effect through involving a new coarse-grained force field and a replica-exchange Monte Carlo algorithm, and employed the model to predict structures and stabilities of complex RNA pseudoknots in ion solutions beyond minimal H-type pseudoknots. Compared with available experimental data, the newly refined model can successfully predict 3D structures from sequences for the complex RNA pseudoknots including SARS-CoV-2 programming-1 ribosomal frameshifting element and Zika virus xrRNA, and can reliably predict the thermal stabilities of RNA pseudoknots with various sequences and lengths over broad ranges of monovalent/divalent salts. In addition, for complex pseudoknots including SARS-CoV-2 frameshifting element, our analyses show that their thermally unfolding pathways are mainly dependent on the relative stabilities of unfolded intermediate states, in analogy to those of minimal H-type pseudoknots.
Collapse
Affiliation(s)
- Xunxun Wang
- Department of Physics and Key Laboratory of Artificial Micro & Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Ya-Lan Tan
- Research Center of Nonlinear Science and School of Mathematical and Physical Sciences, Wuhan Textile University, Wuhan, China
| | - Shixiong Yu
- Department of Physics and Key Laboratory of Artificial Micro & Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Ya-Zhou Shi
- Research Center of Nonlinear Science and School of Mathematical and Physical Sciences, Wuhan Textile University, Wuhan, China
| | - Zhi-Jie Tan
- Department of Physics and Key Laboratory of Artificial Micro & Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China.
| |
Collapse
|
8
|
Bagnolini G, Luu TB, Hargrove AE. Recognizing the power of machine learning and other computational methods to accelerate progress in small molecule targeting of RNA. RNA (NEW YORK, N.Y.) 2023; 29:473-488. [PMID: 36693763 PMCID: PMC10019373 DOI: 10.1261/rna.079497.122] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
RNA structures regulate a wide range of processes in biology and disease, yet small molecule chemical probes or drugs that can modulate these functions are rare. Machine learning and other computational methods are well poised to fill gaps in knowledge and overcome the inherent challenges in RNA targeting, such as the dynamic nature of RNA and the difficulty of obtaining RNA high-resolution structures. Successful tools to date include principal component analysis, linear discriminate analysis, k-nearest neighbor, artificial neural networks, multiple linear regression, and many others. Employment of these tools has revealed critical factors for selective recognition in RNA:small molecule complexes, predictable differences in RNA- and protein-binding ligands, and quantitative structure activity relationships that allow the rational design of small molecules for a given RNA target. Herein we present our perspective on the value of using machine learning and other computation methods to advance RNA:small molecule targeting, including select examples and their validation as well as necessary and promising future directions that will be key to accelerate discoveries in this important field.
Collapse
Affiliation(s)
- Greta Bagnolini
- Department of Chemistry, Duke University, Durham, North Carolina 27708, USA
| | - TinTin B Luu
- Department of Chemistry, Duke University, Durham, North Carolina 27708, USA
| | - Amanda E Hargrove
- Department of Chemistry, Duke University, Durham, North Carolina 27708, USA
- Department of Biochemistry, Duke University School of Medicine, Durham, North Carolina 27710, USA
| |
Collapse
|
9
|
Tan YL, Wang X, Yu S, Zhang B, Tan ZJ. cgRNASP: coarse-grained statistical potentials with residue separation for RNA structure evaluation. NAR Genom Bioinform 2023; 5:lqad016. [PMID: 36879898 PMCID: PMC9985339 DOI: 10.1093/nargab/lqad016] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Revised: 01/21/2023] [Accepted: 02/03/2023] [Indexed: 03/07/2023] Open
Abstract
Knowledge-based statistical potentials are very important for RNA 3-dimensional (3D) structure prediction and evaluation. In recent years, various coarse-grained (CG) and all-atom models have been developed for predicting RNA 3D structures, while there is still lack of reliable CG statistical potentials not only for CG structure evaluation but also for all-atom structure evaluation at high efficiency. In this work, we have developed a series of residue-separation-based CG statistical potentials at different CG levels for RNA 3D structure evaluation, namely cgRNASP, which is composed of long-ranged and short-ranged interactions by residue separation. Compared with the newly developed all-atom rsRNASP, the short-ranged interaction in cgRNASP was involved more subtly and completely. Our examinations show that, the performance of cgRNASP varies with CG levels and compared with rsRNASP, cgRNASP has similarly good performance for extensive types of test datasets and can have slightly better performance for the realistic dataset-RNA-Puzzles dataset. Furthermore, cgRNASP is strikingly more efficient than all-atom statistical potentials/scoring functions, and can be apparently superior to other all-atom statistical potentials and scoring functions trained from neural networks for the RNA-Puzzles dataset. cgRNASP is available at https://github.com/Tan-group/cgRNASP.
Collapse
Affiliation(s)
- Ya-Lan Tan
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences, Wuhan Textile University, Wuhan 430073, China.,Department of Physics and Key Laboratory of Artificial Micro & Nano-structures of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Xunxun Wang
- Department of Physics and Key Laboratory of Artificial Micro & Nano-structures of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Shixiong Yu
- Department of Physics and Key Laboratory of Artificial Micro & Nano-structures of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Bengong Zhang
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences, Wuhan Textile University, Wuhan 430073, China
| | - Zhi-Jie Tan
- Department of Physics and Key Laboratory of Artificial Micro & Nano-structures of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| |
Collapse
|
10
|
Kumar S, Reddy G. TPP Riboswitch Populates Holo-Form-like Structure Even in the Absence of Cognate Ligand at High Mg 2+ Concentration. J Phys Chem B 2022; 126:2369-2381. [PMID: 35298161 DOI: 10.1021/acs.jpcb.1c10794] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Riboswitches are noncoding RNA that regulate gene expression by folding into specific three-dimensional structures (holo-form) upon binding by their cognate ligand in the presence of Mg2+. Riboswitch functioning is also hypothesized to be under kinetic control requiring large cognate ligand concentrations. We ask the question under thermodynamic conditions, can the riboswitches populate structures similar to the holo-form only in the presence of Mg2+ and absence of cognate ligand binding. We addressed this question using thiamine pyrophosphate (TPP) riboswitch as a model system and computer simulations using a coarse-grained model for RNA. The folding free energy surface (FES) shows that with the initial increase in Mg2+ concentration ([Mg2+]), the aptamer domain (AD) of TPP riboswitch undergoes a barrierless collapse in its dimensions. On further increase in [Mg2+], intermediates separated by barriers appear on the FES, and one of the intermediates has a TPP ligand-binding competent structure. We show that site-specific binding of the Mg2+ aids in the formation of tertiary contacts. For [Mg2+] greater than physiological concentration, AD folds into a structure similar to the crystal structure of the TPP holo-form even in the absence of the TPP ligand. The folding kinetics shows that TPP AD populates an intermediate due to the misalignment of two arms present in the structure, which acts as a kinetic trap, leading to larger folding timescales. The predictions of the intermediate structures from the simulations are amenable for experimental verification.
Collapse
Affiliation(s)
- Sunil Kumar
- Solid State and Structural Chemistry Unit, Indian Institute of Science, Bengaluru, Karnataka 560012, India
| | - Govardhan Reddy
- Solid State and Structural Chemistry Unit, Indian Institute of Science, Bengaluru, Karnataka 560012, India
| |
Collapse
|
11
|
rsRNASP: A residue-separation-based statistical potential for RNA 3D structure evaluation. Biophys J 2022; 121:142-156. [PMID: 34798137 PMCID: PMC8758408 DOI: 10.1016/j.bpj.2021.11.016] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Revised: 10/23/2021] [Accepted: 11/10/2021] [Indexed: 01/07/2023] Open
Abstract
Knowledge-based statistical potentials have been shown to be rather effective in protein 3-dimensional (3D) structure evaluation and prediction. Recently, several statistical potentials have been developed for RNA 3D structure evaluation, while their performances are either still at a low level for the test datasets from structure prediction models or dependent on the "black-box" process through neural networks. In this work, we have developed an all-atom distance-dependent statistical potential based on residue separation for RNA 3D structure evaluation, namely rsRNASP, which is composed of short- and long-ranged potentials distinguished by residue separation. The extensive examinations against available RNA test datasets show that rsRNASP has apparently higher performance than the existing statistical potentials for the realistic test datasets with large RNAs from structure prediction models, including the newly released RNA-Puzzles dataset, and is comparable to the existing top statistical potentials for the test datasets with small RNAs or near-native decoys. In addition, rsRNASP is superior to RNA3DCNN, a recently developed scoring function through 3D convolutional neural networks. rsRNASP and the relevant databases are available to the public.
Collapse
|
12
|
Poblete S, Božič A, Kanduč M, Podgornik R, Guzman HV. RNA Secondary Structures Regulate Adsorption of Fragments onto Flat Substrates. ACS OMEGA 2021; 6:32823-32831. [PMID: 34901632 PMCID: PMC8655909 DOI: 10.1021/acsomega.1c04774] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Accepted: 11/04/2021] [Indexed: 06/14/2023]
Abstract
RNA is a functionally rich molecule with multilevel, hierarchical structures whose role in the adsorption to molecular substrates is only beginning to be elucidated. Here, we introduce a multiscale simulation approach that combines a tractable coarse-grained RNA structural model with an interaction potential of a structureless flat adsorbing substrate. Within this approach, we study the specific role of stem-hairpin and multibranch RNA secondary structure motifs on its adsorption phenomenology. Our findings identify a dual regime of adsorption for short RNA fragments with and without the secondary structure and underline the adsorption efficiency in both cases as a function of the surface interaction strength. The observed behavior results from an interplay between the number of contacts formed at the surface and the conformational entropy of the RNA molecule. The adsorption phenomenology of RNA seems to persist also for much longer RNAs as qualitatively observed by comparing the trends of our simulations with a theoretical approach based on an ideal semiflexible polymer chain.
Collapse
Affiliation(s)
- Simón Poblete
- Instituto
de Ciencias Físicas y Matemáticas, Universidad Austral de Chile, Valdivia 5091000, Chile
- Computational
Biology Lab, Fundación Ciencia &
Vida, Santiago 7780272, Chile
| | - Anže Božič
- Department
of Theoretical Physics, Jožef Stefan
Institute, SI-1000 Ljubljana, Slovenia
| | - Matej Kanduč
- Department
of Theoretical Physics, Jožef Stefan
Institute, SI-1000 Ljubljana, Slovenia
| | - Rudolf Podgornik
- School
of Physical Sciences and Kavli Institute for Theoretical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
- Institute
of Physics, Chinese Academy of Sciences, Beijing 100190, China
- Wenzhou
Institute of the University of Chinese Academy of Sciences, Wenzhou, Zhejiang 325000, China
- Department
of Physics, Faculty of Mathematics and Physics, University of Ljubljana, SI-1000 Ljubljana, Slovenia
| | - Horacio V. Guzman
- Department
of Theoretical Physics, Jožef Stefan
Institute, SI-1000 Ljubljana, Slovenia
| |
Collapse
|
13
|
Mazzanti L, Alferkh L, Frezza E, Pasquali S. Biasing RNA Coarse-Grained Folding Simulations with Small-Angle X-ray Scattering Data. J Chem Theory Comput 2021; 17:6509-6521. [PMID: 34506136 DOI: 10.1021/acs.jctc.1c00441] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
RNA molecules can easily adopt alternative structures in response to different environmental conditions. As a result, a molecule's energy landscape is rough and can exhibit a multitude of deep basins. In the absence of a high-resolution structure, small-angle X-ray scattering data (SAXS) can narrow down the conformational space available to the molecule and be used in conjunction with physical modeling to obtain high-resolution putative structures to be further tested by experiments. Because of the low resolution of these data, it is natural to implement the integration of SAXS data into simulations using a coarse-grained representation of the molecule, allowing for much wider searches and faster evaluation of SAXS theoretical intensity curves than with atomistic models. We present here the theoretical framework and the implementation of a simulation approach based on our coarse-grained model HiRE-RNA combined with SAXS evaluations "on-the-fly" leading the simulation toward conformations agreeing with the scattering data, starting from partially folded structures as the ones that can easily be obtained from secondary structure prediction-based tools. We show on three benchmark systems how our approach can successfully achieve high-resolution structures with remarkable similarity with the native structure recovering not only the overall shape, as imposed by SAXS data, but also the details of initially missing base pairs.
Collapse
Affiliation(s)
- Liuba Mazzanti
- Laboratoire CiTCoM, CNRS UMR 8038, Université de Paris, 4 Avenue de l'observatoire, 75006 Paris, France
| | - Lina Alferkh
- Laboratoire CiTCoM, CNRS UMR 8038, Université de Paris, 4 Avenue de l'observatoire, 75006 Paris, France
| | - Elisa Frezza
- Laboratoire CiTCoM, CNRS UMR 8038, Université de Paris, 4 Avenue de l'observatoire, 75006 Paris, France
| | - Samuela Pasquali
- Laboratoire CiTCoM, CNRS UMR 8038, Université de Paris, 4 Avenue de l'observatoire, 75006 Paris, France
| |
Collapse
|
14
|
Li J, Chen SJ. RNA 3D Structure Prediction Using Coarse-Grained Models. Front Mol Biosci 2021; 8:720937. [PMID: 34277713 PMCID: PMC8283274 DOI: 10.3389/fmolb.2021.720937] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2021] [Accepted: 06/17/2021] [Indexed: 12/12/2022] Open
Abstract
The three-dimensional (3D) structures of Ribonucleic acid (RNA) molecules are essential to understanding their various and important biological functions. However, experimental determination of the atomic structures is laborious and technically difficult. The large gap between the number of sequences and the experimentally determined structures enables the thriving development of computational approaches to modeling RNAs. However, computational methods based on all-atom simulations are intractable for large RNA systems, which demand long time simulations. Facing such a challenge, many coarse-grained (CG) models have been developed. Here, we provide a review of CG models for modeling RNA 3D structures, compare the performance of the different models, and offer insights into potential future developments.
Collapse
Affiliation(s)
| | - Shi-Jie Chen
- Departments of Physics and Biochemistry, and Institute of Data Science and Informatics, University of Missouri, Columbia, MO, United States
| |
Collapse
|
15
|
Zhang D, Li J, Chen SJ. IsRNA1: De Novo Prediction and Blind Screening of RNA 3D Structures. J Chem Theory Comput 2021; 17:1842-1857. [PMID: 33560836 DOI: 10.1021/acs.jctc.0c01148] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Modeling structures and functions of large ribonucleic acid (RNAs) especially with complicated topologies is highly challenging due to the inefficiency of large conformational sampling and the presence of complicated tertiary interactions. To address this problem, one highly promising approach is coarse-grained modeling. Here, following an iterative simulated reference state approach to decipher the correlations between different structural parameters, we developed a potent coarse-grained RNA model named as IsRNA1 for RNA studies. Molecular dynamics simulations in the IsRNA1 can predict the native structures of small RNAs from a sequence and fold medium-sized RNAs into near-native tertiary structures with the assistance of secondary structure constraints. A large-scale benchmark test on RNA 3D structure prediction shows that IsRNA1 exhibits improved performance for relatively large RNAs of complicated topologies, such as large stem-loop structures and structures containing long-range tertiary interactions. The advantages of IsRNA1 include the consideration of the correlations between the different structural variables, the appropriate characterization of canonical base-pairing and base-stacking interactions, and the better sampling for the backbone conformations. Moreover, a blind screening protocol was developed based on IsRNA1 to identify good structural models from a pool of candidates without prior knowledge of the native structures.
Collapse
Affiliation(s)
- Dong Zhang
- Department of Physics, Department of Biochemistry, and Institute of Data Science and Informatics, University of Missouri, Columbia, Missouri 65211, United States
| | - Jun Li
- Department of Physics, Department of Biochemistry, and Institute of Data Science and Informatics, University of Missouri, Columbia, Missouri 65211, United States
| | - Shi-Jie Chen
- Department of Physics, Department of Biochemistry, and Institute of Data Science and Informatics, University of Missouri, Columbia, Missouri 65211, United States
| |
Collapse
|
16
|
Hurst T, Zhang D, Zhou Y, Chen SJ. A Bayes-inspired theory for optimally building an efficient coarse-grained folding force field. COMMUNICATIONS IN INFORMATION AND SYSTEMS 2021; 21:65-83. [PMID: 34354546 PMCID: PMC8336718 DOI: 10.4310/cis.2021.v21.n1.a4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Because of their potential utility in predicting conformational changes and assessing folding dynamics, coarse-grained (CG) RNA folding models are appealing for rapid characterization of RNA molecules. Previously, we reported the iterative simulated RNA reference state (IsRNA) method for parameterizing a CG force field for RNA folding, which consecutively updates the simulation force field to reflect marginal distributions of folding coordinates in the structure database and extract various energy terms. While the IsRNA model was validated by showing close agreement between the IsRNA-simulated and experimentally observed distributions, here, we expand our theoretical understanding of the model and, in doing so, improve the parameterization process to optimize the subset of included folding coordinates, which leads to accelerated simulations. Using statistical mechanical theory, we analyze the underlying, Bayesian concept that drives parameterization of the energy function, providing a general method for developing predictive, knowledge-based, polymer force fields on the basis of limited data. Furthermore, we propose an optimal parameterization procedure, based on the principal of maximum entropy.
Collapse
Affiliation(s)
- Travis Hurst
- Department of Physics, University of Missouri-Columbia, Columbia, MO 65211, USA
| | - Dong Zhang
- Department of Physics, University of Missouri-Columbia
| | - Yuanzhe Zhou
- Department of Physics, University of Missouri-Columbia, Columbia, MO 65211, USA
| | - Shi-Jie Chen
- Department of Physics, Department of Biochemistry, MU Institute for Data Science and Informatics, University of Missouri-Columbia, Columbia, MO 65211, USA
| |
Collapse
|
17
|
Halder A, Kumar S, Valsson O, Reddy G. Mg 2+ Sensing by an RNA Fragment: Role of Mg 2+-Coordinated Water Molecules. J Chem Theory Comput 2020; 16:6702-6715. [PMID: 32941038 DOI: 10.1021/acs.jctc.0c00589] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
RNA molecules selectively bind to specific metal ions to populate their functional active states, making it important to understand their source of ion selectivity. In large RNA systems, metal ions interact with the RNA at multiple locations, making it difficult to decipher the precise role of ions in folding. To overcome this complexity, we studied the role of different metal ions (Mg2+, Ca2+, and K+) in the folding of a small RNA hairpin motif (5'-ucCAAAga-3') using unbiased all-atom molecular dynamics simulations. The advantage of studying this system is that it requires specific binding of a single metal ion to fold to its native state. We find that even for this small RNA, the folding free energy surface (FES) is multidimensional as different metal ions present in the solution can simultaneously facilitate folding. The FES shows that specific binding of a metal ion is indispensable for its folding. We further show that in addition to the negatively charged phosphate groups, the spatial organization of electronegative nucleobase atoms drives the site-specific binding of the metal ions. Even though the binding site cannot discriminate between different metal ions, RNA folds efficiently only in a Mg2+ solution. We show that the rigid network of Mg2+-coordinated water molecules facilitates the formation of important interactions in the transition state. The other metal ions such as K+ and Ca2+ cannot facilitate the formation of such interactions. These results allow us to hypothesize possible metal-sensing mechanisms in large metalloriboswitches and also provide useful insights into the design of appropriate collective variables for studying large RNA molecules using enhanced sampling methods.
Collapse
Affiliation(s)
- Antarip Halder
- Solid State and Structural Chemistry Unit, Indian Institute of Science, Bengaluru 560012, Karnataka, India
| | - Sunil Kumar
- Solid State and Structural Chemistry Unit, Indian Institute of Science, Bengaluru 560012, Karnataka, India
| | - Omar Valsson
- Max Planck Institute for Polymer Research, Ackermannweg 10, D-55128 Mainz, Germany
| | - Govardhan Reddy
- Solid State and Structural Chemistry Unit, Indian Institute of Science, Bengaluru 560012, Karnataka, India
| |
Collapse
|
18
|
Bag S, Aggarwal A, Maiti PK. Machine Learning Prediction of Electronic Coupling between the Guanine Bases of DNA. J Phys Chem A 2020; 124:7658-7664. [DOI: 10.1021/acs.jpca.0c04368] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Affiliation(s)
- Saientan Bag
- Center for Condensed Matter Theory, Department of Physics, Indian Institute of Science, Bangalore 560012, India
| | - Abhishek Aggarwal
- Center for Condensed Matter Theory, Department of Physics, Indian Institute of Science, Bangalore 560012, India
| | - Prabal K. Maiti
- Center for Condensed Matter Theory, Department of Physics, Indian Institute of Science, Bangalore 560012, India
| |
Collapse
|
19
|
Badu S, Melnik R, Singh S. Mathematical and computational models of RNA nanoclusters and their applications in data-driven environments. MOLECULAR SIMULATION 2020. [DOI: 10.1080/08927022.2020.1804564] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Affiliation(s)
- Shyam Badu
- MS2Discovery Interdisciplinary Research Institute, Wilfrid Laurier University, Waterloo, Ontario, Canada
| | - Roderick Melnik
- MS2Discovery Interdisciplinary Research Institute, Wilfrid Laurier University, Waterloo, Ontario, Canada
- BCAM-Basque Center for Applied Mathematics, Bilbao, Spain
| | - Sundeep Singh
- MS2Discovery Interdisciplinary Research Institute, Wilfrid Laurier University, Waterloo, Ontario, Canada
| |
Collapse
|
20
|
Palomino‐Hernandez O, Margreiter MA, Rossetti G. Challenges in RNA Regulation in Huntington's Disease: Insights from Computational Studies. Isr J Chem 2020. [DOI: 10.1002/ijch.202000021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Affiliation(s)
- Oscar Palomino‐Hernandez
- Computational Biomedicine, Institute of Neuroscience and Medicine (INM-9)/Instute for advanced simulations (IAS-5)Forschungszentrum Juelich 52425 Jülich Germany
- Faculty 1RWTH Aachen 52425 Aachen Germany
- Computation-based Science and Technology Research CenterThe Cyprus Institute Nicosia 2121 Cyprus
- Institute of Life ScienceThe Hebrew University of Jerusalem Jerusalem 91904 Israel
| | - Michael A. Margreiter
- Computational Biomedicine, Institute of Neuroscience and Medicine (INM-9)/Instute for advanced simulations (IAS-5)Forschungszentrum Juelich 52425 Jülich Germany
- Faculty 1RWTH Aachen 52425 Aachen Germany
| | - Giulia Rossetti
- Computational Biomedicine, Institute of Neuroscience and Medicine (INM-9)/Instute for advanced simulations (IAS-5)Forschungszentrum Juelich 52425 Jülich Germany
- Jülich Supercomputing Centre (JSC)Forschungszentrum Jülich 52425 Jülich Germany
- Department of Hematology, Oncology, Hemostaseology and Stem Cell Transplantation University Hospital AachenRWTH Aachen University Pauwelsstraße 30 52074 Aachen Germany
| |
Collapse
|
21
|
In silico design and validation of high-affinity RNA aptamers targeting epithelial cellular adhesion molecule dimers. Proc Natl Acad Sci U S A 2020; 117:8486-8493. [PMID: 32234785 DOI: 10.1073/pnas.1913242117] [Citation(s) in RCA: 48] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Nucleic acid aptamers hold great promise for therapeutic applications due to their favorable intrinsic properties, as well as high-throughput experimental selection techniques. Despite the utility of the systematic evolution of ligands by the exponential enrichment (SELEX) method for aptamer determination, complementary in silico aptamer design is highly sought after to facilitate virtual screening and increased understanding of important nucleic acid-protein interactions. Here, with a combined experimental and theoretical approach, we have developed two optimal epithelial cellular adhesion molecule (EpCAM) aptamers. Our structure-based in silico method first predicts their binding modes and then optimizes them for EpCAM with molecular dynamics simulations, docking, and free energy calculations. Our isothermal titration calorimetry experiments further confirm that the EpCAM aptamers indeed exhibit enhanced affinity over a previously patented nanomolar aptamer, EP23. Moreover, our study suggests that EP23 and the de novo designed aptamers primarily bind to EpCAM dimers (and not monomers, as hypothesized in previous published works), suggesting a paradigm for developing EpCAM-targeted therapies.
Collapse
|
22
|
Larkey NE, Phillips JL, Jang HS, Kolluri SK, Burrows SM. Small RNA Biosensor Design Strategy To Mitigate Off-Analyte Response. ACS Sens 2020; 5:377-384. [PMID: 31942801 DOI: 10.1021/acssensors.9b01968] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
Several bottlenecks in the design of current sensor technologies for small noncoding RNA must be addressed. The small size of the sensors and the large number of other nucleotides that may have sequence similarity makes selectivity a real concern. Many of the current sensors have one strand with an exposed region called a toehold. The toehold serves as a place for the analyte nucleic acid strand to bind and initiate competitive displacement of sensors' secondary strands. Since the toehold region is not protected, any endogenous oligonucleotide sequences that are similar or only different by a few nucleic acids will interact with the toehold and cause false signals. To address sensor selectivity, we investigated how the toehold location in the sensor impacts the sensitivity and selectivity for the analyte of interest. We will discuss the differences in sensitivity and selectivity for a miR-146a-5p biosensor in the presence of different naturally occurring mismatch sequences. We found that altering the toehold location lowered the rate of the false signal from off-analyte microRNA by upward of 20 percentage points. Detection limits as low as 56 pM were observed when the sensor concentration was 5 nM. The findings herein are broadly applicable to other small and large RNAs as well as other types of sensing platforms.
Collapse
Affiliation(s)
- Nicholas E. Larkey
- Department of Chemistry, Oregon State University, 153 Gilbert Hall, Corvallis, Oregon 97331, United States
| | - Jessica L. Phillips
- Department of Environmental and Molecular Toxicology, Cancer Research Laboratory, Oregon State University, Corvallis, Oregon 97331, United States
| | - Hyo Sang Jang
- Department of Environmental and Molecular Toxicology, Cancer Research Laboratory, Oregon State University, Corvallis, Oregon 97331, United States
| | - Siva K. Kolluri
- Department of Environmental and Molecular Toxicology, Cancer Research Laboratory, Oregon State University, Corvallis, Oregon 97331, United States
| | - Sean M. Burrows
- Department of Chemistry, Oregon State University, 153 Gilbert Hall, Corvallis, Oregon 97331, United States
| |
Collapse
|
23
|
Poblete S, Pérez-Acle T. Brief Comparison between Experimental and Computationally Generated Ensembles of RNA Dinucleotides. J Chem Inf Model 2020; 60:989-994. [PMID: 31891267 DOI: 10.1021/acs.jcim.9b00913] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The computational modeling of RNA and its interactions is of crucial importance for the understanding of the wide variety of biological functions it performs. Among these approaches, several coarse-grained models employ structural databases to derive their energy functions or to define scoring functions for structure prediction purposes. In many cases, the parametrization is done by using as a reference a set of experimentally determined structures or data obtained from Molecular Dynamics simulations. Since the two choices are clearly different, we present here a brief comparison of the essential spaces of a set of conformations of two topologically connected nucleotides generated by these means. We find that when the nucleotides are embedded into a duplex, the essential spaces of both ensembles are quite restricted and do not differ substantially. Nevertheless, when the conformational space of a free dinucleoside monophosphate simulation is compared against a similar set obtained from the structural database, the differences of the essential spaces are considerable. In addition, we show a brief comparison of a specific distance between the nucleotides which correlates with the sugar pucker of the first nucleotide and analyze its distribution under similar conditions.
Collapse
Affiliation(s)
- Simón Poblete
- Instituto de Ciencias Físicas y Matemáticas , Universidad Austral de Chile , Casilla 567 , Valdivia 5090000 , Chile.,Computational Biology Lab , Fundación Ciencia & Vida , Avenida Zañartu 1482 , Ñuñoa, Santiago 7780272 , Chile
| | - Tomás Pérez-Acle
- Computational Biology Lab , Fundación Ciencia & Vida , Avenida Zañartu 1482 , Ñuñoa, Santiago 7780272 , Chile.,Centro Interdisciplinario de Neurociencia de Valparaı́so , Universidad de Valparaı́so , Pasaje Harrington 287 , Playa Ancha , Valparaı́so , Chile.,Universidad San Sebastian , Carmen Sylva 2444 , Santiago 7510156 , Chile
| |
Collapse
|
24
|
Theory and simulations for RNA folding in mixtures of monovalent and divalent cations. Proc Natl Acad Sci U S A 2019; 116:21022-21030. [PMID: 31570624 DOI: 10.1073/pnas.1911632116] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
RNA molecules cannot fold in the absence of counterions. Experiments are typically performed in the presence of monovalent and divalent cations. How to treat the impact of a solution containing a mixture of both ion types on RNA folding has remained a challenging problem for decades. By exploiting the large concentration difference between divalent and monovalent ions used in experiments, we develop a theory based on the reference interaction site model (RISM), which allows us to treat divalent cations explicitly while keeping the implicit screening effect due to monovalent ions. Our theory captures both the inner shell and outer shell coordination of divalent cations to phosphate groups, which we demonstrate is crucial for an accurate calculation of RNA folding thermodynamics. The RISM theory for ion-phosphate interactions when combined with simulations based on a transferable coarse-grained model allows us to predict accurately the folding of several RNA molecules in a mixture containing monovalent and divalent ions. The calculated folding free energies and ion-preferential coefficients for RNA molecules (pseudoknots, a fragment of the rRNA, and the aptamer domain of the adenine riboswitch) are in excellent agreement with experiments over a wide range of monovalent and divalent ion concentrations. Because the theory is general, it can be readily used to investigate ion and sequence effects on DNA properties.
Collapse
|
25
|
Tan YL, Feng CJ, Jin L, Shi YZ, Zhang W, Tan ZJ. What is the best reference state for building statistical potentials in RNA 3D structure evaluation? RNA (NEW YORK, N.Y.) 2019; 25:793-812. [PMID: 30996105 PMCID: PMC6573789 DOI: 10.1261/rna.069872.118] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/08/2018] [Accepted: 04/06/2019] [Indexed: 05/14/2023]
Abstract
Knowledge-based statistical potentials have been shown to be efficient in protein structure evaluation/prediction, and the core difference between various statistical potentials is attributed to the choice of reference states. However, for RNA 3D structure evaluation, a comprehensive examination on reference states is still lacking. In this work, we built six statistical potentials based on six reference states widely used in protein structure evaluation, including averaging, quasi-chemical approximation, atom-shuffled, finite-ideal-gas, spherical-noninteracting, and random-walk-chain reference states, and we examined the six reference states against three RNA test sets including six subsets. Our extensive examinations show that, overall, for identifying native structures and ranking decoy structures, the finite-ideal-gas and random-walk-chain reference states are slightly superior to others, while for identifying near-native structures, there is only a slight difference between these reference states. Our further analyses show that the performance of a statistical potential is apparently dependent on the quality of the training set. Furthermore, we found that the performance of a statistical potential is closely related to the origin of test sets, and for the three realistic test subsets, the six statistical potentials have overall unsatisfactory performance. This work presents a comprehensive examination on the existing reference states and statistical potentials for RNA 3D structure evaluation.
Collapse
Affiliation(s)
- Ya-Lan Tan
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Chen-Jie Feng
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Lei Jin
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Ya-Zhou Shi
- Research Center of Nonlinear Science, School of Mathematics and Computer Science, Wuhan Textile University, Wuhan 430073, China
| | - Wenbing Zhang
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Zhi-Jie Tan
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| |
Collapse
|
26
|
Pasquali S, Frezza E, Barroso da Silva FL. Coarse-grained dynamic RNA titration simulations. Interface Focus 2019; 9:20180066. [PMID: 31065339 DOI: 10.1098/rsfs.2018.0066] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/13/2019] [Indexed: 12/11/2022] Open
Abstract
Electrostatic interactions play a pivotal role in many biomolecular processes. The molecular organization and function in biological systems are largely determined by these interactions. Owing to the highly negative charge of RNA, the effect is expected to be more pronounced in this system. Moreover, RNA base pairing is dependent on the charge of the base, giving rise to alternative secondary and tertiary structures. The equilibrium between uncharged and charged bases is regulated by the solution pH, which is therefore a key environmental condition influencing the molecule's structure and behaviour. By means of constant-pH Monte Carlo simulations based on a fast proton titration scheme, coupled with the coarse-grained model HiRE-RNA, molecular dynamic simulations of RNA molecules at constant pH enable us to explore the RNA conformational plasticity at different pH values as well as to compute electrostatic properties as local pK a values for each nucleotide.
Collapse
Affiliation(s)
- S Pasquali
- Laboratoire de Cristallographie et RMN Biologiques, CNRS UMR 8015, Université Paris Descartes, Paris 75006, France
| | - E Frezza
- Laboratoire de Cristallographie et RMN Biologiques, CNRS UMR 8015, Université Paris Descartes, Paris 75006, France
| | - F L Barroso da Silva
- Departamento de Física e Química, Faculdade de Ciência s Farmacêuticas de Ribeirão Preto, Universidade de São Paulo, Av. do café, s/no, Ribeirão Preto, SP BR-14040-903, Brazil.,Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC, USA
| |
Collapse
|
27
|
Ponce-Salvatierra A, Astha, Merdas K, Nithin C, Ghosh P, Mukherjee S, Bujnicki JM. Computational modeling of RNA 3D structure based on experimental data. Biosci Rep 2019; 39:BSR20180430. [PMID: 30670629 PMCID: PMC6367127 DOI: 10.1042/bsr20180430] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2018] [Revised: 01/19/2019] [Accepted: 01/21/2019] [Indexed: 01/02/2023] Open
Abstract
RNA molecules are master regulators of cells. They are involved in a variety of molecular processes: they transmit genetic information, sense cellular signals and communicate responses, and even catalyze chemical reactions. As in the case of proteins, RNA function is dictated by its structure and by its ability to adopt different conformations, which in turn is encoded in the sequence. Experimental determination of high-resolution RNA structures is both laborious and difficult, and therefore the majority of known RNAs remain structurally uncharacterized. To address this problem, predictive computational methods were developed based on the accumulated knowledge of RNA structures determined so far, the physical basis of the RNA folding, and taking into account evolutionary considerations, such as conservation of functionally important motifs. However, all theoretical methods suffer from various limitations, and they are generally unable to accurately predict structures for RNA sequences longer than 100-nt residues unless aided by additional experimental data. In this article, we review experimental methods that can generate data usable by computational methods, as well as computational approaches for RNA structure prediction that can utilize data from experimental analyses. We outline methods and data types that can be potentially useful for RNA 3D structure modeling but are not commonly used by the existing software, suggesting directions for future development.
Collapse
Affiliation(s)
- Almudena Ponce-Salvatierra
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, Warsaw PL-02-109, Poland
| | - Astha
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, Warsaw PL-02-109, Poland
| | - Katarzyna Merdas
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, Warsaw PL-02-109, Poland
| | - Chandran Nithin
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, Warsaw PL-02-109, Poland
| | - Pritha Ghosh
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, Warsaw PL-02-109, Poland
| | - Sunandan Mukherjee
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, Warsaw PL-02-109, Poland
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, Warsaw PL-02-109, Poland
- Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, ul. Umultowska 89, Poznan PL-61-614, Poland
| |
Collapse
|
28
|
Abstract
The necessity for accurate and computationally efficient representations of water in atomistic simulations that can span biologically relevant timescales has born the necessity of coarse-grained (CG) modeling. Despite numerous advances, CG water models rely mostly on a-priori specified assumptions. How these assumptions affect the model accuracy, efficiency, and in particular transferability, has not been systematically investigated. Here we propose a data driven comparison and selection for CG water models through a Hierarchical Bayesian framework. We examine CG water models that differ in their level of coarse-graining, structure, and number of interaction sites. We find that the importance of electrostatic interactions for the physical system under consideration is a dominant criterion for the model selection. Multi-site models are favored, unless the effects of water in electrostatic screening are not relevant, in which case the single site model is preferred due to its computational savings. The charge distribution is found to play an important role in the multi-site model’s accuracy while the flexibility of the bonds/angles may only slightly improve the models. Furthermore, we find significant variations in the computational cost of these models. We present a data informed rationale for the selection of CG water models and provide guidance for future water model designs.
Collapse
|
29
|
Dans PD, Gallego D, Balaceanu A, Darré L, Gómez H, Orozco M. Modeling, Simulations, and Bioinformatics at the Service of RNA Structure. Chem 2019. [DOI: 10.1016/j.chempr.2018.09.015] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
|
30
|
Smith LG, Tan Z, Spasic A, Dutta D, Salas-Estrada LA, Grossfield A, Mathews DH. Chemically Accurate Relative Folding Stability of RNA Hairpins from Molecular Simulations. J Chem Theory Comput 2018; 14:6598-6612. [PMID: 30375860 DOI: 10.1021/acs.jctc.8b00633] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
To benchmark RNA force fields, we compared the folding stabilities of three 12-nucleotide hairpin stem loops estimated by simulation to stabilities determined by experiment. We used umbrella sampling and a reaction coordinate of end-to-end (5' to 3' hydroxyl oxygen) distance to estimate the free energy change of the transition from the native conformation to a fully extended conformation with no hydrogen bonds between non-neighboring bases. Each simulation was performed four times using the AMBER FF99+bsc0+χOL3 force field, and each window, spaced at 1 Å intervals, was sampled for 1 μs, for a total of 552 μs of simulation. We compared differences in the simulated free energy changes to analogous differences in free energies from optical melting experiments using thermodynamic cycles where the free energy change between stretched and random coil sequences is assumed to be sequence-independent. The differences between experimental and simulated ΔΔ G° are, on average, 0.98 ± 0.66 kcal/mol, which is chemically accurate and suggests that analogous simulations could be used predictively. We also report a novel method to identify where replica free energies diverge along a reaction coordinate, thus indicating where additional sampling would most improve convergence. We conclude by discussing methods to more economically perform these simulations.
Collapse
Affiliation(s)
- Louis G Smith
- Department of Biochemistry & Biophysics , University of Rochester , Rochester , New York 14642 , United States.,Center for RNA Biology , University of Rochester , Rochester , New York 14642 , United States
| | - Zhen Tan
- Department of Biochemistry & Biophysics , University of Rochester , Rochester , New York 14642 , United States.,Center for RNA Biology , University of Rochester , Rochester , New York 14642 , United States
| | - Aleksandar Spasic
- Department of Biochemistry & Biophysics , University of Rochester , Rochester , New York 14642 , United States.,Center for RNA Biology , University of Rochester , Rochester , New York 14642 , United States
| | - Debapratim Dutta
- Department of Biochemistry & Biophysics , University of Rochester , Rochester , New York 14642 , United States.,Center for RNA Biology , University of Rochester , Rochester , New York 14642 , United States
| | - Leslie A Salas-Estrada
- Department of Biochemistry & Biophysics , University of Rochester , Rochester , New York 14642 , United States
| | - Alan Grossfield
- Department of Biochemistry & Biophysics , University of Rochester , Rochester , New York 14642 , United States
| | - David H Mathews
- Department of Biochemistry & Biophysics , University of Rochester , Rochester , New York 14642 , United States.,Department of Biostatistics and Computational Biology , University of Rochester , Rochester , New York 14642 , United States.,Center for RNA Biology , University of Rochester , Rochester , New York 14642 , United States
| |
Collapse
|
31
|
Jin L, Shi YZ, Feng CJ, Tan YL, Tan ZJ. Modeling Structure, Stability, and Flexibility of Double-Stranded RNAs in Salt Solutions. Biophys J 2018; 115:1403-1416. [PMID: 30236782 DOI: 10.1016/j.bpj.2018.08.030] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2018] [Revised: 08/10/2018] [Accepted: 08/24/2018] [Indexed: 11/16/2022] Open
Abstract
Double-stranded (ds) RNAs play essential roles in many processes of cell metabolism. The knowledge of three-dimensional (3D) structure, stability, and flexibility of dsRNAs in salt solutions is important for understanding their biological functions. In this work, we further developed our previously proposed coarse-grained model to predict 3D structure, stability, and flexibility for dsRNAs in monovalent and divalent ion solutions through involving an implicit structure-based electrostatic potential. The model can make reliable predictions for 3D structures of extensive dsRNAs with/without bulge/internal loops from their sequences, and the involvement of the structure-based electrostatic potential and corresponding ion condition can improve the predictions for 3D structures of dsRNAs in ion solutions. Furthermore, the model can make good predictions for thermal stability for extensive dsRNAs over the wide range of monovalent/divalent ion concentrations, and our analyses show that the thermally unfolding pathway of dsRNA is generally dependent on its length as well as its sequence. In addition, the model was employed to examine the salt-dependent flexibility of a dsRNA helix, and the calculated salt-dependent persistence lengths are in good accordance with experiments.
Collapse
Affiliation(s)
- Lei Jin
- Center for Theoretical Physics and Key Laboratory of Artificial Micro- & Nanostructures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Ya-Zhou Shi
- Research Center of Nonlinear Science, School of Mathematics and Computer Science, Wuhan Textile University, Wuhan, China
| | - Chen-Jie Feng
- Center for Theoretical Physics and Key Laboratory of Artificial Micro- & Nanostructures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Ya-Lan Tan
- Center for Theoretical Physics and Key Laboratory of Artificial Micro- & Nanostructures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Zhi-Jie Tan
- Center for Theoretical Physics and Key Laboratory of Artificial Micro- & Nanostructures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China.
| |
Collapse
|
32
|
Shi YZ, Jin L, Feng CJ, Tan YL, Tan ZJ. Predicting 3D structure and stability of RNA pseudoknots in monovalent and divalent ion solutions. PLoS Comput Biol 2018; 14:e1006222. [PMID: 29879103 PMCID: PMC6007934 DOI: 10.1371/journal.pcbi.1006222] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2018] [Revised: 06/19/2018] [Accepted: 05/22/2018] [Indexed: 01/30/2023] Open
Abstract
RNA pseudoknots are a kind of minimal RNA tertiary structural motifs, and their three-dimensional (3D) structures and stability play essential roles in a variety of biological functions. Therefore, to predict 3D structures and stability of RNA pseudoknots is essential for understanding their functions. In the work, we employed our previously developed coarse-grained model with implicit salt to make extensive predictions and comprehensive analyses on the 3D structures and stability for RNA pseudoknots in monovalent/divalent ion solutions. The comparisons with available experimental data show that our model can successfully predict the 3D structures of RNA pseudoknots from their sequences, and can also make reliable predictions for the stability of RNA pseudoknots with different lengths and sequences over a wide range of monovalent/divalent ion concentrations. Furthermore, we made comprehensive analyses on the unfolding pathway for various RNA pseudoknots in ion solutions. Our analyses for extensive pseudokonts and the wide range of monovalent/divalent ion concentrations verify that the unfolding pathway of RNA pseudoknots is mainly dependent on the relative stability of unfolded intermediate states, and show that the unfolding pathway of RNA pseudoknots can be significantly modulated by their sequences and solution ion conditions.
Collapse
Affiliation(s)
- Ya-Zhou Shi
- Research Center of Nonlinear Science, School of Mathematics and Computer Science, Wuhan Textile University, Wuhan, China
- Department of Physics and Key Laboratory of Artificial Micro- and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Lei Jin
- Department of Physics and Key Laboratory of Artificial Micro- and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Chen-Jie Feng
- Department of Physics and Key Laboratory of Artificial Micro- and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Ya-Lan Tan
- Department of Physics and Key Laboratory of Artificial Micro- and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Zhi-Jie Tan
- Department of Physics and Key Laboratory of Artificial Micro- and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| |
Collapse
|
33
|
Šponer J, Bussi G, Krepl M, Banáš P, Bottaro S, Cunha RA, Gil-Ley A, Pinamonti G, Poblete S, Jurečka P, Walter NG, Otyepka M. RNA Structural Dynamics As Captured by Molecular Simulations: A Comprehensive Overview. Chem Rev 2018; 118:4177-4338. [PMID: 29297679 PMCID: PMC5920944 DOI: 10.1021/acs.chemrev.7b00427] [Citation(s) in RCA: 386] [Impact Index Per Article: 55.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2017] [Indexed: 12/14/2022]
Abstract
With both catalytic and genetic functions, ribonucleic acid (RNA) is perhaps the most pluripotent chemical species in molecular biology, and its functions are intimately linked to its structure and dynamics. Computer simulations, and in particular atomistic molecular dynamics (MD), allow structural dynamics of biomolecular systems to be investigated with unprecedented temporal and spatial resolution. We here provide a comprehensive overview of the fast-developing field of MD simulations of RNA molecules. We begin with an in-depth, evaluatory coverage of the most fundamental methodological challenges that set the basis for the future development of the field, in particular, the current developments and inherent physical limitations of the atomistic force fields and the recent advances in a broad spectrum of enhanced sampling methods. We also survey the closely related field of coarse-grained modeling of RNA systems. After dealing with the methodological aspects, we provide an exhaustive overview of the available RNA simulation literature, ranging from studies of the smallest RNA oligonucleotides to investigations of the entire ribosome. Our review encompasses tetranucleotides, tetraloops, a number of small RNA motifs, A-helix RNA, kissing-loop complexes, the TAR RNA element, the decoding center and other important regions of the ribosome, as well as assorted others systems. Extended sections are devoted to RNA-ion interactions, ribozymes, riboswitches, and protein/RNA complexes. Our overview is written for as broad of an audience as possible, aiming to provide a much-needed interdisciplinary bridge between computation and experiment, together with a perspective on the future of the field.
Collapse
Affiliation(s)
- Jiří Šponer
- Institute of Biophysics of the Czech Academy of Sciences , Kralovopolska 135 , Brno 612 65 , Czech Republic
| | - Giovanni Bussi
- Scuola Internazionale Superiore di Studi Avanzati , Via Bonomea 265 , Trieste 34136 , Italy
| | - Miroslav Krepl
- Institute of Biophysics of the Czech Academy of Sciences , Kralovopolska 135 , Brno 612 65 , Czech Republic
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science , Palacky University Olomouc , 17. listopadu 12 , Olomouc 771 46 , Czech Republic
| | - Pavel Banáš
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science , Palacky University Olomouc , 17. listopadu 12 , Olomouc 771 46 , Czech Republic
| | - Sandro Bottaro
- Structural Biology and NMR Laboratory, Department of Biology , University of Copenhagen , Copenhagen 2200 , Denmark
| | - Richard A Cunha
- Scuola Internazionale Superiore di Studi Avanzati , Via Bonomea 265 , Trieste 34136 , Italy
| | - Alejandro Gil-Ley
- Scuola Internazionale Superiore di Studi Avanzati , Via Bonomea 265 , Trieste 34136 , Italy
| | - Giovanni Pinamonti
- Scuola Internazionale Superiore di Studi Avanzati , Via Bonomea 265 , Trieste 34136 , Italy
| | - Simón Poblete
- Scuola Internazionale Superiore di Studi Avanzati , Via Bonomea 265 , Trieste 34136 , Italy
| | - Petr Jurečka
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science , Palacky University Olomouc , 17. listopadu 12 , Olomouc 771 46 , Czech Republic
| | - Nils G Walter
- Single Molecule Analysis Group and Center for RNA Biomedicine, Department of Chemistry , University of Michigan , Ann Arbor , Michigan 48109 , United States
| | - Michal Otyepka
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science , Palacky University Olomouc , 17. listopadu 12 , Olomouc 771 46 , Czech Republic
| |
Collapse
|
34
|
Zhang D, Chen SJ. IsRNA: An Iterative Simulated Reference State Approach to Modeling Correlated Interactions in RNA Folding. J Chem Theory Comput 2018; 14:2230-2239. [PMID: 29499114 DOI: 10.1021/acs.jctc.7b01228] [Citation(s) in RCA: 45] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Coarse-grained RNA folding models promise great potential for RNA structure prediction. A key component in a coarse-grained folding model is the force field. One of the challenges in the coarse-grained force field calculation is how to treat the correlation between the different degrees of freedoms. Here, we describe a new approach (IsRNA) to extract the correlated energy functions from the known structures. Through iterative molecular dynamics simulations, we build the correlation effects into the reference states, from which we extract the energy functions. The validity of IsRNA is supported by the close agreement between the simulated Boltzmann-like probability distributions for all the structure parameters and those observed from the experimentally determined structures. The correlated energy functions derived here may provide a new tool for RNA 3D structure prediction.
Collapse
Affiliation(s)
- Dong Zhang
- Department of Physics, Department of Biochemistry, and MU Informatics Institute , University of Missouri , Columbia , Missouri 65211 , United States
| | - Shi-Jie Chen
- Department of Physics, Department of Biochemistry, and MU Informatics Institute , University of Missouri , Columbia , Missouri 65211 , United States
| |
Collapse
|