1
|
Khan N, Rahaman M, Zhang S. GINClus: RNA structural motif clustering using graph isomorphism network. NAR Genom Bioinform 2025; 7:lqaf050. [PMID: 40290315 PMCID: PMC12034103 DOI: 10.1093/nargab/lqaf050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2024] [Revised: 04/02/2025] [Accepted: 04/15/2025] [Indexed: 04/30/2025] Open
Abstract
Ribonucleic acid (RNA) structural motif identification is a crucial step for understanding RNA structure and functionality. Due to the complexity and variations of RNA 3D structures, identifying RNA structural motifs is challenging and time-consuming. Particularly, discovering new RNA structural motif families is a hard problem and still largely depends on manual analysis. In this paper, we proposed an RNA structural motif clustering tool, named GINClus, which uses a semi-supervised deep learning model to cluster RNA motif candidates (RNA loop regions) based on both base interaction and 3D structure similarities. GINClus converts base interactions and 3D structures of RNA motif candidates into graph representations and using graph isomorphism network (GIN) model in combination with K-means and hierarchical agglomerative clustering, GINClus clusters the RNA motif candidates based on their structural similarities. GINClus has a clustering accuracy of 87.88% for known internal loop motifs and 97.69% for known hairpin loop motifs. Using GINClus, we successfully clustered the motifs of the same families together and were able to find 927 new instances of Sarcin-ricin, Kink-turn, Tandem-shear, Hook-turn, E-loop, C-loop, T-loop, and GNRA loop motif families. We also identified 12 new RNA structural motif families with unique structure and base-pair interactions.
Collapse
Affiliation(s)
- Nabila Shahnaz Khan
- Department of Computer Science, University of Central Florida, Orlando, FL 32816, United States
| | - Md Mahfuzur Rahaman
- Department of Computer Science, University of Central Florida, Orlando, FL 32816, United States
| | - Shaojie Zhang
- Department of Computer Science, University of Central Florida, Orlando, FL 32816, United States
| |
Collapse
|
2
|
Aja PM, Agu PC, Ogbu C, Alum EU, Fasogbon IV, Musyoka AM, Ngwueche W, Egwu CO, Tusubira D, Ross K. RNA research for drug discovery: Recent advances and critical insight. Gene 2025; 947:149342. [PMID: 39983851 DOI: 10.1016/j.gene.2025.149342] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2024] [Revised: 02/12/2025] [Accepted: 02/16/2025] [Indexed: 02/23/2025]
Abstract
The field of RNA research has experienced significant changes and is now at the forefront of contemporary drug development. This narrative overview explores the scientific developments and historical turning points in RNA research, emphasising the field's critical significance in the development of novel therapeutics. Important discoveries like antisense oligonucleotides (ASOs), mRNA therapies, and RNA interference (RNAi) have created novel treatment options that can be targeted, such as the ground-breaking mRNA vaccinations against COVID-19. Advances in high-throughput sequencing, single-cell RNA sequencing, and epitranscriptomics have further unravelled the complexity of RNA biology, shedding light on the intricacies of gene regulation and cellular diversity. The integration of computational tools and bioinformatics has propelled the identification of RNA-based biomarkers and the development of RNA therapeutics. Despite significant progress, challenges such as RNA stability, delivery, and off-target effects persist, necessitating continuous innovation and ethical considerations. This review provides a critical insight into the current state and prospects of RNA research, emphasising its transformative potential in drug discovery. By examining the interplay between technological advancements and therapeutic applications, we underscore the promising horizon for RNA-based interventions in treating a myriad of diseases, marking a new era in precision medicine.
Collapse
Affiliation(s)
- Patrick Maduabuchi Aja
- Biochemistry Department, Biomedical Sciences Faculty, Kampala International University, P.O. Box Ishaka, Bushenyi, Uganda; Biochemistry Department, Faculty of Science, Ebonyi State University, P.M.B. 053 Abakaliki, Ebonyi State, Nigeria.
| | - Peter Chinedu Agu
- Biochemistry Department, Faculty of Science, Ebonyi State University, P.M.B. 053 Abakaliki, Ebonyi State, Nigeria; Department of Biochemistry, Faculty of Science, Evangel University, Nigeria
| | - Celestine Ogbu
- Department of Biochemistry, Faculty of Basic Medical Sciences, Federal University of Health Sciences, Otukpo, Nigeria
| | - Esther Ugo Alum
- Publications and Extension Department, Kampala International University, P. O. Box 20000, Uganda; Biochemistry Department, Faculty of Science, Ebonyi State University, P.M.B. 053 Abakaliki, Ebonyi State, Nigeria
| | - Ilemobayo Victor Fasogbon
- Biochemistry Department, Biomedical Sciences Faculty, Kampala International University, P.O. Box Ishaka, Bushenyi, Uganda
| | - Angela Mumbua Musyoka
- Biochemistry Department, Biomedical Sciences Faculty, Kampala International University, P.O. Box Ishaka, Bushenyi, Uganda
| | - Wisdom Ngwueche
- Department of Biochemistry, Faculty of Biological Sciences, University of Nigeria, Nsukka, Enugu State, Nigeria
| | - Chinedu Ogbonia Egwu
- Department of Biochemistry, Faculty of Basic Medical Sciences, Alex Ekwueme Federal University, Ndufu-Alike, Ikwo, Ebonyi State, Nigeria
| | - Deusdedit Tusubira
- Department of Biochemistry, Faculty of Medicine, Mbarara University of Science and Technology, Mbarara, Uganda
| | - Kehinde Ross
- School of Pharmacy and Biomolecular Sciences, Liverpool John Moores University, Liverpool, United Kingdom; Institute for Health Research, Liverpool John Moores University, Liverpool, United Kingdom
| |
Collapse
|
3
|
Zhang H, Ding Y. RNA Structure: Function and Application in Plant Biology. ANNUAL REVIEW OF PLANT BIOLOGY 2025; 76:115-141. [PMID: 40101225 DOI: 10.1146/annurev-arplant-083123-055521] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/20/2025]
Abstract
RNA orchestrates intricate structures that influence gene expression and protein production in all living organisms, with implications for fundamental biology, medicine, and agriculture. Although extensive research has been conducted on RNA biology, many regulatory mechanisms remain elusive due to the complex and dynamic nature of RNA structures and past technological limitations. Recent advancements in RNA structure technology have revolutionized plant RNA biology research. Here, we review cutting-edge technologies for studying RNA structures in plants and their functional significance in diverse biological processes. Additionally, we highlight the pivotal role of RNA structure in influencing plant growth, development, and responses to environmental stresses. We also discuss the potential evolutionary significance of RNA structure in natural adaptation and crop domestication. Finally, we propose leveraging RNA structure-mediated gene regulation as an innovative strategy to bolster plant resilience against climate change.
Collapse
Affiliation(s)
- Huakun Zhang
- Key Laboratory of Molecular Epigenetics of the Ministry of Education, Northeast Normal University, Changchun, China;
| | - Yiliang Ding
- Department of Cell and Developmental Biology, John Innes Centre, Norwich Research Park, Norwich, United Kingdom;
| |
Collapse
|
4
|
Baburaj A, Jayadevan S, Aliyana AK, SK NK, Stylios GK. AI-Driven TENGs for Self-Powered Smart Sensors and Intelligent Devices. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2025; 12:e2417414. [PMID: 40277838 PMCID: PMC12120734 DOI: 10.1002/advs.202417414] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/22/2025] [Revised: 03/20/2025] [Indexed: 04/26/2025]
Abstract
Triboelectric nanogenerators (TENGs) are emerging as transformative technologies for sustainable energy harvesting and precision sensing, offering eco-friendly power generation from mechanical motion. They harness mechanical energy while enabling self-sustaining sensing for self-powered devices. However, challenges such as material optimization, fabrication techniques, design strategies, and output stability must be addressed to fully realize their practical potential. Artificial intelligence (AI), with its capabilities in advanced data analysis, pattern recognition, and adaptive responses, is revolutionizing fields like healthcare, industrial automation, and smart infrastructure. When integrated with TENGs, AI can overcome current limitations by enhancing output, stability, and adaptability. This review explores the synergistic potential of AI-driven TENG systems, from optimizing materials and fabrication to embedding machine learning and deep learning algorithms for intelligent real-time sensing. These advancements enable improved energy harvesting, predictive maintenance, and dynamic performance optimization, making TENGs more practical across industries. The review also identifies key challenges and future research directions, including the development of low-power AI algorithms, sustainable materials, hybrid energy systems, and robust security protocols for AI-enhanced TENG solutions.
Collapse
Affiliation(s)
- Aiswarya Baburaj
- Department of ElectronicsMangalore UniversityMangalore574199India
| | - Syamini Jayadevan
- Research Institute for Flexible MaterialsSchool of Textiles and DesignHeriot‐Watt UniversityNetherdaleGalashielsTD1 3HFUnited Kingdom of Great Britain and Northern Ireland
| | - Akshaya Kumar Aliyana
- Research Institute for Flexible MaterialsSchool of Textiles and DesignHeriot‐Watt UniversityNetherdaleGalashielsTD1 3HFUnited Kingdom of Great Britain and Northern Ireland
| | - Naveen Kumar SK
- Department of ElectronicsMangalore UniversityMangalore574199India
| | - George K Stylios
- Research Institute for Flexible MaterialsSchool of Textiles and DesignHeriot‐Watt UniversityNetherdaleGalashielsTD1 3HFUnited Kingdom of Great Britain and Northern Ireland
| |
Collapse
|
5
|
Chaturvedi M, Rashid MA, Paliwal KK. RNA structure prediction using deep learning - A comprehensive review. Comput Biol Med 2025; 188:109845. [PMID: 39983363 DOI: 10.1016/j.compbiomed.2025.109845] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2024] [Revised: 02/09/2025] [Accepted: 02/10/2025] [Indexed: 02/23/2025]
Abstract
In computational biology, accurate RNA structure prediction offers several benefits, including facilitating a better understanding of RNA functions and RNA-based drug design. Implementing deep learning techniques for RNA structure prediction has led tremendous progress in this field, resulting in significant improvements in prediction accuracy. This comprehensive review aims to provide an overview of the diverse strategies employed in predicting RNA secondary structures, emphasizing deep learning methods. The article categorizes the discussion into three main dimensions: feature extraction methods, existing state-of-the-art learning model architectures, and prediction approaches. We present a comparative analysis of various techniques and models highlighting their strengths and weaknesses. Finally, we identify gaps in the literature, discuss current challenges, and suggest future approaches to enhance model performance and applicability in RNA structure prediction tasks. This review provides a deeper insight into the subject and paves the way for further progress in this dynamic intersection of life sciences and artificial intelligence.
Collapse
Affiliation(s)
- Mayank Chaturvedi
- Signal Processing Laboratory, School of Engineering and Built Environment, Griffith University, Brisbane, QLD, 4111, Australia.
| | - Mahmood A Rashid
- Signal Processing Laboratory, School of Engineering and Built Environment, Griffith University, Brisbane, QLD, 4111, Australia.
| | - Kuldip K Paliwal
- Signal Processing Laboratory, School of Engineering and Built Environment, Griffith University, Brisbane, QLD, 4111, Australia.
| |
Collapse
|
6
|
Wang R, Schlick T. How Large is the Universe of RNA-Like Motifs? A Clustering Analysis of RNA Graph Motifs Using Topological Descriptors. ARXIV 2025:arXiv:2501.04258v1. [PMID: 39867422 PMCID: PMC11760235] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 01/28/2025]
Abstract
Identifying novel and functional RNA structures remains a significant challenge in RNA motif design and is crucial for developing RNA-based therapeutics. Here we introduce a computational topology-based approach with unsupervised machine-learning algorithms to estimate the database size and content of RNA-like graph topologies. Specifically, we apply graph theory enumeration to generate all 110,667 possible 2D dual graphs for vertex numbers ranging from 2 to 9. Among them, only 0.11% (121 dual graphs) correspond to approximately 200,000 known RNA atomic fragments/substructures (collected in 2021) using the RNA-as-Graphs (RAG) mapping method. The remaining 99.89% of the dual graphs may be RNA-like or non-RNA-like. To determine which dual graphs in the 99.89% hypothetical set are more likely to be associated with RNA structures, we apply computational topology descriptors using the Persistent Spectral Graphs (PSG) method to characterize each graph using 19 PSG-based features and use clustering algorithms that partition all possible dual graphs into two clusters. The cluster with the higher percentage of known dual graphs for RNA is defined as the "RNA-like" cluster, while the other is considered as "non-RNA-like". The distance of each dual graph to the center of the RNA-like cluster represents the likelihood of it belonging to RNA structures. From validation, our PSG-based RNA-like cluster includes 97.3% of the 121 known RNA dual graphs, suggesting good performance. Furthermore, 46.017% of the hypothetical RNAs are predicted to be RNA-like. Among the top 15 graphs identified as high-likelihood candidates for novel RNA motifs, 4 were confirmed from the RNA dataset collected in 2022. Significantly, we observe that all the top 15 RNA-like dual graphs can be separated into multiple subgraphs, whereas the top 15 non-RNA-like dual graphs tend not to have any subgraphs (subgraphs preserve pseudoknots and junctions). Moreover, a significant topological difference between top RNA-like and non-RNA-like graphs is evident when comparing their topological features (e.g. Betti-0 and Betti-1 numbers). These findings provide valuable insights into the size of the RNA motif universe and RNA design strategies, offering a novel framework for predicting RNA graph topologies and guiding the discovery of novel RNA motifs, perhaps anti-viral therapeutics by subgraph assembly.
Collapse
Affiliation(s)
- Rui Wang
- Simons Center for Computational Physical Chemistry, New York University, New York, NY 10003, USA
| | - Tamar Schlick
- Simons Center for Computational Physical Chemistry, New York University, New York, NY 10003, USA
- Department of Chemistry, New York University, New York, NY 10003, USA
- Courant Institute of Mathematical Sciences, New York University, New York, NY 10012, USA
- New York University-East China Normal University Center for Computational Chemistry, New York University Shanghai, Shanghai 200122, China
| |
Collapse
|
7
|
Kumar A, Dixit S, Srinivasan K, M D, Vincent PMDR. Personalized cancer vaccine design using AI-powered technologies. Front Immunol 2024; 15:1357217. [PMID: 39582860 PMCID: PMC11581883 DOI: 10.3389/fimmu.2024.1357217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2023] [Accepted: 09/24/2024] [Indexed: 11/26/2024] Open
Abstract
Immunotherapy has ushered in a new era of cancer treatment, yet cancer remains a leading cause of global mortality. Among various therapeutic strategies, cancer vaccines have shown promise by activating the immune system to specifically target cancer cells. While current cancer vaccines are primarily prophylactic, advancements in targeting tumor-associated antigens (TAAs) and neoantigens have paved the way for therapeutic vaccines. The integration of artificial intelligence (AI) into cancer vaccine development is revolutionizing the field by enhancing various aspect of design and delivery. This review explores how AI facilitates precise epitope design, optimizes mRNA and DNA vaccine instructions, and enables personalized vaccine strategies by predicting patient responses. By utilizing AI technologies, researchers can navigate complex biological datasets and uncover novel therapeutic targets, thereby improving the precision and efficacy of cancer vaccines. Despite the promise of AI-powered cancer vaccines, significant challenges remain, such as tumor heterogeneity and genetic variability, which can limit the effectiveness of neoantigen prediction. Moreover, ethical and regulatory concerns surrounding data privacy and algorithmic bias must be addressed to ensure responsible AI deployment. The future of cancer vaccine development lies in the seamless integration of AI to create personalized immunotherapies that offer targeted and effective cancer treatments. This review underscores the importance of interdisciplinary collaboration and innovation in overcoming these challenges and advancing cancer vaccine development.
Collapse
Affiliation(s)
- Anant Kumar
- School of Bioscience and Technology, Vellore Institute of Technology, Vellore, India
| | - Shriniket Dixit
- School of Computer Science and Engineering, Vellore Institute of Technology, Vellore, India
| | - Kathiravan Srinivasan
- School of Computer Science and Engineering, Vellore Institute of Technology, Vellore, India
| | - Dinakaran M
- School of Computer Science and Engineering, Vellore Institute of Technology, Chennai, India
| | - P. M. Durai Raj Vincent
- School of Computer Science Engineering and Information Systems, Vellore Institute of Technology, Vellore, India
| |
Collapse
|
8
|
Cao X, Zhang Y, Ding Y, Wan Y. Identification of RNA structures and their roles in RNA functions. Nat Rev Mol Cell Biol 2024; 25:784-801. [PMID: 38926530 DOI: 10.1038/s41580-024-00748-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/28/2024] [Indexed: 06/28/2024]
Abstract
The development of high-throughput RNA structure profiling methods in the past decade has greatly facilitated our ability to map and characterize different aspects of RNA structures transcriptome-wide in cell populations, single cells and single molecules. The resulting high-resolution data have provided insights into the static and dynamic nature of RNA structures, revealing their complexity as they perform their respective functions in the cell. In this Review, we discuss recent technical advances in the determination of RNA structures, and the roles of RNA structures in RNA biogenesis and functions, including in transcription, processing, translation, degradation, localization and RNA structure-dependent condensates. We also discuss the current understanding of how RNA structures could guide drug design for treating genetic diseases and battling pathogenic viruses, and highlight existing challenges and future directions in RNA structure research.
Collapse
Affiliation(s)
- Xinang Cao
- Stem Cell and Regenerative Biology, Genome Institute of Singapore, Singapore, Singapore
| | - Yueying Zhang
- Department of Cell and Developmental Biology, John Innes Centre, Norwich, UK
| | - Yiliang Ding
- Department of Cell and Developmental Biology, John Innes Centre, Norwich, UK.
| | - Yue Wan
- Stem Cell and Regenerative Biology, Genome Institute of Singapore, Singapore, Singapore.
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore.
| |
Collapse
|
9
|
Nedorezova DD, Dubovichenko MV, Kalnin AJ, Nour MAY, Eldeeb AA, Ashmarova AI, Kurbanov GF, Kolpashchikov DM. Cleaving Folded RNA with DNAzyme Agents. Chembiochem 2024; 25:e202300637. [PMID: 37870555 DOI: 10.1002/cbic.202300637] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2023] [Revised: 10/17/2023] [Accepted: 10/23/2023] [Indexed: 10/24/2023]
Abstract
Cleavage of biological mRNA by DNAzymes (Dz) has been proposed as a variation of oligonucleotide gene therapy (OGT). The design of Dz-based OGT agents includes computational prediction of two RNA-binding arms with low affinity (melting temperatures (Tm ) close to the reaction temperature of 37 °C) to avoid product inhibition and maintain high specificity. However, RNA cleavage might be limited by the RNA binding step especially if the RNA is folded in secondary structures. This calls for the need for two high-affinity RNA-binding arms. In this study, we optimized 10-23 Dz-based OGT agents for cleavage of three RNA targets with different folding energies under multiple turnover conditions in 2 mM Mg2+ at 37 °C. Unexpectedly, one optimized Dz had each RNA-binding arm with a Tm ≥60 °C, without suffering from product inhibition or low selectivity. This phenomenon was explained by the folding of the RNA cleavage products into stable secondary structures. This result suggests that Dz with long (high affinity) RNA-binding arms should not be excluded from the candidate pool for OGT agents. Rather, analysis of the cleavage products' folding should be included in Dz selection algorithms. The Dz optimization workflow should include testing with folded rather than linear RNA substrates.
Collapse
Affiliation(s)
- Daria D Nedorezova
- Laboratory of molecular robotics and biosensor systems, Laboratory of Frontier nucleic acid technologies in gene therapy of cancer, SCAMT Institute, ITMO University, St. Petersburg, 191002, Russian Federation
| | - Mikhail V Dubovichenko
- Laboratory of molecular robotics and biosensor systems, Laboratory of Frontier nucleic acid technologies in gene therapy of cancer, SCAMT Institute, ITMO University, St. Petersburg, 191002, Russian Federation
| | - Arseniy J Kalnin
- Laboratory of molecular robotics and biosensor systems, Laboratory of Frontier nucleic acid technologies in gene therapy of cancer, SCAMT Institute, ITMO University, St. Petersburg, 191002, Russian Federation
| | - Moustapha A Y Nour
- Laboratory of molecular robotics and biosensor systems, Laboratory of Frontier nucleic acid technologies in gene therapy of cancer, SCAMT Institute, ITMO University, St. Petersburg, 191002, Russian Federation
| | - Ahmed A Eldeeb
- Laboratory of molecular robotics and biosensor systems, Laboratory of Frontier nucleic acid technologies in gene therapy of cancer, SCAMT Institute, ITMO University, St. Petersburg, 191002, Russian Federation
| | - Anna I Ashmarova
- Laboratory of molecular robotics and biosensor systems, Laboratory of Frontier nucleic acid technologies in gene therapy of cancer, SCAMT Institute, ITMO University, St. Petersburg, 191002, Russian Federation
| | - Gabdulla F Kurbanov
- Laboratory of molecular robotics and biosensor systems, Laboratory of Frontier nucleic acid technologies in gene therapy of cancer, SCAMT Institute, ITMO University, St. Petersburg, 191002, Russian Federation
| | - Dmitry M Kolpashchikov
- Laboratory of molecular robotics and biosensor systems, Laboratory of Frontier nucleic acid technologies in gene therapy of cancer, SCAMT Institute, ITMO University, St. Petersburg, 191002, Russian Federation
- Chemistry Department, University of Central Florida, Orlando, FL 32816-2366, USA
- Burnett School of Biomedical Sciences, University of Central Florida, Orlando, FL 32816, USA
| |
Collapse
|
10
|
Metkar M, Pepin CS, Moore MJ. Tailor made: the art of therapeutic mRNA design. Nat Rev Drug Discov 2024; 23:67-83. [PMID: 38030688 DOI: 10.1038/s41573-023-00827-x] [Citation(s) in RCA: 20] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/09/2023] [Indexed: 12/01/2023]
Abstract
mRNA medicine is a new and rapidly developing field in which the delivery of genetic information in the form of mRNA is used to direct therapeutic protein production in humans. This approach, which allows for the quick and efficient identification and optimization of drug candidates for both large populations and individual patients, has the potential to revolutionize the way we prevent and treat disease. A key feature of mRNA medicines is their high degree of designability, although the design choices involved are complex. Maximizing the production of therapeutic proteins from mRNA medicines requires a thorough understanding of how nucleotide sequence, nucleotide modification and RNA structure interplay to affect translational efficiency and mRNA stability. In this Review, we describe the principles that underlie the physical stability and biological activity of mRNA and emphasize their relevance to the myriad considerations that factor into therapeutic mRNA design.
Collapse
|
11
|
Tieng FYF, Abdullah-Zawawi MR, Md Shahri NAA, Mohamed-Hussein ZA, Lee LH, Mutalib NSA. A Hitchhiker's guide to RNA-RNA structure and interaction prediction tools. Brief Bioinform 2023; 25:bbad421. [PMID: 38040490 PMCID: PMC10753535 DOI: 10.1093/bib/bbad421] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 10/16/2023] [Accepted: 10/26/2023] [Indexed: 12/03/2023] Open
Abstract
RNA biology has risen to prominence after a remarkable discovery of diverse functions of noncoding RNA (ncRNA). Most untranslated transcripts often exert their regulatory functions into RNA-RNA complexes via base pairing with complementary sequences in other RNAs. An interplay between RNAs is essential, as it possesses various functional roles in human cells, including genetic translation, RNA splicing, editing, ribosomal RNA maturation, RNA degradation and the regulation of metabolic pathways/riboswitches. Moreover, the pervasive transcription of the human genome allows for the discovery of novel genomic functions via RNA interactome investigation. The advancement of experimental procedures has resulted in an explosion of documented data, necessitating the development of efficient and precise computational tools and algorithms. This review provides an extensive update on RNA-RNA interaction (RRI) analysis via thermodynamic- and comparative-based RNA secondary structure prediction (RSP) and RNA-RNA interaction prediction (RIP) tools and their general functions. We also highlighted the current knowledge of RRIs and the limitations of RNA interactome mapping via experimental data. Then, the gap between RSP and RIP, the importance of RNA homologues, the relationship between pseudoknots, and RNA folding thermodynamics are discussed. It is hoped that these emerging prediction tools will deepen the understanding of RNA-associated interactions in human diseases and hasten treatment processes.
Collapse
Affiliation(s)
- Francis Yew Fu Tieng
- UKM Medical Molecular Biology Institute (UMBI), Universiti Kebangsaan Malaysia (UKM), Kuala Lumpur 56000, Malaysia
| | | | - Nur Alyaa Afifah Md Shahri
- UKM Medical Molecular Biology Institute (UMBI), Universiti Kebangsaan Malaysia (UKM), Kuala Lumpur 56000, Malaysia
| | - Zeti-Azura Mohamed-Hussein
- Institute of Systems Biology (INBIOSIS), UKM, Selangor 43600, Malaysia
- Department of Applied Physics, Faculty of Science and Technology, UKM, Selangor 43600, Malaysia
| | - Learn-Han Lee
- Sunway Microbiomics Centre, School of Medical and Life Sciences, Sunway University, Sunway City 47500, Malaysia
- Novel Bacteria and Drug Discovery Research Group, Microbiome and Bioresource Research Strength, Jeffrey Cheah School of Medicine and Health Sciences, Monash University of Malaysia, Selangor 47500, Malaysia
| | - Nurul-Syakima Ab Mutalib
- UKM Medical Molecular Biology Institute (UMBI), Universiti Kebangsaan Malaysia (UKM), Kuala Lumpur 56000, Malaysia
- Novel Bacteria and Drug Discovery Research Group, Microbiome and Bioresource Research Strength, Jeffrey Cheah School of Medicine and Health Sciences, Monash University of Malaysia, Selangor 47500, Malaysia
- Faculty of Health Sciences, UKM, Kuala Lumpur 50300, Malaysia
| |
Collapse
|
12
|
Wang F, Cao H, Xia Q, Liu Z, Wang M, Gao F, Xu D, Deng B, Diao Y, Kapranov P. Lessons from discovery of true ADAR RNA editing sites in a human cell line. BMC Biol 2023; 21:160. [PMID: 37468903 PMCID: PMC10357658 DOI: 10.1186/s12915-023-01651-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Accepted: 06/20/2023] [Indexed: 07/21/2023] Open
Abstract
BACKGROUND Conversion or editing of adenosine (A) into inosine (I) catalyzed by specialized cellular enzymes represents one of the most common post-transcriptional RNA modifications with emerging connection to disease. A-to-I conversions can happen at specific sites and lead to increase in proteome diversity and changes in RNA stability, splicing, and regulation. Such sites can be detected as adenine-to-guanine sequence changes by next-generation RNA sequencing which resulted in millions reported sites from multiple genome-wide surveys. Nonetheless, the lack of extensive independent validation in such endeavors, which is critical considering the relatively high error rate of next-generation sequencing, leads to lingering questions about the validity of the current compendiums of the editing sites and conclusions based on them. RESULTS Strikingly, we found that the current analytical methods suffer from very high false positive rates and that a significant fraction of sites in the public databases cannot be validated. In this work, we present potential solutions to these problems and provide a comprehensive and extensively validated list of A-to-I editing sites in a human cancer cell line. Our findings demonstrate that most of true A-to-I editing sites in a human cancer cell line are located in the non-coding transcripts, the so-called RNA 'dark matter'. On the other hand, many ADAR editing events occurring in exons of human protein-coding mRNAs, including those that can recode the transcriptome, represent false positives and need to be interpreted with caution. Nonetheless, yet undiscovered authentic ADAR sites that increase the diversity of human proteome exist and warrant further identification. CONCLUSIONS Accurate identification of human ADAR sites remains a challenging problem, particularly for the sites in exons of protein-coding mRNAs. As a result, genome-wide surveys of ADAR editome must still be accompanied by extensive Sanger validation efforts. However, given the vast number of unknown human ADAR sites, there is a need for further developments of the analytical techniques, potentially those that are based on deep learning solutions, in order to provide a quick and reliable identification of the editome in any sample.
Collapse
Affiliation(s)
- Fang Wang
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Huifen Cao
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China.
| | - Qiu Xia
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Ziheng Liu
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Ming Wang
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Fan Gao
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Dongyang Xu
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Bolin Deng
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Yong Diao
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Philipp Kapranov
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China.
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, 361102, China.
| |
Collapse
|
13
|
Khorkova O, Stahl J, Joji A, Volmar CH, Zeier Z, Wahlestedt C. Long non-coding RNA-targeting therapeutics: discovery and development update. Expert Opin Drug Discov 2023; 18:1011-1029. [PMID: 37466388 DOI: 10.1080/17460441.2023.2236552] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 07/03/2023] [Accepted: 07/11/2023] [Indexed: 07/20/2023]
Abstract
INTRODUCTION lncRNAs are major players in regulatory networks orchestrating multiple cellular functions, such as 3D chromosomal interactions, epigenetic modifications, gene expression and others. Due to progress in the development of nucleic acid-based therapeutics, lncRNAs potentially represent easily accessible therapeutic targets. AREAS COVERED Currently, significant efforts are directed at studies that can tap the enormous therapeutic potential of lncRNAs. This review describes recent developments in this field, particularly focusing on clinical applications. EXPERT OPINION Extensive druggable target range of lncRNA combined with high specificity and accelerated development process of nucleic acid-based therapeutics open new prospects for treatment in areas of extreme unmet medical need, such as genetic diseases, aggressive cancers, protein deficiencies, and subsets of common diseases caused by known mutations. Although currently wide acceptance of lncRNA-targeting nucleic acid-based therapeutics is impeded by the need for parenteral or direct-to-CNS administration, development of less invasive techniques and orally available/BBB-penetrant nucleic acid-based therapeutics is showing early successes. Recently, mRNA-based COVID-19 vaccines have demonstrated clinical safety of all aspects of nucleic acid-based therapeutic technology, including multiple chemical modifications of nucleic acids and nanoparticle delivery. These trends position lncRNA-targeting drugs as significant players in the future of drug development, especially in the area of personalized medicine.
Collapse
Affiliation(s)
- Olga Khorkova
- Center for Therapeutic Innovation and Department of Psychiatry and Behavioral Sciences, University of Miami, Miami, FL, USA
| | - Jack Stahl
- Center for Therapeutic Innovation and Department of Psychiatry and Behavioral Sciences, University of Miami, Miami, FL, USA
| | - Aswathy Joji
- Center for Therapeutic Innovation and Department of Psychiatry and Behavioral Sciences, University of Miami, Miami, FL, USA
| | - Claude-Henry Volmar
- Center for Therapeutic Innovation and Department of Psychiatry and Behavioral Sciences, University of Miami, Miami, FL, USA
| | - Zane Zeier
- Center for Therapeutic Innovation and Department of Psychiatry and Behavioral Sciences, University of Miami, Miami, FL, USA
| | - Claes Wahlestedt
- Center for Therapeutic Innovation and Department of Psychiatry and Behavioral Sciences, University of Miami, Miami, FL, USA
| |
Collapse
|
14
|
Marklund E, Ke Y, Greenleaf WJ. High-throughput biochemistry in RNA sequence space: predicting structure and function. Nat Rev Genet 2023; 24:401-414. [PMID: 36635406 DOI: 10.1038/s41576-022-00567-5] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/08/2022] [Indexed: 01/14/2023]
Abstract
RNAs are central to fundamental biological processes in all known organisms. The set of possible intramolecular interactions of RNA nucleotides defines the range of alternative structural conformations of a specific RNA that can coexist, and these structures enable functional catalytic properties of RNAs and/or their productive intermolecular interactions with other RNAs or proteins. However, the immense combinatorial space of potential RNA sequences has precluded predictive mapping between RNA sequence and molecular structure and function. Recent advances in high-throughput approaches in vitro have enabled quantitative thermodynamic and kinetic measurements of RNA-RNA and RNA-protein interactions, across hundreds of thousands of sequence variations. In this Review, we explore these techniques, how they can be used to understand RNA function and how they might form the foundations of an accurate model to predict the structure and function of an RNA directly from its nucleotide sequence. The experimental techniques and modelling frameworks discussed here are also highly relevant for the sampling of sequence-structure-function space of DNAs and proteins.
Collapse
Affiliation(s)
- Emil Marklund
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
| | - Yuxi Ke
- Department of Bioengineering, Stanford University, Stanford, CA, USA
| | - William J Greenleaf
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA.
| |
Collapse
|
15
|
Yu H, Qi Y, Yang B, Yang X, Ding Y. G4Atlas: a comprehensive transcriptome-wide G-quadruplex database. Nucleic Acids Res 2023; 51:D126-D134. [PMID: 36243987 PMCID: PMC9825586 DOI: 10.1093/nar/gkac896] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 09/16/2022] [Accepted: 10/03/2022] [Indexed: 01/29/2023] Open
Abstract
RNA G-quadruplex (rG4) is a vital RNA tertiary structure motif that involves the base pairs on both Hoogsteen and Watson-Crick faces of guanines. rG4 is of great importance in the post-transcriptional regulation of gene expression. Experimental technologies have advanced to identify in vitro and in vivo rG4s across diverse transcriptomes. Building on these recent advances, here we present G4Atlas, the first transcriptome-wide G-quadruplex database, in which we have collated, classified, and visualized transcriptome rG4 experimental data, generated from rG4-seq, chemical profiling and ligand-binding methods. Our comprehensive database includes transcriptome-wide rG4s generated from 82 experimental treatments and 238 samples across ten species. In addition, we have also included RNA secondary structure prediction information across both experimentally identified and unidentified rG4s to enable users to display any potential competitive folding between rG4 and RNA secondary structures. As such, G4Atlas will enable users to explore the general functions of rG4s in diverse biological processes. In addition, G4Atlas lays the foundation for further data-driven deep learning algorithms to examine rG4 structural features.
Collapse
Affiliation(s)
- Haopeng Yu
- Department of Cell and Developmental Biology, John Innes Centre, Norwich Research Park, Norwich NR4 7UH, UK
| | - Yiman Qi
- Department of Cell and Developmental Biology, John Innes Centre, Norwich Research Park, Norwich NR4 7UH, UK
| | - Bibo Yang
- Department of Cell and Developmental Biology, John Innes Centre, Norwich Research Park, Norwich NR4 7UH, UK
| | - Xiaofei Yang
- Department of Cell and Developmental Biology, John Innes Centre, Norwich Research Park, Norwich NR4 7UH, UK
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, China
- CAS-JIC Center of Excellence for Plant and Microbial Sciences (CEPAMS), Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, China
| | - Yiliang Ding
- Department of Cell and Developmental Biology, John Innes Centre, Norwich Research Park, Norwich NR4 7UH, UK
| |
Collapse
|
16
|
Gumna J, Antczak M, Adamiak RW, Bujnicki JM, Chen SJ, Ding F, Ghosh P, Li J, Mukherjee S, Nithin C, Pachulska-Wieczorek K, Ponce-Salvatierra A, Popenda M, Sarzynska J, Wirecki T, Zhang D, Zhang S, Zok T, Westhof E, Miao Z, Szachniuk M, Rybarczyk A. Computational Pipeline for Reference-Free Comparative Analysis of RNA 3D Structures Applied to SARS-CoV-2 UTR Models. Int J Mol Sci 2022; 23:ijms23179630. [PMID: 36077037 PMCID: PMC9455975 DOI: 10.3390/ijms23179630] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Revised: 08/17/2022] [Accepted: 08/20/2022] [Indexed: 01/19/2023] Open
Abstract
RNA is a unique biomolecule that is involved in a variety of fundamental biological functions, all of which depend solely on its structure and dynamics. Since the experimental determination of crystal RNA structures is laborious, computational 3D structure prediction methods are experiencing an ongoing and thriving development. Such methods can lead to many models; thus, it is necessary to build comparisons and extract common structural motifs for further medical or biological studies. Here, we introduce a computational pipeline dedicated to reference-free high-throughput comparative analysis of 3D RNA structures. We show its application in the RNA-Puzzles challenge, in which five participating groups attempted to predict the three-dimensional structures of 5'- and 3'-untranslated regions (UTRs) of the SARS-CoV-2 genome. We report the results of this puzzle and discuss the structural motifs obtained from the analysis. All simulated models and tools incorporated into the pipeline are open to scientific and academic use.
Collapse
Affiliation(s)
- Julita Gumna
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Maciej Antczak
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
- Institute of Computing Science, Poznan University of Technology, 60-965 Poznan, Poland
| | - Ryszard W. Adamiak
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
- Institute of Computing Science, Poznan University of Technology, 60-965 Poznan, Poland
| | - Janusz M. Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - Shi-Jie Chen
- Department of Physics, Department of Biochemistry, Institute for Data Science and Informatics, University of Missouri, Columbia, MO 65211, USA
| | - Feng Ding
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA
| | - Pritha Ghosh
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - Jun Li
- Department of Physics, Department of Biochemistry, Institute for Data Science and Informatics, University of Missouri, Columbia, MO 65211, USA
| | - Sunandan Mukherjee
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - Chandran Nithin
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
- Laboratory of Computational Biology, Faculty of Chemistry, Biological and Chemical Research Centre, University of Warsaw, 02-089 Warsaw, Poland
| | | | - Almudena Ponce-Salvatierra
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - Mariusz Popenda
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Joanna Sarzynska
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Tomasz Wirecki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - Dong Zhang
- Department of Physics, Department of Biochemistry, Institute for Data Science and Informatics, University of Missouri, Columbia, MO 65211, USA
| | - Sicheng Zhang
- Department of Physics, Department of Biochemistry, Institute for Data Science and Informatics, University of Missouri, Columbia, MO 65211, USA
| | - Tomasz Zok
- Institute of Computing Science, Poznan University of Technology, 60-965 Poznan, Poland
| | - Eric Westhof
- Architecture et Réactivité de l’ARN, Université de Strasbourg, Institut de Biologie Moléculaire et Cellulaire du CNRS, 67084 Strasbourg, France
| | - Zhichao Miao
- Translational Research Institute of Brain and Brain-Like Intelligence, Department of Anesthesiology, Shanghai Fourth People’s Hospital Affiliated to Tongji University School of Medicine, Shanghai 200081, China
- Correspondence: (Z.M.); (A.R.)
| | - Marta Szachniuk
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
- Institute of Computing Science, Poznan University of Technology, 60-965 Poznan, Poland
| | - Agnieszka Rybarczyk
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
- Institute of Computing Science, Poznan University of Technology, 60-965 Poznan, Poland
- Correspondence: (Z.M.); (A.R.)
| |
Collapse
|