1
|
Krishnan S, Roy A, Wong L, Gromiha M. DRLiPS: a novel method for prediction of druggable RNA-small molecule binding pockets using machine learning. Nucleic Acids Res 2025; 53:gkaf239. [PMID: 40173014 PMCID: PMC11963762 DOI: 10.1093/nar/gkaf239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2024] [Revised: 02/16/2025] [Accepted: 03/14/2025] [Indexed: 04/04/2025] Open
Abstract
Ribonucleic Acid (RNA) is the central conduit for information transfer in the cell. Identifying potential RNA targets in disease conditions is a challenging task, given the vast repertoire of functional non-coding RNAs in a human cell. A potential druggable target must satisfy several criteria, including disease association, cellular accessibility, binding pockets for drug-like molecules, and minimal cross-reactivity. While several methods exist for prediction of druggable proteins, they cannot be repurposed for RNAs due to fundamental differences in their binding modality. Taking all these constraints into account, a new structure-based model, Druggable RNA-Ligand binding Pocket Selector (DRLiPS), is developed here to predict binding site-level druggability of any given RNA target. A novel strategy for sampling negative binding sites in RNA structures using three parallel approaches is demonstrated here to improve model specificity: backbone motif search, exhaustive pocket prediction, and blind docking. An external blind test dataset has also been curated to showcase the model's generalizability to both experimental and modelled apo state RNA structures. DRLiPS has achieved an F1-score of 0.70, precision of 0.61, specificity of 0.89, and recall of 0.73 on this external test dataset, outperforming two existing methods, DrugPred_RNA and RNACavityMiner. Further analysis indicates that the features selected for model-building generalize well to both apo and holo states with a backbone RMSD tolerance of 3 Å. It can also predict the effect of binding site single point mutations on druggability, which can aid in optimizing synthetic RNA aptamers for small molecule recognition. The DRLiPS model is freely accessible at https://web.iitm.ac.in/bioinfo2/DRLiPS/.
Collapse
Affiliation(s)
- Sowmya Ramaswamy Krishnan
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, India
- TCS Research (Life Sciences division), Tata Consultancy Services, Hyderabad 500081, India
| | - Arijit Roy
- TCS Research (Life Sciences division), Tata Consultancy Services, Hyderabad 500081, India
| | - Limsoon Wong
- Department of Computer Science, National University of Singapore, 117417, Singapore
| | - M Michael Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, India
- Department of Computer Science, National University of Singapore, 117417, Singapore
| |
Collapse
|
2
|
Zhou Y, Chen SJ. Advances in machine-learning approaches to RNA-targeted drug design. ARTIFICIAL INTELLIGENCE CHEMISTRY 2024; 2:100053. [PMID: 38434217 PMCID: PMC10904028 DOI: 10.1016/j.aichem.2024.100053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/05/2024]
Abstract
RNA molecules play multifaceted functional and regulatory roles within cells and have garnered significant attention in recent years as promising therapeutic targets. With remarkable successes achieved by artificial intelligence (AI) in different fields such as computer vision and natural language processing, there is a growing imperative to harness AI's potential in computer-aided drug design (CADD) to discover novel drug compounds that target RNA. Although machine-learning (ML) approaches have been widely adopted in the discovery of small molecules targeting proteins, the application of ML approaches to model interactions between RNA and small molecule is still in its infancy. Compared to protein-targeted drug discovery, the major challenges in ML-based RNA-targeted drug discovery stem from the scarcity of available data resources. With the growing interest and the development of curated databases focusing on interactions between RNA and small molecule, the field anticipates a rapid growth and the opening of a new avenue for disease treatment. In this review, we aim to provide an overview of recent advancements in computationally modeling RNA-small molecule interactions within the context of RNA-targeted drug discovery, with a particular emphasis on methodologies employing ML techniques.
Collapse
Affiliation(s)
- Yuanzhe Zhou
- Department of Physics and Astronomy, University of Missouri, Columbia, MO 65211-7010, USA
| | - Shi-Jie Chen
- Department of Physics and Astronomy, Department of Biochemistry, Institute of Data Sciences and Informatics, University of Missouri, Columbia, MO 65211-7010, USA
| |
Collapse
|
3
|
Temml V, Kutil Z. Structure-based molecular modeling in SAR analysis and lead optimization. Comput Struct Biotechnol J 2021; 19:1431-1444. [PMID: 33777339 PMCID: PMC7979990 DOI: 10.1016/j.csbj.2021.02.018] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2020] [Revised: 02/21/2021] [Accepted: 02/23/2021] [Indexed: 12/13/2022] Open
Abstract
In silico methods like molecular docking and pharmacophore modeling are established strategies in lead identification. Their successful application for finding new active molecules for a target is reported by a plethora of studies. However, once a potential lead is identified, lead optimization, with the focus on improving potency, selectivity, or pharmacokinetic parameters of a parent compound, is a much more complex task. Even though in silico molecular modeling methods could contribute a lot of time and cost-saving by rationally filtering synthetic optimization options, they are employed less widely in this stage of research. In this review, we highlight studies that have successfully used computer-aided SAR analysis in lead optimization and want to showcase sound methodology and easily accessible in silico tools for this purpose.
Collapse
Affiliation(s)
- Veronika Temml
- Institute of Pharmacy, Department of Pharmaceutical and Medicinal Chemistry, Paracelsus Medical University Salzburg, Strubergasse 21, 5020 Salzburg, Austria
| | - Zsofia Kutil
- Institute of Biotechnology of the Czech Academy of Sciences, BIOCEV, Vestec, Czech Republic
| |
Collapse
|
4
|
Stefaniak F, Bujnicki JM. AnnapuRNA: A scoring function for predicting RNA-small molecule binding poses. PLoS Comput Biol 2021; 17:e1008309. [PMID: 33524009 PMCID: PMC7877745 DOI: 10.1371/journal.pcbi.1008309] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2020] [Revised: 02/11/2021] [Accepted: 12/16/2020] [Indexed: 11/22/2022] Open
Abstract
RNA is considered as an attractive target for new small molecule drugs. Designing active compounds can be facilitated by computational modeling. Most of the available tools developed for these prediction purposes, such as molecular docking or scoring functions, are parametrized for protein targets. The performance of these methods, when applied to RNA-ligand systems, is insufficient. To overcome these problems, we developed AnnapuRNA, a new knowledge-based scoring function designed to evaluate RNA-ligand complex structures, generated by any computational docking method. We also evaluated three main factors that may influence the structure prediction, i.e., the starting conformer of a ligand, the docking program, and the scoring function used. We applied the AnnapuRNA method for a post-hoc study of the recently published structures of the FMN riboswitch. Software is available at https://github.com/filipspl/AnnapuRNA. Drug development is a lengthy and complicated process, which requires costly experiments on a very large number of chemical compounds. The identification of chemical molecules with desired properties can be facilitated by computational methods. Several methods were developed for computer-aided design of drugs that target protein molecules. However, recently the ribonucleic acid (RNA) emerged as an attractive target for the development of new drugs. Unfortunately, the portfolio of the computer methods that can be applied to study RNA and its interactions with small chemical molecules is very limited. This situation motivated us to develop a new computational method, with which to predict RNA-small molecule interactions. To this end, we collected the information on the statistics of interactions in experimentally determined structures of complexes formed by RNA with small molecules. We then used the statistical data to train machine learning methods aiming to distinguish between RNA-ligand interactions observed experimentally and other interactions that can be observed in theoretical analyses, but are not observed in nature. The resulting method called AnnapuRNA is superior to other similar tools and can be used to predict preferred ligands of RNA molecules and how RNA and small molecules interact with each other.
Collapse
Affiliation(s)
- Filip Stefaniak
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Warsaw, Poland
- * E-mail: (FS); (JMB)
| | - Janusz M. Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Warsaw, Poland
- Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, Poznan, Poland
- * E-mail: (FS); (JMB)
| |
Collapse
|
5
|
Feng Y, Huang SY. ITScore-NL: An Iterative Knowledge-Based Scoring Function for Nucleic Acid-Ligand Interactions. J Chem Inf Model 2020; 60:6698-6708. [PMID: 33291885 DOI: 10.1021/acs.jcim.0c00974] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Nucleic acid-ligand complexes underlie numerous cellular processes, such as gene function expression and regulation, in which their three-dimensional structures are important to understand their functions and thus to develop therapeutic interventions. Given the high cost and technical difficulties in experimental methods, computational methods such as molecular docking have been actively used to investigate nucleic acid-ligand interactions in which an accurate scoring function is crucial. However, because of the limited number of experimental nucleic acid-ligand binding data and structures, the scoring function development for nucleic acid-ligand interactions falls far behind that for protein-protein and protein-ligand interactions. Here, based on our statistical mechanics-based iterative approach, we have developed an iterative knowledge-based scoring function for nucleic acid-ligand interactions, named as ITScore-NL, by explicitly including stacking and electrostatic potentials. Our ITScore-NL scoring function was extensively evaluated for its ability in the binding mode and binding affinity predictions on three diverse test sets and compared with state-of-the-art scoring functions. Overall, ITScore-NL obtained significantly better performance than the other 12 scoring functions and predicted near-native poses with rmsd ≤ 1.5 Å for 71.43% of the cases when the top three binding modes were considered and a good correlation of R = 0.64 in binding affinity prediction on the large test set of 77 nucleic acid-ligand complexes. These results suggested the accuracy of ITScore-NL and the necessity of explicitly including stacking and electrostatic potentials.
Collapse
Affiliation(s)
- Yuyu Feng
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| |
Collapse
|
6
|
Badu S, Melnik R, Singh S. Mathematical and computational models of RNA nanoclusters and their applications in data-driven environments. MOLECULAR SIMULATION 2020. [DOI: 10.1080/08927022.2020.1804564] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Affiliation(s)
- Shyam Badu
- MS2Discovery Interdisciplinary Research Institute, Wilfrid Laurier University, Waterloo, Ontario, Canada
| | - Roderick Melnik
- MS2Discovery Interdisciplinary Research Institute, Wilfrid Laurier University, Waterloo, Ontario, Canada
- BCAM-Basque Center for Applied Mathematics, Bilbao, Spain
| | - Sundeep Singh
- MS2Discovery Interdisciplinary Research Institute, Wilfrid Laurier University, Waterloo, Ontario, Canada
| |
Collapse
|
7
|
Bezerra KS, Fulco UL, Esmaile SC, Lima Neto JX, Machado LD, Freire VN, Albuquerque EL, Oliveira JIN. Ribosomal RNA-Aminoglycoside Hygromycin B Interaction Energy Calculation within a Density Functional Theory Framework. J Phys Chem B 2019; 123:6421-6429. [PMID: 31283875 DOI: 10.1021/acs.jpcb.9b04468] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
We intend to investigate the drug-binding energy of each nucleotide inside the aminoglycoside hygromycin B (hygB) binding site of 30S ribosomal RNA (rRNA) subunit by using the molecular fractionation with conjugate caps (MFCC) strategy based on the density functional theory (DFT), considering the functional LDA/PWC, OBS, and the dielectric constant parametrization. Aminoglycosides are bactericidal antibiotics that have high affinity to the prokaryotic rRNA, inhibiting the synthesis of proteins by acting on the main stages of the translation mechanism, whereas binding to rRNA 16S, a component of the 30S ribosomal subunit in prokaryotes. The identification of the nucleotides presenting the most negative binding energies allows us to stabilize hygB in a suitable binding pocket of the 30S ribosomal subunit. In addition, it should be highlighted that mutations in these residues may probably lead to resistance to ribosome-targeting antibiotics. Quantum calculations of aminoglycoside hygromycin B-ribosome complex might contribute to further quantum studies with antibiotics like macrolides and other aminoglycosides.
Collapse
Affiliation(s)
- Katyanna S Bezerra
- Departamento de Biofísica e Farmacologia , Universidade Federal do Rio Grande do Norte , 59072-970 Natal-RN , Brazil
| | - Umberto L Fulco
- Departamento de Biofísica e Farmacologia , Universidade Federal do Rio Grande do Norte , 59072-970 Natal-RN , Brazil
| | - Stephany C Esmaile
- Departamento de Biofísica e Farmacologia , Universidade Federal do Rio Grande do Norte , 59072-970 Natal-RN , Brazil
| | - José X Lima Neto
- Departamento de Biofísica e Farmacologia , Universidade Federal do Rio Grande do Norte , 59072-970 Natal-RN , Brazil
| | - Leonardo D Machado
- Departamento de Física Teórica e Experimental , Universidade Federal do Rio Grande do Norte , 59072-970 Natal-RN , Brazil
| | - Valder N Freire
- Departamento de Física , Universidade Federal do Ceará , 60455-760 Fortaleza-CE , Brazil
| | - Eudenilson L Albuquerque
- Departamento de Biofísica e Farmacologia , Universidade Federal do Rio Grande do Norte , 59072-970 Natal-RN , Brazil
| | - Jonas I N Oliveira
- Departamento de Biofísica e Farmacologia , Universidade Federal do Rio Grande do Norte , 59072-970 Natal-RN , Brazil
| |
Collapse
|
8
|
Challenges and current status of computational methods for docking small molecules to nucleic acids. Eur J Med Chem 2019; 168:414-425. [PMID: 30831409 DOI: 10.1016/j.ejmech.2019.02.046] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Revised: 02/12/2019] [Accepted: 02/12/2019] [Indexed: 01/29/2023]
Abstract
Since the development of the first docking program in 1982, the use of docking-based in silico screening for potentially bioactive molecule discovery has become a common strategy in academia and pharmaceutical industry. Up until recently, application of docking programs has largely focused on drugs binding to proteins. However, with the discovery of promising drug targets in nucleic acids, including RNA riboswitches, DNA G-quadruplexes, and extended repeats in RNA, there has been greater interests in developing drugs for nucleic acids. However, due to major biochemical and physical differences in charges, binding pockets, and solvation, existing docking programs, developed for proteins, face difficulties when adopted directly for nucleic acids. In this review, we cover the current field of in silico docking to nucleic acids, available programs, as well as challenges faced in the field.
Collapse
|
9
|
Jedrzejczyk D, Gendaszewska-Darmach E, Pawlowska R, Chworos A. Designing synthetic RNA for delivery by nanoparticles. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2017; 29:123001. [PMID: 28004640 DOI: 10.1088/1361-648x/aa5561] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
The rapid development of synthetic biology and nanobiotechnology has led to the construction of various synthetic RNA nanoparticles of different functionalities and potential applications. As they occur naturally, nucleic acids are an attractive construction material for biocompatible nanoscaffold and nanomachine design. In this review, we provide an overview of the types of RNA and nucleic acid's nanoparticle design, with the focus on relevant nanostructures utilized for gene-expression regulation in cellular models. Structural analysis and modeling is addressed along with the tools available for RNA structural prediction. The functionalization of RNA-based nanoparticles leading to prospective applications of such constructs in potential therapies is shown. The route from the nanoparticle design and modeling through synthesis and functionalization to cellular application is also described. For a better understanding of the fate of targeted RNA after delivery, an overview of RNA processing inside the cell is also provided.
Collapse
Affiliation(s)
- Dominika Jedrzejczyk
- Centre of Molecular and Macromolecular Studies, Polish Academy of Sciences, Sienkiewicza 112, 90-363 Lodz, Poland
| | | | | | | |
Collapse
|
10
|
Structure-Based Discovery of Small Molecules Binding to RNA. TOPICS IN MEDICINAL CHEMISTRY 2017. [DOI: 10.1007/7355_2016_29] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
|
11
|
Dawson WK, Bujnicki JM. Computational modeling of RNA 3D structures and interactions. Curr Opin Struct Biol 2015; 37:22-8. [PMID: 26689764 DOI: 10.1016/j.sbi.2015.11.007] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2015] [Revised: 11/11/2015] [Accepted: 11/12/2015] [Indexed: 11/25/2022]
Abstract
RNA molecules have key functions in cellular processes beyond being carriers of protein-coding information. These functions are often dependent on the ability to form complex three-dimensional (3D) structures. However, experimental determination of RNA 3D structures is difficult, which has prompted the development of computational methods for structure prediction from sequence. Recent progress in 3D structure modeling of RNA and emerging approaches for predicting RNA interactions with ions, ligands and proteins have been stimulated by successes in protein 3D structure modeling.
Collapse
Affiliation(s)
- Wayne K Dawson
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, ul. Ks. Trojdena 4, 02-109 Warsaw, Poland
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, ul. Ks. Trojdena 4, 02-109 Warsaw, Poland; Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, ul. Umultowska 89, 61-614 Poznan, Poland.
| |
Collapse
|