1
|
Taghavi A, Springer NA, Zanon PRA, Li Y, Li C, Childs-Disney JL, Disney MD. The evolution and application of RNA-focused small molecule libraries. RSC Chem Biol 2025; 6:510-527. [PMID: 39957993 PMCID: PMC11824871 DOI: 10.1039/d4cb00272e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2024] [Accepted: 02/06/2025] [Indexed: 02/18/2025] Open
Abstract
RNA structure plays a role in nearly every disease. Therefore, approaches that identify tractable small molecule chemical matter that targets RNA and affects its function would transform drug discovery. Despite this potential, discovery of RNA-targeted small molecule chemical probes and medicines remains in its infancy. Advances in RNA-focused libraries are key to enable more successful primary screens and to define structure-activity relationships amongst hit molecules. In this review, we describe how RNA-focused small molecule libraries have been used and evolved over time and provide underlying principles for their application to develop bioactive small molecules. We also describe areas that need further investigation to advance the field, including generation of larger data sets to inform machine learning approaches.
Collapse
Affiliation(s)
- Amirhossein Taghavi
- Department of Chemistry, The Herbert Wertheim UF Scripps Institute for Biomedical Innovation and Technology 130 Scripps Way Jupiter FL 33458 USA
| | - Noah A Springer
- Department of Chemistry, The Herbert Wertheim UF Scripps Institute for Biomedical Innovation and Technology 130 Scripps Way Jupiter FL 33458 USA
- Department of Chemistry, The Scripps Research Institute 130 Scripps Way Jupiter FL 33458 USA
| | - Patrick R A Zanon
- Department of Chemistry, The Herbert Wertheim UF Scripps Institute for Biomedical Innovation and Technology 130 Scripps Way Jupiter FL 33458 USA
| | - Yanjun Li
- Department of Medicinal Chemistry, Center for Natural Products, Drug Discovery and Development, The University of Florida Gainesville FL 32610 USA
- Department of Computer & Information Science & Engineering, University of Florida Gainesville FL 32611 USA
| | - Chenglong Li
- Department of Medicinal Chemistry, Center for Natural Products, Drug Discovery and Development, The University of Florida Gainesville FL 32610 USA
| | - Jessica L Childs-Disney
- Department of Chemistry, The Herbert Wertheim UF Scripps Institute for Biomedical Innovation and Technology 130 Scripps Way Jupiter FL 33458 USA
| | - Matthew D Disney
- Department of Chemistry, The Herbert Wertheim UF Scripps Institute for Biomedical Innovation and Technology 130 Scripps Way Jupiter FL 33458 USA
- Department of Chemistry, The Scripps Research Institute 130 Scripps Way Jupiter FL 33458 USA
| |
Collapse
|
2
|
Daroch A, Purohit R. MDbDMRP: A novel molecular descriptor-based computational model to identify drug-miRNA relationships. Int J Biol Macromol 2025; 287:138580. [PMID: 39657879 DOI: 10.1016/j.ijbiomac.2024.138580] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2024] [Revised: 11/20/2024] [Accepted: 12/07/2024] [Indexed: 12/12/2024]
Abstract
MicroRNAs (miRNAs) are important in gene expression regulation and many other biological processes and have emerged as promising therapeutic targets. Identifying potential drug-miRNA relationships is helpful in disease therapy and pharmaceutical engineering in medical research. However, accurately predicting these relationships remains a significant computational challenge. This study introduces MDbDMRP, a novel molecular descriptors-based drug-miRNA relationship prediction computational model designed to address this challenge. MDbDMRP leverages the power of machine learning to predict new drug-miRNA associations and non-associations. The model achieves exceptional performance, exceeding an average score of 0.92 across various evaluation metrics, including accuracy, precision, recall, and F1-score. Furthermore, it demonstrates a remarkable ability to distinguish between positive and negative interactions, as evidenced by an outstanding AUC-ROC score of 0.9864. The results obtained from MDbDMRP were further validated through molecular docking, reinforcing its performance. These results position MDbDMRP as a valuable tool for researchers aiming to unlock the potential of miRNAs in drug discovery.
Collapse
Affiliation(s)
- Amit Daroch
- Structural Bioinformatics Lab, Biotechnology Division, CSIR-Institute of Himalayan Bioresource Technology, Palampur, HP 176061, India; The Himalayan Centre for High-throughput Computational Biology, (HiCHiCoB, A BIC supported by DBT, India), Palampur, HP 176061, India
| | - Rituraj Purohit
- Academy of Scientific and Innovative Research, Ghaziabad 201002, India.
| |
Collapse
|
3
|
Liu W, Lan Z, Li Z, Sun X, Lu X. Dual-neighbourhood information aggregation and feature fusion for prediction of miRNA-disease association. Comput Biol Med 2024; 181:109068. [PMID: 39208505 DOI: 10.1016/j.compbiomed.2024.109068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Revised: 06/23/2024] [Accepted: 08/21/2024] [Indexed: 09/04/2024]
Abstract
Studying the intricate relationship between miRNAs and diseases is crucial to prevent and treat miRNA-related disorders. Existing computational methods often overlook the importance of features of different nodes and the propagation of features among heterogeneous nodes. Many prediction models focus only on the feature coding of miRNA and diseases and ignore the importance of feature aggregation. We propose a prediction method via dual-neighbourhood feature aggregation and feature fusion, which uses multiple sources of information, aggregates information on homogeneous and heterogeneous nodes and fuses learned features to predict multiple representations of disease nodes. We constructed similarity networks of multiple homogeneous nodes based on different similarity computation methods respectively, and fused the attention mechanism by using graph convolutional networks to obtain information of different levels of importance. To alleviate the problem of sparse connectivity in the dataset, we built a two-neighbourhood heterogeneous graph neural network model to integrate the homogeneous similarity network into a miRNA-disease heterogeneous network by using known miRNA-disease association information. We used the neighbourhood information associated with the nodes in the network to perform feature aggregation. In addition, we used a feature fusion module to learn the importance of different types of nodes to predict miRNA-disease associations. Our experimental results on the Human microRNA Disease Database (HMDD v3.2) show that the model demonstrates superior performance. This work demonstrates the capability of our model to identify potential miRNAs associated with diseases through a case study of two common cancers.
Collapse
Affiliation(s)
- Wei Liu
- School of Computer Science, Xiangtan University, Xiangtan, 411105, China
| | - Zixin Lan
- School of Computer Science, Xiangtan University, Xiangtan, 411105, China
| | - Zejun Li
- School of Computer Science and Engineering, Hunan Institute of Technology, Hengyang, 421002, China
| | - Xingen Sun
- School of Mathematics and Computational Science, Xiangtan University, Xiangtan, 411105, China
| | - Xu Lu
- School of Computer Science, Guangdong Polytechnic Normal University, Guangdong Provincial Key Laboratory of Intellectual Property Big Data, Guangzhou 510665, China.
| |
Collapse
|
4
|
Xiao H, Zhang Y, Yang X, Yu S, Chen Z, Lu A, Zhang Z, Zhang G, Zhang BT. SMTRI: A deep learning-based web service for predicting small molecules that target miRNA-mRNA interactions. MOLECULAR THERAPY. NUCLEIC ACIDS 2024; 35:102303. [PMID: 39281703 PMCID: PMC11401195 DOI: 10.1016/j.omtn.2024.102303] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Accepted: 08/12/2024] [Indexed: 09/18/2024]
Abstract
Mature microRNAs (miRNAs) are short, single-stranded RNAs that bind to target mRNAs and induce translational repression and gene silencing. Many miRNAs discovered in animals have been implicated in diseases and have recently been pursued as therapeutic targets. However, conventional pharmacological screening for candidate small-molecule drugs can be time-consuming and labor-intensive. Therefore, developing a computational program to assist mature miRNA-targeted drug discovery in silico is desirable. Our previous work (https://doi.org/10.1002/advs.201903451) revealed that the unique functional loops formed during Argonaute-mediated miRNA-mRNA interactions have stable structural characteristics and may serve as potential targets for small-molecule drug discovery. Developing drugs specifically targeting disease-related mature miRNAs and their target mRNAs would avoid affecting unrelated ones. Here, we present SMTRI, a convolutional neural network-based approach for efficiently predicting small molecules that target RNA secondary structural motifs formed by interactions between miRNAs and their target mRNAs. Measured on three additional testing sets, SMTRI outperformed state-of-the-art algorithms by 12.9%-30.3% in AUC and 2.0%-18.4% in accuracy. Moreover, four case studies on the published experimentally validated RNA-targeted small molecules also revealed the reliability of SMTRI.
Collapse
Affiliation(s)
- Huan Xiao
- School of Chinese Medicine, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong SAR 999077, China
| | - Yihao Zhang
- School of Chinese Medicine, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong SAR 999077, China
| | - Xin Yang
- Law Sau Fai Institute for Advancing Translational Medicine in Bone & Joint Diseases, Hong Kong Baptist University, Hong Kong SAR 999077, China
- Institute of Integrated Bioinformedicine and Translational Science, Hong Kong Baptist University, Hong Kong SAR 999077, China
- Institute of Precision Medicine and Innovative Drug Discovery, Hong Kong Baptist University, Hong Kong SAR 999077, China
| | - Sifan Yu
- School of Chinese Medicine, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong SAR 999077, China
| | - Ziqi Chen
- Law Sau Fai Institute for Advancing Translational Medicine in Bone & Joint Diseases, Hong Kong Baptist University, Hong Kong SAR 999077, China
- Institute of Integrated Bioinformedicine and Translational Science, Hong Kong Baptist University, Hong Kong SAR 999077, China
- Institute of Precision Medicine and Innovative Drug Discovery, Hong Kong Baptist University, Hong Kong SAR 999077, China
| | - Aiping Lu
- Law Sau Fai Institute for Advancing Translational Medicine in Bone & Joint Diseases, Hong Kong Baptist University, Hong Kong SAR 999077, China
- Institute of Integrated Bioinformedicine and Translational Science, Hong Kong Baptist University, Hong Kong SAR 999077, China
- Institute of Precision Medicine and Innovative Drug Discovery, Hong Kong Baptist University, Hong Kong SAR 999077, China
| | - Zongkang Zhang
- School of Chinese Medicine, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong SAR 999077, China
| | - Ge Zhang
- Law Sau Fai Institute for Advancing Translational Medicine in Bone & Joint Diseases, Hong Kong Baptist University, Hong Kong SAR 999077, China
- Institute of Integrated Bioinformedicine and Translational Science, Hong Kong Baptist University, Hong Kong SAR 999077, China
- Institute of Precision Medicine and Innovative Drug Discovery, Hong Kong Baptist University, Hong Kong SAR 999077, China
| | - Bao-Ting Zhang
- School of Chinese Medicine, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong SAR 999077, China
| |
Collapse
|
5
|
Sun XY, Hou ZJ, Zhang WG, Chen Y, Yao HB. HTFSMMA: Higher-Order Topological Guided Small Molecule-MicroRNA Associations Prediction. J Comput Biol 2024; 31:886-906. [PMID: 39109562 DOI: 10.1089/cmb.2024.0587] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/10/2024] Open
Abstract
Small molecules (SMs) play a pivotal role in regulating microRNAs (miRNAs). Existing prediction methods for associations between SM-miRNA have overlooked crucial aspects: the incorporation of local topological features between nodes, which represent either SMs or miRNAs, and the effective fusion of node features with topological features. This study introduces a novel approach, termed high-order topological features for SM-miRNA association prediction (HTFSMMA), which specifically addresses these limitations. Initially, an association graph is formed by integrating SM-miRNA association data, SM similarity, and miRNA similarity. Subsequently, we focus on the local information of links and propose target neighborhood graph convolutional network for extracting local topological features. Then, HTFSMMA employs graph attention networks to amalgamate these local features, thereby establishing a platform for the acquisition of high-order features through random walks. Finally, the extracted features are integrated into the multilayer perceptron to derive the association prediction scores. To demonstrate the performance of HTFSMMA, we conducted comprehensive evaluations including five-fold cross-validation, leave-one-out cross-validation (LOOCV), SM-fixed local LOOCV, and miRNA-fixed local LOOCV. The area under receiver operating characteristic curve values were 0.9958 ± 0.0024 (0.8722 ± 0.0021), 0.9986 (0.9504), 0.9974 (0.9111), and 0.9977 (0.9074), respectively. Our findings demonstrate the superior performance of HTFSMMA over existing approaches. In addition, three case studies and the DeLong test have confirmed the effectiveness of the proposed method. These results collectively underscore the significance of HTFSMMA in facilitating the inference of associations between SMs and miRNAs.
Collapse
Affiliation(s)
- Xiao-Yan Sun
- School of Computer Science and Artificial Intelligence & Aliyun Big Data, Changzhou University, Changzhou, China
| | - Zhen-Jie Hou
- School of Computer Science and Artificial Intelligence & Aliyun Big Data, Changzhou University, Changzhou, China
| | - Wen-Guang Zhang
- School of Life Sciences, Inner Mongolia Agricultural University, Hohhot, China
| | - Yan Chen
- School of Computer Science and Artificial Intelligence & Aliyun Big Data, Changzhou University, Changzhou, China
| | - Hai-Bin Yao
- School of Computer Science and Artificial Intelligence & Aliyun Big Data, Changzhou University, Changzhou, China
| |
Collapse
|
6
|
Krishnan SR, Roy A, Gromiha MM. Reliable method for predicting the binding affinity of RNA-small molecule interactions using machine learning. Brief Bioinform 2024; 25:bbae002. [PMID: 38261341 PMCID: PMC10805179 DOI: 10.1093/bib/bbae002] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 12/21/2023] [Accepted: 12/24/2023] [Indexed: 01/24/2024] Open
Abstract
Ribonucleic acids (RNAs) play important roles in cellular regulation. Consequently, dysregulation of both coding and non-coding RNAs has been implicated in several disease conditions in the human body. In this regard, a growing interest has been observed to probe into the potential of RNAs to act as drug targets in disease conditions. To accelerate this search for disease-associated novel RNA targets and their small molecular inhibitors, machine learning models for binding affinity prediction were developed specific to six RNA subtypes namely, aptamers, miRNAs, repeats, ribosomal RNAs, riboswitches and viral RNAs. We found that differences in RNA sequence composition, flexibility and polar nature of RNA-binding ligands are important for predicting the binding affinity. Our method showed an average Pearson correlation (r) of 0.83 and a mean absolute error of 0.66 upon evaluation using the jack-knife test, indicating their reliability despite the low amount of data available for several RNA subtypes. Further, the models were validated with external blind test datasets, which outperform other existing quantitative structure-activity relationship (QSAR) models. We have developed a web server to host the models, RNA-Small molecule binding Affinity Predictor, which is freely available at: https://web.iitm.ac.in/bioinfo2/RSAPred/.
Collapse
Affiliation(s)
- Sowmya R Krishnan
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, India
- TCS Research (Life Sciences division), Tata Consultancy Services, Hyderabad 500081, India
| | - Arijit Roy
- TCS Research (Life Sciences division), Tata Consultancy Services, Hyderabad 500081, India
| | - M Michael Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, India
- International Research Frontiers Initiative, School of Computing, Tokyo Institute of Technology, Yokohama 226-8501, Japan
- Department of Computer Science, National University of Singapore, Singapore 117543
| |
Collapse
|
7
|
Zhong Y, Shen C, Xi X, Luo Y, Ding P, Luo L. Multitask joint learning with graph autoencoders for predicting potential MiRNA-drug associations. Artif Intell Med 2023; 145:102665. [PMID: 37925217 DOI: 10.1016/j.artmed.2023.102665] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Revised: 06/14/2023] [Accepted: 09/14/2023] [Indexed: 11/06/2023]
Abstract
The occurrence of many diseases is associated with miRNA abnormalities. Predicting potential drug-miRNA associations is of great importance for both disease treatment and new drug discovery. Most computation-based approaches learn one task at a time, ignoring the information contained in other tasks in the same domain. Multitask learning can effectively enhance the prediction performance of a single task by extending the valid information of related tasks. In this paper, we presented a multitask joint learning framework (MTJL) with a graph autoencoder for predicting the associations between drugs and miRNAs. First, we combined multiple pieces of information to construct a high-quality similarity network of both drugs and miRNAs and then used a graph autoencoder (GAE) to learn their embedding representations separately. Second, to further improve the embedding quality of drugs, we added an auxiliary task to classify drugs using the learned representations. Finally, the embedding representations of drugs and miRNAs were linearly transformed to obtain the predictive association scores between them. A comparison with other state-of-the-art models shows that MTJL has the best prediction performance, and ablation experiments show that the auxiliary task can enhance the embedding quality and improve the robustness of the model. In addition, we show that MTJL has high utility in predicting potential associations between drugs and miRNAs by conducting two case studies.
Collapse
Affiliation(s)
- Yichen Zhong
- School of Computer Science, University of South China, Hengyang 421001, China
| | - Cong Shen
- College of Computer Science and Electronic Engineering, Hunan University, Changsha 410083, China
| | - Xiaoting Xi
- School of Computer Science, University of South China, Hengyang 421001, China
| | - Yuxun Luo
- School of Computer Science and Engineering, Hunan University of Science and Technology, Xiangtan 411105, China
| | - Pingjian Ding
- School of Computer Science, University of South China, Hengyang 421001, China
| | - Lingyun Luo
- School of Computer Science, University of South China, Hengyang 421001, China.
| |
Collapse
|
8
|
Sun J, Xu M, Ru J, James-Bott A, Xiong D, Wang X, Cribbs AP. Small molecule-mediated targeting of microRNAs for drug discovery: Experiments, computational techniques, and disease implications. Eur J Med Chem 2023; 257:115500. [PMID: 37262996 PMCID: PMC11554572 DOI: 10.1016/j.ejmech.2023.115500] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Revised: 05/05/2023] [Accepted: 05/15/2023] [Indexed: 06/03/2023]
Abstract
Small molecules have been providing medical breakthroughs for human diseases for more than a century. Recently, identifying small molecule inhibitors that target microRNAs (miRNAs) has gained importance, despite the challenges posed by labour-intensive screening experiments and the significant efforts required for medicinal chemistry optimization. Numerous experimentally-verified cases have demonstrated the potential of miRNA-targeted small molecule inhibitors for disease treatment. This new approach is grounded in their posttranscriptional regulation of the expression of disease-associated genes. Reversing dysregulated gene expression using this mechanism may help control dysfunctional pathways. Furthermore, the ongoing improvement of algorithms has allowed for the integration of computational strategies built on top of laboratory-based data, facilitating a more precise and rational design and discovery of lead compounds. To complement the use of extensive pharmacogenomics data in prioritising potential drugs, our previous work introduced a computational approach based on only molecular sequences. Moreover, various computational tools for predicting molecular interactions in biological networks using similarity-based inference techniques have been accumulated in established studies. However, there are a limited number of comprehensive reviews covering both computational and experimental drug discovery processes. In this review, we outline a cohesive overview of both biological and computational applications in miRNA-targeted drug discovery, along with their disease implications and clinical significance. Finally, utilizing drug-target interaction (DTIs) data from DrugBank, we showcase the effectiveness of deep learning for obtaining the physicochemical characterization of DTIs.
Collapse
Affiliation(s)
- Jianfeng Sun
- Botnar Research Centre, Nuffield Department of Orthopedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, OX3 7LD, UK.
| | - Miaoer Xu
- Department of Biology, Emory University, Atlanta, GA, 30322, USA
| | - Jinlong Ru
- Chair of Prevention of Microbial Diseases, School of Life Sciences Weihenstephan, Technical University of Munich, Freising, 85354, Germany
| | - Anna James-Bott
- Botnar Research Centre, Nuffield Department of Orthopedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, OX3 7LD, UK
| | - Dapeng Xiong
- Department of Computational Biology, Cornell University, Ithaca, NY, 14853, USA; Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY, 14853, USA
| | - Xia Wang
- College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China.
| | - Adam P Cribbs
- Botnar Research Centre, Nuffield Department of Orthopedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, OX3 7LD, UK.
| |
Collapse
|
9
|
Ding P, Zeng M, Yin R. Editorial: Computational methods to analyze RNA data for human diseases. Front Genet 2023; 14:1270334. [PMID: 37674479 PMCID: PMC10478215 DOI: 10.3389/fgene.2023.1270334] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Accepted: 08/14/2023] [Indexed: 09/08/2023] Open
Affiliation(s)
- Pingjian Ding
- Center for Artificial Intelligence in Drug Discovery, School of Medicine, Case Western Reserve University, Cleveland, OH, United States
| | - Min Zeng
- School of Computer Science and Engineering, Central South University, Changsha, China
| | - Rui Yin
- Department of Health Outcomes and Biomedical Informatics, University of Florida, Gainesville, FL, United States
| |
Collapse
|
10
|
Luo Y, Peng L, Shan W, Sun M, Luo L, Liang W. Machine learning in the development of targeting microRNAs in human disease. Front Genet 2023; 13:1088189. [PMID: 36685965 PMCID: PMC9845262 DOI: 10.3389/fgene.2022.1088189] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Accepted: 12/12/2022] [Indexed: 01/05/2023] Open
Abstract
A microRNA is a small, single-stranded, non-coding ribonucleic acid that plays a crucial role in RNA silencing and can regulate gene expression. With the in-depth study of miRNA in development and disease, miRNA has become an attractive target for novel therapeutic strategies. Exploring miRNA targeting therapy only through experiments is expensive and laborious, so it is essential to develop novel and efficient computational methods to narrow down the search. Recent advances in machine learning applied in biomedical informatics provide opportunities to explore miRNA-targeting drugs, thus promoting miRNA therapeutics. This review provides an overview of recent advancements in miRNA targeting therapeutic using machine learning. First, we mainly describe the basics of predicting miRNA targeting drugs, including pharmacogenomic data resources and data preprocessing. Then we present primary machine learning algorithms and elaborate their application in discovering relationships among miRNAs, drugs, and diseases. Along with the progress of miRNA targeting therapeutics, we finally analyze and discuss the current challenges and opportunities that machine learning confronts.
Collapse
Affiliation(s)
- Yuxun Luo
- School of Computer Science and Engineering, Hunan University of Science and Technology, Xiangtan, China,Hunan Key Laboratory for Service computing and Novel Software Technology, Xiangtan, China
| | - Li Peng
- School of Computer Science and Engineering, Hunan University of Science and Technology, Xiangtan, China,Hunan Key Laboratory for Service computing and Novel Software Technology, Xiangtan, China
| | - Wenyu Shan
- School of Computer Science, University of South China, Hengyang, China
| | - Mengyue Sun
- School of Polymer Science and Polymer Engineering, The University of Akron, Akron, OH, United States
| | - Lingyun Luo
- School of Computer Science, University of South China, Hengyang, China
| | - Wei Liang
- School of Computer Science and Engineering, Hunan University of Science and Technology, Xiangtan, China,Hunan Key Laboratory for Service computing and Novel Software Technology, Xiangtan, China,*Correspondence: Wei Liang,
| |
Collapse
|
11
|
Xiao H, Yang X, Zhang Y, Zhang Z, Zhang G, Zhang BT. RNA-targeted small-molecule drug discoveries: a machine-learning perspective. RNA Biol 2023; 20:384-397. [PMID: 37337437 PMCID: PMC10283424 DOI: 10.1080/15476286.2023.2223498] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/06/2023] [Indexed: 06/21/2023] Open
Abstract
In the past two decades, machine learning (ML) has been extensively adopted in protein-targeted small molecule (SM) discovery. Once trained, ML models could exert their predicting abilities on large volumes of molecules within a short time. However, applying ML approaches to discover RNA-targeted SMs is still in its early stages. This is primarily because of the intrinsic structural instability of RNA molecules that impede the structure-based screening or designing of RNA-targeted SMs. Recently, with more studies revealing RNA structures and a growing number of RNA-targeted ligands being identified, it resulted in an increased interest in the field of drugging RNA. Undeniably, intracellular RNA is much more abundant than protein and, if successfully targeted, will be a major alternative target for therapeutics. Therefore, in this context, as well as under the premise of having RNA-related research data, ML-based methods can get involved in improving the speed of traditional experimental processes. [Figure: see text].
Collapse
Affiliation(s)
- Huan Xiao
- School of Chinese Medicine, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong, China
| | - Xin Yang
- School of Chinese Medicine, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong, China
| | - Yihao Zhang
- School of Chinese Medicine, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong, China
| | - Zongkang Zhang
- School of Chinese Medicine, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong, China
| | - Ge Zhang
- Law Sau Fai Institute for Advancing Translational Medicine in Bone & Joint Diseases, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong, China
- Institute of Integrated Bioinformedicine and Translational Science, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong, China
- Institute of Precision Medicine and Innovative Drug Discovery, HKBU Institute for Research and Continuing Education, Shenzhen, China
| | - Bao-Ting Zhang
- School of Chinese Medicine, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong, China
| |
Collapse
|
12
|
Xie W, Zheng Z, Zhang W, Huang L, Lin Q, Wong KC. SRG-vote: Predicting miRNA-gene relationships via embedding and LSTM ensemble. IEEE J Biomed Health Inform 2022; 26:4335-4344. [PMID: 35471879 DOI: 10.1109/jbhi.2022.3169542] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
AbstractTargeted therapy for one for a set of genes has made it possible to apply precision medicine for different patients due to the existence of tumor heterogeneity. However, how to regulate those genes are still problematic. One of the natural regulators of genes is microRNAs. Thus, a better understanding of the miRNA-gene interaction mechanism might contribute to future diagnosis, prevention, and cancer therapy. The interactions between microRNA and genes play an essential role in molecular genetics. The in-vivo experiments validating the relationships between them are time-consuming, money-costly, and labor-intensive. With the development of high-throughput technology, we dealt with tons of biological data. However, extracting features from tremendous raw data and making a mathematical model is still a challenging topic. Machine learning and deep learning algorithms have become powerful tools in dealing with biological data. Inspired by this, in this paper, we propose a model that combines features/embedding extraction methods, deep learning algorithms, and a voting system. We leverage doc2vec to generate sequential embedding from molecular sequences. The role2vec, GCN, and GMM for geometrical embedding were generated from the complex network from similarity and pair-wise datasets. For the deep learning algorithms, we leveraged LSTM and Bi-LSTM according to different embedding and features. Finally, we adopted a voting system to balance results from different data sources. The results have shown that our voting system could achieve a higher AUC than the existing benchmark. The case studies demonstrate that our model could reveal potential relationships between miRNAs and genes. The source code, features, and predictive results can be downloaded at https://github.com/Xshelton/SRG-vote.
Collapse
|
13
|
Luo J, Bao Y, Chen X, Shen C. Metapath-Based Deep Convolutional Neural Network for Predicting miRNA-Target Association on Heterogeneous Network. Interdiscip Sci 2021; 13:547-558. [PMID: 34170473 DOI: 10.1007/s12539-021-00454-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Revised: 06/17/2021] [Accepted: 06/17/2021] [Indexed: 06/13/2023]
Abstract
Predicting the interactions between microRNAs (miRNAs) and target genes is of great significance for understanding the regulatory mechanism of miRNA and treating complex diseases. The emergence of large-scale, heterogeneous biological networks has offered unprecedented opportunities for revealing miRNA-associated target genes. However, there are still some limitations about automatically learn the feature information of the network in the existing methods. Since network representation learning can self-adaptively capture structure information of the network, we propose a framework based on heterogeneous network representation, MDCNN (Metapath-Based Deep Convolutional Neural Network), to predict the associations between miRNAs and target genes. MDCNN samples the paths between the node pairs in the form of meta-path based on the heterogeneous information network (HIN) about miRNAs and target genes. Then the node feature and the path feature which is learned by the Deep Convolutional Neural Network (DCNN) are spliced together as the representation of the miRNA-target gene, to predict the miRNA-target gene interactions. The experiment results indicate that the performance of MDCNN outperforms other methods in multiple validation metrics by fivefold cross validation. We set an ablation study to identify the necessity of miRNA similarity and target gene similarity for improving the prediction ability of MDCNN. The case studies on hsa-miR-26b-5p and CDKN1A further demonstrates that MDCNN can successfully predict potential miRNA-target gene interactions.
Collapse
Affiliation(s)
- Jiawei Luo
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410083, China
| | - Yaoting Bao
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410083, China
| | - Xiangtao Chen
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410083, China.
| | - Cong Shen
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410083, China
| |
Collapse
|
14
|
Kulichenko M, Smith JS, Nebgen B, Li YW, Fedik N, Boldyrev AI, Lubbers N, Barros K, Tretiak S. The Rise of Neural Networks for Materials and Chemical Dynamics. J Phys Chem Lett 2021; 12:6227-6243. [PMID: 34196559 DOI: 10.1021/acs.jpclett.1c01357] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Machine learning (ML) is quickly becoming a premier tool for modeling chemical processes and materials. ML-based force fields, trained on large data sets of high-quality electron structure calculations, are particularly attractive due their unique combination of computational efficiency and physical accuracy. This Perspective summarizes some recent advances in the development of neural network-based interatomic potentials. Designing high-quality training data sets is crucial to overall model accuracy. One strategy is active learning, in which new data are automatically collected for atomic configurations that produce large ML uncertainties. Another strategy is to use the highest levels of quantum theory possible. Transfer learning allows training to a data set of mixed fidelity. A model initially trained to a large data set of density functional theory calculations can be significantly improved by retraining to a relatively small data set of expensive coupled cluster theory calculations. These advances are exemplified by applications to molecules and materials.
Collapse
Affiliation(s)
- Maksim Kulichenko
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
- Department of Chemistry and Biochemistry, Utah State University, Logan, Utah 84322, United States
| | - Justin S Smith
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
- Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Benjamin Nebgen
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Ying Wai Li
- Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Nikita Fedik
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
- Department of Chemistry and Biochemistry, Utah State University, Logan, Utah 84322, United States
| | - Alexander I Boldyrev
- Department of Chemistry and Biochemistry, Utah State University, Logan, Utah 84322, United States
| | - Nicholas Lubbers
- Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Kipton Barros
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
- Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Sergei Tretiak
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
- Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
- Center for Integrated Nanotechnologies, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| |
Collapse
|
15
|
Li J, Peng D, Xie Y, Dai Z, Zou X, Li Z. Novel Potential Small Molecule-MiRNA-Cancer Associations Prediction Model Based on Fingerprint, Sequence, and Clinical Symptoms. J Chem Inf Model 2021; 61:2208-2219. [PMID: 33899462 DOI: 10.1021/acs.jcim.0c01458] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
As an important biomarker in organisms, miRNA is closely related to various small molecules and diseases. Research on small molecule-miRNA-cancer associations is helpful for the development of cancer treatment drugs and the discovery of pathogenesis. It is very urgent to develop theoretical methods for identifying potential small molecular-miRNA-cancer associations, because experimental approaches are usually time-consuming, laborious, and expensive. To overcome this problem, we developed a new computational method, in which features derived from structure, sequence, and symptoms were utilized to characterize small molecule, miRNA, and cancer, respectively. A feature vector was construct to characterize small molecule-miRNA-cancer association by concatenating these features, and a random forest algorithm was utilized to construct a model for recognizing potential association. Based on the 5-fold cross-validation and benchmark data set, the model achieved an accuracy of 93.20 ± 0.52%, a precision of 93.22 ± 0.51%, a recall of 93.20 ± 0.53%, and an F1-measure of 93.20 ± 0.52%. The areas under the receiver operating characteristic curve and precision recall curve were 0.9873 and 0.9870. The real prediction ability and application performance of the developed method have also been further evaluated and verified through an independent data set test and case study. Some potential small molecules and miRNAs related to cancer have been identified and are worthy of further experimental research. It is anticipated that our model could be regarded as a useful high-throughput virtual screening tool for drug research and development. All source codes can be downloaded from https://github.com/LeeKamlong/Multi-class-SMMCA.
Collapse
Affiliation(s)
- Jinlong Li
- School of Chemistry and Chemical Engineering, Guangdong Pharmaceutical University, Guangzhou 510006, People's Republic of China
| | - Dongdong Peng
- School of Chemistry and Chemical Engineering, Guangdong Pharmaceutical University, Guangzhou 510006, People's Republic of China
| | - Yun Xie
- School of Chemistry and Chemical Engineering, Guangdong Pharmaceutical University, Guangzhou 510006, People's Republic of China
| | - Zong Dai
- School of Biomedical Engineering, Sun Yat-Sen University, Guangzhou 510275, People's Republic of China
| | - Xiaoyong Zou
- School of Chemistry, Sun Yat-Sen University, Guangzhou 510275, People's Republic of China
| | - Zhanchao Li
- School of Chemistry and Chemical Engineering, Guangdong Pharmaceutical University, Guangzhou 510006, People's Republic of China
- Key Laboratory of Digital Quality Evaluation of Chinese Materia Medica of State Administration of Traditional Chinese Medicine, Guangzhou 510006, People's Republic of China
| |
Collapse
|
16
|
Tang X, Luo J, Shen C, Lai Z. Multi-view Multichannel Attention Graph Convolutional Network for miRNA-disease association prediction. Brief Bioinform 2021; 22:6271996. [PMID: 33963829 DOI: 10.1093/bib/bbab174] [Citation(s) in RCA: 93] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Revised: 04/08/2021] [Accepted: 04/09/2021] [Indexed: 12/11/2022] Open
Abstract
MOTIVATION In recent years, a growing number of studies have proved that microRNAs (miRNAs) play significant roles in the development of human complex diseases. Discovering the associations between miRNAs and diseases has become an important part of the discovery and treatment of disease. Since uncovering associations via traditional experimental methods is complicated and time-consuming, many computational methods have been proposed to identify the potential associations. However, there are still challenges in accurately determining potential associations between miRNA and disease by using multisource data. RESULTS In this study, we develop a Multi-view Multichannel Attention Graph Convolutional Network (MMGCN) to predict potential miRNA-disease associations. Different from simple multisource information integration, MMGCN employs GCN encoder to obtain the features of miRNA and disease in different similarity views, respectively. Moreover, our MMGCN can enhance the learned latent representations for association prediction by utilizing multichannel attention, which adaptively learns the importance of different features. Empirical results on two datasets demonstrate that MMGCN model can achieve superior performance compared with nine state-of-the-art methods on most of the metrics. Furthermore, we prove the effectiveness of multichannel attention mechanism and the validity of multisource data in miRNA and disease association prediction. Case studies also indicate the ability of the method for discovering new associations.
Collapse
Affiliation(s)
- Xinru Tang
- College of Computer Science and Electronic Engineering, Hunan University, Changsha 410083, China
| | - Jiawei Luo
- College of Computer Science and Electronic Engineering, Hunan University, Changsha 410083, China
| | - Cong Shen
- College of Computer Science and Electronic Engineering, Hunan University, Changsha 410083, China
| | - Zihan Lai
- College of Computer Science and Electronic Engineering, Hunan University, Changsha 410083, China
| |
Collapse
|
17
|
Shen C, Luo J, Ouyang W, Ding P, Chen X. IDDkin: Network-based influence deep diffusion model for enhancing prediction of kinase inhibitors. Bioinformatics 2020; 36:5481-5491. [PMID: 33367525 DOI: 10.1093/bioinformatics/btaa1058] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2020] [Revised: 11/09/2020] [Accepted: 12/10/2020] [Indexed: 01/01/2023] Open
Abstract
MOTIVATION Protein kinases have been the focus of drug discovery research for many years because they play a causal role in many human diseases. Understanding the binding profile of kinase inhibitors is a prerequisite for drug discovery, and traditional methods of predicting kinase inhibitors are time-consuming and inefficient. Calculation-based predictive methods provide a relatively low-cost and high-efficiency approach to the rapid development and effective understanding of the binding profile of kinase inhibitors. Particularly, the continuous improvement of network pharmacology methods provides unprecedented opportunities for drug discovery, network-based computational methods could be employed to aggregate the effective information from heterogeneous sources, which have become a new way for predicting the binding profile of kinase inhibitors. RESULTS In this study, we proposed a network-based influence deep diffusion model, named IDDkin, for enhancing the prediction of kinase inhibitors. IDDkin uses deep graph convolutional networks, graph attention networks and adaptive weighting methods to diffuse the effective information of heterogeneous networks. The updated kinase and compound representations are used to predict potential compound-kinase pairs. The experimental results show that the performance of IDDkin is superior to the comparison methods, including the state-of-the art kinase inhibitor prediction method and the classic model widely used in relationship prediction. In experiments conducted to verify its generalizability and in case studies, the IDDkin model also shows excellent performance. All of these results demonstrate the powerful predictive ability of the IDDkin model in the field of kinase inhibitors. AVAILABILITY AND IMPLEMENTATION Source code and data can be downloaded from https://github.com/ CS-BIO/IDDkin. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Cong Shen
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410083, China
| | - Jiawei Luo
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410083, China
| | - Wenjue Ouyang
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410083, China
| | - Pingjian Ding
- School of Computer Science, University of South China, Hengyang, 421001, China
| | - Xiangtao Chen
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410083, China
| |
Collapse
|