1
|
Kumar AA, Bhandary S, Hegde SG, Chatterjee J. Knowledge graph applications and multi-relation learning for drug repurposing: A scoping review. Comput Biol Chem 2025; 115:108364. [PMID: 39914071 DOI: 10.1016/j.compbiolchem.2025.108364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2024] [Revised: 01/17/2025] [Accepted: 01/23/2025] [Indexed: 02/26/2025]
Abstract
OBJECTIVE Development of novel drug solutions has always been an expensive endeavour, hence drug repurposing as an approach has gained popularity in recent years. In this review we intend to examine one of the most unique computational methods for drug repurposing, that being knowledge graphs. METHOD Through literature review we looked at the application of knowledge graphs in medicine, specifically at its use in drug repurposing. We also looked at literature embedding methods, integration of machine learning models and approaches to completion of knowledge graphs. RESULT After filtering 43 papers were used for analysis. Timeline, country distribution, application areas of knowledge graph was highlighted. General trends in the use of knowledge graphs for drug repurposing and any shortcomings of the approach was discussed. CONCLUSION This approach has gained popularity only very recently; hence it is in a nascent phase.
Collapse
Affiliation(s)
- A Arun Kumar
- Department of Biotechnology, PES University, Bangalore 560085, India
| | - Samarth Bhandary
- Department of Biotechnology, PES University, Bangalore 560085, India
| | | | - Jhinuk Chatterjee
- Department of Biotechnology, PES University, Bangalore 560085, India.
| |
Collapse
|
2
|
Selote R, Makhijani R. A knowledge graph approach to drug repurposing for Alzheimer's, Parkinson's and Glioma using drug-disease-gene associations. Comput Biol Chem 2025; 115:108302. [PMID: 39693851 DOI: 10.1016/j.compbiolchem.2024.108302] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2024] [Revised: 11/06/2024] [Accepted: 11/26/2024] [Indexed: 12/20/2024]
Abstract
Drug Repurposing gives us facility to find the new uses of previously developed drugs rather than developing new drugs from start. Particularly during pandemic, drug repurposing caught much attention to provide new applications of the previously approved drugs. In our research, we provide a novel method for drug repurposing based on feature learning process from drug-disease-gene network. In our research, we aimed at finding drug candidates which can be repurposed under neurodegenerative diseases and glioma. We collected association data between drugs, diseases and genes from public resources and primarily examined the data related to Alzheimer's, Parkinson's and Glioma diseases. We created a Knowledge Graph using neo4j by integrating all these datasets and applied scalable feature learning algorithm known as node2vec to create node embeddings. These embeddings were later used to predict the unknown associations between disease and their candidate drugs by finding cosine similarity between disease and drug nodes embedding. We obtained a definitive set of candidate drugs for repurposing. These results were validated from the literature and CodReS online tool to rank the candidate drugs. Additionally, we verified the status of candidate drugs from pharmaceutical knowledge databases to confirm their significance.
Collapse
Affiliation(s)
- Ruchira Selote
- Department of Computer Science and Engineering, Indian Institute of Information Technology, Nagpur, India.
| | - Richa Makhijani
- Department of Computer Science and Engineering, Indian Institute of Information Technology, Nagpur, India.
| |
Collapse
|
3
|
Galluzzo Y. A comprehensive review of the data and knowledge graphs approaches in bioinformatics. COMPUTER SCIENCE AND INFORMATION SYSTEMS 2024; 21:1055-1075. [DOI: 10.2298/csis230530027g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2025]
Abstract
The scientific community is currently showing strong interest in constructing knowledge graphs from heterogeneous domains (genomic, pharmaceutical, clinical etc.). The main goal here is to support researchers in gaining an immediate overview of the biomedical and clinical data that can be utilized to construct and extend KGs. A in-depth overview of the available biomedical data and the latest applications of knowledge graphs, from the biological to the clinical context, is provided showing the most recent methods of representing biomedical knowledge with embeddings (KGEs). Furthermore, this review, differentiates biomedical databases based on their construction process (whether manually curated by experts or not), aiming to offer a detailed overview and guide researchers in selecting the appropriate database for their research considering to the specific project needs, available resources, and data complexity. In conclusion, the review highlights current challenges: integration of different knowledge graphs and the interpretability of predictions of new relations.
Collapse
|
4
|
Quan Y, Xiong ZK, Zhang KX, Zhang QY, Zhang W, Zhang HY. Evolution-strengthened knowledge graph enables predicting the targetability and druggability of genes. PNAS NEXUS 2023; 2:pgad147. [PMID: 37188275 PMCID: PMC10178923 DOI: 10.1093/pnasnexus/pgad147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/29/2023] [Accepted: 04/21/2023] [Indexed: 05/17/2023]
Abstract
Identifying promising targets is a critical step in modern drug discovery, with causative genes of diseases that are an important source of successful targets. Previous studies have found that the pathogeneses of various diseases are closely related to the evolutionary events of organisms. Accordingly, evolutionary knowledge can facilitate the prediction of causative genes and further accelerate target identification. With the development of modern biotechnology, massive biomedical data have been accumulated, and knowledge graphs (KGs) have emerged as a powerful approach for integrating and utilizing vast amounts of data. In this study, we constructed an evolution-strengthened knowledge graph (ESKG) and validated applications of ESKG in the identification of causative genes. More importantly, we developed an ESKG-based machine learning model named GraphEvo, which can effectively predict the targetability and the druggability of genes. We further investigated the explainability of the ESKG in druggability prediction by dissecting the evolutionary hallmarks of successful targets. Our study highlights the importance of evolutionary knowledge in biomedical research and demonstrates the potential power of ESKG in promising target identification. The data set of ESKG and the code of GraphEvo can be downloaded from https://github.com/Zhankun-Xiong/GraphEvo.
Collapse
Affiliation(s)
| | | | - Ke-Xin Zhang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, Hubei 430070, P. R. China
| | - Qing-Ye Zhang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, Hubei 430070, P. R. China
| | - Wen Zhang
- To whom correspondence should be addressed: ;
| | | |
Collapse
|
5
|
Zhao C, Wang H, Qi W, Liu S. Toward drug-miRNA resistance association prediction by positional encoding graph neural network and multi-channel neural network. Methods 2022; 207:81-89. [PMID: 36167292 DOI: 10.1016/j.ymeth.2022.09.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Revised: 09/01/2022] [Accepted: 09/18/2022] [Indexed: 10/31/2022] Open
Abstract
Drug discovery is a costly and time-consuming process, and most drugs exert therapeutic efficacy by targeting specific proteins. However, there are a large number of proteins that are not targeted by any drug. Recently, miRNA-based therapeutics are becoming increasingly important, since miRNA can regulate the expressions of specific genes and affect a variety of human diseases. Therefore, it is of great significance to study the associations between miRNAs and drugs to enable drug discovery and disease treatment. In this work, we propose a novel method named DMR-PEG, which facilitates drug-miRNA resistance association (DMRA) prediction by leveraging positional encoding graph neural network with layer attention (LAPEG) and multi-channel neural network (MNN). LAPEG considers both the potential information in the miRNA-drug resistance heterogeneous network and the specific characteristics of entities (i.e., drugs and miRNAs) to learn favorable representations of drugs and miRNAs. And MNN models various sophisticated relations and synthesizes the predictions from different perspectives effectively. In the comprehensive experiments, DMR-PEG achieves the area under the precision-recall curve (AUPR) score of 0.2793 and the area under the receiver-operating characteristic curve (AUC) score of 0.9475, which outperforms the most state-of-the-art methods. Further experimental results show that our proposed method has good robustness and stability. The ablation study demonstrates each component in DMR-PEG is essential for drug-miRNA drug resistance association prediction. And real-world case study presents that DMR-PEG is promising for DMRA inference.
Collapse
Affiliation(s)
- Chengshuai Zhao
- College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Haorui Wang
- School of Computer Science, Wuhan University, Wuhan 430072, China
| | - Weiwei Qi
- Hubei Bailianhe Pumped-storage Power Station, Wuhan 430074, China
| | - Shichao Liu
- College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| |
Collapse
|
6
|
Pavel A, Saarimäki LA, Möbus L, Federico A, Serra A, Greco D. The potential of a data centred approach & knowledge graph data representation in chemical safety and drug design. Comput Struct Biotechnol J 2022; 20:4837-4849. [PMID: 36147662 PMCID: PMC9464643 DOI: 10.1016/j.csbj.2022.08.061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Revised: 08/26/2022] [Accepted: 08/26/2022] [Indexed: 11/20/2022] Open
Abstract
Big Data pervades nearly all areas of life sciences, yet the analysis of large integrated data sets remains a major challenge. Moreover, the field of life sciences is highly fragmented and, consequently, so is its data, knowledge, and standards. This, in turn, makes integrated data analysis and knowledge gathering across sub-fields a demanding task. At the same time, the integration of various research angles and data types is crucial for modelling the complexity of organisms and biological processes in a holistic manner. This is especially valid in the context of drug development and chemical safety assessment where computational methods can provide solutions for the urgent need of fast, effective, and sustainable approaches. At the same time, such computational methods require the development of methodologies suitable for an integrated and data centred Big Data view. Here we discuss Knowledge Graphs (KG) as a solution to a data centred analysis approach for drug and chemical development and safety assessment. KGs are knowledge bases, data analysis engines, and knowledge discovery systems all in one, allowing them to be used from simple data retrieval, over meta-analysis to complex predictive and knowledge discovery systems. Therefore, KGs have immense potential to advance the data centred approach, the re-usability, and informativity of data. Furthermore, they can improve the power of analysis, and the complexity of modelled processes, all while providing knowledge in a natively human understandable network data model.
Collapse
Affiliation(s)
- Alisa Pavel
- Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland.,BioMediTech Institute, Tampere University, Tampere, Finland.,Finnish Hub for Development and Validation of Integrated Approaches (FHAIVE), Tampere, Finland
| | - Laura A Saarimäki
- Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland.,BioMediTech Institute, Tampere University, Tampere, Finland.,Finnish Hub for Development and Validation of Integrated Approaches (FHAIVE), Tampere, Finland
| | - Lena Möbus
- Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland.,BioMediTech Institute, Tampere University, Tampere, Finland.,Finnish Hub for Development and Validation of Integrated Approaches (FHAIVE), Tampere, Finland
| | - Antonio Federico
- Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland.,BioMediTech Institute, Tampere University, Tampere, Finland.,Finnish Hub for Development and Validation of Integrated Approaches (FHAIVE), Tampere, Finland
| | - Angela Serra
- Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland.,BioMediTech Institute, Tampere University, Tampere, Finland.,Finnish Hub for Development and Validation of Integrated Approaches (FHAIVE), Tampere, Finland
| | - Dario Greco
- Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland.,BioMediTech Institute, Tampere University, Tampere, Finland.,Finnish Hub for Development and Validation of Integrated Approaches (FHAIVE), Tampere, Finland.,Institute of Biotechnology, University of Helsinki, Helsinki, Finland
| |
Collapse
|
7
|
Lou Z, Cheng Z, Li H, Teng Z, Liu Y, Tian Z. Predicting miRNA-disease associations via learning multimodal networks and fusing mixed neighborhood information. Brief Bioinform 2022; 23:6582005. [PMID: 35524503 DOI: 10.1093/bib/bbac159] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Revised: 03/29/2022] [Accepted: 04/10/2022] [Indexed: 12/13/2022] Open
Abstract
MOTIVATION In recent years, a large number of biological experiments have strongly shown that miRNAs play an important role in understanding disease pathogenesis. The discovery of miRNA-disease associations is beneficial for disease diagnosis and treatment. Since inferring these associations through biological experiments is time-consuming and expensive, researchers have sought to identify the associations utilizing computational approaches. Graph Convolutional Networks (GCNs), which exhibit excellent performance in link prediction problems, have been successfully used in miRNA-disease association prediction. However, GCNs only consider 1st-order neighborhood information at one layer but fail to capture information from high-order neighbors to learn miRNA and disease representations through information propagation. Therefore, how to aggregate information from high-order neighborhood effectively in an explicit way is still challenging. RESULTS To address such a challenge, we propose a novel method called mixed neighborhood information for miRNA-disease association (MINIMDA), which could fuse mixed high-order neighborhood information of miRNAs and diseases in multimodal networks. First, MINIMDA constructs the integrated miRNA similarity network and integrated disease similarity network respectively with their multisource information. Then, the embedding representations of miRNAs and diseases are obtained by fusing mixed high-order neighborhood information from multimodal network which are the integrated miRNA similarity network, integrated disease similarity network and the miRNA-disease association networks. Finally, we concentrate the multimodal embedding representations of miRNAs and diseases and feed them into the multilayer perceptron (MLP) to predict their underlying associations. Extensive experimental results show that MINIMDA is superior to other state-of-the-art methods overall. Moreover, the outstanding performance on case studies for esophageal cancer, colon tumor and lung cancer further demonstrates the effectiveness of MINIMDA. AVAILABILITY AND IMPLEMENTATION https://github.com/chengxu123/MINIMDA and http://120.79.173.96/.
Collapse
Affiliation(s)
- Zhengzheng Lou
- School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450000, China
| | - Zhaoxu Cheng
- School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450000, China
| | - Hui Li
- School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450000, China
| | - Zhixia Teng
- College of Information and Computer Engineering, Northeast Forestry University, Harbin 150040, China
| | - Yang Liu
- Departments of Cerebrovascular Diseases, The Second Affiliated Hospital of Zhengzhou University, Zhengzhou 450000, China
| | - Zhen Tian
- School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450000, China
| |
Collapse
|