1
|
Jia X, Sun X, Wang K, Li M. DRGCL: Drug Repositioning via Semantic-Enriched Graph Contrastive Learning. IEEE J Biomed Health Inform 2025; 29:1656-1667. [PMID: 38437145 DOI: 10.1109/jbhi.2024.3372527] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2024]
Abstract
Drug repositioning greatly reduces drug development costs and time by discovering new indications for existing drugs. With the development of technology and large-scale biological databases, computational drug repositioning has increasingly attracted remarkable attention, which can narrow down repositioning candidates. Recently, graph neural networks (GNNs) have been widely used and achieved promising results in drug repositioning. However, the existing GNNs based methods usually focus on modeling the complex drug-disease association graph, but ignore the semantic information on the graph, which may lead to a lack of consistency of global topology information and local semantic information for the learned features. To alleviate the above challenge, we propose a novel drug repositioning model based on graph contrastive learning, termed DRGCL. First, we treat the known drug-disease associations as the topology graph. Second, we select the top- similar neighbor from drug/disease similarity information to construct the semantic graph rather than use the traditional data augmentation strategy, thereby maximally retaining rich semantic information. Finally, we pull closer to embedding consistency of the different embedding spaces by graph contrastive learning to enhance the topology and semantic feature on the graph. We have evaluated DRGCL on four benchmark datasets and the experiment results show that the proposed DRGCL is superior to the state-of-the-art methods. Especially, the average result of DRGCL is 11.92% higher than that of the second-best method in terms of AUPRC. The case studies further demonstrate the reliability of DRGCL.
Collapse
|
2
|
Xia Y, Xiong A, Zhang Z, Zou Q, Cui F. A comprehensive review of deep learning-based approaches for drug-drug interaction prediction. Brief Funct Genomics 2025; 24:elae052. [PMID: 39987494 PMCID: PMC11847217 DOI: 10.1093/bfgp/elae052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2024] [Revised: 07/29/2024] [Accepted: 02/21/2025] [Indexed: 02/25/2025] Open
Abstract
Deep learning models have made significant progress in the biomedical field, particularly in the prediction of drug-drug interactions (DDIs). DDIs are pharmacodynamic reactions between two or more drugs in the body, which may lead to adverse effects and are of great significance for drug development and clinical research. However, predicting DDI through traditional clinical trials and experiments is not only costly but also time-consuming. When utilizing advanced Artificial Intelligence (AI) and deep learning techniques, both developers and users face multiple challenges, including the problem of acquiring and encoding data, as well as the difficulty of designing computational methods. In this paper, we review a variety of DDI prediction methods, including similarity-based, network-based, and integration-based approaches, to provide an up-to-date and easy-to-understand guide for researchers in different fields. Additionally, we provide an in-depth analysis of widely used molecular representations and a systematic exposition of the theoretical framework of models used to extract features from graph data.
Collapse
Affiliation(s)
- Yan Xia
- School of Computer Science and Technology, Hainan University, No. 58, Renmin Avenue, Haidian Island, Haikou, Hainan Province, 570228, China
| | - An Xiong
- School of Computer Science and Technology, Hainan University, No. 58, Renmin Avenue, Haidian Island, Haikou, Hainan Province, 570228, China
| | - Zilong Zhang
- School of Computer Science and Technology, Hainan University, No. 58, Renmin Avenue, Haidian Island, Haikou, Hainan Province, 570228, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, No. 4, Section 2, Jianshe North Road, Chengdu, Sichuan Province, 610054, China
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, No. 1, Chengdian Road, Kecheng District, Quzhou, Zhejiang Province, 324000, China
| | - Feifei Cui
- School of Computer Science and Technology, Hainan University, No. 58, Renmin Avenue, Haidian Island, Haikou, Hainan Province, 570228, China
| |
Collapse
|
3
|
Du X, Sun X, Li M. Knowledge Graph Convolutional Network with Heuristic Search for Drug Repositioning. J Chem Inf Model 2024; 64:4928-4937. [PMID: 38837744 DOI: 10.1021/acs.jcim.4c00737] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/07/2024]
Abstract
Drug repositioning is a strategy of repurposing approved drugs for treating new indications, which can accelerate the drug discovery process, reduce development costs, and lower the safety risk. The advancement of biotechnology has significantly accelerated the speed and scale of biological data generation, offering significant potential for drug repositioning through biomedical knowledge graphs that integrate diverse entities and relations from various biomedical sources. To fully learn the semantic information and topological structure information from the biological knowledge graph, we propose a knowledge graph convolutional network with a heuristic search, named KGCNH, which can effectively utilize the diversity of entities and relationships in biological knowledge graphs, as well as topological structure information, to predict the associations between drugs and diseases. Specifically, we design a relation-aware attention mechanism to compute the attention scores for each neighboring entity of a given entity under different relations. To address the challenge of randomness of the initial attention scores potentially impacting model performance and to expand the search scope of the model, we designed a heuristic search module based on Gumbel-Softmax, which uses attention scores as heuristic information and introduces randomness to assist the model in exploring more optimal embeddings of drugs and diseases. Following this module, we derive the relation weights, obtain the embeddings of drugs and diseases through neighborhood aggregation, and then predict drug-disease associations. Additionally, we employ feature-based augmented views to enhance model robustness and mitigate overfitting issues. We have implemented our method and conducted experiments on two data sets. The results demonstrate that KGCNH outperforms competing methods. In particular, case studies on lithium and quetiapine confirm that KGCNH can retrieve more actual drug-disease associations in the top prediction results.
Collapse
Affiliation(s)
- Xiang Du
- School of Computer Science and Engineering, Central South University, Changsha, Hunan 410083, China
- School of Information Engineering, Jiangxi University of Science and Technology, Ganzhou, Jiangxi 341000, China
| | - Xinliang Sun
- School of Computer Science and Engineering, Central South University, Changsha, Hunan 410083, China
| | - Min Li
- School of Computer Science and Engineering, Central South University, Changsha, Hunan 410083, China
| |
Collapse
|
4
|
Wen JW, Zhang HL, Du PF. Vislocas: Vision transformers for identifying protein subcellular mis-localization signatures of different cancer subtypes from immunohistochemistry images. Comput Biol Med 2024; 174:108392. [PMID: 38608321 DOI: 10.1016/j.compbiomed.2024.108392] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2024] [Revised: 03/22/2024] [Accepted: 04/01/2024] [Indexed: 04/14/2024]
Abstract
Proteins must be sorted to specific subcellular compartments to perform their functions. Abnormal protein subcellular localizations are related to many diseases. Although many efforts have been made in predicting protein subcellular localization from various static information, including sequences, structures and interactions, such static information cannot predict protein mis-localization events in diseases. On the contrary, the IHC (immunohistochemistry) images, which have been widely applied in clinical diagnosis, contains information that can be used to find protein mis-localization events in disease states. In this study, we create the Vislocas method, which is capable of finding mis-localized proteins from IHC images as markers of cancer subtypes. By combining CNNs and vision transformer encoders, Vislocas can automatically extract image features at both global and local level. Vislocas can be trained with full-sized IHC images from scratch. It is the first attempt to create an end-to-end IHC image-based protein subcellular location predictor. Vislocas achieved comparable or better performances than state-of-the-art methods. We applied Vislocas to find significant protein mis-localization events in different subtypes of glioma, melanoma and skin cancer. The mis-localized proteins, which were found purely from IHC images by Vislocas, are in consistency with clinical or experimental results in literatures. All codes of Vislocas have been deposited in a Github repository (https://github.com/JingwenWen99/Vislocas). All datasets of Vislocas have been deposited in Zenodo (https://zenodo.org/records/10632698).
Collapse
Affiliation(s)
- Jing-Wen Wen
- College of Intelligence and Computing, Tianjin University, Tianjin, 300350, China.
| | - Han-Lin Zhang
- College of Intelligence and Computing, Tianjin University, Tianjin, 300350, China.
| | - Pu-Feng Du
- College of Intelligence and Computing, Tianjin University, Tianjin, 300350, China.
| |
Collapse
|
5
|
Wang Y, Zhang Z, Piao C, Huang Y, Zhang Y, Zhang C, Lu YJ, Liu D. LDS-CNN: a deep learning framework for drug-target interactions prediction based on large-scale drug screening. Health Inf Sci Syst 2023; 11:42. [PMID: 37667773 PMCID: PMC10475000 DOI: 10.1007/s13755-023-00243-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Accepted: 08/14/2023] [Indexed: 09/06/2023] Open
Abstract
Background Drug-target interaction (DTI) is a vital drug design strategy that plays a significant role in many processes of complex diseases and cellular events. In the face of challenges such as extensive protein data and experimental costs, it is suggested to apply bioinformatics approaches to exploit potential interactions to design new targeted medications. Different data and interaction types bring difficulties to study involving incompatible and heterology formats. The analysis of drug-target interactions in a comprehensive and unified model is a significant challenge. Method Here, we propose a general method for predicting interactions between small-molecule drugs and protein targets, Large-scale Drug target Screening Convolutional Neural Network (LDS-CNN), which used unified encoding to achieve the calculation of the different data formats in an integrated model to realize feature abstraction and potential object prediction. Result On 898,412 interaction data involving 1683 small-molecule compounds and 14,350 human proteins from 8.8 billion records, the proposed method achieved an area under the curve (AUC) of 0.96, an area under the precision-recall curve (AUPRC) of 0.95, and an accuracy of 90.13%. The experimental results illustrated that the proposed method attained high accuracy on the test set, indicating its high predictive ability in drug-target interaction prediction. LDS-CNN is effective for the prediction of large-scale datasets and datasets composed of data with different formats. Conclusion In this study, we propose a DTI prediction method to solve the problems of unified encoding of large-scale data in multiple formats. It provides a feasible way to efficiently abstract the features among different types of drug-related data, thus reducing experimental costs and time consumption. The proposed method can be used to identify potential drug targets and candidates for the treatment of complex diseases. This work provides a reference for DTI to process large-scale data and different formats with deep learning methods and provides certain suggestions for future research.
Collapse
Affiliation(s)
- Yang Wang
- School of Computer Science and Technology, Guangdong University of Technology, Guangzhou, 510006 China
| | - Zuxian Zhang
- School of Biomedical and Pharmaceutical Sciences, Guangdong University of Technology, Guangzhou, 510006 China
| | - Chenghong Piao
- The First Affiliated Hospital of Ningbo University, Ningbo, 315010 China
| | - Ying Huang
- School of Biomedical and Pharmaceutical Sciences, Guangdong University of Technology, Guangzhou, 510006 China
| | - Yihan Zhang
- School of Biomedical and Pharmaceutical Sciences, Guangdong University of Technology, Guangzhou, 510006 China
| | - Chi Zhang
- Shanghai Institute of Biological Products, Shanghai, 201403 China
| | - Yu-Jing Lu
- School of Biomedical and Pharmaceutical Sciences, Guangdong University of Technology, Guangzhou, 510006 China
- Smart Medical Innovation Technology Center, Guangdong University of Technology, Guangzhou, 510006 China
| | - Dongning Liu
- School of Computer Science and Technology, Guangdong University of Technology, Guangzhou, 510006 China
| |
Collapse
|