1
|
Guo H, Zhang C, Shang J, Zhang D, Guo Y, Gao K, Yang K, Gao X, Yao D, Chen W, Yan M, Wu G. Drug-Target Affinity Prediction Based on Topological Enhanced Graph Neural Networks. J Chem Inf Model 2025; 65:3749-3760. [PMID: 40159647 DOI: 10.1021/acs.jcim.4c01335] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/02/2025]
Abstract
Graph neural networks (GNNs) have achieved remarkable success in drug-target affinity (DTA) analysis, reducing the cost of drug development. Unlike traditional one-dimensional (1D) sequence-based methods, GNNs leverage graph structures to capture richer protein and drug features, leading to improved DTA prediction performance. However, existing methods often neglect to incorporate valuable protein cavity information, a key aspect of protein physical chemistry. This study addresses this gap by proposing a novel topology-enhanced GNN for DTA prediction that integrates protein pocket data. Additionally, we optimize training and message-passing strategies to enhance the model's feature representation capabilities. Our model's effectiveness is validated on the Davis and KIBA data sets, demonstrating its ability to capture the intricate interplay between drugs and targets. The source code is publicly available on https://github.com/ZZDXgangwu/DTA.
Collapse
Affiliation(s)
- Hengliang Guo
- National Supercomputing Center in Zhengzhou, Zhengzhou University, Zhengzhou 450001, China
- School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China
| | - Congxiang Zhang
- School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China
| | - Jiandong Shang
- National Supercomputing Center in Zhengzhou, Zhengzhou University, Zhengzhou 450001, China
- School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China
| | - Dujuan Zhang
- National Supercomputing Center in Zhengzhou, Zhengzhou University, Zhengzhou 450001, China
- School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China
| | - Yang Guo
- National Supercomputing Center in Zhengzhou, Zhengzhou University, Zhengzhou 450001, China
- School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China
| | - Kang Gao
- National Supercomputing Center in Zhengzhou, Zhengzhou University, Zhengzhou 450001, China
- School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China
| | - Kecheng Yang
- National Supercomputing Center in Zhengzhou, Zhengzhou University, Zhengzhou 450001, China
- School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China
| | - Xu Gao
- National Supercomputing Center in Zhengzhou, Zhengzhou University, Zhengzhou 450001, China
- School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China
| | - Dezhong Yao
- National Supercomputing Center in Zhengzhou, Zhengzhou University, Zhengzhou 450001, China
- School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China
- School of Computer and Technology, Huazhong University of Science and Technology, Wuhan 420100, China
| | - Wanting Chen
- School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China
| | - Mengfan Yan
- School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China
| | - Gang Wu
- National Supercomputing Center in Zhengzhou, Zhengzhou University, Zhengzhou 450001, China
- School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China
| |
Collapse
|
2
|
Wang G, Zhang H, Shao M, Sun S, Cao C. MMPD-DTA: Integrating Multi-Modal Deep Learning with Pocket-Drug Graphs for Drug-Target Binding Affinity Prediction. J Chem Inf Model 2025; 65:1615-1630. [PMID: 39833138 DOI: 10.1021/acs.jcim.4c01528] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2025]
Abstract
Predicting drug-target binding affinity (DTA) is a crucial task in drug discovery research. Recent studies have demonstrated that pocket features and interactions between targets and drugs significantly improve the understanding of DTA. However, challenges remain, particularly in the detailed consideration of both global and local information and the further modeling of pocket features. In this paper, we propose a novel multimodal deep learning model named MMPD-DTA for predicting drug-target binding affinity to address these challenges. The MMPD-DTA model integrates graph and sequence modalities of targets, pockets, and drugs to capture both global and local target and drug information. The model introduces a novel pocket-drug graph (PD graph) that simultaneously models atomic interactions within the target, within the drug, and between the target and drug. We employ GraphSAGE for graph representation learning from the PD graph, complemented by sequence representation learning via transformers for the target sequence and graph representation learning via a graph isomorphism network for the drug molecular graph. These multimodal representations are then concatenated, and a multilayer perceptron generates the final binding affinity predictions. Experimental results on three real-world test sets demonstrate that the MMPD-DTA model outperforms baseline methods. Ablation studies further confirm the effectiveness of each module within the MMPD-DTA model. Our code is available at https://github.com/zhc-moushang/MMPD-DTA.
Collapse
Affiliation(s)
- Guishen Wang
- College of Computer Science and Engineering, Changchun University of Technology, North Yuanda Street No. 3000, Jilin 130012, China
| | - Hangchen Zhang
- College of Computer Science and Engineering, Changchun University of Technology, North Yuanda Street No. 3000, Jilin 130012, China
| | - Mengting Shao
- Key Laboratory for Bio-Electromagnetic Environment and Advanced Medical Theranostics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Longmian Avenue No. 101, Jiangsu 211166, China
| | - Shisen Sun
- College of Computer Science and Engineering, Changchun University of Technology, North Yuanda Street No. 3000, Jilin 130012, China
| | - Chen Cao
- Key Laboratory for Bio-Electromagnetic Environment and Advanced Medical Theranostics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Longmian Avenue No. 101, Jiangsu 211166, China
| |
Collapse
|
3
|
Lavecchia A. Advancing drug discovery with deep attention neural networks. Drug Discov Today 2024; 29:104067. [PMID: 38925473 DOI: 10.1016/j.drudis.2024.104067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2024] [Revised: 06/10/2024] [Accepted: 06/19/2024] [Indexed: 06/28/2024]
Abstract
In the dynamic field of drug discovery, deep attention neural networks are revolutionizing our approach to complex data. This review explores the attention mechanism and its extended architectures, including graph attention networks (GATs), transformers, bidirectional encoder representations from transformers (BERT), generative pre-trained transformers (GPTs) and bidirectional and auto-regressive transformers (BART). Delving into their core principles and multifaceted applications, we uncover their pivotal roles in catalyzing de novo drug design, predicting intricate molecular properties and deciphering elusive drug-target interactions. Despite challenges, these attention-based architectures hold unparalleled promise to drive transformative breakthroughs and accelerate progress in pharmaceutical research.
Collapse
Affiliation(s)
- Antonio Lavecchia
- Drug Discovery Laboratory, Department of Pharmacy, University of Napoli Federico II, I-80131 Naples, Italy.
| |
Collapse
|
4
|
Wang Z, Feng B, Gao Q, Wang Y, Yang Y, Luo B, Zhang Q, Wang F, Li B. A prediction method of interaction based on Bilinear Attention Networks for designing polyphenol-protein complexes delivery systems. Int J Biol Macromol 2024; 269:131959. [PMID: 38692548 DOI: 10.1016/j.ijbiomac.2024.131959] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Revised: 04/20/2024] [Accepted: 04/27/2024] [Indexed: 05/03/2024]
Abstract
Polyphenol-protein complexes delivery systems are gaining attention for their potential health benefits and food industry development. However, creating an ideal delivery system requires extensive wet-lab experimentation. To address this, we collected 525 ligand-protein interaction data pairs and established an interaction prediction model using Bilinear Attention Networks. We utilized 10-fold cross validation to address potential overfitting issues in the model, resulting in showed higher average AUROC (0.8443), AUPRC (0.7872), and F1 (0.8164). The optimal threshold (0.3739) was selected for the model to be used for subsequent analysis. Based on the model prediction results and optimal threshold, by verifying experimental analysis, the interaction of paeonol with the following proteins was obtained, including bovine serum albumin (lgKa = 6.2759), bovine β-lactoglobulin (lgKa = 6.7479), egg ovalbumin (lgKa = 5.1806), zein (lgKa = 6.0122), bovine α-lactalbumin (lgKa = 3.9170), bovine lactoferrin (lgKa = 4.5380), the first four proteins are consistent with the predicted results of the model, with lgKa >5. The established model can accurately and rapidly predict the interaction of polyphenol-protein complexes. This study is the first to combine open ligand-protein interaction experiments with Deep Learning algorithms in the food industry, greatly improving research efficiency and providing a novel perspective for future complex delivery system construction.
Collapse
Affiliation(s)
- Zhipeng Wang
- Department of Food Science, Northeast Agricultural University, Harbin 150030, PR China
| | - Baolong Feng
- Center for Education Technology, Northeast Agricultural University, Harbin 150030, PR China
| | - Qizhou Gao
- Department of Food Science, Northeast Agricultural University, Harbin 150030, PR China
| | - Yutang Wang
- Institute of Agro-Products Processing Science and Technology, Chinese Academy of Agricultural Sciences, Key Laboratory of Agro-Products Processing, Ministry of Agriculture, Beijing 100193, PR China.
| | - Yan Yang
- Department of Food Science, Northeast Agricultural University, Harbin 150030, PR China
| | - Bowen Luo
- Department of Food Science, Northeast Agricultural University, Harbin 150030, PR China
| | - Qi Zhang
- Department of Food Science, Northeast Agricultural University, Harbin 150030, PR China
| | - Fengzhong Wang
- Institute of Agro-Products Processing Science and Technology, Chinese Academy of Agricultural Sciences, Key Laboratory of Agro-Products Processing, Ministry of Agriculture, Beijing 100193, PR China.
| | - Bailiang Li
- Department of Food Science, Northeast Agricultural University, Harbin 150030, PR China; Key Laboratory of Dairy Science, Ministry of Education, Northeast Agricultural University, Harbin 150030, PR China.
| |
Collapse
|
5
|
Zeng X, Li SJ, Lv SQ, Wen ML, Li Y. A comprehensive review of the recent advances on predicting drug-target affinity based on deep learning. Front Pharmacol 2024; 15:1375522. [PMID: 38628639 PMCID: PMC11019008 DOI: 10.3389/fphar.2024.1375522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Accepted: 03/21/2024] [Indexed: 04/19/2024] Open
Abstract
Accurate calculation of drug-target affinity (DTA) is crucial for various applications in the pharmaceutical industry, including drug screening, design, and repurposing. However, traditional machine learning methods for calculating DTA often lack accuracy, posing a significant challenge in accurately predicting DTA. Fortunately, deep learning has emerged as a promising approach in computational biology, leading to the development of various deep learning-based methods for DTA prediction. To support researchers in developing novel and highly precision methods, we have provided a comprehensive review of recent advances in predicting DTA using deep learning. We firstly conducted a statistical analysis of commonly used public datasets, providing essential information and introducing the used fields of these datasets. We further explored the common representations of sequences and structures of drugs and targets. These analyses served as the foundation for constructing DTA prediction methods based on deep learning. Next, we focused on explaining how deep learning models, such as Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Transformer, and Graph Neural Networks (GNNs), were effectively employed in specific DTA prediction methods. We highlighted the unique advantages and applications of these models in the context of DTA prediction. Finally, we conducted a performance analysis of multiple state-of-the-art methods for predicting DTA based on deep learning. The comprehensive review aimed to help researchers understand the shortcomings and advantages of existing methods, and further develop high-precision DTA prediction tool to promote the development of drug discovery.
Collapse
Affiliation(s)
- Xin Zeng
- College of Mathematics and Computer Science, Dali University, Dali, China
| | - Shu-Juan Li
- Yunnan Institute of Endemic Diseases Control and Prevention, Dali, China
| | - Shuang-Qing Lv
- Institute of Surveying and Information Engineering West Yunnan University of Applied Science, Dali, China
| | - Meng-Liang Wen
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China
| | - Yi Li
- College of Mathematics and Computer Science, Dali University, Dali, China
| |
Collapse
|
6
|
Nitulescu GM. Techniques and Strategies in Drug Design and Discovery. Int J Mol Sci 2024; 25:1364. [PMID: 38338643 PMCID: PMC10855429 DOI: 10.3390/ijms25031364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Accepted: 01/18/2024] [Indexed: 02/12/2024] Open
Abstract
The process of drug discovery constitutes a highly intricate and formidable undertaking, encompassing the identification and advancement of novel therapeutic entities [...].
Collapse
Affiliation(s)
- George Mihai Nitulescu
- Faculty of Pharmacy, "Carol Davila" University of Medicine and Pharmacy, 6 Traian Vuia Street, 020956 Bucharest, Romania
| |
Collapse
|
7
|
Zhang Y, Liu C, Liu M, Liu T, Lin H, Huang CB, Ning L. Attention is all you need: utilizing attention in AI-enabled drug discovery. Brief Bioinform 2023; 25:bbad467. [PMID: 38189543 PMCID: PMC10772984 DOI: 10.1093/bib/bbad467] [Citation(s) in RCA: 37] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2023] [Revised: 11/03/2023] [Accepted: 11/25/2023] [Indexed: 01/09/2024] Open
Abstract
Recently, attention mechanism and derived models have gained significant traction in drug development due to their outstanding performance and interpretability in handling complex data structures. This review offers an in-depth exploration of the principles underlying attention-based models and their advantages in drug discovery. We further elaborate on their applications in various aspects of drug development, from molecular screening and target binding to property prediction and molecule generation. Finally, we discuss the current challenges faced in the application of attention mechanisms and Artificial Intelligence technologies, including data quality, model interpretability and computational resource constraints, along with future directions for research. Given the accelerating pace of technological advancement, we believe that attention-based models will have an increasingly prominent role in future drug discovery. We anticipate that these models will usher in revolutionary breakthroughs in the pharmaceutical domain, significantly accelerating the pace of drug development.
Collapse
Affiliation(s)
- Yang Zhang
- Innovative Institute of Chinese Medicine and Pharmacy, Academy for Interdiscipline, Chengdu University of Traditional Chinese Medicine, Chengdu, China
| | - Caiqi Liu
- Department of Gastrointestinal Medical Oncology, Harbin Medical University Cancer Hospital, No.150 Haping Road, Nangang District, Harbin, Heilongjiang 150081, China
- Key Laboratory of Molecular Oncology of Heilongjiang Province, No.150 Haping Road, Nangang District, Harbin, Heilongjiang 150081, China
| | - Mujiexin Liu
- Chongqing Key Laboratory of Sichuan-Chongqing Co-construction for Diagnosis and Treatment of Infectious Diseases Integrated Traditional Chinese and Western Medicine, College of Medical Technology, Chengdu University of Traditional Chinese Medicine, Chengdu, China
| | - Tianyuan Liu
- Graduate School of Science and Technology, University of Tsukuba, Tsukuba, Japan
| | - Hao Lin
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Cheng-Bing Huang
- School of Computer Science and Technology, Aba Teachers University, Aba, China
| | - Lin Ning
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, Zhejiang, China
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu 611844, China
| |
Collapse
|
8
|
Zulfiqar H, Ahmed Z, Kissanga Grace-Mercure B, Hassan F, Zhang ZY, Liu F. Computational prediction of promotors in Agrobacterium tumefaciens strain C58 by using the machine learning technique. Front Microbiol 2023; 14:1170785. [PMID: 37125199 PMCID: PMC10133480 DOI: 10.3389/fmicb.2023.1170785] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Accepted: 03/17/2023] [Indexed: 05/02/2023] Open
Abstract
Promotors are those genomic regions on the upstream of genes, which are bound by RNA polymerase for starting gene transcription. Because it is the most critical element of gene expression, the recognition of promoters is crucial to understand the regulation of gene expression. This study aimed to develop a machine learning-based model to predict promotors in Agrobacterium tumefaciens (A. tumefaciens) strain C58. In the model, promotor sequences were encoded by three different kinds of feature descriptors, namely, accumulated nucleotide frequency, k-mer nucleotide composition, and binary encodings. The obtained features were optimized by using correlation and the mRMR-based algorithm. These optimized features were inputted into a random forest (RF) classifier to discriminate promotor sequences from non-promotor sequences in A. tumefaciens strain C58. The examination of 10-fold cross-validation showed that the proposed model could yield an overall accuracy of 0.837. This model will provide help for the study of promoters in A. tumefaciens C58 strain.
Collapse
Affiliation(s)
- Hasan Zulfiqar
- Yangtze Delta Region Institute (Huzhou), University of Electronic Science and Technology of China, Huzhou, China
- School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Zahoor Ahmed
- Yangtze Delta Region Institute (Huzhou), University of Electronic Science and Technology of China, Huzhou, China
| | - Bakanina Kissanga Grace-Mercure
- School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Farwa Hassan
- School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Zhao-Yue Zhang
- School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Fen Liu
- Department of Radiation Oncology, Peking University Cancer Hospital (Inner Mongolia Campus), Affiliated Cancer Hospital of Inner Mongolia Medical University, Inner Mongolia Cancer Hospital, Hohhot, China
| |
Collapse
|