1
|
Yu D, Yang X, Shang Y, Yuan S, Liu Y, Liu Y. Drug-target interaction prediction based on metapaths and simplified neighbor aggregation. Methods 2025; 240:154-164. [PMID: 40288620 DOI: 10.1016/j.ymeth.2025.04.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2025] [Revised: 04/02/2025] [Accepted: 04/20/2025] [Indexed: 04/29/2025] Open
Abstract
Drug-target interaction (DTI) prediction is critical in drug repositioning and discovery. In current metapath-based prediction methods, attention mechanisms are often used to differentiate the importance of various neighbors, enhancing the model's expressiveness. However, in biological networks with small-scale imbalanced data, attention mechanisms are prone to interference from noise and missing data, leading to instability in weight learning, reduced efficiency, and an increased risk of overfitting. To address these issues, we propose the use of average aggregation to mitigate noise, simplify model complexity, and improve stability. Specifically, we introduce a simplified mean aggregation method for DTI prediction. This approach uses average aggregation, effectively reducing noise interference, lowering model complexity, and preventing overfitting, making it especially suitable for current biological networks. Extensive testing on three heterogeneous biological datasets shows that SNADTI outperforms 12 leading methods across two evaluation metrics, significantly reducing training time and validating its effectiveness in DTI prediction. Complexity analysis reveals that our method offers a substantial computational speed advantage over other methods on the same dataset, highlighting its enhanced efficiency. Experimental results demonstrate that SNADTI excels in prediction accuracy, stability, and reproducibility, confirming its practicality and effectiveness in DTI prediction.
Collapse
Affiliation(s)
- Di Yu
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, Hunan, China
| | - Xinyu Yang
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, Hunan, China
| | - Yifan Shang
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, Hunan, China; Department of Biomedical Engineering, The Chinese University of Hong Kong, 999077, Hong Kong, China.
| | - Sisi Yuan
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, 28223, NC, USA
| | - Yuansheng Liu
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, Hunan, China
| | - Yiping Liu
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, Hunan, China
| |
Collapse
|
2
|
Cao C, Li M, Wang C, Xu L, Zou Q, Wang Y, Han W. DGCLCMI: a deep graph collaboration learning method to predict circRNA-miRNA interactions. BMC Biol 2025; 23:104. [PMID: 40264118 PMCID: PMC12016396 DOI: 10.1186/s12915-025-02197-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2025] [Accepted: 03/25/2025] [Indexed: 04/24/2025] Open
Abstract
BACKGROUND Numerous studies have shown that circRNA can act as a miRNA sponge, competitively binding to miRNAs, thereby regulating gene expression and disease progression. Due to the high cost and time-consuming nature of traditional wet lab experiments, analyzing circRNA-miRNA associations is often inefficient and labor-intensive. Although some computational models have been developed to identify these associations, they fail to capture the deep collaborative features between circRNA and miRNA interactions and do not guide the training of feature extraction networks based on these high-order relationships, leading to poor prediction performance. RESULTS To address these issues, we innovatively propose a novel deep graph collaboration learning method for circRNA-miRNA interaction, called DGCLCMI. First, it uses word2vec to encode sequences into word embeddings. Next, we present a joint model that combines an improved neural graph collaborative filtering method with a feature extraction network for optimization. Deep interaction information is embedded as informative features within the sequence representations for prediction. Comprehensive experiments on three well-established datasets across seven metrics demonstrate that our algorithm significantly outperforms previous models, achieving an average AUC of 0.960. In addition, a case study reveals that 18 out of 20 predicted unknown CMI data points are accurate. CONCLUSIONS The DGCLCMI improves circRNA and miRNA feature representation by capturing deep collaborative information, achieving superior performance compared to prior methods. It facilitates the discovery of unknown associations and sheds light on their roles in physiological processes.
Collapse
Affiliation(s)
- Chao Cao
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, Sichuan, 611731, China
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, Zhejiang, 324003, China
| | - Mengli Li
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, Zhejiang, 324003, China
| | - Chunyu Wang
- Faculty of Computing, Harbin Institute of Technology, Harbin, Heilongjiang, 150001, China
| | - Lei Xu
- School of Electronic and Communication Engineering, Shenzhen Polytechnic University, Shenzhen, Guangdong, 518055, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, Sichuan, 611731, China
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, Zhejiang, 324003, China
| | - Yansu Wang
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, Sichuan, 611731, China
| | - Wu Han
- Department of Statistics, Stanford University, Stanford, CA, 94043, USA.
| |
Collapse
|
3
|
Chen L, Li Y, Ma Y, Gao L, Yu L. Multiscale graph equivariant diffusion model for 3D molecule design. SCIENCE ADVANCES 2025; 11:eadv0778. [PMID: 40238892 DOI: 10.1126/sciadv.adv0778] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2024] [Accepted: 03/07/2025] [Indexed: 04/18/2025]
Abstract
Three-dimensional molecular generation is critical in drug design. However, current methods often rely on point clouds or oversimplified interaction models, limiting their ability to accurately represent molecular structures. To address these challenges, this paper proposes the multiscale graph equivariant diffusion model for 3D molecule design (MD3MD). MD3MD partitions molecular conformations into multiscale graphs, assigning different weights to capture atomic interactions across scales. This framework guides the diffusion process, enabling high-quality 3D molecular generation. Experimental results demonstrate that MD3MD excels in both unconditional and conditional generation tasks, producing diverse, stable, and innovative molecules that meet specified conditions. Visualization highlights MD3MD's ability to learn domain-specific patterns and generate molecules distinct from existing datasets while maintaining distributional consistency. By effectively exploring chemical space, MD3MD surpasses previous methods in generating innovative and chemically diverse molecules, offering a notable advancement in the field of molecular design.
Collapse
Affiliation(s)
- Lu Chen
- School of Computer Science and Technology, Xidian University, Xi'an 710071, Shaanxi, China
| | - Yan Li
- School of Management, Xi'an Polytechnic University, Xi'an 710000, Shaanxi, China
| | - Yanjie Ma
- School of Computer Science and Technology, Xidian University, Xi'an 710071, Shaanxi, China
| | - Lin Gao
- School of Computer Science and Technology, Xidian University, Xi'an 710071, Shaanxi, China
| | - Liang Yu
- School of Computer Science and Technology, Xidian University, Xi'an 710071, Shaanxi, China
| |
Collapse
|
4
|
Shang Y, Wang Z, Chen Y, Yang X, Ren Z, Zeng X, Xu L. HNF-DDA: subgraph contrastive-driven transformer-style heterogeneous network embedding for drug-disease association prediction. BMC Biol 2025; 23:101. [PMID: 40241152 PMCID: PMC12004644 DOI: 10.1186/s12915-025-02206-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2024] [Accepted: 04/03/2025] [Indexed: 04/18/2025] Open
Abstract
BACKGROUND Drug-disease association (DDA) prediction aims to identify potential links between drugs and diseases, facilitating the discovery of new therapeutic potentials and reducing the cost and time associated with traditional drug development. However, existing DDA prediction methods often overlook the global relational information provided by other biological entities, and the complex association structure between drug diseases, limiting the potential correlations of drug and disease embeddings. RESULTS In this study, we propose HNF-DDA, a subgraph contrastive-driven transformer-style heterogeneous network embedding model for DDA prediction. Specifically, HNF-DDA adopts all-pairs message passing strategy to capture the global structure of the network, fully integrating multi-omics information. HNF-DDA also proposes the concept of subgraph contrastive learning to capture the local structure of drug-disease subgraphs, learning the high-order semantic information of nodes. Experimental results on two benchmark datasets demonstrate that HNF-DDA outperforms several state-of-the-art methods. Additionally, it shows superior performance across different dataset splitting schemes, indicating HNF-DDA's capability to generalize to novel drug and disease categories. Case studies for breast cancer and prostate cancer reveal that 9 out of the top 10 predicted candidate drugs for breast cancer and 8 out of the top 10 for prostate cancer have documented therapeutic effects. CONCLUSIONS HNF-DDA incorporates all-pairs message passing and subgraph capture strategies into heterogeneous network embedding, enabling effective learning of drug and disease representations enriched with heterogeneous information, while also demonstrating significant potential for applications in drug repositioning.
Collapse
Affiliation(s)
- Yifan Shang
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, China
| | - Zixu Wang
- Department of Computer Science, University of Tsukuba, Tsukuba, 305-8577, Japan
| | - Yangyang Chen
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, China
| | - Xinyu Yang
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, China
| | - Zhonghao Ren
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, China
| | - Xiangxiang Zeng
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, China
| | - Lei Xu
- School of Electronic and Communication Engineering, Shenzhen Polytechnic University, Shenzhen, 518055, China.
| |
Collapse
|
5
|
Alqahtani S, Alqahtani T, Venkatesan K, Sivadasan D, Ahmed R, Elfadil H, Paulsamy P, Periannan K. Unveiling Pharmacogenomics Insights into Circular RNAs: Toward Precision Medicine in Cancer Therapy. Biomolecules 2025; 15:535. [PMID: 40305280 PMCID: PMC12024797 DOI: 10.3390/biom15040535] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2025] [Revised: 03/27/2025] [Accepted: 04/01/2025] [Indexed: 05/02/2025] Open
Abstract
Pharmacogenomics is revolutionizing precision medicine by enabling tailored therapeutic strategies based on an individual genetic and molecular profile. Circular RNAs (circRNAs), a distinct subclass of endogenous non-coding RNAs, have recently emerged as key regulators of drug resistance, tumor progression, and therapeutic responses. Their covalently closed circular structure provides exceptional stability and resistance to exonuclease degradation, positioning them as reliable biomarkers and novel therapeutic targets in cancer management. This review provides a comprehensive analysis of the interplay between circRNAs and pharmacogenomics, focusing on their role in modulating drug metabolism, therapeutic efficacy, and toxicity profiles. We examine how circRNA-mediated regulatory networks influence chemotherapy resistance, alter targeted therapy responses, and impact immunotherapy outcomes. Additionally, we discuss emerging experimental tools and bioinformatics techniques for studying circRNAs, including multi-omics integration, machine learning-driven biomarker discovery, and high-throughput sequencing technologies. Beyond their diagnostic potential, circRNAs are being actively explored as therapeutic agents and drug delivery vehicles. Recent advancements in circRNA-based vaccines, engineered CAR-T cells, and synthetic circRNA therapeutics highlight their transformative potential in oncology. Furthermore, we address the challenges of standardization, reproducibility, and clinical translation, emphasizing the need for rigorous biomarker validation and regulatory frameworks to facilitate their integration into clinical practice. By incorporating circRNA profiling into pharmacogenomic strategies, this review underscores a paradigm shift toward highly personalized cancer therapies. circRNAs hold immense potential to overcome drug resistance, enhance treatment efficacy, and optimize patient outcomes, marking a significant advancement in precision oncology.
Collapse
Affiliation(s)
- Saud Alqahtani
- Department of Pharmacology, College of Pharmacy, King Khalid University, Abha 62521, Saudi Arabia; (S.A.); (T.A.)
| | - Taha Alqahtani
- Department of Pharmacology, College of Pharmacy, King Khalid University, Abha 62521, Saudi Arabia; (S.A.); (T.A.)
| | - Krishnaraju Venkatesan
- Department of Pharmacology, College of Pharmacy, King Khalid University, Abha 62521, Saudi Arabia; (S.A.); (T.A.)
| | - Durgaramani Sivadasan
- Department of Pharmaceutics, College of Pharmacy, Jazan University, P.O. Box 114, Jazan 45142, Saudi Arabia;
| | - Rehab Ahmed
- Division of Microbiology, Immunology and Biotechnology, Department of Natural Products and Alternative Medicine, Faculty of Pharmacy, University of Tabuk, Tabuk 71491, Saudi Arabia; (R.A.); (H.E.)
| | - Hassabelrasoul Elfadil
- Division of Microbiology, Immunology and Biotechnology, Department of Natural Products and Alternative Medicine, Faculty of Pharmacy, University of Tabuk, Tabuk 71491, Saudi Arabia; (R.A.); (H.E.)
| | - Premalatha Paulsamy
- College of Nursing, Mahalah Branch for Girls, King Khalid University, Abha 62521, Saudi Arabia;
| | - Kalaiselvi Periannan
- Department of Mental Health Nursing, Oxford School of Nursing & Midwifery, Faculty of Health and Life Sciences, Oxford Brookes University, Oxford OX3 0FL, UK;
| |
Collapse
|
6
|
Geng A, Luo Z, Li A, Zhang Z, Zou Q, Wei L, Cui F. ACP-CLB: An Anticancer Peptide Prediction Model Based on Multichannel Discriminative Processing and Integration of Large Pretrained Protein Language Models. J Chem Inf Model 2025; 65:2336-2349. [PMID: 39969847 DOI: 10.1021/acs.jcim.4c02072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/20/2025]
Abstract
MOTIVATION Cancer affects millions globally, and as research advances, our understanding and treatment of cancer evolve. Compared to conventional treatments with significant side effects, anticancer peptides (ACPs) have gained considerable attention. Validating ACPs through wet-lab experiments is time-consuming and costly. However, numerous artificial intelligence methods are now used for ACP identification and classification. These methods typically apply a uniform strategy to all feature types, overlooking the potential benefits of more specialized processing for different feature types. INNOVATION In this paper, we propose a framework based on multichannel discriminative processing, where different neural networks are applied to process various feature types, optimizing their respective feature vectors. Additionally, we leverage Large Pretrained Protein Language Models to capture deeper sequence features, further enhancing the model's performance. Contributions: To better validate the overall performance and generalization ability of the model, we compared it with state-of-the-art models using four different data sets (AntiCp2Main, AntiCp2 Alternate, ACP740, cACP-DeepGram). The results show significant improvements across most metrics. Additionally, our proposed framework better assists researchers in distinguishing and identifying ACPs and further validates the need for distinct processing methods for different feature types.
Collapse
Affiliation(s)
- Aoyun Geng
- School of Computer Science and Technology, Hainan University, Haikou 570228, China
| | - Zhenjie Luo
- School of Computer Science and Technology, Hainan University, Haikou 570228, China
| | - Aohan Li
- Graduate School of Informatics and Engineering, The University of Electro-Communications, Tokyo 182-8585, Japan
| | - Zilong Zhang
- School of Computer Science and Technology, Hainan University, Haikou 570228, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu 610054, China
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou 324000, China
| | - Leyi Wei
- Centre for Artificial Intelligence driven Drug Discovery, Faculty of Applied Science, Macao Polytechnic University, Macao SAR 999078, China
- School of Informatics, Xiamen University, Xiamen 361000, China
| | - Feifei Cui
- School of Computer Science and Technology, Hainan University, Haikou 570228, China
| |
Collapse
|
7
|
Liang SZ, Wang L, You ZH, Yu CQ, Wei MM, Wei Y, Shi TL, Jiang C. Predicting circRNA-Disease Associations through Multisource Domain-Aware Embeddings and Feature Projection Networks. J Chem Inf Model 2025; 65:1666-1676. [PMID: 39829001 DOI: 10.1021/acs.jcim.4c02250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2025]
Abstract
Recent studies have highlighted the significant role of circular RNAs (circRNAs) in various diseases. Accurately predicting circRNA-disease associations is crucial for understanding their biological functions and disease mechanisms. This work introduces the MNDCDA method, designed to address the challenges posed by the limited number of known circRNA-disease associations and the high cost of biological experiments. MNDCDA integrates multiple biological data sources with neighborhood-aware embedding models and deep feature projection networks to predict potential pathways linking circRNAs to diseases. Initially, comprehensive biometric data are used to construct four similarity networks, forming a diverse circRNA-disease interaction framework. Next, a neighborhood-aware embedding model captures structural information about circRNAs and diseases, while deep feature projection networks learn high-order feature interactions and nonlinear connections. Finally, a bilinear decoder identifies novel associations between circRNAs and diseases. The MNDCDA model achieved an AUC of 0.9070 on a constructed benchmark dataset. In case studies, 25 out of 30 predicted circRNA-disease pairs were validated through wet lab experiments and published literature. These extensive experimental results demonstrate that MNDCDA is a robust computational tool for predicting circRNA-disease associations, providing valuable insights while helping to reduce research costs.
Collapse
Affiliation(s)
- Si-Zhe Liang
- School of Information Engineering, Xijing Univerity, Xi'an 710123, China
| | - Lei Wang
- Guangxi Key Lab of Human-Machine Interaction and Intelligent Decision, Guangxi Academy of Sciences, Nanning 530007, China
- School of Computer Science and Technology, China University of Mining and Technology, Xuzhou 221116, China
| | - Zhu-Hong You
- School of Computer Science, Northwestern Polytechnical University, Xi'an 710129, China
| | - Chang-Qing Yu
- School of Information Engineering, Xijing Univerity, Xi'an 710123, China
| | - Meng-Meng Wei
- School of Computer Science and Technology, China University of Mining and Technology, Xuzhou 221116, China
| | - Yu Wei
- School of Information Engineering, Xijing Univerity, Xi'an 710123, China
| | - Tai-Long Shi
- School of Information Engineering, Xijing Univerity, Xi'an 710123, China
| | - Chen Jiang
- School of Information Engineering, Xijing Univerity, Xi'an 710123, China
| |
Collapse
|
8
|
Tian Z, Zhang Z, Zhou W, Teng Z, Song W, Zou Q. DSANIB: Drug-Target Interaction Predictions With Dual-View Synergistic Attention Network and Information Bottleneck Strategy. IEEE J Biomed Health Inform 2025; 29:1484-1493. [PMID: 40030194 DOI: 10.1109/jbhi.2024.3497591] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/06/2025]
Abstract
Prediction of drug-target interactions (DTIs) is one of the crucial steps for drug repositioning. Identifying DTIs through bio-experimental manners is always expensive and time-consuming. Recently, deep learning-based approaches have shown promising advancements in DTI prediction, but they face two notable challenges: (i) how to explicitly capture local interactions between drug-target pairs and learn their higher-order substructure embeddings; (ii) How to filter out redundant information to obtain effective embeddings for drugs and targets. Results: In this study, we propose a novel approach, termed DSANIB, to infer potential interactions between drugs and targets. DSANIB comprises two primary components: (1) DSAN component: The Inter-view Attention Network Module explicitly learns the local interactions between drugs and targets, while the Intra-view Attention Network Module aggregates information from local interaction features to obtain their higher-order substructure embeddings. (2) Information Bottleneck (IB) component: DSANIB adopts the IB strategy, which could retain relevant information while minimizing the redundant features to obtain their discriminative representations. Extensive experimental results demonstrate that DSANIB outperforms other SOTA prediction models. In addition, visualization of drug and target embeddings learned through DSANIB could provide interpretable insights for the prediction results.
Collapse
|
9
|
Xia Y, Xiong A, Zhang Z, Zou Q, Cui F. A comprehensive review of deep learning-based approaches for drug-drug interaction prediction. Brief Funct Genomics 2025; 24:elae052. [PMID: 39987494 PMCID: PMC11847217 DOI: 10.1093/bfgp/elae052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2024] [Revised: 07/29/2024] [Accepted: 02/21/2025] [Indexed: 02/25/2025] Open
Abstract
Deep learning models have made significant progress in the biomedical field, particularly in the prediction of drug-drug interactions (DDIs). DDIs are pharmacodynamic reactions between two or more drugs in the body, which may lead to adverse effects and are of great significance for drug development and clinical research. However, predicting DDI through traditional clinical trials and experiments is not only costly but also time-consuming. When utilizing advanced Artificial Intelligence (AI) and deep learning techniques, both developers and users face multiple challenges, including the problem of acquiring and encoding data, as well as the difficulty of designing computational methods. In this paper, we review a variety of DDI prediction methods, including similarity-based, network-based, and integration-based approaches, to provide an up-to-date and easy-to-understand guide for researchers in different fields. Additionally, we provide an in-depth analysis of widely used molecular representations and a systematic exposition of the theoretical framework of models used to extract features from graph data.
Collapse
Affiliation(s)
- Yan Xia
- School of Computer Science and Technology, Hainan University, No. 58, Renmin Avenue, Haidian Island, Haikou, Hainan Province, 570228, China
| | - An Xiong
- School of Computer Science and Technology, Hainan University, No. 58, Renmin Avenue, Haidian Island, Haikou, Hainan Province, 570228, China
| | - Zilong Zhang
- School of Computer Science and Technology, Hainan University, No. 58, Renmin Avenue, Haidian Island, Haikou, Hainan Province, 570228, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, No. 4, Section 2, Jianshe North Road, Chengdu, Sichuan Province, 610054, China
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, No. 1, Chengdian Road, Kecheng District, Quzhou, Zhejiang Province, 324000, China
| | - Feifei Cui
- School of Computer Science and Technology, Hainan University, No. 58, Renmin Avenue, Haidian Island, Haikou, Hainan Province, 570228, China
| |
Collapse
|
10
|
Cao C, Wang C, Dai Q, Zou Q, Wang T. CRBPSA: CircRNA-RBP interaction sites identification using sequence structural attention model. BMC Biol 2024; 22:260. [PMID: 39543602 PMCID: PMC11566611 DOI: 10.1186/s12915-024-02055-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2024] [Accepted: 10/30/2024] [Indexed: 11/17/2024] Open
Abstract
BACKGROUND Due to the ability of circRNA to bind with corresponding RBPs and play a critical role in gene regulation and disease prevention, numerous identification algorithms have been developed. Nevertheless, most of the current mainstream methods primarily capture one-dimensional sequence features through various descriptors, while neglecting the effective extraction of secondary structure features. Moreover, as the number of introduced descriptors increases, the issues of sparsity and ineffective representation also rise, causing a significant burden on computational models and leaving room for improvement in predictive performance. RESULTS Based on this, we focused on capturing the features of secondary structure in sequences and developed a new architecture called CRBPSA, which is based on a sequence-structure attention mechanism. Firstly, a base-pairing matrix is generated by calculating the matching probability between each base, with a Gaussian function introduced as a weight to construct the secondary structure. Then, a Structure_Transformer is employed to extract base-pairing information and spatial positional dependencies, enabling the identification of binding sites through deeper feature extraction. Experimental results using the same set of hyperparameters on 37 circRNA datasets, totaling 671,952 samples, show that the CRBPSA algorithm achieves an average AUC of 99.93%, surpassing all existing prediction methods. CONCLUSIONS CRBPSA is a lightweight and efficient prediction tool for circRNA-RBP, which can capture structural features of sequences with minimal computational resources and accurately predict protein-binding sites. This tool facilitates a deeper understanding of the biological processes and mechanisms underlying circRNA and protein interactions.
Collapse
Affiliation(s)
- Chao Cao
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, Sichuan, China
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, Zhejiang, China
| | - Chunyu Wang
- Faculty of Computing, Harbin Institute of Technology, Harbin, Heilongjiang, China
| | - Qi Dai
- College of Life Science and Medicine, Zhejiang Sci-Tech University, Hangzhou, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, Sichuan, China
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, Zhejiang, China
| | - Tao Wang
- School of Computer Science, Northwestern Polytechnical University, Xi'an, China.
| |
Collapse
|
11
|
Liu YX, Song JL, Li XM, Lin H, Cao YN. Identification of target genes co-regulated by four key histone modifications of five key regions in hepatocellular carcinoma. Methods 2024; 231:165-177. [PMID: 39349287 DOI: 10.1016/j.ymeth.2024.09.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2024] [Revised: 08/27/2024] [Accepted: 09/27/2024] [Indexed: 10/02/2024] Open
Abstract
Hepatocellular carcinoma (HCC) is a cancer with high morbidity and mortality. Studies have shown that histone modification plays an important regulatory role in the occurrence and development of HCC. However, the specific regulatory effects of histone modifications on gene expression in HCC are still unclear. This study focuses on HepG2 cell lines and hepatocyte cell lines. First, the distribution of histone modification signals in the two cell lines was calculated and analyzed. Then, using the random forest algorithm, we analyzed the effects of different histone modifications and their modified regions on gene expression in the two cell lines, four key histone modifications (H3K36me3, H3K4me3, H3K79me2, and H3K9ac) and five key regions that co-regulate gene expression were obtained. Subsequently, target genes regulated by key histone modifications in key regions were screened. Combined with clinical data, Cox regression analysis and Kaplan-Meier survival analysis were performed on the target genes, and four key target genes (CBX2, CEBPZOS, LDHA, and UMPS) related to prognosis were identified. Finally, through immune infiltration analysis and drug sensitivity analysis of key target genes, the potential role of key target genes in HCC was confirmed. Our results provide a theoretical basis for exploring the occurrence of HCC and propose potential biomarkers associated with histone modifications, which may be potential drug targets for the clinical treatment of HCC.
Collapse
Affiliation(s)
- Yu-Xian Liu
- School of Artificial Intelligence, Anhui University of Science and Technology, Huainan 232001, China.
| | - Jia-Le Song
- School of Artificial Intelligence, Anhui University of Science and Technology, Huainan 232001, China
| | - Xiao-Ming Li
- School of Artificial Intelligence, Anhui University of Science and Technology, Huainan 232001, China
| | - Hao Lin
- Key Laboratory for Neuro-Information of Ministry of Education, Center for Informational Biology, School of Life Sciences and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China.
| | - Yan-Ni Cao
- School of Artificial Intelligence, Anhui University of Science and Technology, Huainan 232001, China.
| |
Collapse
|
12
|
Zhang X, Zhao S, Su X, Xu L. From docking to dynamics: Unveiling the potential non-peptide and non-covalent inhibitors of M pro from natural products. Comput Biol Med 2024; 181:108963. [PMID: 39216402 DOI: 10.1016/j.compbiomed.2024.108963] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2024] [Revised: 07/05/2024] [Accepted: 07/26/2024] [Indexed: 09/04/2024]
Abstract
MOTIVATION This study aims to investigate non-covalent and non-peptide inhibitors of Mpro, a crucial protein target, by employing a comprehensive approach that integrates molecular docking, molecular dynamics simulations, and top-hits activity predictions. The focus is on elucidating the non-covalent and non-peptide binding modes of potential inhibitors with Mpro. METHODS We employed a semi-flexible molecular docking methodology, binding score and ADME screening, which are based on structure, to screen compounds from CMNPD and HERB in silico. These methodologies allowed us to find potential candidates depending on their binding values and interactions with the binding site of main protease. To further evaluate the stability of these interactions, we conducted molecular dynamics simulations and calculated binding energies. Ultimately, a top-hits activity prediction method was employed to prioritize compounds based on their predicted inhibitory potential. RESULTS Through a combination of binding energy calculations and activity predictions, we identified six potential inhibitor molecules exhibiting promising activity against Mpro. These compounds demonstrated favorable binding interactions and stability profiles, making them attractive candidates for further experimental validation and drug development efforts targeting Mpro.
Collapse
Affiliation(s)
- Xin Zhang
- The Quzhou Affiliated Hospital of Wenzhou Medical University, Quzhou People's Hospital, Quzhou, China; Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China
| | - Shulin Zhao
- The Quzhou Affiliated Hospital of Wenzhou Medical University, Quzhou People's Hospital, Quzhou, China
| | - Xi Su
- Foshan Women and Children Hospital, Foshan, China
| | - Lifeng Xu
- The Quzhou Affiliated Hospital of Wenzhou Medical University, Quzhou People's Hospital, Quzhou, China.
| |
Collapse
|
13
|
Yuan L, Zhao L, Lai J, Jiang Y, Zhang Q, Shen Z, Zheng CH, Huang DS. iCRBP-LKHA: Large convolutional kernel and hybrid channel-spatial attention for identifying circRNA-RBP interaction sites. PLoS Comput Biol 2024; 20:e1012399. [PMID: 39173070 PMCID: PMC11373821 DOI: 10.1371/journal.pcbi.1012399] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2024] [Revised: 09/04/2024] [Accepted: 08/08/2024] [Indexed: 08/24/2024] Open
Abstract
Circular RNAs (circRNAs) play vital roles in transcription and translation. Identification of circRNA-RBP (RNA-binding protein) interaction sites has become a fundamental step in molecular and cell biology. Deep learning (DL)-based methods have been proposed to predict circRNA-RBP interaction sites and achieved impressive identification performance. However, those methods cannot effectively capture long-distance dependencies, and cannot effectively utilize the interaction information of multiple features. To overcome those limitations, we propose a DL-based model iCRBP-LKHA using deep hybrid networks for identifying circRNA-RBP interaction sites. iCRBP-LKHA adopts five encoding schemes. Meanwhile, the neural network architecture, which consists of large kernel convolutional neural network (LKCNN), convolutional block attention module with one-dimensional convolution (CBAM-1D) and bidirectional gating recurrent unit (BiGRU), can explore local information, global context information and multiple features interaction information automatically. To verify the effectiveness of iCRBP-LKHA, we compared its performance with shallow learning algorithms on 37 circRNAs datasets and 37 circRNAs stringent datasets. And we compared its performance with state-of-the-art DL-based methods on 37 circRNAs datasets, 37 circRNAs stringent datasets and 31 linear RNAs datasets. The experimental results not only show that iCRBP-LKHA outperforms other competing methods, but also demonstrate the potential of this model in identifying other RNA-RBP interaction sites.
Collapse
Affiliation(s)
- Lin Yuan
- Key Laboratory of Computing Power Network and Information Security, Ministry of Education, Shandong Computer Science Center, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China
- Shandong Engineering Research Center of Big Data Applied Technology, Faculty of Computer Science and Technology, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China
- Shandong Provincial Key Laboratory of Computer Networks, Shandong Fundamental Research Center for Computer Science, Jinan, China
| | - Ling Zhao
- Key Laboratory of Computing Power Network and Information Security, Ministry of Education, Shandong Computer Science Center, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China
- Shandong Engineering Research Center of Big Data Applied Technology, Faculty of Computer Science and Technology, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China
- Shandong Provincial Key Laboratory of Computer Networks, Shandong Fundamental Research Center for Computer Science, Jinan, China
| | - Jinling Lai
- Key Laboratory of Computing Power Network and Information Security, Ministry of Education, Shandong Computer Science Center, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China
- Shandong Engineering Research Center of Big Data Applied Technology, Faculty of Computer Science and Technology, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China
- Shandong Provincial Key Laboratory of Computer Networks, Shandong Fundamental Research Center for Computer Science, Jinan, China
| | - Yufeng Jiang
- Key Laboratory of Computing Power Network and Information Security, Ministry of Education, Shandong Computer Science Center, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China
- Shandong Engineering Research Center of Big Data Applied Technology, Faculty of Computer Science and Technology, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China
- Shandong Provincial Key Laboratory of Computer Networks, Shandong Fundamental Research Center for Computer Science, Jinan, China
| | - Qinhu Zhang
- Eastern Institute for Advanced Study, Eastern Institute of Technology, Ningbo, China
| | - Zhen Shen
- School of Computer and Software, Nanyang Institute of Technology, Nanyang, China
| | - Chun-Hou Zheng
- Key Lab of Intelligent Computing and Signal Processing of Ministry of Education, School of Artificial Intelligence, Anhui University, Hefei, China
| | - De-Shuang Huang
- Eastern Institute for Advanced Study, Eastern Institute of Technology, Ningbo, China
| |
Collapse
|
14
|
Wei H, Gao L, Wu S, Jiang Y, Liu B. DiSMVC: a multi-view graph collaborative learning framework for measuring disease similarity. Bioinformatics 2024; 40:btae306. [PMID: 38715444 PMCID: PMC11256965 DOI: 10.1093/bioinformatics/btae306] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Revised: 04/19/2024] [Accepted: 05/05/2024] [Indexed: 05/30/2024] Open
Abstract
MOTIVATION Exploring potential associations between diseases can help in understanding pathological mechanisms of diseases and facilitating the discovery of candidate biomarkers and drug targets, thereby promoting disease diagnosis and treatment. Some computational methods have been proposed for measuring disease similarity. However, these methods describe diseases without considering their latent multi-molecule regulation and valuable supervision signal, resulting in limited biological interpretability and efficiency to capture association patterns. RESULTS In this study, we propose a new computational method named DiSMVC. Different from existing predictors, DiSMVC designs a supervised graph collaborative framework to measure disease similarity. Multiple bio-entity associations related to genes and miRNAs are integrated via cross-view graph contrastive learning to extract informative disease representation, and then association pattern joint learning is implemented to compute disease similarity by incorporating phenotype-annotated disease associations. The experimental results show that DiSMVC can draw discriminative characteristics for disease pairs, and outperform other state-of-the-art methods. As a result, DiSMVC is a promising method for predicting disease associations with molecular interpretability. AVAILABILITY AND IMPLEMENTATION Datasets and source codes are available at https://github.com/Biohang/DiSMVC.
Collapse
Affiliation(s)
- Hang Wei
- School of Computer Science and Technology, Xidian University, Xi’an, Shaanxi 710126, China
| | - Lin Gao
- School of Computer Science and Technology, Xidian University, Xi’an, Shaanxi 710126, China
| | - Shuai Wu
- School of Computer Science and Technology, Xidian University, Xi’an, Shaanxi 710126, China
| | - Yina Jiang
- Department of Basic Medicine, Shaanxi University of Chinese Medicine, Xianyang, Shaanxi 712046, China
| | - Bin Liu
- Faculty of Engineering, Shenzhen MSU-BIT University, Shenzhen, Guangdong 518172, China
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing, 100081, China
| |
Collapse
|
15
|
Tian Z, Han C, Xu L, Teng Z, Song W. MGCNSS: miRNA-disease association prediction with multi-layer graph convolution and distance-based negative sample selection strategy. Brief Bioinform 2024; 25:bbae168. [PMID: 38622356 PMCID: PMC11018511 DOI: 10.1093/bib/bbae168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Revised: 03/14/2024] [Accepted: 03/31/2024] [Indexed: 04/17/2024] Open
Abstract
Identifying disease-associated microRNAs (miRNAs) could help understand the deep mechanism of diseases, which promotes the development of new medicine. Recently, network-based approaches have been widely proposed for inferring the potential associations between miRNAs and diseases. However, these approaches ignore the importance of different relations in meta-paths when learning the embeddings of miRNAs and diseases. Besides, they pay little attention to screening out reliable negative samples which is crucial for improving the prediction accuracy. In this study, we propose a novel approach named MGCNSS with the multi-layer graph convolution and high-quality negative sample selection strategy. Specifically, MGCNSS first constructs a comprehensive heterogeneous network by integrating miRNA and disease similarity networks coupled with their known association relationships. Then, we employ the multi-layer graph convolution to automatically capture the meta-path relations with different lengths in the heterogeneous network and learn the discriminative representations of miRNAs and diseases. After that, MGCNSS establishes a highly reliable negative sample set from the unlabeled sample set with the negative distance-based sample selection strategy. Finally, we train MGCNSS under an unsupervised learning manner and predict the potential associations between miRNAs and diseases. The experimental results fully demonstrate that MGCNSS outperforms all baseline methods on both balanced and imbalanced datasets. More importantly, we conduct case studies on colon neoplasms and esophageal neoplasms, further confirming the ability of MGCNSS to detect potential candidate miRNAs. The source code is publicly available on GitHub https://github.com/15136943622/MGCNSS/tree/master.
Collapse
Affiliation(s)
- Zhen Tian
- School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450000, China
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou 324000, China
| | - Chenguang Han
- School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450000, China
| | - Lewen Xu
- School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450000, China
| | - Zhixia Teng
- College of Computer and Control Engineering, Northeast Forestry University, Harbin 150040, China
| | - Wei Song
- School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450000, China
| |
Collapse
|