1
|
Wang Z, Meng J, Li H, Dai Q, Lin X, Luan Y. Attention-augmented multi-domain cooperative graph representation learning for molecular interaction prediction. Neural Netw 2025; 186:107265. [PMID: 39987715 DOI: 10.1016/j.neunet.2025.107265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2024] [Revised: 01/23/2025] [Accepted: 02/07/2025] [Indexed: 02/25/2025]
Abstract
Accurate identification of molecular interactions is crucial for biological network analysis, which can provide valuable insights into fundamental regulatory mechanisms. Despite considerable progress driven by computational advancements, existing methods often rely on task-specific prior knowledge or inherent structural properties of molecules, which limits their generalizability and applicability. Recently, graph-based methods have emerged as a promising approach for predicting links in molecular networks. However, most of these methods focus primarily on aggregating topological information within individual domains, leading to an inadequate characterization of molecular interactions. To mitigate these challenges, we propose AMCGRL, a generalized multi-domain cooperative graph representation learning framework for multifarious molecular interaction prediction tasks. Concretely, AMCGRL incorporates multiple graph encoders to simultaneously learn molecular representations from both intra-domain and inter-domain graphs in a comprehensive manner. Then, the cross-domain decoder is employed to bridge these graph encoders to facilitate the extraction of task-relevant information across different domains. Furthermore, a hierarchical mutual attention mechanism is developed to capture complex pairwise interaction patterns between distinct types of molecules through inter-molecule communicative learning. Extensive experiments conducted on the various datasets demonstrate the superior representation learning capability of AMCGRL compared to the state-of-the-art methods, proving its effectiveness in advancing the prediction of molecular interactions.
Collapse
Affiliation(s)
- Zhaowei Wang
- School of Computer Science and Technology, Dalian University of Technology, Dalian 116024, China.
| | - Jun Meng
- School of Computer Science and Technology, Dalian University of Technology, Dalian 116024, China.
| | - Haibin Li
- School of Computer Science and Technology, Dalian University of Technology, Dalian 116024, China.
| | - Qiguo Dai
- School of Computer Science and Engineering, Dalian Minzu University, Dalian 116600, China.
| | - Xiaohui Lin
- School of Computer Science and Technology, Dalian University of Technology, Dalian 116024, China.
| | - Yushi Luan
- School of Bioengineering, Dalian University of Technology, Dalian 116024, China.
| |
Collapse
|
2
|
Mao Y, Wu J, Weng J, Li M, Xiong Y, Gu W, Jiang R, Pang R, Lin X, Tang D. Inter-view contrastive learning and miRNA fusion for lncRNA-protein interaction prediction in heterogeneous graphs. Brief Bioinform 2025; 26:bbaf148. [PMID: 40194558 PMCID: PMC11975365 DOI: 10.1093/bib/bbaf148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2024] [Revised: 02/19/2025] [Accepted: 03/16/2025] [Indexed: 04/09/2025] Open
Abstract
Predicting long non-coding RNA (lncRNA)-protein interactions is essential for understanding biological processes and discovering new therapeutic targets. In this study, we propose a novel model based on inter-view contrastive learning and miRNA fusion for lncRNA-protein interaction (LPI) prediction, called ICMF-LPI, which utilizes a heterogeneous information network to enhance LPI prediction. The model integrates miRNA as a mediator, constructing an lncRNA-miRNA-protein network, and employs metapath to extract diverse relationships from heterogeneous graphs. By fusing miRNA-related information and leveraging contrastive learning across inter-views, ICMF-LPI effectively captures potential interactions. Experimental results, including five-fold cross-validation, demonstrate the model's superior performance compared to several state-of-the-art methods, with significant improvements in the area under the receiver operating characteristic curve and the area under the precision-recall curve metrics. Notably, even when direct LPI connections are excluded, ICMF-LPI still achieves competitive predictive accuracy, performing comparably or better than some existing models. This demonstrates that the proposed model is effective in scenarios where direct interaction data are unavailable. This approach offers a promising direction for developing predictive models in bioinformatics, particularly in challenging conditions.
Collapse
Affiliation(s)
- Yijun Mao
- College of Mathematics and Informatics, South China Agricultural University, 483 Wushan Road, Tianhe District, GuangZhou 510642, China
- National Key Laboratory of Data Space Technology and System, 3 Minzhuang Road, Haidian District, Beijing 100195, China
| | - Jiale Wu
- College of Mathematics and Informatics, South China Agricultural University, 483 Wushan Road, Tianhe District, GuangZhou 510642, China
| | - Jian Weng
- College of Cyber Security, Jinan University, 601 West Huangpu Avenue, Tianhe District, GuangZhou 510632, China
| | - Ming Li
- College of Cyber Security, Jinan University, 601 West Huangpu Avenue, Tianhe District, GuangZhou 510632, China
| | - Yunyan Xiong
- School of Computer and Infomation Engineering, Guangdong Polytechnic of Industry and Commerce, 1098 North Guangzhou Avenue, Tianhe District, GuangZhou 510510, China
| | - Wanrong Gu
- College of Mathematics and Informatics, South China Agricultural University, 483 Wushan Road, Tianhe District, GuangZhou 510642, China
| | - Rongjin Jiang
- Department of Digital process, Wens Foodstuff Group Co., Ltd, 9 Dongdi North Road, Xinxing County, YunFu 527400, China
| | - Rui Pang
- State Key Laboratory of Green Pesticide, College of Plant Protection, South China Agricultural University, 483 Wushan Road, Tianhe District, GuangZhou 510642, China
| | - Xudong Lin
- College of Mathematics and Informatics, South China Agricultural University, 483 Wushan Road, Tianhe District, GuangZhou 510642, China
| | - Deyu Tang
- College of Mathematics and Informatics, South China Agricultural University, 483 Wushan Road, Tianhe District, GuangZhou 510642, China
| |
Collapse
|
3
|
Kazemifard N, Farmani M, Baradaran Ghavami S, Kazemi M, Shahrokh S, Asadzadeh Aghdaei H, Zali M. A prediction of the CRNDE role by modulating NF-κB pathway in inflammatory bowel disease (IBD). Biochem Biophys Rep 2024; 38:101731. [PMID: 38766384 PMCID: PMC11101873 DOI: 10.1016/j.bbrep.2024.101731] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Revised: 04/28/2024] [Accepted: 05/07/2024] [Indexed: 05/22/2024] Open
Abstract
Long non-coding RNAs (lncRNAs) regulate multiple pathways and cellular mechanisms. Recent research has emphasized their involvement in the pathogenesis of complex diseases, such as Inflammatory Bowel Disease (IBD) which is characterized by chronic inflammation of the intestines. The two most common types of IBD are ulcerative colitis and Crohn's disease. CRNDE lncRNA was initially detected in colorectal cancer (CRC) and found to be involved in the tumorigenesis pathways. Further studies revealed the role of CRNDE in activating inflammation and promoting the release of inflammatory cytokines. This study utilizes the RNA-seq data analysis and bioinformatics tools to clarify the role of CRNDE in the IBD pathogenesis and confirms its expression in inflamed HT-29 and Caco-2 cell lines and also colonic and blood samples of UC patients and controls ex vivo. Based on our results, CRNDE was significantly upregulated in IBD samples compared to controls in RNA-seq data analysis and Real-time PCR of inflamed HT-29 cell line and colonic biopsies from UC patients. Additionally, predicted that its expression is positively correlated with the pro-inflammatory cytokines production. CRNDE interactions was investigated with several inflammation-related miRNAs and regulatory proteins computationally. Thus, CRNDE upregulation in the colon of IBD patients could be involved in IBD pathogenesis by promoting inflammatory pathways and targeting anti-inflammatory miRNAs.
Collapse
Affiliation(s)
- Nesa Kazemifard
- Department of Genetics and Molecular Biology, School of Medicine, Isfahan University of Medical Sciences, Isfahan, Iran
- Basic and Molecular Epidemiology of Gastrointestinal Disorders Research Center, Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Maryam Farmani
- Gastroenterology and Liver Diseases Research Center, Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Shaghayegh Baradaran Ghavami
- Gastroenterology and Liver Diseases Research Center, Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Mohammad Kazemi
- Department of Genetics and Molecular Biology, School of Medicine, Isfahan University of Medical Sciences, Isfahan, Iran
- Reproductive Sciences and Sexual Health Research Center, Isfahan University of Medical Sciences, Isfahan, Iran
| | - Shabnam Shahrokh
- Basic and Molecular Epidemiology of Gastrointestinal Disorders Research Center, Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Hamid Asadzadeh Aghdaei
- Basic and Molecular Epidemiology of Gastrointestinal Disorders Research Center, Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Mohammadreza Zali
- Gastroenterology and Liver Diseases Research Center, Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| |
Collapse
|
4
|
Wei MM, Yu CQ, Li LP, You ZH, Ren ZH, Guan YJ, Wang XF, Li YC. LPIH2V: LncRNA-protein interactions prediction using HIN2Vec based on heterogeneous networks model. Front Genet 2023; 14:1122909. [PMID: 36845392 PMCID: PMC9950107 DOI: 10.3389/fgene.2023.1122909] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Accepted: 01/30/2023] [Indexed: 02/12/2023] Open
Abstract
LncRNA-protein interaction plays an important role in the development and treatment of many human diseases. As the experimental approaches to determine lncRNA-protein interactions are expensive and time-consuming, considering that there are few calculation methods, therefore, it is urgent to develop efficient and accurate methods to predict lncRNA-protein interactions. In this work, a model for heterogeneous network embedding based on meta-path, namely LPIH2V, is proposed. The heterogeneous network is composed of lncRNA similarity networks, protein similarity networks, and known lncRNA-protein interaction networks. The behavioral features are extracted in a heterogeneous network using the HIN2Vec method of network embedding. The results showed that LPIH2V obtains an AUC of 0.97 and ACC of 0.95 in the 5-fold cross-validation test. The model successfully showed superiority and good generalization ability. Compared to other models, LPIH2V not only extracts attribute characteristics by similarity, but also acquires behavior properties by meta-path wandering in heterogeneous networks. LPIH2V would be beneficial in forecasting interactions between lncRNA and protein.
Collapse
Affiliation(s)
- Meng-Meng Wei
- School of Information Engineering, Xijing University, Xi’an, China
| | - Chang-Qing Yu
- School of Information Engineering, Xijing University, Xi’an, China,*Correspondence: Chang-Qing Yu, ; Li-Ping Li,
| | - Li-Ping Li
- School of Information Engineering, Xijing University, Xi’an, China,College of Grassland and Environment Sciences, Xinjiang Agricultural University, Urumqi, China,*Correspondence: Chang-Qing Yu, ; Li-Ping Li,
| | - Zhu-Hong You
- School of Computer Science, Northwestern Polytechnical University, Xi’an, China
| | - Zhong-Hao Ren
- School of Information Engineering, Xijing University, Xi’an, China
| | - Yong-Jian Guan
- School of Information Engineering, Xijing University, Xi’an, China
| | | | | |
Collapse
|
5
|
Ma Y, Zhang H, Jin C, Kang C. Predicting lncRNA-protein interactions with bipartite graph embedding and deep graph neural networks. Front Genet 2023; 14:1136672. [PMID: 36845380 PMCID: PMC9948011 DOI: 10.3389/fgene.2023.1136672] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2023] [Accepted: 01/30/2023] [Indexed: 02/11/2023] Open
Abstract
Background: Long non-coding RNAs (lncRNAs) play crucial roles in numerous biological processes. Investigation of the lncRNA-protein interaction contributes to discovering the undetected molecular functions of lncRNAs. In recent years, increasingly computational approaches have substituted the traditional time-consuming experiments utilized to crack the possible unknown associations. However, significant explorations of the heterogeneity in association prediction between lncRNA and protein are inadequate. It remains challenging to integrate the heterogeneity of lncRNA-protein interactions with graph neural network algorithms. Methods: In this paper, we constructed a deep architecture based on GNN called BiHo-GNN, which is the first to integrate the properties of homogeneous with heterogeneous networks through bipartite graph embedding. Different from previous research, BiHo-GNN can capture the mechanism of molecular association by the data encoder of heterogeneous networks. Meanwhile, we design the process of mutual optimization between homogeneous and heterogeneous networks, which can promote the robustness of BiHo-GNN. Results: We collected four datasets for predicting lncRNA-protein interaction and compared the performance of current prediction models on benchmarking dataset. In comparison with the performance of other models, BiHo-GNN outperforms existing bipartite graph-based methods. Conclusion: Our BiHo-GNN integrates the bipartite graph with homogeneous graph networks. Based on this model structure, the lncRNA-protein interactions and potential associations can be predicted and discovered accurately.
Collapse
Affiliation(s)
- Yuzhou Ma
- College of Artificial Intelligence, Nankai University, Tianjin, China
| | - Han Zhang
- College of Artificial Intelligence, Nankai University, Tianjin, China,*Correspondence: Han Zhang,
| | - Chen Jin
- College of Computer Science, Nankai University, Tianjin, China
| | - Chuanze Kang
- College of Artificial Intelligence, Nankai University, Tianjin, China
| |
Collapse
|
6
|
Pepe G, Appierdo R, Carrino C, Ballesio F, Helmer-Citterich M, Gherardini PF. Artificial intelligence methods enhance the discovery of RNA interactions. Front Mol Biosci 2022; 9:1000205. [PMID: 36275611 PMCID: PMC9585310 DOI: 10.3389/fmolb.2022.1000205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Accepted: 09/20/2022] [Indexed: 11/13/2022] Open
Abstract
Understanding how RNAs interact with proteins, RNAs, or other molecules remains a challenge of main interest in biology, given the importance of these complexes in both normal and pathological cellular processes. Since experimental datasets are starting to be available for hundreds of functional interactions between RNAs and other biomolecules, several machine learning and deep learning algorithms have been proposed for predicting RNA-RNA or RNA-protein interactions. However, most of these approaches were evaluated on a single dataset, making performance comparisons difficult. With this review, we aim to summarize recent computational methods, developed in this broad research area, highlighting feature encoding and machine learning strategies adopted. Given the magnitude of the effect that dataset size and quality have on performance, we explored the characteristics of these datasets. Additionally, we discuss multiple approaches to generate datasets of negative examples for training. Finally, we describe the best-performing methods to predict interactions between proteins and specific classes of RNA molecules, such as circular RNAs (circRNAs) and long non-coding RNAs (lncRNAs), and methods to predict RNA-RNA or RNA-RBP interactions independently of the RNA type.
Collapse
Affiliation(s)
- G Pepe
- Department of Biology, University of Rome “Tor Vergata”, Rome, Italy
- *Correspondence: G Pepe, ; M Helmer-Citterich,
| | - R Appierdo
- Department of Biology, University of Rome “Tor Vergata”, Rome, Italy
| | - C Carrino
- PhD Program in Cellular and Molecular Biology, Department of Biology, University of Rome “Tor Vergata”, Rome, Italy
| | - F Ballesio
- PhD Program in Cellular and Molecular Biology, Department of Biology, University of Rome “Tor Vergata”, Rome, Italy
| | - M Helmer-Citterich
- Department of Biology, University of Rome “Tor Vergata”, Rome, Italy
- *Correspondence: G Pepe, ; M Helmer-Citterich,
| | - PF Gherardini
- Department of Biology, University of Rome “Tor Vergata”, Rome, Italy
| |
Collapse
|