1
|
Ni J, Li B, Miao S, Zhang X, Yan D, Jing S, Lu S, Xie Z, Zhang X, Liu Y. MethPriorGCN: a deep learning tool for inferring DNA methylation prior knowledge and guiding personalized medicine. Brief Bioinform 2025; 26:bbaf131. [PMID: 40131311 PMCID: PMC11934576 DOI: 10.1093/bib/bbaf131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2025] [Revised: 02/27/2025] [Accepted: 03/05/2025] [Indexed: 03/26/2025] Open
Abstract
DNA methylation plays a crucial role in human diseases pathogenesis. Substantial experimental evidence from clinical and biological studies has confirmed numerous methylation-disease associations, which provide valuable prior knowledge for advancing precision medicine through biomarker discovery and disease subtyping. To systematically mine reliable methylation prior knowledge from known DNA methylation-disease associations and develop robust computational methods for precision medicine applications, we propose MethPriorGCN. By integrating layer attention mechanisms and feature weighting mechanisms, MethPriorGCN not only identified reliable methylation digital biomarkers but also achieved superior disease subtype classification accuracy.
Collapse
Affiliation(s)
- Jie Ni
- Institute for Molecular Medical Technology, State Key Laboratory of Digital Medical Engineering, School of Biological Science and Medical Engineering, Southeast University, No. 2, Southeast University Road, Jiangning District, Nanjing, Jiangsu 211102, China
- Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, No. 101, Longmian Avenue, Jiangning District, Nanjing, Jiangsu 211166, China
- Institute of Biomedical Devices (Suzhou), Southeast University, No. 8, Jinfeng Road, Suzhou New District, Suzhou, Jiangsu 215163, China
| | - Bin Li
- Institute for Molecular Medical Technology, State Key Laboratory of Digital Medical Engineering, School of Biological Science and Medical Engineering, Southeast University, No. 2, Southeast University Road, Jiangning District, Nanjing, Jiangsu 211102, China
- Institute of Biomedical Devices (Suzhou), Southeast University, No. 8, Jinfeng Road, Suzhou New District, Suzhou, Jiangsu 215163, China
| | - Shumei Miao
- Department of Information, The First Affiliated Hospital, Nanjing Medical University, No. 300, Guangzhou Road, Gulou District, Nanjing, Jiangsu 210029, China
| | - Xinting Zhang
- Institute for Molecular Medical Technology, State Key Laboratory of Digital Medical Engineering, School of Biological Science and Medical Engineering, Southeast University, No. 2, Southeast University Road, Jiangning District, Nanjing, Jiangsu 211102, China
- Institute of Biomedical Devices (Suzhou), Southeast University, No. 8, Jinfeng Road, Suzhou New District, Suzhou, Jiangsu 215163, China
| | - Donghui Yan
- Institute for Molecular Medical Technology, State Key Laboratory of Digital Medical Engineering, School of Biological Science and Medical Engineering, Southeast University, No. 2, Southeast University Road, Jiangning District, Nanjing, Jiangsu 211102, China
- Institute of Biomedical Devices (Suzhou), Southeast University, No. 8, Jinfeng Road, Suzhou New District, Suzhou, Jiangsu 215163, China
| | - Shengqi Jing
- Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, No. 101, Longmian Avenue, Jiangning District, Nanjing, Jiangsu 211166, China
- Department of Information, The First Affiliated Hospital, Nanjing Medical University, No. 300, Guangzhou Road, Gulou District, Nanjing, Jiangsu 210029, China
- Center for Data Management, The First Affiliated Hospital, Nanjing Medical University, No. 300, Guangzhou Road, Gulou District, Nanjing, Jiangsu 210029, China
| | - Shan Lu
- Women and Children Department, The First Affiliated Hospital, Nanjing Medical University, No. 300, Guangzhou Road, Gulou District, Nanjing, Jiangsu 210029, China
| | - Zhuoying Xie
- Institute for Molecular Medical Technology, State Key Laboratory of Digital Medical Engineering, School of Biological Science and Medical Engineering, Southeast University, No. 2, Southeast University Road, Jiangning District, Nanjing, Jiangsu 211102, China
- Institute of Biomedical Devices (Suzhou), Southeast University, No. 8, Jinfeng Road, Suzhou New District, Suzhou, Jiangsu 215163, China
| | - Xin Zhang
- Department of Information, The First Affiliated Hospital, Nanjing Medical University, No. 300, Guangzhou Road, Gulou District, Nanjing, Jiangsu 210029, China
| | - Yun Liu
- Department of Information, The First Affiliated Hospital, Nanjing Medical University, No. 300, Guangzhou Road, Gulou District, Nanjing, Jiangsu 210029, China
| |
Collapse
|
2
|
Zhang C, Li Y, Dong Y, Chen W, Yu C. Prediction of miRNA-disease associations based on PCA and cascade forest. BMC Bioinformatics 2024; 25:386. [PMID: 39701957 DOI: 10.1186/s12859-024-05999-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2024] [Accepted: 11/26/2024] [Indexed: 12/21/2024] Open
Abstract
BACKGROUND As a key non-coding RNA molecule, miRNA profoundly affects gene expression regulation and connects to the pathological processes of several kinds of human diseases. However, conventional experimental methods for validating miRNA-disease associations are laborious. Consequently, the development of efficient and reliable computational prediction models is crucial for the identification and validation of these associations. RESULTS In this research, we developed the PCACFMDA method to predict the potential associations between miRNAs and diseases. To construct a multidimensional feature matrix, we consider the fusion similarities of miRNA and disease and miRNA-disease pairs. We then use principal component analysis(PCA) to reduce data complexity and extract low-dimensional features. Subsequently, a tuned cascade forest is used to mine the features and output prediction scores deeply. The results of the 5-fold cross-validation using the HMDD v2.0 database indicate that the PCACFMDA algorithm achieved an AUC of 98.56%. Additionally, we perform case studies on breast, esophageal and lung neoplasms. The findings revealed that the top 50 miRNAs most strongly linked to each disease have been validated. CONCLUSIONS Based on PCA and optimized cascade forests, we propose the PCACFMDA model for predicting undiscovered miRNA-disease associations. The experimental results demonstrate superior prediction performance and commendable stability. Consequently, the PCACFMDA is a potent instrument for in-depth exploration of miRNA-disease associations.
Collapse
Affiliation(s)
- Chuanlei Zhang
- Artificial Intelligence, Tianjin University of Science and Technology, Tianjin, 300457, China
| | - Yubo Li
- Artificial Intelligence, Tianjin University of Science and Technology, Tianjin, 300457, China
| | - Yinglun Dong
- Artificial Intelligence, Tianjin University of Science and Technology, Tianjin, 300457, China
| | - Wei Chen
- Computer Science, China University of Mining and Technology, Xuzhou, 221116, China
| | - Changqing Yu
- Electronic Information, Xijing University, Xi'an, 710123, China.
| |
Collapse
|
3
|
Wang Y, Yin Z. Prediction of miRNA-disease association based on multisource inductive matrix completion. Sci Rep 2024; 14:27503. [PMID: 39528650 PMCID: PMC11555322 DOI: 10.1038/s41598-024-78212-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2024] [Accepted: 10/29/2024] [Indexed: 11/16/2024] Open
Abstract
MicroRNAs (miRNAs) are endogenous non-coding RNAs approximately 23 nucleotides in length, playing significant roles in various cellular processes. Numerous studies have shown that miRNAs are involved in the regulation of many human diseases. Accurate prediction of miRNA-disease associations is crucial for early diagnosis, treatment, and prognosis assessment of diseases. In this paper, we propose the Autoencoder Inductive Matrix Completion (AEIMC) model to identify potential miRNA-disease associations. The model captures interaction features from multiple similarity networks, including miRNA functional similarity, miRNA sequence similarity, disease semantic similarity, disease ontology similarity, and Gaussian interaction kernel similarity between miRNAs and diseases. Autoencoders are used to extract more complex and abstract data representations, which are then input into the inductive matrix completion model for association prediction. The effectiveness of the model is validated through cross-validation, stratified threshold evaluation, and case studies, while ablation experiments further confirm the necessity of introducing sequence and ontology similarities for the first time.
Collapse
Affiliation(s)
- YaWei Wang
- School of Mathematics, Physics and Statistics, Institute for Frontier Medical Technology, Center of Intelligent Computing and Applied Statistics, Shanghai University of Enginneering Science, Shanghai, 201620, China
| | - ZhiXiang Yin
- School of Mathematics, Physics and Statistics, Institute for Frontier Medical Technology, Center of Intelligent Computing and Applied Statistics, Shanghai University of Enginneering Science, Shanghai, 201620, China.
| |
Collapse
|
4
|
Zhang Y, Chu Y, Lin S, Xiong Y, Wei DQ. ReHoGCNES-MDA: prediction of miRNA-disease associations using homogenous graph convolutional networks based on regular graph with random edge sampler. Brief Bioinform 2024; 25:bbae103. [PMID: 38517693 PMCID: PMC10959163 DOI: 10.1093/bib/bbae103] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Revised: 02/04/2024] [Accepted: 02/23/2024] [Indexed: 03/24/2024] Open
Abstract
Numerous investigations increasingly indicate the significance of microRNA (miRNA) in human diseases. Hence, unearthing associations between miRNA and diseases can contribute to precise diagnosis and efficacious remediation of medical conditions. The detection of miRNA-disease linkages via computational techniques utilizing biological information has emerged as a cost-effective and highly efficient approach. Here, we introduced a computational framework named ReHoGCNES, designed for prospective miRNA-disease association prediction (ReHoGCNES-MDA). This method constructs homogenous graph convolutional network with regular graph structure (ReHoGCN) encompassing disease similarity network, miRNA similarity network and known MDA network and then was tested on four experimental tasks. A random edge sampler strategy was utilized to expedite processes and diminish training complexity. Experimental results demonstrate that the proposed ReHoGCNES-MDA method outperforms both homogenous graph convolutional network and heterogeneous graph convolutional network with non-regular graph structure in all four tasks, which implicitly reveals steadily degree distribution of a graph does play an important role in enhancement of model performance. Besides, ReHoGCNES-MDA is superior to several machine learning algorithms and state-of-the-art methods on the MDA prediction. Furthermore, three case studies were conducted to further demonstrate the predictive ability of ReHoGCNES. Consequently, 93.3% (breast neoplasms), 90% (prostate neoplasms) and 93.3% (prostate neoplasms) of the top 30 forecasted miRNAs were validated by public databases. Hence, ReHoGCNES-MDA might serve as a dependable and beneficial model for predicting possible MDAs.
Collapse
Affiliation(s)
- Yufang Zhang
- School of Mathematical Sciences and SJTU-Yale Joint Center for Biostatistics and Data Science, Shanghai Jiao Tong University, Shanghai 200240, China
- Peng Cheng Laboratory, Shenzhen, Guangdong 518055, China
- Zhongjing Research and Industrialization Institute of Chinese Medicine, Zhongguancun Scientific Park, Meixi, Nanyang, Henan, 473006, China
| | - Yanyi Chu
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Shenggeng Lin
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, and Joint Laboratory of International Cooperation in Metabolic and Developmental Sciences, Ministry of Education, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Yi Xiong
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, and Joint Laboratory of International Cooperation in Metabolic and Developmental Sciences, Ministry of Education, Shanghai Jiao Tong University, Shanghai 200240, China
- Shanghai Artificial Intelligence Laboratory, Shanghai, 200232, China
| | - Dong-Qing Wei
- Peng Cheng Laboratory, Shenzhen, Guangdong 518055, China
- Zhongjing Research and Industrialization Institute of Chinese Medicine, Zhongguancun Scientific Park, Meixi, Nanyang, Henan, 473006, China
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, and Joint Laboratory of International Cooperation in Metabolic and Developmental Sciences, Ministry of Education, Shanghai Jiao Tong University, Shanghai 200240, China
| |
Collapse
|
5
|
Wang S, Wang F, Qiao S, Zhuang Y, Zhang K, Pang S, Nowak R, Lv Z. MSHGANMDA: Meta-Subgraphs Heterogeneous Graph Attention Network for miRNA-Disease Association Prediction. IEEE J Biomed Health Inform 2023; 27:4639-4648. [PMID: 35759606 DOI: 10.1109/jbhi.2022.3186534] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
MicroRNAs (miRNAs) influence several biological processes involved in human disease. Biological experiments for verifying the association between miRNA and disease are always costly in terms of both money and time. Although numerous biological experiments have identified multi-types of associations between miRNAs and diseases, existing computational methods are unable to sufficiently mine the knowledge in these associations to predict unknown associations. In this study, we innovatively propose a heterogeneous graph attention network model based on meta-subgraphs (MSHGANMDA) to predict the potential miRNA-disease associations. Firstly, we define five types of meta-subgraph from the known miRNA-disease associations. Then, we use meta-subgraph attention and meta-subgraph semantic attention to extract features of miRNA-disease pairs within and between these five meta-subgraphs, respectively. Finally, we apply a fully-connected layer (FCL) to predict the scores of unknown miRNA-disease associations and cross-entropy loss to train our model end-to-end. To evaluate the effectiveness of MSHGANMDA, we apply five-fold cross-validation to calculate the mean values of evaluation metrics Accuracy, Precision, Recall, and F1-score as 0.8595, 0.8601, 0.8596, and 0.8595, respectively. Experiments show that our model, which primarily utilizes multi-types of miRNA-disease association data, gets the greatest ROC-AUC value of 0.934 when compared to other state-of-the-art approaches. Furthermore, through case studies, we further confirm the effectiveness of MSHGANMDA in predicting unknown diseases.
Collapse
|
6
|
SGAEMDA: Predicting miRNA-Disease Associations Based on Stacked Graph Autoencoder. Cells 2022; 11:cells11243984. [PMID: 36552748 PMCID: PMC9776508 DOI: 10.3390/cells11243984] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Revised: 11/30/2022] [Accepted: 12/07/2022] [Indexed: 12/14/2022] Open
Abstract
MicroRNA (miRNA)-disease association (MDA) prediction is critical for disease prevention, diagnosis, and treatment. Traditional MDA wet experiments, on the other hand, are inefficient and costly.Therefore, we proposed a multi-layer collaborative unsupervised training base model called SGAEMDA (Stacked Graph Autoencoder-Based Prediction of Potential miRNA-Disease Associations). First, from the original miRNA and disease data, we defined two types of initial features: similarity features and association features. Second, stacked graph autoencoder is then used to learn unsupervised low-dimensional representations of meaningful higher-order similarity features, and we concatenate the association features with the learned low-dimensional representations to obtain the final miRNA-disease pair features. Finally, we used a multilayer perceptron (MLP) to predict scores for unknown miRNA-disease associations. SGAEMDA achieved a mean area under the ROC curve of 0.9585 and 0.9516 in 5-fold and 10-fold cross-validation, which is significantly higher than the other baseline methods. Furthermore, case studies have shown that SGAEMDA can accurately predict candidate miRNAs for brain, breast, colon, and kidney neoplasms.
Collapse
|