1
|
Ramani R, Edwin Raja S, Dhinakaran D, Jagan S, Prabaharan G. MapReduce based big data framework using associative Kruskal poly Kernel classifier for diabetic disease prediction. MethodsX 2025; 14:103210. [PMID: 40026593 PMCID: PMC11870225 DOI: 10.1016/j.mex.2025.103210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2024] [Accepted: 02/05/2025] [Indexed: 03/05/2025] Open
Abstract
Recent trendy applications of Artificial Intelligence are Machine Learning (ML) algorithms, which have been extensively utilized for processes like pattern recognition, object classification, effective prediction of disease etc. However, ML techniques are reasonable solutions to computation methods and modeling, especially when the data size is enormous. These facts are established due to the reason that big data field has received considerable attention from both the industrial experts and academicians. The computation process must be accelerated to achieve early disease prediction in order to accomplish the prospects of ML for big data applications. In this paper, a method named "Associative Kruskal Wallis and MapReduce Poly Kernel (AKW-MRPK)" is presented for early disease prediction. Initially, significant attributes are selected by applying Associative Kruskal Wallis Feature Selection model. This study parallelizes polynomial kernel vector using MapReduce based on the significant qualities gained, which will become a significant computing model to facilitate the early prognosis of disease. The proposed AKW-MRPK framework achieves up to 92 % accuracy, reduces computational time to as low as 0.875 ms for 25 patients, and demonstrates superior speedup efficiency with a value of 1.9 ms using two computational nodes, consistently outperforming supervised machine learning algorithms and Hadoop-based clusters across these critical metrics.•The AKW-MRPK method selects attributes and accelerates computations for predictions.•Parallelizing polynomial kernels improves accuracy and speed in healthcare data analysis.
Collapse
Affiliation(s)
- R. Ramani
- Department of Artificial Intelligence and Data Science, P.S.R Engineering College, Sivakasi, India
| | - S. Edwin Raja
- Department of Computer Science and Engineering, Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology, Chennai, India
| | - D. Dhinakaran
- Department of Computer Science and Engineering, Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology, Chennai, India
| | - S. Jagan
- Department of Computer Science and Engineering, Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology, Chennai, India
| | - G. Prabaharan
- Department of Computer Science and Engineering, Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology, Chennai, India
| |
Collapse
|
2
|
Dhinakaran D, Srinivasan L, Edwin Raja S, Valarmathi K, Gomathy Nayagam M. Synergistic feature selection and distributed classification framework for high-dimensional medical data analysis. MethodsX 2025; 14:103219. [PMID: 40083656 PMCID: PMC11904559 DOI: 10.1016/j.mex.2025.103219] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2024] [Accepted: 02/12/2025] [Indexed: 03/16/2025] Open
Abstract
Feature selection and classification efficiency and accuracy are key to improving decision-making regarding medical data analysis. Since the medical datasets are large and complex, they give rise to certain problematic issues such as computational complexity, limited memory space, and a lesser number of correct classifications. In order to overcome these drawbacks, the new integrated algorithm is presented here: Synergistic Kruskal-RFE Selector and Distributed Multi-Kernel Classification Framework (SKR-DMKCF). The innovative architecture of SKR-DMKCF results in the reduction of dimensionality while preserving useful characteristics of the image utilizing recursive feature elimination and multi-kernel classification in a distributed environment. Detailed evaluations were performed on four broad medical datasets and established our performance advantage. The average feature reduction ratio was 89 % for the proposed method, SKR-DMKCF, which can outperform all the methods by achieving the best classification average accuracy of 85.3 %, precision of 81.5 %, and recall 84.7 %. On the efficiency calculations, it was seen that the memory usage is a 25 % reduction compared to the existing methods and the speed-up time was a significant improvement as well to assure scalability for resource-limited environments.•Innovative Synergistic Kruskal-RFE Selector for efficient feature selection in medical datasets.•Distributed Multi-Kernel Classification Framework achieving superior accuracy and computational efficiency.
Collapse
Affiliation(s)
- D. Dhinakaran
- Department of Computer Science and Engineering, Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology, Chennai, India
| | - L. Srinivasan
- Department of Computer Science and Engineering, Dr. N.G.P. Institute of Technology, Coimbatore, India
| | - S. Edwin Raja
- Department of Computer Science and Engineering, Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology, Chennai, India
| | - K. Valarmathi
- Department of Electronics and Communication Engineering, P.S.R Engineering College, Sivakasi, India
| | - M. Gomathy Nayagam
- Department of Computer Science and Business Systems, Ramco Institute of Technology, Rajapalayam, India
| |
Collapse
|
3
|
Zhu R, Wang Y, Dai LY. CLHGNNMDA: Hypergraph Neural Network Model Enhanced by Contrastive Learning for miRNA-Disease Association Prediction. J Comput Biol 2025; 32:47-63. [PMID: 39602201 DOI: 10.1089/cmb.2024.0720] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2024] Open
Abstract
Numerous biological experiments have demonstrated that microRNA (miRNA) is involved in gene regulation within cells, and mutations and abnormal expression of miRNA can cause a myriad of intricate diseases. Forecasting the association between miRNA and diseases can enhance disease prevention and treatment and accelerate drug research, which holds considerable importance for the development of clinical medicine and drug research. This investigation introduces a contrastive learning-augmented hypergraph neural network model, termed CLHGNNMDA, aimed at predicting associations between miRNAs and diseases. Initially, CLHGNNMDA constructs multiple hypergraphs by leveraging diverse similarity metrics related to miRNAs and diseases. Subsequently, hypergraph convolution is applied to each hypergraph to extract feature representations for nodes and hyperedges. Following this, autoencoders are employed to reconstruct information regarding the feature representations of nodes and hyperedges and to integrate various features of miRNAs and diseases extracted from each hypergraph. Finally, a joint contrastive loss function is utilized to refine the model and optimize its parameters. The CLHGNNMDA framework employs multi-hypergraph contrastive learning for the construction of a contrastive loss function. This approach takes into account inter-view interactions and upholds the principle of consistency, thereby augmenting the model's representational efficacy. The results obtained from fivefold cross-validation substantiate that the CLHGNNMDA algorithm achieves a mean area under the receiver operating characteristic curve of 0.9635 and a mean area under the precision-recall curve of 0.9656. These metrics are notably superior to those attained by contemporary state-of-the-art methodologies.
Collapse
Affiliation(s)
- Rong Zhu
- School of Computer Science, Qufu Normal University, Rizhao, China
| | - Yong Wang
- Laboratory Experimental Teaching and Equipment Management Center, Qufu Normal University, Rizhao, China
| | - Ling-Yun Dai
- School of Computer Science, Qufu Normal University, Rizhao, China
| |
Collapse
|
4
|
Liu Y, Wu Q, Zhou L, Liu Y, Li C, Wei Z, Peng W, Yue Y, Zhu X. Disentangled similarity graph attention heterogeneous biological memory network for predicting disease-associated miRNAs. BMC Genomics 2024; 25:1161. [PMID: 39623332 PMCID: PMC11610307 DOI: 10.1186/s12864-024-11078-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2024] [Accepted: 11/21/2024] [Indexed: 12/06/2024] Open
Abstract
BACKGROUND The association between MicroRNAs (miRNAs) and diseases is crucial in treating and exploring many diseases or cancers. Although wet-lab methods for predicting miRNA-disease associations (MDAs) are effective, they are often expensive and time-consuming. Significant advancements have been made using Graph Neural Network-based methods (GNN-MDAs) to address these challenges. However, these methods still face limitations, such as not considering nodes' deep-level similarity associations and hierarchical learning patterns. Additionally, current models do not retain the memory of previously learned heterogeneous historical information about miRNAs or diseases, only focusing on parameter learning without the capability to remember heterogeneous associations. RESULTS This study introduces the K-means disentangled high-level biological similarity to utilize potential hierarchical relationships fully and proposes a Graph Attention Heterogeneous Biological Memory Network architecture (DiGAMN) with memory capabilities. Extensive experiments were conducted across four datasets, comparing the DiGAMN model and its disentangling method against ten state-of-the-art non-disentangled methods and six traditional GNNs. DiGAMN excelled, achieving AUC scores of 96.35%, 96.10%, 96.01%, and 95.89% on the Data1 to Data4 datasets, respectively, surpassing all other models. These results confirm the superior performance of DiGAMN and its disentangling method. Additionally, various ablation studies were conducted to validate the contributions of different modules within the framework, and's encoding statuses and memory units of DiGAMN were visualized to explore the utility and functionality of its modules. Case studies confirmed the effectiveness of DiGAMN's predictions, identifying several new disease-associated miRNAs. CONCLUSIONS DiGAMN introduces the use of a disentangled biological similarity approach for the first time and successfully constructs a Disentangled Graph Attention Heterogeneous Biological Memory Network model. This network can learn disentangled representations of similarity information and effectively store the potential biological entanglement information of miRNAs and diseases. By integrating disentangled similarity information with a heterogeneous attention memory network, DiGAMN enhances the model's ability to capture and utilize complex underlying biological data, significantly outperforming many existing models. The concepts used in this method also provide new perspectives for predicting miRNAs associated with diseases.
Collapse
Affiliation(s)
- Yinbo Liu
- School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, Anhui, 230036, China
- Institute of Automation, Chinese Academy of Sciences, Beijing, 100190, China
| | - Qi Wu
- School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, Anhui, 230036, China
| | - Le Zhou
- Institute of Automation, Chinese Academy of Sciences, Beijing, 100190, China
| | - Yuchen Liu
- Institute of Automation, Chinese Academy of Sciences, Beijing, 100190, China
- China University of Petroleum, Beijing, Beijing, 102249, China
| | - Chao Li
- Institute of Automation, Chinese Academy of Sciences, Beijing, 100190, China
| | - Zhuoyu Wei
- School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, Anhui, 230036, China
| | - Wei Peng
- School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, Anhui, 230036, China
| | - Yi Yue
- School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, Anhui, 230036, China.
| | - Xiaolei Zhu
- School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, Anhui, 230036, China.
| |
Collapse
|
5
|
Yang Y, Sun Y, Li F, Guan B, Liu JX, Shang J. MGCNRF: Prediction of Disease-Related miRNAs Based on Multiple Graph Convolutional Networks and Random Forest. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:15701-15709. [PMID: 37459265 DOI: 10.1109/tnnls.2023.3289182] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/01/2024]
Abstract
Increasing microRNAs (miRNAs) have been confirmed to be inextricably linked to various diseases, and the discovery of their associations has become a routine way of treating diseases. To overcome the time-consuming and laborious shortcoming of traditional experiments in verifying the associations of miRNAs and diseases (MDAs), a variety of computational methods have emerged. However, these methods still have many shortcomings in terms of predictive performance and accuracy. In this study, a model based on multiple graph convolutional networks and random forest (MGCNRF) was proposed for the prediction MDAs. Specifically, MGCNRF first mapped miRNA functional similarity and sequence similarity, disease semantic similarity and target similarity, and the known MDAs into four different two-layer heterogeneous networks. Second, MGCNRF applied four heterogeneous networks into four different layered attention graph convolutional networks (GCNs), respectively, to extract MDA embeddings. Finally, MGCNRF integrated the embeddings of every MDA into the features of the miRNA-disease pair and predicted potential MDAs through the random forest (RF). Fivefold cross-validation was applied to verify the prediction performance of MGCNRF, which outperforms the other seven state-of-the-art methods by area under curve. Furthermore, the accuracy and the case studies of different diseases further demonstrate the scientific rationale of MGCNRF. In conclusion, MGCNRF can serve as a scientific tool for predicting potential MDAs.
Collapse
|
6
|
Wen S, Liu Y, Yang G, Chen W, Wu H, Zhu X, Wang Y. A method for miRNA diffusion association prediction using machine learning decoding of multi-level heterogeneous graph Transformer encoded representations. Sci Rep 2024; 14:20490. [PMID: 39227405 PMCID: PMC11371806 DOI: 10.1038/s41598-024-68897-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Accepted: 07/29/2024] [Indexed: 09/05/2024] Open
Abstract
MicroRNAs (miRNAs) are a key class of endogenous non-coding RNAs that play a pivotal role in regulating diseases. Accurately predicting the intricate relationships between miRNAs and diseases carries profound implications for disease diagnosis, treatment, and prevention. However, these prediction tasks are highly challenging due to the complexity of the underlying relationships. While numerous effective prediction models exist for validating these associations, they often encounter information distortion due to limitations in efficiently retaining information during the encoding-decoding process. Inspired by Multi-layer Heterogeneous Graph Transformer and Machine Learning XGboost classifier algorithm, this study introduces a novel computational approach based on multi-layer heterogeneous encoder-machine learning decoder structure for miRNA-disease association prediction (MHXGMDA). First, we employ the multi-view similarity matrices as the input coding for MHXGMDA. Subsequently, we utilize the multi-layer heterogeneous encoder to capture the embeddings of miRNAs and diseases, aiming to capture the maximum amount of relevant features. Finally, the information from all layers is concatenated to serve as input to the machine learning classifier, ensuring maximal preservation of encoding details. We conducted a comprehensive comparison of seven different classifier models and ultimately selected the XGBoost algorithm as the decoder. This algorithm leverages miRNA embedding features and disease embedding features to decode and predict the association scores between miRNAs and diseases. We applied MHXGMDA to predict human miRNA-disease associations on two benchmark datasets. Experimental findings demonstrate that our approach surpasses several leading methods in terms of both the area under the receiver operating characteristic curve and the area under the precision-recall curve.
Collapse
Affiliation(s)
- SiJian Wen
- School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, 230036, China
| | - YinBo Liu
- School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, 230036, China
| | - Guang Yang
- School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, 230036, China
| | - WenXi Chen
- School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, 230036, China
| | - HaiTao Wu
- School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, 230036, China
| | - XiaoLei Zhu
- School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, 230036, China.
| | - YongMei Wang
- School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, 230036, China.
- Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Hefei, 230036, China.
| |
Collapse
|
7
|
Xuan P, Xiu J, Cui H, Zhang X, Nakaguchi T, Zhang T. Complementary feature learning across multiple heterogeneous networks and multimodal attribute learning for predicting disease-related miRNAs. iScience 2024; 27:108639. [PMID: 38303724 PMCID: PMC10831890 DOI: 10.1016/j.isci.2023.108639] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Revised: 11/02/2023] [Accepted: 12/01/2023] [Indexed: 02/03/2024] Open
Abstract
Inferring the latent disease-related miRNAs is helpful for providing a deep insight into observing the disease pathogenesis. We propose a method, CMMDA, to encode and integrate the context relationship among multiple heterogeneous networks, the complementary information across these networks, and the pairwise multimodal attributes. We first established multiple heterogeneous networks according to the diverse disease similarities. The feature representation embedding the context relationship is formulated for each miRNA (disease) node based on transformer. We designed a co-attention fusion mechanism to encode the complementary information among multiple networks. In terms of a pair of miRNA and disease nodes, the pairwise attributes from multiple networks form a multimodal attribute embedding. A module based on depthwise separable convolution is constructed to enhance the encoding of the specific features from each modality. The experimental results and the ablation studies show that CMMDA's superior performance and the effectiveness of its major innovations.
Collapse
Affiliation(s)
- Ping Xuan
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China
- Department of Computer Science, Shantou University, Shantou 515063, China
| | - Jinshan Xiu
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China
| | - Hui Cui
- Department of Computer Science and Information Technology, La Trobe University, Melbourne, VIC 3083, Australia
| | - Xiaowen Zhang
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China
| | - Toshiya Nakaguchi
- Center for Frontier Medical Engineering, Chiba University, Chiba 2638522, Japan
| | - Tiangang Zhang
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China
- School of Mathematical Science, Heilongjiang University, Harbin 150080, China
| |
Collapse
|
8
|
Yu S, Wang H, Li J, Zhao J, Liang C, Sun Y. A Multi-Relational Graph Encoder Network for Fine-Grained Prediction of MiRNA-Disease Associations. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024; 21:45-56. [PMID: 38015672 DOI: 10.1109/tcbb.2023.3335007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/30/2023]
Abstract
MicroRNAs (miRNAs) are critical in diagnosing and treating various diseases. Automatically demystifying the interdependent relationships between miRNAs and diseases has recently made remarkable progress, but their fine-grained interactive relationships still need to be explored. We propose a multi-relational graph encoder network for fine-grained prediction of miRNA-disease associations (MRFGMDA), which uses practical and current datasets to construct a multi-relational graph encoder network to predict disease-related miRNAs and their specific relationship types (upregulation, downregulation, or dysregulation). We evaluated MRFGMDA and found that it accurately predicted miRNA-disease associations, which could have far-reaching implications for clinical medical analysis, early diagnosis, prevention, and treatment. Case analyses, Kaplan-Meier survival analysis, expression difference analysis, and immune infiltration analysis further demonstrated the effectiveness and feasibility of MRFGMDA in uncovering potential disease-related miRNAs. Overall, our work represents a significant step toward improving the prediction of miRNA-disease associations using a fine-grained approach could lead to more accurate diagnosis and treatment of diseases.
Collapse
|
9
|
Zhong Y, Shen C, Xi X, Luo Y, Ding P, Luo L. Multitask joint learning with graph autoencoders for predicting potential MiRNA-drug associations. Artif Intell Med 2023; 145:102665. [PMID: 37925217 DOI: 10.1016/j.artmed.2023.102665] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Revised: 06/14/2023] [Accepted: 09/14/2023] [Indexed: 11/06/2023]
Abstract
The occurrence of many diseases is associated with miRNA abnormalities. Predicting potential drug-miRNA associations is of great importance for both disease treatment and new drug discovery. Most computation-based approaches learn one task at a time, ignoring the information contained in other tasks in the same domain. Multitask learning can effectively enhance the prediction performance of a single task by extending the valid information of related tasks. In this paper, we presented a multitask joint learning framework (MTJL) with a graph autoencoder for predicting the associations between drugs and miRNAs. First, we combined multiple pieces of information to construct a high-quality similarity network of both drugs and miRNAs and then used a graph autoencoder (GAE) to learn their embedding representations separately. Second, to further improve the embedding quality of drugs, we added an auxiliary task to classify drugs using the learned representations. Finally, the embedding representations of drugs and miRNAs were linearly transformed to obtain the predictive association scores between them. A comparison with other state-of-the-art models shows that MTJL has the best prediction performance, and ablation experiments show that the auxiliary task can enhance the embedding quality and improve the robustness of the model. In addition, we show that MTJL has high utility in predicting potential associations between drugs and miRNAs by conducting two case studies.
Collapse
Affiliation(s)
- Yichen Zhong
- School of Computer Science, University of South China, Hengyang 421001, China
| | - Cong Shen
- College of Computer Science and Electronic Engineering, Hunan University, Changsha 410083, China
| | - Xiaoting Xi
- School of Computer Science, University of South China, Hengyang 421001, China
| | - Yuxun Luo
- School of Computer Science and Engineering, Hunan University of Science and Technology, Xiangtan 411105, China
| | - Pingjian Ding
- School of Computer Science, University of South China, Hengyang 421001, China
| | - Lingyun Luo
- School of Computer Science, University of South China, Hengyang 421001, China.
| |
Collapse
|
10
|
Li Y, Zhang M, Shang J, Li F, Ren Q, Liu JX. iLncDA-RSN: identification of lncRNA-disease associations based on reliable similarity networks. Front Genet 2023; 14:1249171. [PMID: 37614816 PMCID: PMC10442839 DOI: 10.3389/fgene.2023.1249171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Accepted: 07/27/2023] [Indexed: 08/25/2023] Open
Abstract
Identification of disease-associated long non-coding RNAs (lncRNAs) is crucial for unveiling the underlying genetic mechanisms of complex diseases. Multiple types of similarity networks of lncRNAs (or diseases) can complementary and comprehensively characterize their similarities. Hence, in this study, we presented a computational model iLncDA-RSN based on reliable similarity networks for identifying potential lncRNA-disease associations (LDAs). Specifically, for constructing reliable similarity networks of lncRNAs and diseases, miRNA heuristic information with lncRNAs and diseases is firstly introduced to construct their respective Jaccard similarity networks; then Gaussian interaction profile (GIP) kernel similarity networks and Jaccard similarity networks of lncRNAs and diseases are provided based on the lncRNA-disease association network; a random walk with restart strategy is finally applied on Jaccard similarity networks, GIP kernel similarity networks, as well as lncRNA functional similarity network and disease semantic similarity network to construct reliable similarity networks. Depending on the lncRNA-disease association network and the reliable similarity networks, feature vectors of lncRNA-disease pairs are integrated from lncRNA and disease perspectives respectively, and then dimensionality reduced by the elastic net. Two random forests are at last used together on different lncRNA-disease association feature sets to identify potential LDAs. The iLncDA-RSN is evaluated by five-fold cross-validation to analyse its prediction performance, results of which show that the iLncDA-RSN outperforms the compared models. Furthermore, case studies of different complex diseases demonstrate the effectiveness of the iLncDA-RSN in identifying potential LDAs.
Collapse
Affiliation(s)
| | | | - Junliang Shang
- School of Computer Science, Qufu Normal University, Rizhao, China
| | | | | | | |
Collapse
|
11
|
Wang Y, Liu JX, Wang J, Shang J, Gao YL. A Graph Representation Approach Based on Light Gradient Boosting Machine for Predicting Drug-Disease Associations. J Comput Biol 2023; 30:937-947. [PMID: 37486669 DOI: 10.1089/cmb.2023.0078] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/25/2023] Open
Abstract
Determining the association between drug and disease is important in drug development. However, existing approaches for drug-disease associations (DDAs) prediction are too homogeneous in terms of feature extraction. Here, a novel graph representation approach based on light gradient boosting machine (GRLGB) is proposed for prediction of DDAs. After the introduction of the protein into a heterogeneous network, nodes features were extracted from two perspectives: network topology and biological knowledge. Finally, the GRLGB classifier was applied to predict potential DDAs. GRLGB achieved satisfactory results on Bdataset and Fdataset through 10-fold cross-validation. To further prove the reliability of the GRLGB, case studies involving anxiety disorders and clozapine were conducted. The results suggest that GRLGB can identify novel DDAs.
Collapse
Affiliation(s)
- Ying Wang
- School of Computer Science, Qufu Normal University, Rizhao, Shandong, China
| | - Jin-Xing Liu
- School of Computer Science, Qufu Normal University, Rizhao, Shandong, China
| | - Juan Wang
- School of Computer Science, Qufu Normal University, Rizhao, Shandong, China
| | - Junliang Shang
- School of Computer Science, Qufu Normal University, Rizhao, Shandong, China
| | - Ying-Lian Gao
- Qufu Normal University Library, Qufu Normal University, Rizhao, Shandong, China
| |
Collapse
|
12
|
Wang Y, Gao YL, Wang J, Li F, Liu JX. MSGCA: Drug-Disease Associations Prediction Based on Multi-Similarities Graph Convolutional Autoencoder. IEEE J Biomed Health Inform 2023; 27:3686-3694. [PMID: 37163398 DOI: 10.1109/jbhi.2023.3272154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]
Abstract
Identifying drug-disease associations (DDAs) is critical to the development of drugs. Traditional methods to determine DDAs are expensive and inefficient. Therefore, it is imperative to develop more accurate and effective methods for DDAs prediction. Most current DDAs prediction methods utilize original DDAs matrix directly. However, the original DDAs matrix is sparse, which greatly affects the prediction consequences. Hence, a prediction method based on multi-similarities graph convolutional autoencoder (MSGCA) is proposed for DDAs prediction. First, MSGCA integrates multiple drug similarities and disease similarities using centered kernel alignment-based multiple kernel learning (CKA-MKL) algorithm to form new drug similarity and disease similarity, respectively. Second, the new drug and disease similarities are improved by linear neighborhood, and the DDAs matrix is reconstructed by weighted K nearest neighbor profiles. Next, the reconstructed DDAs and the improved drug and disease similarities are integrated into a heterogeneous network. Finally, the graph convolutional autoencoder with attention mechanism is utilized to predict DDAs. Compared with extant methods, MSGCA shows superior results on three datasets. Furthermore, case studies further demonstrate the reliability of MSGCA.
Collapse
|
13
|
Shen Y, Liu JX, Yin MM, Zheng CH, Gao YL. BMPMDA: Prediction of MiRNA-Disease Associations Using a Space Projection Model Based on Block Matrix. Interdiscip Sci 2023; 15:88-99. [PMID: 36335274 DOI: 10.1007/s12539-022-00542-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2022] [Revised: 10/13/2022] [Accepted: 10/14/2022] [Indexed: 11/07/2022]
Abstract
With the high-quality development of bioinformatics technology, miRNA-disease associations (MDAs) are gradually being uncovered. At present, convenient and efficient prediction methods, which solve the problem of resource-consuming in traditional wet experiments, need to be further put forward. In this study, a space projection model based on block matrix is presented for predicting MDAs (BMPMDA). Specifically, two block matrices are first composed of the known association matrix and similarity to increase comprehensiveness. For the integrity of information in the heterogeneous network, matrix completion (MC) is utilized to mine potential MDAs. Considering the neighborhood information of data points, linear neighborhood similarity (LNS) is regarded as a measure of similarity. Next, LNS is projected onto the corresponding completed association matrix to derive the projection score. Finally, the AUC and AUPR values for BMPMDA reach 0.9691 and 0.6231, respectively. Additionally, the majority of novel MDAs in three disease cases are identified in existing databases and literature. It suggests that BMPMDA can serve as a reliable prediction model for biological research.
Collapse
Affiliation(s)
- Yi Shen
- Qufu Normal University, Rizhao, 276800, China
| | | | | | - Chun-Hou Zheng
- Co-Innovation Center for Information Supply and Assurance Technology, Anhui University, Hefei, 230000, China
| | - Ying-Lian Gao
- Library of Qufu Normal University, Qufu Normal University, Rizhao, 276800, China.
| |
Collapse
|
14
|
Dornaika F, Bi J, Zhang C. A unified deep semi-supervised graph learning scheme based on nodes re-weighting and manifold regularization. Neural Netw 2023; 158:188-196. [PMID: 36462365 DOI: 10.1016/j.neunet.2022.11.017] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2022] [Revised: 10/01/2022] [Accepted: 11/10/2022] [Indexed: 11/21/2022]
Abstract
In recent years, semi-supervised learning on graphs has gained importance in many fields and applications. The goal is to use both partially labeled data (labeled examples) and a large amount of unlabeled data to build more effective predictive models. Deep Graph Neural Networks (GNNs) are very useful in both unsupervised and semi-supervised learning problems. As a special class of GNNs, Graph Convolutional Networks (GCNs) aim to obtain data representation through graph-based node smoothing and layer-wise neural network transformations. However, GCNs have some weaknesses when applied to semi-supervised graph learning: (1) it ignores the manifold structure implicitly encoded by the graph; (2) it uses a fixed neighborhood graph and focuses only on the convolution of a graph, but pays little attention to graph construction; (3) it rarely considers the problem of topological imbalance. To overcome the above shortcomings, in this paper, we propose a novel semi-supervised learning method called Re-weight Nodes and Graph Learning Convolutional Network with Manifold Regularization (ReNode-GLCNMR). Our proposed method simultaneously integrates graph learning and graph convolution into a unified network architecture, which also enforces label smoothing through an unsupervised loss term. At the same time, it addresses the problem of imbalance in graph topology by adaptively reweighting the influence of labeled nodes based on their distances to the class boundaries. Experiments on 8 benchmark datasets show that ReNode-GLCNMR significantly outperforms the state-of-the-art semi-supervised GNN methods.1.
Collapse
Affiliation(s)
- Fadi Dornaika
- Henan University, Henan Key Lab of Big Data Analysis and Processing, Kaifeng, China; University of the Basque Country, UPV/EHU, San Sebastian, Spain; IKERBASQUE, Basque Foundation for Science, Bilbao, Spain.
| | - Jingjun Bi
- University of the Basque Country, UPV/EHU, San Sebastian, Spain.
| | - Chongsheng Zhang
- Henan University, Henan Key Lab of Big Data Analysis and Processing, Kaifeng, China.
| |
Collapse
|
15
|
SGAEMDA: Predicting miRNA-Disease Associations Based on Stacked Graph Autoencoder. Cells 2022; 11:cells11243984. [PMID: 36552748 PMCID: PMC9776508 DOI: 10.3390/cells11243984] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Revised: 11/30/2022] [Accepted: 12/07/2022] [Indexed: 12/14/2022] Open
Abstract
MicroRNA (miRNA)-disease association (MDA) prediction is critical for disease prevention, diagnosis, and treatment. Traditional MDA wet experiments, on the other hand, are inefficient and costly.Therefore, we proposed a multi-layer collaborative unsupervised training base model called SGAEMDA (Stacked Graph Autoencoder-Based Prediction of Potential miRNA-Disease Associations). First, from the original miRNA and disease data, we defined two types of initial features: similarity features and association features. Second, stacked graph autoencoder is then used to learn unsupervised low-dimensional representations of meaningful higher-order similarity features, and we concatenate the association features with the learned low-dimensional representations to obtain the final miRNA-disease pair features. Finally, we used a multilayer perceptron (MLP) to predict scores for unknown miRNA-disease associations. SGAEMDA achieved a mean area under the ROC curve of 0.9585 and 0.9516 in 5-fold and 10-fold cross-validation, which is significantly higher than the other baseline methods. Furthermore, case studies have shown that SGAEMDA can accurately predict candidate miRNAs for brain, breast, colon, and kidney neoplasms.
Collapse
|
16
|
Yang X, Zheng Y, Xing X, Sui X, Jia W, Pan H. Immune subtype identification and multi-layer perceptron classifier construction for breast cancer. Front Oncol 2022; 12:943874. [PMID: 36568197 PMCID: PMC9780074 DOI: 10.3389/fonc.2022.943874] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2022] [Accepted: 11/17/2022] [Indexed: 12/13/2022] Open
Abstract
Introduction Breast cancer is a heterogeneous tumor. Tumor microenvironment (TME) has an important effect on the proliferation, metastasis, treatment, and prognosis of breast cancer. Methods In this study, we calculated the relative proportion of tumor infiltrating immune cells (TIICs) in the breast cancer TME, and used the consensus clustering algorithm to cluster the breast cancer subtypes. We also developed a multi-layer perceptron (MLP) classifier based on a deep learning framework to detect breast cancer subtypes, which 70% of the breast cancer research cohort was used for the model training and 30% for validation. Results By performing the K-means clustering algorithm, the research cohort was clustered into two subtypes. The Kaplan-Meier survival estimate analysis showed significant differences in the overall survival (OS) between the two identified subtypes. Estimating the difference in the relative proportion of TIICs showed that the two subtypes had significant differences in multiple immune cells, such as CD8, CD4, and regulatory T cells. Further, the expression level of immune checkpoint molecules (PDL1, CTLA4, LAG3, TIGIT, CD27, IDO1, ICOS) and tumor mutational burden (TMB) also showed significant differences between the two subtypes, indicating the clinical value of the two subtypes. Finally, we identified a 38-gene signature and developed a multilayer perceptron (MLP) classifier that combined multi-gene signature to identify breast cancer subtypes. The results showed that the classifier had an accuracy rate of 93.56% and can be robustly used for the breast cancer subtype diagnosis. Conclusion Identification of breast cancer subtypes based on the immune signature in the tumor microenvironment can assist clinicians to effectively and accurately assess the progression of breast cancer and formulate different treatment strategies for different subtypes.
Collapse
Affiliation(s)
- Xinbo Yang
- School of Information Science and Engineering, Shandong Normal University, Jinan, China
| | - Yuanjie Zheng
- School of Information Science and Engineering, Shandong Normal University, Jinan, China,*Correspondence: Yuanjie Zheng, ; Huali Pan,
| | - Xianrong Xing
- Department of Pharmacy, Shandong Medical College, Jinan, China
| | - Xiaodan Sui
- School of Information Science and Engineering, Shandong Normal University, Jinan, China
| | - Weikuan Jia
- School of Information Science and Engineering, Shandong Normal University, Jinan, China
| | - Huali Pan
- School of Information Science and Engineering, Shandong Normal University, Jinan, China,Business School, Shandong Normal University, Jinan, China,*Correspondence: Yuanjie Zheng, ; Huali Pan,
| |
Collapse
|
17
|
Lu S, Liang Y, Li L, Liao S, Ouyang D. Inferring human miRNA–disease associations via multiple kernel fusion on GCNII. Front Genet 2022; 13:980497. [PMID: 36134032 PMCID: PMC9483142 DOI: 10.3389/fgene.2022.980497] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Accepted: 07/20/2022] [Indexed: 11/16/2022] Open
Abstract
Increasing evidence shows that the occurrence of human complex diseases is closely related to the mutation and abnormal expression of microRNAs(miRNAs). MiRNAs have complex and fine regulatory mechanisms, which makes it a promising target for drug discovery and disease diagnosis. Therefore, predicting the potential miRNA-disease associations has practical significance. In this paper, we proposed an miRNA–disease association predicting method based on multiple kernel fusion on Graph Convolutional Network via Initial residual and Identity mapping (GCNII), called MKFGCNII. Firstly, we built a heterogeneous network of miRNAs and diseases to extract multi-layer features via GCNII. Secondly, multiple kernel fusion method was applied to weight fusion of embeddings at each layer. Finally, Dual Laplacian Regularized Least Squares was used to predict new miRNA–disease associations by the combined kernel in miRNA and disease spaces. Compared with the other methods, MKFGCNII obtained the highest AUC value of 0.9631. Code is available at https://github.com/cuntjx/bioInfo.
Collapse
Affiliation(s)
- Shanghui Lu
- School of Computer Science and Engineering, Macau University of Science and Technology, Taipa, China
- School of Mathematics and Physics, Hechi University, Hechi, China
| | - Yong Liang
- School of Computer Science and Engineering, Macau University of Science and Technology, Taipa, China
- Peng Cheng Laboratory, Shenzhen, China
- *Correspondence: Yong Liang,
| | - Le Li
- School of Computer Science and Engineering, Macau University of Science and Technology, Taipa, China
| | - Shuilin Liao
- School of Computer Science and Engineering, Macau University of Science and Technology, Taipa, China
| | - Dong Ouyang
- School of Computer Science and Engineering, Macau University of Science and Technology, Taipa, China
| |
Collapse
|
18
|
Rao Y, Xie M, Wang H. Predict potential miRNA-disease associations based on bounded nuclear norm regularization. Front Genet 2022; 13:978975. [PMID: 36072658 PMCID: PMC9441603 DOI: 10.3389/fgene.2022.978975] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Accepted: 07/18/2022] [Indexed: 11/13/2022] Open
Abstract
Increasing evidences show that the abnormal microRNA (miRNA) expression is related to a variety of complex human diseases. However, the current biological experiments to determine miRNA-disease associations are time consuming and expensive. Therefore, computational models to predict potential miRNA-disease associations are in urgent need. Though many miRNA-disease association prediction methods have been proposed, there is still a room to improve the prediction accuracy. In this paper, we propose a matrix completion model with bounded nuclear norm regularization to predict potential miRNA-disease associations, which is called BNNRMDA. BNNRMDA at first constructs a heterogeneous miRNA-disease network integrating the information of miRNA self-similarity, disease self-similarity, and the known miRNA-disease associations, which is represented by an adjacent matrix. Then, it models the miRNA-disease prediction as a relaxed matrix completion with error tolerance, value boundary and nuclear norm minimization. Finally it implements the alternating direction method to solve the matrix completion problem. BNNRMDA makes full use of available information of miRNAs and diseases, and can deals with the data containing noise. Compared with four state-of-the-art methods, the experimental results show BNNRMDA achieved the best performance in five-fold cross-validation and leave-one-out cross-validation. The case studies on two complex human diseases showed that 47 of the top 50 prediction results of BNNRMDA have been verified in the latest HMDD database.
Collapse
|
19
|
Wang XF, Yu CQ, Li LP, You ZH, Huang WZ, Li YC, Ren ZH, Guan YJ. KGDCMI: A New Approach for Predicting circRNA–miRNA Interactions From Multi-Source Information Extraction and Deep Learning. Front Genet 2022; 13:958096. [PMID: 36051691 PMCID: PMC9426772 DOI: 10.3389/fgene.2022.958096] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Accepted: 06/23/2022] [Indexed: 11/13/2022] Open
Abstract
Emerging evidence has revealed that circular RNA (circRNA) is widely distributed in mammalian cells and functions as microRNA (miRNA) sponges involved in transcriptional and posttranscriptional regulation of gene expression. Recognizing the circRNA–miRNA interaction provides a new perspective for the detection and treatment of human complex diseases. Compared with the traditional biological experimental methods used to predict the association of molecules, which are limited to the small-scale and are time-consuming and laborious, computing models can provide a basis for biological experiments at low cost. Considering that the proposed calculation model is limited, it is necessary to develop an effective computational method to predict the circRNA–miRNA interaction. This study thus proposed a novel computing method, named KGDCMI, to predict the interactions between circRNA and miRNA based on multi-source information extraction and fusion. The KGDCMI obtains RNA attribute information from sequence and similarity, capturing the behavior information in RNA association through a graph-embedding algorithm. Then, the obtained feature vector is extracted further by principal component analysis and sent to the deep neural network for information fusion and prediction. At last, KGDCMI obtains the prediction accuracy (area under the curve [AUC] = 89.30% and area under the precision–recall curve [AUPR] = 87.67%). Meanwhile, with the same dataset, KGDCMI is 2.37% and 3.08%, respectively, higher than the only existing model, and we conducted three groups of comparative experiments, obtaining the best classification strategy, feature extraction parameters, and dimensions. In addition, in the performed case study, 7 of the top 10 interaction pairs were confirmed in PubMed. These results suggest that KGDCMI is a feasible and useful method to predict the circRNA–miRNA interaction and can act as a reliable candidate for related RNA biological experiments.
Collapse
Affiliation(s)
- Xin-Fei Wang
- School of Information Engineering, Xijing University, Xi’an, China
| | - Chang-Qing Yu
- School of Information Engineering, Xijing University, Xi’an, China
- *Correspondence: Chang-Qing Yu, ; Li-Ping Li,
| | - Li-Ping Li
- School of Information Engineering, Xijing University, Xi’an, China
- College of Grassland and Environment Sciences, Xinjiang Agricultural University, Urumqi, China
- *Correspondence: Chang-Qing Yu, ; Li-Ping Li,
| | - Zhu-Hong You
- School of Computer Science, Northwestern Polytechnical University, Xi’an, China
| | - Wen-Zhun Huang
- School of Information Engineering, Xijing University, Xi’an, China
| | - Yue-Chao Li
- School of Information Engineering, Xijing University, Xi’an, China
| | - Zhong-Hao Ren
- School of Information Engineering, Xijing University, Xi’an, China
| | - Yong-Jian Guan
- School of Information Engineering, Xijing University, Xi’an, China
| |
Collapse
|
20
|
Xu L, Li X, Yang Q, Tan L, Liu Q, Liu Y. Application of Bidirectional Generative Adversarial Networks to Predict Potential miRNAs Associated With Diseases. Front Genet 2022; 13:936823. [PMID: 35903359 PMCID: PMC9314862 DOI: 10.3389/fgene.2022.936823] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Accepted: 06/08/2022] [Indexed: 11/18/2022] Open
Abstract
Substantial evidence has shown that microRNAs are crucial for biological processes within complex human diseases. Identifying the association of miRNA–disease pairs will contribute to accelerating the discovery of potential biomarkers and pathogenesis. Researchers began to focus on constructing computational models to facilitate the progress of disease pathology and clinical medicine by identifying the potential disease-related miRNAs. However, most existing computational methods are expensive, and their use is limited to unobserved relationships for unknown miRNAs (diseases) without association information. In this manuscript, we proposed a creatively semi-supervised model named bidirectional generative adversarial network for miRNA-disease association prediction (BGANMDA). First, we constructed a microRNA similarity network, a disease similarity network, and Gaussian interaction profile kernel similarity based on the known miRNA–disease association and comprehensive similarity of miRNAs (diseases). Next, an integrated similarity feature network with the full underlying relationships of miRNA–disease pairwise was obtained. Then, the similarity feature network was fed into the BGANMDA model to learn advanced traits in latent space. Finally, we ranked an association score list and predicted the associations between miRNA and disease. In our experiment, a five-fold cross validation was applied to estimate BGANMDA’s performance, and an area under the curve (AUC) of 0.9319 and a standard deviation of 0.00021 were obtained. At the same time, in the global and local leave-one-out cross validation (LOOCV), the AUC value and standard deviation of BGANMDA were 0.9116 ± 0.0025 and 0.8928 ± 0.0022, respectively. Furthermore, BGANMDA was employed in three different case studies to validate its prediction capability and accuracy. The experimental results of the case studies showed that 46, 46, and 48 of the top 50 prediction lists had been identified in previous studies.
Collapse
Affiliation(s)
- Long Xu
- School of Computer Science and Technology, Heilongjiang University, Harbin, China
| | - Xiaokun Li
- School of Computer Science and Technology, Heilongjiang University, Harbin, China
- Postdoctoral Program of Heilongjiang Hengxun Technology Co., Ltd., Heilongjiang University, Harbin, China
- *Correspondence: Xiaokun Li, ; Yong Liu,
| | - Qiang Yang
- School of Electronic Engineering, Heilongjiang University, Harbin, China
| | - Long Tan
- School of Computer Science and Technology, Heilongjiang University, Harbin, China
| | - Qingyuan Liu
- Postdoctoral Program of Heilongjiang Hengxun Technology Co., Ltd., Heilongjiang University, Harbin, China
| | - Yong Liu
- School of Computer Science and Technology, Heilongjiang University, Harbin, China
- *Correspondence: Xiaokun Li, ; Yong Liu,
| |
Collapse
|
21
|
Ouyang D, Miao R, Wang J, Liu X, Xie S, Ai N, Dang Q, Liang Y. Predicting Multiple Types of Associations Between miRNAs and Diseases Based on Graph Regularized Weighted Tensor Decomposition. Front Bioeng Biotechnol 2022; 10:911769. [PMID: 35910021 PMCID: PMC9335924 DOI: 10.3389/fbioe.2022.911769] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2022] [Accepted: 05/04/2022] [Indexed: 11/23/2022] Open
Abstract
Many studies have indicated miRNAs lead to the occurrence and development of diseases through a variety of underlying mechanisms. Meanwhile, computational models can save time, minimize cost, and discover potential associations on a large scale. However, most existing computational models based on a matrix or tensor decomposition cannot recover positive samples well. Moreover, the high noise of biological similarity networks and how to preserve these similarity relationships in low-dimensional space are also challenges. To this end, we propose a novel computational framework, called WeightTDAIGN, to identify potential multiple types of miRNA–disease associations. WeightTDAIGN can recover positive samples well and improve prediction performance by weighting positive samples. WeightTDAIGN integrates more auxiliary information related to miRNAs and diseases into the tensor decomposition framework, focuses on learning low-rank tensor space, and constrains projection matrices by using the L2,1 norm to reduce the impact of redundant information on the model. In addition, WeightTDAIGN can preserve the local structure information in the biological similarity network by introducing graph Laplacian regularization. Our experimental results show that the sparser datasets, the more satisfactory performance of WeightTDAIGN can be obtained. Also, the results of case studies further illustrate that WeightTDAIGN can accurately predict the associations of miRNA–disease-type.
Collapse
Affiliation(s)
- Dong Ouyang
- Faculty of Information Technology, Macau University of Science and Technology, Macau, China
| | - Rui Miao
- Faculty of Information Technology, Macau University of Science and Technology, Macau, China
| | - Jianjun Wang
- School of Mathematics and Statistics, Southwest University, Chongqing, China
| | - Xiaoying Liu
- Computer Engineering Technical College, Guangdong Polytechnic of Science and Technology, Zhuhai, China
| | - Shengli Xie
- Institute of Intelligent Information Processing, Guangdong University of Technology, Guangzhou, China
| | - Ning Ai
- Faculty of Information Technology, Macau University of Science and Technology, Macau, China
| | - Qi Dang
- Faculty of Information Technology, Macau University of Science and Technology, Macau, China
| | - Yong Liang
- Peng Cheng Laboratory, Shenzhen, China
- *Correspondence: Yong Liang,
| |
Collapse
|