151
|
Zhong L, Ming Z, Xie G, Fan C, Piao X. Recent Advances on the Semi-Supervised Learning for Long Non-Coding RNA-Protein Interactions Prediction: A Review. Protein Pept Lett 2019; 27:385-391. [PMID: 31654509 DOI: 10.2174/0929866526666191025104043] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2019] [Revised: 05/30/2019] [Accepted: 09/24/2019] [Indexed: 12/24/2022]
Abstract
In recent years, more and more evidence indicates that long non-coding RNA (lncRNA) plays a significant role in the development of complex biological processes, especially in RNA progressing, chromatin modification, and cell differentiation, as well as many other processes. Surprisingly, lncRNA has an inseparable relationship with human diseases such as cancer. Therefore, only by knowing more about the function of lncRNA can we better solve the problems of human diseases. However, lncRNAs need to bind to proteins to perform their biomedical functions. So we can reveal the lncRNA function by studying the relationship between lncRNA and protein. But due to the limitations of traditional experiments, researchers often use computational prediction models to predict lncRNA protein interactions. In this review, we summarize several computational models of the lncRNA protein interactions prediction base on semi-supervised learning during the past two years, and introduce their advantages and shortcomings briefly. Finally, the future research directions of lncRNA protein interaction prediction are pointed out.
Collapse
Affiliation(s)
- Lin Zhong
- School of Mathematics, Liaoning University, Shenyang, 110036, China
| | - Zhong Ming
- National Engineering Laboratory for Big Data System Computing Technology, Shenzhen University, Shenzhen, 518060, China.,College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, 518060, China
| | - Guobo Xie
- School of Computer Science, Guangdong University of Technology, Guangzhou, 510006, China
| | - Chunlong Fan
- College of Computer Science, Shenyang Aerospace University, Shenyang, 110136, China
| | - Xue Piao
- School of Medical Informatics, Xuzhou Medical University, Xuzhou, 221004, China
| |
Collapse
|
152
|
Pan X, Shen HB. Inferring Disease-Associated MicroRNAs Using Semi-supervised Multi-Label Graph Convolutional Networks. iScience 2019; 20:265-277. [PMID: 31605942 PMCID: PMC6817654 DOI: 10.1016/j.isci.2019.09.013] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2019] [Revised: 09/05/2019] [Accepted: 09/11/2019] [Indexed: 01/22/2023] Open
Abstract
MicroRNAs (miRNAs) play crucial roles in biological processes involved in diseases. The associations between diseases and protein-coding genes (PCGs) have been well investigated, and miRNAs interact with PCGs to trigger them to be functional. We present a computational method, DimiG, to infer miRNA-associated diseases using a semi-supervised Graph Convolutional Network model (GCN). DimiG uses a multi-label framework to integrate PCG-PCG interactions, PCG-miRNA interactions, PCG-disease associations, and tissue expression profiles. DimiG is trained on disease-PCG associations and an interaction network using a GCN, which is further used to score associations between diseases and miRNAs. We evaluate DimiG on a benchmark set from verified disease-miRNA associations. Our results demonstrate that DimiG outperforms the best unsupervised method and is comparable to two supervised methods. Three case studies of prostate cancer, lung cancer, and inflammatory bowel disease further demonstrate the efficacy of DimiG, where top miRNAs predicted by DimiG are supported by literature.
Collapse
Affiliation(s)
- Xiaoyong Pan
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, 200240 Shanghai, China; Department of Medical informatics, Erasmus Medical Center, 3015 CE Rotterdam, the Netherlands.
| | - Hong-Bin Shen
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, 200240 Shanghai, China.
| |
Collapse
|
153
|
Zhang H, Ming Z, Fan C, Zhao Q, Liu H. A path-based computational model for long non-coding RNA-protein interaction prediction. Genomics 2019; 112:1754-1760. [PMID: 31639442 DOI: 10.1016/j.ygeno.2019.09.018] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2019] [Revised: 09/20/2019] [Accepted: 09/24/2019] [Indexed: 10/25/2022]
Abstract
Recently, lncRNAs have attracted accumulating attentions because more and more experimental researches have shown lncRNA can play critical roles in many biological processes. Predicting potential interactions between lncRNAs and proteins are key to understand the lncRNAs biological functions. But traditional biological experiments are expensive and time-consuming, network similarity methods provide a powerful solution to computationally predict lncRNA-protein interactions. In this work, a novel path-based lncRNA-protein interaction (PBLPI) prediction model is proposed by integrating protein semantic similarity, lncRNA functional similarity, known human lncRNA-protein interactions, and Gaussian interaction profile kernel similarity. PBLPI model utilizes three interlinked sub-graphs to construct a heterogeneous graph, and then infers potential lncRNA-protein interactions through depth-first search algorithm. Consequently, PBLPI achieves reliable performance in the frameworks of 5-fold cross validation (average AUC is 0.9244 and AUPR is 0.6478). In the case study, we use "Mus musculus" data to further validate the reliability of PBLPI method. It is anticipated that PBLPI would become a useful tool to identify potential lncRNA-protein interactions.
Collapse
Affiliation(s)
- Hui Zhang
- School of Life Science, Liaoning University, Shenyang 110036, China
| | - Zhong Ming
- National Engineering Laboratory for Big Data System Computing Technology, Shenzhen University, Shenzhen 518060, China; College of Computer Science and Software Engineering, Shenzhen University, Shenzhen 518060, China
| | - Chunlong Fan
- College of Computer Science, Shenyang Aerospace University, Shenyang 110136, China
| | - Qi Zhao
- College of Computer Science, Shenyang Aerospace University, Shenyang 110136, China.
| | - Hongsheng Liu
- School of Life Science, Liaoning University, Shenyang 110036, China; Research Center for Computer Simulating and Information Processing of Bio-macromolecules of Liaoning Province, Shenyang 110036, China; Engineering Laboratory for Molecular Simulation and Designing of Drug Molecules of Liaoning, Shenyang 110036, China.
| |
Collapse
|
154
|
Huang Z, Liu L, Gao Y, Shi J, Cui Q, Li J, Zhou Y. Benchmark of computational methods for predicting microRNA-disease associations. Genome Biol 2019; 20:202. [PMID: 31594544 PMCID: PMC6781296 DOI: 10.1186/s13059-019-1811-3] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2019] [Accepted: 09/03/2019] [Indexed: 01/06/2023] Open
Abstract
BACKGROUND A series of miRNA-disease association prediction methods have been proposed to prioritize potential disease-associated miRNAs. Independent benchmarking of these methods is warranted to assess their effectiveness and robustness. RESULTS Based on more than 8000 novel miRNA-disease associations from the latest HMDD v3.1 database, we perform systematic comparison among 36 readily available prediction methods. Their overall performances are evaluated with rigorous precision-recall curve analysis, where 13 methods show acceptable accuracy (AUPRC > 0.200) while the top two methods achieve a promising AUPRC over 0.300, and most of these methods are also highly ranked when considering only the causal miRNA-disease associations as the positive samples. The potential of performance improvement is demonstrated by combining different predictors or adopting a more updated miRNA similarity matrix, which would result in up to 16% and 46% of AUPRC augmentations compared to the best single predictor and the predictors using the previous similarity matrix, respectively. Our analysis suggests a common issue of the available methods, which is that the prediction results are severely biased toward well-annotated diseases with many associated miRNAs known and cannot further stratify the positive samples by discriminating the causal miRNA-disease associations from the general miRNA-disease associations. CONCLUSION Our benchmarking results not only provide a reference for biomedical researchers to choose appropriate miRNA-disease association predictors for their purpose, but also suggest the future directions for the development of more robust miRNA-disease association predictors.
Collapse
Affiliation(s)
- Zhou Huang
- Department of Biomedical Informatics, Department of Physiology and Pathophysiology, Center for Noncoding RNA Medicine, MOE Key Lab of Cardiovascular Sciences, School of Basic Medical Sciences, Peking University, 38 Xueyuan Rd, Beijing, 100191, China
| | - Leibo Liu
- Institute of Computational Medicine, School of Artificial Intelligence, Hebei University of Technology, Tianjin, 300401, China
| | - Yuanxu Gao
- Department of Biomedical Informatics, Department of Physiology and Pathophysiology, Center for Noncoding RNA Medicine, MOE Key Lab of Cardiovascular Sciences, School of Basic Medical Sciences, Peking University, 38 Xueyuan Rd, Beijing, 100191, China
| | - Jiangcheng Shi
- Department of Biomedical Informatics, Department of Physiology and Pathophysiology, Center for Noncoding RNA Medicine, MOE Key Lab of Cardiovascular Sciences, School of Basic Medical Sciences, Peking University, 38 Xueyuan Rd, Beijing, 100191, China
| | - Qinghua Cui
- Department of Biomedical Informatics, Department of Physiology and Pathophysiology, Center for Noncoding RNA Medicine, MOE Key Lab of Cardiovascular Sciences, School of Basic Medical Sciences, Peking University, 38 Xueyuan Rd, Beijing, 100191, China
- Center of Bioinformatics, Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, 610054, China
| | - Jianwei Li
- Institute of Computational Medicine, School of Artificial Intelligence, Hebei University of Technology, Tianjin, 300401, China.
| | - Yuan Zhou
- Department of Biomedical Informatics, Department of Physiology and Pathophysiology, Center for Noncoding RNA Medicine, MOE Key Lab of Cardiovascular Sciences, School of Basic Medical Sciences, Peking University, 38 Xueyuan Rd, Beijing, 100191, China.
| |
Collapse
|
155
|
Gao Y, Jia K, Shi J, Zhou Y, Cui Q. A Computational Model to Predict the Causal miRNAs for Diseases. Front Genet 2019; 10:935. [PMID: 31632446 PMCID: PMC6786093 DOI: 10.3389/fgene.2019.00935] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Accepted: 09/05/2019] [Indexed: 01/30/2023] Open
Abstract
MicroRNAs (miRNAs) are one class of important noncoding RNA molecules, and their dysfunction is associated with a number of diseases. Currently, a series of databases and algorithms have been developed for dissecting human miRNA-disease associations. However, these tools only presented the associations between miRNAs and disease but did not address whether the associations are causal or not, a key biomedical issue that is critical for understanding the roles of candidate miRNAs in the mechanisms of specific diseases. Here we first manually curated causal miRNA-disease association information and updated the human miRNA disease database (HMDD) accordingly. Then we built a computational model, MDCAP (MiRNA-Disease Causal Association Predictor), to predict novel causal miRNA-disease associations. As a result, we collected 6,667 causal miRNA-disease associations between 616 miRNAs and 440 diseases, which accounts for ∼20% of the total data in HMDD. The MDCAP model achieved an area under the receiver operating characteristic (ROC) curve of 0.928 for ROC analysis by independent test and an area under the ROC curve of 0.925 for ROC analysis by 10-fold cross-validation. Finally, case studies conducted on myocardial infarction and hsa-mir-498 further suggested the biomedical significance of the predictions.
Collapse
Affiliation(s)
- Yuanxu Gao
- Department of Biomedical Informatics, Department of Physiology and Pathophysiology, Center for Noncoding RNA Medicine, MOE Key Lab of Cardiovascular Sciences, School of Basic Medical Sciences, Peking University, Beijing, China
| | - Kaiwen Jia
- Department of Biomedical Informatics, Department of Physiology and Pathophysiology, Center for Noncoding RNA Medicine, MOE Key Lab of Cardiovascular Sciences, School of Basic Medical Sciences, Peking University, Beijing, China
| | - Jiangcheng Shi
- Department of Biomedical Informatics, Department of Physiology and Pathophysiology, Center for Noncoding RNA Medicine, MOE Key Lab of Cardiovascular Sciences, School of Basic Medical Sciences, Peking University, Beijing, China
| | - Yuan Zhou
- Department of Biomedical Informatics, Department of Physiology and Pathophysiology, Center for Noncoding RNA Medicine, MOE Key Lab of Cardiovascular Sciences, School of Basic Medical Sciences, Peking University, Beijing, China
| | - Qinghua Cui
- Department of Biomedical Informatics, Department of Physiology and Pathophysiology, Center for Noncoding RNA Medicine, MOE Key Lab of Cardiovascular Sciences, School of Basic Medical Sciences, Peking University, Beijing, China.,Center of Bioinformatics, Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| |
Collapse
|
156
|
Wu HY, Wei Y, Pan SL. Down-regulation and clinical significance of miR-7-2-3p in papillary thyroid carcinoma with multiple detecting methods. IET Syst Biol 2019; 13:225-233. [PMID: 31538956 PMCID: PMC8687168 DOI: 10.1049/iet-syb.2019.0025] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2019] [Revised: 05/30/2019] [Accepted: 06/10/2019] [Indexed: 04/05/2024] Open
Abstract
Altered miRNA expression participates in the biological progress of thyroid carcinoma and functions as a diagnostic marker or therapeutic agent. However, the role of miR-7-2-3p is currently unclear. The authors' study was the first investigation of miR-7-2-3p expression level and diagnostic ability in several public databases. Potential target genes were obtained from DIANA Tools, and function enrichment analysis was then performed. Furthermore, the authors examined expression levels of potential targets in the Human Protein Atlas (HPA) and the Cancer Genome Atlas (TCGA). Finally, the potential transcription factors (TFs) were predicted by JASPAR. TCGA, GSE62054, GSE73182, GSE40807, and GSE55780 revealed that miR-7-2-3p expression in papillary thyroid carcinoma (PTC) tissues was notably lower compared with non-tumour tissues, while its expression in E-MATB-736 showed no remarkable difference. Function enrichment analysis showed that 698 genes were enriched in pathways, including pathways in cancer, and glioma. CCND1, GSK3B, and ITGAV of pathways in cancer were inverse correlations with miR-7-2-3p in both post-transcription and protein levels. According to the TF prediction, the prospective upstream TFs of miR-7-2-3p were ISX, SPI1, PRRX1, and BARX1. MiR-7-2-3p was significantly down-regulated and may act on PTC progression by crucial pathways. However, the mechanisms of miR-7-2-3p need further investigation.
Collapse
Affiliation(s)
- Hua-Yu Wu
- Department of Cell Biology and Genetics, School of Pre-clinical Medicine, Guangxi Medical University, Nanning, 530021, Guangxi Zhuang Autonomous Region, People's Republic of China
| | - Yi Wei
- Department of Pathophysiology, School of Pre-clinical Medicine, Guangxi Medical University, Nanning, 530021, Guangxi Zhuang Autonomous Region, People's Republic of China
| | - Shang-Ling Pan
- Department of Pathophysiology, School of Pre-clinical Medicine, Guangxi Medical University, Nanning, 530021, Guangxi Zhuang Autonomous Region, People's Republic of China.
| |
Collapse
|
157
|
Cheng L, Zhao H, Wang P, Zhou W, Luo M, Li T, Han J, Liu S, Jiang Q. Computational Methods for Identifying Similar Diseases. MOLECULAR THERAPY. NUCLEIC ACIDS 2019; 18:590-604. [PMID: 31678735 PMCID: PMC6838934 DOI: 10.1016/j.omtn.2019.09.019] [Citation(s) in RCA: 80] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/25/2019] [Revised: 09/11/2019] [Accepted: 09/12/2019] [Indexed: 02/01/2023]
Abstract
Although our knowledge of human diseases has increased dramatically, the molecular basis, phenotypic traits, and therapeutic targets of most diseases still remain unclear. An increasing number of studies have observed that similar diseases often are caused by similar molecules, can be diagnosed by similar markers or phenotypes, or can be cured by similar drugs. Thus, the identification of diseases similar to known ones has attracted considerable attention worldwide. To this end, the associations between diseases at the molecular, phenotypic, and taxonomic levels were used to measure the pairwise similarity in diseases. The corresponding performance assessment strategies for these methods involving the terms “category-based,” “simulated-patient-based,” and “benchmark-data-based” were thus further emphasized. Then, frequently used methods were evaluated using a benchmark-data-based strategy. To facilitate the assessment of disease similarity scores, researchers have designed dozens of tools that implement these methods for calculating disease similarity. Currently, disease similarity has been advantageous in predicting noncoding RNA (ncRNA) function and therapeutic drugs for diseases. In this article, we review disease similarity methods, evaluation strategies, tools, and their applications in the biomedical community. We further evaluate the performance of these methods and discuss the current limitations and future trends for calculating disease similarity.
Collapse
Affiliation(s)
- Liang Cheng
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | - Hengqiang Zhao
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | - Pingping Wang
- School of Life Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, China
| | - Wenyang Zhou
- School of Life Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, China
| | - Meng Luo
- School of Life Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, China
| | - Tianxin Li
- School of Life Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, China
| | - Junwei Han
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China.
| | - Shulin Liu
- Systemomics Center, College of Pharmacy, and Genomics Research Center (State-Province Key Laboratories of Biomedicine-Pharmaceutics of China), Harbin Medical University, Harbin, Heilongjiang, China; Department of Microbiology, Immunology and Infectious Diseases, University of Calgary, Calgary, AB, Canada.
| | - Qinghua Jiang
- School of Life Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, China.
| |
Collapse
|
158
|
Xie G, Fan Z, Sun Y, Wu C, Ma L. WBNPMD: weighted bipartite network projection for microRNA-disease association prediction. J Transl Med 2019; 17:322. [PMID: 31547811 PMCID: PMC6757419 DOI: 10.1186/s12967-019-2063-4] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2019] [Accepted: 09/06/2019] [Indexed: 01/21/2023] Open
Abstract
Background Recently, numerous biological experiments have indicated that microRNAs (miRNAs) play critical roles in exploring the pathogenesis of various human diseases. Since traditional experimental methods for miRNA-disease associations detection are costly and time-consuming, it becomes urgent to design efficient and robust computational techniques for identifying undiscovered interactions. Methods In this paper, we proposed a computation framework named weighted bipartite network projection for miRNA-disease association prediction (WBNPMD). In this method, transfer weights were constructed by combining the known miRNA and disease similarities, and the initial information was properly configured. Then the two-step bipartite network algorithm was implemented to infer potential miRNA-disease associations. Results The proposed WBNPMD was applied to the known miRNA-disease association data, and leave-one-out cross-validation (LOOCV) and fivefold cross-validation were implemented to evaluate the performance of WBNPMD. As a result, our method achieved the AUCs of 0.9321 and \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$0.9173 \pm 0.0005$$\end{document}0.9173±0.0005 in LOOCV and fivefold cross-validation, and outperformed other four state-of-the-art methods. We also carried out two kinds of case studies on prostate neoplasm, colorectal neoplasm, and lung neoplasm, and most of the top 50 predicted miRNAs were confirmed to have an association with the corresponding diseases based on dbDeMC, miR2Disease, and HMDD V3.0 databases. Conclusions The experimental results demonstrate that WBNPMD can accurately infer potential miRNA-disease associations. We anticipated that the proposed WBNPMD could serve as a powerful tool for potential miRNA-disease associations excavation. Electronic supplementary material The online version of this article (10.1186/s12967-019-2063-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Guobo Xie
- School of Computer Science, Guangdong University of Technology, Guangzhou, China
| | - Zhiliang Fan
- School of Computer Science, Guangdong University of Technology, Guangzhou, China
| | - Yuping Sun
- School of Computer Science, Guangdong University of Technology, Guangzhou, China.
| | - Cuiming Wu
- School of Computer Science, Guangdong University of Technology, Guangzhou, China
| | - Lei Ma
- Institute of Automation, Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
159
|
Shi C, Chen J, Kang X, Zhao G, Lao X, Zheng H. Deep Learning in the Study of Protein-Related Interactions. Protein Pept Lett 2019; 27:359-369. [PMID: 31538879 DOI: 10.2174/0929866526666190723114142] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2019] [Revised: 03/13/2019] [Accepted: 04/05/2019] [Indexed: 11/22/2022]
Abstract
Protein-related interaction prediction is critical to understanding life processes, biological functions, and mechanisms of drug action. Experimental methods used to determine proteinrelated interactions have always been costly and inefficient. In recent years, advances in biological and medical technology have provided us with explosive biological and physiological data, and deep learning-based algorithms have shown great promise in extracting features and learning patterns from complex data. At present, deep learning in protein research has emerged. In this review, we provide an introductory overview of the deep neural network theory and its unique properties. Mainly focused on the application of this technology in protein-related interactions prediction over the past five years, including protein-protein interactions prediction, protein-RNA\DNA, Protein- drug interactions prediction, and others. Finally, we discuss some of the challenges that deep learning currently faces.
Collapse
Affiliation(s)
- Cheng Shi
- School of Life Science and Technology, China Pharmaceutical University, Nanjing 210009, China
| | - Jiaxing Chen
- School of Life Science and Technology, China Pharmaceutical University, Nanjing 210009, China
| | - Xinyue Kang
- School of Life Science and Technology, China Pharmaceutical University, Nanjing 210009, China
| | - Guiling Zhao
- School of Life Science and Technology, China Pharmaceutical University, Nanjing 210009, China
| | - Xingzhen Lao
- School of Life Science and Technology, China Pharmaceutical University, Nanjing 210009, China
| | - Heng Zheng
- School of Life Science and Technology, China Pharmaceutical University, Nanjing 210009, China
| |
Collapse
|
160
|
Zhang Y, Chen M, Cheng X, Chen Z. LSGSP: a novel miRNA-disease association prediction model using a Laplacian score of the graphs and space projection federated method. RSC Adv 2019; 9:29747-29759. [PMID: 35531537 PMCID: PMC9071959 DOI: 10.1039/c9ra05554a] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Accepted: 09/09/2019] [Indexed: 12/31/2022] Open
Abstract
Lots of research findings have indicated that miRNAs (microRNAs) are involved in many important biological processes; their mutations and disorders are closely related to diseases, therefore, determining the associations between human diseases and miRNAs is key to understand pathogenic mechanisms. Existing biological experimental methods for identifying miRNA-disease associations are usually expensive and time consuming. Therefore, the development of efficient and reliable computational methods for identifying disease-related miRNAs has become an important topic in the field of biological research in recent years. In this study, we developed a novel miRNA-disease association prediction model using a Laplacian score of the graphs and space projection federated method (LSGSP). This integrates experimentally validated miRNA-disease associations, disease semantic similarity scores, miRNA functional scores, and miRNA family information to build a new disease similarity network and miRNA similarity network, and then obtains the global similarities of these networks through calculating the Laplacian score of the graphs, based on which the miRNA-disease weighted network can be constructed through combination with the miRNA-disease Boolean network. Finally, the miRNA-disease score was obtained via projecting the miRNA space and disease space onto the miRNA-disease weighted network. Compared with several other state-of-the-art methods, using leave-one-out cross validation (LOOCV) to evaluate the accuracy of LSGSP with respect to a benchmark dataset, prediction dataset and compare dataset, LSGSP showed excellent predictive performance with high AUC values of 0.9221, 0.9745 and 0.9194, respectively. In addition, for prostate neoplasms and lung neoplasms, the consistencies between the top 50 predicted miRNAs (obtained from LSGSP) and the results (confirmed from the updated HMDD, miR2Disease, and dbDEMC databases) reached 96% and 100%, respectively. Similarly, for isolated diseases (diseases not associated with any miRNAs), the consistencies between the top 50 predicted miRNAs (obtained from LSGSP) and the results (confirmed from the above-mentioned three databases) reached 98% and 100%, respectively. These results further indicate that LSGSP can effectively predict potential associations between miRNAs and diseases.
Collapse
Affiliation(s)
- Yi Zhang
- School of Information Science and Engineering, Guilin University of Technology 541004 Guilin China
| | - Min Chen
- School of Computer Science and Technology, Hunan Institute of Technology 421002 Hengyang China
| | - Xiaohui Cheng
- School of Information Science and Engineering, Guilin University of Technology 541004 Guilin China
| | - Zheng Chen
- School of Computer Science and Technology, Hunan Institute of Technology 421002 Hengyang China
| |
Collapse
|
161
|
Gong Y, Niu Y, Zhang W, Li X. A network embedding-based multiple information integration method for the MiRNA-disease association prediction. BMC Bioinformatics 2019; 20:468. [PMID: 31510919 PMCID: PMC6740005 DOI: 10.1186/s12859-019-3063-3] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2019] [Accepted: 08/29/2019] [Indexed: 12/26/2022] Open
Abstract
BACKGROUND MiRNAs play significant roles in many fundamental and important biological processes, and predicting potential miRNA-disease associations makes contributions to understanding the molecular mechanism of human diseases. Existing state-of-the-art methods make use of miRNA-target associations, miRNA-family associations, miRNA functional similarity, disease semantic similarity and known miRNA-disease associations, but the known miRNA-disease associations are not well exploited. RESULTS In this paper, a network embedding-based multiple information integration method (NEMII) is proposed for the miRNA-disease association prediction. First, known miRNA-disease associations are formulated as a bipartite network, and the network embedding method Structural Deep Network Embedding (SDNE) is adopted to learn embeddings of nodes in the bipartite network. Second, the embedding representations of miRNAs and diseases are combined with biological features about miRNAs and diseases (miRNA-family associations and disease semantic similarities) to represent miRNA-disease pairs. Third, the prediction models are constructed based on the miRNA-disease pairs by using the random forest. In computational experiments, NEMII achieves high-accuracy performances and outperforms other state-of-the-art methods: GRNMF, NTSMDA and PBMDA. The usefulness of NEMII is further validated by case studies. The studies demonstrate the great potential of network embedding method for the miRNA-disease association prediction, and SDNE outperforms other popular network embedding methods: DeepWalk, High-Order Proximity preserved Embedding (HOPE) and Laplacian Eigenmaps (LE). CONCLUSION We propose a new method, named NEMII, for predicting miRNA-disease associations, which has great potential to benefit the field of miRNA-disease association prediction.
Collapse
Affiliation(s)
- Yuchong Gong
- School of Computer Science, Wuhan University, Wuhan, 430072 China
| | - Yanqing Niu
- School of Mathematics and Statistics, South-Central University for Nationalities, Wuhan, 430074 China
| | - Wen Zhang
- College of Informatics, Huazhong Agricultural University, Wuhan, 430070 China
| | - Xiaohong Li
- School of Computer Science, Wuhan University, Wuhan, 430072 China
| |
Collapse
|
162
|
Zhang L, Chen X, Yin J. Prediction of Potential miRNA-Disease Associations Through a Novel Unsupervised Deep Learning Framework with Variational Autoencoder. Cells 2019; 8:cells8091040. [PMID: 31489920 PMCID: PMC6770222 DOI: 10.3390/cells8091040] [Citation(s) in RCA: 49] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2019] [Revised: 08/31/2019] [Accepted: 09/02/2019] [Indexed: 12/22/2022] Open
Abstract
The important role of microRNAs (miRNAs) in the formation, development, diagnosis, and treatment of diseases has attracted much attention among researchers recently. In this study, we present an unsupervised deep learning model of the variational autoencoder for MiRNA–disease association prediction (VAEMDA). Through combining the integrated miRNA similarity and the integrated disease similarity with known miRNA–disease associations, respectively, we constructed two spliced matrices. These matrices were applied to train the variational autoencoder (VAE), respectively. The final predicted association scores between miRNAs and diseases were obtained by integrating the scores from the two trained VAE models. Unlike previous models, VAEMDA can avoid noise introduced by the random selection of negative samples and reveal associations between miRNAs and diseases from the perspective of data distribution. Compared with previous methods, VAEMDA obtained higher area under the receiver operating characteristics curves (AUCs) of 0.9118, 0.8652, and 0.9091 ± 0.0065 in global leave-one-out cross validation (LOOCV), local LOOCV, and five-fold cross validation, respectively. Further, the AUCs of VAEMDA were 0.8250 and 0.8237 in global leave-one-disease-out cross validation (LODOCV), and local LODOCV, respectively. In three different types of case studies on three important diseases, the results showed that most of the top 50 potentially associated miRNAs were verified by databases and the literature.
Collapse
Affiliation(s)
- Li Zhang
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China.
| | - Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China.
| | - Jun Yin
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China.
| |
Collapse
|
163
|
Predicting miRNA-Disease Associations by Incorporating Projections in Low-Dimensional Space and Local Topological Information. Genes (Basel) 2019; 10:genes10090685. [PMID: 31500152 PMCID: PMC6770973 DOI: 10.3390/genes10090685] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2019] [Revised: 08/31/2019] [Accepted: 09/03/2019] [Indexed: 12/14/2022] Open
Abstract
Predicting the potential microRNA (miRNA) candidates associated with a disease helps in exploring the mechanisms of disease development. Most recent approaches have utilized heterogeneous information about miRNAs and diseases, including miRNA similarities, disease similarities, and miRNA-disease associations. However, these methods do not utilize the projections of miRNAs and diseases in a low-dimensional space. Thus, it is necessary to develop a method that can utilize the effective information in the low-dimensional space to predict potential disease-related miRNA candidates. We proposed a method based on non-negative matrix factorization, named DMAPred, to predict potential miRNA-disease associations. DMAPred exploits the similarities and associations of diseases and miRNAs, and it integrates local topological information of the miRNA network. The likelihood that a miRNA is associated with a disease also depends on their projections in low-dimensional space. Therefore, we project miRNAs and diseases into low-dimensional feature space to yield their low-dimensional and dense feature representations. Moreover, the sparse characteristic of miRNA-disease associations was introduced to make our predictive model more credible. DMAPred achieved superior performance for 15 well-characterized diseases with AUCs (area under the receiver operating characteristic curve) ranging from 0.860 to 0.973 and AUPRs (area under the precision-recall curve) ranging from 0.118 to 0.761. In addition, case studies on breast, prostatic, and lung neoplasms demonstrated the ability of DMAPred to discover potential disease-related miRNAs.
Collapse
|
164
|
Pant N, Rakshit S, Paul S, Saha I. Genome-wide analysis of multi-view data of miRNA-seq to identify miRNA biomarkers for stomach cancer. J Biomed Inform 2019; 97:103254. [DOI: 10.1016/j.jbi.2019.103254] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2019] [Revised: 07/03/2019] [Accepted: 07/17/2019] [Indexed: 12/30/2022]
|
165
|
Cui Z, Liu JX, Gao YL, Zhu R, Yuan SS. LncRNA-Disease Associations Prediction Using Bipartite Local Model With Nearest Profile-Based Association Inferring. IEEE J Biomed Health Inform 2019; 24:1519-1527. [PMID: 31478878 DOI: 10.1109/jbhi.2019.2937827] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
There is much evidence that long non-coding RNA (lncRNA) is associated with many diseases. However, it is time-consuming and expensive to identify meaningful lncRNA-disease associations (LDAs) through medical or biological experiments. Therefore, investigating how to identify more meaningful LDAs is necessary, and at the same time it is conducive to the prevention, diagnosis and treatment of complex diseases. Considering the limitations of some current prediction models, a novel model based on bipartite local model with nearest profile-based association inferring, BLM-NPAI, is developed for predicting LDAs. This model predicts novel LDAs from the lncRNA side and the disease side, respectively. More importantly, for some lncRNAs and diseases without any association, the model can also be predicted by their nearest neighbors. Leave-one-out cross validation (LOOCV) and 5-fold cross validation are implemented for BLM-NPAI to evaluate the performance of this model. Our model is superior to current advanced methods in most cases. In addition, to verify the validity and reliability of BLM-NPAI, three disease cases and three lncRNA cases are analyzed to further evaluate BLM-NPAI. Finally, these predicted novel LDAs are confirmed by using the LncRNA-disease database.
Collapse
|
166
|
Li C, Liu H, Hu Q, Que J, Yao J. A Novel Computational Model for Predicting microRNA-Disease Associations Based on Heterogeneous Graph Convolutional Networks. Cells 2019; 8:cells8090977. [PMID: 31455028 PMCID: PMC6769654 DOI: 10.3390/cells8090977] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2019] [Revised: 08/22/2019] [Accepted: 08/23/2019] [Indexed: 01/13/2023] Open
Abstract
Identifying the interactions between disease and microRNA (miRNA) can accelerate drugs development, individualized diagnosis, and treatment for various human diseases. However, experimental methods are time-consuming and costly. So computational approaches to predict latent miRNA-disease interactions are eliciting increased attention. But most previous studies have mainly focused on designing complicated similarity-based methods to predict latent interactions between miRNAs and diseases. In this study, we propose a novel computational model, termed heterogeneous graph convolutional network for miRNA-disease associations (HGCNMDA), which is based on known human protein-protein interaction (PPI) and integrates four biological networks: miRNA-disease, miRNA-gene, disease-gene, and PPI network. HGCNMDA achieved reliable performance using leave-one-out cross-validation (LOOCV). HGCNMDA is then compared to three state-of-the-art algorithms based on five-fold cross-validation. HGCNMDA achieves an AUC of 0.9626 and an average precision of 0.9660, respectively, which is ahead of other competitive algorithms. We further analyze the top-10 unknown interactions between miRNA and disease. In summary, HGCNMDA is a useful computational model for predicting miRNA-disease interactions.
Collapse
Affiliation(s)
- Chunyan Li
- School of Informatics, Xiamen University, Xiamen 361005, China
- Graduate School, Yunnan Minzu University, Kunming 650504, China
| | - Hongju Liu
- College of Information Technology and Computer Science, University of the Cordilleras, Baguio 2600, Philippines
| | - Qian Hu
- School of Informatics, Xiamen University, Xiamen 361005, China
| | - Jinlong Que
- School of Informatics, Xiamen University, Xiamen 361005, China
| | - Junfeng Yao
- School of Informatics, Xiamen University, Xiamen 361005, China.
| |
Collapse
|
167
|
Prediction of Disease-related microRNAs through Integrating Attributes of microRNA Nodes and Multiple Kinds of Connecting Edges. Molecules 2019; 24:molecules24173099. [PMID: 31455026 PMCID: PMC6749327 DOI: 10.3390/molecules24173099] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2019] [Revised: 08/09/2019] [Accepted: 08/14/2019] [Indexed: 11/17/2022] Open
Abstract
Identifying disease-associated microRNAs (disease miRNAs) contributes to the understanding of disease pathogenesis. Most previous computational biology studies focused on multiple kinds of connecting edges of miRNAs and diseases, including miRNA-miRNA similarities, disease-disease similarities, and miRNA-disease associations. Few methods exploited the node attribute information related to miRNA family and cluster. The previous methods do not completely consider the sparsity of node attributes. Additionally, it is challenging to deeply integrate the node attributes of miRNAs and the similarities and associations related to miRNAs and diseases. In the present study, we propose a novel method, known as MDAPred, based on nonnegative matrix factorization to predict candidate disease miRNAs. MDAPred integrates the node attributes of miRNAs and the related similarities and associations of miRNAs and diseases. Since a miRNA is typically subordinate to a family or a cluster, the node attributes of miRNAs are sparse. Similarly, the data for miRNA and disease similarities are sparse. Projecting the miRNA and disease similarities and miRNA node attributes into a common low-dimensional space contributes to estimating miRNA-disease associations. Simultaneously, the possibility that a miRNA is associated with a disease depends on the miRNA's neighbour information. Therefore, MDAPred deeply integrates projections of multiple kinds of connecting edges, projections of miRNAs node attributes, and neighbour information of miRNAs. The cross-validation results showed that MDAPred achieved superior performance compared to other state-of-the-art methods for predicting disease-miRNA associations. MDAPred can also retrieve more actual miRNA-disease associations at the top of prediction results, which is very important for biologists. Additionally, case studies of breast, lung, and pancreatic cancers further confirmed the ability of MDAPred to discover potential miRNA-disease associations.
Collapse
|
168
|
Zhou Y, Zheng X, Xu B, Hu W, Huang T, Jiang J. The Identification and Analysis of mRNA-lncRNA-miRNA Cliques From the Integrative Network of Ovarian Cancer. Front Genet 2019; 10:751. [PMID: 31497032 PMCID: PMC6712160 DOI: 10.3389/fgene.2019.00751] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2019] [Accepted: 07/17/2019] [Indexed: 12/11/2022] Open
Abstract
Ovarian cancer is one of the leading causes of cancer mortality in women. Since little clinical symptoms were shown in the early period of ovarian cancer, most patients were found in phases III-IV or with abdominal metastasis when diagnosed. The lack of effective early diagnosis biomarkers makes ovarian cancer difficult to screen. However, in essence, the fundamental problem is we know very little about the regulatory mechanisms during tumorigenesis of ovarian cancer. There are emerging regulatory factors, such as long noncoding RNAs (lncRNAs) and microRNAs (miRNAs), which have played important roles in cancers. Therefore, we analyzed the RNA-seq profiles of 407 ovarian cancer patients. An integrative network of 20,424 coding RNAs (mRNAs), 10,412 lncRNAs, and 742 miRNAs were construed with variance inflation factor (VIF) regression method. The mRNA-lncRNA-miRNA cliques were identified from the network and analyzed. Such promising cliques showed significant correlations with survival and stage of ovarian cancer and characterized the complex sponge regulatory mechanism, suggesting their contributions to tumorigenicity. Our results provided novel insights of the regulatory mechanisms among mRNAs, lncRNAs, and miRNAs and highlighted several promising regulators for ovarian cancer detection and treatment.
Collapse
Affiliation(s)
- You Zhou
- Department of Tumor Biological Treatment, The Third Affiliated Hospital of Soochow University, Changzhou, China
- Jiangsu Engineering Research Center for Tumor Immunotherapy, Changzhou, China
- Institute of Cell Therapy, Soochow University, Changzhou, China
| | - Xiao Zheng
- Department of Tumor Biological Treatment, The Third Affiliated Hospital of Soochow University, Changzhou, China
- Jiangsu Engineering Research Center for Tumor Immunotherapy, Changzhou, China
- Institute of Cell Therapy, Soochow University, Changzhou, China
| | - Bin Xu
- Department of Tumor Biological Treatment, The Third Affiliated Hospital of Soochow University, Changzhou, China
- Jiangsu Engineering Research Center for Tumor Immunotherapy, Changzhou, China
- Institute of Cell Therapy, Soochow University, Changzhou, China
| | - Wenwei Hu
- Department of Tumor Biological Treatment, The Third Affiliated Hospital of Soochow University, Changzhou, China
| | - Tao Huang
- Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences (CAS), Shanghai, China
| | - Jingting Jiang
- Department of Tumor Biological Treatment, The Third Affiliated Hospital of Soochow University, Changzhou, China
- Jiangsu Engineering Research Center for Tumor Immunotherapy, Changzhou, China
- Institute of Cell Therapy, Soochow University, Changzhou, China
| |
Collapse
|
169
|
Xie G, Meng T, Luo Y, Liu Z. SKF-LDA: Similarity Kernel Fusion for Predicting lncRNA-Disease Association. MOLECULAR THERAPY. NUCLEIC ACIDS 2019; 18:45-55. [PMID: 31514111 PMCID: PMC6742806 DOI: 10.1016/j.omtn.2019.07.022] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/22/2019] [Revised: 07/13/2019] [Accepted: 07/24/2019] [Indexed: 01/24/2023]
Abstract
Recently, prediction of lncRNA-disease associations has attracted more and more attentions. Various computational models have been proposed; however, there is still room to improve the prediction accuracy. In this paper, we propose a kernel fusion method with different types of similarities for the lncRNAs and diseases. The expression similarity and cosine similarity are used for lncRNAs, and the semantic similarity and cosine similarity are used for the diseases. To eliminate the noise effect, a neighbor constraint is enforced to refine all the similarity matrices before fusion. Experimental results show that the proposed similarity kernel fusion (SKF)-LDA method has the superiority performance in terms of AUC values and other measurements. In the schemes of LOOCV and 5-fold CV, AUC values of SKF-LDA achieve 0.9049 and 0.8743±0.0050 respectively. In addition, the conducted case studies of three diseases (hepatocellular carcinoma, lung cancer, and prostate cancer) show that SKF-LDA can predict related lncRNAs accurately.
Collapse
Affiliation(s)
- Guobo Xie
- School of Computer Science, Guangdong University of Technology, Guangzhou, China
| | - Tengfei Meng
- School of Computer Science, Guangdong University of Technology, Guangzhou, China
| | - Yu Luo
- School of Computer Science, Guangdong University of Technology, Guangzhou, China.
| | - Zhenguo Liu
- Department of Thoracic Surgery, The First Affiliated Hospital of Sun Yat-sen University, Guangzhou, China.
| |
Collapse
|
170
|
Guo ZH, Yi HC, You ZH. Construction and Comprehensive Analysis of a Molecular Association Network via lncRNA-miRNA -Disease-Drug-Protein Graph. Cells 2019; 8:E866. [PMID: 31405040 PMCID: PMC6721720 DOI: 10.3390/cells8080866] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2019] [Revised: 07/20/2019] [Accepted: 07/31/2019] [Indexed: 12/14/2022] Open
Abstract
One key issue in the post-genomic era is how to systematically describe the associations between small molecule transcripts or translations inside cells. With the rapid development of high-throughput "omics" technologies, the achieved ability to detect and characterize molecules with other molecule targets opens the possibility of investigating the relationships between different molecules from a global perspective. In this article, a molecular association network (MAN) is constructed and comprehensively analyzed by integrating the associations among miRNA, lncRNA, protein, drug, and disease, in which any kind of potential associations can be predicted. More specifically, each node in MAN can be represented as a vector by combining two kinds of information including the attribute of the node itself (e.g., sequences of ncRNAs and proteins, semantics of diseases and molecular fingerprints of drugs) and the behavior of the node in the complex network (associations with other nodes). A random forest classifier is trained to classify and predict new interactions or associations between biomolecules. In the experiment, the proposed method achieved a superb performance with an area under curve (AUC) of 0.9735 under a five-fold cross-validation, which showed that the proposed method could provide new insight for exploration of the molecular mechanisms of disease and valuable clues for disease treatment.
Collapse
Affiliation(s)
- Zhen-Hao Guo
- The Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi 830011, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Hai-Cheng Yi
- The Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi 830011, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Zhu-Hong You
- The Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi 830011, China.
- University of Chinese Academy of Sciences, Beijing 100049, China.
| |
Collapse
|
171
|
Zheng K, You ZH, Wang L, Zhou Y, Li LP, Li ZW. MLMDA: a machine learning approach to predict and validate MicroRNA-disease associations by integrating of heterogenous information sources. J Transl Med 2019; 17:260. [PMID: 31395072 PMCID: PMC6688360 DOI: 10.1186/s12967-019-2009-x] [Citation(s) in RCA: 54] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2019] [Accepted: 07/31/2019] [Indexed: 02/01/2023] Open
Abstract
Background Emerging evidences show that microRNA (miRNA) plays an important role in many human complex diseases. However, considering the inherent time-consuming and expensive of traditional in vitro experiments, more and more attention has been paid to the development of efficient and feasible computational methods to predict the potential associations between miRNA and disease. Methods In this work, we present a machine learning-based model called MLMDA for predicting the association of miRNAs and diseases. More specifically, we first use the k-mer sparse matrix to extract miRNA sequence information, and combine it with miRNA functional similarity, disease semantic similarity and Gaussian interaction profile kernel similarity information. Then, more representative features are extracted from them through deep auto-encoder neural network (AE). Finally, the random forest classifier is used to effectively predict potential miRNA–disease associations. Results The experimental results show that the MLMDA model achieves promising performance under fivefold cross validations with AUC values of 0.9172, which is higher than the methods using different classifiers or different feature combination methods mentioned in this paper. In addition, to further evaluate the prediction performance of MLMDA model, case studies are carried out with three Human complex diseases including Lymphoma, Lung Neoplasm, and Esophageal Neoplasms. As a result, 39, 37 and 36 out of the top 40 predicted miRNAs are confirmed by other miRNA–disease association databases. Conclusions These prominent experimental results suggest that the MLMDA model could serve as a useful tool guiding the future experimental validation for those promising miRNA biomarker candidates. The source code and datasets explored in this work are available at http://220.171.34.3:81/.
Collapse
Affiliation(s)
- Kai Zheng
- School of Computer Science and Technology, China University of Mining and Technology, Xuzhou, 221116, China.
| | - Zhu-Hong You
- Xinjiang Technical Institutes of Physics and Chemistry, Chinese Academy of Sciences, Ürümqi, 830011, China.
| | - Lei Wang
- Xinjiang Technical Institutes of Physics and Chemistry, Chinese Academy of Sciences, Ürümqi, 830011, China. .,College of Information Science and Engineering, Zaozhuang University, Zaozhuang, 277100, China.
| | - Yong Zhou
- School of Computer Science and Technology, China University of Mining and Technology, Xuzhou, 221116, China
| | - Li-Ping Li
- Xinjiang Technical Institutes of Physics and Chemistry, Chinese Academy of Sciences, Ürümqi, 830011, China
| | - Zheng-Wei Li
- School of Computer Science and Technology, China University of Mining and Technology, Xuzhou, 221116, China
| |
Collapse
|
172
|
Predicting human disease-associated circRNAs based on locality-constrained linear coding. Genomics 2019; 112:1335-1342. [PMID: 31394170 DOI: 10.1016/j.ygeno.2019.08.001] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2019] [Revised: 08/01/2019] [Accepted: 08/02/2019] [Indexed: 12/12/2022]
Abstract
Circular RNAs (circRNAs) are a new kind of endogenous non-coding RNAs, which have been discovered continuously. More and more studies have shown that circRNAs are related to the occurrence and development of human diseases. Identification of circRNAs associated with diseases can contribute to understand the pathogenesis, diagnosis and treatment of diseases. However, experimental methods of circRNA prediction remain expensive and time-consuming. Therefore, it is urgent to propose novel computational methods for the prediction of circRNA-disease associations. In this study, we develop a computational method called LLCDC that integrates the known circRNA-disease associations, circRNA semantic similarity network, disease semantic similarity network, reconstructed circRNA similarity network, and reconstructed disease similarity network to predict circRNAs related to human diseases. Specifically, the reconstructed similarity networks are obtained by using Locality-Constrained Linear Coding (LLC) on the known association matrix, cosine similarities of circRNAs and diseases. Then, the label propagation method is applied to the similarity networks, and four relevant score matrices are respectively obtained. Finally, we use 5-fold cross validation (5-fold CV) to evaluate the performance of LLCDC, and the AUC value of the method is 0.9177, indicating that our method performs better than the other three methods. In addition, case studies on gastric cancer, breast cancer and papillary thyroid carcinoma further verify the reliability of our method in predicting disease-associated circRNAs.
Collapse
|
173
|
Liu K, Kang M, Zhou Z, Qin W, Wang R. Bioinformatics analysis identifies hub genes and pathways in nasopharyngeal carcinoma. Oncol Lett 2019; 18:3637-3645. [PMID: 31516577 PMCID: PMC6732963 DOI: 10.3892/ol.2019.10707] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2018] [Accepted: 05/03/2019] [Indexed: 12/14/2022] Open
Abstract
The aim of the present study was to identify genes associated with and the underlying mechanisms in nasopharyngeal carcinoma (NPC) using microarray data. GSE12452 and GSE34573 gene expression profiles were obtained from the Gene Expression Omnibus (GEO) database. GEO2R was utilized to obtain differentially expressed genes (DEGs). In addition, the Database for Annotation, Visualization and Integrated Discovery was used to perform pathway enrichment analyses for DEGs using the Gene Ontology (GO) annotation along with the Kyoto Encyclopedia of Genes and Genomes (KEGG). Furthermore, Cytoscape was used to perform module analysis of the protein-protein interaction (PPI) network and pathways of the hub genes were studied. A total of 298 genes were ascertained as DEGs in the two datasets. To functionally categorize these DEGs, we obtained 82 supplemented GO terms along with 7 KEGG pathways. Subsequently, a PPI network consisting of 10 hub genes with high degrees of interaction was constructed. These hub genes included cyclin-dependent kinase (CDK) 1, structural maintenance of chromosomes (SMC) 4, kinetochore-associated (KNTC) 1, kinesin family member (KIF) 23, aurora kinase A (AURKA), ATAD (ATPase family AAA domain containing) 2, NDC80 kinetochore complex component, enhancer of zeste 2 polycomb repressive complex 2 subunit, BUB1 mitotic checkpoint serine/threonine kinase and protein regulator of cytokinesis 1. CDK1, SMC4, KNTC1, KIF23, AURKA and ATAD2 presented with high areas under the curve in receiver operator curves, suggesting that these genes may be diagnostic markers for nasopharyngeal carcinoma. In conclusion, it was proposed that CDK1, SMC4, KNTC1, KIF23, AURKA and ATAD2 may be involved in the tumorigenesis of NPC. Furthermore, they may be utilized as molecular biomarkers in early diagnosis of NPC.
Collapse
Affiliation(s)
- Kang Liu
- Department of Radiation Oncology, The First Affiliated Hospital of Guangxi Medical University, Guangxi, Nanning 530021, P.R. China
| | - Min Kang
- Department of Radiation Oncology, The First Affiliated Hospital of Guangxi Medical University, Guangxi, Nanning 530021, P.R. China
| | - Ziyan Zhou
- Department of Radiation Oncology, The First Affiliated Hospital of Guangxi Medical University, Guangxi, Nanning 530021, P.R. China
| | - Wen Qin
- Department of Radiation Oncology, The First Affiliated Hospital of Guangxi Medical University, Guangxi, Nanning 530021, P.R. China
| | - Rensheng Wang
- Department of Radiation Oncology, The First Affiliated Hospital of Guangxi Medical University, Guangxi, Nanning 530021, P.R. China
| |
Collapse
|
174
|
Inferring the Disease-Associated miRNAs Based on Network Representation Learning and Convolutional Neural Networks. Int J Mol Sci 2019; 20:ijms20153648. [PMID: 31349729 PMCID: PMC6696449 DOI: 10.3390/ijms20153648] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2019] [Revised: 07/17/2019] [Accepted: 07/18/2019] [Indexed: 02/06/2023] Open
Abstract
Identification of disease-associated miRNAs (disease miRNAs) are critical for understanding etiology and pathogenesis. Most previous methods focus on integrating similarities and associating information contained in heterogeneous miRNA-disease networks. However, these methods establish only shallow prediction models that fail to capture complex relationships among miRNA similarities, disease similarities, and miRNA-disease associations. We propose a prediction method on the basis of network representation learning and convolutional neural networks to predict disease miRNAs, called CNNMDA. CNNMDA deeply integrates the similarity information of miRNAs and diseases, miRNA-disease associations, and representations of miRNAs and diseases in low-dimensional feature space. The new framework based on deep learning was built to learn the original and global representation of a miRNA-disease pair. First, diverse biological premises about miRNAs and diseases were combined to construct the embedding layer in the left part of the framework, from a biological perspective. Second, the various connection edges in the miRNA-disease network, such as similarity and association connections, were dependent on each other. Therefore, it was necessary to learn the low-dimensional representations of the miRNA and disease nodes based on the entire network. The right part of the framework learnt the low-dimensional representation of each miRNA and disease node based on non-negative matrix factorization, and these representations were used to establish the corresponding embedding layer. Finally, the left and right embedding layers went through convolutional modules to deeply learn the complex and non-linear relationships among the similarities and associations between miRNAs and diseases. Experimental results based on cross validation indicated that CNNMDA yields superior performance compared to several state-of-the-art methods. Furthermore, case studies on lung, breast, and pancreatic neoplasms demonstrated the powerful ability of CNNMDA to discover potential disease miRNAs.
Collapse
|
175
|
Li Z, Nie R, You Z, Zhao Y, Ge X, Wang Y. LRMDA: Using Logistic Regression and Random Walk with Restart for MiRNA-Disease Association Prediction. ACTA ACUST UNITED AC 2019. [DOI: 10.1007/978-3-030-26969-2_27] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/23/2023]
|
176
|
Ensemble of decision tree reveals potential miRNA-disease associations. PLoS Comput Biol 2019; 15:e1007209. [PMID: 31329575 PMCID: PMC6675125 DOI: 10.1371/journal.pcbi.1007209] [Citation(s) in RCA: 149] [Impact Index Per Article: 24.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2018] [Revised: 08/01/2019] [Accepted: 06/24/2019] [Indexed: 12/14/2022] Open
Abstract
In recent years, increasing associations between microRNAs (miRNAs) and human diseases have been identified. Based on accumulating biological data, many computational models for potential miRNA-disease associations inference have been developed, which saves time and expenditure on experimental studies, making great contributions to researching molecular mechanism of human diseases and developing new drugs for disease treatment. In this paper, we proposed a novel computational method named Ensemble of Decision Tree based MiRNA-Disease Association prediction (EDTMDA), which innovatively built a computational framework integrating ensemble learning and dimensionality reduction. For each miRNA-disease pair, the feature vector was extracted by calculating the statistical measures, graph theoretical measures, and matrix factorization results for the miRNA and disease, respectively. Then multiple base learnings were built to yield many decision trees (DTs) based on random selection of negative samples and miRNA/disease features. Particularly, Principal Components Analysis was applied to each base learning to reduce feature dimensionality and hence remove the noise or redundancy. Average strategy was adopted for these DTs to get final association scores between miRNAs and diseases. In model performance evaluation, EDTMDA showed AUC of 0.9309 in global leave-one-out cross validation (LOOCV) and AUC of 0.8524 in local LOOCV. Additionally, AUC of 0.9192+/-0.0009 in 5-fold cross validation proved the model's reliability and stability. Furthermore, three types of case studies for four human diseases were implemented. As a result, 94% (Esophageal Neoplasms), 86% (Kidney Neoplasms), 96% (Breast Neoplasms) and 88% (Carcinoma Hepatocellular) of top 50 predicted miRNAs were confirmed by experimental evidences in literature.
Collapse
|
177
|
Yu J, Xuan Z, Feng X, Zou Q, Wang L. A novel collaborative filtering model for LncRNA-disease association prediction based on the Naïve Bayesian classifier. BMC Bioinformatics 2019; 20:396. [PMID: 31315558 PMCID: PMC6637631 DOI: 10.1186/s12859-019-2985-0] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2018] [Accepted: 07/03/2019] [Indexed: 12/19/2022] Open
Abstract
BACKGROUND Since the number of known lncRNA-disease associations verified by biological experiments is quite limited, it has been a challenging task to uncover human disease-related lncRNAs in recent years. Moreover, considering the fact that biological experiments are very expensive and time-consuming, it is important to develop efficient computational models to discover potential lncRNA-disease associations. RESULTS In this manuscript, a novel Collaborative Filtering model called CFNBC for inferring potential lncRNA-disease associations is proposed based on Naïve Bayesian Classifier. In CFNBC, an original lncRNA-miRNA-disease tripartite network is constructed first by integrating known miRNA-lncRNA associations, miRNA-disease associations and lncRNA-disease associations, and then, an updated lncRNA-miRNA-disease tripartite network is further constructed through applying the item-based collaborative filtering algorithm on the original tripartite network. Finally, based on the updated tripartite network, a novel approach based on the Naïve Bayesian Classifier is proposed to predict potential associations between lncRNAs and diseases. The novelty of CFNBC lies in the construction of the updated lncRNA-miRNA-disease tripartite network and the introduction of the item-based collaborative filtering algorithm and Naïve Bayesian Classifier, which guarantee that CFNBC can be applied to predict potential lncRNA-disease associations efficiently without entirely relying on known miRNA-disease associations. Simulation results show that CFNBC can achieve a reliable AUC of 0.8576 in the Leave-One-Out Cross Validation (LOOCV), which is considerably better than previous state-of-the-art results. Moreover, case studies of glioma, colorectal cancer and gastric cancer demonstrate the excellent prediction performance of CFNBC as well. CONCLUSIONS According to simulation results, due to the satisfactory prediction performance, CFNBC may be an excellent addition to biomedical researches in the future.
Collapse
Affiliation(s)
- Jingwen Yu
- grid.448798.eCollege of Computer Engineering & Applied Mathematics, Changsha University, Changsha, Hunan People’s Republic of China
- 0000 0000 8633 7608grid.412982.4Key Laboratory of Intelligent Computing & Information Processing, Xiangtan University, XiangTan, People’s Republic of China
| | - Zhanwei Xuan
- grid.448798.eCollege of Computer Engineering & Applied Mathematics, Changsha University, Changsha, Hunan People’s Republic of China
- 0000 0000 8633 7608grid.412982.4Key Laboratory of Intelligent Computing & Information Processing, Xiangtan University, XiangTan, People’s Republic of China
| | - Xiang Feng
- grid.448798.eCollege of Computer Engineering & Applied Mathematics, Changsha University, Changsha, Hunan People’s Republic of China
- 0000 0000 8633 7608grid.412982.4Key Laboratory of Intelligent Computing & Information Processing, Xiangtan University, XiangTan, People’s Republic of China
| | - Quan Zou
- 0000 0004 0369 4060grid.54549.39Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, People’s Republic of China
- 0000 0004 1761 2484grid.33763.32School of Computer Science and Technology, Tianjin University, Tianjin, People’s Republic of China
| | - Lei Wang
- grid.448798.eCollege of Computer Engineering & Applied Mathematics, Changsha University, Changsha, Hunan People’s Republic of China
- 0000 0000 8633 7608grid.412982.4Key Laboratory of Intelligent Computing & Information Processing, Xiangtan University, XiangTan, People’s Republic of China
| |
Collapse
|
178
|
Cheng S, Zhang L, Tan J, Gong W, Li C, Zhang X. DM-RPIs: Predicting ncRNA-protein interactions using stacked ensembling strategy. Comput Biol Chem 2019; 83:107088. [PMID: 31330489 DOI: 10.1016/j.compbiolchem.2019.107088] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2019] [Revised: 06/30/2019] [Accepted: 07/02/2019] [Indexed: 01/10/2023]
Abstract
ncRNA-protein interactions (ncRPIs) play an important role in a number of cellular processes, such as post-transcriptional modification, transcriptional regulation, disease progression and development. Since experimental methods are expensive and time-consuming to identify the ncRPIs, we proposed a computational method, Deep Mining ncRNA-Protein Interactions (DM-RPIs), for identifying the ncRPIs. In order to descending dimension and excavating hidden information from k-mer frequency of RNA and protein sequences, using the Deep Stacking Auto-encoders Networks (DSANs) model refined the raw data. Three common machine learning algorithms, Support Vector Machine (SVM), Random Forest (RF), and Convolution Neural Network (CNN), were separately trained as individual predictors and then the three individual predictors were integrated together using stacked ensembling strategy. Based on the RPI2241 dataset, DM-RPI obtains an accuracy of 0.851, precision of 0.852, sensitivity of 0.873, specificity of 0.826, and MCC of 0.701, which is promising and pioneering for the prediction of ncRPIs.
Collapse
Affiliation(s)
- Shuping Cheng
- College of Life Science and Bioengineering, Beijing University of Technology, Intelligent Physiological Measurement and Clinical Translation, Beijing International Base for Scientific and Technological Cooperation, Beijing, 100124, China
| | - Lu Zhang
- College of Life Science and Bioengineering, Beijing University of Technology, Intelligent Physiological Measurement and Clinical Translation, Beijing International Base for Scientific and Technological Cooperation, Beijing, 100124, China
| | - Jianjun Tan
- College of Life Science and Bioengineering, Beijing University of Technology, Intelligent Physiological Measurement and Clinical Translation, Beijing International Base for Scientific and Technological Cooperation, Beijing, 100124, China.
| | - Weikang Gong
- College of Life Science and Bioengineering, Beijing University of Technology, Intelligent Physiological Measurement and Clinical Translation, Beijing International Base for Scientific and Technological Cooperation, Beijing, 100124, China
| | - Chunhua Li
- College of Life Science and Bioengineering, Beijing University of Technology, Intelligent Physiological Measurement and Clinical Translation, Beijing International Base for Scientific and Technological Cooperation, Beijing, 100124, China
| | - Xiaoyi Zhang
- College of Life Science and Bioengineering, Beijing University of Technology, Intelligent Physiological Measurement and Clinical Translation, Beijing International Base for Scientific and Technological Cooperation, Beijing, 100124, China
| |
Collapse
|
179
|
Yan F, Zheng Y, Jia W, Hou S, Xiao R. MAMDA: Inferring microRNA-Disease associations with manifold alignment. Comput Biol Med 2019; 110:156-163. [DOI: 10.1016/j.compbiomed.2019.05.014] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2018] [Revised: 05/17/2019] [Accepted: 05/17/2019] [Indexed: 01/13/2023]
|
180
|
Gao YL, Cui Z, Liu JX, Wang J, Zheng CH. NPCMF: Nearest Profile-based Collaborative Matrix Factorization method for predicting miRNA-disease associations. BMC Bioinformatics 2019; 20:353. [PMID: 31234797 PMCID: PMC6591872 DOI: 10.1186/s12859-019-2956-5] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2019] [Accepted: 06/17/2019] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND Predicting meaningful miRNA-disease associations (MDAs) is costly. Therefore, an increasing number of researchers are beginning to focus on methods to predict potential MDAs. Thus, prediction methods with improved accuracy are under development. An efficient computational method is proposed to be crucial for predicting novel MDAs. For improved experimental productivity, large biological datasets are used by researchers. Although there are many effective and feasible methods to predict potential MDAs, the possibility remains that these methods are flawed. RESULTS A simple and effective method, known as Nearest Profile-based Collaborative Matrix Factorization (NPCMF), is proposed to identify novel MDAs. The nearest profile is introduced to our method to achieve the highest AUC value compared with other advanced methods. For some miRNAs and diseases without any association, we use the nearest neighbour information to complete the prediction. CONCLUSIONS To evaluate the performance of our method, five-fold cross-validation is used to calculate the AUC value. At the same time, three disease cases, gastric neoplasms, rectal neoplasms and colonic neoplasms, are used to predict novel MDAs on a gold-standard dataset. We predict the vast majority of known MDAs and some novel MDAs. Finally, the prediction accuracy of our method is determined to be better than that of other existing methods. Thus, the proposed prediction model can obtain reliable experimental results.
Collapse
Affiliation(s)
- Ying-Lian Gao
- Library of Qufu Normal University, Qufu Normal University, Rizhao, China
| | - Zhen Cui
- School of Information Science and Engineering, Qufu Normal University, Rizhao, China
| | - Jin-Xing Liu
- School of Information Science and Engineering, Qufu Normal University, Rizhao, China. .,Co-Innovation Center for Information Supply and Assurance Technology, Anhui University, Hefei, China.
| | - Juan Wang
- School of Information Science and Engineering, Qufu Normal University, Rizhao, China
| | - Chun-Hou Zheng
- Co-Innovation Center for Information Supply and Assurance Technology, Anhui University, Hefei, China
| |
Collapse
|
181
|
Yuan C, Zhang Y, Tu W, Guo Y. Integrated miRNA profiling and bioinformatics analyses reveal upregulated miRNAs in gastric cancer. Oncol Lett 2019; 18:1979-1988. [PMID: 31423268 DOI: 10.3892/ol.2019.10495] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2018] [Accepted: 04/15/2019] [Indexed: 12/20/2022] Open
Abstract
Gastric cancer is one of the most common malignancies in China and exhibits a poor prognosis. The most significant challenge for gastric cancer treatment is the absence of early diagnostic biomarkers. MicroRNAs (miRNAs) are small non-coding RNAs, which possess clinical value in a number of different types of cancer. The current study identified 13 miRNAs (hsa-miR-22, hsa-miR-545, hsa-let-7i, hsa-miR-15b, hsa-miR-221, hsa-miR-196a, hsa-miR-20a, hsa-miR-196b, hsa-miR-93, hsa-miR-19a, hsa-miR-503, hsa-miR-106b and hsa-miR-18a) that were significantly overexpressed in GC, by analyzing 1,000 GC samples included in four public datasets, including GSE23739, GSE78091, GSE30070 and The Cancer Genome Atlas. Furthermore, it was revealed that the expression levels of these 13 miRNAs were significantly higher in gastric cancer tissues of grades I, II and III compared with normal controls. Gene ontology analysis and Kyoto Encyclopedia of Genes and Genomes analysis demonstrated that the differentially expressed miRNAs were involved in regulating transcription, protein amino acid phosphorylation, signal transduction, protein binding, zinc ion binding, the mitogen-activated protein kinase signaling pathway and focal adhesion. In summary, the present study may provide potential new therapeutic and prognostic targets for gastric cancer.
Collapse
Affiliation(s)
- Chen Yuan
- Department of Emergency, Huashan Hospital, Fudan University, Shanghai 200040, P.R. China
| | - Yue Zhang
- Department of Gerontology, Shanghai Jiaotong University Affiliated to Sixth People's Hospital, Shanghai 201499, P.R. China
| | - Wenwen Tu
- Department of Cardiology, Jingan District Zhabei Central Hospital, Shanghai 200040, P.R. China
| | - Yusheng Guo
- Department of Emergency, Huashan Hospital, Fudan University, Shanghai 200040, P.R. China
| |
Collapse
|
182
|
Zhao Q, Yang Y, Ren G, Ge E, Fan C. Integrating Bipartite Network Projection and KATZ Measure to Identify Novel CircRNA-Disease Associations. IEEE Trans Nanobioscience 2019; 18:578-584. [PMID: 31199265 DOI: 10.1109/tnb.2019.2922214] [Citation(s) in RCA: 73] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Accumulating biological experiments have shown that circRNAs are closely related to the occurrence and development of many complex human diseases. During recent years, the associations of circRNA with disease have caused more and more researchers to pay attention and to analyze their correlation mechanisms. However, experimental methods for determining the associations of circRNA with a particular disease are still expensive, difficult, and time consuming. Moreover, the available databases related to circRNA-disease correlations have only recently been updated, and only a few computational methods are constructed to predict potential circRNA-disease correlations. Taking into account the limitations of experimental studies, we develop a novel computational method, named IBNPKATZ, for predicting potential circRNA-disease associations, which integrates the bipartite network projection algorithm and KATZ measure. This model is based on the known circRNA-disease associations, combining circRNA similarity and disease similarity. Specifically, the circRNA similarity is derived from the average of the semantic similarity and the Gaussian interaction profile (GIP) kernel similarity of circRNA. Similarly, disease similarity is the mean of the semantic similarity and the GIP kernel similarity of disease. Furthermore, it is semi-supervised and does not require negative samples. Finally, IBNPKATZ achieves reliable AUC of 0.9352 in the leave-one-out cross validation, and case studies show that the circRNA-disease correlations predicted by our method can be successfully demonstrated by relevant experiments. The IBNPKATZ is expected to be a useful biomedical research tool for predicting potential circRNA-disease associations.
Collapse
|
183
|
Yu C, Chen F, Jiang J, Zhang H, Zhou M. Screening key genes and signaling pathways in colorectal cancer by integrated bioinformatics analysis. Mol Med Rep 2019; 20:1259-1269. [PMID: 31173250 PMCID: PMC6625394 DOI: 10.3892/mmr.2019.10336] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2018] [Accepted: 04/24/2019] [Indexed: 01/14/2023] Open
Abstract
The aim of the present study was to identify potential key genes associated with the progression and prognosis of colorectal cancer (CRC). Differentially expressed genes (DEGs) between CRC and normal samples were screened by integrated analysis of gene expression profile datasets, including the Gene Expression Omnibus (GEO) and The Cancer Genome Atlas. Gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis was conducted to identify the biological role of DEGs. In addition, a protein‑protein interaction network and survival analysis were used to identify the key genes. The profiles of GSE9348, GSE22598 and GSE113513 were downloaded from the GEO database. A total of 405 common DEGs were identified, including 236 down‑ and 169 upregulated. GO analysis revealed that the downregulated DEGs were mainly enriched in 'detoxification of copper ion' [biological process, (BP)], 'oxidoreductase activity, acting on CH‑OH group of donors, NAD or NADP as acceptor' [molecular function, (MF)] and 'brush border' [cellular component, (CC)]. Upregulated DEGs were mainly involved in 'nuclear division' (BP), 'snoRNA binding' (MF) and 'nucleolar part' (CC). KEGG pathway analysis revealed that DEGs were mainly involved in 'mineral absorption', 'nitrogen metabolism', 'cell cycle' and 'caffeine metabolism'. A PPI network was constructed with 268 nodes and 1,027 edges. The top one module was selected, and it was revealed that module‑related genes were mainly enriched in the GO terms 'sister chromatid segregation' (BP), 'chemokine activity' (MF), and 'condensed chromosome (CC)'. The KEGG pathway was mainly enriched in 'cell cycle', 'progesterone‑mediated oocyte maturation', 'chemokine signaling pathway', 'IL‑17 signaling pathway', 'legionellosis', and 'rheumatoid arthritis'. DNA topoisomerase II‑α (TOP2A), mitotic arrest deficient 2 like 1 (MAD2L1), cyclin B1 (CCNB1), checkpoint kinase 1 (CHEK1), cell division cycle 6 (CDC6) and ubiquitin conjugating enzyme E2 C (UBE2C) were indicated as hub genes. Furthermore, survival analysis revealed that TOP2A, MAD2L1, CDC6 and CHEK1 may serve as prognostic biomarkers in CRC. The present study provided insights into the molecular mechanism of CRC that may be useful in further investigations.
Collapse
Affiliation(s)
- Chang Yu
- Department of Radiation Medicine, Guangdong Provincial Key Laboratory of Tropical Disease Research, School of Public Health, Southern Medical University, Guangzhou, Guangdong 510515, P.R. China
| | - Fuqiang Chen
- Department of Radiation Medicine, Guangdong Provincial Key Laboratory of Tropical Disease Research, School of Public Health, Southern Medical University, Guangzhou, Guangdong 510515, P.R. China
| | - Jianjun Jiang
- Department of Thoracic Surgery, Nanfang Hospital, Southern Medical University, Guangzhou, Guangdong 510515, P.R. China
| | - Hong Zhang
- The First Affiliated Hospital, University of Science and Technology of China, Hefei, Anhui 230026, P.R. China
| | - Meijuan Zhou
- Department of Radiation Medicine, Guangdong Provincial Key Laboratory of Tropical Disease Research, School of Public Health, Southern Medical University, Guangzhou, Guangdong 510515, P.R. China
| |
Collapse
|
184
|
Yin J, Chen X, Wang CC, Zhao Y, Sun YZ. Prediction of Small Molecule–MicroRNA Associations by Sparse Learning and Heterogeneous Graph Inference. Mol Pharm 2019; 16:3157-3166. [DOI: 10.1021/acs.molpharmaceut.9b00384] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Affiliation(s)
- Jun Yin
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| | - Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| | - Chun-Chun Wang
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| | - Yan Zhao
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| | - Ya-Zhou Sun
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen 518060, China
| |
Collapse
|
185
|
A Novel Method for Predicting Disease-Associated LncRNA-MiRNA Pairs Based on the Higher-Order Orthogonal Iteration. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2019; 2019:7614850. [PMID: 31191710 PMCID: PMC6525924 DOI: 10.1155/2019/7614850] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/31/2018] [Revised: 01/25/2019] [Accepted: 02/10/2019] [Indexed: 12/30/2022]
Abstract
A lot of research studies have shown that many complex human diseases are associated not only with microRNAs (miRNAs) but also with long noncoding RNAs (lncRNAs). However, most of the current existing studies focus on the prediction of disease-related miRNAs or lncRNAs, and to our knowledge, until now, there are few literature studies reported to pay attention to the study of impact of miRNA-lncRNA pairs on diseases, although more and more studies have shown that both lncRNAs and miRNAs play important roles in cell proliferation and differentiation during the recent years. The identification of disease-related genes provides great insight into the underlying pathogenesis of diseases at a system level. In this study, a novel model called PADLMHOOI was proposed to predict potential associations between diseases and lncRNA-miRNA pairs based on the higher-order orthogonal iteration, and in order to evaluate its prediction performance, the global and local LOOCV were implemented, respectively, and simulation results demonstrated that PADLMHOOI could achieve reliable AUCs of 0.9545 and 0.8874 in global and local LOOCV separately. Moreover, case studies further demonstrated the effectiveness of PADLMHOOI to infer unknown disease-related lncRNA-miRNA pairs.
Collapse
|
186
|
Tang C, Zhou H, Zheng X, Zhang Y, Sha X. Dual Laplacian regularized matrix completion for microRNA-disease associations prediction. RNA Biol 2019; 16:601-611. [PMID: 30676207 PMCID: PMC6546388 DOI: 10.1080/15476286.2019.1570811] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2018] [Revised: 11/30/2018] [Accepted: 01/03/2019] [Indexed: 01/21/2023] Open
Abstract
Since lots of miRNA-disease associations have been verified, it is meaningful to discover more miRNA-disease associations for serving disease diagnosis and prevention of human complex diseases. However, it is not practical to identify potential associations using traditional biological experimental methods since the process is expensive and time consuming. Therefore, it is necessary to develop efficient computational methods to accomplish this task. In this work, we introduced a matrix completion model with dual Laplacian regularization (DLRMC) to infer unknown miRNA-disease associations in heterogeneous omics data. Specifically, DLRMC transformed the task of miRNA-disease association prediction into a matrix completion problem, in which the potential missing entries of the miRNA-disease association matrix were calculated, the missing association can be obtained based on the prediction scores after the completion procedure. Meanwhile, the miRNA functional similarity and the disease semantic similarity were fully exploited to serve the miRNA-disease association matrix completion by using a dual Laplacian regularization term. In the experiments, we conducted global and local Leave-One-Out Cross Validation (LOOCV) and case studies to evaluate the efficacy of DLRMC on the Human miRNA-disease associations dataset obtained from the HMDDv2.0 database. As a result, the AUCs of DLRMC is 0.9174 and 0.8289 in global LOOCV and local LOOCV, respectively, which significantly outperform a variety of previous methods. In addition, in the case studies on four significant diseases related to human health including Colon Neoplasms, Kidney neoplasms, Lymphoma and Prostate neoplasms, 90%, 92%, 92% and 94% out of the top 50 predicted miRNAs has been confirmed, respectively.
Collapse
Affiliation(s)
- Chang Tang
- School of Computer Science, China University of Geosciences, Wuhan, China
| | - Hua Zhou
- Department of Hematology, The Affiliated Huai’an Hospital of Xuzhou Medical University, Huai’an, China
| | - Xiao Zheng
- Wuhan University of Technology Hospital, Wuhan University of Technology, Wuhan, China
| | - Yanming Zhang
- Department of Hematology, The Affiliated Huai’an Hospital of Xuzhou Medical University, Huai’an, China
| | - Xiaofeng Sha
- Department of Oncology, Huai’an Hongze District People’s Hospital, Huai’an, China
| |
Collapse
|
187
|
Shi P, Chen C, Li X, Wei Z, Liu Z, Liu Y. MicroRNA‑124 suppresses cell proliferation and invasion of triple negative breast cancer cells by targeting STAT3. Mol Med Rep 2019; 19:3667-3675. [PMID: 30896795 PMCID: PMC6472193 DOI: 10.3892/mmr.2019.10044] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2018] [Accepted: 03/06/2019] [Indexed: 12/21/2022] Open
Abstract
MicroRNAs (miRNAs) are pivotal regulators of the progression of carcinogenesis and negatively regulate the expression of tumour‑associated genes. Downregulation of miR‑124 expression has been demonstrated in various human cancer tissues, wherein miR‑124 serves as a tumour suppressor by targeting oncogenes. However, its function and underlying mechanism of action remain unclear in breast cancer. In the present study, the tissue‑specific expression of miR‑124 was detected in 10 paired triple‑negative breast cancer and normal tissues, and its inhibitory effects on cell growth and invasion were evaluated in vitro and in vivo. Bioinformatics analysis identified signal transducer and activator of transcription 3 (STAT3), a well‑known oncogene in breast cancer, as the potential target. Upregulation of miR‑124 expression decreased STAT3 mRNA and protein levels in breast cancer cells and the relative luciferase activity. Rescue experiments revealed that the transfection of a STAT3 expression plasmid reversed the inhibitory effect of miR‑124 on the proliferation and invasion of MDA‑MB‑468 cells. These data demonstrate that miR‑124 serves vital roles in the suppression of triple‑negative breast cancer via inhibition of cell proliferation and invasion through STAT3. These results highlight the potential role of miR‑124 as a diagnostic or therapeutic target in patients with breast cancer.
Collapse
Affiliation(s)
- Pengfei Shi
- Department of Thyroid and Breast Surgery, The Central Hospital of Wuhan, Wuhan, Hubei 430000, P.R. China
| | - Cheng Chen
- Department of General Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei 430000, P.R. China
| | - Xiang Li
- Department of General Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei 430000, P.R. China
| | - Zhanjie Wei
- Department of Thyroid and Breast Surgery, The Central Hospital of Wuhan, Wuhan, Hubei 430000, P.R. China
| | - Zhimin Liu
- Department of Thyroid and Breast Surgery, The Central Hospital of Wuhan, Wuhan, Hubei 430000, P.R. China
| | - Yongjun Liu
- Department of Thyroid and Breast Surgery, The Central Hospital of Wuhan, Wuhan, Hubei 430000, P.R. China
| |
Collapse
|
188
|
Shamsizadeh S, Goliaei S, Razaghi Moghadam Z. CAMIRADA: Cancer microRNA association discovery algorithm, a case study on breast cancer. J Biomed Inform 2019; 94:103180. [PMID: 31039404 DOI: 10.1016/j.jbi.2019.103180] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2019] [Revised: 04/04/2019] [Accepted: 04/17/2019] [Indexed: 12/18/2022]
Abstract
In recent studies, non-coding protein RNAs have been identified as microRNA that can be used as biomarkers for early diagnosis and treatment of cancer, that decrease mortality in cancer. A microRNA may target hundreds or thousands of genes and a gene may regulate several microRNAs, so determining which microRNA is associated with which cancer is a big challenge. Many computational methods have been performed to detect micoRNAs association with cancer, but more effort is needed with higher accuracy. Increasing research has shown that relationship between microRNAs and TFs play a significant role in the diagnosis of cancer. Therefore, we developed a new computational framework (CAMIRADA) to identify cancer-related microRNAs based on the relationship between microRNAs and disease genes (DG) in the protein network, the functional relationships between microRNAs and Transcription Factors (TF) on the co-expression network, and the relationship between microRNAs and the Differential Expression Gene (DEG) on co-expression network. The CAMIRADA was applied to assess breast cancer data from two HMDD and miR2Disease databases. In this study, the AUC for the 65 microRNAs of the top of the list was 0.95, which was more accurate than the similar methods used to detect microRNAs associated with the cancer artery.
Collapse
Affiliation(s)
- Sepideh Shamsizadeh
- Faculty of New Sciences and Technologies, University of Tehran, Tehran, Iran.
| | - Sama Goliaei
- Faculty of New Sciences and Technologies, University of Tehran, Tehran, Iran.
| | - Zahra Razaghi Moghadam
- Faculty of New Sciences and Technologies, University of Tehran, Tehran, Iran; Max Planck Institute of Molecular Plant Physiology, Posdam, Germany.
| |
Collapse
|
189
|
Chen M, Zhang Y, Li A, Li Z, Liu W, Chen Z. Bipartite Heterogeneous Network Method Based on Co-neighbor for MiRNA-Disease Association Prediction. Front Genet 2019; 10:385. [PMID: 31080459 PMCID: PMC6497741 DOI: 10.3389/fgene.2019.00385] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2018] [Accepted: 04/10/2019] [Indexed: 12/22/2022] Open
Abstract
In recent years, miRNA variation and dysregulation have been found to be closely related to human tumors, and identifying miRNA-disease associations is helpful for understanding the mechanisms of disease or tumor development and is greatly significant for the prognosis, diagnosis, and treatment of human diseases. This article proposes a Bipartite Heterogeneous network link prediction method based on co-neighbor to predict miRNA-disease association (BHCN). According to the structural characteristics of the bipartite network, the concept of bipartite network co-neighbors is proposed, and the co-neighbors were used to represent the probability of association between disease and miRNA. To predict the isolated diseases and the new miRNA based on the association probability expressed by co-neighbors, we utilized the similarity between disease nodes and the similarity between miRNA nodes in heterogeneous networks to represent the association probability between disease and miRNA. The model's predictive performance was evaluated by the leave-one-out cross validation (LOOCV) on different datasets. The AUC value of BHCN on the gold benchmark dataset was 0.7973, and the AUC obtained on the prediction dataset was 0.9349, which was better than that of the classic global algorithm. In this case study, we conducted predictive studies on breast neoplasms and colon neoplasms. Most of the top 50 predicted results were confirmed by three databases, namely, HMDD, miR2disease, and dbDEMC, with accuracy rates of 96 and 82%. In addition, BHCN can be used for predicting isolated diseases (without any known associated diseases) and new miRNAs (without any known associated miRNAs). In the isolated disease case study, the top 50 of breast neoplasm and colon neoplasm potentials associated with miRNAs predicted an accuracy of 100 and 96%, respectively, thereby demonstrating the favorable predictive power of BHCN for potentially relevant miRNAs.
Collapse
Affiliation(s)
- Min Chen
- School of Computer Science and Technology, Hunan Institute of Technology, Hengyang, China
| | - Yi Zhang
- School of Information Science and Engineering, Guilin University of Technology, Guilin, China
| | - Ang Li
- School of Computer Science and Technology, Hunan Institute of Technology, Hengyang, China
| | - Zejun Li
- School of Computer Science and Technology, Hunan Institute of Technology, Hengyang, China
| | - Wenhua Liu
- School of Computer Science and Technology, Hunan Institute of Technology, Hengyang, China
| | - Zheng Chen
- School of Computer Science and Technology, Hunan Institute of Technology, Hengyang, China
| |
Collapse
|
190
|
Ai D, Pan H, Li X, Gao Y, Liu G, Xia LC. Identifying Gut Microbiota Associated With Colorectal Cancer Using a Zero-Inflated Lognormal Model. Front Microbiol 2019; 10:826. [PMID: 31068913 PMCID: PMC6491826 DOI: 10.3389/fmicb.2019.00826] [Citation(s) in RCA: 117] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2019] [Accepted: 04/01/2019] [Indexed: 12/26/2022] Open
Abstract
Colorectal cancer (CRC) is the third most common cancer worldwide. Its incidence is still increasing, and the mortality rate is high. New therapeutic and prognostic strategies are urgently needed. It became increasingly recognized that the gut microbiota composition differs significantly between healthy people and CRC patients. Thus, identifying the difference between gut microbiota of the healthy people and CRC patients is fundamental to understand these microbes' functional roles in the development of CRC. We studied the microbial community structure of a CRC metagenomic dataset of 156 patients and healthy controls, and analyzed the diversity, differentially abundant bacteria, and co-occurrence networks. We applied a modified zero-inflated lognormal (ZIL) model for estimating the relative abundance. We found that the abundance of genera: Anaerostipes, Bilophila, Catenibacterium, Coprococcus, Desulfovibrio, Flavonifractor, Porphyromonas, Pseudoflavonifractor, and Weissella was significantly different between the healthy and CRC groups. We also found that bacteria such as Streptococcus, Parvimonas, Collinsella, and Citrobacter were uniquely co-occurring within the CRC patients. In addition, we found that the microbial diversity of healthy controls is significantly higher than that of the CRC patients, which indicated a significant negative correlation between gut microbiota diversity and the stage of CRC. Collectively, our results strengthened the view that individual microbes as well as the overall structure of gut microbiota were co-evolving with CRC.
Collapse
Affiliation(s)
- Dongmei Ai
- Basic Experimental of Natural Science, University of Science and Technology Beijing, Beijing, China
- School of Mathematics and Physics, University of Science and Technology Beijing, Beijing, China
| | - Hongfei Pan
- School of Mathematics and Physics, University of Science and Technology Beijing, Beijing, China
| | - Xiaoxin Li
- School of Mathematics and Physics, University of Science and Technology Beijing, Beijing, China
| | - Yingxin Gao
- School of Mathematics and Physics, University of Science and Technology Beijing, Beijing, China
| | - Gang Liu
- School of Mathematics and Physics, University of Science and Technology Beijing, Beijing, China
| | - Li C Xia
- Department of Medicine, Stanford University School of Medicine, Stanford, CA, United States
| |
Collapse
|
191
|
Qu K, Guo F, Liu X, Lin Y, Zou Q. Application of Machine Learning in Microbiology. Front Microbiol 2019; 10:827. [PMID: 31057526 PMCID: PMC6482238 DOI: 10.3389/fmicb.2019.00827] [Citation(s) in RCA: 108] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Accepted: 04/01/2019] [Indexed: 02/01/2023] Open
Abstract
Microorganisms are ubiquitous and closely related to people's daily lives. Since they were first discovered in the 19th century, researchers have shown great interest in microorganisms. People studied microorganisms through cultivation, but this method is expensive and time consuming. However, the cultivation method cannot keep a pace with the development of high-throughput sequencing technology. To deal with this problem, machine learning (ML) methods have been widely applied to the field of microbiology. Literature reviews have shown that ML can be used in many aspects of microbiology research, especially classification problems, and for exploring the interaction between microorganisms and the surrounding environment. In this study, we summarize the application of ML in microbiology.
Collapse
Affiliation(s)
- Kaiyang Qu
- College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Fei Guo
- College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Xiangrong Liu
- School of Information Science and Technology, Xiamen University, Xiamen, China
| | - Yuan Lin
- School of Information Science and Technology, Xiamen University, Xiamen, China
- Department of System Integration, Sparebanken Vest, Bergen, Norway
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
- Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| |
Collapse
|
192
|
Zeng X, Wang W, Deng G, Bing J, Zou Q. Prediction of Potential Disease-Associated MicroRNAs by Using Neural Networks. MOLECULAR THERAPY. NUCLEIC ACIDS 2019; 16:566-575. [PMID: 31077936 PMCID: PMC6510966 DOI: 10.1016/j.omtn.2019.04.010] [Citation(s) in RCA: 53] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/09/2019] [Revised: 04/11/2019] [Accepted: 04/11/2019] [Indexed: 12/13/2022]
Abstract
Identifying disease-related microRNAs (miRNAs) is an essential but challenging task in bioinformatics research. Much effort has been devoted to discovering the underlying associations between miRNAs and diseases. However, most studies mainly focus on designing advanced methods to improve prediction accuracy while neglecting to investigate the link predictability of the relationships between miRNAs and diseases. In this work, we construct a heterogeneous network by integrating neighborhood information in the neural network to predict potential associations between miRNAs and diseases, which also consider the imbalance of datasets. We also employ a new computational method called a neural network model for miRNA-disease association prediction (NNMDA). This model predicts miRNA-disease associations by integrating multiple biological data resources. Comparison of our work with other algorithms reveals the reliable performance of NNMDA. Its average AUC score was 0.937 over 15 diseases in a 5-fold cross-validation and AUC of 0.8439 based on leave-one-out cross-validation. The results indicate that NNMDA could be used in evaluating the accuracy of miRNA-disease associations. Moreover, NNMDA was applied to two common human diseases in two types of case studies. In the first type, 26 out of the top 30 predicted miRNAs of lung neoplasms were confirmed by the experiments. In the second type of case study for new diseases without any known miRNAs related to it, we selected breast neoplasms as the test example by hiding the association information between the miRNAs and this disease. The results verified 50 out of the top 50 predicted breast-neoplasm-related miRNAs.
Collapse
Affiliation(s)
- Xiangxiang Zeng
- Shenzhen Research Institute of Xiamen University, Xiamen University, Shenzhen 518000, Guangdong, China; Department of Information Science and Technology, Xiamen University, Xiamen 361005, Fujian, China
| | - Wen Wang
- Shenzhen Research Institute of Xiamen University, Xiamen University, Shenzhen 518000, Guangdong, China
| | - Gaoshan Deng
- Department of Computer Science, University of Southern California, Los Angeles, CA 90089, USA
| | - Jiaxin Bing
- Shenzhen Research Institute of Xiamen University, Xiamen University, Shenzhen 518000, Guangdong, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu 610000, China; Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610000, China.
| |
Collapse
|
193
|
Xie G, Wu C, Sun Y, Fan Z, Liu J. LPI-IBNRA: Long Non-coding RNA-Protein Interaction Prediction Based on Improved Bipartite Network Recommender Algorithm. Front Genet 2019; 10:343. [PMID: 31057602 PMCID: PMC6482170 DOI: 10.3389/fgene.2019.00343] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2019] [Accepted: 03/29/2019] [Indexed: 12/26/2022] Open
Abstract
According to the latest research, lncRNAs (long non-coding RNAs) play a broad and important role in various biological processes by interacting with proteins. However, identifying whether proteins interact with a specific lncRNA through biological experimental methods is difficult, costly, and time-consuming. Thus, many bioinformatics computational methods have been proposed to predict lncRNA-protein interactions. In this paper, we proposed a novel approach called Long non-coding RNA-Protein Interaction Prediction based on Improved Bipartite Network Recommender Algorithm (LPI-IBNRA). In the proposed method, we implemented a two-round resource allocation and eliminated the second-order correlations appropriately on the bipartite network. Experimental results illustrate that LPI-IBNRA outperforms five previous methods, with the AUC values of 0.8932 in leave-one-out cross validation (LOOCV) and 0.8819 ± 0.0052 in 10-fold cross validation, respectively. In addition, case studies on four lncRNAs were carried out to show the predictive power of LPI-IBNRA.
Collapse
Affiliation(s)
- Guobo Xie
- School of Computers, Guangdong University of Technology, Guangzhou, China
| | - Cuiming Wu
- School of Computers, Guangdong University of Technology, Guangzhou, China
| | - Yuping Sun
- School of Computers, Guangdong University of Technology, Guangzhou, China
| | - Zhiliang Fan
- School of Computers, Guangdong University of Technology, Guangzhou, China
| | - Jianghui Liu
- Department of Emergency, The First Affiliated Hospital of Sun Yat-sen University, Guangzhou, China
| |
Collapse
|
194
|
Li H, Wang Y, Jiang J, Zhao H, Feng X, Zhao B, Wang L. A Novel Human Microbe-Disease Association Prediction Method Based on the Bidirectional Weighted Network. Front Microbiol 2019; 10:676. [PMID: 31024478 PMCID: PMC6465552 DOI: 10.3389/fmicb.2019.00676] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2018] [Accepted: 03/18/2019] [Indexed: 12/12/2022] Open
Abstract
The survival of human beings is inseparable from microbes. More and more studies have proved that microbes can affect human physiological processes in various aspects and are closely related to some human diseases. In this paper, based on known microbe-disease associations, a bidirectional weighted network was constructed by integrating the schemes of normalized Gaussian interactions and bidirectional recommendations firstly. And then, based on the newly constructed bidirectional network, a computational model called BWNMHMDA was developed to predict potential relationships between microbes and diseases. Finally, in order to evaluate the superiority of the new prediction model BWNMHMDA, the framework of LOOCV and 5-fold cross validation were implemented, and simulation results indicated that BWNMHMDA could achieve reliable AUCs of 0.9127 and 0.8967 ± 0.0027 in these two different frameworks respectively, which is outperformed some state-of-the-art methods. Moreover, case studies of asthma, colorectal carcinoma, and chronic obstructive pulmonary disease were implemented to further estimate the performance of BWNMHMDA. Experimental results showed that there are 10, 9, and 8 out of the top 10 predicted microbes having been confirmed by related literature in these three kinds of case studies separately, which also demonstrated that our new model BWNMHMDA could achieve satisfying prediction performance.
Collapse
Affiliation(s)
- Hao Li
- Key Laboratory of Hunan Province for Internet of Things and Information Security, Xiangtan University, Xiangtan, China
| | - Yuqi Wang
- Key Laboratory of Hunan Province for Internet of Things and Information Security, Xiangtan University, Xiangtan, China
| | - Jingwu Jiang
- Clinical Lab, Yongcheng People's Hospital, Shangqiu, China
| | - Haochen Zhao
- Key Laboratory of Hunan Province for Internet of Things and Information Security, Xiangtan University, Xiangtan, China
| | - Xiang Feng
- College of Computer Engineering & Applied Mathematics, Changsha University, Changsha, China
| | - Bihai Zhao
- College of Computer Engineering & Applied Mathematics, Changsha University, Changsha, China
| | - Lei Wang
- Key Laboratory of Hunan Province for Internet of Things and Information Security, Xiangtan University, Xiangtan, China
- College of Computer Engineering & Applied Mathematics, Changsha University, Changsha, China
| |
Collapse
|
195
|
Wang L, Wang Y, Li H, Feng X, Yuan D, Yang J. A Bidirectional Label Propagation Based Computational Model for Potential Microbe-Disease Association Prediction. Front Microbiol 2019; 10:684. [PMID: 31024481 PMCID: PMC6465563 DOI: 10.3389/fmicb.2019.00684] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2019] [Accepted: 03/19/2019] [Indexed: 12/12/2022] Open
Abstract
A growing number of clinical observations have indicated that microbes are involved in a variety of important human diseases. It is obvious that in-depth investigation of correlations between microbes and diseases will benefit the prevention, early diagnosis, and prognosis of diseases greatly. Hence, in this paper, based on known microbe-disease associations, a prediction model called NBLPIHMDA was proposed to infer potential microbe-disease associations. Specifically, two kinds of networks including the disease similarity network and the microbe similarity network were first constructed based on the Gaussian interaction profile kernel similarity. The bidirectional label propagation was then applied on these two kinds of networks to predict potential microbe-disease associations. We applied NBLPIHMDA on Human Microbe-Disease Association database (HMDAD), and compared it with 3 other recent published methods including LRLSHMDA, BiRWMP, and KATZHMDA based on the leave-one-out cross validation and 5-fold cross validation, respectively. As a result, the area under the receiver operating characteristic curves (AUCs) achieved by NBLPIHMDA were 0.8777 and 0.8958 ± 0.0027, respectively, outperforming the compared methods. In addition, in case studies of asthma, colorectal carcinoma, and Chronic obstructive pulmonary disease, simulation results illustrated that there are 10, 10, and 8 out of the top 10 predicted microbes having been confirmed by published documentary evidences, which further demonstrated that NBLPIHMDA is promising in predicting novel associations between diseases and microbes as well.
Collapse
Affiliation(s)
- Lei Wang
- Key Laboratory of Hunan Province for Internet of Things and Information Security, Xiangtan University, Xiangtan, China
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, China
| | - Yuqi Wang
- Key Laboratory of Hunan Province for Internet of Things and Information Security, Xiangtan University, Xiangtan, China
| | - Hao Li
- Key Laboratory of Hunan Province for Internet of Things and Information Security, Xiangtan University, Xiangtan, China
| | - Xiang Feng
- Key Laboratory of Hunan Province for Internet of Things and Information Security, Xiangtan University, Xiangtan, China
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, China
| | - Dawei Yuan
- Geneis Beijing Co., Ltd., Beijing, China
| | | |
Collapse
|
196
|
Yerukala Sathipati S, Sahu D, Huang HC, Lin Y, Ho SY. Identification and characterization of the lncRNA signature associated with overall survival in patients with neuroblastoma. Sci Rep 2019; 9:5125. [PMID: 30914706 PMCID: PMC6435792 DOI: 10.1038/s41598-019-41553-y] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2018] [Accepted: 03/05/2019] [Indexed: 01/16/2023] Open
Abstract
Neuroblastoma (NB) is a commonly occurring cancer among infants and young children. Recently, long non-coding RNAs (lncRNAs) have been using as prognostic biomarkers for therapeutics and interventions in various cancers. Considering the poor survival of NB, the lncRNA-based therapeutic strategies must be improved. This work proposes an overall survival time estimator called SVR-NB to identify the lncRNA signature that is associated with the overall survival of patients with NB. SVR-NB is an optimized support vector regression (SVR)-based method that uses an inheritable bi-objective combinatorial genetic algorithm for feature selection. The dataset of 231 NB patients that contains overall survival information and expression profiles of 783 lncRNAs was used to design and evaluate SVR-NB from the database of gene expression omnibus accession GSE62564. SVR-NB identified a signature of 35 lncRNAs and achieved a mean squared correlation coefficient of 0.85 and a mean absolute error of 0.56 year between the actual and estimated overall survival time using 10-fold cross-validation. Further, we ranked and characterized the 35 lncRNAs according to their contribution towards the estimation accuracy. Functional annotations and co-expression gene analysis of LOC440896, LINC00632, and IGF2-AS revealed the association of co-expressed genes in Kyoto Encyclopedia of Genes and Genomes pathways.
Collapse
Grants
- This work was funded by Ministry of Science and Technology ROC under the contract numbers MOST 106-2634-F-075-001-, 106-2218-E-009-031-, 107-2221-E-009-154-, 107-2218-E-029-001-, and 107-2314-B-039-025-. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
- This work was funded by Ministry of Science and Technology ROC under the contract numbers MOST 107-2221-E-009 -154 –, 107-2634-F-075 -001 –, 107-2218-E-009 -005 –, 107-2218-E-029 -001 –, and 107-2319-B-400 -001 –, and was financially supported by the “Center for Intelligent Drug Systems and Smart Bio-devices (IDS2B)” from The Featured Areas Research Center Program within the framework of the Higher Education Sprout Project by the Ministry of Education (MOE) in Taiwan. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Collapse
Affiliation(s)
| | - Divya Sahu
- Institute of Biomedical Informatics, Center for Systems and Synthetic Biology, National Yang-Ming University, Taipei, Taiwan
| | - Hsuan-Cheng Huang
- Institute of Biomedical Informatics, Center for Systems and Synthetic Biology, National Yang-Ming University, Taipei, Taiwan
- Bioinformatics Program, Taiwan International Graduate Program, Institute of Information Science, Academia Sinica, Taipei, Taiwan
| | - Yenching Lin
- Interdisciplinary Neuroscience Ph.D. Program, National Chiao Tung University, Hsinchu, Taiwan
| | - Shinn-Ying Ho
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan.
- Bioinformatics Program, Taiwan International Graduate Program, Institute of Information Science, Academia Sinica, Taipei, Taiwan.
- Interdisciplinary Neuroscience Ph.D. Program, National Chiao Tung University, Hsinchu, Taiwan.
- Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan.
- Center For Intelligent Drug Systems and Smart Bio-devices (IDSB), National Chiao Tung University, Hsinchu, Taiwan.
| |
Collapse
|
197
|
Ru X, Li L, Wang C. Identification of Phage Viral Proteins With Hybrid Sequence Features. Front Microbiol 2019; 10:507. [PMID: 30972038 PMCID: PMC6443926 DOI: 10.3389/fmicb.2019.00507] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2018] [Accepted: 02/27/2019] [Indexed: 02/01/2023] Open
Abstract
The uniqueness of bacteriophages plays an important role in bioinformatics research. In real applications, the function of the bacteriophage virion proteins is the main area of interest. Therefore, it is very important to classify bacteriophage virion proteins and non-phage virion proteins accurately. Extracting comprehensive and effective sequence features from proteins plays a vital role in protein classification. In order to more fully represent protein information, this paper is more comprehensive and effective by combining the features extracted by the feature information representation algorithm based on sequence information (CCPA) and the feature representation algorithm based on sequence and structure information. After extracting features, the Max-Relevance-Max-Distance (MRMD) algorithm is used to select the optimal feature set with the strongest correlation between class labels and low redundancy between features. Given the randomness of the samples selected by the random forest classification algorithm and the randomness features for producing each node variable, a random forest method is employed to perform 10-fold cross-validation on the bacteriophage protein classification. The accuracy of this model is as high as 93.5% in the classification of phage proteins in this study. This study also found that, among the eight physicochemical properties considered, the charge property has the greatest impact on the classification of bacteriophage proteins These results indicate that the model discussed in this paper is an important tool in bacteriophage protein research.
Collapse
Affiliation(s)
- Xiaoqing Ru
- School of Information and Electrical Engineering, Hebei University of Engineering, Handan, China
| | - Lihong Li
- School of Information and Electrical Engineering, Hebei University of Engineering, Handan, China
| | - Chunyu Wang
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
| |
Collapse
|
198
|
Chen SJ, Liao DL, Chen CH, Wang TY, Chen KC. Construction and Analysis of Protein-Protein Interaction Network of Heroin Use Disorder. Sci Rep 2019; 9:4980. [PMID: 30899073 PMCID: PMC6428805 DOI: 10.1038/s41598-019-41552-z] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2018] [Accepted: 03/11/2019] [Indexed: 12/17/2022] Open
Abstract
Heroin use disorder (HUD) is a complex disease resulting from interactions among genetic and other factors (e.g., environmental factors). The mechanism of HUD development remains unknown. Newly developed network medicine tools provide a platform for exploring complex diseases at the system level. This study proposes that protein–protein interactions (PPIs), particularly those among proteins encoded by casual or susceptibility genes, are extremely crucial for HUD development. The giant component of our constructed PPI network comprised 111 nodes with 553 edges, including 16 proteins with large degree (k) or high betweenness centrality (BC), which were further identified as the backbone of the network. JUN with the largest degree was suggested to be central to the PPI network associated with HUD. Moreover, PCK1 with the highest BC and MAPK14 with the secondary largest degree and 9th highest BC might be involved in the development HUD and other substance diseases.
Collapse
Affiliation(s)
- Shaw-Ji Chen
- Department of Medicine, Mackay Medical College, New Taipei City, Taiwan.,Department of Psychiatry, Mackay Memorial Hospital, Taitung Branch, Taiwan
| | - Ding-Lieh Liao
- Bali Psychiatric Center, Department of Health, Executive Yuan, New Taipei, Taiwan
| | - Chia-Hsiang Chen
- Department of Psychiatry, Chang Gung Memorial Hospital at Linkou and Chang Gung University School of Medicine, Taoyuan, Taiwan
| | - Tse-Yi Wang
- Department of Medical Informatics, Tzu Chi University, Hualien, Taiwan
| | - Kuang-Chi Chen
- Department of Medical Informatics, Tzu Chi University, Hualien, Taiwan.
| |
Collapse
|
199
|
Wang CC, Chen X, Qu J, Sun YZ, Li JQ. RFSMMA: A New Computational Model to Identify and Prioritize Potential Small Molecule-MiRNA Associations. J Chem Inf Model 2019; 59:1668-1679. [PMID: 30840454 DOI: 10.1021/acs.jcim.9b00129] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
More and more studies found that many complex human diseases occur accompanied by aberrant expression of microRNAs (miRNAs). Small molecule (SM) drugs have been utilized to treat complex human diseases by affecting the expression of miRNAs. Several computational methods were proposed to infer underlying associations between SMs and miRNAs. In our study, we proposed a new calculation model of random forest based small molecule-miRNA association prediction (RFSMMA) which was based on the known SM-miRNA associations in the SM2miR database. RFSMMA utilized the similarity of SMs and miRNAs as features to represent SM-miRNA pairs and further implemented the machine learning algorithm of random forest to train training samples and obtain a prediction model. In RFSMMA, integrating multiple kinds of similarity can avoid the bias of single similarity and choosing more reliable features from original features can represent SM-miRNA pairs more accurately. We carried out cross validations to assess predictive accuracy of RFSMMA. As a result, RFSMMA acquired AUCs of 0.9854, 0.9839, 0.7052, and 0.9917 ± 0.0008 under global leave-one-out cross validation (LOOCV), miRNA-fixed local LOOCV, SM-fixed local LOOCV, and 5-fold cross validation, respectively, under data set 1. Based on data set 2, RFSMMA obtained AUCs of 0.8456, 0.8463, 0.6653, and 0.8389 ± 0.0033 under four cross validations according to the order mentioned above. In addition, we implemented a case study on three common SMs, namely, 5-fluorouracil, 17β-estradiol, and 5-aza-2'-deoxycytidine. Among the top 50 associated miRNAs of these three SMs predicted by RFSMMA, 31, 32, and 28 miRNAs were verified, respectively. Therefore, RFSMMA is shown to be an effective and reliable tool for identifying underlying SM-miRNA associations.
Collapse
Affiliation(s)
- Chun-Chun Wang
- School of Information and Control Engineering , China University of Mining and Technology , Xuzhou 221116 , China
| | - Xing Chen
- School of Information and Control Engineering , China University of Mining and Technology , Xuzhou 221116 , China
| | - Jia Qu
- School of Information and Control Engineering , China University of Mining and Technology , Xuzhou 221116 , China
| | - Ya-Zhou Sun
- College of Computer Science and Software Engineering , Shenzhen University , Shenzhen 518060 , China
| | - Jian-Qiang Li
- College of Computer Science and Software Engineering , Shenzhen University , Shenzhen 518060 , China
| |
Collapse
|
200
|
Long Noncoding RNA and Protein Interactions: From Experimental Results to Computational Models Based on Network Methods. Int J Mol Sci 2019; 20:ijms20061284. [PMID: 30875752 PMCID: PMC6471543 DOI: 10.3390/ijms20061284] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2019] [Revised: 03/09/2019] [Accepted: 03/11/2019] [Indexed: 01/13/2023] Open
Abstract
Non-coding RNAs with a length of more than 200 nucleotides are long non-coding RNAs (lncRNAs), which have gained tremendous attention in recent decades. Many studies have confirmed that lncRNAs have important influence in post-transcriptional gene regulation; for example, lncRNAs affect the stability and translation of splicing factor proteins. The mutations and malfunctions of lncRNAs are closely related to human disorders. As lncRNAs interact with a variety of proteins, predicting the interaction between lncRNAs and proteins is a significant way to depth exploration functions and enrich annotations of lncRNAs. Experimental approaches for lncRNA–protein interactions are expensive and time-consuming. Computational approaches to predict lncRNA–protein interactions can be grouped into two broad categories. The first category is based on sequence, structural information and physicochemical property. The second category is based on network method through fusing heterogeneous data to construct lncRNA related heterogeneous network. The network-based methods can capture the implicit feature information in the topological structure of related biological heterogeneous networks containing lncRNAs, which is often ignored by sequence-based methods. In this paper, we summarize and discuss the materials, interaction score calculation algorithms, advantages and disadvantages of state-of-the-art algorithms of lncRNA–protein interaction prediction based on network methods to assist researchers in selecting a suitable method for acquiring more dependable results. All the related different network data are also collected and processed in convenience of users, and are available at https://github.com/HAN-Siyu/APINet/.
Collapse
|