101
|
Qu Y, Zhang H, Lyu C, Liang C. LLCMDA: A Novel Method for Predicting miRNA Gene and Disease Relationship Based on Locality-Constrained Linear Coding. Front Genet 2018; 9:576. [PMID: 30555511 PMCID: PMC6282048 DOI: 10.3389/fgene.2018.00576] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2018] [Accepted: 11/08/2018] [Indexed: 01/03/2023] Open
Abstract
MiRNAs are small non-coding regulatory RNAs which are associated with multiple diseases. Increasing evidence has shown that miRNAs play important roles in various biological and physiological processes. Therefore, the identification of potential miRNA-disease associations could provide new clues to understanding the mechanism of pathogenesis. Although many traditional methods have been successfully applied to discover part of the associations, they are in general time-consuming and expensive. Consequently, computational-based methods are urgently needed to predict the potential miRNA-disease associations in a more efficient and resources-saving way. In this paper, we propose a novel method to predict miRNA-disease associations based on Locality-constrained Linear Coding (LLC). Specifically, we first reconstruct similarity networks for both miRNAs and diseases using LLC and then apply label propagation on the similarity networks to get relevant scores. To comprehensively verify the performance of the proposed method, we compare our method with several state-of-the-art methods under different evaluation metrics. Moreover, two types of case studies conducted on two common diseases further demonstrate the validity and utility of our method. Extensive experimental results indicate that our method can effectively predict potential associations between miRNAs and diseases.
Collapse
Affiliation(s)
- Yu Qu
- School of Information Science and Engineering, Shandong Normal University, Jinan, China
| | - Huaxiang Zhang
- School of Information Science and Engineering, Shandong Normal University, Jinan, China
| | - Chen Lyu
- School of Information Science and Engineering, Shandong Normal University, Jinan, China
| | - Cheng Liang
- School of Information Science and Engineering, Shandong Normal University, Jinan, China
| |
Collapse
|
102
|
Lan W, Wang J, Li M, Liu J, Wu FX, Pan Y. Predicting MicroRNA-Disease Associations Based on Improved MicroRNA and Disease Similarities. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:1774-1782. [PMID: 27392365 DOI: 10.1109/tcbb.2016.2586190] [Citation(s) in RCA: 86] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
MicroRNAs (miRNAs) are a type of non-coding RNAs with about ∼22nt nucleotides. Increasing evidences have shown that miRNAs play critical roles in many human diseases. The identification of human disease-related miRNAs is helpful to explore the underlying pathogenesis of diseases. More and more experimental validated associations between miRNAs and diseases have been reported in the recent studies, which provide useful information for new miRNA-disease association discovery. In this study, we propose a computational framework, KBMF-MDI, to predict the associations between miRNAs and diseases based on their similarities. The sequence and function information of miRNAs are used to measure similarity among miRNAs while the semantic and function information of disease are used to measure similarity among diseases, respectively. In addition, the kernelized Bayesian matrix factorization method is employed to infer potential miRNA-disease associations by integrating these data sources. We applied this method to 6,084 known miRNA-disease associations and utilized 5-fold cross validation to evaluate the performance. The experimental results demonstrate that our method can effectively predict unknown miRNA-disease associations.
Collapse
|
103
|
Prediction of potential disease-associated microRNAs by composite network based inference. Sci Rep 2018; 8:15813. [PMID: 30361693 PMCID: PMC6202421 DOI: 10.1038/s41598-018-34180-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2018] [Accepted: 10/12/2018] [Indexed: 01/05/2023] Open
Abstract
MicroRNAs (miRNAs) act a significant role in multiple biological processes and their associations with the development of all kinds of complex diseases are much close. In the research area of biology, medicine, and bioinformatics, prediction of potential miRNA-disease associations (MDAs) on the base of a variety of heterogeneous biological datasets in a short time is an important subject. Therefore, we proposed the model of Composite Network based inference for MiRNA-Disease Association prediction (CNMDA) through applying random walk to a multi-level composite network constructed by heterogeneous dataset of disease, long noncoding RNA (lncRNA) and miRNA. The results showed that CNMDA achieved an AUC of 0.8547 in leave-one-out cross validation and an AUC of 0.8533+/−0.0009 in 5-fold cross validation. In addition, we employed CNMDA to infer novel miRNAs for kidney neoplasms, breast neoplasms and lung neoplasms on the base of HMDD v2.0. Also, we employed the approach for lung neoplasms on the base of HMDD v1.0 and for breast neoplasms that have no known related miRNAs. It was found that CNMDA could be seen as an applicable tool for potential MDAs prediction.
Collapse
|
104
|
Niu YW, Liu H, Wang GH, Yan GY. Maximal entropy random walk on heterogenous network for MIRNA-disease Association prediction. Math Biosci 2018; 306:1-9. [PMID: 30336146 DOI: 10.1016/j.mbs.2018.10.004] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2018] [Revised: 10/08/2018] [Accepted: 10/13/2018] [Indexed: 12/24/2022]
Abstract
The last few decades have verified the vital roles of microRNAs in the development of human diseases and witnessed the increasing interest in the prediction of potential disease-miRNA associations. Owning to the open access of many miRNA-related databases, up until recently, kinds of feasible in silico models have been proposed. In this work, we developed a computational model of Maximal Entropy Random Walk on heterogenous network for MiRNA-disease Association prediction (MERWMDA). MERWMDA integrated known disease-miRNA association, pair-wise functional relation of miRNAs and pair-wise semantic relation of diseases into a heterogenous network comprised of disease and miRNA nodes full of information. As a kind of widely-applied biased walk process with more randomness, MERW was then implemented on the heterogenous network to reveal potential disease-miRNA associations. Cross validation was further performed to evaluate the performance of MERWMDA. As a result, MERWMDA obtained AUCs of 0.8966 and 0.8491 respectively in the aspect of global and local leave-one-out cross validation. What' more, three different case study strategies on four human complex diseases were conducted to comprehensively assess the quality of the model. Specifically, one kind of case study on Esophageal cancer and Prostate cancer were conducted based on HMDD v2.0 database. 94% and 88% out of the top 50 ranked miRNAs were confirmed by recent literature, respectively. To simulate new disease without known related miRNAs, Lung cancer (confirmed ratio 94%) associated miRNAs were removed for case study. Lymphoma (verified ratio 88%) was adopted to assess the prediction robustness of MERWMDA based on HMDD v1.0 database. We anticipated that MERWMDA could offer valuable candidates for in vitro biomedical experiments in future.
Collapse
Affiliation(s)
- Ya-Wei Niu
- School of Mathematics, Shandong University, Jinan 250100, China
| | - Hua Liu
- School of Mathematics, Shandong University, Jinan 250100, China
| | - Guang-Hui Wang
- School of Mathematics, Shandong University, Jinan 250100, China.
| | - Gui-Ying Yan
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China
| |
Collapse
|
105
|
Mishra S, Shah MI, Sarkar M, Asati N, Rout C. ILDgenDB: integrated genetic knowledge resource for interstitial lung diseases (ILDs). DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2018; 2018:5035482. [PMID: 29897484 PMCID: PMC6007225 DOI: 10.1093/database/bay053] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/25/2018] [Accepted: 05/17/2018] [Indexed: 12/31/2022]
Abstract
Interstitial lung diseases (ILDs) are a diverse group of ∼200 acute and chronic pulmonary disorders that are characterized by variable amounts of inflammation, fibrosis and architectural distortion with substantial morbidity and mortality. Inaccurate and delayed diagnoses increase the risk, especially in developing countries. Studies have indicated the significant roles of genetic elements in ILDs pathogenesis. Therefore, the first genetic knowledge resource, ILDgenDB, has been developed with an objective to provide ILDs genetic data and their integrated analyses for the better understanding of disease pathogenesis and identification of diagnostics-based biomarkers. This resource contains literature-curated disease candidate genes (DCGs) enriched with various regulatory elements that have been generated using an integrated bioinformatics workflow of databases searches, literature-mining and DCGs–microRNA (miRNAs)–single nucleotide polymorphisms (SNPs) association analyses. To provide statistical significance to disease-gene association, ILD-specificity index and hypergeomatric test scores were also incorporated. Association analyses of miRNAs, SNPs and pathways responsible for the pathogenesis of different sub-classes of ILDs were also incorporated. Manually verified 299 DCGs and their significant associations with 1932 SNPs, 2966 miRNAs and 9170 miR-polymorphisms were also provided. Furthermore, 216 literature-mined and proposed biomarkers were identified. The ILDgenDB resource provides user-friendly browsing and extensive query-based information retrieval systems. Additionally, this resource also facilitates graphical view of predicted DCGs–SNPs/miRNAs and literature associated DCGs–ILDs interactions for each ILD to facilitate efficient data interpretation. Outcomes of analyses suggested the significant involvement of immune system and defense mechanisms in ILDs pathogenesis. This resource may potentially facilitate genetic-based disease monitoring and diagnosis. Database URL: http://14.139.240.55/ildgendb/index.php
Collapse
Affiliation(s)
- Smriti Mishra
- Department of Biotechnology and Bioinformatics, Jaypee University of Information Technology, Waknaghat, Solan, Himachal Pradesh 173234, India
| | - Mohammad I Shah
- Department of Biotechnology and Bioinformatics, Jaypee University of Information Technology, Waknaghat, Solan, Himachal Pradesh 173234, India
| | - Malay Sarkar
- Department of Pulmonary Medicine, Indira Gandhi Medical College, Shimla, Himachal Pradesh 171001, India
| | - Nimisha Asati
- Department of Biotechnology and Bioinformatics, Jaypee University of Information Technology, Waknaghat, Solan, Himachal Pradesh 173234, India
| | - Chittaranjan Rout
- Department of Biotechnology and Bioinformatics, Jaypee University of Information Technology, Waknaghat, Solan, Himachal Pradesh 173234, India
| |
Collapse
|
106
|
Chen X, Cheng JY, Yin J. Predicting microRNA-disease associations using bipartite local models and hubness-aware regression. RNA Biol 2018; 15:1192-1205. [PMID: 30196756 DOI: 10.1080/15476286.2018.1517010] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
The development and progression of numerous complex human diseases have been confirmed to be associated with microRNAs (miRNAs) by various experimental and clinical studies. Predicting potential miRNA-disease associations can help us understand the underlying molecular and cellular mechanisms of diseases and promote the development of disease treatment and diagnosis. Due to the high cost of conventional experimental verification, proposing a new computational method for miRNA-disease association prediction is an efficient and economical way. Since previous computational models ignored the hubness phenomenon, we presented a novel computational model of Bipartite Local models and Hubness-Aware Regression for MiRNA-Disease Association prediction (BLHARMDA). In this method, we first used known miRNA-disease associations to calculate the Jaccard similarity between miRNAs and between diseases, then utilized a modified kNNs model in the bipartite local model method. As a result, we effectively alleviated the detriments from 'bad' hubs. BLHARMDA obtained AUCs of 0.9141 and 0.8390 in the global and local leave-one-out cross validation, respectively, which outperformed most of the previous models and proved high prediction performance of BLHARMDA. Besides, the standard deviation of 0.0006 in 5-fold cross validation confirmed our model's prediction stability and the averaged prediction accuracy of 0.9120 showed the high precision of our model. In addition, to further evaluate our model's accuracy, we implemented BLHARMDA on three typical human diseases in three different types of case studies. As a result, 49 (Esophageal Neoplasms), 50 (Lung Neoplasms) and 50 (Carcinoma Hepatocellular) out of the top 50 related miRNAs were validated by recent experimental discoveries.
Collapse
Affiliation(s)
- Xing Chen
- a School of Information and Control Engineering , China University of Mining and Technology , Xuzhou , China
| | - Jun-Yan Cheng
- b College of Computer Science and Technology , Wuhan University of Science and Technology , Hubei , China
| | - Jun Yin
- a School of Information and Control Engineering , China University of Mining and Technology , Xuzhou , China
| |
Collapse
|
107
|
Pan X, Wenzel A, Jensen LJ, Gorodkin J. Genome-wide identification of clusters of predicted microRNA binding sites as microRNA sponge candidates. PLoS One 2018; 13:e0202369. [PMID: 30142196 PMCID: PMC6108476 DOI: 10.1371/journal.pone.0202369] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2018] [Accepted: 08/01/2018] [Indexed: 12/21/2022] Open
Abstract
The number of discovered natural miRNA sponges in plants, viruses, and mammals is increasing steadily. Some sponges like ciRS-7 for miR-7 contain multiple nearby miRNA binding sites. We hypothesize that such clusters of miRNA binding sites on the genome can function together as a sponge. No systematic effort has been made in search for clusters of miRNA targets. Here, we, to our knowledge, make the first genome-wide target site predictions for clusters of mature human miRNAs. For each miRNA, we predict the target sites on a genome-wide scale, build a graph with edge weights based on the pairwise distances between sites, and apply Markov clustering to identify genomic regions with high binding site density. Significant clusters are then extracted based on cluster size difference between real and shuffled genomes preserving local properties such as the GC content. We then use conservation and binding energy to filter a final set of miRNA target site clusters or sponge candidates. Our pipeline predicts 3673 sponge candidates for 1250 miRNAs, including the experimentally verified miR-7 sponge ciRS-7. In addition, we point explicitly to 19 high-confidence candidates overlapping annotated genomic sequence. The full list of candidates is freely available at http://rth.dk/resources/mirnasponge, where detailed properties for individual candidates can be explored, such as alignment details, conservation, accessibility and target profiles, which facilitates selection of sponge candidates for further context specific analysis.
Collapse
Affiliation(s)
- Xiaoyong Pan
- Center for non-coding RNA in Technology and Health, Department of Veterinary and Animal Sciences, University of Copenhagen, Frederiksberg, Denmark
- Disease Systems Biology Program, Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
| | - Anne Wenzel
- Center for non-coding RNA in Technology and Health, Department of Veterinary and Animal Sciences, University of Copenhagen, Frederiksberg, Denmark
| | - Lars Juhl Jensen
- Center for non-coding RNA in Technology and Health, Department of Veterinary and Animal Sciences, University of Copenhagen, Frederiksberg, Denmark
- Disease Systems Biology Program, Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
- * E-mail: (LJJ); (JG)
| | - Jan Gorodkin
- Center for non-coding RNA in Technology and Health, Department of Veterinary and Animal Sciences, University of Copenhagen, Frederiksberg, Denmark
- * E-mail: (LJJ); (JG)
| |
Collapse
|
108
|
Zhao Y, Chen X, Yin J. A Novel Computational Method for the Identification of Potential miRNA-Disease Association Based on Symmetric Non-negative Matrix Factorization and Kronecker Regularized Least Square. Front Genet 2018; 9:324. [PMID: 30186308 PMCID: PMC6111239 DOI: 10.3389/fgene.2018.00324] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2018] [Accepted: 07/30/2018] [Indexed: 12/31/2022] Open
Abstract
Increasing evidence has indicated that microRNAs (miRNAs) are associated with numerous human diseases. Studying the associations between miRNAs and diseases contributes to the exploration of effective diagnostic and treatment approaches for diseases. Unfortunately, the use of biological experiments to reveal the potential associations between miRNAs and diseases is time consuming and costly. Therefore, it is very necessary to use simple and efficient calculation models to predict potential disease-related miRNAs. Considering the limitations of other previous methods, we proposed a novel computational model of Symmetric Nonnegative Matrix Factorization for MiRNA-Disease Association prediction (SNMFMDA) to reveal the relation of miRNA-disease pairs. SNMFMDA could be applied to predict miRNAs associated with new diseases. Compared to the direct use of the integrated similarity in previous computational models, the integrated similarity need to be interpolated by symmetric non-negative matrix factorization (SymNMF) before application in SNMFMDA, and the relevant probability of disease-miRNA was obtained mainly through Kronecker regularized least square (KronRLS) method in our model. What's more, the AUC of global leave-one-out cross validation (LOOCV) reached 0.9007, and the AUC based on local LOOCV was 0.8426. Besides, the mean and the standard deviation of AUCs achieved 0.8830 and 0.0017 respectively in 5-fold cross validation. All of the above results demonstrated the superior prediction performance of SNMFMDA. We also conducted three different case studies on Esophageal Neoplasms, Breast Neoplasms and Lung Neoplasms, and 49, 49, and 48 of the top 50 of their predicted miRNAs respectively were confirmed by databases or related literatures. It could be expected that SNMFMDA would be a model with the ability to predict disease-related miRNAs efficiently and accurately.
Collapse
Affiliation(s)
- Yan Zhao
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| | - Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| | - Jun Yin
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| |
Collapse
|
109
|
He BS, Qu J, Zhao Q. Identifying and Exploiting Potential miRNA-Disease Associations With Neighborhood Regularized Logistic Matrix Factorization. Front Genet 2018; 9:303. [PMID: 30131824 PMCID: PMC6090164 DOI: 10.3389/fgene.2018.00303] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2018] [Accepted: 07/18/2018] [Indexed: 12/12/2022] Open
Abstract
With the rapid development of biological research, microRNAs (miRNA) have become an attractive topic because lots of experimental studies have revealed the significant associations between miRNAs and diseases. However, considering that experiments are expensive and time-consuming, computational methods for predicting associations between miRNAs and diseases have become increasingly crucial. In this study, we proposed a neighborhood regularized logistic matrix factorization method for miRNA-disease association prediction (NRLMFMDA) by integrating miRNA functional similarity, disease semantic similarity, Gaussian interaction profile kernel similarity, and experimentally validation of disease-miRNA association. We used Gaussian interaction profile kernel similarity to cover the shortage of the traditional similarity to make it more reasonable and complete. Furthermore, NRLMFMDA also considered the important influences of the neighborhood information and took full advantage of them to improve the accuracy of the miRNA-disease association prediction. We also improved the accuracy by giving higher weights to the known association data in the process of calculating the potential association probabilities. In the global and the local leave-one-out cross validation, NRLMFMDA got the AUCs of 0.9068 and 0.8239, respectively. Moreover, the average AUC of NRLMFMDA in 5-fold cross validation was 0.8976 ± 0.0034. All the three kinds of cross validations have shown significant advantages to a number of previous models. In the case studies of breast neoplasms, esophageal neoplasms and lymphoma according to known miRNA-disease associations in the recent version of HMDD database, there were 78, 80, and 74% of top 50 predicted related miRNAs verified to have associations with these three diseases, respectively. In the further case studies for new disease without any known related miRNAs and the previous version of HMDD database, there were also high proportions of the predicted miRNAs verified by experimental reports. All the validation experiment results have demonstrated the effectiveness and practicability of NRLFMDA to predict the potential miRNA-disease associations.
Collapse
Affiliation(s)
- Bin-Sheng He
- The First Affiliated Hospital, Changsha Medical University, Changsha, China
| | - Jia Qu
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| | - Qi Zhao
- School of Mathematics, Liaoning University, Shenyang, China.,Research Center for Computer Simulating and Information Processing of Bio-Macromolecules of Liaoning Province, Shenyang, China
| |
Collapse
|
110
|
Chen X, Yin J, Qu J, Huang L. MDHGI: Matrix Decomposition and Heterogeneous Graph Inference for miRNA-disease association prediction. PLoS Comput Biol 2018; 14:e1006418. [PMID: 30142158 PMCID: PMC6126877 DOI: 10.1371/journal.pcbi.1006418] [Citation(s) in RCA: 264] [Impact Index Per Article: 37.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2018] [Revised: 09/06/2018] [Accepted: 08/06/2018] [Indexed: 12/14/2022] Open
Abstract
Recently, a growing number of biological research and scientific experiments have demonstrated that microRNA (miRNA) affects the development of human complex diseases. Discovering miRNA-disease associations plays an increasingly vital role in devising diagnostic and therapeutic tools for diseases. However, since uncovering associations via experimental methods is expensive and time-consuming, novel and effective computational methods for association prediction are in demand. In this study, we developed a computational model of Matrix Decomposition and Heterogeneous Graph Inference for miRNA-disease association prediction (MDHGI) to discover new miRNA-disease associations by integrating the predicted association probability obtained from matrix decomposition through sparse learning method, the miRNA functional similarity, the disease semantic similarity, and the Gaussian interaction profile kernel similarity for diseases and miRNAs into a heterogeneous network. Compared with previous computational models based on heterogeneous networks, our model took full advantage of matrix decomposition before the construction of heterogeneous network, thereby improving the prediction accuracy. MDHGI obtained AUCs of 0.8945 and 0.8240 in the global and the local leave-one-out cross validation, respectively. Moreover, the AUC of 0.8794+/-0.0021 in 5-fold cross validation confirmed its stability of predictive performance. In addition, to further evaluate the model's accuracy, we applied MDHGI to four important human cancers in three different kinds of case studies. In the first type, 98% (Esophageal Neoplasms) and 98% (Lymphoma) of top 50 predicted miRNAs have been confirmed by at least one of the two databases (dbDEMC and miR2Disease) or at least one experimental literature in PubMed. In the second type of case study, what made a difference was that we removed all known associations between the miRNAs and Lung Neoplasms before implementing MDHGI on Lung Neoplasms. As a result, 100% (Lung Neoplasms) of top 50 related miRNAs have been indexed by at least one of the three databases (dbDEMC, miR2Disease and HMDD V2.0) or at least one experimental literature in PubMed. Furthermore, we also tested our prediction method on the HMDD V1.0 database to prove the applicability of MDHGI to different datasets. The results showed that 50 out of top 50 miRNAs related with the breast neoplasms were validated by at least one of the three databases (HMDD V2.0, dbDEMC, and miR2Disease) or at least one experimental literature.
Collapse
Affiliation(s)
- Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| | - Jun Yin
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| | - Jia Qu
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| | - Li Huang
- Business Analytics Centre, National University of Singapore, Singapore
| |
Collapse
|
111
|
Chen X, Qu J, Yin J. TLHNMDA: Triple Layer Heterogeneous Network Based Inference for MiRNA-Disease Association Prediction. Front Genet 2018; 9:234. [PMID: 30018632 PMCID: PMC6038677 DOI: 10.3389/fgene.2018.00234] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2017] [Accepted: 06/12/2018] [Indexed: 12/12/2022] Open
Abstract
In recent years, microRNAs (miRNAs) have been confirmed to be involved in many important biological processes and associated with various kinds of human complex diseases. Therefore, predicting potential associations between miRNAs and diseases with the huge number of verified heterogeneous biological datasets will provide a new perspective for disease therapy. In this article, we developed a novel computational model of Triple Layer Heterogeneous Network based inference for MiRNA-Disease Association prediction (TLHNMDA) by using the experimentally verified miRNA-disease associations, miRNA-long noncoding RNA (lncRNA) interactions, miRNA function similarity information, disease semantic similarity information and Gaussian interaction profile kernel similarity for lncRNAs into an triple layer heterogeneous network to predict new miRNA-disease associations. As a result, the AUCs of TLHNMDA are 0.8795 and 0.8795 ± 0.0010 based on leave-one-out cross validation (LOOCV) and 5-fold cross validation, respectively. Furthermore, TLHNMDA was implemented on three complex human diseases to evaluate predictive ability. As a result, 84% (kidney neoplasms), 78% (lymphoma) and 76% (prostate neoplasms) of top 50 predicted miRNAs for the three complex diseases can be verified by biological experiments. In addition, based on the HMDD v1.0 database, 98% of top 50 potential esophageal neoplasms-associated miRNAs were confirmed by experimental reports. It is expected that TLHNMDA could be a useful model to predict potential miRNA-disease associations with high prediction accuracy and stability.
Collapse
Affiliation(s)
- Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| | - Jia Qu
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| | - Jun Yin
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| |
Collapse
|
112
|
Chen L, Liu B, Yan C. DPFMDA: Distributed and privatized framework for miRNA-Disease association prediction. Pattern Recognit Lett 2018. [DOI: 10.1016/j.patrec.2017.07.008] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
|
113
|
Chen X, Zhou Z, Zhao Y. ELLPMDA: Ensemble learning and link prediction for miRNA-disease association prediction. RNA Biol 2018; 15:807-818. [PMID: 29619882 DOI: 10.1080/15476286.2018.1460016] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open
Abstract
Recently, accumulating evidences have indicated miRNAs play critical roles in the progression and development of various human complex diseases, which pointed out that identifying miRNA-disease association could enable us to understand diseases at miRNA level. Thus, revealing more and more potential miRNA-disease associations is a vital topic in biomedical domain. However, it will be extremely expensive and time-consuming if we examine all the possible miRNA-disease pairs. Therefore, more accurate and efficient methods are being highly requested to detect potential miRNA-disease associations. In this study, we developed a computational model of Ensemble Learning and Link Prediction for miRNA-Disease Association prediction (ELLPMDA) to achieve this goal. By integrating miRNA functional similarity, disease semantic similarity, miRNA-disease association and Gaussian profile kernel similarity for miRNAs and diseases, we constructed a similarity network and utilized ensemble learning to combine rank results given by three classic similarity-based algorithms. To evaluate the performance of ELLPMDA, we exploited global and local Leave-One-Out Cross Validation (LOOCV), 5-fold Cross Validation (CV) and three kinds of case studies. As a result, the AUCs of ELLPMDA is 0.9181, 0.8181 and 0.9193+/-0.0002 in global LOOCV, local LOOCV and 5-fold CV, respectively, which significantly exceed almost all the previous methods. Moreover, in three distinct kinds of case studies for Kidney Neoplasms, Lymphoma, Prostate Neoplasms, Colon Neoplasms and Esophageal Neoplasms, 88%, 92%, 86%, 98% and 98% out of the top 50 predicted miRNAs has been confirmed, respectively. Besides, ELLPMDA is based on global similarity measure and applicable to new diseases without any known related miRNAs.
Collapse
Affiliation(s)
- Xing Chen
- a School of Information and Control Engineering, China University of Mining and Technology , Xuzhou , China
| | - Zhihan Zhou
- b School of Mathematical Science, Zhejiang University , Hangzhou , China
| | - Yan Zhao
- a School of Information and Control Engineering, China University of Mining and Technology , Xuzhou , China
| |
Collapse
|
114
|
Global Similarity Method Based on a Two-tier Random Walk for the Prediction of microRNA-Disease Association. Sci Rep 2018; 8:6481. [PMID: 29691434 PMCID: PMC5915491 DOI: 10.1038/s41598-018-24532-7] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2018] [Accepted: 04/03/2018] [Indexed: 12/15/2022] Open
Abstract
microRNAs (miRNAs) mutation and maladjustment are related to the occurrence and development of human diseases. Studies on disease-associated miRNA have contributed to disease diagnosis and treatment. To address the problems, such as low prediction accuracy and failure to predict the relationship between new miRNAs and diseases and so on, we design a Laplacian score of graphs to calculate the global similarity of networks and propose a Global Similarity method based on a Two-tier Random Walk for the prediction of miRNA-disease association (GSTRW) to reveal the correlation between miRNAs and diseases. This method is a global approach that can simultaneously predict the correlation between all diseases and miRNAs in the absence of negative samples. Experimental results reveal that this method is better than existing approaches in terms of overall prediction accuracy and ability to predict orphan diseases and novel miRNAs. A case study on GSTRW for breast cancer and conlon cancer is also conducted, and the majority of miRNA-disease association can be verified by our experiment. This study indicates that this method is feasible and effective.
Collapse
|
115
|
Peng LH, Sun CN, Guan NN, Li JQ, Chen X. HNMDA: heterogeneous network-based miRNA–disease association prediction. Mol Genet Genomics 2018; 293:983-995. [DOI: 10.1007/s00438-018-1438-1] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2017] [Accepted: 04/18/2018] [Indexed: 12/11/2022]
|
116
|
Abstract
In recent years, microRNAs (miRNAs) are attracting an increasing amount of researchers’ attention, as accumulating studies show that miRNAs play important roles in various basic biological processes and that dysregulation of miRNAs is connected with diverse human diseases, particularly cancers. However, the experimental methods to identify associations between miRNAs and diseases remain costly and laborious. In this study, we developed a computational method named Network Distance Analysis for MiRNA‐Disease Association prediction (NDAMDA) which could effectively predict potential miRNA‐disease associations. The highlight of this method was the use of not only the direct network distance between 2 miRNAs (diseases) but also their respective mean network distances to all other miRNAs (diseases) in the network. The model's reliable performance was certified by the AUC of 0.8920 in global leave‐one‐out cross‐validation (LOOCV), 0.8062 in local LOOCV and the average AUCs of 0.8935 ± 0.0009 in fivefold cross‐validation. Moreover, we applied NDAMDA to 3 different case studies to predict potential miRNAs related to breast neoplasms, lymphoma, oesophageal neoplasms, prostate neoplasms and hepatocellular carcinoma. Results showed that 86%, 72%, 86%, 86% and 84% of the top 50 predicted miRNAs were supported by experimental association evidence. Therefore, NDAMDA is a reliable method for predicting disease‐related miRNAs.
Collapse
Affiliation(s)
- Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| | - Le-Yi Wang
- School of Mathematics and Statistics, Wuhan University, Luojiashan, Wuchang, China
| | - Li Huang
- Business Analytics Centre, National University of Singapore, Singapore, Singapore
| |
Collapse
|
117
|
Zhang X, Yin J, Zhang X. A Semi-Supervised Learning Algorithm for Predicting Four Types MiRNA-Disease Associations by Mutual Information in a Heterogeneous Network. Genes (Basel) 2018; 9:genes9030139. [PMID: 29498680 PMCID: PMC5867860 DOI: 10.3390/genes9030139] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2017] [Revised: 02/20/2018] [Accepted: 02/22/2018] [Indexed: 01/05/2023] Open
Abstract
Increasing evidence suggests that dysregulation of microRNAs (miRNAs) may lead to a variety of diseases. Therefore, identifying disease-related miRNAs is a crucial problem. Currently, many computational approaches have been proposed to predict binary miRNA-disease associations. In this study, in order to predict underlying miRNA-disease association types, a semi-supervised model called the network-based label propagation algorithm is proposed to infer multiple types of miRNA-disease associations (NLPMMDA) by mutual information derived from the heterogeneous network. The NLPMMDA method integrates disease semantic similarity, miRNA functional similarity, and Gaussian interaction profile kernel similarity information of miRNAs and diseases to construct a heterogeneous network. NLPMMDA is a semi-supervised model which does not require verified negative samples. Leave-one-out cross validation (LOOCV) was implemented for four known types of miRNA-disease associations and demonstrated the reliable performance of our method. Moreover, case studies of lung cancer and breast cancer confirmed effective performance of NLPMMDA to predict novel miRNA-disease associations and their association types.
Collapse
Affiliation(s)
- Xiaotian Zhang
- School of Mechanical, Electrical and Information Engineering, Shandong University, Weihai 264209, China.
| | - Jian Yin
- School of Mechanical, Electrical and Information Engineering, Shandong University, Weihai 264209, China.
| | - Xu Zhang
- School of Mechanical, Electrical and Information Engineering, Shandong University, Weihai 264209, China.
| |
Collapse
|
118
|
Chen X, Yan CC, Zhang X, You ZH, Huang YA, Yan GY. HGIMDA: Heterogeneous graph inference for miRNA-disease association prediction. Oncotarget 2018; 7:65257-65269. [PMID: 27533456 PMCID: PMC5323153 DOI: 10.18632/oncotarget.11251] [Citation(s) in RCA: 182] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2016] [Accepted: 07/28/2016] [Indexed: 12/20/2022] Open
Abstract
Recently, microRNAs (miRNAs) have drawn more and more attentions because accumulating experimental studies have indicated miRNA could play critical roles in multiple biological processes as well as the development and progression of human complex diseases. Using the huge number of known heterogeneous biological datasets to predict potential associations between miRNAs and diseases is an important topic in the field of biology, medicine, and bioinformatics. In this study, considering the limitations in the previous computational methods, we developed the computational model of Heterogeneous Graph Inference for MiRNA-Disease Association prediction (HGIMDA) to uncover potential miRNA-disease associations by integrating miRNA functional similarity, disease semantic similarity, Gaussian interaction profile kernel similarity, and experimentally verified miRNA-disease associations into a heterogeneous graph. HGIMDA obtained AUCs of 0.8781 and 0.8077 based on global and local leave-one-out cross validation, respectively. Furthermore, HGIMDA was applied to three important human cancers for performance evaluation. As a result, 90% (Colon Neoplasms), 88% (Esophageal Neoplasms) and 88% (Kidney Neoplasms) of top 50 predicted miRNAs are confirmed by recent experiment reports. Furthermore, HGIMDA could be effectively applied to new diseases and new miRNAs without any known associations, which overcome the important limitations of many previous computational models.
Collapse
Affiliation(s)
- Xing Chen
- School of Information and Electrical Engineering, China University of Mining and Technology, Xuzhou, China
| | | | - Xu Zhang
- School of Mechanical, Electrical & Information Engineering, Shandong University, Weihai, China
| | - Zhu-Hong You
- School of Computer Science and Technology, China University of Mining and Technology, Xuzhou, China
| | - Yu-An Huang
- Department of Computing, Hong Kong Polytechnic University, Hong Kong, China
| | - Gui-Ying Yan
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
119
|
Abstract
Nowadays, as more and more associations between microRNAs (miRNAs) and diseases have been discovered, miRNA has gradually become a hot topic in the biological field. Because of the high consumption of time and money on carrying out biological experiments, computational method which can help scientists choose the most likely associations between miRNAs and diseases for further experimental studies is desperately needed. In this study, we proposed a method of Graph Regression for MiRNA-Disease Association prediction (GRMDA) which combines known miRNA-disease associations, miRNA functional similarity, disease semantic similarity, and Gaussian interaction profile kernel similarity. We used Gaussian interaction profile kernel similarity to supplement the shortage of miRNA functional similarity and disease semantic similarity. Furthermore, the graph regression was synchronously performed in three latent spaces, including association space, miRNA similarity space, and disease similarity space, by using two matrix factorization approaches called Singular Value Decomposition and Partial Least-Squares to extract important related attributes and filter the noise. In the leave-one-out cross validation and five-fold cross validation, GRMDA obtained the AUCs of 0.8272 and 0.8080 ± 0.0024, respectively. Thus, its performance is better than some previous models. In the case study of Lymphoma using the recorded miRNA-disease associations in HMDD V2.0 database, 88% of top 50 predicted miRNAs were verified by experimental literatures. In order to test the performance of GRMDA on new diseases with no known related miRNAs, we took Breast Neoplasms as an example by regarding all the known related miRNAs as unknown ones. We found that 100% of top 50 predicted miRNAs were verified. Moreover, 84% of top 50 predicted miRNAs in case study for Esophageal Neoplasms based on HMDD V1.0 were verified to have known associations. In conclusion, GRMDA is an effective and practical method for miRNA-disease association prediction.
Collapse
Affiliation(s)
- Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| | - Jing-Ru Yang
- School of Computer Science and Technology, Nankai University, Tianjin, China
| | - Na-Na Guan
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China
| | - Jian-Qiang Li
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China
| |
Collapse
|
120
|
Westergaard D, Stærfeldt HH, Tønsberg C, Jensen LJ, Brunak S. A comprehensive and quantitative comparison of text-mining in 15 million full-text articles versus their corresponding abstracts. PLoS Comput Biol 2018; 14:e1005962. [PMID: 29447159 PMCID: PMC5831415 DOI: 10.1371/journal.pcbi.1005962] [Citation(s) in RCA: 67] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2017] [Revised: 02/28/2018] [Accepted: 01/05/2018] [Indexed: 12/21/2022] Open
Abstract
Across academia and industry, text mining has become a popular strategy for keeping up with the rapid growth of the scientific literature. Text mining of the scientific literature has mostly been carried out on collections of abstracts, due to their availability. Here we present an analysis of 15 million English scientific full-text articles published during the period 1823-2016. We describe the development in article length and publication sub-topics during these nearly 250 years. We showcase the potential of text mining by extracting published protein-protein, disease-gene, and protein subcellular associations using a named entity recognition system, and quantitatively report on their accuracy using gold standard benchmark data sets. We subsequently compare the findings to corresponding results obtained on 16.5 million abstracts included in MEDLINE and show that text mining of full-text articles consistently outperforms using abstracts only.
Collapse
Affiliation(s)
- David Westergaard
- Center for Biological Sequence Analysis, Department of Bio and Health Informatics, Technical University of Denmark, Lyngby, Denmark
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Hans-Henrik Stærfeldt
- Center for Biological Sequence Analysis, Department of Bio and Health Informatics, Technical University of Denmark, Lyngby, Denmark
| | - Christian Tønsberg
- Office for Innovation and Sector Services, Technical Information Center of Denmark, Technical University of Denmark, Lyngby, Denmark
| | - Lars Juhl Jensen
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
- * E-mail: (LJJ); (SB)
| | - Søren Brunak
- Center for Biological Sequence Analysis, Department of Bio and Health Informatics, Technical University of Denmark, Lyngby, Denmark
- * E-mail: (LJJ); (SB)
| |
Collapse
|
121
|
Chen X, Huang L, Xie D, Zhao Q. EGBMMDA: Extreme Gradient Boosting Machine for MiRNA-Disease Association prediction. Cell Death Dis 2018; 9:3. [PMID: 29305594 PMCID: PMC5849212 DOI: 10.1038/s41419-017-0003-x] [Citation(s) in RCA: 190] [Impact Index Per Article: 27.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2017] [Revised: 09/09/2017] [Accepted: 09/29/2017] [Indexed: 12/14/2022]
Abstract
Associations between microRNAs (miRNAs) and human diseases have been identified by increasing studies and discovering new ones is an ongoing process in medical laboratories. To improve experiment productivity, researchers computationally infer potential associations from biological data, selecting the most promising candidates for experimental verification. Predicting potential miRNA–disease association has become a research area of growing importance. This paper presents a model of Extreme Gradient Boosting Machine for MiRNA-Disease Association (EGBMMDA) prediction by integrating the miRNA functional similarity, the disease semantic similarity, and known miRNA–disease associations. The statistical measures, graph theoretical measures, and matrix factorization results for each miRNA-disease pair were calculated and used to form an informative feature vector. The vector for known associated pairs obtained from the HMDD v2.0 database was used to train a regression tree under the gradient boosting framework. EGBMMDA was the first decision tree learning-based model used for predicting miRNA–disease associations. Respectively, AUCs of 0.9123 and 0.8221 in global and local leave-one-out cross-validation proved the model’s reliable performance. Moreover, the 0.9048 ± 0.0012 AUC in fivefold cross-validation confirmed its stability. We carried out three different types of case studies of predicting potential miRNAs related to Colon Neoplasms, Lymphoma, Prostate Neoplasms, Breast Neoplasms, and Esophageal Neoplasms. The results indicated that, respectively, 98%, 90%, 98%, 100%, and 98% of the top 50 predictions for the five diseases were confirmed by experiments. Therefore, EGBMMDA appears to be a useful computational resource for miRNA–disease association prediction.
Collapse
Affiliation(s)
- Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, 221116, China.
| | - Li Huang
- Business Analytics Centre, National University of Singapore, Singapore, 119613, Singapore
| | - Di Xie
- School of Mathematics, Liaoning University, Shenyang, 110036, China
| | - Qi Zhao
- School of Mathematics, Liaoning University, Shenyang, 110036, China.,Research Center for Computer Simulating and Information Processing of Bio-Macromolecules of Liaoning Province, Shenyang, 110036, China
| |
Collapse
|
122
|
Cui T, Zhang L, Huang Y, Yi Y, Tan P, Zhao Y, Hu Y, Xu L, Li E, Wang D. MNDR v2.0: an updated resource of ncRNA-disease associations in mammals. Nucleic Acids Res 2018; 46:D371-D374. [PMID: 29106639 PMCID: PMC5753235 DOI: 10.1093/nar/gkx1025] [Citation(s) in RCA: 91] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2017] [Revised: 10/15/2017] [Accepted: 10/19/2017] [Indexed: 02/05/2023] Open
Abstract
Accumulating evidence suggests that diverse non-coding RNAs (ncRNAs) are involved in the progression of a wide variety of diseases. In recent years, abundant ncRNA-disease associations have been found and predicted according to experiments and prediction algorithms. Diverse ncRNA-disease associations are scattered over many resources and mammals, whereas a global view of diverse ncRNA-disease associations is not available for any mammals. Hence, we have updated the MNDR v2.0 database (www.rna-society.org/mndr/) by integrating experimental and prediction associations from manual literature curation and other resources under one common framework. The new developments in MNDR v2.0 include (i) an over 220-fold increase in ncRNA-disease associations enhancement compared with the previous version (including lncRNA, miRNA, piRNA, snoRNA and more than 1400 diseases); (ii) integrating experimental and prediction evidence from 14 resources and prediction algorithms for each ncRNA-disease association; (iii) mapping disease names to the Disease Ontology and Medical Subject Headings (MeSH); (iv) providing a confidence score for each ncRNA-disease association and (v) an increase of species coverage to six mammals. Finally, MNDR v2.0 intends to provide the scientific community with a resource for efficient browsing and extraction of the associations between diverse ncRNAs and diseases, including >260 000 ncRNA-disease associations.
Collapse
Affiliation(s)
- Tianyu Cui
- The Key Laboratory of Molecular Biology for High Cancer Incidence Coastal Chaoshan Area and Department of Biochemistry and Molecular Biology, Shantou University Medical College, Shantou 515041, China
| | - Lin Zhang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | - Yan Huang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | - Ying Yi
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | - Puwen Tan
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | - Yue Zhao
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | - Yongfei Hu
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | - Liyan Xu
- The Key Laboratory of Molecular Biology for High Cancer Incidence Coastal Chaoshan Area and Department of Biochemistry and Molecular Biology, Shantou University Medical College, Shantou 515041, China
- To whom correspondence should be addressed. Tel: +86 451 86699584; Fax: +86 451 86699584; . Correspondence may also be addressed to Enmin Li. Tel: +86 754 88900413; Fax: +86 754 88900847; . Correspondence may also be addressed to Liyan Xu. Tel: +86 754 88900460; Fax: +86 754 88900847;
| | - Enmin Li
- The Key Laboratory of Molecular Biology for High Cancer Incidence Coastal Chaoshan Area and Department of Biochemistry and Molecular Biology, Shantou University Medical College, Shantou 515041, China
- To whom correspondence should be addressed. Tel: +86 451 86699584; Fax: +86 451 86699584; . Correspondence may also be addressed to Enmin Li. Tel: +86 754 88900413; Fax: +86 754 88900847; . Correspondence may also be addressed to Liyan Xu. Tel: +86 754 88900460; Fax: +86 754 88900847;
| | - Dong Wang
- The Key Laboratory of Molecular Biology for High Cancer Incidence Coastal Chaoshan Area and Department of Biochemistry and Molecular Biology, Shantou University Medical College, Shantou 515041, China
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
- Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
- To whom correspondence should be addressed. Tel: +86 451 86699584; Fax: +86 451 86699584; . Correspondence may also be addressed to Enmin Li. Tel: +86 754 88900413; Fax: +86 754 88900847; . Correspondence may also be addressed to Liyan Xu. Tel: +86 754 88900460; Fax: +86 754 88900847;
| |
Collapse
|
123
|
Chen X, Guan NN, Li JQ, Yan GY. GIMDA: Graphlet interaction-based MiRNA-disease association prediction. J Cell Mol Med 2017; 22:1548-1561. [PMID: 29272076 PMCID: PMC5824414 DOI: 10.1111/jcmm.13429] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2017] [Accepted: 09/22/2017] [Indexed: 01/19/2023] Open
Abstract
MicroRNAs (miRNAs) have been confirmed to be closely related to various human complex diseases by many experimental studies. It is necessary and valuable to develop powerful and effective computational models to predict potential associations between miRNAs and diseases. In this work, we presented a prediction model of Graphlet Interaction for MiRNA‐Disease Association prediction (GIMDA) by integrating the disease semantic similarity, miRNA functional similarity, Gaussian interaction profile kernel similarity and the experimentally confirmed miRNA‐disease associations. The related score of a miRNA to a disease was calculated by measuring the graphlet interactions between two miRNAs or two diseases. The novelty of GIMDA lies in that we used graphlet interaction to analyse the complex relationships between two nodes in a graph. The AUCs of GIMDA in global and local leave‐one‐out cross‐validation (LOOCV) turned out to be 0.9006 and 0.8455, respectively. The average result of five‐fold cross‐validation reached to 0.8927 ± 0.0012. In case study for colon neoplasms, kidney neoplasms and prostate neoplasms based on the database of HMDD V2.0, 45, 45, 41 of the top 50 potential miRNAs predicted by GIMDA were validated by dbDEMC and miR2Disease. Additionally, in the case study of new diseases without any known associated miRNAs and the case study of predicting potential miRNA‐disease associations using HMDD V1.0, there were also high percentages of top 50 miRNAs verified by the experimental literatures.
Collapse
Affiliation(s)
- Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| | - Na-Na Guan
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China
| | - Jian-Qiang Li
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China
| | - Gui-Ying Yan
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
124
|
LRSSLMDA: Laplacian Regularized Sparse Subspace Learning for MiRNA-Disease Association prediction. PLoS Comput Biol 2017; 13:e1005912. [PMID: 29253885 PMCID: PMC5749861 DOI: 10.1371/journal.pcbi.1005912] [Citation(s) in RCA: 186] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2017] [Revised: 01/02/2018] [Accepted: 12/01/2017] [Indexed: 12/17/2022] Open
Abstract
Predicting novel microRNA (miRNA)-disease associations is clinically significant due to miRNAs’ potential roles of diagnostic biomarkers and therapeutic targets for various human diseases. Previous studies have demonstrated the viability of utilizing different types of biological data to computationally infer new disease-related miRNAs. Yet researchers face the challenge of how to effectively integrate diverse datasets and make reliable predictions. In this study, we presented a computational model named Laplacian Regularized Sparse Subspace Learning for MiRNA-Disease Association prediction (LRSSLMDA), which projected miRNAs/diseases’ statistical feature profile and graph theoretical feature profile to a common subspace. It used Laplacian regularization to preserve the local structures of the training data and a L1-norm constraint to select important miRNA/disease features for prediction. The strength of dimensionality reduction enabled the model to be easily extended to much higher dimensional datasets than those exploited in this study. Experimental results showed that LRSSLMDA outperformed ten previous models: the AUC of 0.9178 in global leave-one-out cross validation (LOOCV) and the AUC of 0.8418 in local LOOCV indicated the model’s superior prediction accuracy; and the average AUC of 0.9181+/-0.0004 in 5-fold cross validation justified its accuracy and stability. In addition, three types of case studies further demonstrated its predictive power. Potential miRNAs related to Colon Neoplasms, Lymphoma, Kidney Neoplasms, Esophageal Neoplasms and Breast Neoplasms were predicted by LRSSLMDA. Respectively, 98%, 88%, 96%, 98% and 98% out of the top 50 predictions were validated by experimental evidences. Therefore, we conclude that LRSSLMDA would be a valuable computational tool for miRNA-disease association prediction. Discovering miRNA-disease associations promotes the understanding towards the molecular mechanisms of various human diseases at the miRNA level, and contributes to the development of diagnostic biomarkers and treatment tools for diseases. Computational models can make the discovery more efficient and experiments more productive. LRSSLMDA was proposed to computationally infer potential miRNA-disease associations via adopting sparse subspace learning with Laplacian regularization on the known miRNA-disease association network and the informative feature profiles extracted from the integrated miRNA/disease similarity networks. Experimental results in global and local leave-one-out cross validation and 5-fold cross validation showed a superior prediction performance of LRSSLMDA over previous models. Moreover, three types of case studies on five important human diseases were carried out to further demonstrate the model’s predictive power: respectively, 98%, 88%, 96%, 98% and 98% out of the top 50 predicted miRNAs were confirmed by experimental literatures. So, we believe that LRSSLMDA could make reliable predictions and might guide future experimental studies on miRNA-disease associations.
Collapse
|
125
|
Chen X, Niu YW, Wang GH, Yan GY. MKRMDA: multiple kernel learning-based Kronecker regularized least squares for MiRNA-disease association prediction. J Transl Med 2017; 15:251. [PMID: 29233191 PMCID: PMC5727873 DOI: 10.1186/s12967-017-1340-3] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2017] [Accepted: 11/07/2017] [Indexed: 01/15/2023] Open
Abstract
Background Recently, as the research of microRNA (miRNA) continues, there are plenty of experimental evidences indicating that miRNA could be associated with various human complex diseases development and progression. Hence, it is necessary and urgent to pay more attentions to the relevant study of predicting diseases associated miRNAs, which may be helpful for effective prevention, diagnosis and treatment of human diseases. Especially, constructing computational methods to predict potential miRNA–disease associations is worthy of more studies because of the feasibility and effectivity. Methods In this work, we developed a novel computational model of multiple kernels learning-based Kronecker regularized least squares for MiRNA–disease association prediction (MKRMDA), which could reveal potential miRNA–disease associations by automatically optimizing the combination of multiple kernels for disease and miRNA. Results MKRMDA obtained AUCs of 0.9040 and 0.8446 in global and local leave-one-out cross validation, respectively. Meanwhile, MKRMDA achieved average AUCs of 0.8894 ± 0.0015 in fivefold cross validation. Furthermore, we conducted three different kinds of case studies on some important human cancers for further performance evaluation. In the case studies of colonic cancer, esophageal cancer and lymphoma based on known miRNA–disease associations in HMDDv2.0 database, 76, 94 and 88% of the corresponding top 50 predicted miRNAs were confirmed by experimental reports, respectively. In another two kinds of case studies for new diseases without any known associated miRNAs and diseases only with known associations in HMDDv1.0 database, the verified ratios of two different cancers were 88 and 94%, respectively. Conclusions All the results mentioned above adequately showed the reliable prediction ability of MKRMDA. We anticipated that MKRMDA could serve to facilitate further developments in the field and the follow-up investigations by biomedical researchers. Electronic supplementary material The online version of this article (10.1186/s12967-017-1340-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, 221116, China.
| | - Ya-Wei Niu
- School of Mathematics, Shandong University, Jinan, 250100, China
| | - Guang-Hui Wang
- School of Mathematics, Shandong University, Jinan, 250100, China.
| | - Gui-Ying Yan
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, 100190, China
| |
Collapse
|
126
|
Chen X, Niu YW, Wang GH, Yan GY. HAMDA: Hybrid Approach for MiRNA-Disease Association prediction. J Biomed Inform 2017; 76:50-58. [DOI: 10.1016/j.jbi.2017.10.014] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2017] [Revised: 09/27/2017] [Accepted: 10/30/2017] [Indexed: 12/27/2022]
|
127
|
Zhao Q, Xie D, Liu H, Wang F, Yan GY, Chen X. SSCMDA: spy and super cluster strategy for MiRNA-disease association prediction. Oncotarget 2017; 9:1826-1842. [PMID: 29416734 PMCID: PMC5788602 DOI: 10.18632/oncotarget.22812] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2017] [Accepted: 10/30/2017] [Indexed: 12/23/2022] Open
Abstract
In the biological field, the identification of the associations between microRNAs (miRNAs) and diseases has been paid increasing attention as an extremely meaningful study for the clinical medicine. However, it is expensive and time-consuming to confirm miRNA-disease associations by experimental methods. Therefore, in recent years, several effective computational models for predicting the potential miRNA-disease associations have been developed. In this paper, we proposed the Spy and Super Cluster strategy for MiRNA-Disease Association prediction (SSCMDA) based on known miRNA-disease associations, integrated disease similarity and integrated miRNA similarity. For problems of mixed unknown miRNA-disease pairs containing both potential associations and real negative associations, which will lead to inaccurate prediction, spy strategy is adopted by SSCMDA to identify reliable negative samples from the unknown miRNA-disease pairs. Moreover, the super-cluster strategy could gather as many positive samples as possible to improve the accuracy of the prediction by overcoming the shortage of lacking sufficient positive training samples. As a result, the AUCs of global leave-one-out cross validation (LOOCV), local LOOCV and 5-fold cross validation were 0.9007, 0.8747 and 0.8806+/-0.0025, respectively. According to the AUC results, SSCMDA has shown a significant improvement compared with some previous models. We further carried out case studies based on various version of HMDD database to test the prediction performance robustness of SSCMDA. We also implemented case study to examine whether SSCMDA was effective for new diseases without any known associated miRNAs. As a result, a large proportion of the predicted miRNAs have been verified by experimental reports.
Collapse
Affiliation(s)
- Qi Zhao
- School of Mathematics, Liaoning University, Shenyang, China.,Research Center for Computer Simulating and Information Processing of Bio-Macromolecules of Liaoning Province, Shenyang, China
| | - Di Xie
- School of Mathematics, Liaoning University, Shenyang, China
| | - Hongsheng Liu
- Research Center for Computer Simulating and Information Processing of Bio-Macromolecules of Liaoning Province, Shenyang, China.,School of Life Science, Liaoning University, Shenyang, China
| | - Fan Wang
- School of Mechatronic Engineering, China University of Mining and Technology, Xuzhou, China.,Jiangsu Key Laboratory of Mine Mechanical and Electrical Equipment, China University of Mining and Technology, Xuzhou, China
| | - Gui-Ying Yan
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
| | - Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| |
Collapse
|
128
|
Fu L, Peng Q. A deep ensemble model to predict miRNA-disease association. Sci Rep 2017; 7:14482. [PMID: 29101378 PMCID: PMC5670180 DOI: 10.1038/s41598-017-15235-6] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2017] [Accepted: 10/23/2017] [Indexed: 02/08/2023] Open
Abstract
Cumulative evidence from biological experiments has confirmed that microRNAs (miRNAs) are related to many types of human diseases through different biological processes. It is anticipated that precise miRNA-disease association prediction could not only help infer potential disease-related miRNA but also boost human diagnosis and disease prevention. Considering the limitations of previous computational models, a more effective computational model needs to be implemented to predict miRNA-disease associations. In this work, we first constructed a human miRNA-miRNA similarity network utilizing miRNA-miRNA functional similarity data and heterogeneous miRNA Gaussian interaction profile kernel similarities based on the assumption that similar miRNAs with similar functions tend to be associated with similar diseases, and vice versa. Then, we constructed disease-disease similarity using disease semantic information and heterogeneous disease-related interaction data. We proposed a deep ensemble model called DeepMDA that extracts high-level features from similarity information using stacked autoencoders and then predicts miRNA-disease associations by adopting a 3-layer neural network. In addition to five-fold cross-validation, we also proposed another cross-validation method to evaluate the performance of the model. The results show that the proposed model is superior to previous methods with high robustness.
Collapse
Affiliation(s)
- Laiyi Fu
- Systems Engineering Institute, School of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shannxi, 710049, China
| | - Qinke Peng
- Systems Engineering Institute, School of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shannxi, 710049, China.
| |
Collapse
|
129
|
You ZH, Wang LP, Chen X, Zhang S, Li XF, Yan GY, Li ZW. PRMDA: personalized recommendation-based MiRNA-disease association prediction. Oncotarget 2017; 8:85568-85583. [PMID: 29156742 PMCID: PMC5689632 DOI: 10.18632/oncotarget.20996] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2017] [Accepted: 08/29/2017] [Indexed: 12/23/2022] Open
Abstract
Recently, researchers have been increasingly focusing on microRNAs (miRNAs) with accumulating evidence indicating that miRNAs serve as a vital role in various biological processes and dysfunctions of miRNAs are closely related with human complex diseases. Predicting potential associations between miRNAs and diseases is attached considerable significance in the domains of biology, medicine, and bioinformatics. In this study, we developed a computational model of Personalized Recommendation-based MiRNA-Disease Association prediction (PRMDA) to predict potential related miRNA for all diseases by implementing personalized recommendation-based algorithm based on integrated similarity for diseases and miRNAs. PRMDA is a global method capable of prioritizing candidate miRNAs for all diseases simultaneously. Moreover, the model could be applied to diseases without any known associated miRNAs. PRMDA obtained AUC of 0.8315 based on leave-one-out cross validation, which demonstrated that PRMDA could be regarded as a reliable tool for miRNA-disease association prediction. Besides, we implemented PRMDA on the HMDD V1.0 and HMDD V2.0 databases for three kinds of case studies about five important human cancers in order to test the performance of the model from different perspectives. As a result, 92%, 94%, 88%, 96% and 88% out of the top 50 candidate miRNAs predicted by PRMDA for Colon Neoplasms, Esophageal Neoplasms, Lymphoma, Lung Neoplasms and Breast Neoplasms, respectively, were confirmed by experimental reports.
Collapse
Affiliation(s)
- Zhu-Hong You
- Department of Information Engineering, Xijing University, Xi’an, China
| | - Luo-Pin Wang
- International Software School, Wuhan University, Wuhan, China
| | - Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| | - Shanwen Zhang
- Department of Information Engineering, Xijing University, Xi’an, China
| | - Xiao-Fang Li
- Department of Information Engineering, Xijing University, Xi’an, China
| | - Gui-Ying Yan
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
| | - Zheng-Wei Li
- School of Computer Science and Technology, Hefei, China
| |
Collapse
|
130
|
Li JQ, Rong ZH, Chen X, Yan GY, You ZH. MCMDA: Matrix completion for MiRNA-disease association prediction. Oncotarget 2017; 8:21187-21199. [PMID: 28177900 PMCID: PMC5400576 DOI: 10.18632/oncotarget.15061] [Citation(s) in RCA: 148] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2016] [Accepted: 01/09/2017] [Indexed: 12/31/2022] Open
Abstract
Nowadays, researchers have realized that microRNAs (miRNAs) are playing a significant role in many important biological processes and they are closely connected with various complex human diseases. However, since there are too many possible miRNA-disease associations to analyze, it remains difficult to predict the potential miRNAs related to human diseases without a systematic and effective method. In this study, we developed a Matrix Completion for MiRNA-Disease Association prediction model (MCMDA) based on the known miRNA-disease associations in HMDD database. MCMDA model utilized the matrix completion algorithm to update the adjacency matrix of known miRNA-disease associations and furthermore predict the potential associations. To evaluate the performance of MCMDA, we performed leave-one-out cross validation (LOOCV) and 5-fold cross validation to compare MCMDA with three previous classical computational models (RLSMDA, HDMP, and WBSMDA). As a result, MCMDA achieved AUCs of 0.8749 in global LOOCV, 0.7718 in local LOOCV and average AUC of 0.8767+/−0.0011 in 5-fold cross validation. Moreover, the prediction results associated with colon neoplasms, kidney neoplasms, lymphoma and prostate neoplasms were verified. As a consequence, 84%, 86%, 78% and 90% of the top 50 potential miRNAs for these four diseases were respectively confirmed by recent experimental discoveries. Therefore, MCMDA model is superior to the previous models in that it improves the prediction performance although it only depends on the known miRNA-disease associations.
Collapse
Affiliation(s)
- Jian-Qiang Li
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, 518060, China
| | - Zhi-Hao Rong
- School of Software, Beihang University, Beijing, 100191, China
| | - Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, 221116, China
| | - Gui-Ying Yan
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, 100190, China
| | - Zhu-Hong You
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Science, ürümqi, 830011, China
| |
Collapse
|
131
|
Xiao Q, Luo J, Liang C, Cai J, Ding P. A graph regularized non-negative matrix factorization method for identifying microRNA-disease associations. Bioinformatics 2017; 34:239-248. [PMID: 28968779 DOI: 10.1093/bioinformatics/btx545] [Citation(s) in RCA: 178] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2017] [Revised: 07/21/2017] [Accepted: 08/31/2017] [Indexed: 01/22/2023] Open
Abstract
MOTIVATION MicroRNAs (miRNAs) play crucial roles in post-transcriptional regulations and various cellular processes. The identification of disease-related miRNAs provides great insights into the underlying pathogenesis of diseases at a system level. However, most existing computational approaches are biased towards known miRNA-disease associations, which is inappropriate for those new diseases or miRNAs without any known association information. RESULTS In this study, we propose a new method with graph regularized non-negative matrix factorization in heterogeneous omics data, called GRNMF, to discover potential associations between miRNAs and diseases, especially for new diseases and miRNAs or those diseases and miRNAs with sparse known associations. First, we integrate the disease semantic information and miRNA functional information to estimate disease similarity and miRNA similarity, respectively. Considering that there is no available interaction observed for new diseases or miRNAs, a preprocessing step is developed to construct the interaction score profiles that will assist in prediction. Next, a graph regularized non-negative matrix factorization framework is utilized to simultaneously identify potential associations for all diseases. The results indicated that our proposed method can effectively prioritize disease-associated miRNAs with higher accuracy compared with other recent approaches. Moreover, case studies also demonstrated the effectiveness of GRNMF to infer unknown miRNA-disease associations for those novel diseases and miRNAs. AVAILABILITY AND IMPLEMENTATION The code of GRNMF is freely available at https://github.com/XIAO-HN/GRNMF/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Qiu Xiao
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
| | - Jiawei Luo
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
| | - Cheng Liang
- College of Information Science and Engineering, Shandong Normal University, Jinan, China
| | - Jie Cai
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
| | - Pingjian Ding
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
| |
Collapse
|
132
|
Chen X, Gong Y, Zhang DH, You ZH, Li ZW. DRMDA: deep representations-based miRNA-disease association prediction. J Cell Mol Med 2017; 22:472-485. [PMID: 28857494 PMCID: PMC5742725 DOI: 10.1111/jcmm.13336] [Citation(s) in RCA: 57] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2017] [Accepted: 07/01/2017] [Indexed: 12/22/2022] Open
Abstract
Recently, microRNAs (miRNAs) are confirmed to be important molecules within many crucial biological processes and therefore related to various complex human diseases. However, previous methods of predicting miRNA–disease associations have their own deficiencies. Under this circumstance, we developed a prediction method called deep representations‐based miRNA–disease association (DRMDA) prediction. The original miRNA–disease association data were extracted from HDMM database. Meanwhile, stacked auto‐encoder, greedy layer‐wise unsupervised pre‐training algorithm and support vector machine were implemented to predict potential associations. We compared DRMDA with five previous classical prediction models (HGIMDA, RLSMDA, HDMP, WBSMDA and RWRMDA) in global leave‐one‐out cross‐validation (LOOCV), local LOOCV and fivefold cross‐validation, respectively. The AUCs achieved by DRMDA were 0.9177, 08339 and 0.9156 ± 0.0006 in the three tests above, respectively. In further case studies, we predicted the top 50 potential miRNAs for colon neoplasms, lymphoma and prostate neoplasms, and 88%, 90% and 86% of the predicted miRNA can be verified by experimental evidence, respectively. In conclusion, DRMDA is a promising prediction method which could identify potential and novel miRNA–disease associations.
Collapse
Affiliation(s)
- Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| | - Yao Gong
- School of Life Science, Peking University, Beijing, China
| | - De-Hong Zhang
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| | - Zhu-Hong You
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Science, Ürümqi, China
| | - Zheng-Wei Li
- School of Computer Science and Technology, China University of Mining and Technology, Xuzhou, China
| |
Collapse
|
133
|
Peng L, Peng M, Liao B, Huang G, Liang W, Li K. Improved low-rank matrix recovery method for predicting miRNA-disease association. Sci Rep 2017; 7:6007. [PMID: 28729528 PMCID: PMC5519594 DOI: 10.1038/s41598-017-06201-3] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2017] [Accepted: 06/23/2017] [Indexed: 02/06/2023] Open
Abstract
MicroRNAs (miRNAs) performs crucial roles in various human diseases, but miRNA-related pathogenic mechanisms remain incompletely understood. Revealing the potential relationship between miRNAs and diseases is a critical problem in biomedical research. Considering limitation of existing computational approaches, we develop improved low-rank matrix recovery (ILRMR) for miRNA-disease association prediction. ILRMR is a global method that can simultaneously prioritize potential association for all diseases and does not require negative samples. ILRMR can also identify promising miRNAs for investigating diseases without any known related miRNA. By integrating miRNA-miRNA similarity information, disease-disease similarity information, and miRNA family information to matrix recovery, ILRMR performs better than other methods in cross validation and case studies.
Collapse
Affiliation(s)
- Li Peng
- College of Information Science and Engineering, Hunan University, Changsha, Hunan, 410082, China.,College of Computer Science and Engineering, Hunan University of Science and Technology, Xiangtan, Hunan, 411201, China
| | - Manman Peng
- College of Information Science and Engineering, Hunan University, Changsha, Hunan, 410082, China.
| | - Bo Liao
- College of Information Science and Engineering, Hunan University, Changsha, Hunan, 410082, China
| | - Guohua Huang
- College of Information Engineering, Shaoyang University, Shaoyang, Hunan, 422000, China
| | - Wei Liang
- College of Computer Science and Engineering, Hunan University of Science and Technology, Xiangtan, Hunan, 411201, China
| | - Keqin Li
- Department of Computer Science, State University of New York, New Paltz, New York, 12561, USA
| |
Collapse
|
134
|
Liu Y, Zeng X, He Z, Zou Q. Inferring microRNA-disease associations by random walk on a heterogeneous network with multiple data sources. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2017; 14:905-915. [PMID: 27076459 DOI: 10.1109/tcbb.2016.2550432] [Citation(s) in RCA: 209] [Impact Index Per Article: 26.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
Since the discovery of the regulatory function of microRNA (miRNA), increased attention has focused on identifying the relationship between miRNA and disease. It has been suggested that computational method are an efficient way to identify potential disease-related miRNAs for further confirmation using biological experiments. In this paper, we first highlighted three limitations commonly associated with previous computational methods. To resolve these limitations, we established disease similarity subnetwork and miRNA similarity subnetwork by integrating multiple data sources, where the disease similarity is composed of disease semantic similarity and disease functional similarity, and the miRNA similarity is calculated using the miRNA-target gene and miRNA-lncRNA (long non-coding RNA) associations. Then, a heterogeneous network was constructed by connecting the disease similarity subnetwork and the miRNA similarity subnetwork using the known miRNA-disease associations. We extended random walk with restart to predict miRNA-disease associations in the heterogeneous network. The leave-one-out cross-validation achieved an average area under the curve (AUC) of 0:8049 across 341 diseases and 476 miRNAs. For five-fold cross-validation, our method achieved an AUC from 0:7970 to 0:9249 for 15 human diseases. Case studies further demonstrated the feasibility of our method to discover potential miRNA-disease associations. An online service for prediction is freely available at http://ifmda.aliapp.com.
Collapse
|
135
|
Huang YA, You ZH, Li LP, Huang ZA, Xiang LX, Li XF, Lv LT. EPMDA: an expression-profile based computational model for microRNA-disease association prediction. Oncotarget 2017; 8:87033-87043. [PMID: 29152061 PMCID: PMC5675613 DOI: 10.18632/oncotarget.18788] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2017] [Accepted: 05/29/2017] [Indexed: 12/14/2022] Open
Abstract
MicroRNA has become a new star molecule for understanding multiple biological processes and the mechanism of various complex human diseases. Even though a number of computational models have been proposed for predicting the association between microRNAs and various human diseases, most of them are mainly based on microRNA functional similarity and heterogeneous biological networks which suffer from inevitable computational error and bias. In this work, considering the limitation of information resource used by existing methods, we proposed EPMDA model which is the first computational method using the expression profiles of microRNAs to predict the most potential microRNAs associated with various diseases. Based on the dataset constructed from HMDD v2.0 database, EPMDA obtained AUCs of 0.8945 and 0.8917 based on the leave-one-out and 5-fold cross validation, respectively. Furthermore, EPMDA was applied to two important human diseases. As a result, 80% and 88% microRNAs in the top-25 lists of Colon Neoplasms and Kidney Neoplasms were confirmed by other databases. The performance comparison of EPMDA with existing prediction models and classical algorithms also demonstrated the reliable prediction ability of EPMDA. It is anticipated that EPMDA can be used as an effective computational tool for future biomedical researches.
Collapse
Affiliation(s)
- Yu-An Huang
- College of Information Engineering, Xijing University, Xi'an 710123, China
| | - Zhu-Hong You
- College of Information Engineering, Xijing University, Xi'an 710123, China
| | - Li-Ping Li
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Science, Urumqi 830011, China
| | - Zhi-An Huang
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, 518060, China
| | - Lu-Xuan Xiang
- Institute of Information and Control, Hangzhou Dianzi University, Hangzhou, China
| | - Xiao-Fang Li
- College of Information Engineering, Xijing University, Xi'an 710123, China
| | - Lin-Tao Lv
- College of Information Engineering, Xijing University, Xi'an 710123, China
| |
Collapse
|
136
|
Alkan F, Wenzel A, Palasca O, Kerpedjiev P, Rudebeck A, Stadler PF, Hofacker IL, Gorodkin J. RIsearch2: suffix array-based large-scale prediction of RNA-RNA interactions and siRNA off-targets. Nucleic Acids Res 2017; 45:e60. [PMID: 28108657 PMCID: PMC5416843 DOI: 10.1093/nar/gkw1325] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2016] [Accepted: 12/19/2016] [Indexed: 12/28/2022] Open
Abstract
Intermolecular interactions of ncRNAs are at the core of gene regulation events, and identifying the full map of these interactions bears crucial importance for ncRNA functional studies. It is known that RNA-RNA interactions are built up by complementary base pairings between interacting RNAs and high level of complementarity between two RNA sequences is a powerful predictor of such interactions. Here, we present RIsearch2, a large-scale RNA-RNA interaction prediction tool that enables quick localization of potential near-complementary RNA-RNA interactions between given query and target sequences. In contrast to previous heuristics which either search for exact matches while including G-U wobble pairs or employ simplified energy models, we present a novel approach using a single integrated seed-and-extend framework based on suffix arrays. RIsearch2 enables fast discovery of candidate RNA-RNA interactions on genome/transcriptome-wide scale. We furthermore present an siRNA off-target discovery pipeline that not only predicts the off-target transcripts but also computes the off-targeting potential of a given siRNA. This is achieved by combining genome-wide RIsearch2 predictions with target site accessibilities and transcript abundance estimates. We show that this pipeline accurately predicts siRNA off-target interactions and enables off-targeting potential comparisons between different siRNA designs. RIsearch2 and the siRNA off-target discovery pipeline are available as stand-alone software packages from http://rth.dk/resources/risearch.
Collapse
Affiliation(s)
- Ferhat Alkan
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark
- Department of Veterinary Clinical and Animal Science, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark
| | - Anne Wenzel
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark
- Department of Veterinary Clinical and Animal Science, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark
| | - Oana Palasca
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark
- Department of Veterinary Clinical and Animal Science, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Blegdamsvej 3B, 2200 Copenhagen N, Denmark
| | - Peter Kerpedjiev
- Institute for Theoretical Chemistry, University of Vienna, Währingerstraße 17, 1090 Wien, Austria
| | - Anders Frost Rudebeck
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark
- Department of Veterinary Clinical and Animal Science, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark
| | - Peter F. Stadler
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark
- Institute for Theoretical Chemistry, University of Vienna, Währingerstraße 17, 1090 Wien, Austria
- Bioinformatics Group, Department of Computer Science & IZBI-Interdisciplinary Center for Bioinformatics & LIFE-Leipzig Research Center for Civilization Diseases & Competence Center for Scalable Data Services and Solutions, University Leipzig, Härtelstraße 16–18, 04107 Leipzig, Germany
- Max Planck Institute for Mathematics in the Sciences, Inselstraße 22, 04103 Leipzig, Germany
- Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, NM 87501, USA
| | - Ivo L. Hofacker
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark
- Institute for Theoretical Chemistry, University of Vienna, Währingerstraße 17, 1090 Wien, Austria
- Research Group Bioinformatics and Computational Biology, Faculty of Computer Science, University of Vienna, Währingerstraße 17, 1090 Wien, Austria
| | - Jan Gorodkin
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark
- Department of Veterinary Clinical and Animal Science, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark
| |
Collapse
|
137
|
Abstract
Cumulative verified experimental studies have demonstrated that microRNAs (miRNAs) could be closely related with the development and progression of human complex diseases. Based on the assumption that functional similar miRNAs may have a strong correlation with phenotypically similar diseases and vice versa, researchers developed various effective computational models which combine heterogeneous biologic data sets including disease similarity network, miRNA similarity network, and known disease-miRNA association network to identify potential relationships between miRNAs and diseases in biomedical research. Considering the limitations in previous computational study, we introduced a novel computational method of Ranking-based KNN for miRNA-Disease Association prediction (RKNNMDA) to predict potential related miRNAs for diseases, and our method obtained an AUC of 0.8221 based on leave-one-out cross validation. In addition, RKNNMDA was applied to 3 kinds of important human cancers for further performance evaluation. The results showed that 96%, 80% and 94% of predicted top 50 potential related miRNAs for Colon Neoplasms, Esophageal Neoplasms, and Prostate Neoplasms have been confirmed by experimental literatures, respectively. Moreover, RKNNMDA could be used to predict potential miRNAs for diseases without any known miRNAs, and it is anticipated that RKNNMDA would be of great use for novel miRNA-disease association identification.
Collapse
Affiliation(s)
- Xing Chen
- a School of Information and Control Engineering , China University of Mining and Technology , Xuzhou , China
| | - Qiao-Feng Wu
- b College of Electrical Engineering , Zhejiang University , Hangzhou , China
| | - Gui-Ying Yan
- c Academy of Mathematics and Systems Science , Chinese Academy of Sciences , Beijing , China
| |
Collapse
|
138
|
Mugunga I, Ju Y, Liu X, Huang X. Computational prediction of human disease-related microRNAs by path-based random walk. Oncotarget 2017; 8:58526-58535. [PMID: 28938576 PMCID: PMC5601672 DOI: 10.18632/oncotarget.17226] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2017] [Accepted: 03/22/2017] [Indexed: 01/09/2023] Open
Abstract
MicroRNAs (miRNAs) are a class of small, endogenous RNAs that are 21–25 nucleotides in length. In animals and plants, miRNAs target specific genes for degradation or translation repression. Discovering disease-related miRNA is fundamental for understanding the pathogenesis of diseases. The association between miRNA and a disease is mainly determined via biological investigation, which is complicated by increased biological information due to big data from different databases. Researchers have utilized different computational methods to harmonize experimental approaches to discover miRNA that articulates restrictively in specific environmental situations. In this work, we present a prediction model that is based on the theory of path features and random walk to obtain a relevancy score of miRNA-related disease. In this model, highly ranked scores are potential miRNA-disease associations. Features were extracted from positive and negative samples of miRNA-disease association. Then, we compared our method with other presented models using the five-fold cross-validation method, which obtained an area under the receiver operating characteristic curve of 88.6%. This indicated that our method has a better performance compared to previous methods and will help future biological investigations.
Collapse
Affiliation(s)
- Israel Mugunga
- Department of Computer Science, Xiamen University, Xiamen, 361005, China
| | - Ying Ju
- Department of Computer Science, Xiamen University, Xiamen, 361005, China
| | - Xiangrong Liu
- Department of Computer Science, Xiamen University, Xiamen, 361005, China
| | - Xiaoyang Huang
- Department of Computer Science, Xiamen University, Xiamen, 361005, China
| |
Collapse
|
139
|
You ZH, Huang ZA, Zhu Z, Yan GY, Li ZW, Wen Z, Chen X. PBMDA: A novel and effective path-based computational model for miRNA-disease association prediction. PLoS Comput Biol 2017; 13:e1005455. [PMID: 28339468 PMCID: PMC5384769 DOI: 10.1371/journal.pcbi.1005455] [Citation(s) in RCA: 292] [Impact Index Per Article: 36.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2016] [Revised: 04/07/2017] [Accepted: 03/14/2017] [Indexed: 12/31/2022] Open
Abstract
In the recent few years, an increasing number of studies have shown that microRNAs (miRNAs) play critical roles in many fundamental and important biological processes. As one of pathogenetic factors, the molecular mechanisms underlying human complex diseases still have not been completely understood from the perspective of miRNA. Predicting potential miRNA-disease associations makes important contributions to understanding the pathogenesis of diseases, developing new drugs, and formulating individualized diagnosis and treatment for diverse human complex diseases. Instead of only depending on expensive and time-consuming biological experiments, computational prediction models are effective by predicting potential miRNA-disease associations, prioritizing candidate miRNAs for the investigated diseases, and selecting those miRNAs with higher association probabilities for further experimental validation. In this study, Path-Based MiRNA-Disease Association (PBMDA) prediction model was proposed by integrating known human miRNA-disease associations, miRNA functional similarity, disease semantic similarity, and Gaussian interaction profile kernel similarity for miRNAs and diseases. This model constructed a heterogeneous graph consisting of three interlinked sub-graphs and further adopted depth-first search algorithm to infer potential miRNA-disease associations. As a result, PBMDA achieved reliable performance in the frameworks of both local and global LOOCV (AUCs of 0.8341 and 0.9169, respectively) and 5-fold cross validation (average AUC of 0.9172). In the cases studies of three important human diseases, 88% (Esophageal Neoplasms), 88% (Kidney Neoplasms) and 90% (Colon Neoplasms) of top-50 predicted miRNAs have been manually confirmed by previous experimental reports from literatures. Through the comparison performance between PBMDA and other previous models in case studies, the reliable performance also demonstrates that PBMDA could serve as a powerful computational tool to accelerate the identification of disease-miRNA associations. Identification of miRNA-disease associations is considered as a key way for the development of pathology, diagnose and therapy. Computational prediction models contribute to discovering the underlying disease-related miRNAs on a large scale. Based on the assumption that functionally related miRNAs tend to be involved in phenotypically similar disease and vice versa, the model of PBMDA was developed to prioritize the underlying miRNA-disease associations by adopting a special depth-first search algorithm in a heterogeneous graph, which was composed of known miRNA-disease association network, miRNA similarity network, and disease similarity network. Through leave-one-out cross validation and 5-fold cross validation, the promising results demonstrated the effectiveness of the proposed model. We further implemented the case studies of three important human complex diseases, 88%, 88% and 90% of top-50 predicted miRNA-disease associations have been manually confirmed based on recent experimental reports. It is anticipated that PBMDA could prioritize the most potential miRNA-disease associations on a large scale for advancing the progress of biological experiment validation in the future, which could further contribute to the understanding of complex disease mechanisms.
Collapse
Affiliation(s)
- Zhu-Hong You
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Science, ürümqi, China
| | - Zhi-An Huang
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China
| | - Zexuan Zhu
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China
- * E-mail: (XC); (ZZ)
| | - Gui-Ying Yan
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
| | - Zheng-Wei Li
- School of Computer Science and Technology, China University of Mining and Technology, Xuzhou, China
| | - Zhenkun Wen
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China
| | - Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
- * E-mail: (XC); (ZZ)
| |
Collapse
|
140
|
Yu H, Chen X, Lu L. Large-scale prediction of microRNA-disease associations by combinatorial prioritization algorithm. Sci Rep 2017; 7:43792. [PMID: 28317855 PMCID: PMC5357838 DOI: 10.1038/srep43792] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2016] [Accepted: 01/30/2017] [Indexed: 12/12/2022] Open
Abstract
Identification of the associations between microRNA molecules and human diseases from large-scale heterogeneous biological data is an important step for understanding the pathogenesis of diseases in microRNA level. However, experimental verification of microRNA-disease associations is expensive and time-consuming. To overcome the drawbacks of conventional experimental methods, we presented a combinatorial prioritization algorithm to predict the microRNA-disease associations. Importantly, our method can be used to predict microRNAs (diseases) associated with the diseases (microRNAs) without the known associated microRNAs (diseases). The predictive performance of our proposed approach was evaluated and verified by the internal cross-validations and external independent validations based on standard association datasets. The results demonstrate that our proposed method achieves the impressive performance for predicting the microRNA-disease association with the Area Under receiver operation characteristic Curve (AUC), 86.93%, which is indeed outperform the previous prediction methods. Particularly, we observed that the ensemble-based method by integrating the predictions of multiple algorithms can give more reliable and robust prediction than the single algorithm, with the AUC score improved to 92.26%. We applied our combinatorial prioritization algorithm to lung neoplasms and breast neoplasms, and revealed their top 30 microRNA candidates, which are in consistent with the published literatures and databases.
Collapse
Affiliation(s)
- Hua Yu
- State Key Laboratory of Plant Genomics, Institute of Genetic and Developmental Biology, Chinese Academy of Sciences, No. 1 West Beichen Road, Chaoyang District, Beijing, 100101, China
| | - Xiaojun Chen
- Key Lab of Agricultural Biotechnology of Ningxia, Agricultural Biotechnology Center, Ningxia Academy of Agriculture and Forestry Sciences, 590 Huanghe East Road, Jinfeng District, Yinchuan, Ningxia, 750002, China.
| | - Lu Lu
- Beijing Computing Center, Beijing Academy of Science and Technology, Building 3 BeiKe Industrial park, Fengxian road 7, Haidian District, Beijing, 100094, China
| |
Collapse
|
141
|
Peng L, Chen Y, Ma N, Chen X. NARRMDA: negative-aware and rating-based recommendation algorithm for miRNA–disease association prediction. MOLECULAR BIOSYSTEMS 2017; 13:2650-2659. [DOI: 10.1039/c7mb00499k] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
An increasing amount of evidence indicates that microRNAs (miRNAs) are closely related to many important biological processes and play a significant role in various human diseases.
Collapse
Affiliation(s)
- Lihong Peng
- College of Information Engineering
- Changsha Medical University
- Changsha
- China
| | - Yeqing Chen
- College of Informatics
- Huazhong Agricultural University
- Wuhan
- China
| | - Ning Ma
- College of Pharmacy
- Changsha Medical University
- Changsha
- China
| | - Xing Chen
- School of Information and Control Engineering
- China University of Mining and Technology
- Xuzhou
- China
| |
Collapse
|
142
|
Chen X, Jiang ZC, Xie D, Huang DS, Zhao Q, Yan GY, You ZH. A novel computational model based on super-disease and miRNA for potential miRNA–disease association prediction. MOLECULAR BIOSYSTEMS 2017; 13:1202-1212. [DOI: 10.1039/c6mb00853d] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Considering the various disadvantages of previous computational models, we proposed a novel computational model based on super-disease and miRNA for potential miRNA–disease association prediction (SDMMDA) to predict potential miRNA–disease associations by integrating known associations, disease semantic similarity, miRNA functional similarity, and Gaussian interaction profile kernel similarity for diseases and miRNAs.
Collapse
Affiliation(s)
- Xing Chen
- School of Information and Control Engineering
- China University of Mining and Technology
- Xuzhou
- China
| | - Zhi-Chao Jiang
- School of Electronics and Information Engineering
- Tongji University
- Shanghai
- China
| | - Di Xie
- School of Mathematics
- Liaoning University
- Shenyang
- China
| | - De-Shuang Huang
- School of Electronics and Information Engineering
- Tongji University
- Shanghai
- China
| | - Qi Zhao
- School of Mathematics
- Liaoning University
- Shenyang
- China
- Research Center for Computer Simulating and Information Processing of Bio-Macromolecules of Liaoning Province
| | - Gui-Ying Yan
- Academy of Mathematics and Systems Science
- Chinese Academy of Sciences
- Beijing
- China
| | - Zhu-Hong You
- Xinjiang Technical Institute of Physics and Chemistry
- Chinese Academy of Science
- ürümqi
- China
| |
Collapse
|
143
|
Wei ZS, Han K, Yang JY, Shen HB, Yu DJ. Protein–protein interaction sites prediction by ensembling SVM and sample-weighted random forests. Neurocomputing 2016. [DOI: 10.1016/j.neucom.2016.02.022] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
|
144
|
Guo L, Liang T, Yu J, Zou Q. A Comprehensive Analysis of miRNA/isomiR Expression with Gender Difference. PLoS One 2016; 11:e0154955. [PMID: 27167065 PMCID: PMC4864079 DOI: 10.1371/journal.pone.0154955] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2016] [Accepted: 04/21/2016] [Indexed: 01/09/2023] Open
Abstract
Although microRNAs (miRNAs) have been widely studied as epigenetic regulation molecules, fewer studies focus on the gender difference at the miRNA and isomiR expression levels. In this study, we aim to understand the potential relationships between gender difference and miRNA/isomiR expression through a comprehensive analysis of small RNA-sequencing datasets based on different human diseases and tissues. Based on specific samples from males and females, we determined that some miRNAs may be diversely expressed between different tissues and genders. Thus, these miRNAs may exhibit inconsistent and even opposite expression between males and females. According to deregulated miRNA expression profiles, some dominantly expressed miRNA loci were selected to analyze isomiR expression patterns using rates of dominant isomiRs. In some miRNA loci, isomiRs showed statistical significance between tumor and normal samples and between males and females samples, suggesting that isomiR expression patterns are not always invariable but may vary between males and females, as well as among different tissues, tumors, and normal samples. The divergence implicates the fluctuation in the expression of miRNA and its detailed expression at the isomiR levels. The divergence also indicates that gender difference may be an important factor that affects the screening of disease-associated miRNAs and isomiRs. This study suggests that miRNA/isomiR expression and gender difference may be more complex than previously assumed and should be further studied according to specific samples from males or females.
Collapse
Affiliation(s)
- Li Guo
- Department of Bioinformatics, School of Geographic and Biologic Information, Nanjing University of Posts and Telecommunications, Nanjing, China
- * E-mail: (LG); (QZ)
| | - Tingming Liang
- Jiangsu Key Laboratory for Molecular and Medical Biotechnology, College of Life Science, Nanjing Normal University, Nanjing, China
| | - Jiafeng Yu
- Shandong Provincial Key Laboratory of Functional Macromolecular Biophysics, Institute of Biophysics, Dezhou University, Dezhou, China
| | - Quan Zou
- School of Computer Science and Technology, Tianjin University, Tianjin, China
- State Key Laboratory of Medicinal Chemical Biology, NanKai University, Tianjin, China
- * E-mail: (LG); (QZ)
| |
Collapse
|
145
|
El Baroudi M, Cinti C, Capobianco E. Immunomediated Pan-cancer Regulation Networks are Dominant Fingerprints After Treatment of Cell Lines with Demethylation. Cancer Inform 2016; 15:45-64. [PMID: 27147816 PMCID: PMC4849425 DOI: 10.4137/cin.s31809] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2015] [Revised: 02/09/2016] [Accepted: 02/17/2016] [Indexed: 11/11/2022] Open
Abstract
Pan-cancer studies are particularly relevant not only for addressing the complexity of the inherently observed heterogeneity but also for identifying clinically relevant features that may be common to the cancer types. Immune system regulations usually reveal synergistic modulation with other cancer mechanisms and in combination provide insights on possible advances in cancer immunotherapies. Network inference is a powerful approach to decipher pan-cancer systems dynamics. The methodology proposed in this study elucidates the impacts of epigenetic treatment on the drivers of complex pan-cancer regulation circuits involving cell lines of five cancer types. These patterns were observed from differential gene expression measurements following demethylation with 5-azacytidine. Networks were built to establish associations of phenotypes at molecular level with cancer hallmarks through both transcriptional and post-transcriptional regulation mechanisms. The most prominent feature that emerges from our integrative network maps, linking pathway landscapes to disease and drug-target associations, refers primarily to a mosaic of immune-system crosslinked influences. Therefore, characteristics initially evidenced in single cancer maps become motifs well summarized by network cores and fingerprints.
Collapse
Affiliation(s)
- Mariama El Baroudi
- Laboratory of Integrative Systems Medicine (LISM), Institute of Clinical Physiology, National Research Council of Italy (CNR), Pisa, Italy
- Medical Oncology Department, MIRO, Institut de Recherche Expérimentale et Clinique (IREC), Université Catholique de Louvain, Brussels, Belgium
| | - Caterina Cinti
- Cancer Therapy UOS, Institute of Clinical Phsyiology, National Research Council of Italy (CNR), Siena, Italy
| | - Enrico Capobianco
- Laboratory of Integrative Systems Medicine (LISM), Institute of Clinical Physiology, National Research Council of Italy (CNR), Pisa, Italy
- Center for Computational Science, Miller School of Medicine, University of Miami, Miami, FL, USA
| |
Collapse
|
146
|
Chen X, Yan CC, Zhang X, You ZH, Deng L, Liu Y, Zhang Y, Dai Q. WBSMDA: Within and Between Score for MiRNA-Disease Association prediction. Sci Rep 2016; 6:21106. [PMID: 26880032 PMCID: PMC4754743 DOI: 10.1038/srep21106] [Citation(s) in RCA: 247] [Impact Index Per Article: 27.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2015] [Accepted: 01/18/2016] [Indexed: 12/18/2022] Open
Abstract
Increasing evidences have indicated that microRNAs (miRNAs) are functionally associated with the development and progression of various complex human diseases. However, the roles of miRNAs in multiple biological processes or various diseases and their underlying molecular mechanisms still have not been fully understood yet. Predicting potential miRNA-disease associations by integrating various heterogeneous biological datasets is of great significance to the biomedical research. Computational methods could obtain potential miRNA-disease associations in a short time, which significantly reduce the experimental time and cost. Considering the limitations in previous computational methods, we developed the model of Within and Between Score for MiRNA-Disease Association prediction (WBSMDA) to predict potential miRNAs associated with various complex diseases. WBSMDA could be applied to the diseases without any known related miRNAs. The AUC of 0.8031 based on Leave-one-out cross validation has demonstrated its reliable performance. WBSMDA was further applied to Colon Neoplasms, Prostate Neoplasms, and Lymphoma for the identification of their potential related miRNAs. As a result, 90%, 84%, and 80% of predicted miRNA-disease pairs in the top 50 prediction list for these three diseases have been confirmed by recent experimental literatures, respectively. It is anticipated that WBSMDA would be a useful resource for potential miRNA-disease association identification.
Collapse
Affiliation(s)
- Xing Chen
- National Center for Mathematics and Interdisciplinary Sciences, Chinese Academy of Sciences, Beijing, 100190, China
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, 100190, China
| | - Chenggang Clarence Yan
- Institute of Information and Control, Hangzhou Dianzi University, Hangzhou, 310018, China
- Department of Automation, Tsinghua University, Beijing, 100084, China
| | - Xu Zhang
- School of Mechanical, Electrical & Information Engineering, Shandong University, Weihai, 264209, China
| | - Zhu-Hong You
- School of Computer Science and Technology, China University of Mining and Technology, Xuzhou, 221116, China
| | - Lixi Deng
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Ying Liu
- School of Economics and Management, Beihang University, Beijing, 100191, China
| | - Yongdong Zhang
- Key Lab of Intelligent Information Processing of Chinese Academy of Sciences, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, China
| | - Qionghai Dai
- Department of Automation, Tsinghua University, Beijing, 100084, China
| |
Collapse
|
147
|
Integration of Multiple Genomic and Phenotype Data to Infer Novel miRNA-Disease Associations. PLoS One 2016; 11:e0148521. [PMID: 26849207 PMCID: PMC4743935 DOI: 10.1371/journal.pone.0148521] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2015] [Accepted: 01/19/2016] [Indexed: 01/07/2023] Open
Abstract
MicroRNAs (miRNAs) play an important role in the development and progression of human diseases. The identification of disease-associated miRNAs will be helpful for understanding the molecular mechanisms of diseases at the post-transcriptional level. Based on different types of genomic data sources, computational methods for miRNA-disease association prediction have been proposed. However, individual source of genomic data tends to be incomplete and noisy; therefore, the integration of various types of genomic data for inferring reliable miRNA-disease associations is urgently needed. In this study, we present a computational framework, CHNmiRD, for identifying miRNA-disease associations by integrating multiple genomic and phenotype data, including protein-protein interaction data, gene ontology data, experimentally verified miRNA-target relationships, disease phenotype information and known miRNA-disease connections. The performance of CHNmiRD was evaluated by experimentally verified miRNA-disease associations, which achieved an area under the ROC curve (AUC) of 0.834 for 5-fold cross-validation. In particular, CHNmiRD displayed excellent performance for diseases without any known related miRNAs. The results of case studies for three human diseases (glioblastoma, myocardial infarction and type 1 diabetes) showed that all of the top 10 ranked miRNAs having no known associations with these three diseases in existing miRNA-disease databases were directly or indirectly confirmed by our latest literature mining. All these results demonstrated the reliability and efficiency of CHNmiRD, and it is anticipated that CHNmiRD will serve as a powerful bioinformatics method for mining novel disease-related miRNAs and providing a new perspective into molecular mechanisms underlying human diseases at the post-transcriptional level. CHNmiRD is freely available at http://www.bio-bigdata.com/CHNmiRD.
Collapse
|
148
|
RBMMMDA: predicting multiple types of disease-microRNA associations. Sci Rep 2015; 5:13877. [PMID: 26347258 PMCID: PMC4561957 DOI: 10.1038/srep13877] [Citation(s) in RCA: 124] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2015] [Accepted: 08/07/2015] [Indexed: 12/20/2022] Open
Abstract
Accumulating evidences have shown that plenty of miRNAs play fundamental and important roles in various biological processes and the deregulations of miRNAs are associated with a broad range of human diseases. However, the mechanisms underlying the dysregulations of miRNAs still have not been fully understood yet. All the previous computational approaches can only predict binary associations between diseases and miRNAs. Predicting multiple types of disease-miRNA associations can further broaden our understanding about the molecular basis of diseases in the level of miRNAs. In this study, the model of Restricted Boltzmann machine for multiple types of miRNA-disease association prediction (RBMMMDA) was developed to predict four different types of miRNA-disease associations. Based on this model, we could obtain not only new miRNA-disease associations, but also corresponding association types. To our knowledge, RBMMMDA is the first model which could computationally infer association types of miRNA-disease pairs. Leave-one-out cross validation was implemented for RBMMMDA and the AUC of 0.8606 demonstrated the reliable and effective performance of RBMMMDA. In the case studies about lung cancer, breast cancer, and global prediction for all the diseases simultaneously, 50, 42, and 45 out of top 100 predicted miRNA-disease association types were confirmed by recent biological experimental literatures, respectively.
Collapse
|
149
|
Yuan D, Cui X, Wang Y, Zhao Y, Li H, Hu S, Chu X, Li Y, Li Q, Liu Q, Zhu W. Enrichment Analysis Identifies Functional MicroRNA-Disease Associations in Humans. PLoS One 2015; 10:e0136285. [PMID: 26296081 PMCID: PMC4546424 DOI: 10.1371/journal.pone.0136285] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2014] [Accepted: 08/02/2015] [Indexed: 11/18/2022] Open
Abstract
Substantial evidence has shown that microRNAs (miRNAs) may be causally linked to the occurrence and progression of human diseases. Herein, we conducted an enrichment analysis to identify potential functional miRNA-disease associations (MDAs) in humans by integrating currently known biological data: miRNA-target interactions (MTIs), protein-protein interactions, and gene-disease associations. Two contributing factors to functional miRNA-disease associations were quantitatively considered: the direct effects of miRNA that target disease-related genes, and indirect effects triggered by protein-protein interactions. Ninety-nine miRNAs were scanned for possible functional association with 2223 MeSH-defined human diseases. Each miRNA was experimentally validated to target ≥ 10 mRNA genes. Putative MDAs were identified when at least one MTI was confidently validated for a disease. Overall, 19648 putative MDAs were found, of which 10.0% was experimentally validated. Further results suggest that filtering for miRNAs that target a greater number of disease-related genes (n ≥ 8) can significantly enrich for true MDAs from the set of putative associations (enrichment rate = 60.7%, adjusted hypergeometric p = 2.41×10-91). Considering the indirect effects of miRNAs further elevated the enrichment rate to 72.6%. By using this method, a novel MDA between miR-24 and ovarian cancer was found. Compared with scramble miRNA overexpression of miR-24 was validated to remarkably induce ovarian cancer cells apoptosis. Our study provides novel insight into factors contributing to functional MDAs by integrating large quantities of previously generated biological data, and establishes a feasible method to identify plausible associations with high confidence.
Collapse
Affiliation(s)
- Dandan Yuan
- Department of Obstetrics and Gynecology, the Second Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Xiaomeng Cui
- School of Measurement and Control Technology & Communications Engineering, Harbin University of Science and Technology, Harbin, China
| | - Yang Wang
- Department of Pharmacy, the First Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Yilei Zhao
- Department of Pharmacy, the First Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Huiying Li
- Department of Obstetrics and Gynecology, Hongqi Hospital of Mudanjiang Medical University, Mudanjiang, China
| | - Suangjiu Hu
- Department of Obstetrics and Gynecology, the Second Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Xiaodan Chu
- Department of Obstetrics and Gynecology, the Second Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Yan Li
- Department of Obstetrics and Gynecology, the Second Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Qiang Li
- Department of General Surgery, the Second Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Qian Liu
- Department of Obstetrics and Gynecology, the Second Affiliated Hospital of Harbin Medical University, Harbin, China
- * E-mail: (QL); (WZ)
| | - Wenliang Zhu
- Institute of Clinical Pharmacology, the Second Affiliated Hospital of Harbin Medical University, Harbin, China
- * E-mail: (QL); (WZ)
| |
Collapse
|
150
|
Zeng X, Zhang X, Zou Q. Integrative approaches for predicting microRNA function and prioritizing disease-related microRNA using biological interaction networks. Brief Bioinform 2015; 17:193-203. [PMID: 26059461 DOI: 10.1093/bib/bbv033] [Citation(s) in RCA: 261] [Impact Index Per Article: 26.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2015] [Indexed: 12/21/2022] Open
Abstract
MicroRNAs (miRNA) play critical roles in regulating gene expressions at the posttranscriptional levels. The prediction of disease-related miRNA is vital to the further investigation of miRNA's involvement in the pathogenesis of disease. In previous years, biological experimentation is the main method used to identify whether miRNA was associated with a given disease. With increasing biological information and the appearance of new miRNAs every year, experimental identification of disease-related miRNAs poses considerable difficulties (e.g. time-consumption and high cost). Because of the limitations of experimental methods in determining the relationship between miRNAs and diseases, computational methods have been proposed. A key to predict potential disease-related miRNA based on networks is the calculation of similarity among diseases and miRNA over the networks. Different strategies lead to different results. In this review, we summarize the existing computational approaches and present the confronted difficulties that help understand the research status. We also discuss the principles, efficiency and differences among these methods. The comprehensive comparison and discussion elucidated in this work provide constructive insights into the matter.
Collapse
|