1
|
Chu Y, Wang X, Dai Q, Wang Y, Wang Q, Peng S, Wei X, Qiu J, Salahub DR, Xiong Y, Wei DQ. MDA-GCNFTG: identifying miRNA-disease associations based on graph convolutional networks via graph sampling through the feature and topology graph. Brief Bioinform 2021; 22:6261915. [PMID: 34009265 DOI: 10.1093/bib/bbab165] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Revised: 04/02/2021] [Accepted: 04/08/2021] [Indexed: 11/13/2022] Open
Abstract
Accurate identification of the miRNA-disease associations (MDAs) helps to understand the etiology and mechanisms of various diseases. However, the experimental methods are costly and time-consuming. Thus, it is urgent to develop computational methods towards the prediction of MDAs. Based on the graph theory, the MDA prediction is regarded as a node classification task in the present study. To solve this task, we propose a novel method MDA-GCNFTG, which predicts MDAs based on Graph Convolutional Networks (GCNs) via graph sampling through the Feature and Topology Graph to improve the training efficiency and accuracy. This method models both the potential connections of feature space and the structural relationships of MDA data. The nodes of the graphs are represented by the disease semantic similarity, miRNA functional similarity and Gaussian interaction profile kernel similarity. Moreover, we considered six tasks simultaneously on the MDA prediction problem at the first time, which ensure that under both balanced and unbalanced sample distribution, MDA-GCNFTG can predict not only new MDAs but also new diseases without known related miRNAs and new miRNAs without known related diseases. The results of 5-fold cross-validation show that the MDA-GCNFTG method has achieved satisfactory performance on all six tasks and is significantly superior to the classic machine learning methods and the state-of-the-art MDA prediction methods. Moreover, the effectiveness of GCNs via the graph sampling strategy and the feature and topology graph in MDA-GCNFTG has also been demonstrated. More importantly, case studies for two diseases and three miRNAs are conducted and achieved satisfactory performance.
Collapse
Affiliation(s)
- Yanyi Chu
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, China
| | - Xuhong Wang
- School of Electronic, Information and Electrical Engineering (SEIEE), Shanghai Jiao Tong University, China
| | - Qiuying Dai
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, China
| | - Yanjing Wang
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, China
| | - Qiankun Wang
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, China
| | - Shaoliang Peng
- College of Computer Science and Electronic Engineering, Hunan University, China
| | | | | | - Dennis Russell Salahub
- Department of Chemistry, University of Calgary, Fellow Royal Society of Canada and Fellow of the American Association for the Advancement of Science, China
| | - Yi Xiong
- State Key Laboratory of Microbial Metabolism, Shanghai-Islamabad-Belgrade Joint Innovation Center on Antibacterial Resistances, Joint International Research Laboratory of Metabolic & Developmental Sciences and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200030, P.R. China
| | - Dong-Qing Wei
- State Key Laboratory of Microbial Metabolism, Shanghai-Islamabad-Belgrade Joint Innovation Center on Antibacterial Resistances, Joint International Research Laboratory of Metabolic & Developmental Sciences and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200030, P.R. China
| |
Collapse
|
2
|
Ding Y, Lei X, Liao B, Wu FX. Machine learning approaches for predicting biomolecule-disease associations. Brief Funct Genomics 2021; 20:273-287. [PMID: 33554238 DOI: 10.1093/bfgp/elab002] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Biomolecules, such as microRNAs, circRNAs, lncRNAs and genes, are functionally interdependent in human cells, and all play critical roles in diverse fundamental and vital biological processes. The dysregulations of such biomolecules can cause diseases. Identifying the associations between biomolecules and diseases can uncover the mechanisms of complex diseases, which is conducive to their diagnosis, treatment, prognosis and prevention. Due to the time consumption and cost of biologically experimental methods, many computational association prediction methods have been proposed in the past few years. In this study, we provide a comprehensive review of machine learning-based approaches for predicting disease-biomolecule associations with multi-view data sources. Firstly, we introduce some databases and general strategies for integrating multi-view data sources in the prediction models. Then we discuss several feature representation methods for machine learning-based prediction models. Thirdly, we comprehensively review machine learning-based prediction approaches in three categories: basic machine learning methods, matrix completion-based methods and deep learning-based methods, while discussing their advantages and disadvantages. Finally, we provide some perspectives for further improving biomolecule-disease prediction methods.
Collapse
Affiliation(s)
- Yulian Ding
- Division of Biomedical Engineering at the University of Saskatchewan
| | - Xiujuan Lei
- School of Computer Science at Shaanxi Normal University
| | - Bo Liao
- School of Mathematics and Statistics at Hainan Normal University, Haikou, China
| | - Fang-Xiang Wu
- College of Engineering and the Department of Computer Science at University of Saskatchewan
| |
Collapse
|
3
|
Wallace DR, Taalab YM, Heinze S, Tariba Lovaković B, Pizent A, Renieri E, Tsatsakis A, Farooqi AA, Javorac D, Andjelkovic M, Bulat Z, Antonijević B, Buha Djordjevic A. Toxic-Metal-Induced Alteration in miRNA Expression Profile as a Proposed Mechanism for Disease Development. Cells 2020; 9:cells9040901. [PMID: 32272672 PMCID: PMC7226740 DOI: 10.3390/cells9040901] [Citation(s) in RCA: 87] [Impact Index Per Article: 17.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2020] [Revised: 04/01/2020] [Accepted: 04/01/2020] [Indexed: 02/07/2023] Open
Abstract
Toxic metals are extensively found in the environment, households, and workplaces and contaminate food and drinking water. The crosstalk between environmental exposure to toxic metals and human diseases has been frequently described. The toxic mechanism of action was classically viewed as the ability to dysregulate the redox status, production of inflammatory mediators and alteration of mitochondrial function. Recently, growing evidence showed that heavy metals might exert their toxicity through microRNAs (miRNA)—short, single-stranded, noncoding molecules that function as positive/negative regulators of gene expression. Aberrant alteration of the endogenous miRNA has been directly implicated in various pathophysiological conditions and signaling pathways, consequently leading to different types of cancer and human diseases. Additionally, the gene-regulatory capacity of miRNAs is particularly valuable in the brain—a complex organ with neurons demonstrating a significant ability to adapt following environmental stimuli. Accordingly, dysregulated miRNAs identified in patients suffering from neurological diseases might serve as biomarkers for the earlier diagnosis and monitoring of disease progression. This review will greatly emphasize the effect of the toxic metals on human miRNA activities and how this contributes to progression of diseases such as cancer and neurodegenerative disorders (NDDs).
Collapse
Affiliation(s)
- David R. Wallace
- School of Biomedical Science, Oklahoma State University Center for Health Sciences, Tulsa, OK 74107, USA;
| | - Yasmeen M. Taalab
- Forensic Medicine and Clinical Toxicology Department, Faculty of Medicine, Mansoura University, Dakahlia Governate 35516, Egypt or
- Institute of Forensic and Traffic Medicine, University of Heidelberg, Voßstraße 2, 69115 Heidelberg, Germany;
| | - Sarah Heinze
- Institute of Forensic and Traffic Medicine, University of Heidelberg, Voßstraße 2, 69115 Heidelberg, Germany;
| | - Blanka Tariba Lovaković
- Analytical Toxicology and Mineral Metabolism Unit, Institute for Medical Research and Occupational Health, Ksaverska cesta 2, 10 000 Zagreb, Croatia; (B.T.L.); (A.P.)
| | - Alica Pizent
- Analytical Toxicology and Mineral Metabolism Unit, Institute for Medical Research and Occupational Health, Ksaverska cesta 2, 10 000 Zagreb, Croatia; (B.T.L.); (A.P.)
| | - Elisavet Renieri
- Centre of Toxicology Science and Research, University of Crete, School of Medicine, 71601 Heraklion, Greece; (E.R.); (A.T.)
| | - Aristidis Tsatsakis
- Centre of Toxicology Science and Research, University of Crete, School of Medicine, 71601 Heraklion, Greece; (E.R.); (A.T.)
| | | | - Dragana Javorac
- Department of Toxicology “Akademik Danilo Soldatović”, University of Belgrade-Faculty of Pharmacy, Vojvode Stepe 450, 11221 Belgrade, Serbia; (D.J.); (M.A.); (Z.B.); (B.A.)
| | - Milena Andjelkovic
- Department of Toxicology “Akademik Danilo Soldatović”, University of Belgrade-Faculty of Pharmacy, Vojvode Stepe 450, 11221 Belgrade, Serbia; (D.J.); (M.A.); (Z.B.); (B.A.)
| | - Zorica Bulat
- Department of Toxicology “Akademik Danilo Soldatović”, University of Belgrade-Faculty of Pharmacy, Vojvode Stepe 450, 11221 Belgrade, Serbia; (D.J.); (M.A.); (Z.B.); (B.A.)
| | - Biljana Antonijević
- Department of Toxicology “Akademik Danilo Soldatović”, University of Belgrade-Faculty of Pharmacy, Vojvode Stepe 450, 11221 Belgrade, Serbia; (D.J.); (M.A.); (Z.B.); (B.A.)
| | - Aleksandra Buha Djordjevic
- Department of Toxicology “Akademik Danilo Soldatović”, University of Belgrade-Faculty of Pharmacy, Vojvode Stepe 450, 11221 Belgrade, Serbia; (D.J.); (M.A.); (Z.B.); (B.A.)
- Correspondence:
| |
Collapse
|
4
|
Wang X, Yan R. DDAPRED: a computational method for predicting drug repositioning using regularized logistic matrix factorization. J Mol Model 2020; 26:60. [PMID: 32062701 DOI: 10.1007/s00894-020-4315-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2019] [Accepted: 01/28/2020] [Indexed: 01/14/2023]
Abstract
Due to rising development costs and stagnant product outputs of traditional drug discovery methods, drug repositioning, which discovers new indications for existing drugs, has attracted increasing interest. Computational drug repositioning can integrate prioritization information and accelerate time lines even further. However, most existing methods for predicting drug repositioning have low precisions. The present article proposed a new method named DDAPRED (https://github.com/nongdaxiaofeng/DDAPRED) for drug repositioning prediction. The method integrated multiple sources of drug similarity and disease similarity information, and it used the regularized logistic matrix decomposition method to significantly improve the prediction performance. In 5-fold cross-validation, the areas under the receiver operating characteristic curve (AUROC) and the precision-recall curve (AUPRC) of DDAPRED reached 0.932 and 0.438, respectively, exceeding other methods. The present study also analyzed the parameters influencing the model performance and the effect of different drug similarity information in-depth, and it verified the treatment relationship of the top 50 predictions with unknown relationships in the training set, further demonstrating the practicability of our method.
Collapse
Affiliation(s)
- Xiaofeng Wang
- College of Mathematics and Computer Science, Shanxi Normal University, Linfen, 041004, China
| | - Renxiang Yan
- College of Biological Science and Engineering, Fuzhou University, Fuzhou, 350106, Fujian, China. .,Fujian Key Laboratory of Marine Enzyme Engineering, Fuzhou, 350116, Fujian, China.
| |
Collapse
|
5
|
Dhama K, Latheef SK, Dadar M, Samad HA, Munjal A, Khandia R, Karthik K, Tiwari R, Yatoo MI, Bhatt P, Chakraborty S, Singh KP, Iqbal HMN, Chaicumpa W, Joshi SK. Biomarkers in Stress Related Diseases/Disorders: Diagnostic, Prognostic, and Therapeutic Values. Front Mol Biosci 2019; 6:91. [PMID: 31750312 PMCID: PMC6843074 DOI: 10.3389/fmolb.2019.00091] [Citation(s) in RCA: 144] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2019] [Accepted: 09/11/2019] [Indexed: 02/05/2023] Open
Abstract
Various internal and external factors negatively affect the homeostatic equilibrium of organisms at the molecular to the whole-body level, inducing the so-called state of stress. Stress affects an organism's welfare status and induces energy-consuming mechanisms to combat the subsequent ill effects; thus, the individual may be immunocompromised, making them vulnerable to pathogens. The information presented here has been extensively reviewed, compiled, and analyzed from authenticated published resources available on Medline, PubMed, PubMed Central, Science Direct, and other scientific databases. Stress levels can be monitored by the quantitative and qualitative measurement of biomarkers. Potential markers of stress include thermal stress markers, such as heat shock proteins (HSPs), innate immune markers, such as Acute Phase Proteins (APPs), oxidative stress markers, and chemical secretions in the saliva and urine. In addition, stress biomarkers also play critical roles in the prognosis of stress-related diseases and disorders, and therapy guidance. Moreover, different components have been identified as potent mediators of cardiovascular, central nervous system, hepatic, and nephrological disorders, which can also be employed to evaluate these conditions precisely, but with stringent validation and specificity. Considerable scientific advances have been made in the detection, quantitation, and application of these biomarkers. The present review describes the current progress of identifying biomarkers, their prognostic, and therapeutic values.
Collapse
Affiliation(s)
- Kuldeep Dhama
- Division of Pathology, ICAR-Indian Veterinary Research Institute, Bareilly, India
| | - Shyma K. Latheef
- Division of Pathology, ICAR-Indian Veterinary Research Institute, Bareilly, India
| | - Maryam Dadar
- Razi Vaccine and Serum Research Institute, Agricultural Research, Education and Extension Organization, Karaj, Iran
| | - Hari Abdul Samad
- Division of Physiology and Climatology, ICAR-Indian Veterinary Research Institute, Bareilly, India
| | - Ashok Munjal
- Department of Genetics, Barkatullah University, Bhopal, India
| | - Rekha Khandia
- Department of Genetics, Barkatullah University, Bhopal, India
| | - Kumaragurubaran Karthik
- Central University Laboratory, Tamil Nadu Veterinary and Animal Sciences University, Chennai, India
| | - Ruchi Tiwari
- Department of Veterinary Microbiology and Immunology, College of Veterinary Sciences, UP Pandit Deen Dayal Upadhayay Pashu Chikitsa Vigyan Vishwavidyalay Evum Go-Anusandhan Sansthan, Mathura, India
| | - Mohd. Iqbal Yatoo
- Division of Veterinary Clinical Complex, Sher-E-Kashmir University of Agricultural Sciences and Technology of Kashmir, Srinagar, India
| | - Prakash Bhatt
- Teaching Veterinary Clinical Complex, College of Veterinary and Animal Sciences, Govind Ballabh Pant University of Agriculture and Technology, Pantnagar, India
| | - Sandip Chakraborty
- Department of Veterinary Microbiology, College of Veterinary Sciences and Animal Husbandry, Agartala, India
| | - Karam Pal Singh
- Division of Pathology, ICAR-Indian Veterinary Research Institute, Bareilly, India
| | - Hafiz M. N. Iqbal
- Tecnologico de Monterrey, School of Engineering and Sciences, Monterrey, Mexico
| | - Wanpen Chaicumpa
- Department of Parasitology, Faculty of Medicine, Center of Research Excellence on Therapeutic Proteins and Antibody Engineering, Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - Sunil Kumar Joshi
- Division of Hematology, Oncology and Bone Marrow Transplantation, Department of Microbiology & Immunology, Department of Pediatrics, University of Miami School of Medicine, Miami, FL, United States
| |
Collapse
|
6
|
Dhama K, Latheef SK, Dadar M, Samad HA, Munjal A, Khandia R, Karthik K, Tiwari R, Yatoo MI, Bhatt P, Chakraborty S, Singh KP, Iqbal HMN, Chaicumpa W, Joshi SK. Biomarkers in Stress Related Diseases/Disorders: Diagnostic, Prognostic, and Therapeutic Values. Front Mol Biosci 2019. [PMID: 31750312 DOI: 10.3389/fmolb.2019.0009] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Various internal and external factors negatively affect the homeostatic equilibrium of organisms at the molecular to the whole-body level, inducing the so-called state of stress. Stress affects an organism's welfare status and induces energy-consuming mechanisms to combat the subsequent ill effects; thus, the individual may be immunocompromised, making them vulnerable to pathogens. The information presented here has been extensively reviewed, compiled, and analyzed from authenticated published resources available on Medline, PubMed, PubMed Central, Science Direct, and other scientific databases. Stress levels can be monitored by the quantitative and qualitative measurement of biomarkers. Potential markers of stress include thermal stress markers, such as heat shock proteins (HSPs), innate immune markers, such as Acute Phase Proteins (APPs), oxidative stress markers, and chemical secretions in the saliva and urine. In addition, stress biomarkers also play critical roles in the prognosis of stress-related diseases and disorders, and therapy guidance. Moreover, different components have been identified as potent mediators of cardiovascular, central nervous system, hepatic, and nephrological disorders, which can also be employed to evaluate these conditions precisely, but with stringent validation and specificity. Considerable scientific advances have been made in the detection, quantitation, and application of these biomarkers. The present review describes the current progress of identifying biomarkers, their prognostic, and therapeutic values.
Collapse
Affiliation(s)
- Kuldeep Dhama
- Division of Pathology, ICAR-Indian Veterinary Research Institute, Bareilly, India
| | - Shyma K Latheef
- Division of Pathology, ICAR-Indian Veterinary Research Institute, Bareilly, India
| | - Maryam Dadar
- Razi Vaccine and Serum Research Institute, Agricultural Research, Education and Extension Organization, Karaj, Iran
| | - Hari Abdul Samad
- Division of Physiology and Climatology, ICAR-Indian Veterinary Research Institute, Bareilly, India
| | - Ashok Munjal
- Department of Genetics, Barkatullah University, Bhopal, India
| | - Rekha Khandia
- Department of Genetics, Barkatullah University, Bhopal, India
| | - Kumaragurubaran Karthik
- Central University Laboratory, Tamil Nadu Veterinary and Animal Sciences University, Chennai, India
| | - Ruchi Tiwari
- Department of Veterinary Microbiology and Immunology, College of Veterinary Sciences, UP Pandit Deen Dayal Upadhayay Pashu Chikitsa Vigyan Vishwavidyalay Evum Go-Anusandhan Sansthan, Mathura, India
| | - Mohd Iqbal Yatoo
- Division of Veterinary Clinical Complex, Sher-E-Kashmir University of Agricultural Sciences and Technology of Kashmir, Srinagar, India
| | - Prakash Bhatt
- Teaching Veterinary Clinical Complex, College of Veterinary and Animal Sciences, Govind Ballabh Pant University of Agriculture and Technology, Pantnagar, India
| | - Sandip Chakraborty
- Department of Veterinary Microbiology, College of Veterinary Sciences and Animal Husbandry, Agartala, India
| | - Karam Pal Singh
- Division of Pathology, ICAR-Indian Veterinary Research Institute, Bareilly, India
| | - Hafiz M N Iqbal
- Tecnologico de Monterrey, School of Engineering and Sciences, Monterrey, Mexico
| | - Wanpen Chaicumpa
- Department of Parasitology, Faculty of Medicine, Center of Research Excellence on Therapeutic Proteins and Antibody Engineering, Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - Sunil Kumar Joshi
- Division of Hematology, Oncology and Bone Marrow Transplantation, Department of Microbiology & Immunology, Department of Pediatrics, University of Miami School of Medicine, Miami, FL, United States
| |
Collapse
|
7
|
Huang Z, Liu L, Gao Y, Shi J, Cui Q, Li J, Zhou Y. Benchmark of computational methods for predicting microRNA-disease associations. Genome Biol 2019; 20:202. [PMID: 31594544 PMCID: PMC6781296 DOI: 10.1186/s13059-019-1811-3] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2019] [Accepted: 09/03/2019] [Indexed: 01/06/2023] Open
Abstract
BACKGROUND A series of miRNA-disease association prediction methods have been proposed to prioritize potential disease-associated miRNAs. Independent benchmarking of these methods is warranted to assess their effectiveness and robustness. RESULTS Based on more than 8000 novel miRNA-disease associations from the latest HMDD v3.1 database, we perform systematic comparison among 36 readily available prediction methods. Their overall performances are evaluated with rigorous precision-recall curve analysis, where 13 methods show acceptable accuracy (AUPRC > 0.200) while the top two methods achieve a promising AUPRC over 0.300, and most of these methods are also highly ranked when considering only the causal miRNA-disease associations as the positive samples. The potential of performance improvement is demonstrated by combining different predictors or adopting a more updated miRNA similarity matrix, which would result in up to 16% and 46% of AUPRC augmentations compared to the best single predictor and the predictors using the previous similarity matrix, respectively. Our analysis suggests a common issue of the available methods, which is that the prediction results are severely biased toward well-annotated diseases with many associated miRNAs known and cannot further stratify the positive samples by discriminating the causal miRNA-disease associations from the general miRNA-disease associations. CONCLUSION Our benchmarking results not only provide a reference for biomedical researchers to choose appropriate miRNA-disease association predictors for their purpose, but also suggest the future directions for the development of more robust miRNA-disease association predictors.
Collapse
Affiliation(s)
- Zhou Huang
- Department of Biomedical Informatics, Department of Physiology and Pathophysiology, Center for Noncoding RNA Medicine, MOE Key Lab of Cardiovascular Sciences, School of Basic Medical Sciences, Peking University, 38 Xueyuan Rd, Beijing, 100191, China
| | - Leibo Liu
- Institute of Computational Medicine, School of Artificial Intelligence, Hebei University of Technology, Tianjin, 300401, China
| | - Yuanxu Gao
- Department of Biomedical Informatics, Department of Physiology and Pathophysiology, Center for Noncoding RNA Medicine, MOE Key Lab of Cardiovascular Sciences, School of Basic Medical Sciences, Peking University, 38 Xueyuan Rd, Beijing, 100191, China
| | - Jiangcheng Shi
- Department of Biomedical Informatics, Department of Physiology and Pathophysiology, Center for Noncoding RNA Medicine, MOE Key Lab of Cardiovascular Sciences, School of Basic Medical Sciences, Peking University, 38 Xueyuan Rd, Beijing, 100191, China
| | - Qinghua Cui
- Department of Biomedical Informatics, Department of Physiology and Pathophysiology, Center for Noncoding RNA Medicine, MOE Key Lab of Cardiovascular Sciences, School of Basic Medical Sciences, Peking University, 38 Xueyuan Rd, Beijing, 100191, China
- Center of Bioinformatics, Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, 610054, China
| | - Jianwei Li
- Institute of Computational Medicine, School of Artificial Intelligence, Hebei University of Technology, Tianjin, 300401, China.
| | - Yuan Zhou
- Department of Biomedical Informatics, Department of Physiology and Pathophysiology, Center for Noncoding RNA Medicine, MOE Key Lab of Cardiovascular Sciences, School of Basic Medical Sciences, Peking University, 38 Xueyuan Rd, Beijing, 100191, China.
| |
Collapse
|
8
|
Xie G, Fan Z, Sun Y, Wu C, Ma L. WBNPMD: weighted bipartite network projection for microRNA-disease association prediction. J Transl Med 2019; 17:322. [PMID: 31547811 PMCID: PMC6757419 DOI: 10.1186/s12967-019-2063-4] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2019] [Accepted: 09/06/2019] [Indexed: 01/21/2023] Open
Abstract
Background Recently, numerous biological experiments have indicated that microRNAs (miRNAs) play critical roles in exploring the pathogenesis of various human diseases. Since traditional experimental methods for miRNA-disease associations detection are costly and time-consuming, it becomes urgent to design efficient and robust computational techniques for identifying undiscovered interactions. Methods In this paper, we proposed a computation framework named weighted bipartite network projection for miRNA-disease association prediction (WBNPMD). In this method, transfer weights were constructed by combining the known miRNA and disease similarities, and the initial information was properly configured. Then the two-step bipartite network algorithm was implemented to infer potential miRNA-disease associations. Results The proposed WBNPMD was applied to the known miRNA-disease association data, and leave-one-out cross-validation (LOOCV) and fivefold cross-validation were implemented to evaluate the performance of WBNPMD. As a result, our method achieved the AUCs of 0.9321 and \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$0.9173 \pm 0.0005$$\end{document}0.9173±0.0005 in LOOCV and fivefold cross-validation, and outperformed other four state-of-the-art methods. We also carried out two kinds of case studies on prostate neoplasm, colorectal neoplasm, and lung neoplasm, and most of the top 50 predicted miRNAs were confirmed to have an association with the corresponding diseases based on dbDeMC, miR2Disease, and HMDD V3.0 databases. Conclusions The experimental results demonstrate that WBNPMD can accurately infer potential miRNA-disease associations. We anticipated that the proposed WBNPMD could serve as a powerful tool for potential miRNA-disease associations excavation. Electronic supplementary material The online version of this article (10.1186/s12967-019-2063-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Guobo Xie
- School of Computer Science, Guangdong University of Technology, Guangzhou, China
| | - Zhiliang Fan
- School of Computer Science, Guangdong University of Technology, Guangzhou, China
| | - Yuping Sun
- School of Computer Science, Guangdong University of Technology, Guangzhou, China.
| | - Cuiming Wu
- School of Computer Science, Guangdong University of Technology, Guangzhou, China
| | - Lei Ma
- Institute of Automation, Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
9
|
Shen C, Ding Y, Tang J, Guo F. Multivariate Information Fusion With Fast Kernel Learning to Kernel Ridge Regression in Predicting LncRNA-Protein Interactions. Front Genet 2019; 9:716. [PMID: 30697228 PMCID: PMC6340980 DOI: 10.3389/fgene.2018.00716] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2018] [Accepted: 12/21/2018] [Indexed: 12/31/2022] Open
Abstract
Long non-coding RNAs (lncRNAs) constitute a large class of transcribed RNA molecules. They have a characteristic length of more than 200 nucleotides which do not encode proteins. They play an important role in regulating gene expression by interacting with the homologous RNA-binding proteins. Due to the laborious and time-consuming nature of wet experimental methods, more researchers should pay great attention to computational approaches for the prediction of lncRNA-protein interaction (LPI). An in-depth literature review in the state-of-the-art in silico investigations, leads to the conclusion that there is still room for improving the accuracy and velocity. This paper propose a novel method for identifying LPI by employing Kernel Ridge Regression, based on Fast Kernel Learning (LPI-FKLKRR). This approach, uses four distinct similarity measures for lncRNA and protein space, respectively. It is remarkable, that we extract Gene Ontology (GO) with proteins, in order to improve the quality of information in protein space. The process of heterogeneous kernels integration, applies Fast Kernel Learning (FastKL) to deal with weight optimization. The extrapolation model is obtained by gaining the ultimate prediction associations, after using Kernel Ridge Regression (KRR). Experimental outcomes show that the ability of modeling with LPI-FKLKRR has extraordinary performance compared with LPI prediction schemes. On benchmark dataset, it has been observed that the best Area Under Precision Recall Curve (AUPR) of 0.6950 is obtained by our proposed model LPI-FKLKRR, which outperforms the integrated LPLNP (AUPR: 0.4584), RWR (AUPR: 0.2827), CF (AUPR: 0.2357), LPIHN (AUPR: 0.2299), and LPBNI (AUPR: 0.3302). Also, combined with the experimental results of a case study on a novel dataset, it is anticipated that LPI-FKLKRR will be a useful tool for LPI prediction.
Collapse
Affiliation(s)
- Cong Shen
- School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Yijie Ding
- School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou, China
| | - Jijun Tang
- School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, Tianjin, China.,Department of Computer Science and Engineering, University of South Carolina, Columbia, SC, United States
| | - Fei Guo
- School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, Tianjin, China
| |
Collapse
|
10
|
Xiong Y, Wang Q, Yang J, Zhu X, Wei DQ. PredT4SE-Stack: Prediction of Bacterial Type IV Secreted Effectors From Protein Sequences Using a Stacked Ensemble Method. Front Microbiol 2018; 9:2571. [PMID: 30416498 PMCID: PMC6212463 DOI: 10.3389/fmicb.2018.02571] [Citation(s) in RCA: 74] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2018] [Accepted: 10/09/2018] [Indexed: 11/13/2022] Open
Abstract
Gram-negative bacteria use various secretion systems to deliver their secreted effectors. Among them, type IV secretion system exists widely in a variety of bacterial species, and secretes type IV secreted effectors (T4SEs), which play vital roles in host-pathogen interactions. However, experimental approaches to identify T4SEs are time- and resource-consuming. In the present study, we aim to develop an in silico stacked ensemble method to predict whether a protein is an effector of type IV secretion system or not based on its sequence information. The protein sequences were encoded by the feature of position specific scoring matrix (PSSM)-composition by summing rows that correspond to the same amino acid residues in PSSM profiles. Based on the PSSM-composition features, we develop a stacked ensemble model PredT4SE-Stack to predict T4SEs, which utilized an ensemble of base-classifiers implemented by various machine learning algorithms, such as support vector machine, gradient boosting machine, and extremely randomized trees, to generate outputs for the meta-classifier in the classification system. Our results demonstrated that the framework of PredT4SE-Stack was a feasible and effective way to accurately identify T4SEs based on protein sequence information. The datasets and source code of PredT4SE-Stack are freely available at http://xbioinfo.sjtu.edu.cn/PredT4SE_Stack/index.php.
Collapse
Affiliation(s)
- Yi Xiong
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Qiankun Wang
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Junchen Yang
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Xiaolei Zhu
- School of Sciences, Anhui Agricultural University, Hefei, China
| | - Dong-Qing Wei
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| |
Collapse
|