1
|
Wang S, Liu T, Ren C, Zhao Y, Qiao S, Zhang Y, Pang S. Heterogeneous graph inference with range constrainted L 2,1-collaborative matrix factorization for small molecule-miRNA association prediction. Comput Biol Chem 2024; 110:108078. [PMID: 38677013 DOI: 10.1016/j.compbiolchem.2024.108078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2024] [Revised: 04/03/2024] [Accepted: 04/16/2024] [Indexed: 04/29/2024]
Abstract
MicroRNAs (miRNAs) play a vital role in regulating gene expression and various biological processes. As a result, they have been identified as effective targets for small molecule (SM) drugs in disease treatment. Heterogeneous graph inference stands as a classical approach for predicting SM-miRNA associations, showcasing commendable convergence accuracy and speed. However, most existing methods do not adequately address the inherent sparsity in SM-miRNA association networks, and imprecise SM/miRNA similarity metrics reduce the accuracy of predicting SM-miRNA associations. In this research, we proposed a heterogeneous graph inference with range constrained L2,1-collaborative matrix factorization (HGIRCLMF) method to predict potential SM-miRNA associations. First, we computed the multi-source similarities of SM/miRNA and integrated these similarity information into a comprehensive SM/miRNA similarity. This step improved the accuracy of SM and miRNA similarity, ensuring reliability for the subsequent inference of the heterogeneity map. Second, we used a range constrained L2,1-collaborative matrix factorization (RCLMF) model to pre-populate the SM-miRNA association matrix with missing values. In this step, we developed a novel matrix decomposition method that enhances the robustness and formative nature of SM-miRNA edges between SM networks and miRNA networks. Next, we built a well-established SM-miRNA heterogeneous network utilizing the processed biological information. Finally, HGIRCLMF used this network data to infer unknown association pair scores. We implemented four cross-validation experiments on two distinct datasets, and HGIRCLMF acquired the highest areas under the curve, surpassing six state-of-the-art computational approaches. Furthermore, we performed three case studies to validate the predictive power of our method in practical application.
Collapse
Affiliation(s)
- Shudong Wang
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum, Qingdao 266580, China
| | - Tiyao Liu
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum, Qingdao 266580, China
| | - Chuanru Ren
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum, Qingdao 266580, China
| | - Yawu Zhao
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum, Qingdao 266580, China
| | - Sibo Qiao
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum, Qingdao 266580, China
| | - Yuanyuan Zhang
- School of Information and Control Engineering, Qingdao University of Technology, Qingdao 266525, China.
| | - Shanchen Pang
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum, Qingdao 266580, China
| |
Collapse
|
2
|
Wang S, Ren C, Zhang Y, Li Y, Pang S, Song T. Identifying potential small molecule-miRNA associations via Robust PCA based on γ-norm regularization. Brief Bioinform 2023; 24:bbad312. [PMID: 37670501 DOI: 10.1093/bib/bbad312] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2023] [Revised: 07/18/2023] [Accepted: 08/10/2023] [Indexed: 09/07/2023] Open
Abstract
Dysregulation of microRNAs (miRNAs) is closely associated with refractory human diseases, and the identification of potential associations between small molecule (SM) drugs and miRNAs can provide valuable insights for clinical treatment. Existing computational techniques for inferring potential associations suffer from limitations in terms of accuracy and efficiency. To address these challenges, we devise a novel predictive model called RPCA$\Gamma $NR, in which we propose a new Robust principal component analysis (PCA) framework based on $\gamma $-norm and $l_{2,1}$-norm regularization and design an Augmented Lagrange Multiplier method to optimize it, thereby deriving the association scores. The Gaussian Interaction Profile Kernel Similarity is calculated to capture the similarity information of SMs and miRNAs in known associations. Through extensive evaluation, including Cross Validation Experiments, Independent Validation Experiment, Efficiency Analysis, Ablation Experiment, Matrix Sparsity Analysis, and Case Studies, RPCA$\Gamma $NR outperforms state-of-the-art models concerning accuracy, efficiency and robustness. In conclusion, RPCA$\Gamma $NR can significantly streamline the process of determining SM-miRNA associations, thus contributing to advancements in drug development and disease treatment.
Collapse
Affiliation(s)
- Shudong Wang
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum (East China), 66 Changjiang Xi Lu, 266580 Shandong, China
| | - Chuanru Ren
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum (East China), 66 Changjiang Xi Lu, 266580 Shandong, China
| | - Yulin Zhang
- College of Mathematics and Systems Science, Shandong University of Science and Technology, Xin An Street, 266590 Shandong, China
| | - Yunyin Li
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum (East China), 66 Changjiang Xi Lu, 266580 Shandong, China
| | - Shanchen Pang
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum (East China), 66 Changjiang Xi Lu, 266580 Shandong, China
| | - Tao Song
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum (East China), 66 Changjiang Xi Lu, 266580 Shandong, China
| |
Collapse
|
3
|
Qu J, Song Z, Cheng X, Jiang Z, Zhou J. Neighborhood-based inference and restricted Boltzmann machine for small molecule-miRNA associations prediction. PeerJ 2023; 11:e15889. [PMID: 37641598 PMCID: PMC10460564 DOI: 10.7717/peerj.15889] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Accepted: 07/21/2023] [Indexed: 08/31/2023] Open
Abstract
Background A growing number of experiments have shown that microRNAs (miRNAs) can be used as target of small molecules (SMs) to regulate gene expression for treating diseases. Therefore, identifying SM-related miRNAs is helpful for the treatment of diseases in the domain of medical investigation. Methods This article presents a new computational model, called NIRBMSMMA (neighborhood-based inference (NI) and restricted Boltzmann machine (RBM)), which we developed to identify potential small molecule-miRNA associations (NIRBMSMMA). First, grounded on known SM-miRNAs associations, SM similarity and miRNA similarity, NI was used to predict score of an unknown SM-miRNA pair by reckoning the sum of known associations between neighbors of the SM (miRNA) and the miRNA (SM). Second, utilizing a two-layered generative stochastic artificial neural network, RBM was used to predict SM-miRNA association by learning potential probability distribution from known SM-miRNA associations. At last, an ensemble learning model was conducted to combine NI and RBM for identifying potential SM-miRNA associations. Results Furthermore, we conducted global leave one out cross validation (LOOCV), miRNA-fixed LOOCV, SM-fixed LOOCV and five-fold cross validation to assess performance of NIRBMSMMA based on three datasets. Results showed that NIRBMSMMA obtained areas under the curve (AUC) of 0.9912, 0.9875, 0.8376 and 0.9898 ± 0.0009 under global LOOCV, miRNA-fixed LOOCV, SM-fixed LOOCV and five-fold cross validation based on dataset 1, respectively. For dataset 2, the AUCs are 0.8645, 0.8720, 0.7066 and 0.8547 ± 0.0046 in turn. For dataset 3, the AUCs are 0.9884, 0.9802, 0.8239 and 0.9870 ± 0.0015 in turn. Also, we conducted case studies to further assess the predictive performance of NIRBMSMMA. These results illustrated the proposed model is a useful tool in predicting potential SM-miRNA associations.
Collapse
Affiliation(s)
- Jia Qu
- School of Computer Science and Artificial Intelligence, Changzhou University, Changzhou, Jiangsu, China
| | - Zihao Song
- School of Computer Science and Artificial Intelligence, Changzhou University, Changzhou, Jiangsu, China
| | - Xiaolong Cheng
- School of Computer Science and Artificial Intelligence, Changzhou University, Changzhou, Jiangsu, China
| | - Zhibin Jiang
- Department of Computer Science and Engineering, Shaoxing University, Shaoxing, Zhejiang, China
| | - Jie Zhou
- Department of Computer Science and Engineering, Shaoxing University, Shaoxing, Zhejiang, China
| |
Collapse
|
4
|
Wang S, Liu T, Ren C, Wu W, Zhao Z, Pang S, Zhang Y. Predicting potential small molecule-miRNA associations utilizing truncated schatten p-norm. Brief Bioinform 2023; 24:bbad234. [PMID: 37366591 DOI: 10.1093/bib/bbad234] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Revised: 06/05/2023] [Accepted: 06/06/2023] [Indexed: 06/28/2023] Open
Abstract
MicroRNAs (miRNAs) have significant implications in diverse human diseases and have proven to be effectively targeted by small molecules (SMs) for therapeutic interventions. However, current SM-miRNA association prediction models do not adequately capture SM/miRNA similarity. Matrix completion is an effective method for association prediction, but existing models use nuclear norm instead of rank function, which has some drawbacks. Therefore, we proposed a new approach for predicting SM-miRNA associations by utilizing the truncated schatten p-norm (TSPN). First, the SM/miRNA similarity was preprocessed by incorporating the Gaussian interaction profile kernel similarity method. This identified more SM/miRNA similarities and significantly improved the SM-miRNA prediction accuracy. Next, we constructed a heterogeneous SM-miRNA network by combining biological information from three matrices and represented the network with its adjacency matrix. Finally, we constructed the prediction model by minimizing the truncated schatten p-norm of this adjacency matrix and we developed an efficient iterative algorithmic framework to solve the model. In this framework, we also used a weighted singular value shrinkage algorithm to avoid the problem of excessive singular value shrinkage. The truncated schatten p-norm approximates the rank function more closely than the nuclear norm, so the predictions are more accurate. We performed four different cross-validation experiments on two separate datasets, and TSPN outperformed various most advanced methods. In addition, public literature confirms a large number of predictive associations of TSPN in four case studies. Therefore, TSPN is a reliable model for SM-miRNA association prediction.
Collapse
Affiliation(s)
- Shudong Wang
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum, Qingdao 266580, China
| | - Tiyao Liu
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum, Qingdao 266580, China
| | - Chuanru Ren
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum, Qingdao 266580, China
| | - Wenhao Wu
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum, Qingdao 266580, China
| | - Zhiyuan Zhao
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum, Qingdao 266580, China
| | - Shanchen Pang
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum, Qingdao 266580, China
| | - Yuanyuan Zhang
- College of Information and Control Engineering, Qingdao University of Technology, Qingdao 266580, China
| |
Collapse
|
5
|
Wang S, Ren C, Zhang Y, Pang S, Qiao S, Wu W, Lin B. AMCSMMA: Predicting Small Molecule-miRNA Potential Associations Based on Accurate Matrix Completion. Cells 2023; 12:cells12081123. [PMID: 37190032 DOI: 10.3390/cells12081123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Revised: 04/02/2023] [Accepted: 04/04/2023] [Indexed: 05/17/2023] Open
Abstract
Exploring potential associations between small molecule drugs (SMs) and microRNAs (miRNAs) is significant for drug development and disease treatment. Since biological experiments are expensive and time-consuming, we propose a computational model based on accurate matrix completion for predicting potential SM-miRNA associations (AMCSMMA). Initially, a heterogeneous SM-miRNA network is constructed, and its adjacency matrix is taken as the target matrix. An optimization framework is then proposed to recover the target matrix with the missing values by minimizing its truncated nuclear norm, an accurate, robust, and efficient approximation to the rank function. Finally, we design an effective two-step iterative algorithm to solve the optimization problem and obtain the prediction scores. After determining the optimal parameters, we conduct four kinds of cross-validation experiments based on two datasets, and the results demonstrate that AMCSMMA is superior to the state-of-the-art methods. In addition, we implement another validation experiment, in which more evaluation metrics in addition to the AUC are introduced and finally achieve great results. In two types of case studies, a large number of SM-miRNA pairs with high predictive scores are confirmed by the published experimental literature. In summary, AMCSMMA has superior performance in predicting potential SM-miRNA associations, which can provide guidance for biological experiments and accelerate the discovery of new SM-miRNA associations.
Collapse
Affiliation(s)
- Shudong Wang
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum, Qingdao 266580, China
| | - Chuanru Ren
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum, Qingdao 266580, China
| | - Yulin Zhang
- College of Mathematics and Systems Science, Shandong University of Science and Technology, Qingdao 266580, China
| | - Shanchen Pang
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum, Qingdao 266580, China
| | - Sibo Qiao
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum, Qingdao 266580, China
| | - Wenhao Wu
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum, Qingdao 266580, China
| | - Boyang Lin
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum, Qingdao 266580, China
| |
Collapse
|
6
|
Wang MN, Li Y, Lei LL, Ding DW, Xie XJ. Combining non-negative matrix factorization with graph Laplacian regularization for predicting drug-miRNA associations based on multi-source information fusion. Front Pharmacol 2023; 14:1132012. [PMID: 36817132 PMCID: PMC9931722 DOI: 10.3389/fphar.2023.1132012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2022] [Accepted: 01/16/2023] [Indexed: 02/05/2023] Open
Abstract
Increasing evidences suggest that miRNAs play a key role in the occurrence and progression of many complex human diseases. Therefore, targeting dysregulated miRNAs with small molecule drugs in the clinical has become a new treatment. Nevertheless, it is high cost and time-consuming for identifying miRNAs-targeted with drugs by biological experiments. Thus, more reliable computational method for identification associations of drugs with miRNAs urgently need to be developed. In this study, we proposed an efficient method, called GNMFDMA, to predict potential associations of drug with miRNA by combining graph Laplacian regularization with non-negative matrix factorization. We first calculated the overall similarity matrices of drugs and miRNAs according to the collected different biological information. Subsequently, the new drug-miRNA association adjacency matrix was reformulated based on the K nearest neighbor profiles so as to put right the false negative associations. Finally, graph Laplacian regularization collaborative non-negative matrix factorization was used to calculate the association scores of drugs with miRNAs. In the cross validation, GNMFDMA obtains AUC of 0.9193, which outperformed the existing methods. In addition, case studies on three common drugs (i.e., 5-Aza-CdR, 5-FU and Gemcitabine), 30, 31 and 34 of the top-50 associations inferred by GNMFDMA were verified. These results reveal that GNMFDMA is a reliable and efficient computational approach for identifying the potential drug-miRNA associations.
Collapse
Affiliation(s)
- Mei-Neng Wang
- School of Mathematics and Computer Science, Yichun University, Yichun, China
| | - Yu Li
- School of Information Engineering, Inner Mongolia University of Science and Technology, Baotou, China,*Correspondence: Yu Li,
| | - Li-Lan Lei
- School of Mathematics and Computer Science, Yichun University, Yichun, China
| | - De-Wu Ding
- School of Mathematics and Computer Science, Yichun University, Yichun, China
| | - Xue-Jun Xie
- School of Mathematics and Computer Science, Yichun University, Yichun, China
| |
Collapse
|
7
|
Ni J, Cheng X, Ni T, Liang J. Identifying SM-miRNA associations based on layer attention graph convolutional network and matrix decomposition. Front Mol Biosci 2022; 9:1009099. [PMID: 36504714 PMCID: PMC9732030 DOI: 10.3389/fmolb.2022.1009099] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Accepted: 11/03/2022] [Indexed: 11/27/2022] Open
Abstract
The accurate prediction of potential associations between microRNAs (miRNAs) and small molecule (SM) drugs can enhance our knowledge of how SM cures endogenous miRNA-related diseases. Given that traditional methods for predicting SM-miRNA associations are time-consuming and arduous, a number of computational models have been proposed to anticipate the potential SM-miRNA associations. However, several of these strategies failed to eliminate noise from the known SM-miRNA association information or failed to prioritize the most significant known SM-miRNA associations. Therefore, we proposed a model of Graph Convolutional Network with Layer Attention mechanism for SM-MiRNA Association prediction (GCNLASMMA). Firstly, we obtained the new SM-miRNA associations by matrix decomposition. The new SM-miRNA associations, as well as the integrated SM similarity and miRNA similarity were subsequently incorporated into a heterogeneous network. Finally, a graph convolutional network with an attention mechanism was used to compute the reconstructed SM-miRNA association matrix. Furthermore, four types of cross validations and two types of case studies were performed to assess the performance of GCNLASMMA. In cross validation, global Leave-One-Out Cross Validation (LOOCV), miRNA-fixed LOOCV, SM-fixed LOOCV and 5-fold cross-validation achieved excellent performance. Numerous hypothesized associations in case studies were confirmed by experimental literatures. All of these results confirmed that GCNLASMMA is a trustworthy association inference method.
Collapse
|
8
|
Peng L, Tu Y, Huang L, Li Y, Fu X, Chen X. DAESTB: inferring associations of small molecule-miRNA via a scalable tree boosting model based on deep autoencoder. Brief Bioinform 2022; 23:6827720. [PMID: 36377749 DOI: 10.1093/bib/bbac478] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2022] [Revised: 09/28/2022] [Accepted: 10/08/2022] [Indexed: 11/16/2022] Open
Abstract
MicroRNAs (miRNAs) are closely related to a variety of human diseases, not only regulating gene expression, but also having an important role in human life activities and being viable targets of small molecule drugs for disease treatment. Current computational techniques to predict the potential associations between small molecule and miRNA are not that accurate. Here, we proposed a new computational method based on a deep autoencoder and a scalable tree boosting model (DAESTB), to predict associations between small molecule and miRNA. First, we constructed a high-dimensional feature matrix by integrating small molecule-small molecule similarity, miRNA-miRNA similarity and known small molecule-miRNA associations. Second, we reduced feature dimensionality on the integrated matrix using a deep autoencoder to obtain the potential feature representation of each small molecule-miRNA pair. Finally, a scalable tree boosting model is used to predict small molecule and miRNA potential associations. The experiments on two datasets demonstrated the superiority of DAESTB over various state-of-the-art methods. DAESTB achieved the best AUC value. Furthermore, in three case studies, a large number of predicted associations by DAESTB are confirmed with the public accessed literature. We envision that DAESTB could serve as a useful biological model for predicting potential small molecule-miRNA associations.
Collapse
Affiliation(s)
- Li Peng
- College of Computer Science and Engineering, Hunan University of Science and Technology, Xiangtan, 411201, Hunan, China.,Hunan Key Laboratory for Service computing and Novel Software Technology
| | - Yuan Tu
- College of Computer Science and Engineering, Hunan University of Science and Technology, Xiangtan, 411201, Hunan, China
| | - Li Huang
- Academy of Arts and Design, Tsinghua University, Beijing, 10084, China.,The Future Laboratory, Tsinghua University, Beijing, 10084, China
| | - Yang Li
- Key Laboratory of Intelligent Computing and Information Processing of Ministry of Education, Xiangtan University, Xiangtan, 411105, China
| | - Xiangzheng Fu
- College of Information Science and Engineering, Hunan University, Changsha, 410082, Hunan, China
| | - Xiang Chen
- College of Computer Science and Engineering, Hunan University of Science and Technology, Xiangtan, 411201, Hunan, China
| |
Collapse
|
9
|
Dong TN, Schrader J, Mücke S, Khosla M. A message passing framework with multiple data integration for miRNA-disease association prediction. Sci Rep 2022; 12:16259. [PMID: 36171337 PMCID: PMC9519928 DOI: 10.1038/s41598-022-20529-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Accepted: 09/14/2022] [Indexed: 11/08/2022] Open
Abstract
Micro RNA or miRNA is a highly conserved class of non-coding RNA that plays an important role in many diseases. Identifying miRNA-disease associations can pave the way for better clinical diagnosis and finding potential drug targets. We propose a biologically-motivated data-driven approach for the miRNA-disease association prediction, which overcomes the data scarcity problem by exploiting information from multiple data sources. The key idea is to enrich the existing miRNA/disease-protein-coding gene (PCG) associations via a message passing framework, followed by the use of disease ontology information for further feature filtering. The enriched and filtered PCG associations are then used to construct the inter-connected miRNA-PCG-disease network to train a structural deep network embedding (SDNE) model. Finally, the pre-trained embeddings and the biologically relevant features from the miRNA family and disease semantic similarity are concatenated to form the pair input representations to a Random Forest classifier whose task is to predict the miRNA-disease association probabilities. We present large-scale comparative experiments, ablation, and case studies to showcase our approach's superiority. Besides, we make the model prediction results for 1618 miRNAs and 3679 diseases, along with all related information, publicly available at http://software.mpm.leibniz-ai-lab.de/ to foster assessments and future adoption.
Collapse
Affiliation(s)
- Thi Ngan Dong
- L3S Research Center, Leibniz University of Hannover, Hannover, Germany.
| | - Johanna Schrader
- L3S Research Center, Leibniz University of Hannover, Hannover, Germany
| | - Stefanie Mücke
- Hannover Unified Biobank (HUB), Hannover Medical School, Hannover, Germany
| | - Megha Khosla
- Delft University of Technology (TU Delft), Delft, Netherlands
| |
Collapse
|
10
|
Wang SH, Wang CC, Huang L, Miao LY, Chen X. Dual-Network Collaborative Matrix Factorization for predicting small molecule-miRNA associations. Brief Bioinform 2021; 23:6447431. [PMID: 34864865 DOI: 10.1093/bib/bbab500] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2021] [Revised: 10/25/2021] [Accepted: 11/02/2021] [Indexed: 01/01/2023] Open
Abstract
MicroRNAs (miRNAs) play crucial roles in multiple biological processes and human diseases and can be considered as therapeutic targets of small molecules (SMs). Because biological experiments used to verify SM-miRNA associations are time-consuming and expensive, it is urgent to propose new computational models to predict new SM-miRNA associations. Here, we proposed a novel method called Dual-network Collaborative Matrix Factorization (DCMF) for predicting the potential SM-miRNA associations. Firstly, we utilized the Weighted K Nearest Known Neighbors (WKNKN) method to preprocess SM-miRNA association matrix. Then, we constructed matrix factorization model to obtain two feature matrices containing latent features of SM and miRNA, respectively. Finally, the predicted SM-miRNA association score matrix was obtained by calculating the inner product of two feature matrices. The main innovations of this method were that the use of WKNKN method can preprocess the missing values of association matrix and the introduction of dual network can integrate more diverse similarity information into DCMF. For evaluating the validity of DCMF, we implemented four different cross validations (CVs) based on two distinct datasets and two different case studies. Finally, based on dataset 1 (dataset 2), DCMF achieved Area Under receiver operating characteristic Curves (AUC) of 0.9868 (0.8770), 0.9833 (0.8836), 0.8377 (0.7591) and 0.9836 ± 0.0030 (0.8632 ± 0.0042) in global Leave-One-Out Cross Validation (LOOCV), miRNA-fixed local LOOCV, SM-fixed local LOOCV and 5-fold CV, respectively. For case studies, plenty of predicted associations have been confirmed by published experimental literature. Therefore, DCMF is an effective tool to predict potential SM-miRNA associations.
Collapse
Affiliation(s)
- Shu-Hao Wang
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, 221116, China.,Artificial Intelligence Research Institute, China University of Mining and Technology, Xuzhou, 221116, China
| | - Chun-Chun Wang
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, 221116, China.,Artificial Intelligence Research Institute, China University of Mining and Technology, Xuzhou, 221116, China
| | - Li Huang
- Academy of Arts and Design, Tsinghua University, Beijing, 10084, China.,The Future Laboratory, Tsinghua University, Beijing, 10084, China
| | - Lian-Ying Miao
- School of Mathematics, China University of Mining and Technology, Xuzhou, 221116, China
| | - Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, 221116, China.,Artificial Intelligence Research Institute, China University of Mining and Technology, Xuzhou, 221116, China
| |
Collapse
|
11
|
Luo J, Shen C, Lai Z, Cai J, Ding P. Incorporating Clinical, Chemical and Biological Information for Predicting Small Molecule-microRNA Associations Based on Non-Negative Matrix Factorization. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:2535-2545. [PMID: 32092012 DOI: 10.1109/tcbb.2020.2975780] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Small molecule(SM) drugs can affect the expression of miRNAs, which plays crucial roles in many important biological processes. The chemical structure and clinical information of small molecule can simultaneously incorporate information such as anatomical distribution, therapeutic effects and structural characteristics. It is necessary to develop a novel model that incorporates small molecule chemical structure and clinical information to reveal the unknown small molecule-miRNA associations. In this study, we developed a new framework based on non-negative matrix factorization, called SMANMF, to discover the potential small molecules-miRNAs associations. First, the functional similarity of two miRNAs can be obtained by computing the overlap of the target gene sets in which the miRNAs interact together, and we integrated two types of small molecule similarities, including chemical similarity and clinical similarity. Then, we utilized a non-negative matrix factorization model to discover the unknown relationship between small molecules and miRNAs. The evaluation results indicate that our model can achieve superior prediction performance compared with previous approaches in 5-fold cross-validation. At the same time, the results of case studies also reveal that the SMANMF model has good predictive performance for predicting the potential association between small molecules and miRNAs.
Collapse
|
12
|
Wang CC, Zhu CC, Chen X. Ensemble of kernel ridge regression-based small molecule-miRNA association prediction in human disease. Brief Bioinform 2021; 23:6407727. [PMID: 34676393 DOI: 10.1093/bib/bbab431] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 09/06/2021] [Accepted: 09/18/2021] [Indexed: 12/12/2022] Open
Abstract
MicroRNAs (miRNAs) play crucial roles in human disease and can be targeted by small molecule (SM) drugs according to numerous studies, which shows that identifying SM-miRNA associations in human disease is important for drug development and disease treatment. We proposed the method of Ensemble of Kernel Ridge Regression-based Small Molecule-MiRNA Association prediction (EKRRSMMA) to uncover potential SM-miRNA associations by combing feature dimensionality reduction and ensemble learning. First, we constructed different feature subsets for both SMs and miRNAs. Then, we trained homogeneous base learners based on distinct feature subsets and took the average of scores obtained from these base learners as SM-miRNA association score. In EKRRSMMA, feature dimensionality reduction technology was employed in the process of construction of feature subsets to reduce the influence of noisy data. Besides, the base learner, namely KRR_avg, was the combination of two classifiers constructed under SM space and miRNA space, which could make full use of the information of SM and miRNA. To assess the prediction performance of EKRRSMMA, we conducted Leave-One-Out Cross-Validation (LOOCV), SM-fixed local LOOCV, miRNA-fixed local LOOCV and 5-fold CV based on two datasets. For Dataset 1 (Dataset 2), EKRRSMMA got the Area Under receiver operating characteristic Curves (AUCs) of 0.9793 (0.8871), 0.8071 (0.7705), 0.9732 (0.8586) and 0.9767 ± 0.0014 (0.8560 ± 0.0027). Besides, we conducted four case studies. As a result, 32 (5-Fluorouracil), 19 (17β-Estradiol), 26 (5-Aza-2'-deoxycytidine) and 11 (cyclophosphamide) out of top 50 predicted potentially associated miRNAs were confirmed by database or experimental literature. Above evaluation results demonstrated that EKRRSMMA is reliable for predicting SM-miRNA associations.
Collapse
Affiliation(s)
- Chun-Chun Wang
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| | - Chi-Chi Zhu
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| | - Xing Chen
- Artificial Intelligence Research Institute, China University of Mining and Technology, Xuzhou 221116, China
| |
Collapse
|
13
|
Chen X, Zhou C, Wang CC, Zhao Y. Predicting potential small molecule-miRNA associations based on bounded nuclear norm regularization. Brief Bioinform 2021; 22:6353837. [PMID: 34404088 DOI: 10.1093/bib/bbab328] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2021] [Revised: 07/24/2021] [Accepted: 07/26/2021] [Indexed: 12/14/2022] Open
Abstract
Mounting evidence has demonstrated the significance of taking microRNAs (miRNAs) as the target of small molecule (SM) drugs for disease treatment. Given the fact that exploring new SM-miRNA associations through biological experiments is extremely expensive, several computing models have been constructed to reveal the possible SM-miRNA associations. Here, we built a computing model of Bounded Nuclear Norm Regularization for SM-miRNA Associations prediction (BNNRSMMA). Specifically, we first constructed a heterogeneous SM-miRNA network utilizing miRNA similarity, SM similarity, confirmed SM-miRNA associations and defined a matrix to represent the heterogeneous network. Then, we constructed a model to complete this matrix by minimizing its nuclear norm. The Alternating Direction Method of Multipliers was adopted to minimize the nuclear norm and obtain predicted scores. The main innovation lies in two aspects. During completion, we limited all elements of the matrix within the interval of (0,1) to make sure they have practical significance. Besides, instead of strictly fitting all known elements, a regularization term was incorporated to tolerate the noise in integrated similarities. Furthermore, four kinds of cross-validations on two datasets and two types of case studies were performed to evaluate the predictive performance of BNNRSMMA. Finally, BNNRSMMA attained areas under the curve of 0.9822 (0.8433), 0.9793 (0.8852), 0.8253 (0.7350) and 0.9758 ± 0.0029 (0.8759 ± 0.0041) under global leave-one-out cross-validation (LOOCV), miRNA-fixed LOOCV, SM-fixed LOOCV and 5-fold cross-validation based on Dataset 1(Dataset 2), respectively. With regard to case studies, plenty of predicted associations have been verified by experimental literatures. All these results confirmed that BNNRSMMA is a reliable tool for inferring associations.
Collapse
Affiliation(s)
- Xing Chen
- Artificial Intelligence Research Institute, China University of Mining and Technology, Xuzhou 221116, China
| | - Chi Zhou
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| | - Chun-Chun Wang
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| | - Yan Zhao
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| |
Collapse
|
14
|
Drug repositioning based on the target microRNAs using bilateral-inductive matrix completion. Mol Genet Genomics 2020; 295:1305-1314. [DOI: 10.1007/s00438-020-01702-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2020] [Accepted: 06/15/2020] [Indexed: 12/14/2022]
|
15
|
Liu F, Peng L, Tian G, Yang J, Chen H, Hu Q, Liu X, Zhou L. Identifying Small Molecule-miRNA Associations Based on Credible Negative Sample Selection and Random Walk. Front Bioeng Biotechnol 2020; 8:131. [PMID: 32258003 PMCID: PMC7090022 DOI: 10.3389/fbioe.2020.00131] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2019] [Accepted: 02/10/2020] [Indexed: 12/05/2022] Open
Abstract
Recently, many studies have demonstrated that microRNAs (miRNAs) are new small molecule drug targets. Identifying small molecule-miRNA associations (SMiRs) plays an important role in finding new clues for various human disease therapy. Wet experiments can discover credible SMiR associations; however, this is a costly and time-consuming process. Computational models have therefore been developed to uncover possible SMiR associations. In this study, we designed a new SMiR association prediction model, RWNS. RWNS integrates various biological information, credible negative sample selections, and random walk on a triple-layer heterogeneous network into a unified framework. It includes three procedures: similarity computation, negative sample selection, and SMiR association prediction based on random walk on the constructed small molecule-disease-miRNA association network. To evaluate the performance of RWNS, we used leave-one-out cross-validation (LOOCV) and 5-fold cross validation to compare RWNS with two state-of-the-art SMiR association methods, namely, TLHNSMMA and SMiR-NBI. Experimental results showed that RWNS obtained an AUC value of 0.9829 under LOOCV and 0.9916 under 5-fold cross validation on the SM2miR1 dataset, and it obtained an AUC value of 0.8938 under LOOCV and 0.9899 under 5-fold cross validation on the SM2miR2 dataset. More importantly, RWNS successfully captured 9, 17, and 37 SMiR associations validated by experiments among the predicted top 10, 20, and 50 SMiR candidates with the highest scores, respectively. We inferred that enoxacin and decitabine are associated with mir-21 and mir-155, respectively. Therefore, RWNS can be a powerful tool for SMiR association prediction.
Collapse
Affiliation(s)
- Fuxing Liu
- School of Computer Science, Hunan University of Technology, Zhuzhou, China
| | - Lihong Peng
- School of Computer Science, Hunan University of Technology, Zhuzhou, China
| | - Geng Tian
- Geneis (Beijing) Co. Ltd., Beijing, China
| | | | - Hui Chen
- College of Chemical Engineering, Xiangtan University, Xiangtan, China
| | - Qi Hu
- Xiangya Second Hospital, Central South University, Changsha, Hunan, China
| | - Xiaojun Liu
- School of Computer Science, Hunan University of Technology, Zhuzhou, China
| | - Liqian Zhou
- School of Computer Science, Hunan University of Technology, Zhuzhou, China
| |
Collapse
|
16
|
Zhao Y, Chen X, Yin J, Qu J. SNMFSMMA: using symmetric nonnegative matrix factorization and Kronecker regularized least squares to predict potential small molecule-microRNA association. RNA Biol 2019; 17:281-291. [PMID: 31739716 DOI: 10.1080/15476286.2019.1694732] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
Accumulating studies have shown that microRNAs (miRNAs) could be used as targets of small-molecule (SM) drugs to treat diseases. In recent years, researchers have proposed many computational models to reveal miRNA-SM associations due to the huge cost of experimental methods. Considering the shortcomings of the previous models, such as the prediction accuracy of some models is low or some cannot be applied for new SMs (miRNAs), we developed a novel model named Symmetric Nonnegative Matrix Factorization for Small Molecule-MiRNA Association prediction (SNMFSMMA). Different from some models directly applying the integrated similarities, SNMFSMMA first performed matrix decomposition on the integrated similarity matrixes, and calculated the Kronecker product of the new integrated similarity matrixes to obtain the SM-miRNA pair similarity. Further, we applied regularized least square to obtain the mapping function of the SM-miRNA pairs to the associated probabilities by minimizing the objective function. On the basis of Dataset 1 and 2 extracted from SM2miR v1.0 database, we implemented global leave-one-out cross validation (LOOCV), miRNA-fixed local LOOCV, SM-fixed local LOOCV and 5-fold cross-validation to evaluate the prediction performance. Finally, the AUC values obtained by SNMFSMMA in these validation reached 0.9711 (0.8895), 0.9698 (0.8884), 0.8329 (0.7651) and 0.9644 ± 0.0035 (0.8814 ± 0.0033) based on Dataset 1 (Dataset 2), respectively. In the first case study, 5 of the top 10 associations predicted were confirmed. In the second, 7 and 8 of the top 10 predicted miRNAs related with 5-FU and 5-Aza-2'-deoxycytidine were confirmed. These results demonstrated the reliable predictive power of SNMFSMMA.
Collapse
Affiliation(s)
- Yan Zhao
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| | - Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| | - Jun Yin
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| | - Jia Qu
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| |
Collapse
|
17
|
Wang CC, Chen X. A Unified Framework for the Prediction of Small Molecule–MicroRNA Association Based on Cross-Layer Dependency Inference on Multilayered Networks. J Chem Inf Model 2019; 59:5281-5293. [DOI: 10.1021/acs.jcim.9b00667] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Affiliation(s)
- Chun-Chun Wang
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| | - Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| |
Collapse
|
18
|
Yin J, Chen X, Wang CC, Zhao Y, Sun YZ. Prediction of Small Molecule–MicroRNA Associations by Sparse Learning and Heterogeneous Graph Inference. Mol Pharm 2019; 16:3157-3166. [DOI: 10.1021/acs.molpharmaceut.9b00384] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Affiliation(s)
- Jun Yin
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| | - Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| | - Chun-Chun Wang
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| | - Yan Zhao
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| | - Ya-Zhou Sun
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen 518060, China
| |
Collapse
|
19
|
Wang CC, Chen X, Qu J, Sun YZ, Li JQ. RFSMMA: A New Computational Model to Identify and Prioritize Potential Small Molecule-MiRNA Associations. J Chem Inf Model 2019; 59:1668-1679. [PMID: 30840454 DOI: 10.1021/acs.jcim.9b00129] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
More and more studies found that many complex human diseases occur accompanied by aberrant expression of microRNAs (miRNAs). Small molecule (SM) drugs have been utilized to treat complex human diseases by affecting the expression of miRNAs. Several computational methods were proposed to infer underlying associations between SMs and miRNAs. In our study, we proposed a new calculation model of random forest based small molecule-miRNA association prediction (RFSMMA) which was based on the known SM-miRNA associations in the SM2miR database. RFSMMA utilized the similarity of SMs and miRNAs as features to represent SM-miRNA pairs and further implemented the machine learning algorithm of random forest to train training samples and obtain a prediction model. In RFSMMA, integrating multiple kinds of similarity can avoid the bias of single similarity and choosing more reliable features from original features can represent SM-miRNA pairs more accurately. We carried out cross validations to assess predictive accuracy of RFSMMA. As a result, RFSMMA acquired AUCs of 0.9854, 0.9839, 0.7052, and 0.9917 ± 0.0008 under global leave-one-out cross validation (LOOCV), miRNA-fixed local LOOCV, SM-fixed local LOOCV, and 5-fold cross validation, respectively, under data set 1. Based on data set 2, RFSMMA obtained AUCs of 0.8456, 0.8463, 0.6653, and 0.8389 ± 0.0033 under four cross validations according to the order mentioned above. In addition, we implemented a case study on three common SMs, namely, 5-fluorouracil, 17β-estradiol, and 5-aza-2'-deoxycytidine. Among the top 50 associated miRNAs of these three SMs predicted by RFSMMA, 31, 32, and 28 miRNAs were verified, respectively. Therefore, RFSMMA is shown to be an effective and reliable tool for identifying underlying SM-miRNA associations.
Collapse
Affiliation(s)
- Chun-Chun Wang
- School of Information and Control Engineering , China University of Mining and Technology , Xuzhou 221116 , China
| | - Xing Chen
- School of Information and Control Engineering , China University of Mining and Technology , Xuzhou 221116 , China
| | - Jia Qu
- School of Information and Control Engineering , China University of Mining and Technology , Xuzhou 221116 , China
| | - Ya-Zhou Sun
- College of Computer Science and Software Engineering , Shenzhen University , Shenzhen 518060 , China
| | - Jian-Qiang Li
- College of Computer Science and Software Engineering , Shenzhen University , Shenzhen 518060 , China
| |
Collapse
|
20
|
Qu J, Chen X, Sun YZ, Zhao Y, Cai SB, Ming Z, You ZH, Li JQ. In Silico Prediction of Small Molecule-miRNA Associations Based on the HeteSim Algorithm. MOLECULAR THERAPY-NUCLEIC ACIDS 2018; 14:274-286. [PMID: 30654189 PMCID: PMC6348698 DOI: 10.1016/j.omtn.2018.12.002] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/13/2018] [Revised: 11/15/2018] [Accepted: 12/04/2018] [Indexed: 01/27/2023]
Abstract
Targeting microRNAs (miRNAs) with drug small molecules (SMs) is a new treatment method for many human complex diseases. Unsurprisingly, identification of potential miRNA-SM associations is helpful for pharmaceutical engineering and disease therapy in the field of medical research. In this paper, we developed a novel computational model of HeteSim-based inference for SM-miRNA Association prediction (HSSMMA) by implementing a path-based measurement method of HeteSim on a heterogeneous network combined with known miRNA-SM associations, integrated miRNA similarity, and integrated SM similarity. Through considering paths from an SM to a miRNA in the heterogeneous network, the model can capture the semantics information under each path and predict potential miRNA-SM associations based on all the considered paths. We performed global, miRNA-fixed local and SM-fixed local leave one out cross validation (LOOCV) as well as 5-fold cross validation based on the dataset of known miRNA-SM associations to evaluate the prediction performance of our approach. The results showed that HSSMMA gained the corresponding areas under the receiver operating characteristic (ROC) curve (AUCs) of 0.9913, 0.9902, 0.7989, and 0.9910 ± 0.0004 based on dataset 1 and AUCs of 0.7401, 0.8466, 0.6149, and 0.7451 ± 0.0054 based on dataset 2, respectively. In case studies, 2 of the top 10 and 13 of the top 50 predicted potential miRNA-SM associations were confirmed by published literature. We further implemented case studies to test whether HSSMMA was effective for new SMs without any known related miRNAs. The results from cross validation and case studies showed that HSSMMA could be a useful prediction tool for the identification of potential miRNA-SM associations.
Collapse
Affiliation(s)
- Jia Qu
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| | - Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China.
| | - Ya-Zhou Sun
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen 518060, China
| | - Yan Zhao
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| | - Shu-Bin Cai
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen 518060, China
| | - Zhong Ming
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen 518060, China.
| | - Zhu-Hong You
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Science, Ürümqi 830011, China.
| | - Jian-Qiang Li
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen 518060, China
| |
Collapse
|
21
|
Chen X, Guan NN, Sun YZ, Li JQ, Qu J. MicroRNA-small molecule association identification: from experimental results to computational models. Brief Bioinform 2018; 21:47-61. [PMID: 30325405 DOI: 10.1093/bib/bby098] [Citation(s) in RCA: 75] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2018] [Revised: 09/07/2018] [Accepted: 09/07/2018] [Indexed: 12/14/2022] Open
Abstract
Small molecule is a kind of low molecular weight organic compound with variety of biological functions. Studies have indicated that small molecules can inhibit a specific function of a multifunctional protein or disrupt protein-protein interactions and may have beneficial or detrimental effect against diseases. MicroRNAs (miRNAs) play crucial roles in cellular biology, which makes it possible to develop miRNA as diagnostics and therapeutic targets. Several drug-like compound libraries were screened successfully against different miRNAs in cellular assays further demonstrating the possibility of targeting miRNAs with small molecules. In this review, we summarized the concept and functions of small molecule and miRNAs. Especially, five aspects of miRNA functions were exhibited in detail with individual examples. In addition, four disease states that have been linked to miRNA alterations were summed up. Then, small molecules related to four important miRNAs miR-21, 122, 4644 and 27 were selected for introduction. Some important publicly accessible databases and web servers of the experimentally validated or potential small molecule-miRNA associations were discussed. Identifying small molecule targeting miRNAs has become an important goal of biomedical research. Thus, several experimental and computational models have been developed and implemented to identify novel small molecule-miRNA associations. Here, we reviewed four experimental techniques used in the past few years to search for small-molecule inhibitors of miRNAs, as well as three types of models of predicting small molecule-miRNA associations from different perspectives. Finally, we summarized the limitations of existing methods and discussed the future directions for further development of computational models.
Collapse
Affiliation(s)
- Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| | - Na-Na Guan
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China
| | - Ya-Zhou Sun
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China
| | - Jian-Qiang Li
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China
| | - Jia Qu
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| |
Collapse
|
22
|
Guan NN, Sun YZ, Ming Z, Li JQ, Chen X. Prediction of Potential Small Molecule-Associated MicroRNAs Using Graphlet Interaction. Front Pharmacol 2018; 9:1152. [PMID: 30374302 PMCID: PMC6196296 DOI: 10.3389/fphar.2018.01152] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2018] [Accepted: 09/24/2018] [Indexed: 11/13/2022] Open
Abstract
MicroRNAs (miRNAs) have been proved to be targeted by the small molecules recently, which made using small molecules to target miRNAs become a possible therapy for human diseases. Therefore, it is very meaningful to investigate the relationships between small molecules and miRNAs, which is still yet in the newly-developing stage. In this paper, we presented a prediction model of Graphlet Interaction based inference for Small Molecule-MiRNA Association prediction (GISMMA) by combining small molecule similarity network, miRNA similarity network and known small molecule-miRNA association network. This model described the complex relationship between two small molecules or between two miRNAs using graphlet interaction which consists of 28 isomers. The association score between a small molecule and a miRNA was calculated based on counting the numbers of graphlet interaction throughout the small molecule similarity network and the miRNA similarity network, respectively. Global and two types of local leave-one-out cross validation (LOOCV) as well as five-fold cross validation were implemented in two datasets to evaluate GISMMA. For Dataset 1, the AUCs are 0.9291 for global LOOCV, 0.9505, and 0.7702 for two local LOOCVs, 0.9263 ± 0.0026 for five-fold cross validation; for Dataset 2, the AUCs are 0.8203, 0.8640, 0.6591, and 0.8554 ± 0.0063, in turn. In case study for small molecules, 5-Fluorouracil, 17β-Estradiol and 5-Aza-2'-deoxycytidine, the numbers of top 50 miRNAs predicted by GISMMA and validated to be related to these three small molecules by experimental literatures are in turn 30, 29, and 25. Based on the results from cross validations and case studies, it is easy to realize the excellent performance of GISMMA.
Collapse
Affiliation(s)
- Na-Na Guan
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China
| | - Ya-Zhou Sun
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China
| | - Zhong Ming
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China.,National Engineering Laboratory for Big Data System Computing Technology, Shenzhen University, Shenzhen, China
| | - Jian-Qiang Li
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China
| | - Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| |
Collapse
|
23
|
Yang XY, Gao L, Liang C. Inferring Disease–miRNA Associations by Self-Weighting with Multiple Data Source. Mol Biol 2018; 52:749-760. [DOI: 10.1134/s0026893318050151] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2017] [Accepted: 10/05/2017] [Indexed: 01/03/2025]
|
24
|
Qu J, Chen X, Sun YZ, Li JQ, Ming Z. Inferring potential small molecule-miRNA association based on triple layer heterogeneous network. J Cheminform 2018; 10:30. [PMID: 29943160 PMCID: PMC6020102 DOI: 10.1186/s13321-018-0284-9] [Citation(s) in RCA: 56] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2018] [Accepted: 06/19/2018] [Indexed: 12/12/2022] Open
Abstract
Recently, many biological experiments have indicated that microRNAs (miRNAs) are a newly discovered small molecule (SM) drug targets that play an important role in the development and progression of human complex diseases. More and more computational models have been developed to identify potential associations between SMs and target miRNAs, which would be a great help for disease therapy and clinical applications for known drugs in the field of medical research. In this study, we proposed a computational model of triple layer heterogeneous network based small molecule–MiRNA association prediction (TLHNSMMA) to uncover potential SM–miRNA associations by integrating integrated SM similarity, integrated miRNA similarity, integrated disease similarity, experimentally verified SM–miRNA associations and miRNA–disease associations into a heterogeneous graph. To evaluate the performance of TLHNSMMA, we implemented global and two types of local leave-one-out cross validation as well as fivefold cross validation to compare TLHNSMMA with one previous classical computational model (SMiR-NBI). As a result, for Dataset 1, TLHNSMMA obtained the AUCs of 0.9859, 0.9845, 0.7645 and 0.9851 ± 0.0012, respectively; for Dataset 2, the AUCs are in turn 0.8149, 0.8244, 0.6057 and 0.8168 ± 0.0022. As the result of case studies shown, among the top 10, 20 and 50 potential SM-related miRNAs, there were 2, 7 and 14 SM–miRNA associations confirmed by experiments, respectively. Therefore, TLHNSMMA could be effectively applied to the prediction of SM–miRNA associations.
Collapse
Affiliation(s)
- Jia Qu
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, 221116, China
| | - Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, 221116, China.
| | - Ya-Zhou Sun
- National Engineering Laboratory for Big Data System Computing Technology, Shenzhen University, Shenzhen, 518060, China.,College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, 518060, China
| | - Jian-Qiang Li
- National Engineering Laboratory for Big Data System Computing Technology, Shenzhen University, Shenzhen, 518060, China.,College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, 518060, China
| | - Zhong Ming
- National Engineering Laboratory for Big Data System Computing Technology, Shenzhen University, Shenzhen, 518060, China.,College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, 518060, China
| |
Collapse
|
25
|
Yang J, Lin Y, Jiang L, Xi J, Wang X, Guan J, Chen J, Pan Y, Luo J, Ye C, Sun Q. Comparison analysis of microRNAs in response to dengue virus type 2 infection between the Vero cell-adapted strain and its source, the clinical C6/36 isolated strain. Virus Res 2018; 250:65-74. [PMID: 29660363 DOI: 10.1016/j.virusres.2018.04.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2017] [Revised: 04/12/2018] [Accepted: 04/12/2018] [Indexed: 10/17/2022]
Abstract
To elucidate the differences in microRNAs during dengue virus infection between Vero cell-adapted strain (DENV-2-Vero) and its source, the clinical C6/36 isolated strain (DENV-2-C6/36), a comparison analysis was performed in Vero cells by high throughput sequencing. The results showed that the expression of 16 known and 3 novel miRNAs exhibited marked differences. 5 known miRNAs were up-regulated in DENV-2-C6/36 group, while 11 known microRNAs were down-regulated in DENV-2-Vero group. The GO enrichment and KEGG pathway analysis showed that there was a distinct difference in regulating viral replication between two strains. In DENV-2-Vero infection group, significantly enriched GO terms included virion attachment to host cells, viral structural protein/genome processing and packaging. Meanwhile, the regulation of cell death and apoptosis between two groups were different in the early stage of infection. KEGG enrichment analysis showed that DENV-2-C6/36 infection induced more intense regulation of immune-related pathways, including Fc gamma R-mediated phagocytosis, etc. DENV-2-Vero infection could partially alleviate the immune defense of Vero cells compared with DENV-2-C6/36. The results indicated that the distinct microRNA changes induced by two DENV-2 strains may be partly related to their infective abilities. Our data provide useful insights that help elucidate the host-pathogen interactions following DENV infection.
Collapse
Affiliation(s)
- Jiajia Yang
- Institute of Medical Biology, Chinese Academy of Medical Sciences, and Peking Union Medical College, Kunming 650118, PR China; Yunnan Key Laboratory of Vaccine Research & Development on Severe Infectious Diseases, Kunming 650118, PR China; Yunnan Key Laboratory of Vector-Borne Infectious Disease, Kunming 650118, PR China
| | - Yao Lin
- Institute of Medical Biology, Chinese Academy of Medical Sciences, and Peking Union Medical College, Kunming 650118, PR China; Yunnan Key Laboratory of Vaccine Research & Development on Severe Infectious Diseases, Kunming 650118, PR China; Yunnan Key Laboratory of Vector-Borne Infectious Disease, Kunming 650118, PR China
| | - Liming Jiang
- Institute of Medical Biology, Chinese Academy of Medical Sciences, and Peking Union Medical College, Kunming 650118, PR China; Yunnan Key Laboratory of Vaccine Research & Development on Severe Infectious Diseases, Kunming 650118, PR China; Yunnan Key Laboratory of Vector-Borne Infectious Disease, Kunming 650118, PR China
| | - Juemin Xi
- Institute of Medical Biology, Chinese Academy of Medical Sciences, and Peking Union Medical College, Kunming 650118, PR China; Yunnan Key Laboratory of Vaccine Research & Development on Severe Infectious Diseases, Kunming 650118, PR China; Yunnan Key Laboratory of Vector-Borne Infectious Disease, Kunming 650118, PR China
| | - Xiaodan Wang
- Institute of Medical Biology, Chinese Academy of Medical Sciences, and Peking Union Medical College, Kunming 650118, PR China; Yunnan Key Laboratory of Vaccine Research & Development on Severe Infectious Diseases, Kunming 650118, PR China; Yunnan Key Laboratory of Vector-Borne Infectious Disease, Kunming 650118, PR China
| | - Jiaoqiong Guan
- Institute of Medical Biology, Chinese Academy of Medical Sciences, and Peking Union Medical College, Kunming 650118, PR China; Yunnan Key Laboratory of Vaccine Research & Development on Severe Infectious Diseases, Kunming 650118, PR China; Yunnan Key Laboratory of Vector-Borne Infectious Disease, Kunming 650118, PR China
| | - Junying Chen
- Institute of Medical Biology, Chinese Academy of Medical Sciences, and Peking Union Medical College, Kunming 650118, PR China; Yunnan Key Laboratory of Vaccine Research & Development on Severe Infectious Diseases, Kunming 650118, PR China; Yunnan Key Laboratory of Vector-Borne Infectious Disease, Kunming 650118, PR China
| | - Yue Pan
- Institute of Medical Biology, Chinese Academy of Medical Sciences, and Peking Union Medical College, Kunming 650118, PR China; Yunnan Key Laboratory of Vaccine Research & Development on Severe Infectious Diseases, Kunming 650118, PR China; Yunnan Key Laboratory of Vector-Borne Infectious Disease, Kunming 650118, PR China
| | - Jia Luo
- Institute of Medical Biology, Chinese Academy of Medical Sciences, and Peking Union Medical College, Kunming 650118, PR China; Yunnan Key Laboratory of Vaccine Research & Development on Severe Infectious Diseases, Kunming 650118, PR China; Kunming Medical University, Kunming 650500, PR China
| | - Chao Ye
- Institute of Medical Biology, Chinese Academy of Medical Sciences, and Peking Union Medical College, Kunming 650118, PR China; Yunnan Key Laboratory of Vaccine Research & Development on Severe Infectious Diseases, Kunming 650118, PR China; Kunming Medical University, Kunming 650500, PR China
| | - Qiangming Sun
- Institute of Medical Biology, Chinese Academy of Medical Sciences, and Peking Union Medical College, Kunming 650118, PR China; Yunnan Key Laboratory of Vaccine Research & Development on Severe Infectious Diseases, Kunming 650118, PR China; Yunnan Key Laboratory of Vector-Borne Infectious Disease, Kunming 650118, PR China.
| |
Collapse
|
26
|
Chen H, Zhang Z, Peng W. miRDDCR: a miRNA-based method to comprehensively infer drug-disease causal relationships. Sci Rep 2017; 7:15921. [PMID: 29162848 PMCID: PMC5698443 DOI: 10.1038/s41598-017-15716-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2017] [Accepted: 10/31/2017] [Indexed: 01/10/2023] Open
Abstract
Revealing the cause-and-effect mechanism behind drug-disease relationships remains a challenging task. Recent studies suggested that drugs can target microRNAs (miRNAs) and alter their expression levels. In the meanwhile, the inappropriate expression of miRNAs will lead to various diseases. Therefore, targeting specific miRNAs by small-molecule drugs to modulate their activities provides a promising approach to human disease treatment. However, few studies attempt to discover drug-disease causal relationships through the molecular level of miRNAs. Here, we developed a miRNA-based inference method miRDDCR to comprehensively predict drug-disease causal relationships. We first constructed a three-layer drug-miRNA-disease heterogeneous network by combining similarity measurements, existing drug-miRNA associations and miRNA-disease associations. Then, we extended the algorithm of Random Walk to the three-layer heterogeneous network and ranked the potential indications for drugs. Leave-one-out cross-validations and case studies demonstrated that our method miRDDCR can achieve excellent prediction power. Compared with related methods, our causality discovery-based algorithm showed superior prediction ability and highlighted the molecular basis miRNAs, which can be used to assist in the experimental design for drug development and disease treatment. Finally, comprehensively inferred drug-disease causal relationships were released for further studies.
Collapse
Affiliation(s)
- Hailin Chen
- School of Software, East China Jiaotong University, Nanchang, China.
| | - Zuping Zhang
- School of Information Science and Engineering, Central South University, Changsha, China
| | - Wei Peng
- Computer Center of Kunming University of Science and Technology, Kunming, China
| |
Collapse
|
27
|
Integration of Multiple Genomic and Phenotype Data to Infer Novel miRNA-Disease Associations. PLoS One 2016; 11:e0148521. [PMID: 26849207 PMCID: PMC4743935 DOI: 10.1371/journal.pone.0148521] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2015] [Accepted: 01/19/2016] [Indexed: 01/07/2023] Open
Abstract
MicroRNAs (miRNAs) play an important role in the development and progression of human diseases. The identification of disease-associated miRNAs will be helpful for understanding the molecular mechanisms of diseases at the post-transcriptional level. Based on different types of genomic data sources, computational methods for miRNA-disease association prediction have been proposed. However, individual source of genomic data tends to be incomplete and noisy; therefore, the integration of various types of genomic data for inferring reliable miRNA-disease associations is urgently needed. In this study, we present a computational framework, CHNmiRD, for identifying miRNA-disease associations by integrating multiple genomic and phenotype data, including protein-protein interaction data, gene ontology data, experimentally verified miRNA-target relationships, disease phenotype information and known miRNA-disease connections. The performance of CHNmiRD was evaluated by experimentally verified miRNA-disease associations, which achieved an area under the ROC curve (AUC) of 0.834 for 5-fold cross-validation. In particular, CHNmiRD displayed excellent performance for diseases without any known related miRNAs. The results of case studies for three human diseases (glioblastoma, myocardial infarction and type 1 diabetes) showed that all of the top 10 ranked miRNAs having no known associations with these three diseases in existing miRNA-disease databases were directly or indirectly confirmed by our latest literature mining. All these results demonstrated the reliability and efficiency of CHNmiRD, and it is anticipated that CHNmiRD will serve as a powerful bioinformatics method for mining novel disease-related miRNAs and providing a new perspective into molecular mechanisms underlying human diseases at the post-transcriptional level. CHNmiRD is freely available at http://www.bio-bigdata.com/CHNmiRD.
Collapse
|
28
|
Shao T, Zhao Z, Wu A, Bai J, Li Y, Chen H, Jiang C, Wang Y, Li S, Wang L, Zhang F, Xu J, Li X. Functional dissection of virus-human crosstalk mediated by miRNAs based on the VmiReg database. MOLECULAR BIOSYSTEMS 2016; 11:1319-28. [PMID: 25787233 DOI: 10.1039/c5mb00095e] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Recently, a number of viruses have been shown to encode microRNAs (miRNAs), and they play important roles in several biological processes, enhancing the intricacies of the virus-host crosstalk. However, systematically deciphering the characteristics of crosstalk mediated by viral and human miRNAs has been hampered by the lack of high-confidence targets. Here, a user-friendly platform is developed to provide experimentally validated and predicted target genes of viral miRNAs as well as their functions, named VmiReg. To explore the virus-human crosstalk meditated by miRNAs, validated human cellular targets of viral and cellular miRNAs are analyzed. As a result, target genes of viral miRNAs are prone to be silenced by human miRNAs. Two kinds of targets have globally significantly high functional similarities and are more often found simultaneously in many important biological functions, even in disease genes, particularly cancer genes, and essential genes. In addition, viral and human miRNA targets are in close proximity within the protein-protein interaction network, indicating frequent communication via physical interactions to participate in the same functions. Finally, multiple dense modules intuitively exhibit crosstalk between viral and cellular miRNAs. Furthermore, most co-regulated genes tend to be in important locations of modules. The lymphoma-related module is one of the typical examples. Our study suggests that the functional importance of cellular genes targeted by viral miRNAs and the intricate virus-host crosstalk mediated by miRNAs may be performed via the sharing of target genes or physical interactions, providing a new direction in further researching the roles of miRNAs in infection.
Collapse
Affiliation(s)
- Tingting Shao
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Lv Y, Wang S, Meng F, Yang L, Wang Z, Wang J, Chen X, Jiang W, Li Y, Li X. Identifying novel associations between small molecules and miRNAs based on integrated molecular networks. Bioinformatics 2015. [DOI: 10.1093/bioinformatics/btv417] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
|
30
|
How to learn about gene function: text-mining or ontologies? Methods 2014; 74:3-15. [PMID: 25088781 DOI: 10.1016/j.ymeth.2014.07.004] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2014] [Revised: 07/01/2014] [Accepted: 07/09/2014] [Indexed: 12/31/2022] Open
Abstract
As the amount of genome information increases rapidly, there is a correspondingly greater need for methods that provide accurate and automated annotation of gene function. For example, many high-throughput technologies--e.g., next-generation sequencing--are being used today to generate lists of genes associated with specific conditions. However, their functional interpretation remains a challenge and many tools exist trying to characterize the function of gene-lists. Such systems rely typically in enrichment analysis and aim to give a quick insight into the underlying biology by presenting it in a form of a summary-report. While the load of annotation may be alleviated by such computational approaches, the main challenge in modern annotation remains to develop a systems form of analysis in which a pipeline can effectively analyze gene-lists quickly and identify aggregated annotations through computerized resources. In this article we survey some of the many such tools and methods that have been developed to automatically interpret the biological functions underlying gene-lists. We overview current functional annotation aspects from the perspective of their epistemology (i.e., the underlying theories used to organize information about gene function into a body of verified and documented knowledge) and find that most of the currently used functional annotation methods fall broadly into one of two categories: they are based either on 'known' formally-structured ontology annotations created by 'experts' (e.g., the GO terms used to describe the function of Entrez Gene entries), or--perhaps more adventurously--on annotations inferred from literature (e.g., many text-mining methods use computer-aided reasoning to acquire knowledge represented in natural languages). Overall however, deriving detailed and accurate insight from such gene lists remains a challenging task, and improved methods are called for. In particular, future methods need to (1) provide more holistic insight into the underlying molecular systems; (2) provide better follow-up experimental testing and treatment options, and (3) better manage gene lists derived from organisms that are not well-studied. We discuss some promising approaches that may help achieve these advances, especially the use of extended dictionaries of biomedical concepts and molecular mechanisms, as well as greater use of annotation benchmarks.
Collapse
|
31
|
Zhan Y, Zhang R, Lv H, Song X, Xu X, Chai L, Lv W, Shang Z, Jiang Y, Zhang R. Prioritization of candidate genes for periodontitis using multiple computational tools. J Periodontol 2014; 85:1059-69. [PMID: 24476546 DOI: 10.1902/jop.2014.130523] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
BACKGROUND Both genetic and environmental factors contribute to the development of periodontitis. Genetic studies identified a variety of candidate genes for periodontitis. The aim of the present study is to identify the most promising candidate genes for periodontitis using an integrative gene ranking method. METHODS Seed genes that were confirmed to be associated with periodontitis were identified using text mining. Three types of candidate genes were then extracted from different resources (expression profiles, genome-wide association studies). Combining the seed genes, four freely available bioinformatics tools (ToppGene, DIR, Endeavour, and GPEC) were integrated for prioritization of candidate genes. Candidate genes that identified with at least three programs and ranked in the top 20 by each program were considered the most promising. RESULTS Prioritization analysis resulted in 21 promising genes involved or potentially involved in periodontitis. Among them, IL18 (interleukin 18), CD44 (CD44 molecule), CXCL1 (chemokine [CXC motif] ligand 1), IL6ST (interleukin 6 signal transducer), MMP3 (matrix metallopeptidase 3), MMP7, CCR1 (chemokine [C-C motif] receptor 1), MMP13, and TLR9 (Toll-like receptor 9) had been associated with periodontitis. However, the roles of other genes, such as CSF3 (colony stimulating factor 3 receptor), CD40, TNFSF14 (tumor necrosis factor receptor superfamily, member 14), IFNB1 (interferon-β1), TIRAP (toll-interleukin 1 receptor domain containing adaptor protein), IL2RA (interleukin 2 receptor α), ETS1 (v-ets avian erythroblastosis virus E26 oncogene homolog 1), GADD45B (growth arrest and DNA-damage-inducible 45 β), BIRC3 (baculoviral IAP repeat containing 3), VAV1 (vav 1 guanine nucleotide exchange factor), COL5A1 (collagen, type V, α1), and C3 (complement component 3), have not been investigated thoroughly in the process of periodontitis. These genes are mainly involved in bacterial infection, immune response, and inflammatory reaction, suggesting that further characterizing their roles in periodontitis will be important. CONCLUSIONS A combination of computational tools will be useful in mining candidate genes for periodontitis. These theoretical results provide new clues for experimental biologists to plan targeted experiments.
Collapse
Affiliation(s)
- Yuanbo Zhan
- Department of Periodontology and Oral Mucosa, Second Affiliated Hospital of Harbin Medical University, Harbin, China
| | | | | | | | | | | | | | | | | | | |
Collapse
|
32
|
Wang P, Ning S, Wang Q, Li R, Ye J, Zhao Z, Li Y, Huang T, Li X. mirTarPri: improved prioritization of microRNA targets through incorporation of functional genomics data. PLoS One 2013; 8:e53685. [PMID: 23326485 PMCID: PMC3541237 DOI: 10.1371/journal.pone.0053685] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2012] [Accepted: 12/03/2012] [Indexed: 11/18/2022] Open
Abstract
MicroRNAs (miRNAs) are a class of small (19-25 nt) non-coding RNAs. This important class of gene regulator downregulates gene expression through sequence-specific binding to the 3'untranslated regions (3'UTRs) of target mRNAs. Several computational target prediction approaches have been developed for predicting miRNA targets. However, the predicted target lists often have high false positive rates. To construct a workable target list for subsequent experimental studies, we need novel approaches to properly rank the candidate targets from traditional methods. We performed a systematic analysis of experimentally validated miRNA targets using functional genomics data, and found significant functional associations between genes that were targeted by the same miRNA. Based on this finding, we developed a miRNA target prioritization method named mirTarPri to rank the predicted target lists from commonly used target prediction methods. Leave-one-out cross validation has proved to be successful in identifying known targets, achieving an AUC score up to 0. 84. Validation in high-throughput data proved that mirTarPri was an unbiased method. Applying mirTarPri to prioritize results of six commonly used target prediction methods allowed us to find more positive targets at the top of the prioritized candidate list. In comparison with other methods, mirTarPri had an outstanding performance in gold standard and CLIP data. mirTarPri was a valuable method to improve the efficacy of current miRNA target prediction methods. We have also developed a web-based server for implementing mirTarPri method, which is freely accessible at http://bioinfo.hrbmu.edu.cn/mirTarPri.
Collapse
Affiliation(s)
- Peng Wang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Shangwei Ning
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Qianghu Wang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Ronghong Li
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Jingrun Ye
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Zuxianglan Zhao
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Yan Li
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Teng Huang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Xia Li
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
- * E-mail:
| |
Collapse
|
33
|
Prioritising risk pathways of complex human diseases based on functional profiling. Eur J Hum Genet 2012; 21:666-72. [PMID: 23047740 DOI: 10.1038/ejhg.2012.218] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
Analysis of the biological pathways involved in complex human diseases is an important step in elucidating the pathogenesis and mechanism of diseases. Most pathway analysis approaches identify disease-related biological pathways using overlapping genes between pathways and diseases. However, these approaches ignore the functional biological association between pathways and diseases. In this paper, we designed a novel computational framework for prioritising disease-risk pathways based on functional profiling. The disease gene set and biological pathways were translated into functional profiles in the context of GO annotations. We then implemented a semantic similarity measurement for calculating the concordance score between a functional profile of disease genes and a functional profile of pathways (FPP); the concordance score was then used to prioritise and infer disease-risk pathways. A freely accessible web toolkit, 'Functional Profiling-based Pathway Prioritisation' (FPPP), was developed (http://bioinfo.hrbmu.edu.cn/FPPP). During validation, our method successfully identified known disease-pathway pairs with area under the ROC curve (AUC) values of 96.73 and 95.02% in tests using both pathway randomisation and disease randomisation. A robustness analysis showed that FPPP is reliable even when using data containing noise. A case study based on a dilated cardiomyopathy data set indicated that the high-ranking pathways from FPPP are well known to be linked with this disease. Furthermore, we predicted the risk pathways of 413 diseases by using FPPP to build a disease similarity landscape that systematically reveals the global modular organisation of disease associations.
Collapse
|
34
|
Chen X, Jiang W, Wang Q, Huang T, Wang P, Li Y, Chen X, Lv Y, Li X. Systematically characterizing and prioritizing chemosensitivity related gene based on Gene Ontology and protein interaction network. BMC Med Genomics 2012; 5:43. [PMID: 23031817 PMCID: PMC3532125 DOI: 10.1186/1755-8794-5-43] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2011] [Accepted: 08/27/2012] [Indexed: 11/30/2022] Open
Abstract
Background The identification of genes that predict in vitro cellular chemosensitivity of cancer cells is of great importance. Chemosensitivity related genes (CRGs) have been widely utilized to guide clinical and cancer chemotherapy decisions. In addition, CRGs potentially share functional characteristics and network features in protein interaction networks (PPIN). Methods In this study, we proposed a method to identify CRGs based on Gene Ontology (GO) and PPIN. Firstly, we documented 150 pairs of drug-CCRG (curated chemosensitivity related gene) from 492 published papers. Secondly, we characterized CCRGs from the perspective of GO and PPIN. Thirdly, we prioritized CRGs based on CCRGs’ GO and network characteristics. Lastly, we evaluated the performance of the proposed method. Results We found that CCRG enriched GO terms were most often related to chemosensitivity and exhibited higher similarity scores compared to randomly selected genes. Moreover, CCRGs played key roles in maintaining the connectivity and controlling the information flow of PPINs. We then prioritized CRGs using CCRG enriched GO terms and CCRG network characteristics in order to obtain a database of predicted drug-CRGs that included 53 CRGs, 32 of which have been reported to affect susceptibility to drugs. Our proposed method identifies a greater number of drug-CCRGs, and drug-CCRGs are much more significantly enriched in predicted drug-CRGs, compared to a method based on the correlation of gene expression and drug activity. The mean area under ROC curve (AUC) for our method is 65.2%, whereas that for the traditional method is 55.2%. Conclusions Our method not only identifies CRGs with expression patterns strongly correlated with drug activity, but also identifies CRGs in which expression is weakly correlated with drug activity. This study provides the framework for the identification of signatures that predict in vitro cellular chemosensitivity and offers a valuable database for pharmacogenomics research.
Collapse
Affiliation(s)
- Xin Chen
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | | | | | | | | | | | | | | | | |
Collapse
|