1
|
Zhao Y, Ma Y, Zhang Q. Metabolite-disease interaction prediction based on logistic matrix factorization and local neighborhood constraints. Front Psychiatry 2023; 14:1149947. [PMID: 37342171 PMCID: PMC10277486 DOI: 10.3389/fpsyt.2023.1149947] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Accepted: 05/10/2023] [Indexed: 06/22/2023] Open
Abstract
Background Increasing evidence indicates that metabolites are closely related to human diseases. Identifying disease-related metabolites is especially important for the diagnosis and treatment of disease. Previous works have mainly focused on the global topological information of metabolite and disease similarity networks. However, the local tiny structure of metabolites and diseases may have been ignored, leading to insufficiency and inaccuracy in the latent metabolite-disease interaction mining. Methods To solve the aforementioned problem, we propose a novel metabolite-disease interaction prediction method with logical matrix factorization and local nearest neighbor constraints (LMFLNC). First, the algorithm constructs metabolite-metabolite and disease-disease similarity networks by integrating multi-source heterogeneous microbiome data. Then, the local spectral matrices based on these two networks are established and used as the input of the model, together with the known metabolite-disease interaction network. Finally, the probability of metabolite-disease interaction is calculated according to the learned latent representations of metabolites and diseases. Results Extensive experiments on the metabolite-disease interaction data were conducted. The results show that the proposed LMFLNC method outperformed the second-best algorithm by 5.28 and 5.61% in the AUPR and F1, respectively. The LMFLNC method also exhibited several potential metabolite-disease interactions, such as "Cortisol" (HMDB0000063), relating to "21-Hydroxylase deficiency," and "3-Hydroxybutyric acid" (HMDB0000011) and "Acetoacetic acid" (HMDB0000060), both relating to "3-Hydroxy-3-methylglutaryl-CoA lyase deficiency." Conclusion The proposed LMFLNC method can well preserve the geometrical structure of original data and can thus effectively predict the underlying associations between metabolites and diseases. The experimental results show its effectiveness in metabolite-disease interaction prediction.
Collapse
Affiliation(s)
- Yongbiao Zhao
- National Engineering Research Center for E-Learning, Central China Normal University, Wuhan, Hubei, China
- School of Computer Engineering, Hubei University of Arts and Science, Xiangyang, Hubei, China
| | - Yuanyuan Ma
- School of Computer Engineering, Hubei University of Arts and Science, Xiangyang, Hubei, China
| | - Qilin Zhang
- School of Computer Engineering, Hubei University of Arts and Science, Xiangyang, Hubei, China
| |
Collapse
|
2
|
Xu Z, Marchionni L, Wang S. MultiNEP: a multi-omics network enhancement framework for prioritizing disease genes and metabolites simultaneously. Bioinformatics 2023; 39:btad333. [PMID: 37216914 PMCID: PMC10250081 DOI: 10.1093/bioinformatics/btad333] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Revised: 04/28/2023] [Accepted: 05/19/2023] [Indexed: 05/24/2023] Open
Abstract
MOTIVATION Many studies have successfully used network information to prioritize candidate omics profiles associated with diseases. The metabolome, as the link between genotypes and phenotypes, has accumulated growing attention. Using a "multi-omics" network constructed with a gene-gene network, a metabolite-metabolite network, and a gene-metabolite network to simultaneously prioritize candidate disease-associated metabolites and gene expressions could further utilize gene-metabolite interactions that are not used when prioritizing them separately. However, the number of metabolites is usually 100 times fewer than that of genes. Without accounting for this imbalance issue, we cannot effectively use gene-metabolite interactions when simultaneously prioritizing disease-associated metabolites and genes. RESULTS Here, we developed a Multi-omics Network Enhancement Prioritization (MultiNEP) framework with a weighting scheme to reweight contributions of different sub-networks in a multi-omics network to effectively prioritize candidate disease-associated metabolites and genes simultaneously. In simulation studies, MultiNEP outperforms competing methods that do not address network imbalances and identifies more true signal genes and metabolites simultaneously when we down-weight relative contributions of the gene-gene network and up-weight that of the metabolite-metabolite network to the gene-metabolite network. Applications to two human cancer cohorts show that MultiNEP prioritizes more cancer-related genes by effectively using both within- and between-omics interactions after handling network imbalance. AVAILABILITY AND IMPLEMENTATION The developed MultiNEP framework is implemented in an R package and available at: https://github.com/Karenxzr/MultiNep.
Collapse
Affiliation(s)
- Zhuoran Xu
- Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York, NY 10065, United States
| | - Luigi Marchionni
- Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York, NY 10065, United States
| | - Shuang Wang
- Department of Biostatistics, Columbia University, New York, NY 10032, United States
| |
Collapse
|
3
|
Khosla NK, Lesinski JM, Colombo M, Bezinge L, deMello AJ, Richards DA. Simplifying the complex: accessible microfluidic solutions for contemporary processes within in vitro diagnostics. LAB ON A CHIP 2022; 22:3340-3360. [PMID: 35984715 PMCID: PMC9469643 DOI: 10.1039/d2lc00609j] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Accepted: 08/15/2022] [Indexed: 05/02/2023]
Abstract
In vitro diagnostics (IVDs) form the cornerstone of modern medicine. They are routinely employed throughout the entire treatment pathway, from initial diagnosis through to prognosis, treatment planning, and post-treatment surveillance. Given the proven links between high quality diagnostic testing and overall health, ensuring broad access to IVDs has long been a focus of both researchers and medical professionals. Unfortunately, the current diagnostic paradigm relies heavily on centralized laboratories, complex and expensive equipment, and highly trained personnel. It is commonly assumed that this level of complexity is required to achieve the performance necessary for sensitive and specific disease diagnosis, and that making something affordable and accessible entails significant compromises in test performance. However, recent work in the field of microfluidics is challenging this notion. By exploiting the unique features of microfluidic systems, researchers have been able to create progressively simple devices that can perform increasingly complex diagnostic assays. This review details how microfluidic technologies are disrupting the status quo, and facilitating the development of simple, affordable, and accessible integrated IVDs. Importantly, we discuss the advantages and limitations of various approaches, and highlight the remaining challenges within the field.
Collapse
Affiliation(s)
- Nathan K Khosla
- Institute for Chemical and Bioengineering, ETH Zürich, Vladimir Prelog Weg 1, Zürich, 8093, Switzerland.
| | - Jake M Lesinski
- Institute for Chemical and Bioengineering, ETH Zürich, Vladimir Prelog Weg 1, Zürich, 8093, Switzerland.
| | - Monika Colombo
- Institute for Chemical and Bioengineering, ETH Zürich, Vladimir Prelog Weg 1, Zürich, 8093, Switzerland.
| | - Léonard Bezinge
- Institute for Chemical and Bioengineering, ETH Zürich, Vladimir Prelog Weg 1, Zürich, 8093, Switzerland.
| | - Andrew J deMello
- Institute for Chemical and Bioengineering, ETH Zürich, Vladimir Prelog Weg 1, Zürich, 8093, Switzerland.
| | - Daniel A Richards
- Institute for Chemical and Bioengineering, ETH Zürich, Vladimir Prelog Weg 1, Zürich, 8093, Switzerland.
| |
Collapse
|
4
|
Lei X, Tie J, Pan Y. Inferring Metabolite-Disease Association Using Graph Convolutional Networks. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:688-698. [PMID: 33705323 DOI: 10.1109/tcbb.2021.3065562] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
As is well known, biological experiments are time-consuming and laborious, so there is absolutely no doubt that developing an effective computational model will help solve these problems. Most of computational models rely on the biological similarity and network-based methods that cannot consider the topological structures of metabolite-disease association graphs. We proposed a novel method based on graph convolutional networks to infer potential metabolite-disease association, named MDAGCN. We first calculated three kinds of metabolite similarities and three kinds of disease similarities. The final similarity of disease and metabolite will be obtained by integrating three kinds' similarities of each and filtering out the noise similarity values. Then metabolite similarity network, disease similarity network and known metabolite-disease association network were used to construct a heterogenous network. Finally, heterogeneous network with rich information is fed into the graph convolutional networks to obtain new features of a node through aggregation of node information so as to infer the potential associations between metabolites and diseases. Experimental results show that MDAGCN achieves more reliable results in cross validation and case studies when compared with other existing methods.
Collapse
|
5
|
Ma Y, Ma Y. Hypergraph-based logistic matrix factorization for metabolite-disease interaction prediction. Bioinformatics 2022; 38:435-443. [PMID: 34499104 DOI: 10.1093/bioinformatics/btab652] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2021] [Revised: 08/08/2021] [Accepted: 09/06/2021] [Indexed: 02/03/2023] Open
Abstract
MOTIVATION Function-related metabolites, the terminal products of the cell regulation, show a close association with complex diseases. The identification of disease-related metabolites is critical to the diagnosis, prevention and treatment of diseases. However, most existing computational approaches build networks by calculating pairwise relationships, which is inappropriate for mining higher-order relationships. RESULTS In this study, we presented a novel approach with hypergraph-based logistic matrix factorization, HGLMF, to predict the potential interactions between metabolites and disease. First, the molecular structures and gene associations of metabolites and the hierarchical structures and GO functional annotations of diseases were extracted to build various similarity measures of metabolites and diseases. Next, the kernel neighborhood similarity of metabolites (or diseases) was calculated according to the completed interactive network. Second, multiple networks of metabolites and diseases were fused, respectively, and the hypergraph structures of metabolites and diseases were built. Finally, a logistic matrix factorization based on hypergraph was proposed to predict potential metabolite-disease interactions. In computational experiments, HGLMF accurately predicted the metabolite-disease interaction, and performed better than other state-of-the-art methods. Moreover, HGLMF could be used to predict new metabolites (or diseases). As suggested from the case studies, the proposed method could discover novel disease-related metabolites, which has been confirmed in existing studies. AVAILABILITY AND IMPLEMENTATION The codes and dataset are available at: https://github.com/Mayingjun20179/HGLMF. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yingjun Ma
- School of Applied Mathematics, Xiamen University of Technology, Xiamen 361024, China
| | - Yuanyuan Ma
- School of Computer & Information Engineering, Anyang Normal University, Anyang 455000, China
| |
Collapse
|
6
|
Chen J, Chen Y, Sun K, Wang Y, He H, Sun L, Ha S, Li X, Ou Y, Zhang X, Bi Y. Prediction of Ovarian Cancer-Related Metabolites Based on Graph Neural Network. Front Cell Dev Biol 2021; 9:753221. [PMID: 34676219 PMCID: PMC8525679 DOI: 10.3389/fcell.2021.753221] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Accepted: 08/27/2021] [Indexed: 11/13/2022] Open
Abstract
Ovarian cancer is one of the three most malignant tumors of the female reproductive system. At present, researchers do not know its pathogenesis, which makes the treatment effect unsatisfactory. Metabolomics is closely related to drug efficacy, safety evaluation, mechanism of action, and rational drug use. Therefore, identifying ovarian cancer-related metabolites could greatly help researchers understand the pathogenesis and develop treatment plans. However, the measurement of metabolites is inaccurate and greatly affects the environment, and biological experiment is time-consuming and costly. Therefore, researchers tend to use computational methods to identify disease-related metabolites in large scale. Since the hypothesis that similar diseases are related to similar metabolites is widely accepted, in this paper, we built both disease similarity network and metabolite similarity network and used graph convolutional network (GCN) to encode these networks. Then, support vector machine (SVM) was used to identify whether a metabolite is related to ovarian cancer. The experiment results show that the AUC and AUPR of our method are 0.92 and 0.81, respectively. Finally, we proposed an effective method to prioritize ovarian cancer-related metabolites in large scale.
Collapse
Affiliation(s)
- Jingjing Chen
- Department of Obstetrics and Gynecology, First Affiliated Hospital, Heilongjiang University of Chinese Medicine, Harbin, China
| | - Yingying Chen
- Department of Obstetrics and Gynecology, First Affiliated Hospital, Heilongjiang University of Chinese Medicine, Harbin, China
| | - Kefeng Sun
- Department of Obstetrics and Gynecology, First Affiliated Hospital, Heilongjiang University of Chinese Medicine, Harbin, China
| | - Yu Wang
- Department of Obstetrics and Gynecology, First Affiliated Hospital, Heilongjiang University of Chinese Medicine, Harbin, China
| | - Hui He
- Department of Obstetrics and Gynecology, First Affiliated Hospital, Heilongjiang University of Chinese Medicine, Harbin, China
| | - Lin Sun
- Department of Reproductive Medicine, Dalian Maternal and Children's Centre, Dalian, China
| | - Sifu Ha
- Department of Reproductive Medicine, Dalian Maternal and Children's Centre, Dalian, China
| | - Xiaoxiao Li
- Graduate School of Heilongjiang University of Chinese Medicine, Harbin, China
| | - Yifei Ou
- Graduate School of Heilongjiang University of Chinese Medicine, Harbin, China
| | - Xue Zhang
- Department of General Practice, Beijing Friendship Hospital, Capital Medical University, Beijing, China
| | - Yanli Bi
- Department of Reproductive Medicine, The First Affiliated Hospital, Henan University of Chinese Medicine, Zhengzhou, China
| |
Collapse
|
7
|
Zhao T, Hu Y, Cheng L. Deep-DRM: a computational method for identifying disease-related metabolites based on graph deep learning approaches. Brief Bioinform 2020; 22:5922326. [PMID: 33048110 DOI: 10.1093/bib/bbaa212] [Citation(s) in RCA: 50] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2020] [Revised: 07/27/2020] [Accepted: 08/12/2020] [Indexed: 12/31/2022] Open
Abstract
MOTIVATION The functional changes of the genes, RNAs and proteins will eventually be reflected in the metabolic level. Increasing number of researchers have researched mechanism, biomarkers and targeted drugs by metabolites. However, compared with our knowledge about genes, RNAs, and proteins, we still know few about diseases-related metabolites. All the few existed methods for identifying diseases-related metabolites ignore the chemical structure of metabolites, fail to recognize the association pattern between metabolites and diseases, and fail to apply to isolated diseases and metabolites. RESULTS In this study, we present a graph deep learning based method, named Deep-DRM, for identifying diseases-related metabolites. First, chemical structures of metabolites were used to calculate similarities of metabolites. The similarities of diseases were obtained based on their functional gene network and semantic associations. Therefore, both metabolites and diseases network could be built. Next, Graph Convolutional Network (GCN) was applied to encode the features of metabolites and diseases, respectively. Then, the dimension of these features was reduced by Principal components analysis (PCA) with retainment 99% information. Finally, Deep neural network was built for identifying true metabolite-disease pairs (MDPs) based on these features. The 10-cross validations on three testing setups showed outstanding AUC (0.952) and AUPR (0.939) of Deep-DRM compared with previous methods and similar approaches. Ten of top 15 predicted associations between diseases and metabolites got support by other studies, which suggests that Deep-DRM is an efficient method to identify MDPs. CONTACT liangcheng@hrbmu.edu.cn. AVAILABILITY AND IMPLEMENTATION https://github.com/zty2009/GPDNN-for-Identify-ing-Disease-related-Metabolites.
Collapse
Affiliation(s)
- Tianyi Zhao
- Department of Computer Science at the Harbin Institute of Technology
| | - Yang Hu
- Department of Life Science at the Harbin Institute of Technology
| | - Liang Cheng
- CAMS Key Laboratory of Molecular Probe and Targeted Theranostics, College of Bioinformatics Science and Technology at Harbin Medical University
| |
Collapse
|
8
|
Lei X, Tie J, Fujita H. Relational completion based non-negative matrix factorization for predicting metabolite-disease associations. Knowl Based Syst 2020. [DOI: 10.1016/j.knosys.2020.106238] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
|