1
|
Kong X, Diao L, Jiang P, Nie S, Guo S, Li D. DDK-Linker: a network-based strategy identifies disease signals by linking high-throughput omics datasets to disease knowledge. Brief Bioinform 2024; 25:bbae111. [PMID: 38517698 PMCID: PMC10959161 DOI: 10.1093/bib/bbae111] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Revised: 02/26/2024] [Accepted: 02/27/2024] [Indexed: 03/24/2024] Open
Abstract
The high-throughput genomic and proteomic scanning approaches allow investigators to measure the quantification of genome-wide genes (or gene products) for certain disease conditions, which plays an essential role in promoting the discovery of disease mechanisms. The high-throughput approaches often generate a large gene list of interest (GOIs), such as differentially expressed genes/proteins. However, researchers have to perform manual triage and validation to explore the most promising, biologically plausible linkages between the known disease genes and GOIs (disease signals) for further study. Here, to address this challenge, we proposed a network-based strategy DDK-Linker to facilitate the exploration of disease signals hidden in omics data by linking GOIs to disease knowns genes. Specifically, it reconstructed gene distances in the protein-protein interaction (PPI) network through six network methods (random walk with restart, Deepwalk, Node2Vec, LINE, HOPE, Laplacian) to discover disease signals in omics data that have shorter distances to disease genes. Furthermore, benefiting from the establishment of knowledge base we established, the abundant bioinformatics annotations were provided for each candidate disease signal. To assist in omics data interpretation and facilitate the usage, we have developed this strategy into an application that users can access through a website or download the R package. We believe DDK-Linker will accelerate the exploring of disease genes and drug targets in a variety of omics data, such as genomics, transcriptomics and proteomics data, and provide clues for complex disease mechanism and pharmacological research. DDK-Linker is freely accessible at http://ddklinker.ncpsb.org.cn/.
Collapse
Affiliation(s)
- Xiangren Kong
- State Key Laboratory of Medical Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Lihong Diao
- State Key Laboratory of Medical Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, China
| | - Peng Jiang
- State Key Laboratory of Medical Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Shiyan Nie
- State Key Laboratory of Medical Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Shuzhen Guo
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, China
| | - Dong Li
- State Key Laboratory of Medical Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| |
Collapse
|
2
|
Han S, Hong J, Yun SJ, Koo HJ, Kim TY. PWN: enhanced random walk on a warped network for disease target prioritization. BMC Bioinformatics 2023; 24:105. [PMID: 36944912 PMCID: PMC10031933 DOI: 10.1186/s12859-023-05227-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Accepted: 03/13/2023] [Indexed: 03/23/2023] Open
Abstract
BACKGROUND Extracting meaningful information from unbiased high-throughput data has been a challenge in diverse areas. Specifically, in the early stages of drug discovery, a considerable amount of data was generated to understand disease biology when identifying disease targets. Several random walk-based approaches have been applied to solve this problem, but they still have limitations. Therefore, we suggest a new method that enhances the effectiveness of high-throughput data analysis with random walks. RESULTS We developed a new random walk-based algorithm named prioritization with a warped network (PWN), which employs a warped network to achieve enhanced performance. Network warping is based on both internal and external features: graph curvature and prior knowledge. CONCLUSIONS We showed that these compositive features synergistically increased the resulting performance when applied to random walk algorithms, which led to PWN consistently achieving the best performance among several other known methods. Furthermore, we performed subsequent experiments to analyze the characteristics of PWN.
Collapse
Affiliation(s)
- Seokjin Han
- Standigm Inc., 70, Nonhyeon-ro 85-gil, Gangnam-gu, Seoul, 06234 Republic of Korea
| | - Jinhee Hong
- Standigm Inc., 70, Nonhyeon-ro 85-gil, Gangnam-gu, Seoul, 06234 Republic of Korea
| | - So Jeong Yun
- Standigm Inc., 70, Nonhyeon-ro 85-gil, Gangnam-gu, Seoul, 06234 Republic of Korea
| | - Hee Jung Koo
- Standigm UK Co., Ltd, 50-60 Station Road, Cambridge, CB1 2JH UK
| | - Tae Yong Kim
- Standigm Inc., 70, Nonhyeon-ro 85-gil, Gangnam-gu, Seoul, 06234 Republic of Korea
| |
Collapse
|
3
|
Robin V, Bodein A, Scott-Boyer MP, Leclercq M, Périn O, Droit A. Overview of methods for characterization and visualization of a protein-protein interaction network in a multi-omics integration context. Front Mol Biosci 2022; 9:962799. [PMID: 36158572 PMCID: PMC9494275 DOI: 10.3389/fmolb.2022.962799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Accepted: 08/16/2022] [Indexed: 11/26/2022] Open
Abstract
At the heart of the cellular machinery through the regulation of cellular functions, protein-protein interactions (PPIs) have a significant role. PPIs can be analyzed with network approaches. Construction of a PPI network requires prediction of the interactions. All PPIs form a network. Different biases such as lack of data, recurrence of information, and false interactions make the network unstable. Integrated strategies allow solving these different challenges. These approaches have shown encouraging results for the understanding of molecular mechanisms, drug action mechanisms, and identification of target genes. In order to give more importance to an interaction, it is evaluated by different confidence scores. These scores allow the filtration of the network and thus facilitate the representation of the network, essential steps to the identification and understanding of molecular mechanisms. In this review, we will discuss the main computational methods for predicting PPI, including ones confirming an interaction as well as the integration of PPIs into a network, and we will discuss visualization of these complex data.
Collapse
Affiliation(s)
- Vivian Robin
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| | - Antoine Bodein
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| | - Marie-Pier Scott-Boyer
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| | - Mickaël Leclercq
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| | - Olivier Périn
- Digital Sciences Department, L'Oréal Advanced Research, Aulnay-sous-bois, France
| | - Arnaud Droit
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| |
Collapse
|
4
|
Li D, Li L, Quan F, Wang T, Xu S, Li S, Tian K, Feng M, He N, Tian L, Chen B, Zhang H, Wang L, Wang J. Identification of circulating immune landscape in ischemic stroke based on bioinformatics methods. Front Genet 2022; 13:921582. [PMID: 35957686 PMCID: PMC9358692 DOI: 10.3389/fgene.2022.921582] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2022] [Accepted: 07/06/2022] [Indexed: 11/19/2022] Open
Abstract
Ischemic stroke (IS) is a high-incidence disease that seriously threatens human life and health. Neuroinflammation and immune responses are key players in the pathophysiological processes of IS. However, the underlying immune mechanisms are not fully understood. In this study, we attempted to identify several immune biomarkers associated with IS. We first retrospectively collected validated human IS immune-related genes (IS-IRGs) as seed genes. Afterward, potential IS-IRGs were discovered by applying random walk with restart on the PPI network and the permutation test as a screening strategy. Doing so, the validated and potential sets of IS-IRGs were merged together as an IS-IRG catalog. Two microarray profiles were subsequently used to explore the expression patterns of the IS-IRG catalog, and only IS-IRGs that were differentially expressed between IS patients and controls in both profiles were retained for biomarker selection by the Random Forest rankings. CLEC4D and CD163 were finally identified as immune biomarkers of IS, and a classification model was constructed and verified based on the weights of two biomarkers obtained from the Neural Network algorithm. Furthermore, the CIBERSORT algorithm helped us determine the proportions of circulating immune cells. Correlation analyses between IS immune biomarkers and immune cell proportions demonstrated that CLEC4D was strongly correlated with the proportion of neutrophils (r = 0.72). These results may provide potential targets for further studies on immuno-neuroprotection therapies against reperfusion injury.
Collapse
Affiliation(s)
- Danyang Li
- Department of Neurology, The Second Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Lifang Li
- Department of Neurology, The Second Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Fei Quan
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Tianfeng Wang
- Department of Neurology, The Second Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Si Xu
- Department of Neurology, The Second Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Shuang Li
- Department of Neurology, The Second Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Kuo Tian
- Department of Neurology, The Second Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Meng Feng
- Department of Neurology, The Second Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Ni He
- Department of Neurology, The Second Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Liting Tian
- Department of Neurology, The Second Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Biying Chen
- Department of Neurology, The Second Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Huixue Zhang
- Department of Neurology, The Second Affiliated Hospital of Harbin Medical University, Harbin, China
- *Correspondence: Huixue Zhang, ; Lihua Wang, ; Jianjian Wang,
| | - Lihua Wang
- Department of Neurology, The Second Affiliated Hospital of Harbin Medical University, Harbin, China
- *Correspondence: Huixue Zhang, ; Lihua Wang, ; Jianjian Wang,
| | - Jianjian Wang
- Department of Neurology, The Second Affiliated Hospital of Harbin Medical University, Harbin, China
- *Correspondence: Huixue Zhang, ; Lihua Wang, ; Jianjian Wang,
| |
Collapse
|
5
|
Sheng M, Cai H, Yang Q, Li J, Zhang J, Liu L. A Random Walk-Based Method to Identify Candidate Genes Associated With Lymphoma. Front Genet 2021; 12:792754. [PMID: 34899868 PMCID: PMC8655984 DOI: 10.3389/fgene.2021.792754] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2021] [Accepted: 11/02/2021] [Indexed: 11/16/2022] Open
Abstract
Lymphoma is a serious type of cancer, especially for adolescents and elder adults, although this malignancy is quite rare compared with other types of cancer. The cause of this malignancy remains ambiguous. Genetic factor is deemed to be highly associated with the initiation and progression of lymphoma, and several genes have been related to this disease. Determining the pathogeny of lymphoma by identifying the related genes is important. In this study, we presented a random walk-based method to infer the novel lymphoma-associated genes. From the reported 1,458 lymphoma-associated genes and protein–protein interaction network, raw candidate genes were mined by using the random walk with restart algorithm. The determined raw genes were further filtered by using three screening tests (i.e., permutation, linkage, and enrichment tests). These tests could control false-positive genes and screen out essential candidate genes with strong linkages to validate the lymphoma-associated genes. A total of 108 inferred genes were obtained. Analytical results indicated that some inferred genes, such as RAC3, TEC, IRAK2/3/4, PRKCE, SMAD3, BLK, TXK, PRKCQ, were associated with the initiation and progression of lymphoma.
Collapse
Affiliation(s)
- Minjie Sheng
- Department of Ophthalmology, Yangpu Hospital, School of Medicine, Tongji University, Shanghai, China
| | - Haiying Cai
- Department of Ophthalmology, Yangpu Hospital, School of Medicine, Tongji University, Shanghai, China
| | - Qin Yang
- Department of Ophthalmology, Yangpu Hospital, School of Medicine, Tongji University, Shanghai, China
| | - Jing Li
- Department of Ophthalmology, Yangpu Hospital, School of Medicine, Tongji University, Shanghai, China
| | - Jian Zhang
- Department of Ophthalmology, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China.,Shanghai Key Laboratory of Ocular Fundus Diseases, Shanghai, China.,Shanghai Engineering Center for Visual Science and Photomedicine, Shanghai, China.,National Clinical Research Center for Eye Diseases, Shanghai, China.,Shanghai Engineering Center for Precise Diagnosis and Treatment of Eye Diseases, Shanghai, China
| | - Lihua Liu
- Department of Ophthalmology, Yangpu Hospital, School of Medicine, Tongji University, Shanghai, China
| |
Collapse
|
6
|
Yang L, Qin Y, Jian C. Screening for Core Genes Related to Pathogenesis of Alzheimer's Disease. Front Cell Dev Biol 2021; 9:668738. [PMID: 33968940 PMCID: PMC8101499 DOI: 10.3389/fcell.2021.668738] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2021] [Accepted: 04/01/2021] [Indexed: 12/18/2022] Open
Abstract
Alzheimer’s disease (AD), a nervous system disease, lacks effective therapies at present. RNA expression is the basic way to regulate life activities, and identifying related characteristics in AD patients may aid the exploration of AD pathogenesis and treatment. This study developed a classifier that could accurately classify AD patients and healthy people, and then obtained 3 core genes that may be related to the pathogenesis of AD. To this end, RNA expression data of the middle temporal gyrus of AD patients were firstly downloaded from GEO database, and the data were then normalized using limma package following a supplementation of missing data by k-Nearest Neighbor (KNN) algorithm. Afterwards, the top 500 genes of the most feature importance were obtained through Max-Relevance and Min-Redundancy (mRMR) analysis, and based on these genes, a series of AD classifiers were constructed through Support Vector Machine (SVM), Random Forest (RF), and KNN algorithms. Then, the KNN classifier with the highest Matthews correlation coefficient (MCC) value composed of 14 genes in incremental feature selection (IFS) analysis was identified as the best AD classifier. As analyzed, the 14 genes played a pivotal role in determination of AD and may be core genes associated with the pathogenesis of AD. Finally, protein-protein interaction (PPI) network and Random Walk with Restart (RWR) analysis were applied to obtain core gene-associated genes, and key pathways related to AD were further analyzed. Overall, this study contributed to a deeper understanding of AD pathogenesis and provided theoretical guidance for related research and experiments.
Collapse
Affiliation(s)
- Longxiu Yang
- Department of Neurology, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Yuan Qin
- Department of Neurology, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Chongdong Jian
- Department of Neurology, The Affiliated Hospital of Youjiang Medical University for Nationalities, Baise, China
| |
Collapse
|
7
|
Han H, Yang Y, Wu Z, Liu B, Dong L, Deng H, Tian J, Lei H. Capilliposide B blocks VEGF-induced angiogenesis in vitro in primary human retinal microvascular endothelial cells. Biomed Pharmacother 2021; 133:110999. [PMID: 33227710 DOI: 10.1016/j.biopha.2020.110999] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Revised: 11/06/2020] [Accepted: 11/08/2020] [Indexed: 12/21/2022] Open
Abstract
Abnormal angiogenesis is associated with intraocular diseases such as proliferative diabetic retinopathy and neovascular age-related macular degeneration, and current therapies for these eye diseases are not satisfactory. The purpose of this study was to determine whether capilliposide B (CPS-B), a novel oleanane triterpenoid saponin derived from Lysimachia capillipes Hemsl, can inhibit vascular endothelial growth factor (VEGF)-induced angiogenesis signaling events and cellular responses in primary human retinal microvascular endothelial cells (HRECs). Our study revealed that the capilliposide B IC50 for HRECs was 8.5 μM at 72 h and that 1 μM capilliposide B specifically inhibited VEGF-induced activation of VEGFR2 and its downstream signaling enzymes Akt and Erk. In addition, we discovered that this chemical effectively blocked VEGF-stimulated proliferation, migration and tube formation of the HRECs, suggesting that capilliposide B is a promising prophylactic for angiogenesis-associated diseases such as proliferative diabetic retinopathy.
Collapse
Affiliation(s)
- Haote Han
- College of Biomedical Engineering & Instrument Science, Zhejiang University, Hangzhou, 310027, PR China; Zhejiang-Malaysia Joint Research Center for Traditional Medicine, Zhejiang University, Hangzhou, 310027, PR China; Schepens Eye Research Institute of Massachusetts Eye and Ear, Boston, MA, USA; Department of Ophthalmology, Harvard Medical School, Boston, MA, USA
| | - Yanhui Yang
- School of Basic Medical Sciences, Ningxia Medical University, Yinchuan, Ningxia, PR China
| | - Zhipan Wu
- College of Biomedical Engineering & Instrument Science, Zhejiang University, Hangzhou, 310027, PR China; Zhejiang-Malaysia Joint Research Center for Traditional Medicine, Zhejiang University, Hangzhou, 310027, PR China
| | - Bing Liu
- Schepens Eye Research Institute of Massachusetts Eye and Ear, Boston, MA, USA; Department of Ophthalmology, Harvard Medical School, Boston, MA, USA; Guangzhou Women and Children's Medical Center, Guangzhou Medical University, 510005, Guangzhou, PR China
| | - Lijun Dong
- Shenzhen Eye Hospital, Shenzhen Eye Institute, Jinan University, Shenzhen, Guangdong Province, PR China
| | - Hongwei Deng
- Shenzhen Eye Hospital, Shenzhen Eye Institute, Jinan University, Shenzhen, Guangdong Province, PR China.
| | - Jingkui Tian
- College of Biomedical Engineering & Instrument Science, Zhejiang University, Hangzhou, 310027, PR China; Zhejiang-Malaysia Joint Research Center for Traditional Medicine, Zhejiang University, Hangzhou, 310027, PR China.
| | - Hetian Lei
- Shenzhen Eye Hospital, Shenzhen Eye Institute, Jinan University, Shenzhen, Guangdong Province, PR China.
| |
Collapse
|
8
|
Identification of Latent Oncogenes with a Network Embedding Method and Random Forest. BIOMED RESEARCH INTERNATIONAL 2020; 2020:5160396. [PMID: 33029511 PMCID: PMC7530476 DOI: 10.1155/2020/5160396] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Revised: 09/09/2020] [Accepted: 09/14/2020] [Indexed: 12/29/2022]
Abstract
Oncogene is a special type of genes, which can promote the tumor initiation. Good study on oncogenes is helpful for understanding the cause of cancers. Experimental techniques in early time are quite popular in detecting oncogenes. However, their defects become more and more evident in recent years, such as high cost and long time. The newly proposed computational methods provide an alternative way to study oncogenes, which can provide useful clues for further investigations on candidate genes. Considering the limitations of some previous computational methods, such as lack of learning procedures and terming genes as individual subjects, a novel computational method was proposed in this study. The method adopted the features derived from multiple protein networks, viewing proteins in a system level. A classic machine learning algorithm, random forest, was applied on these features to capture the essential characteristic of oncogenes, thereby building the prediction model. All genes except validated oncogenes were ranked with a measurement yielded by the prediction model. Top genes were quite different from potential oncogenes discovered by previous methods, and they can be confirmed to become novel oncogenes. It was indicated that the newly identified genes can be essential supplements for previous results.
Collapse
|
9
|
RWR-algorithm-based dissection of microRNA-506-3p and microRNA-140-5p as radiosensitive biomarkers in colorectal cancer. Aging (Albany NY) 2020; 12:20512-20522. [PMID: 33033230 PMCID: PMC7655152 DOI: 10.18632/aging.103907] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2020] [Accepted: 07/21/2020] [Indexed: 12/12/2022]
Abstract
Radiotherapy resistance is one of the main causes for treatment failure in colorectal cancer (CRC), and it is associated with the deregulation of certain microRNAs. In this study, we constructed the microRNA-mRNA network consisting of 2275 microRNAs and 7045 target genes, collected the known microRNAs related to CRC-radiosensitivity (CRCR) (n=18) as the seed nodes, and applied the algorithm of random walk with restart (RWR) to the network to identify novel CRCR-related microRNAs (n=263). In functional analysis, 263 novel microRNAs shared a high proportion of the same biological processes and pathways with the known microRNAs. In topological analysis of the sub-network of the 263 microRNAs and their targets, hsa-mir-506-3p and hsa-mir-140-5p were identified as network hub nodes. In plasma, radiosensitive patients had a higher expression level of hsa-mir-506-3p and hsa-mir-140-5p than radioresistant patients. In experimental validation, both hsa-mir-506-3p and hsa-mir-140-5p over-expression could obviously decrease the cell proliferation, survival rate and colonality in CRC cells after radiation. In conclusion, this study combined the novel network-based method with experimental validation, and identified two novel radiosensitive biomarkers of hsa-mir-506-3p and hsa-mir-140-5p in CRC.
Collapse
|
10
|
Zhang J, Zhang M, Zhao H, Xu X. Identification of proliferative diabetic retinopathy-associated genes on the protein–protein interaction network by using heat diffusion algorithm. Biochim Biophys Acta Mol Basis Dis 2020; 1866:165794. [DOI: 10.1016/j.bbadis.2020.165794] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2020] [Revised: 03/25/2020] [Accepted: 04/04/2020] [Indexed: 12/11/2022]
|
11
|
Zhang X, Chen L. Prediction of membrane protein types by fusing protein-protein interaction and protein sequence information. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2020; 1868:140524. [PMID: 32858174 DOI: 10.1016/j.bbapap.2020.140524] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Revised: 07/17/2020] [Accepted: 07/30/2020] [Indexed: 11/30/2022]
Abstract
Membrane proteins are gatekeepers to the cell and essential for determination of the function of cells. Identification of the types of membrane proteins is an essential problem in cell biology. It is time-consuming and expensive to identify the type of membrane proteins with traditional experimental methods. The alternative way is to design effective computational methods, which can provide quick and reliable predictions. To date, several computational methods have been proposed in this regard. Several of them used the features extracted from the sequence information of individual proteins. Recently, networks are more and more popular to tackle different protein-related problems, which can organize proteins in a system level and give an overview of all proteins. However, such form weakens the essential properties of proteins, such as their sequence information. In this study, a novel feature fusion scheme was proposed, which integrated the information of protein sequences and protein-protein interaction network. The fused features of a protein were defined as the linear combination of sequence features of all proteins in the network, where the combination coefficients were the probabilities yielded by the random walk with restart algorithm with the protein as the seed node. Several models with such fused features and different classification algorithms were built and evaluated. Their performance for predicting the type of membrane proteins was improved compared with the models only with the sequence features or network information.
Collapse
Affiliation(s)
- Xiaolin Zhang
- College of Information Engineering, Shanghai Maritime University, Shanghai 201306, People's Republic of China
| | - Lei Chen
- College of Information Engineering, Shanghai Maritime University, Shanghai 201306, People's Republic of China.
| |
Collapse
|
12
|
Identification of COVID-19 Infection-Related Human Genes Based on a Random Walk Model in a Virus-Human Protein Interaction Network. BIOMED RESEARCH INTERNATIONAL 2020; 2020:4256301. [PMID: 32685484 PMCID: PMC7345912 DOI: 10.1155/2020/4256301] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/24/2020] [Accepted: 06/26/2020] [Indexed: 12/15/2022]
Abstract
Coronaviruses are specific crown-shaped viruses that were first identified in the 1960s, and three typical examples of the most recent coronavirus disease outbreaks include severe acute respiratory syndrome (SARS), Middle East respiratory syndrome (MERS), and COVID-19. Particularly, COVID-19 is currently causing a worldwide pandemic, threatening the health of human beings globally. The identification of viral pathogenic mechanisms is important for further developing effective drugs and targeted clinical treatment methods. The delayed revelation of viral infectious mechanisms is currently one of the technical obstacles in the prevention and treatment of infectious diseases. In this study, we proposed a random walk model to identify the potential pathological mechanisms of COVID-19 on a virus–human protein interaction network, and we effectively identified a group of proteins that have already been determined to be potentially important for COVID-19 infection and for similar SARS infections, which help further developing drugs and targeted therapeutic methods against COVID-19. Moreover, we constructed a standard computational workflow for predicting the pathological biomarkers and related pharmacological targets of infectious diseases.
Collapse
|
13
|
Wang K, Yi D, Yu Z, Zhu B, Li S, Liu X. Identification of the Hub Genes Related to Nerve Injury-Induced Neuropathic Pain. Front Neurosci 2020; 14:488. [PMID: 32508579 PMCID: PMC7251260 DOI: 10.3389/fnins.2020.00488] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Accepted: 04/20/2020] [Indexed: 11/13/2022] Open
Abstract
Background The reactivity enhancement of pain sensitive neurons in the nervous system is a feature of the pathogenesis for neuropathic pain (NP), yet the underlying mechanisms need to be fully understood. In this study, we made an attempt to clarify the NP-related hub genes and signaling pathways so as to provide effective diagnostic and therapeutic methods toward NP. Methods Microarray expression profile GSE30691 including the mRNA-seq data of the spared nerve injury (SNI)-induced NP rats was accessed from the GEO database. Then, genes associated with NP development were screened using differential analysis along with random walk with restart (RWR). GO annotation and KEGG pathway analyses were performed to explore the biological functions and signaling pathways where the genes were activated. Afterward, protein-protein interaction (PPI) analysis and GO analysis were conducted to further identify the hub genes which showed an intimate correlation with NP development. Results Totally 94 genes associated with NP development were screened by differential analysis and RWR analysis, and they were observed to be predominantly enriched in hormone secretion and transport, cAMP signaling pathway and other NP occurrence associated functions and pathways. Thereafter, the 94 genes were subjected to PPI analysis to find the genes much more associated with NP and a functional module composed of 48 genes were obtained. 8 hub genes including C3, C1qb, Ccl2, Cxcl13, Timp1, Fcgr2b, Gal, and Lyz2 were eventually identified after further association and functional enrichment analyses, and the expression of these 8 genes were all higher in SNI rats by comparison with those in Sham rats. Conclusion Based on the data collected from GEO database, this study discovered 8 hub genes that were closely related to NP occurrence and development, which help to provide potent theoretical basis for NP treatment.
Collapse
Affiliation(s)
- Kai Wang
- Department of Pain Medicine Center, Peking University Third Hospital, Beijing, China
| | - Duan Yi
- Department of Pain Medicine Center, Peking University Third Hospital, Beijing, China
| | - Zhuoyin Yu
- Department of Anesthesiology, Peking University Third Hospital, Beijing, China
| | - Bin Zhu
- Department of Pain Medicine Center, Peking University Third Hospital, Beijing, China
| | - Shuiqing Li
- Department of Pain Medicine Center, Peking University Third Hospital, Beijing, China
| | - Xiaoguang Liu
- Department of Orthopedic, Peking University Third Hospital, Beijing, China
| |
Collapse
|
14
|
Prediction of Drug Side Effects with a Refined Negative Sample Selection Strategy. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2020; 2020:1573543. [PMID: 32454877 PMCID: PMC7232712 DOI: 10.1155/2020/1573543] [Citation(s) in RCA: 49] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/28/2020] [Revised: 04/14/2020] [Accepted: 04/23/2020] [Indexed: 01/07/2023]
Abstract
Drugs are an important way to treat various diseases. However, they inevitably produce side effects, bringing great risks to human bodies and pharmaceutical companies. How to predict the side effects of drugs has become one of the essential problems in drug research. Designing efficient computational methods is an alternative way. Some studies paired the drug and side effect as a sample, thereby modeling the problem as a binary classification problem. However, the selection of negative samples is a key problem in this case. In this study, a novel negative sample selection strategy was designed for accessing high-quality negative samples. Such strategy applied the random walk with restart (RWR) algorithm on a chemical-chemical interaction network to select pairs of drugs and side effects, such that drugs were less likely to have corresponding side effects, as negative samples. Through several tests with a fixed feature extraction scheme and different machine-learning algorithms, models with selected negative samples produced high performance. The best model even yielded nearly perfect performance. These models had much higher performance than those without such strategy or with another selection strategy. Furthermore, it is not necessary to consider the balance of positive and negative samples under such a strategy.
Collapse
|
15
|
Yi D, Wang K, Zhu B, Li S, Liu X. Identification of neuropathic pain-associated genes and pathways via random walk with restart algorithm. J Neurosurg Sci 2020; 65:414-420. [PMID: 32536116 DOI: 10.23736/s0390-5616.20.04920-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
BACKGROUND Neuropathic pain (NP) develops from neuropathic lesions or diseases affecting the nervous system, and has become a serious public health issue due to its complex symptoms, high incidence and long duration. At present, the exact pathogenesis of NP is still unclear. In this study, we sought to identify the genes as well as the related molecular mechanisms associated with NP occurrence and development. METHODS We firstly identified the differentially expressed genes between NP spinal nerve ligation (SNL) rats and control sham rats and then projected them onto a STRING network for functional association analysis. Then, Random Walk with Restart (RWR) was conducted to find some new NP-related genes, with their potential functions sequentially analyzed by GO annotation and KEGG pathway analysis. RESULTS Some new NP-related genes, like Gng13, C3 and Cxcl2, were identified by RWR analysis. Meanwhile, some biological functions like inflammatory responses, chemotaxis and immune responses, as well as some signaling pathways, such as those involved in neuroactive ligand-receptor interactions, complement and blood coagulation cascade reactions, and cytokine-receptor interactions that the new NP- related genes were most activated were found to be associated with NP occurrence and development. CONCLUSIONS This study extends our knowledge of NP occurrence and development and provides new therapeutic targets for future NP treatment.
Collapse
Affiliation(s)
- Duan Yi
- Department of Pain Medicine Center, Peking University Third Hospital, Beijing China
| | - Kai Wang
- Department of Pain Medicine Center, Peking University Third Hospital, Beijing China
| | - Bin Zhu
- Department of Pain Medicine Center, Peking University Third Hospital, Beijing China
| | - Shuiqing Li
- Department of Pain Medicine Center, Peking University Third Hospital, Beijing China
| | - Xiaoguang Liu
- Department of Orthopedic, Peking University Third Hospital, Beijing China -
| |
Collapse
|
16
|
Zhu L, Xiang J, Wang Q, Wang A, Li C, Tian G, Zhang H, Chen S. Revealing the Interactions Between Diabetes, Diabetes-Related Diseases, and Cancers Based on the Network Connectivity of Their Related Genes. Front Genet 2020; 11:617136. [PMID: 33381155 PMCID: PMC7767993 DOI: 10.3389/fgene.2020.617136] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2020] [Accepted: 11/18/2020] [Indexed: 11/25/2022] Open
Abstract
Diabetes-related diseases (DRDs), especially cancers pose a big threat to public health. Although people have explored pathological pathways of a few common DRDs, there is a lack of systematic studies on important biological processes (BPs) connecting diabetes and its related diseases/cancers. We have proposed and compared 10 protein-protein interaction (PPI)-based computational methods to study the connections between diabetes and 254 diseases, among which a method called DIconnectivity_eDMN performs the best in the sense that it infers a disease rank (according to its relation with diabetes) most consistent with that by literature mining. DIconnectivity_eDMN takes diabetes-related genes, other disease-related genes, a PPI network, and genes in BPs as input. It first maps genes in a BP into the PPI network to construct a BP-related subnetwork, which is expanded (in the whole PPI network) by a random walk with restart (RWR) process to generate a so-called expanded modularized network (eMN). Since the numbers of known disease genes are not high, an RWR process is also performed to generate an expanded disease-related gene list. For each eMN and disease, the expanded diabetes-related genes and disease-related genes are mapped onto the eMN. The association between diabetes and the disease is measured by the reachability of their genes on all eMNs, in which the reachability is estimated by a method similar to the Kolmogorov-Smirnov (KS) test. DIconnectivity_eDMN achieves an area under receiver operating characteristic curve (AUC) of 0.71 for predicting both Type 1 DRDs and Type 2 DRDs. In addition, DIconnectivity_eDMN reveals important BPs connecting diabetes and DRDs. For example, "respiratory system development" and "regulation of mRNA metabolic process" are critical in associating Type 1 diabetes (T1D) and many Type 1 DRDs. It is also found that the average proportion of diabetes-related genes interacting with DRDs is higher than that of non-DRDs.
Collapse
Affiliation(s)
- Lijuan Zhu
- College of Mathematics and Computer Science, Zhejiang Normal University, Jinhua, China
| | - Ju Xiang
- Neuroscience Research Center, Department of Basic Medical Sciences, Changsha Medical University, Changsha, China
- School of Computer Science and Engineering, Central South University, Changsha, China
| | - Qiuling Wang
- Department of Endocrinology, The Affiliated Yantai Yuhuangding Hospital of Qingdao University, Yantai, China
| | - Ailan Wang
- Geneis Beijing Co., Ltd., Beijing, China
| | - Chao Li
- Geneis Beijing Co., Ltd., Beijing, China
| | - Geng Tian
- Geneis Beijing Co., Ltd., Beijing, China
- Qingdao Geneis Institute of Big Data Mining and Precision Medicine, Qingdao, China
| | - Huajun Zhang
- College of Mathematics and Computer Science, Zhejiang Normal University, Jinhua, China
- *Correspondence: Huajun Zhang,
| | - Size Chen
- Department of Oncology, The First Affiliated Hospital of Guangdong Pharmaceutical University, Guangdong Provincial Engineering Research Center for Esophageal Cancer Precision Treatment, Guangzhou, China
- Size Chen,
| |
Collapse
|
17
|
Inferring novel genes related to oral cancer with a network embedding method and one-class learning algorithms. Gene Ther 2019; 26:465-478. [PMID: 31455874 DOI: 10.1038/s41434-019-0099-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2019] [Revised: 06/18/2019] [Accepted: 07/15/2019] [Indexed: 12/14/2022]
Abstract
Oral cancer (OC) is one of the most common cancers threatening human lives. However, OC pathogenesis has yet to be fully uncovered, and thus designing effective treatments remains difficult. Identifying genes related to OC is an important way for achieving this purpose. In this study, we proposed three computational models for inferring novel OC-related genes. In contrast to previously proposed computational methods, which lacked the learning procedures, each proposed model adopted a one-class learning algorithm, which can provide a deep insight into features of validated OC-related genes. A network embedding algorithm (i.e., node2vec) was applied to the protein-protein interaction network to produce the representation of genes. The features of the OC-related genes were used in the training of the one-class algorithm, and the performance of the final inferring model was improved through a feature selection procedure. Then, candidate genes were produced by applying the trained inferring model to other genes. Three tests were performed to screen out the important candidate genes. Accordingly, we obtained three inferred gene sets, any two of which were different. The inferred genes were also different from previous reported genes and some of them have been included in the public Oral Cancer Gene Database. Finally, we analyzed several inferred genes to confirm whether they are novel OC-related genes.
Collapse
|
18
|
Lu S, Zhu ZG, Lu WC. Inferring novel genes related to colorectal cancer via random walk with restart algorithm. Gene Ther 2019; 26:373-385. [PMID: 31308477 DOI: 10.1038/s41434-019-0090-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2018] [Revised: 05/20/2019] [Accepted: 06/11/2019] [Indexed: 12/12/2022]
Abstract
Colorectal cancer (CRC) is the third most common type of cancer. In recent decades, genomic analysis has played an increasingly important role in understanding the molecular mechanisms of CRC. However, its pathogenesis has not been fully uncovered. Identification of genes related to CRC as complete as possible is an important way to investigate its pathogenesis. Therefore, we proposed a new computational method for the identification of novel CRC-associated genes. The proposed method is based on existing proven CRC-associated genes, human protein-protein interaction networks, and random walk with restart algorithm. The utility of the method is indicated by comparing it to the methods based on Guilt-by-association or shortest path algorithm. Using the proposed method, we successfully identified 298 novel CRC-associated genes. Previous studies have validated the involvement of the majority of these 298 novel genes in CRC-associated biological processes, thus suggesting the efficacy and accuracy of our method.
Collapse
Affiliation(s)
- Sheng Lu
- Department of General Surgery, Rui Jin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai Institute of Digestive Surgery, Shanghai, 200025, China
| | - Zheng-Gang Zhu
- Department of General Surgery, Rui Jin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai Institute of Digestive Surgery, Shanghai, 200025, China
| | - Wen-Cong Lu
- Department of Chemistry, College of Sciences, Shanghai University, Shanghai, 200444, China.
| |
Collapse
|
19
|
Jiang W, Zhan H, Jiao Y, Li S, Gao W. A novel lncRNA-miRNA-mRNA network analysis identified the hub lncRNA RP11-159F24.1 in the pathogenesis of papillary thyroid cancer. Cancer Med 2018; 7:6290-6298. [PMID: 30474931 PMCID: PMC6308055 DOI: 10.1002/cam4.1900] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2018] [Revised: 08/28/2018] [Accepted: 10/01/2018] [Indexed: 12/13/2022] Open
Abstract
Papillary thyroid cancer (PTC) is one of the most common cancers worldwide, and its carcinogenesis is influenced by a complex network of gene interactions. In this study, the microarray expression profile was re-annotated into a lncRNA-mRNA biphasic profile. LncRNA-mRNA interactions were confirmed by established miRNA-RNA data and hypergeometric test. Then, a PTC-related lncRNA-miRNA-mRNA network (PTCRN) was constructed by integrating differentially expressed genes with the RNA-RNA networks. The new network consisted of 21 lncRNAs, 241 mRNAs and 803 edges. To prioritize PTC-related genes, we performed topological analysis and random walk with restart (PWR) algorithm analysis of PTCRN. Both analyses identified lncRNA RP11-159F24.1 as a hub node in the network, which could interact with 47 mRNAs by sponging miR-485. In functional enrichment analysis, these interacting mRNAs were associated with the pathways in cancer. In validation, RP11-159F24.1 (up-regulated; P = 0.0013) showed an opposite expression pattern with its target miR-485 (down-regulated; P = 0.0013) in PTC, indicating that the RP11-159F24.1/miR-485/mRNAs axis might play an important role in the development of PTC. In conclusion, this study has constructed a PTC-related lncRNA-miRNA-mRNA network and identified the hub lncRNA RP11-159F24.1 in the tumorigenesis, which provided novel insights to explore the underlying mechanism of PTC.
Collapse
Affiliation(s)
- Wei Jiang
- Department of Endocrinologythe First Affiliated Hospital of Harbin Medical UniversityHarbinChina
| | - Hua Zhan
- Department of Neurosurgerythe First Affiliated Hospital of Harbin Medical UniversityHarbinChina
| | - Yanyan Jiao
- Department of Endocrinologythe First Affiliated Hospital of Harbin Medical UniversityHarbinChina
| | - Sha Li
- Department of Endocrinologythe First Affiliated Hospital of Harbin Medical UniversityHarbinChina
| | - Weixu Gao
- Department of Endocrinologythe First Affiliated Hospital of Harbin Medical UniversityHarbinChina
| |
Collapse
|
20
|
Lu S, Zhao K, Wang X, Liu H, Ainiwaer X, Xu Y, Ye M. Use of Laplacian Heat Diffusion Algorithm to Infer Novel Genes With Functions Related to Uveitis. Front Genet 2018; 9:425. [PMID: 30349554 PMCID: PMC6186792 DOI: 10.3389/fgene.2018.00425] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2018] [Accepted: 09/10/2018] [Indexed: 12/17/2022] Open
Abstract
Uveitis is the inflammation of the uvea and is a serious eye disease that can cause blindness for middle-aged and young people. However, the pathogenesis of this disease has not been fully uncovered and thus renders difficulties in designing effective treatments. Completely identifying the genes related to this disease can help improve and accelerate the comprehension of uveitis. In this study, a new computational method was developed to infer potential related genes based on validated ones. We employed a large protein–protein interaction network reported in STRING, in which Laplacian heat diffusion algorithm was applied using validated genes as seed nodes. Except for the validated ones, all genes in the network were filtered by three tests, namely, permutation, association, and function tests, which evaluated the genes based on their specialties and associations to uveitis. Results indicated that 59 inferred genes were accessed, several of which were confirmed to be highly related to uveitis by literature review. In addition, the inferred genes were compared with those reported in a previous study, indicating that our reported genes are necessary supplements.
Collapse
Affiliation(s)
- Shiheng Lu
- Department of Ophthalmology, Shanghai Pudong Hospital, Fudan University Pudong Medical Center, Pudong, China
| | - Ke Zhao
- Department of Ophthalmology, Shanghai Pudong Hospital, Fudan University Pudong Medical Center, Pudong, China
| | - Xuefei Wang
- Department of Ophthalmology, Shanghai Pudong Hospital, Fudan University Pudong Medical Center, Pudong, China
| | - Hui Liu
- Department of Ophthalmology, Shanghai Pudong Hospital, Fudan University Pudong Medical Center, Pudong, China
| | - Xiamuxiya Ainiwaer
- Department of Ophthalmology, Shanghai Pudong Hospital, Fudan University Pudong Medical Center, Pudong, China
| | - Yan Xu
- School of Life Sciences, Shanghai University, Shanghai, China
| | - Min Ye
- Department of Ophthalmology, Shanghai Pudong Hospital, Fudan University Pudong Medical Center, Pudong, China
| |
Collapse
|