1
|
Jiang XY, Hu JJ, Wang R, Zhang WY, Jin QQ, Yang YT, Mei J, Hong L, Yao H, Tao F, Li JJ, Liu Y, Zhang L, Chen SX, Chen G, Song Y, Zhou SG. Cuproptosis-Associated lncRNA Gene Signature Establishes New Prognostic Profile and Predicts Immunotherapy Response in Endometrial Carcinoma. Biochem Genet 2024; 62:3439-3466. [PMID: 38108937 PMCID: PMC11427535 DOI: 10.1007/s10528-023-10574-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 10/26/2023] [Indexed: 12/19/2023]
Abstract
Uterine corpus endometrial carcinoma (UCEC), a prevalent kind of cancerous tumor in female reproductive system that has a dismal prognosis in women worldwide. Given the very limited studies of cuproptosis-related lncRNAs (CRLs) in UCEC. Our purpose was to construct a prognostic profile based on CRLs and explore its assess prognostic value in UCEC victims and its correlation with the immunological microenvironment. METHODS 554 UCEC tumor samples and 23 normal samples' RNA-seq statistics and clinical details were compiled from data in the TCGA database. CRLs were obtained using Pearson correlation analysis. Using LASSO Cox regression, multivariate Cox regression, and univariate Cox regression analysis, six CRLs are confirmed to develop a risk prediction model at last.We identified two main molecular subtypes and observed that multilayer CRLs modifications were related to patient clinicopathological features, prognosis, and tumor microenvironment (TME) cell infiltration characteristics, and then we verified the prognostic hallmark of UCEC and examined its immunological landscape.Finally, using qRT-PCR, model hub genes' expression patterns were confirmed. RESULTS A unique CRL signature was established by the combination of six differently expressed CRLs that were highly linked with the prognosis of UCEC patients. According to their CRLs signatures, the patients were divided into two groups: the low-risk and the high-risk groups. Compared to individuals at high risk, patients at low risk had higher survival rates (p < 0.001). Additionally, Cox regression reveals that the profiles of lncRNAs linked to cuproptosis may independently predict prognosis in UCEC patients. The 1-, 3-, and 5-year risks' respective receiver operating characteristics (ROC) exhibited AUC values of 0.778, 0.810, and 0.854. Likewise, the signature could predict survival in different groups based on factors like stage, age, and grade, among others. Further investigation revealed differences between the different risk score groups in terms of drug sensitivity,immune cell infiltration,tumor mutation burden (TMB) score and microsatellite instability (MSI) score. Compared to the group of high risk, the low-risk group had greater rates of TMB and MSI. Results from qRT-PCR revealed that in UCEC vs normal tissues, AC026202.2, NRAV, AC079466.2, and AC090617.5 were upregulated,while LINC01545 and AL450384.1 were downregulated. CONCLUSIONS Our research clarified the relationship between CRLs signature and the immunological profile and prognosis of UCEC.This signature will establish the framework for future investigations into the endometrial cancer CRLs mechanism as well as the exploitation of new diagnostic tools and new therapeutic.
Collapse
Affiliation(s)
- Xi-Ya Jiang
- Department of Gynecology and Obstetrics, Maternity and Child Healthcare Hospital Affiliated to Anhui Medical University, Anhui Province Maternity and Child Healthcare Hospital, Hefei, 230001, Anhui, China
- Department of Gynecology and Obstetrics, The Fifth Clinical College of Anhui Medical University, Hefei, 230032, Anhui, China
| | - Jing-Jing Hu
- Department of Gynecology and Obstetrics, Maternity and Child Healthcare Hospital Affiliated to Anhui Medical University, Anhui Province Maternity and Child Healthcare Hospital, Hefei, 230001, Anhui, China
- Department of Reproduction, The First Affiliated Hospital of Anhui Medical University, Hefei, 230032, Anhui, China
| | - Rui Wang
- Department of Clinical Laboratory, Anhui Province Maternity and Child Healthcare Hospital, Hefei, 230001, Anhui, China
| | - Wei-Yu Zhang
- Department of Gynecology and Obstetrics, Maternity and Child Healthcare Hospital Affiliated to Anhui Medical University, Anhui Province Maternity and Child Healthcare Hospital, Hefei, 230001, Anhui, China
- Department of Gynecology and Obstetrics, The Fifth Clinical College of Anhui Medical University, Hefei, 230032, Anhui, China
| | - Qin-Qin Jin
- Department of Gynecology and Obstetrics, Maternity and Child Healthcare Hospital Affiliated to Anhui Medical University, Anhui Province Maternity and Child Healthcare Hospital, Hefei, 230001, Anhui, China
- Department of Gynecology and Obstetrics, The Fifth Clinical College of Anhui Medical University, Hefei, 230032, Anhui, China
| | - Yin-Ting Yang
- Department of Gynecology and Obstetrics, Maternity and Child Healthcare Hospital Affiliated to Anhui Medical University, Anhui Province Maternity and Child Healthcare Hospital, Hefei, 230001, Anhui, China
- Department of Gynecology and Obstetrics, The Fifth Clinical College of Anhui Medical University, Hefei, 230032, Anhui, China
| | - Jie Mei
- Department of Gynecology and Obstetrics, Maternity and Child Healthcare Hospital Affiliated to Anhui Medical University, Anhui Province Maternity and Child Healthcare Hospital, Hefei, 230001, Anhui, China
- Department of Gynecology and Obstetrics, The Fifth Clinical College of Anhui Medical University, Hefei, 230032, Anhui, China
| | - Lin Hong
- Department of Gynecology and Obstetrics, Maternity and Child Healthcare Hospital Affiliated to Anhui Medical University, Anhui Province Maternity and Child Healthcare Hospital, Hefei, 230001, Anhui, China
- Department of Gynecology and Obstetrics, The Fifth Clinical College of Anhui Medical University, Hefei, 230032, Anhui, China
| | - Hui Yao
- Department of Gynecology and Obstetrics, Maternity and Child Healthcare Hospital Affiliated to Anhui Medical University, Anhui Province Maternity and Child Healthcare Hospital, Hefei, 230001, Anhui, China
- Department of Gynecology and Obstetrics, The Fifth Clinical College of Anhui Medical University, Hefei, 230032, Anhui, China
| | - Feng Tao
- Department of Gynecology and Obstetrics, Maternity and Child Healthcare Hospital Affiliated to Anhui Medical University, Anhui Province Maternity and Child Healthcare Hospital, Hefei, 230001, Anhui, China
- Department of Gynecology and Obstetrics, The Fifth Clinical College of Anhui Medical University, Hefei, 230032, Anhui, China
| | - Jie-Jie Li
- Department of Gynecology and Obstetrics, Maternity and Child Healthcare Hospital Affiliated to Anhui Medical University, Anhui Province Maternity and Child Healthcare Hospital, Hefei, 230001, Anhui, China
- Department of Gynecology and Obstetrics, The Fifth Clinical College of Anhui Medical University, Hefei, 230032, Anhui, China
| | - Yu Liu
- Department of Gynecology and Obstetrics, Maternity and Child Healthcare Hospital Affiliated to Anhui Medical University, Anhui Province Maternity and Child Healthcare Hospital, Hefei, 230001, Anhui, China
- Department of Gynecology and Obstetrics, The Fifth Clinical College of Anhui Medical University, Hefei, 230032, Anhui, China
| | - Li Zhang
- Department of Gynecology and Obstetrics, Maternity and Child Healthcare Hospital Affiliated to Anhui Medical University, Anhui Province Maternity and Child Healthcare Hospital, Hefei, 230001, Anhui, China
- Department of Gynecology and Obstetrics, The Fifth Clinical College of Anhui Medical University, Hefei, 230032, Anhui, China
| | - Shun-Xia Chen
- Department of Gynecology and Obstetrics, Maternity and Child Healthcare Hospital Affiliated to Anhui Medical University, Anhui Province Maternity and Child Healthcare Hospital, Hefei, 230001, Anhui, China
- Department of Gynecology and Obstetrics, The Fifth Clinical College of Anhui Medical University, Hefei, 230032, Anhui, China
| | - Guo Chen
- Department of Gynecology and Obstetrics, Maternity and Child Healthcare Hospital Affiliated to Anhui Medical University, Anhui Province Maternity and Child Healthcare Hospital, Hefei, 230001, Anhui, China
- Department of Gynecology and Obstetrics, The Fifth Clinical College of Anhui Medical University, Hefei, 230032, Anhui, China
| | - Yang Song
- Department of Pain, The First Affiliated Hospital of Anhui Medical University, Hefei, 230032, Anhui, China.
| | - Shu-Guang Zhou
- Department of Gynecology and Obstetrics, Maternity and Child Healthcare Hospital Affiliated to Anhui Medical University, Anhui Province Maternity and Child Healthcare Hospital, Hefei, 230001, Anhui, China.
- Department of Gynecology and Obstetrics, The Fifth Clinical College of Anhui Medical University, Hefei, 230032, Anhui, China.
| |
Collapse
|
2
|
Momanyi BM, Temesgen SA, Wang T, Gao H, Gao R, Tang H, Tang L. iGATTLDA: Integrative graph attention and transformer-based model for predicting lncRNA-Disease associations. IET Syst Biol 2024; 18:172-182. [PMID: 39308027 PMCID: PMC11490194 DOI: 10.1049/syb2.12098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2024] [Revised: 08/19/2024] [Accepted: 09/10/2024] [Indexed: 10/20/2024] Open
Abstract
Long non-coding RNAs (lncRNAs) have emerged as significant contributors to the regulation of various biological processes, and their dysregulation has been linked to a variety of human disorders. Accurate prediction of potential correlations between lncRNAs and diseases is crucial for advancing disease diagnostics and treatment procedures. The authors introduced a novel computational method, iGATTLDA, for the prediction of lncRNA-disease associations. The model utilised lncRNA and disease similarity matrices, with known associations represented in an adjacency matrix. A heterogeneous network was constructed, dissecting lncRNAs and diseases as nodes and their associations as edges. The Graph Attention Network (GAT) is employed to process initial features and corresponding adjacency information. GAT identified significant neighbouring nodes in the network, capturing intricate relationships between lncRNAs and diseases, and generating new feature representations. Subsequently, the transformer captures global dependencies and interactions across the entire sequence of features produced by the GAT. Consequently, iGATTLDA successfully captures complex relationships and interactions that conventional approaches may overlook. In evaluating iGATTLDA, it attained an area under the receiver operating characteristic (ROC) curve (AUC) of 0.95 and an area under the precision recall curve (AUPRC) of 0.96 with a two-layer multilayer perceptron (MLP) classifier. These results were notably higher compared to the majority of previously proposed models, further substantiating the model's efficiency in predicting potential lncRNA-disease associations by incorporating both local and global interactions. The implementation details can be obtained from https://github.com/momanyibiffon/iGATTLDA.
Collapse
Affiliation(s)
- Biffon Manyura Momanyi
- School of Computer Science and EngineeringCenter for Informational BiologyUniversity of Electronic Science and Technology of ChinaChengduChina
| | - Sebu Aboma Temesgen
- School of Life Science and TechnologyCenter for Informational BiologyUniversity of Electronic Science and Technology of ChinaChengduChina
| | - Tian‐Yu Wang
- School of Life Science and TechnologyCenter for Informational BiologyUniversity of Electronic Science and Technology of ChinaChengduChina
| | - Hui Gao
- School of Computer Science and EngineeringCenter for Informational BiologyUniversity of Electronic Science and Technology of ChinaChengduChina
| | - Ru Gao
- The People’s Hospital of WenjiangChengduChina
| | - Hua Tang
- School of Basic Medical SciencesSouthwest Medical UniversityLuzhouChina
- Medical Engineering & Medical Informatics Integration and Transformational Medicine Key Laboratory of Luzhou CityLuzhouChina
- Central Nervous System Drug Key Laboratory of Sichuan ProvinceLuzhouChina
| | - Li‐Xia Tang
- School of Life Science and TechnologyCenter for Informational BiologyUniversity of Electronic Science and Technology of ChinaChengduChina
| |
Collapse
|
3
|
Chen L, Chen K, Zhou B. Inferring drug-disease associations by a deep analysis on drug and disease networks. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:14136-14157. [PMID: 37679129 DOI: 10.3934/mbe.2023632] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/09/2023]
Abstract
Drugs, which treat various diseases, are essential for human health. However, developing new drugs is quite laborious, time-consuming, and expensive. Although investments into drug development have greatly increased over the years, the number of drug approvals each year remain quite low. Drug repositioning is deemed an effective means to accelerate the procedures of drug development because it can discover novel effects of existing drugs. Numerous computational methods have been proposed in drug repositioning, some of which were designed as binary classifiers that can predict drug-disease associations (DDAs). The negative sample selection was a common defect of this method. In this study, a novel reliable negative sample selection scheme, named RNSS, is presented, which can screen out reliable pairs of drugs and diseases with low probabilities of being actual DDAs. This scheme considered information from k-neighbors of one drug in a drug network, including their associations to diseases and the drug. Then, a scoring system was set up to evaluate pairs of drugs and diseases. To test the utility of the RNSS, three classic classification algorithms (random forest, bayes network and nearest neighbor algorithm) were employed to build classifiers using negative samples selected by the RNSS. The cross-validation results suggested that such classifiers provided a nearly perfect performance and were significantly superior to those using some traditional and previous negative sample selection schemes.
Collapse
Affiliation(s)
- Lei Chen
- College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China
| | - Kaiyu Chen
- College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China
| | - Bo Zhou
- Shanghai University of Medicine & Health Sciences, Shanghai 201318, China
| |
Collapse
|
4
|
Liu H, Bing P, Zhang M, Tian G, Ma J, Li H, Bao M, He K, He J, He B, Yang J. MNNMDA: Predicting human microbe-disease association via a method to minimize matrix nuclear norm. Comput Struct Biotechnol J 2023; 21:1414-1423. [PMID: 36824227 PMCID: PMC9941872 DOI: 10.1016/j.csbj.2022.12.053] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Revised: 12/29/2022] [Accepted: 12/30/2022] [Indexed: 01/03/2023] Open
Abstract
Identifying the potential associations between microbes and diseases is the first step for revealing the pathological mechanisms of microbe-associated diseases. However, traditional culture-based microbial experiments are expensive and time-consuming. Thus, it is critical to prioritize disease-associated microbes by computational methods for further experimental validation. In this study, we proposed a novel method called MNNMDA, to predict microbe-disease associations (MDAs) by applying a Matrix Nuclear Norm method into known microbe and disease data. Specifically, we first calculated Gaussian interaction profile kernel similarity and functional similarity for diseases and microbes. Then we constructed a heterogeneous information network by combining the integrated disease similarity network, the integrated microbe similarity network and the known microbe-disease bipartite network. Finally, we formulated the microbe-disease association prediction problem as a low-rank matrix completion problem, which was solved by minimizing the nuclear norm of a matrix with a few regularization terms. We tested the performances of MNNMDA in three datasets including HMDAD, Disbiome, and Combined Data with small, medium and large sizes respectively. We also compared MNNMDA with 5 state-of-the-art methods including KATZHMDA, LRLSHMDA, NTSHMDA, GATMDA, and KGNMDA, respectively. MNNMDA achieved area under the ROC curves (AUROC) of 0.9536 and 0.9364 respectively on HDMAD and Disbiome, better than the AUCs of compared methods under the 5-fold cross-validation for all microbe-disease associations. It also obtained a relatively good performance with AUROC 0.8858 in the combined data. In addition, MNNMDA was also better than other methods in area under precision and recall curve (AUPR) under the 5-fold cross-validation for all associations, and in both AUROC and AUPR under the 5-fold cross-validation for diseases and the 5-fold cross-validation for microbes. Finally, the case studies on colon cancer and inflammatory bowel disease (IBD) also validated the effectiveness of MNNMDA. In conclusion, MNNMDA is an effective method in predicting microbe-disease associations. Availability The codes and data for this paper are freely available at Github https://github.com/Haiyan-Liu666/MNNMDA.
Collapse
Affiliation(s)
- Haiyan Liu
- Academician Workstation, Changsha Medical University, Changsha 410219, PR China,College of Information Engineering, Changsha Medical University, Changsha 410219, PR China,Hunan Key Laboratory of the Research and Development of Novel Pharmaceutical Preparations, Changsha Medical University, Changsha 410219, PR China
| | - Pingping Bing
- Academician Workstation, Changsha Medical University, Changsha 410219, PR China
| | - Meijun Zhang
- Geneis Beijing Co., Ltd., Beijing 100102, PR China
| | - Geng Tian
- Geneis Beijing Co., Ltd., Beijing 100102, PR China
| | - Jun Ma
- College of Information Engineering, Changsha Medical University, Changsha 410219, PR China
| | - Haigang Li
- Academician Workstation, Changsha Medical University, Changsha 410219, PR China,Hunan Key Laboratory of the Research and Development of Novel Pharmaceutical Preparations, Changsha Medical University, Changsha 410219, PR China,School of pharmacy, Changsha Medical University, Changsha 410219, PR China
| | - Meihua Bao
- Academician Workstation, Changsha Medical University, Changsha 410219, PR China,Hunan Key Laboratory of the Research and Development of Novel Pharmaceutical Preparations, Changsha Medical University, Changsha 410219, PR China,School of pharmacy, Changsha Medical University, Changsha 410219, PR China
| | - Kunhui He
- Academician Workstation, Changsha Medical University, Changsha 410219, PR China,Hunan Key Laboratory of the Research and Development of Novel Pharmaceutical Preparations, Changsha Medical University, Changsha 410219, PR China,School of pharmacy, Changsha Medical University, Changsha 410219, PR China
| | - Jianjun He
- Academician Workstation, Changsha Medical University, Changsha 410219, PR China,Hunan Key Laboratory of the Research and Development of Novel Pharmaceutical Preparations, Changsha Medical University, Changsha 410219, PR China,School of pharmacy, Changsha Medical University, Changsha 410219, PR China,Corresponding authors at: Academician Workstation, Changsha Medical University, Changsha 410219, PR China.
| | - Binsheng He
- Academician Workstation, Changsha Medical University, Changsha 410219, PR China,Hunan Key Laboratory of the Research and Development of Novel Pharmaceutical Preparations, Changsha Medical University, Changsha 410219, PR China,School of pharmacy, Changsha Medical University, Changsha 410219, PR China,Corresponding authors at: Academician Workstation, Changsha Medical University, Changsha 410219, PR China.
| | - Jialiang Yang
- Academician Workstation, Changsha Medical University, Changsha 410219, PR China,Hunan Key Laboratory of the Research and Development of Novel Pharmaceutical Preparations, Changsha Medical University, Changsha 410219, PR China,Geneis Beijing Co., Ltd., Beijing 100102, PR China,School of pharmacy, Changsha Medical University, Changsha 410219, PR China,Corresponding authors at: Academician Workstation, Changsha Medical University, Changsha 410219, PR China.
| |
Collapse
|
5
|
He B, Wang K, Xiang J, Bing P, Tang M, Tian G, Guo C, Xu M, Yang J. DGHNE: network enhancement-based method in identifying disease-causing genes through a heterogeneous biomedical network. Brief Bioinform 2022; 23:6712302. [PMID: 36151744 DOI: 10.1093/bib/bbac405] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Revised: 08/01/2022] [Accepted: 08/21/2022] [Indexed: 12/14/2022] Open
Abstract
The identification of disease-causing genes is critical for mechanistic understanding of disease etiology and clinical manipulation in disease prevention and treatment. Yet the existing approaches in tackling this question are inadequate in accuracy and efficiency, demanding computational methods with higher identification power. Here, we proposed a new method called DGHNE to identify disease-causing genes through a heterogeneous biomedical network empowered by network enhancement. First, a disease-disease association network was constructed by the cosine similarity scores between phenotype annotation vectors of diseases, and a new heterogeneous biomedical network was constructed by using disease-gene associations to connect the disease-disease network and gene-gene network. Then, the heterogeneous biomedical network was further enhanced by using network embedding based on the Gaussian random projection. Finally, network propagation was used to identify candidate genes in the enhanced network. We applied DGHNE together with five other methods into the most updated disease-gene association database termed DisGeNet. Compared with all other methods, DGHNE displayed the highest area under the receiver operating characteristic curve and the precision-recall curve, as well as the highest precision and recall, in both the global 5-fold cross-validation and predicting new disease-gene associations. We further performed DGHNE in identifying the candidate causal genes of Parkinson's disease and diabetes mellitus, and the genes connecting hyperglycemia and diabetes mellitus. In all cases, the predicted causing genes were enriched in disease-associated gene ontology terms and Kyoto Encyclopedia of Genes and Genomes pathways, and the gene-disease associations were highly evidenced by independent experimental studies.
Collapse
Affiliation(s)
- Binsheng He
- Academician Workstation, Changsha Medical University, Changsha 410219, China.,Hunan Key Laboratory of the Research and Development of Novel Pharmaceutical Preparations, Changsha Medical University, Changsha 410219, P. R. China.,School of pharmacy, Changsha Medical University, Changsha 410219, P. R. China
| | - Kun Wang
- School of Mathematical Sciences, Ocean University of China, Qingdao 266100, China
| | - Ju Xiang
- Academician Workstation, Changsha Medical University, Changsha 410219, China
| | - Pingping Bing
- Academician Workstation, Changsha Medical University, Changsha 410219, China.,Hunan Key Laboratory of the Research and Development of Novel Pharmaceutical Preparations, Changsha Medical University, Changsha 410219, P. R. China.,School of pharmacy, Changsha Medical University, Changsha 410219, P. R. China
| | - Min Tang
- School of Life Sciences, Jiangsu University, Zhenjiang 212001, Jiangsu, China
| | - Geng Tian
- Geneis (Beijing) Co., Ltd., Beijing 100102, China
| | - Cheng Guo
- Center for Infection and Immunity, Mailman School of Public Health, Columbia University, New York, NY, 10032, USA
| | - Miao Xu
- Broad institute of MIT and Harvard, 415 Main Street, Cambridge, MA 02142, USA
| | - Jialiang Yang
- Academician Workstation, Changsha Medical University, Changsha 410219, China.,Hunan Key Laboratory of the Research and Development of Novel Pharmaceutical Preparations, Changsha Medical University, Changsha 410219, P. R. China.,School of pharmacy, Changsha Medical University, Changsha 410219, P. R. China.,Geneis (Beijing) Co., Ltd., Beijing 100102, China
| |
Collapse
|
6
|
Li S, Yang M, Ji L, Fan H. A multi-omics machine learning framework in predicting the recurrence and metastasis of patients with pancreatic adenocarcinoma. Front Microbiol 2022; 13:1032623. [PMID: 36406449 PMCID: PMC9669652 DOI: 10.3389/fmicb.2022.1032623] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Accepted: 10/17/2022] [Indexed: 10/15/2023] Open
Abstract
Local recurrence and distant metastasis are the main causes of death in patients with pancreatic adenocarcinoma (PDAC). Microbial content in PDAC metastasis is still not well-characterized. Here, the tissue microbiome was comprehensively compared between metastatic and non-metastatic PDAC patients. We found that the pancreatic tissue microbiome of metastatic patients was significantly different from that of non-metastatic patients. Further, 10 potential bacterial biomarkers (Kurthia, Gulbenkiania, Acetobacterium and Planctomyces etc.) were identified by differential analysis. Meanwhile, significant differences in expression patterns across multiple omics (lncRNA, miRNA, and mRNA) of PDAC patients were found. The highest accuracy was achieved when these 10 bacterial biomarkers were used as features to predict recurrence or metastasis in PDAC patients, with an AUC of 0.815. Finally, the recurrence and metastasis in PDAC patients were associated with reduced survival and this association was potentially driven by the 10 biomarkers we identified. Our studies highlight the association between the tissue microbiome and recurrence or metastasis of pancreatic adenocarcioma patients, as well as the survival of patients.
Collapse
Affiliation(s)
- Shenming Li
- Department of Hepatobiliary and Pancreaticosplenic Surgery, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China
- Department of Nephrology, Essen University Hospital, University of Duisburg-Essen, Essen, Germany
| | - Min Yang
- School of Electrical and Information Engineering, Anhui University of Technology, Ma’anshan, Anhui, China
- Genesis Beijing Co., Ltd., Beijing, China
| | - Lei Ji
- Genesis Beijing Co., Ltd., Beijing, China
| | - Hua Fan
- Department of Hepatobiliary and Pancreaticosplenic Surgery, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China
| |
Collapse
|
7
|
Jiang ZR, Yang LH, Jin LZ, Yi LM, Bing PP, Zhou J, Yang JS. Identification of novel cuproptosis-related lncRNA signatures to predict the prognosis and immune microenvironment of breast cancer patients. Front Oncol 2022; 12:988680. [PMID: 36203428 PMCID: PMC9531154 DOI: 10.3389/fonc.2022.988680] [Citation(s) in RCA: 46] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Accepted: 08/15/2022] [Indexed: 11/13/2022] Open
Abstract
Background Cuproptosis is a new modality of cell death regulation that is currently considered as a new cancer treatment strategy. Nevertheless, the prognostic predictive value of cuproptosis-related lncRNAs in breast cancer (BC) remains unknown. Using cuproptosis-related lncRNAs, this study aims to predict the immune microenvironment and prognosis of BC patients. and develop new therapeutic strategies that target the disease. Methods The Cancer Genome Atlas (TCGA) database provided the RNA-seq data along with the corresponding clinical and prognostic information. Univariate and multivariate Cox regression analyses were performed to acquire lncRNAs associated with cuproptosis to establish predictive features. The Kaplan-Meier method was used to calculate the overall survival rate (OS) in the high-risk and low-risk groups. High risk and low risk gene sets were enriched to explore functional discrepancies among risk teams. The mutation data were analyzed using the "MAFTools" r-package. The ties of predictive characteristics and immune status had been explored by single sample gene set enrichment analysis (ssGSEA). Last, the correlation between predictive features and treatment condition in patients with BC was analyzed. Based on prognostic risk models, we assessed associations between risk subgroups and immune scores and immune checkpoints. In addition, drug responses in at-risk populations were predicted. Results We identified a set of 11 Cuproptosis-Related lncRNAs (GORAB-AS1, AC 079922.2, AL 589765.4, AC 005696.4, Cytor, ZNF 197-AS1, AC 002398.1, AL 451085.3, YTH DF 3-AS1, AC 008771.1, LINC 02446), based on which to construct the risk model. In comparison to the high-risk group, the low-risk patients lived longer (p < 0.001). Moreover, cuproptosis-related lncRNA profiles can independently predict prognosis in BC patients. The AUC values for receiver operating characteristics (ROC) of 1-, 3-, and 5-year risk were 0.849, 0.779, and 0.794, respectively. Patients in the high-risk group had lower OS than those in the low-risk group when they were divided into groups based on various clinicopathological variables. The tumor burden mutations (TMB) correlation analysis showed that high TMB had a worse prognosis than low-TMB, and gene mutations were found to be different in high and low TMB groups, such as PIK3CA (36% versus 32%), SYNE1 (4% versus 6%). Gene enrichment analysis indicated that the differential genes were significantly concentrated in immune-related pathways. The predictive traits were significantly correlated with the immune status of BC patients, according to ssGSEA results. Finally, high-risk patients showed high sensitivity in anti-CD276 immunotherapy and conventional chemotherapeutic drugs such as imatinib, lapatinib, and pazopanib. Conclusion We successfully constructed of a cuproptosis-related lncRNA signature, which can independently predict the prognosis of BC patients and can be used to estimate OS and clinical treatment outcomes in BRCA patients. It will serve as a foundation for further research into the mechanism of cuproptosis-related lncRNAs in breast cancer, as well as for the development of new markers and therapeutic targets for the disease.
Collapse
Affiliation(s)
- Zi-Rong Jiang
- Department of Surgical Oncology, Ningde Municipal Hospital of Ningde Normal University, Teaching Hospital of Fujian Medical University, Ningde, China
| | - Lin-Hui Yang
- Department of Surgical Oncology, Ningde Municipal Hospital of Ningde Normal University, Teaching Hospital of Fujian Medical University, Ningde, China
| | - Liang-Zi Jin
- Institute of Medical Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Kunming, China
| | - Li-Mu Yi
- Department of Pathology, The First Affiliated Hospital of Guangdong University of Pharmacy, Guangzhou, China
| | - Ping-Ping Bing
- Academician Workstation, Changsha Medical University, Changsha, China
| | - Jun Zhou
- Academician Workstation, Changsha Medical University, Changsha, China
| | - Jia-Sheng Yang
- School of Electrical & Information Engineering, Anhui University of Technology, Ma’anshan, China
| |
Collapse
|
8
|
Zhang Y, Ye F, Gao X. MCA-Net: Multi-Feature Coding and Attention Convolutional Neural Network for Predicting lncRNA-Disease Association. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:2907-2919. [PMID: 34283719 DOI: 10.1109/tcbb.2021.3098126] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
With the advent of the era of big data, it is troublesome to accurately predict the associations between lncRNAs and diseases based on traditional biological experiments due to its time-consuming and subjective. In this paper, we propose a novel deep learning method for predicting lncRNA-disease associations using multi-feature coding and attention convolutional neural network (MCA-Net). We first calculate six similarity features to extract different types of lncRNA and disease feature information. Second, a multi-feature coding method is proposed to construct the feature vectors of lncRNA-disease association samples by integrating the six similarity features. Furthermore, an attention convolutional neural network is developed to identify lncRNA-disease associations under 10-fold cross-validation. Finally, we evaluate the performance of MCA-Net from different perspectives including the effects of the model parameters, distinct deep learning models, and the necessity of attention mechanism. We also compare MCA-Net with several state-of-the-art methods on three publicly available datasets, i.e., LncRNADisease, Lnc2Cancer, and LncRNADisease2.0. The results show that our MCA-Net outperforms the state-of-the-art methods on all three dataset. Besides, case studies on breast cancer and lung cancer further verify that MCA-Net is effective and accurate for the lncRNA-disease association prediction.
Collapse
|
9
|
Linc00312 Single Nucleotide Polymorphism as Biomarker for Chemoradiotherapy Induced Hematotoxicity in Nasopharyngeal Carcinoma Patients. DISEASE MARKERS 2022; 2022:6707821. [PMID: 35990252 PMCID: PMC9381851 DOI: 10.1155/2022/6707821] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/23/2022] [Revised: 03/30/2022] [Accepted: 07/06/2022] [Indexed: 12/08/2022]
Abstract
Background. Linc00312 is downregulated in nasopharyngeal carcinoma (NPC) and associates with poor treatment efficacy. Genetic variations are the main cause of individual differences in treatment response. The objective of this study is to explore the relationship between genetic variations of linc00312 and the risk of chemoradiotherapy induced toxic reactions in NPC patients. Methods. We used a bioinformatics approach to select 3 single nucleotide polymorphisms (SNPs) with regulatory feature in linc00312 (rs12497104, rs15734, and rs164966). 505 NPC patients receiving chemoradiotherapy with complete follow-up data were recruited. Genotyping was carried out by MassARRAY iPLEX platform. Univariate logistic and multivariate logistic regression were used to analyze the risk factors responsible for toxic reactions of NPC patients. Results. Our result demonstrated that linc00312 rs15734 (
) was significantly associated with severe leukopenia in NPC patients underwent chemoradiotherapy (AA vs. GG,
,
). In addition, the risk of severe leukopenia was remarkably increased to 5.635 times (
) in female with rs15734 AA genotype compared to male with rs15734 GG genotype. Moreover, patients with rs12497104 (
) AA genotype showed a 67.5% lower risk of thrombocytopenia than those with GG genotype (
). Remarkably, the younger patients (
) with rs12497104 AA genotype displayed a 90% decreased risk of thrombocytopenia compared with older patients (
) carrying rs12497104 GG genotype (
). Conclusions. Genetic variations of linc00312 affect the risk of chemoradiotherapy induced hematotoxicity in nasopharyngeal carcinoma patients and may serve as biomarkers for personalized medicine.
Collapse
|
10
|
Chen B, Wang T, Zhang J, Zhang S, Shang X. Identification of Colon Cancer-Related RNAs Based on Heterogeneous Networks and Random Walk. BIOLOGY 2022; 11:1003. [PMID: 36101384 PMCID: PMC9312154 DOI: 10.3390/biology11071003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Revised: 06/25/2022] [Accepted: 06/28/2022] [Indexed: 11/17/2022]
Abstract
Colon cancer is considered as a complex disease that consists of metastatic seeding in early stages. Such disease is not simply caused by the action of a single RNA, but is associated with disorders of many kinds of RNAs and their regulation relationships. Hence, it is of great significance to study the complex regulatory roles among mRNAs, miRNAs and lncRNAs for further understanding the pathogenic mechanism of colon cancer. In this study, we constructed a heterogeneous network consisting of differentially expressed mRNAs, miRNAs and lncRNAs. This contains three kinds of vertices and six types of edges. All RNAs were re-divided into three categories, which were "related", "irrelevant" and "unlabeled". They were processed by dynamic excitation restart random walk (RW-DIR) for identifying colon cancer-related RNAs. Ten RNAs were finally obtained related to colon cancer, which were hsa-miR-2682-5p, hsa-miR-1277-3p, ANGPTL1, SLC22A18AS, FENDRR, PHLPP2, hsa-miR-302a-5p, APCDD1, MEX3A and hsa-miR-509-3-5p. Numerical experiments have indicated that the proposed network construction framework and the following RW-DIR algorithm are effective for identifying colon cancer-related RNAs, and this kind of analysis framework can also be easily extended to other diseases, effectively narrowing the scope of biological experimental research.
Collapse
Affiliation(s)
- Bolin Chen
- School of Computer Science, Northwestern Polytechnical University, Xi’an 710072, China; (B.C.); (T.W.); (J.Z.)
| | - Teng Wang
- School of Computer Science, Northwestern Polytechnical University, Xi’an 710072, China; (B.C.); (T.W.); (J.Z.)
| | - Jinlei Zhang
- School of Computer Science, Northwestern Polytechnical University, Xi’an 710072, China; (B.C.); (T.W.); (J.Z.)
| | - Shengli Zhang
- School of Information Technology, Minzu Normal University of Xingyi, Xingyi 562400, China;
| | - Xuequn Shang
- School of Computer Science, Northwestern Polytechnical University, Xi’an 710072, China; (B.C.); (T.W.); (J.Z.)
| |
Collapse
|
11
|
Silva ABOV, Spinosa EJ. Graph Convolutional Auto-Encoders for Predicting Novel lncRNA-Disease Associations. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:2264-2271. [PMID: 33819159 DOI: 10.1109/tcbb.2021.3070910] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
LncRNAs are intermediate molecules that participate in the most diverse biological processes in humans, such as gene expression control and X-chromosome inactivation. Numerous researches have associated lncRNAs with a wide range of diseases, such as breast cancer, leukemia, and many other conditions. In this work, we propose a graph-based method named PANDA. This method treats the prediction of new associations between lncRNAs and diseases as a link prediction problem in a graph. We start by building a heterogeneous graph that contains the known associations between lncRNAs and diseases and additional information such as gene expression levels and symptoms of diseases. We then use a Graph Auto-encoder to learn the representation of the nodes' features and edges, finally applying a Neural Network to predict potentially interesting novel edges. The experimental results indicate that PANDA achieved a 0.976 AUC-ROC, surpassing state-of-the-art methods for the same problem, showing that PANDA could be a promising approach to generate embeddings to predict potentially novel lncRNA-disease associations.
Collapse
|
12
|
Jiang M, Zhou B, Chen L. Identification of drug side effects with a path-based method. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2022; 19:5754-5771. [PMID: 35603377 DOI: 10.3934/mbe.2022269] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
The study of drug side effects is a significant task in drug discovery. Candidate drugs with unaccepted side effects must be eliminated to prevent risks for both patients and pharmaceutical companies. Thus, all side effects for any candidate drug should be determined. However, this task, which is carried out through traditional experiments, is time-consuming and expensive. Building computational methods has been increasingly used for the identification of drug side effects. In the present study, a new path-based method was proposed to determine drug side effects. A heterogeneous network was built to perform such method, which defined drugs and side effects as nodes. For any drug and side effect, the proposed path-based method determined all paths with limited length that connects them and further evaluated the association between them based on these paths. The strong association indicates that the drug has a side effect with a high probability. By using two types of jackknife test, the method yielded good performance and was superior to some other network-based methods. Furthermore, the effects of one parameter in the method and heterogeneous network was analyzed.
Collapse
Affiliation(s)
- Meng Jiang
- College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China
| | - Bo Zhou
- Shanghai University of Medicine & Health Sciences, Shanghai 201318, China
| | - Lei Chen
- College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China
| |
Collapse
|
13
|
Xie G, Jiang J, Sun Y. LDA-LNSUBRW: lncRNA-Disease Association Prediction Based on Linear Neighborhood Similarity and Unbalanced bi-Random Walk. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:989-997. [PMID: 32870798 DOI: 10.1109/tcbb.2020.3020595] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Increasing number of experiments show that lncRNAs are involved in many biological processes, and their mutations and disorders are associated with many diseases. However, verifying the relationships between lncRNAs and diseases is time consuming and laborio. Searching for effective computational methods will contribute to our understanding of the underlying mechanisms of disease and identifying biomarkers of diseases. Therefore, we proposed a method called lncRNA-disease association prediction based on linear neighborhood similarity and unbalanced bi-random walk (LDA-LNSUBRW). Given that the known lncRNA-disease associations are rare, a pretreatment step should be performed to obtain the interaction possibility of unknown cases, so as to help us predict the potential associations. In the framework of leave-one-out cross-validation (LOOCV)and fivefold cross-validation (5-fold CV), LDA-LNSUBRW achieved effective performance with AUC of 0.8874 and 0.8632 ± 0.0051, respectively. The experimental results in this paper show that the proposed method is superior to five other state-of-the-art methods. In addition, case studies of three diseases (lung cancer, breast cancer, and osteosarcoma)were carried out to illustrate that LDA-LNSUBRW could predict the relevant lncRNAs.
Collapse
|
14
|
Chen M, Deng Y, Li A, Tan Y. Inferring Latent Disease-lncRNA Associations by Label-Propagation Algorithm and Random Projection on a Heterogeneous Network. Front Genet 2022; 13:798632. [PMID: 35186029 PMCID: PMC8854791 DOI: 10.3389/fgene.2022.798632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 01/18/2022] [Indexed: 11/13/2022] Open
Abstract
Long noncoding RNA (lncRNA), a type of more than 200 nucleotides non-coding RNA, is related to various complex diseases. To precisely identify the potential lncRNA–disease association is important to understand the disease pathogenesis, to develop new drugs, and to design individualized diagnosis and treatment methods for different human diseases. Compared with the complexity and high cost of biological experiments, computational methods can quickly and effectively predict potential lncRNA–disease associations. Thus, it is a promising avenue to develop computational methods for lncRNA-disease prediction. However, owing to the low prediction accuracy ofstate of the art methods, it is vastly challenging to accurately and effectively identify lncRNA-disease at present. This article proposed an integrated method called LPARP, which is based on label-propagation algorithm and random projection to address the issue. Specifically, the label-propagation algorithm is initially used to obtain the estimated scores of lncRNA–disease associations, and then random projections are used to accurately predict disease-related lncRNAs.The empirical experiments showed that LAPRP achieved good prediction on three golddatasets, which is superior to existing state-of-the-art prediction methods. It can also be used to predict isolated diseases and new lncRNAs. Case studies of bladder cancer, esophageal squamous-cell carcinoma, and colorectal cancer further prove the reliability of the method. The proposed LPARP algorithm can predict the potential lncRNA–disease interactions stably and effectively with fewer data. LPARP can be used as an effective and reliable tool for biomedical research.
Collapse
|
15
|
Fan Y, Dong X, Li M, Liu P, Zheng J, Li H, Zhang Y. LncRNA KRT19P3 Is Involved in Breast Cancer Cell Proliferation, Migration and Invasion. Front Oncol 2022; 11:799082. [PMID: 35059320 PMCID: PMC8763666 DOI: 10.3389/fonc.2021.799082] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Accepted: 12/08/2021] [Indexed: 12/13/2022] Open
Abstract
Long non-coding RNAs (LncRNAs) have already been taken as critical regulatory molecules in breast carcinoma (BC). Besides, the progression of BC is closely associated with the immune system. However, the relationship between lncRNAs and the tumor immune system in BC has not been fully studied. LncRNA KRT19P3 has been reported to inhibit the progression of gastric cancer. In the present study, we first discovered that KRT19P3 was downregulated in BC tissues compared with para cancer tissue. Then we showed that KRT19P3 could be used as a marker to differentiate BC from para cancer tissue. Increased expression of KRT19P3 markedly inhibited the proliferation, migration, and invasion rate of BC cells in vitro and tumor growth of BC in vivo. Conversely, KRT19P3 knockdown by siRNA markedly promoted the proliferation, migration, and invasion rate of BC cells after being transfected. Comparison of clinical parameters showed an inverse relationship between the expression of KRT19P3 and pathological grade. Furthermore, immunohistochemistry (IHC) was applied to reveal the positive rate of the expression of Ki-67, programmed death-ligand 1 (PD-L1), and CD8 in BC tissues. Correlation analysis showed that Ki-67 and PD-L1 were inversely proportional to KRT19P3 but CD8 was directly proportional to KRT19P3. In conclusion, this study demonstrated that lncRNA KRT19P3 inhibits BC progression, and may affect the expression of PD-L1 in BC, which in turn affects CD8+ T (CD8 positive Cytotoxic T lymphocyte) cells in the immune microenvironment.
Collapse
Affiliation(s)
- Yanping Fan
- Pathology Department, First Affiliated Hospital of Weifang Medical University (Weifang People's Hospital), Weifang, China.,Department of Basic Medicine, Weifang Medical University, Weifang, China
| | - Xiaotong Dong
- Pathology Department, First Affiliated Hospital of Weifang Medical University (Weifang People's Hospital), Weifang, China.,Department of Basic Medicine, Weifang Medical University, Weifang, China
| | - Meizeng Li
- Pathology Department, First Affiliated Hospital of Weifang Medical University (Weifang People's Hospital), Weifang, China.,Department of Basic Medicine, Weifang Medical University, Weifang, China
| | - Pengju Liu
- School of Economics, Qingdao University, Qingdao, China
| | - Jie Zheng
- Department of Basic Medicine, Weifang Medical University, Weifang, China
| | - Hongli Li
- Department of Basic Medicine, Weifang Medical University, Weifang, China
| | - Yunxiang Zhang
- Pathology Department, First Affiliated Hospital of Weifang Medical University (Weifang People's Hospital), Weifang, China
| |
Collapse
|
16
|
Wang L, Shang M, Dai Q, He PA. Prediction of lncRNA-disease association based on a Laplace normalized random walk with restart algorithm on heterogeneous networks. BMC Bioinformatics 2022; 23:5. [PMID: 34983367 PMCID: PMC8729064 DOI: 10.1186/s12859-021-04538-1] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Accepted: 12/15/2021] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND More and more evidence showed that long non-coding RNAs (lncRNAs) play important roles in the development and progression of human sophisticated diseases. Therefore, predicting human lncRNA-disease associations is a challenging and urgently task in bioinformatics to research of human sophisticated diseases. RESULTS In the work, a global network-based computational framework called as LRWRHLDA were proposed which is a universal network-based method. Firstly, four isomorphic networks include lncRNA similarity network, disease similarity network, gene similarity network and miRNA similarity network were constructed. And then, six heterogeneous networks include known lncRNA-disease, lncRNA-gene, lncRNA-miRNA, disease-gene, disease-miRNA, and gene-miRNA associations network were applied to design a multi-layer network. Finally, the Laplace normalized random walk with restart algorithm in this global network is suggested to predict the relationship between lncRNAs and diseases. CONCLUSIONS The ten-fold cross validation is used to evaluate the performance of LRWRHLDA. As a result, LRWRHLDA achieves an AUC of 0.98402, which is higher than other compared methods. Furthermore, LRWRHLDA can predict isolated disease-related lnRNA (isolated lnRNA related disease). The results for colorectal cancer, lung adenocarcinoma, stomach cancer and breast cancer have been verified by other researches. The case studies indicated that our method is effective.
Collapse
Affiliation(s)
- Liugen Wang
- School of Science, Zhejiang Sci-Tech University, Hangzhou, 310018, China
| | - Min Shang
- School of Science, Zhejiang Sci-Tech University, Hangzhou, 310018, China
| | - Qi Dai
- College of Life Science, Zhejiang Sci-Tech University, Hangzhou, 310018, China
| | - Ping-An He
- School of Science, Zhejiang Sci-Tech University, Hangzhou, 310018, China.
| |
Collapse
|
17
|
Bao MH, Zhang RQ, Huang XS, Zhou J, Guo Z, Xu BF, Liu R. Transcriptomic and Proteomic Profiling of Human Stable and Unstable Carotid Atherosclerotic Plaques. Front Genet 2021; 12:755507. [PMID: 34804124 PMCID: PMC8599967 DOI: 10.3389/fgene.2021.755507] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Accepted: 10/12/2021] [Indexed: 01/09/2023] Open
Abstract
Atherosclerosis is a chronic inflammatory disease with high prevalence and mortality. The rupture of atherosclerotic plaque is the main reason for the clinical events caused by atherosclerosis. Making clear the transcriptomic and proteomic profiles between the stabe and unstable atherosclerotic plaques is crucial to prevent the clinical manifestations. In the present study, 5 stable and 5 unstable human carotid atherosclerotic plaques were obtained by carotid endarterectomy. The samples were used for the whole transcriptome sequencing (RNA-Seq) by the Next-Generation Sequencing using the Illumina HiSeq, and for proteome analysis by HPLC-MS/MS. The lncRNA-targeted genes and circRNA-originated genes were identified by analyzing their location and sequence. Gene Ontology and KEGG enrichment was carried out to analyze the functions of differentially expressed RNAs and proteins. The protein-protein interactions (PPI) network was constructed by the online tool STRING. The consistency of transcriptome and proteome were analyzed, and the lncRNA/circRNA-miRNA-mRNA interactions were predicted. As a result, 202 mRNAs, 488 lncRNAs, 91 circRNAs, and 293 proteins were identified to be differentially expressed between stable and unstable atherosclerotic plaques. The 488 lncRNAs might target 381 protein-coding genes by cis-acting mechanisms. Sequence analysis indicated the 91 differentially expressed circRNAs were originated from 97 protein-coding genes. These differentially expressed RNAs and proteins were mainly enriched in the terms of the cellular response to stress or stimulus, the regulation of gene transcription, the immune response, the nervous system functions, the hematologic activities, and the endocrine system. These results were consistent with the previous reported data in the dataset GSE41571. Further analysis identified CD5L, S100A12, CKB (target gene of lncRNA MSTRG.11455.17), CEMIP (target gene of lncRNA MSTRG.12845), and SH3GLB1 (originated gene of hsacirc_000411) to be critical genes in regulating the stability of atherosclerotic plaques. Our results provided a comprehensive transcriptomic and proteomic knowledge on the stability of atherosclerotic plaques.
Collapse
Affiliation(s)
- Mei-Hua Bao
- Academician Workstation, Changsha, China.,School of Stomatology, Changsha Medical University, Changsha, China
| | - Ruo-Qi Zhang
- School of Stomatology, Changsha Medical University, Changsha, China
| | - Xiao-Shan Huang
- Department of Pharmacology, Changsha Health Vocational College, Changsha, China
| | - Ji Zhou
- Academician Workstation, Changsha, China
| | - Zhen Guo
- Academician Workstation, Changsha, China
| | - Bao-Feng Xu
- Academician Workstation, Changsha, China.,First Hospital of Jilin University, Changchun, Jilin, China
| | - Rui Liu
- Academician Workstation, Changsha, China.,Department of VIP Unit, China-Japan Union Hospital of Jilin University, Changchun, China
| |
Collapse
|
18
|
Yao Y, Ji B, Lv Y, Li L, Xiang J, Liao B, Gao W. Predicting LncRNA-Disease Association by a Random Walk With Restart on Multiplex and Heterogeneous Networks. Front Genet 2021; 12:712170. [PMID: 34490041 PMCID: PMC8417042 DOI: 10.3389/fgene.2021.712170] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Accepted: 07/23/2021] [Indexed: 02/05/2023] Open
Abstract
Studies have found that long non-coding RNAs (lncRNAs) play important roles in many human biological processes, and it is critical to explore potential lncRNA-disease associations, especially cancer-associated lncRNAs. However, traditional biological experiments are costly and time-consuming, so it is of great significance to develop effective computational models. We developed a random walk algorithm with restart on multiplex and heterogeneous networks of lncRNAs and diseases to predict lncRNA-disease associations (MHRWRLDA). First, multiple disease similarity networks are constructed by using different approaches to calculate similarity scores between diseases, and multiple lncRNA similarity networks are also constructed by using different approaches to calculate similarity scores between lncRNAs. Then, a multiplex and heterogeneous network was constructed by integrating multiple disease similarity networks and multiple lncRNA similarity networks with the lncRNA-disease associations, and a random walk with restart on the multiplex and heterogeneous network was performed to predict lncRNA-disease associations. The results of Leave-One-Out cross-validation (LOOCV) showed that the value of Area under the curve (AUC) was 0.68736, which was improved compared with the classical algorithm in recent years. Finally, we confirmed a few novel predicted lncRNAs associated with specific diseases like colon cancer by literature mining. In summary, MHRWRLDA contributes to predict lncRNA-disease associations.
Collapse
Affiliation(s)
- Yuhua Yao
- School of Mathematics and Statistics, Hainan Normal University, Haikou, China
- Key Laboratory of Data Science and Intelligence Education, Ministry of Education, Hainan Normal University, Haikou, China
- Key Laboratory of Computational Science and Application of Hainan Province, Hainan Normal University, Haikou, China
| | - Binbin Ji
- Geneis Beijing Co., Ltd., Beijing, China
| | - Yaping Lv
- School of Mathematics and Statistics, Hainan Normal University, Haikou, China
| | - Ling Li
- Basic Courses Department, Zhejiang Shuren University, Hangzhou, China
| | - Ju Xiang
- School of Computer Science and Engineering, Central South University, Changsha, China
- Department of Basic Medical Sciences, Changsha Medical University, Changsha, China
- Department of Computer Science, Changsha Medical University, Changsha, China
| | - Bo Liao
- School of Mathematics and Statistics, Hainan Normal University, Haikou, China
| | - Wei Gao
- Departments of Internal Medicine-Oncology, Fujian Cancer Hospital & Fujian Medical University Cancer Hospital, Fuzhou, China
| |
Collapse
|
19
|
Yao Y, Ji B, Lv Y, Li L, Xiang J, Liao B, Gao W. Predicting LncRNA–Disease Association by a Random Walk With Restart on Multiplex and Heterogeneous Networks. Front Genet 2021. [DOI: https:/doi.org/10.3389/fgene.2021.712170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
Studies have found that long non-coding RNAs (lncRNAs) play important roles in many human biological processes, and it is critical to explore potential lncRNA–disease associations, especially cancer-associated lncRNAs. However, traditional biological experiments are costly and time-consuming, so it is of great significance to develop effective computational models. We developed a random walk algorithm with restart on multiplex and heterogeneous networks of lncRNAs and diseases to predict lncRNA–disease associations (MHRWRLDA). First, multiple disease similarity networks are constructed by using different approaches to calculate similarity scores between diseases, and multiple lncRNA similarity networks are also constructed by using different approaches to calculate similarity scores between lncRNAs. Then, a multiplex and heterogeneous network was constructed by integrating multiple disease similarity networks and multiple lncRNA similarity networks with the lncRNA–disease associations, and a random walk with restart on the multiplex and heterogeneous network was performed to predict lncRNA–disease associations. The results of Leave-One-Out cross-validation (LOOCV) showed that the value of Area under the curve (AUC) was 0.68736, which was improved compared with the classical algorithm in recent years. Finally, we confirmed a few novel predicted lncRNAs associated with specific diseases like colon cancer by literature mining. In summary, MHRWRLDA contributes to predict lncRNA–disease associations.
Collapse
|
20
|
Liu Z, Hong ZP, Xi SX. RUNX3 Expression Level Is Correlated with the Clinical and Pathological Characteristics in Endometrial Cancer: A Systematic Review and Meta-analysis. BIOMED RESEARCH INTERNATIONAL 2021; 2021:9995384. [PMID: 34337071 PMCID: PMC8298141 DOI: 10.1155/2021/9995384] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/02/2021] [Revised: 05/07/2021] [Accepted: 06/30/2021] [Indexed: 11/20/2022]
Abstract
Human Runt-associated transcription factor 3 (RUNX3) plays an important role in the development and progression of endometrial cancer (EC). However, the clinical and pathological significance of RUNX3 in EC needs to be further studied. In order to clarify the clinical and pathological significance of RUNX3, a systematic review and meta-analysis was conducted in EC patients. Keywords RUNX3, endometrial cancer, and uterine cancer were searched in Cochrane Library, Web of Knowledge, PubMed, CBM, MEDLINE, and Chinese CNKI database for data up to Dec 31, 2018. References, abstracts, and meeting proceedings were manually searched in supplementary. Outcomes were various clinical and pathological features. The two reviewers performed the literature searching, data extracting, and method assessing independently. Meta-analysis was performed by RevMan5.3.0. A total of 563 EC patients were enrolled from eight studies. Meta-analysis results showed that the expression of RUNX3 has significant differences in these comparisons: lymph node (LN) metastasis vs. non-LN metastasis (P = 0.26), EC tissues vs. normal tissues (P < 0.00001), clinical stages I/II vs. II/IV (P < 0.00001), muscular infiltration < 1/2 vs. muscular infiltration ≥ 1/2 (P < 0.00001), and G1 vs. G2/G3 (P < 0.00001). The decreasing expression of RUNX3 is associated with poor TNM stage and muscular infiltration. It is indicated that RUNX3 was involved in the suppression effect of EC. However, further multicenter randomized controlled trials are needed considering the small sample size of the included trials.
Collapse
Affiliation(s)
- Zhen Liu
- Department of Gynecology, Chifeng Municipal Hospital, Chifeng Clinical Medical School of Inner Mongolia Medical University, Chifeng, China
| | - Zhi-pan Hong
- Department of Tumor Surgery, Chifeng Municipal Hospital, Chifeng Clinical Medical School of Inner Mongolia Medical University, Chifeng, China
| | - Shu-xue Xi
- Geneis (Beijing) Co. Ltd., Beijing 100102, China
- Qingdao Geneis Institute of Big Data Mining and Precision Medicine, Qingdao 266000, China
| |
Collapse
|
21
|
Chowdhary A, Satagopam V, Schneider R. Long Non-coding RNAs: Mechanisms, Experimental, and Computational Approaches in Identification, Characterization, and Their Biomarker Potential in Cancer. Front Genet 2021; 12:649619. [PMID: 34276764 PMCID: PMC8281131 DOI: 10.3389/fgene.2021.649619] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2021] [Accepted: 04/20/2021] [Indexed: 01/09/2023] Open
Abstract
Long non-coding RNAs are diverse class of non-coding RNA molecules >200 base pairs of length having various functions like gene regulation, dosage compensation, epigenetic regulation. Dysregulation and genomic variations of several lncRNAs have been implicated in several diseases. Their tissue and developmental specific expression are contributing factors for them to be viable indicators of physiological states of the cells. Here we present an comprehensive review the molecular mechanisms and functions, state of the art experimental and computational pipelines and challenges involved in the identification and functional annotation of lncRNAs and their prospects as biomarkers. We also illustrate the application of co-expression networks on the TCGA-LIHC dataset for putative functional predictions of lncRNAs having a therapeutic potential in Hepatocellular carcinoma (HCC).
Collapse
Affiliation(s)
- Anshika Chowdhary
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Venkata Satagopam
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Reinhard Schneider
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| |
Collapse
|
22
|
Yan H, Chai H, Zhao H. Detecting lncRNA-Cancer Associations by Combining miRNAs, Genes, and Prognosis With Matrix Factorization. Front Genet 2021; 12:639872. [PMID: 34262591 PMCID: PMC8273282 DOI: 10.3389/fgene.2021.639872] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Accepted: 04/15/2021] [Indexed: 11/13/2022] Open
Abstract
Motivation: Long non-coding RNAs (lncRNAs) play important roles in cancer development. Prediction of lncRNA–cancer association is necessary for efficiently discovering biomarkers and designing treatment for cancers. Currently, several methods have been developed to predict lncRNA–cancer associations. However, most of them do not consider the relationships between lncRNA with other molecules and with cancer prognosis, which has limited the accuracy of the prediction. Method: Here, we constructed relationship matrices between 1,679 lncRNAs, 2,759 miRNAs, and 16,410 genes and cancer prognosis on three types of cancers (breast, lung, and colorectal cancers) to predict lncRNA–cancer associations. The matrices were iteratively reconstructed by matrix factorization to optimize low-rank size. This method is called detecting lncRNA cancer association (DRACA). Results: Application of this method in the prediction of lncRNAs–breast cancer, lncRNA–lung cancer, and lncRNA–colorectal cancer associations achieved an area under curve (AUC) of 0.810, 0.796, and 0.795, respectively, by 10-fold cross-validations. The performances of DRACA in predicting associations between lncRNAs with three kinds of cancers were at least 6.6, 7.2, and 6.9% better than other methods, respectively. To our knowledge, this is the first method employing cancer prognosis in the prediction of lncRNA–cancer associations. When removing the relationships between cancer prognosis and genes, the AUCs were decreased 7.2, 0.6, and 5% for breast, lung, and colorectal cancers, respectively. Moreover, the predicted lncRNAs were found with greater numbers of somatic mutations than the lncRNAs not predicted as cancer-associated for three types of cancers. DRACA predicted many novel lncRNAs, whose expressions were found to be related to survival rates of patients. The method is available at https://github.com/Yanh35/DRACA.
Collapse
Affiliation(s)
- Huan Yan
- Department of Medical Research Center, Sun Yat-sen Memorial Hospital, Guangzhou, China.,Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Guangzhou, China
| | - Hua Chai
- School of Data and Computer Science, Sun Yat-sen University, Guangzhou, China
| | - Huiying Zhao
- Department of Medical Research Center, Sun Yat-sen Memorial Hospital, Guangzhou, China.,Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Guangzhou, China
| |
Collapse
|
23
|
Peng X, Chen L, Zhou JP. Identification of Carcinogenic Chemicals with Network Embedding and Deep Learning Methods. Curr Bioinform 2021. [DOI: 10.2174/1574893615999200414084317] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
Background:
Cancer is the second leading cause of human death in the world. To date,
many factors have been confirmed to be the cause of cancer. Among them, carcinogenic chemicals
have been widely accepted as the important ones. Traditional methods for detecting carcinogenic
chemicals are of low efficiency and high cost.
Objective:
The aim of this study was to design an efficient computational method for the
identification of carcinogenic chemicals.
Methods:
A new computational model was proposed for detecting carcinogenic chemicals. As a
data-driven model, carcinogenic and non-carcinogenic chemicals were obtained from Carcinogenic
Potency Database (CPDB). These chemicals were represented by features extracted from five
chemical networks, representing five types of chemical associations, via a network embedding
method, Mashup. Obtained features were fed into a powerful deep learning method, recurrent
neural network, to build the model.
Results:
The jackknife test on such model provided the F-measure of 0.971 and AUROC of 0.971.
Conclusion:
The proposed model was quite effective and was superior to the models with
traditional machine learning algorithms, classic chemical encoding schemes or direct usage of
chemical associations.
Collapse
Affiliation(s)
- Xuefei Peng
- College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China
| | - Lei Chen
- College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China
| | - Jian-Peng Zhou
- College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China
| |
Collapse
|
24
|
Seifuddin F, Pirooznia M. Bioinformatics Approaches for Functional Prediction of Long Noncoding RNAs. Methods Mol Biol 2021; 2254:1-13. [PMID: 33326066 DOI: 10.1007/978-1-0716-1158-6_1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
There is accumulating evidence that long noncoding RNAs (lncRNAs) play crucial roles in biological processes and diseases. In recent years, computational models have been widely used to predict potential lncRNA-disease relations. In this chapter, we systematically describe various computational algorithms and prediction tools that have been developed to elucidate the roles of lncRNAs in diseases, coding potential/functional characterization, or ascertaining their involvement in critical biological processes as well as provide a comprehensive summary of these applications.
Collapse
Affiliation(s)
- Fayaz Seifuddin
- Bioinformatics and Computational Biology, National Heart, Lung, and Blood Institute National Institutes of Health, Bethesda, MD, USA
| | - Mehdi Pirooznia
- Bioinformatics and Computational Biology, National Heart, Lung, and Blood Institute National Institutes of Health, Bethesda, MD, USA.
| |
Collapse
|
25
|
Yan C, Zhang Z, Bao S, Hou P, Zhou M, Xu C, Sun J. Computational Methods and Applications for Identifying Disease-Associated lncRNAs as Potential Biomarkers and Therapeutic Targets. MOLECULAR THERAPY. NUCLEIC ACIDS 2020; 21:156-171. [PMID: 32585624 PMCID: PMC7321789 DOI: 10.1016/j.omtn.2020.05.018] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/13/2020] [Revised: 04/06/2020] [Accepted: 05/18/2020] [Indexed: 12/12/2022]
Abstract
Long non-coding RNAs (lncRNAs) have been recognized as critical components of a broad genomic regulatory network and play pivotal roles in physiological and pathological processes. Identification of disease-associated lncRNAs is becoming increasingly crucial for fundamentally improving our understanding of molecular mechanisms of disease and developing novel biomarkers and therapeutic targets. Considering lower efficiency and higher time and labor cost of biological experiments, computer-aided inference of disease-associated RNAs has become a promising avenue for facilitating the study of lncRNA functions and provides complementary value for experimental studies. In this study, we first summarize data and knowledge resources publicly available for the study of lncRNA-disease associations. Then, we present an updated systematic overview of dozens of computational methods and models for inferring lncRNA-disease associations proposed in recent years. Finally, we explore the perspectives and challenges for further studies. Our study provides a guide for biologists and medical scientists to look for dedicated resources and more competent tools for accelerating the unraveling of disease-associated lncRNAs.
Collapse
Affiliation(s)
- Congcong Yan
- School of Biomedical Engineering, School of Ophthalmology & Optometry and Eye Hospital, Wenzhou Medical University, Wenzhou 325027, P.R. China
| | - Zicheng Zhang
- School of Biomedical Engineering, School of Ophthalmology & Optometry and Eye Hospital, Wenzhou Medical University, Wenzhou 325027, P.R. China
| | - Siqi Bao
- School of Biomedical Engineering, School of Ophthalmology & Optometry and Eye Hospital, Wenzhou Medical University, Wenzhou 325027, P.R. China
| | - Ping Hou
- School of Biomedical Engineering, School of Ophthalmology & Optometry and Eye Hospital, Wenzhou Medical University, Wenzhou 325027, P.R. China
| | - Meng Zhou
- School of Biomedical Engineering, School of Ophthalmology & Optometry and Eye Hospital, Wenzhou Medical University, Wenzhou 325027, P.R. China
| | - Chongyong Xu
- Department of Radiology, The Second Affiliated Hospital of Wenzhou Medical University, Wenzhou 325027, P.R. China.
| | - Jie Sun
- School of Biomedical Engineering, School of Ophthalmology & Optometry and Eye Hospital, Wenzhou Medical University, Wenzhou 325027, P.R. China.
| |
Collapse
|
26
|
Zhang Y, Ye F, Xiong D, Gao X. LDNFSGB: prediction of long non-coding rna and disease association using network feature similarity and gradient boosting. BMC Bioinformatics 2020; 21:377. [PMID: 32883200 PMCID: PMC7469344 DOI: 10.1186/s12859-020-03721-0] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2020] [Accepted: 08/21/2020] [Indexed: 12/11/2022] Open
Abstract
BACKGROUND A large number of experimental studies show that the mutation and regulation of long non-coding RNAs (lncRNAs) are associated with various human diseases. Accurate prediction of lncRNA-disease associations can provide a new perspective for the diagnosis and treatment of diseases. The main function of many lncRNAs is still unclear and using traditional experiments to detect lncRNA-disease associations is time-consuming. RESULTS In this paper, we develop a novel and effective method for the prediction of lncRNA-disease associations using network feature similarity and gradient boosting (LDNFSGB). In LDNFSGB, we first construct a comprehensive feature vector to effectively extract the global and local information of lncRNAs and diseases through considering the disease semantic similarity (DISSS), the lncRNA function similarity (LNCFS), the lncRNA Gaussian interaction profile kernel similarity (LNCGS), the disease Gaussian interaction profile kernel similarity (DISGS), and the lncRNA-disease interaction (LNCDIS). Particularly, two methods are used to calculate the DISSS (LNCFS) for considering the local and global information of disease semantics (lncRNA functions) respectively. An autoencoder is then used to reduce the dimensionality of the feature vector to obtain the optimal feature parameter from the original feature set. Furthermore, we employ the gradient boosting algorithm to obtain the lncRNA-disease association prediction. CONCLUSIONS In this study, hold-out, leave-one-out cross-validation, and ten-fold cross-validation methods are implemented on three publicly available datasets to evaluate the performance of LDNFSGB. Extensive experiments show that LDNFSGB dramatically outperforms other state-of-the-art methods. The case studies on six diseases, including cancers and non-cancers, further demonstrate the effectiveness of our method in real-world applications.
Collapse
Affiliation(s)
- Yuan Zhang
- School of Mathematics and Computational Science, Xiangtan University, Xiangtan 411105, China
- Key Laboratory of Intelligent Computing and Information Processing of Ministry of Education, Xiangtan University, Xiangtan 411105, China
| | - Fei Ye
- Key Laboratory of Intelligent Computing and Information Processing of Ministry of Education, Xiangtan University, Xiangtan 411105, China
| | - Dapeng Xiong
- Department of Computational Biology, Ithaca, New York 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, New York 14853, USA
| | - Xieping Gao
- Key Laboratory of Intelligent Computing and Information Processing of Ministry of Education, Xiangtan University, Xiangtan 411105, China.
- College of Medical Imaging and Inspection, Xiangnan University, Chenzhou 423000, China.
| |
Collapse
|
27
|
Wekesa JS, Meng J, Luan Y. Multi-feature fusion for deep learning to predict plant lncRNA-protein interaction. Genomics 2020; 112:2928-2936. [PMID: 32437848 DOI: 10.1016/j.ygeno.2020.05.005] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Revised: 04/22/2020] [Accepted: 05/05/2020] [Indexed: 12/28/2022]
Abstract
Long non-coding RNAs (lncRNAs) play key roles in regulating cellular biological processes through diverse molecular mechanisms including binding to RNA binding proteins. The majority of plant lncRNAs are functionally uncharacterized, thus, accurate prediction of plant lncRNA-protein interaction is imperative for subsequent functional studies. We present an integrative model, namely DRPLPI. Its uniqueness is that it predicts by multi-feature fusion. Structural and four groups of sequence features are used, including tri-nucleotide composition, gapped k-mer, recursive complement and binary profile. We design a multi-head self-attention long short-term memory encoder-decoder network to extract generative high-level features. To obtain robust results, DRPLPI combines categorical boosting and extra trees into a single meta-learner. Experiments on Zea mays and Arabidopsis thaliana obtained 0.9820 and 0.9652 area under precision/recall curve (AUPRC) respectively. The proposed method shows significant enhancement in the prediction performance compared with existing state-of-the-art methods.
Collapse
Affiliation(s)
- Jael Sanyanda Wekesa
- School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning 116023, China; School of Computing and Information Technology, Jomo Kenyatta University of Agriculture and Technology, Nairobi 62000-00200, Kenya
| | - Jun Meng
- School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning 116023, China.
| | - Yushi Luan
- School of Bioengineering, Dalian University of Technology, Dalian, Liaoning 116023, China
| |
Collapse
|
28
|
A random forest based computational model for predicting novel lncRNA-disease associations. BMC Bioinformatics 2020; 21:126. [PMID: 32216744 PMCID: PMC7099795 DOI: 10.1186/s12859-020-3458-1] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Accepted: 03/18/2020] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND Accumulated evidence shows that the abnormal regulation of long non-coding RNA (lncRNA) is associated with various human diseases. Accurately identifying disease-associated lncRNAs is helpful to study the mechanism of lncRNAs in diseases and explore new therapies of diseases. Many lncRNA-disease association (LDA) prediction models have been implemented by integrating multiple kinds of data resources. However, most of the existing models ignore the interference of noisy and redundancy information among these data resources. RESULTS To improve the ability of LDA prediction models, we implemented a random forest and feature selection based LDA prediction model (RFLDA in short). First, the RFLDA integrates the experiment-supported miRNA-disease associations (MDAs) and LDAs, the disease semantic similarity (DSS), the lncRNA functional similarity (LFS) and the lncRNA-miRNA interactions (LMI) as input features. Then, the RFLDA chooses the most useful features to train prediction model by feature selection based on the random forest variable importance score that takes into account not only the effect of individual feature on prediction results but also the joint effects of multiple features on prediction results. Finally, a random forest regression model is trained to score potential lncRNA-disease associations. In terms of the area under the receiver operating characteristic curve (AUC) of 0.976 and the area under the precision-recall curve (AUPR) of 0.779 under 5-fold cross-validation, the performance of the RFLDA is better than several state-of-the-art LDA prediction models. Moreover, case studies on three cancers demonstrate that 43 of the 45 lncRNAs predicted by the RFLDA are validated by experimental data, and the other two predicted lncRNAs are supported by other LDA prediction models. CONCLUSIONS Cross-validation and case studies indicate that the RFLDA has excellent ability to identify potential disease-associated lncRNAs.
Collapse
|
29
|
Zhang Y, Chen M, Li A, Cheng X, Jin H, Liu Y. LDAI-ISPS: LncRNA-Disease Associations Inference Based on Integrated Space Projection Scores. Int J Mol Sci 2020; 21:E1508. [PMID: 32098405 PMCID: PMC7073162 DOI: 10.3390/ijms21041508] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2019] [Revised: 02/18/2020] [Accepted: 02/19/2020] [Indexed: 12/14/2022] Open
Abstract
Long non-coding RNAs (long ncRNAs, lncRNAs) of all kinds have been implicated in a range of cell developmental processes and diseases, while they are not translated into proteins. Inferring diseases associated lncRNAs by computational methods can be helpful to understand the pathogenesis of diseases, but those current computational methods still have not achieved remarkable predictive performance: such as the inaccurate construction of similarity networks and inadequate numbers of known lncRNA-disease associations. In this research, we proposed a lncRNA-disease associations inference based on integrated space projection scores (LDAI-ISPS) composed of the following key steps: changing the Boolean network of known lncRNA-disease associations into the weighted networks via combining all the global information (e.g., disease semantic similarities, lncRNA functional similarities, and known lncRNA-disease associations); obtaining the space projection scores via vector projections of the weighted networks to form the final prediction scores without biases. The leave-one-out cross validation (LOOCV) results showed that, compared with other methods, LDAI-ISPS had a higher accuracy with area-under-the-curve (AUC) value of 0.9154 for inferring diseases, with AUC value of 0.8865 for inferring new lncRNAs (whose associations related to diseases are unknown), with AUC value of 0.7518 for inferring isolated diseases (whose associations related to lncRNAs are unknown). A case study also confirmed the predictive performance of LDAI-ISPS as a helper for traditional biological experiments in inferring the potential LncRNA-disease associations and isolated diseases.
Collapse
Affiliation(s)
- Yi Zhang
- School of Information Science and Engineering, Guilin University of Technology, Guilin 541004, China
| | - Min Chen
- Hunan Institute of Technology, School of Computer Science and Technology, Hengyang 421002, China
| | - Ang Li
- Hunan Institute of Technology, School of Computer Science and Technology, Hengyang 421002, China
| | - Xiaohui Cheng
- School of Information Science and Engineering, Guilin University of Technology, Guilin 541004, China
| | - Hong Jin
- School of Information Science and Engineering, Guilin University of Technology, Guilin 541004, China
| | - Yarong Liu
- School of Information Science and Engineering, Guilin University of Technology, Guilin 541004, China
| |
Collapse
|
30
|
Tan H, Sun Q, Li G, Xiao Q, Ding P, Luo J, Liang C. Multiview Consensus Graph Learning for lncRNA-Disease Association Prediction. Front Genet 2020; 11:89. [PMID: 32153646 PMCID: PMC7047769 DOI: 10.3389/fgene.2020.00089] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2019] [Accepted: 01/27/2020] [Indexed: 12/11/2022] Open
Abstract
Long noncoding RNAs (lncRNAs) are a class of noncoding RNA molecules longer than 200 nucleotides. Recent studies have uncovered their functional roles in diverse cellular processes and tumorigenesis. Therefore, identifying novel disease-related lncRNAs might deepen our understanding of disease etiology. However, due to the relatively small number of verified associations between lncRNAs and diseases, it remains a challenging task to reliably and effectively predict the associated lncRNAs for given diseases. In this paper, we propose a novel multiview consensus graph learning method to infer potential disease-related lncRNAs. Specifically, we first construct a set of similarity matrices for lncRNAs and diseases by taking advantage of the known associations. We then iteratively learn a consensus graph from the multiple input matrices and simultaneously optimize the predicted association probability based on a multi-label learning framework. To convey the utility of our method, three state-of-the-art methods are compared with our method on three widely used datasets. The experiment results illustrate that our method could obtain the best prediction performance under different cross validation schemes. The case study analysis implemented for uterine cervical neoplasms further confirmed the utility of our method in identifying lncRNAs as potential prognostic biomarkers in practice.
Collapse
Affiliation(s)
- Haojiang Tan
- School of Information Science and Engineering, Shandong Normal University, Jinan, China
| | - Quanmeng Sun
- School of Information Science and Engineering, Shandong Normal University, Jinan, China
| | - Guanghui Li
- School of Information Engineering, East China Jiaotong University, Nanchang, China
| | - Qiu Xiao
- College of Information Science and Engineering, Hunan Normal University, Changsha, China
| | - Pingjian Ding
- School of Computer Science, University of South China, Hengyang, China
| | - Jiawei Luo
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
| | - Cheng Liang
- School of Information Science and Engineering, Shandong Normal University, Jinan, China
| |
Collapse
|
31
|
Ru X, Wang L, Li L, Ding H, Ye X, Zou Q. Exploration of the correlation between GPCRs and drugs based on a learning to rank algorithm. Comput Biol Med 2020; 119:103660. [PMID: 32090901 DOI: 10.1016/j.compbiomed.2020.103660] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Revised: 02/04/2020] [Accepted: 02/12/2020] [Indexed: 02/01/2023]
Abstract
Exploring the protein - drug correlation can not only solve the problem of selecting candidate compounds but also solve related problems such as drug redirection and finding potential drug targets. Therefore, many researchers have proposed different machine learning methods for prediction of protein-drug correlations. However, many existing models simply divide the protein-drug relationship into related or irrelevant categories and do not deeply explore the most relevant target (or drug) for a given drug (or target). In order to solve this problem, this paper applies the ranking concept to the prediction of the GPCR (G Protein-Coupled Receptors)-drug correlation. This study uses two different types of data sets to explore candidate compound and potential target problems, and both sets achieved good results. In addition, this study also found that the family to which a protein belongs is not an inherent factor that affects the ranking of GPCR-drug correlations; however, if the drug affects other family members of the protein, then the protein is likely to be a potential target of the drug. This study showed that the learning to rank algorithm is a good tool for exploring protein-drug correlations.
Collapse
Affiliation(s)
- Xiaoqing Ru
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China; School of Information and Electrical Engineering, Hebei University of Engineering, Handan, China
| | - Lida Wang
- Scientific Research Department, Heilongjiang Agricultural Recalmation General Hospital, Harbin, China.
| | - Lihong Li
- School of Information and Electrical Engineering, Hebei University of Engineering, Handan, China
| | - Hui Ding
- Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Xiucai Ye
- Department of Computer Science, University of Tsukuba, Tsukuba Science City, Japan
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China; Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China.
| |
Collapse
|
32
|
Li W, Wang S, Xu J, Mao G, Tian G, Yang J. Inferring Latent Disease-lncRNA Associations by Faster Matrix Completion on a Heterogeneous Network. Front Genet 2019; 10:769. [PMID: 31572428 PMCID: PMC6749816 DOI: 10.3389/fgene.2019.00769] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2019] [Accepted: 07/19/2019] [Indexed: 11/26/2022] Open
Abstract
Current studies have shown that long non-coding RNAs (lncRNAs) play a crucial role in a variety of fundamental biological processes related to complex human diseases. The prediction of latent disease-lncRNA associations can help to understand the pathogenesis of complex human diseases at the level of lncRNA, which also contributes to the detection of disease biomarkers, and the diagnosis, treatment, prognosis and prevention of disease. Nevertheless, it is still a challenging and urgent task to accurately identify latent disease-lncRNA association. Discovering latent links on the basis of biological experiments is time-consuming and wasteful, necessitating the development of computational prediction models. In this study, a computational prediction model has been remodeled as a matrix completion framework of the recommendation system by completing the unknown items in the rating matrix. A novel method named faster randomized matrix completion for latent disease-lncRNA association prediction (FRMCLDA) has been proposed by virtue of improved randomized partial SVD (rSVD-BKI) on a heterogeneous bilayer network. First, the correlated data source and experimentally validated information of diseases and lncRNAs are integrated to construct a heterogeneous bilayer network. Next, the integrated heterogeneous bilayer network can be formalized as a comprehensive adjacency matrix which includes lncRNA similarity matrix, disease similarity matrix, and disease-lncRNA association matrix where the uncertain disease-lncRNA associations are referred to as blank items. Then, a matrix approximate to the original adjacency matrix has been designed with predicted scores to retrieve the blank items. The construction of the approximate matrix could be equivalently resolved by the nuclear norm minimization. Finally, a faster singular value thresholding algorithm with a randomized partial SVD combing a new sub-space reuse technique has been utilized to complete the adjacency matrix. The results of leave-one-out cross-validation (LOOCV) experiments and 5-fold cross-validation (5-fold CV) experiments on three different benchmark databases have confirmed the availability and adaptability of FRMCLDA in inferring latent relationships of disease-lncRNA pairs, and in inferring lncRNAs correlated with novel diseases without any prior interaction information. Additionally, case studies have shown that FRMCLDA is able to effectively predict latent lncRNAs correlated with three widespread malignancies: prostate cancer, colon cancer, and gastric cancer.
Collapse
Affiliation(s)
- Wen Li
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
| | - Shulin Wang
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
| | - Junlin Xu
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
| | - Guo Mao
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
| | - Geng Tian
- Geneis Beijing Co., Ltd., Beijing, China
| | | |
Collapse
|
33
|
CNNDLP: A Method Based on Convolutional Autoencoder and Convolutional Neural Network with Adjacent Edge Attention for Predicting lncRNA-Disease Associations. Int J Mol Sci 2019; 20:ijms20174260. [PMID: 31480319 PMCID: PMC6747450 DOI: 10.3390/ijms20174260] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2019] [Revised: 08/28/2019] [Accepted: 08/28/2019] [Indexed: 12/11/2022] Open
Abstract
It is well known that the unusual expression of long non-coding RNAs (lncRNAs) is closely related to the physiological and pathological processes of diseases. Therefore, inferring the potential lncRNA–disease associations are helpful for understanding the molecular pathogenesis of diseases. Most previous methods have concentrated on the construction of shallow learning models in order to predict lncRNA-disease associations, while they have failed to deeply integrate heterogeneous multi-source data and to learn the low-dimensional feature representations from these data. We propose a method based on the convolutional neural network with the attention mechanism and convolutional autoencoder for predicting candidate disease-related lncRNAs, and refer to it as CNNDLP. CNNDLP integrates multiple kinds of data from heterogeneous sources, including the associations, interactions, and similarities related to the lncRNAs, diseases, and miRNAs. Two different embedding layers are established by combining the diverse biological premises about the cases that the lncRNAs are likely to associate with the diseases. We construct a novel prediction model based on the convolutional neural network with attention mechanism and convolutional autoencoder to learn the attention and the low-dimensional network representations of the lncRNA–disease pairs from the embedding layers. The different adjacent edges among the lncRNA, miRNA, and disease nodes have different contributions for association prediction. Hence, an attention mechanism at the adjacent edge level is established, and the left side of the model learns the attention representation of a pair of lncRNA and disease. A new type of lncRNA similarity and a new type of disease similarity are calculated by incorporating the topological structures of multiple bipartite networks. The low-dimensional network representation of the lncRNA-disease pairs is further learned by the autoencoder based convolutional neutral network on the right side of the model. The cross-validation experimental results confirm that CNNDLP has superior prediction performance compared to the state-of-the-art methods. Case studies on stomach cancer, breast cancer, and prostate cancer further show the ability of CNNDLP for discovering the potential disease lncRNAs.
Collapse
|
34
|
Xie G, Meng T, Luo Y, Liu Z. SKF-LDA: Similarity Kernel Fusion for Predicting lncRNA-Disease Association. MOLECULAR THERAPY. NUCLEIC ACIDS 2019; 18:45-55. [PMID: 31514111 PMCID: PMC6742806 DOI: 10.1016/j.omtn.2019.07.022] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/22/2019] [Revised: 07/13/2019] [Accepted: 07/24/2019] [Indexed: 01/24/2023]
Abstract
Recently, prediction of lncRNA-disease associations has attracted more and more attentions. Various computational models have been proposed; however, there is still room to improve the prediction accuracy. In this paper, we propose a kernel fusion method with different types of similarities for the lncRNAs and diseases. The expression similarity and cosine similarity are used for lncRNAs, and the semantic similarity and cosine similarity are used for the diseases. To eliminate the noise effect, a neighbor constraint is enforced to refine all the similarity matrices before fusion. Experimental results show that the proposed similarity kernel fusion (SKF)-LDA method has the superiority performance in terms of AUC values and other measurements. In the schemes of LOOCV and 5-fold CV, AUC values of SKF-LDA achieve 0.9049 and 0.8743±0.0050 respectively. In addition, the conducted case studies of three diseases (hepatocellular carcinoma, lung cancer, and prostate cancer) show that SKF-LDA can predict related lncRNAs accurately.
Collapse
Affiliation(s)
- Guobo Xie
- School of Computer Science, Guangdong University of Technology, Guangzhou, China
| | - Tengfei Meng
- School of Computer Science, Guangdong University of Technology, Guangzhou, China
| | - Yu Luo
- School of Computer Science, Guangdong University of Technology, Guangzhou, China.
| | - Zhenguo Liu
- Department of Thoracic Surgery, The First Affiliated Hospital of Sun Yat-sen University, Guangzhou, China.
| |
Collapse
|
35
|
Xu L, Liang G, Liao C, Chen GD, Chang CC. An Efficient Classifier for Alzheimer's Disease Genes Identification. Molecules 2018; 23:molecules23123140. [PMID: 30501121 PMCID: PMC6321377 DOI: 10.3390/molecules23123140] [Citation(s) in RCA: 55] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2018] [Revised: 11/17/2018] [Accepted: 11/19/2018] [Indexed: 11/16/2022] Open
Abstract
Alzheimer’s disease (AD) is considered to one of 10 key diseases leading to death in humans. AD is considered the main cause of brain degeneration, and will lead to dementia. It is beneficial for affected patients to be diagnosed with the disease at an early stage so that efforts to manage the patient can begin as soon as possible. Most existing protocols diagnose AD by way of magnetic resonance imaging (MRI). However, because the size of the images produced is large, existing techniques that employ MRI technology are expensive and time-consuming to perform. With this in mind, in the current study, AD is predicted instead by the use of a support vector machine (SVM) method based on gene-coding protein sequence information. In our proposed method, the frequency of two consecutive amino acids is used to describe the sequence information. The accuracy of the proposed method for identifying AD is 85.7%, which is demonstrated by the obtained experimental results. The experimental results also show that the sequence information of gene-coding proteins can be used to predict AD.
Collapse
Affiliation(s)
- Lei Xu
- School of Electronic and Communication Engineering, Shenzhen Polytechnic, Shenzhen 518055, China.
| | - Guangmin Liang
- School of Electronic and Communication Engineering, Shenzhen Polytechnic, Shenzhen 518055, China.
| | - Changrui Liao
- Key Laboratory of Optoelectronic Devices and Systems of Ministry of Education and Guangdong Province, College of Optoelectronic Engineering, Shenzhen University, Shenzhen 518060, China.
| | - Gin-Den Chen
- Department of Obstetrics and Gynecology, Chung Shan Medical University Hospital, Taichung 40201, Taiwan.
| | - Chi-Chang Chang
- School of Medical Informatics, Chung Shan Medical University, Taichung 40201, Taiwan.
- IT Office, Chung Shan Medical University Hospital, Taichung 40201, Taiwan.
| |
Collapse
|