1
|
Baird T, Roychoudhuri R. GS-TCGA: Gene Set-Based Analysis of The Cancer Genome Atlas. J Comput Biol 2024; 31:229-240. [PMID: 38436570 DOI: 10.1089/cmb.2023.0278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/05/2024] Open
Abstract
Most tools for analyzing large gene expression datasets, including The Cancer Genome Atlas (TCGA), have focused on analyzing the expression of individual genes or inference of the abundance of specific cell types from whole transcriptome information. While these methods provide useful insights, they can overlook crucial process-based information that may enhance our understanding of cancer biology. In this study, we describe three novel tools incorporated into an online resource; gene set-based analysis of The Cancer Genome Atlas (GS-TCGA). GS-TCGA is designed to enable user-friendly exploration of TCGA data using gene set-based analysis, leveraging gene sets from the Molecular Signatures Database. GS-TCGA includes three unique tools: GS-Surv determines the association between the expression of gene sets and survival in human cancers. Co-correlative gene set enrichment analysis (CC-GSEA) utilizes interpatient heterogeneity in cancer gene expression to infer functions of specific genes based on GSEA of coregulated genes in TCGA. GS-Corr utilizes interpatient heterogeneity in cancer gene expression profiles to identify genes coregulated with the expression of specific gene sets in TCGA. Users are also able to upload custom gene sets for analysis with each tool. These tools empower researchers to perform survival analysis linked to gene set expression, explore the functional implications of gene coexpression, and identify potential gene regulatory mechanisms.
Collapse
Affiliation(s)
- Tarrion Baird
- Department of Pathology, University of Cambridge, Cambridge, United Kingdom
| | - Rahul Roychoudhuri
- Department of Pathology, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
2
|
Guo S, Li T, Xu D, Xu J, Wang H, Li J, Bi X, Cao M, Xu Z, Xia Q, Cui Y, Li K. Prognostic Implications and Immune Infiltration Characteristics of Chromosomal Instability-Related Dysregulated CeRNA in Lung Adenocarcinoma. Front Mol Biosci 2022; 9:843640. [PMID: 35419410 PMCID: PMC8995899 DOI: 10.3389/fmolb.2022.843640] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2021] [Accepted: 02/22/2022] [Indexed: 12/14/2022] Open
Abstract
An accumulating body of research indicates that long-noncoding RNAs (lncRNAs) regulate the target genes and act as competitive endogenous RNAs (ceRNAs) playing an indispensable role in lung adenocarcinoma (LUAD). LUAD is frequently accompanied by the feature of chromosomal instability (CIN); however, CIN-related ceRNAs have not been investigated yet. We systematically analyzed and integrated CIN-related dysregulated ceRNAs characteristics in LUAD samples for the first time. In TCGA LUAD cohort, CIN in tumor samples was significantly higher than that in those of adjacent, and patients with high CIN risk tended to have worse clinical outcomes. We constructed a double-weighted CIN-related dysregulated ceRNA network, in which edge weight and node weight represented the disorder extent of ceRNA and the correlation of RNA expression level and prognosis, respectively. After module mining and analysis, a potential prognostic biomarker composed of 12 RNAs (8 mRNAs and 4 lncRNAs) named CIN-related dysregulated ceRNAs (CRDC) was obtained. The CRDC risk score had a positive relation with clinical stage and CIN, and patients with high CRDC risk scores exhibited poor prognosis. Moreover, CRDC tended to be an independent risk factor with high robustness to overcome the effect of multicollinearity among other explanatory variables for disease-specific survival (DSS) in TCGA and two GEO cohorts. The result of functional analysis indicated that CRDC was involved in multiple cancer progresses, especially immune-related pathways. The patients with lower CRDC risk had higher B cell, T cell CD4+, T cell CD8+, neutrophil, macrophage, and myeloid dendritic cell infiltration than the patients with higher CRDC risk. Meanwhile, patients with lower CRDC risk could get more benefits from immunological therapy. The results suggested that the CRDC could be a potential prognostic biomarker and an immunotherapy predictor for lung adenocarcinoma.
Collapse
Affiliation(s)
- Shengnan Guo
- Key Laboratory of Tropical Translational Medicine of Ministry of Education, College of Biomedical Information and Engineering, Institute of Nephrology Second Affiliated Hospital and Hainan General Hospital, Hainan Medical University, Haikou, China
| | - Tianhao Li
- Key Laboratory of Tropical Translational Medicine of Ministry of Education, College of Biomedical Information and Engineering, Institute of Nephrology Second Affiliated Hospital and Hainan General Hospital, Hainan Medical University, Haikou, China
| | - Dahua Xu
- Key Laboratory of Tropical Translational Medicine of Ministry of Education, College of Biomedical Information and Engineering, Institute of Nephrology Second Affiliated Hospital and Hainan General Hospital, Hainan Medical University, Haikou, China
| | - Jiankai Xu
- College of Bioinformatics Science and Technology, Cancer Hospital, Harbin Medical University, Harbin, China
| | - Hong Wang
- Key Laboratory of Tropical Translational Medicine of Ministry of Education, College of Biomedical Information and Engineering, Institute of Nephrology Second Affiliated Hospital and Hainan General Hospital, Hainan Medical University, Haikou, China
| | - Jian Li
- College of Bioinformatics Science and Technology, Cancer Hospital, Harbin Medical University, Harbin, China
| | - Xiaoman Bi
- Key Laboratory of Tropical Translational Medicine of Ministry of Education, College of Biomedical Information and Engineering, Institute of Nephrology Second Affiliated Hospital and Hainan General Hospital, Hainan Medical University, Haikou, China
| | - Meng Cao
- Key Laboratory of Tropical Translational Medicine of Ministry of Education, College of Biomedical Information and Engineering, Institute of Nephrology Second Affiliated Hospital and Hainan General Hospital, Hainan Medical University, Haikou, China
| | - Zhizhou Xu
- Key Laboratory of Tropical Translational Medicine of Ministry of Education, College of Biomedical Information and Engineering, Institute of Nephrology Second Affiliated Hospital and Hainan General Hospital, Hainan Medical University, Haikou, China
| | - Qianfeng Xia
- Key Laboratory of Tropical Translational Medicine of Ministry of Education, NHC Key Laboratory of Control of Tropical Diseases, School of Tropical Medicine, The Second Affiliated Hospital, Hainan Medical University, Haikou, China
- *Correspondence: Qianfeng Xia, ; Ying Cui, ; Kongning Li,
| | - Ying Cui
- College of Bioinformatics Science and Technology, Cancer Hospital, Harbin Medical University, Harbin, China
- *Correspondence: Qianfeng Xia, ; Ying Cui, ; Kongning Li,
| | - Kongning Li
- Key Laboratory of Tropical Translational Medicine of Ministry of Education, College of Biomedical Information and Engineering, Institute of Nephrology Second Affiliated Hospital and Hainan General Hospital, Hainan Medical University, Haikou, China
- *Correspondence: Qianfeng Xia, ; Ying Cui, ; Kongning Li,
| |
Collapse
|
3
|
Wang W, Liu W. PCLasso: a protein complex-based, group lasso-Cox model for accurate prognosis and risk protein complex discovery. Brief Bioinform 2021; 22:6291946. [PMID: 34086850 DOI: 10.1093/bib/bbab212] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2021] [Revised: 05/08/2021] [Accepted: 05/15/2021] [Indexed: 12/12/2022] Open
Abstract
For high-dimensional expression data, most prognostic models perform feature selection based on individual genes, which usually lead to unstable prognosis, and the identified risk genes are inherently insufficient in revealing complex molecular mechanisms. Since most genes carry out cellular functions by forming protein complexes-basic representatives of functional modules, identifying risk protein complexes may greatly improve our understanding of disease biology. Coupled with the fact that protein complexes have been shown to have innate resistance to batch effects and are effective predictors of disease phenotypes, constructing prognostic models and selecting features with protein complexes as the basic unit should improve the robustness and biological interpretability of the model. Here, we propose a protein complex-based, group lasso-Cox model (PCLasso) to predict patient prognosis and identify risk protein complexes. Experiments on three cancer types have proved that PCLasso has better prognostic performance than prognostic models based on individual genes. The resulting risk protein complexes not only contain individual risk genes but also incorporate close partners that synergize with them, which may promote the revealing of molecular mechanisms related to cancer progression from a comprehensive perspective. Furthermore, a pan-cancer prognostic analysis was performed to identify risk protein complexes of 19 cancer types, which may provide novel potential targets for cancer research.
Collapse
Affiliation(s)
- Wei Wang
- Heilongjiang Institute of Technology, Harbin 150050, China
| | - Wei Liu
- School of Science at Heilongjiang Institute of Technology, Harbin 150050, China
| |
Collapse
|
4
|
Sun D, Ren X, Ari E, Korcsmaros T, Csermely P, Wu LY. Discovering cooperative biomarkers for heterogeneous complex disease diagnoses. Brief Bioinform 2019; 20:89-101. [PMID: 28968712 DOI: 10.1093/bib/bbx090] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2017] [Indexed: 12/13/2022] Open
Abstract
Biomarkers with high reproducibility and accurate prediction performance can contribute to comprehending the underlying pathogenesis of related complex diseases and further facilitate disease diagnosis and therapy. Techniques integrating gene expression profiles and biological networks for the identification of network-based disease biomarkers are receiving increasing interest. The biomarkers for heterogeneous diseases often exhibit strong cooperative effects, which implies that a set of genes may achieve more accurate outcome prediction than any single gene. In this study, we evaluated various biomarker identification methods that consider gene cooperative effects implicitly or explicitly, and proposed the gene cooperation network to explicitly model the cooperative effects of gene combinations. The gene cooperation network-enhanced method, named as MarkRank, achieves superior performance compared with traditional biomarker identification methods in both simulation studies and real data sets. The biomarkers identified by MarkRank not only have a better prediction accuracy but also have stronger topological relationships in the biological network and exhibit high specificity associated with the related diseases. Furthermore, the top genes identified by MarkRank involve crucial biological processes of related diseases and give a good prioritization for known disease genes. In conclusion, MarkRank suggests that explicit modeling of gene cooperative effects can greatly improve biomarker identification for complex diseases, especially for diseases with high heterogeneity.
Collapse
Affiliation(s)
- Duanchen Sun
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, and School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Xianwen Ren
- Biodynamic Optical Imaging Center, Peking University, Beijing, China
| | - Eszter Ari
- Department of Genetics, Eötvös Loránd University, Budapest
| | - Tamas Korcsmaros
- Institute of Food Research and the Earlham Institute, Norwich, UK
| | - Peter Csermely
- Department of Medical Chemistry, Semmelweis University, Budapest, Hungary
| | - Ling-Yun Wu
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, and School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
5
|
Shao B, Bjaanæs MM, Helland Å, Schütte C, Conrad T. EMT network-based feature selection improves prognosis prediction in lung adenocarcinoma. PLoS One 2019; 14:e0204186. [PMID: 30703089 PMCID: PMC6354965 DOI: 10.1371/journal.pone.0204186] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2018] [Accepted: 12/25/2018] [Indexed: 12/16/2022] Open
Abstract
Various feature selection algorithms have been proposed to identify cancer prognostic biomarkers. In recent years, however, their reproducibility is criticized. The performance of feature selection algorithms is shown to be affected by the datasets, underlying networks and evaluation metrics. One of the causes is the curse of dimensionality, which makes it hard to select the features that generalize well on independent data. Even the integration of biological networks does not mitigate this issue because the networks are large and many of their components are not relevant for the phenotype of interest. With the availability of multi-omics data, integrative approaches are being developed to build more robust predictive models. In this scenario, the higher data dimensions create greater challenges. We proposed a phenotype relevant network-based feature selection (PRNFS) framework and demonstrated its advantages in lung cancer prognosis prediction. We constructed cancer prognosis relevant networks based on epithelial mesenchymal transition (EMT) and integrated them with different types of omics data for feature selection. With less than 2.5% of the total dimensionality, we obtained EMT prognostic signatures that achieved remarkable prediction performance (average AUC values >0.8), very significant sample stratifications, and meaningful biological interpretations. In addition to finding EMT signatures from different omics data levels, we combined these single-omics signatures into multi-omics signatures, which improved sample stratifications significantly. Both single- and multi-omics EMT signatures were tested on independent multi-omics lung cancer datasets and significant sample stratifications were obtained.
Collapse
Affiliation(s)
- Borong Shao
- Zuse Institute Berlin, Berlin, Germany
- Dept of mathematics and computer science, Freie Universität Berlin, Berlin, Germany
- * E-mail:
| | - Maria Moksnes Bjaanæs
- Dept of Oncology, Oslo University Hospital, Oslo, Norway
- Dept of Cancer Genetics, Oslo University Hospital, Oslo, Norway
- Dept of Clinical Medicine, University of Oslo, Oslo, Norway
| | - Åslaug Helland
- Dept of Oncology, Oslo University Hospital, Oslo, Norway
- Dept of Cancer Genetics, Oslo University Hospital, Oslo, Norway
- Dept of Clinical Medicine, University of Oslo, Oslo, Norway
| | - Christof Schütte
- Zuse Institute Berlin, Berlin, Germany
- Dept of mathematics and computer science, Freie Universität Berlin, Berlin, Germany
| | - Tim Conrad
- Zuse Institute Berlin, Berlin, Germany
- Dept of mathematics and computer science, Freie Universität Berlin, Berlin, Germany
| |
Collapse
|
6
|
An N, Zhao C, Yu Z, Yang X. Identification of prognostic genes in colorectal cancer through transcription profiling of multi-stage carcinogenesis. Oncol Lett 2018; 17:432-441. [PMID: 30655784 DOI: 10.3892/ol.2018.9632] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2017] [Accepted: 07/09/2018] [Indexed: 01/02/2023] Open
Abstract
Colorectal cancer is a complex multistage process following the adenoma-carcinoma sequence. Additional research on the basis of molecular dysregulations, particularly in the precancerous stage, may provide insight into the realization of potential biomarkers and therapeutic targets for the disease. In the present study, the expression profile of human multistage colorectal mucosa tissues, including healthy, adenoma and adenocarcinoma samples, was downloaded. Genes that were consistently differentially expressed in precancerous tissues and cancer samples were collected. Based on a merged biological network, the biggest connected component composed of these identified genes and their one-step neighbors were retrieved to conduct random walk with restart algorithm, in order to identify genes significantly affected during carcinogenesis. Therefore, 35 genes significantly affected by carcinogenic dysregulation were successfully identified. Survival and Cox analysis indicated that the expression of these genes was an independent prognostic factor confirmed by six cohorts. In summary, based on the transcription profile of multi-stage carcinogenesis and bioinformatics analysis, 35 genes significantly associated with patient survival were successfully identified, which may serve as promising therapeutic targets for the disease.
Collapse
Affiliation(s)
- Ning An
- Department of Oncology, Affiliated Hospital of Qingdao University, Qingdao, Shandong 266003, P.R. China
| | - Chen Zhao
- Department of Anatomy, School of Basic Medicine, Qingdao University, Qingdao, Shandong 266071, P.R. China
| | - Zhuang Yu
- Department of Oncology, Affiliated Hospital of Qingdao University, Qingdao, Shandong 266003, P.R. China
| | - Xue Yang
- Department of Oncology, Affiliated Hospital of Qingdao University, Qingdao, Shandong 266003, P.R. China
| |
Collapse
|
7
|
Wang W, Liu W. Integration of gene interaction information into a reweighted random survival forest approach for accurate survival prediction and survival biomarker discovery. Sci Rep 2018; 8:13202. [PMID: 30181543 PMCID: PMC6123437 DOI: 10.1038/s41598-018-31497-0] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2017] [Accepted: 08/20/2018] [Indexed: 02/05/2023] Open
Abstract
Accurately predicting patient risk and identifying survival biomarkers are two important tasks in survival analysis. For the emerging high-throughput gene expression data, random survival forest (RSF) is attracting more and more attention as it not only shows excellent performance on survival prediction problems with high-dimensional variables, but also is capable of identifying important variables according to variable importance automatically calculated within the algorithm. However, RSF still suffers from some problems such as limited predictive accuracy on independent datasets and limited biological interpretation of survival biomarkers. In this study, we integrated gene interaction information into a Reweighted RSF model (RRSF) to improve predictive accuracy and identify biologically meaningful survival markers. We applied RRSF to the prediction of patients with glioblastoma multiforme (GBM) and esophageal squamous cell carcinoma (ESCC). With a reconstructed global pathway network and an mRNA-lncRNA co-expression network as the prior gene interaction information, RRSF showed better overall predictive performance than RSF on three GBM and two ESCC datasets. In addition, RRSF identified a two-gene and three-lncRNA signature, which showed robust prognostic values and had high biological relevance to the development of GBM and ESCC, respectively.
Collapse
Affiliation(s)
- Wei Wang
- Department of Mathematics, Heilongjiang Institute of Technology, Harbin, 150050, China
| | - Wei Liu
- Department of Mathematics, Heilongjiang Institute of Technology, Harbin, 150050, China.
- The Key Laboratory of Molecular Biology for High Cancer Incidence Coastal Chaoshan Area, Shantou University Medical College, Shantou, 515041, China.
| |
Collapse
|
8
|
Zhang A, Tian S. Classification of early-stage non-small cell lung cancer by weighing gene expression profiles with connectivity information. Biom J 2017; 60:537-546. [PMID: 29206308 DOI: 10.1002/bimj.201700010] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2017] [Revised: 09/10/2017] [Accepted: 10/22/2017] [Indexed: 11/11/2022]
Abstract
Pathway-based feature selection algorithms, which utilize biological information contained in pathways to guide which features/genes should be selected, have evolved quickly and become widespread in the field of bioinformatics. Based on how the pathway information is incorporated, we classify pathway-based feature selection algorithms into three major categories-penalty, stepwise forward, and weighting. Compared to the first two categories, the weighting methods have been underutilized even though they are usually the simplest ones. In this article, we constructed three different genes' connectivity information-based weights for each gene and then conducted feature selection upon the resulting weighted gene expression profiles. Using both simulations and a real-world application, we have demonstrated that when the data-driven connectivity information constructed from the data of specific disease under study is considered, the resulting weighted gene expression profiles slightly outperform the original expression profiles. In summary, a big challenge faced by the weighting method is how to estimate pathway knowledge-based weights more accurately and precisely. Only until the issue is conquered successfully will wide utilization of the weighting methods be impossible.
Collapse
Affiliation(s)
- Ao Zhang
- Intensive Care Unit (ICU), The First Hospital of Jilin University, Changchun, 130021, China
| | - Suyan Tian
- Division of Clinical Research, The First Hospital of Jilin University, Changchun, 130021, China
| |
Collapse
|
9
|
Liu W, Wang W, Tian G, Xie W, Lei L, Liu J, Huang W, Xu L, Li E. Topologically inferring pathway activity for precise survival outcome prediction: breast cancer as a case. MOLECULAR BIOSYSTEMS 2017; 13:537-548. [PMID: 28098303 DOI: 10.1039/c6mb00757k] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Accurately predicting the survival outcome of patients is of great importance in clinical cancer research. In the past decade, building survival prediction models based on gene expression data has received increasing interest. However, the existing methods are mainly based on individual gene signatures, which are known to have limited prediction accuracy on independent datasets and unclear biological relevance. Here, we propose a novel pathway-based survival prediction method called DRWPSurv in order to accurately predict survival outcome. DRWPSurv integrates gene expression profiles and prior gene interaction information to topologically infer survival associated pathway activities, and uses the pathway activities as features to construct Lasso-Cox model. It uses topological importance of genes evaluated by directed random walk to enhance the robustness of pathway activities and thereby improve the predictive performance. We applied DRWPSurv on three independent breast cancer datasets and compared the predictive performance with a traditional gene-based method and four pathway-based methods. Results showed that pathway-based methods obtained comparable or better predictive performance than the gene-based method, whereas DRWPSurv could predict survival outcome with better accuracy and robustness among the pathway-based methods. In addition, the risk pathways identified by DRWPSurv provide biologically informative models for breast cancer prognosis and treatment.
Collapse
Affiliation(s)
- Wei Liu
- The Key Laboratory of Molecular Biology for High Cancer Incidence Coastal Chaoshan Area, Shantou University Medical College, Shantou, 515041, China. and Department of Mathematics, Heilongjiang Institute of Technology, Harbin, 150050, China
| | - Wei Wang
- Department of Mathematics, Heilongjiang Institute of Technology, Harbin, 150050, China
| | - Guohua Tian
- Department of Mathematics, Heilongjiang Institute of Technology, Harbin, 150050, China
| | - Wenming Xie
- Network Information Center, Shantou University Medical College, Shantou, 515041, China
| | - Li Lei
- Network Information Center, Shantou University Medical College, Shantou, 515041, China
| | - Jiujin Liu
- Network Information Center, Shantou University Medical College, Shantou, 515041, China
| | - Wanxun Huang
- Network Information Center, Shantou University Medical College, Shantou, 515041, China
| | - Liyan Xu
- The Key Laboratory of Molecular Biology for High Cancer Incidence Coastal Chaoshan Area, Shantou University Medical College, Shantou, 515041, China. and Institute of Oncologic Pathology, Shantou University Medical College, Shantou, 515041, China
| | - Enmin Li
- The Key Laboratory of Molecular Biology for High Cancer Incidence Coastal Chaoshan Area, Shantou University Medical College, Shantou, 515041, China. and Department of Biochemistry and Molecular Biology, Shantou University Medical College, Shantou 515041, China
| |
Collapse
|
10
|
Hashemikhabir S, Budak G, Janga SC. ExSurv: A Web Resource for Prognostic Analyses of Exons Across Human Cancers Using Clinical Transcriptomes. Cancer Inform 2016; 15:17-24. [PMID: 27528797 PMCID: PMC4976794 DOI: 10.4137/cin.s39367] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2016] [Revised: 05/25/2016] [Accepted: 05/30/2016] [Indexed: 01/01/2023] Open
Abstract
Survival analysis in biomedical sciences is generally performed by correlating the levels of cellular components with patients' clinical features as a common practice in prognostic biomarker discovery. While the common and primary focus of such analysis in cancer genomics so far has been to identify the potential prognostic genes, alternative splicing - a posttranscriptional regulatory mechanism that affects the functional form of a protein due to inclusion or exclusion of individual exons giving rise to alternative protein products, has increasingly gained attention due to the prevalence of splicing aberrations in cancer transcriptomes. Hence, uncovering the potential prognostic exons can not only help in rationally designing exon-specific therapeutics but also increase specificity toward more personalized treatment options. To address this gap and to provide a platform for rational identification of prognostic exons from cancer transcriptomes, we developed ExSurv (https://exsurv.soic.iupui.edu), a web-based platform for predicting the survival contribution of all annotated exons in the human genome using RNA sequencing-based expression profiles for cancer samples from four cancer types available from The Cancer Genome Atlas. ExSurv enables users to search for a gene of interest and shows survival probabilities for all the exons associated with a gene and found to be significant at the chosen threshold. ExSurv also includes raw expression values across the cancer cohort as well as the survival plots for prognostic exons. Our analysis of the resulting prognostic exons across four cancer types revealed that most of the survival-associated exons are unique to a cancer type with few processes such as cell adhesion, carboxylic, fatty acid metabolism, and regulation of T-cell signaling common across cancer types, possibly suggesting significant differences in the posttranscriptional regulatory pathways contributing to prognosis.
Collapse
Affiliation(s)
- Seyedsasan Hashemikhabir
- Department of Biohealth Informatics, School of Informatics and Computing, Indiana University Purdue University, Indianapolis, IN, USA
| | - Gungor Budak
- Department of Biohealth Informatics, School of Informatics and Computing, Indiana University Purdue University, Indianapolis, IN, USA
| | - Sarath Chandra Janga
- Department of Biohealth Informatics, School of Informatics and Computing, Indiana University Purdue University, Indianapolis, IN, USA
- Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, 5021 Health Information and Translational Sciences (HITS), Indianapolis, IN, USA
- Department of Medical and Molecular Genetics, Indiana University School of Medicine, Medical Research and Library Building, Indianapolis, IN, USA
| |
Collapse
|
11
|
Feng L, Tong R, Liu X, Zhang K, Wang G, Zhang L, An N, Cheng S. A network-based method for identifying prognostic gene modules in lung squamous carcinoma. Oncotarget 2016; 7:18006-20. [PMID: 26919109 PMCID: PMC4951267 DOI: 10.18632/oncotarget.7632] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2015] [Accepted: 02/13/2016] [Indexed: 12/23/2022] Open
Abstract
Similarities in gene expression between both developing embryonic and precancerous tissues and cancer tissues may help identify much-needed biomarkers and therapeutic targets in lung squamous carcinoma. In this study, human lung samples representing ten successive time points, from embryonic development to carcinogenesis, were used to construct global gene expression profiles. Differentially expressed genes with similar expression in precancerous and cancer samples were identified. Using a network-based greedy searching algorithm to analyze the training cohort (n = 69) and three independent testing cohorts, we successfully identified a significant 22-gene module in which expression levels were correlated with overall survival in lung squamous carcinoma patients.
Collapse
Affiliation(s)
- Lin Feng
- State Key Laboratory of Molecular Oncology, Department of Etiology and Carcinogenesis, Peking Union Medical College and Cancer Institute (Hospital), Chinese Academy of Medical Sciences, Beijing, China
| | - Run Tong
- Department of Respiratory and Critical Care Medicine, China-Japan Friendship Hospital, Beijing, China
| | - Xiaohong Liu
- Department of Gynecology and Obstetrics, Maternal and Child Health Care Hospital of Haidian, Beijing, China
| | - Kaitai Zhang
- State Key Laboratory of Molecular Oncology, Department of Etiology and Carcinogenesis, Peking Union Medical College and Cancer Institute (Hospital), Chinese Academy of Medical Sciences, Beijing, China
| | - Guiqi Wang
- Department of Endoscopy, Cancer Hospital, Chinese Academy of Medical Sciences, Beijing, China
| | - Lei Zhang
- Department of Endoscopy, Cancer Hospital, Chinese Academy of Medical Sciences, Beijing, China
| | - Ning An
- State Key Laboratory of Molecular Oncology, Department of Etiology and Carcinogenesis, Peking Union Medical College and Cancer Institute (Hospital), Chinese Academy of Medical Sciences, Beijing, China
| | - Shujun Cheng
- State Key Laboratory of Molecular Oncology, Department of Etiology and Carcinogenesis, Peking Union Medical College and Cancer Institute (Hospital), Chinese Academy of Medical Sciences, Beijing, China
| |
Collapse
|
12
|
Martinez-Ledesma E, Verhaak RGW, Treviño V. Identification of a multi-cancer gene expression biomarker for cancer clinical outcomes using a network-based algorithm. Sci Rep 2015. [PMID: 26202601 PMCID: PMC5378879 DOI: 10.1038/srep11966] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Cancer types are commonly classified by histopathology and more recently through molecular characteristics such as gene expression, mutations, copy number variations, and epigenetic alterations. These molecular characterizations have led to the proposal of prognostic biomarkers for many cancer types. Nevertheless, most of these biomarkers have been proposed for a specific cancer type or even specific subtypes. Although more challenging, it is useful to identify biomarkers that can be applied for multiple types of cancer. Here, we have used a network-based exploration approach to identify a multi-cancer gene expression biomarker highly connected by ESR1, PRKACA, LRP1, JUN and SMAD2 that can be predictive of clinical outcome in 12 types of cancer from The Cancer Genome Atlas (TCGA) repository. The gene signature of this biomarker is highly supported by cancer literature, biological terms, and prognostic power in other cancer types. Additionally, the signature does not seem to be highly associated with specific mutations or copy number alterations. Comparisons with cancer-type specific and other multi-cancer biomarkers in TCGA and other datasets showed that the performance of the proposed multi-cancer biomarker is superior, making the proposed approach and multi-cancer biomarker potentially useful in research and clinical settings.
Collapse
Affiliation(s)
- Emmanuel Martinez-Ledesma
- 1] Grupo de Enfoque e Investigación en Bioinformática, Departamento de Investigación e Innovación, Escuela Nacional de Medicina, Tecnológico de Monterrey, Monterrey, Nuevo León 64849, México [2] Department of Genomic Medicine, The University of Texas MD Anderson Cancer Center, Houston, Texas 77030, USA
| | - Roeland G W Verhaak
- 1] Department of Genomic Medicine, The University of Texas MD Anderson Cancer Center, Houston, Texas 77030, USA [2] Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas 77030, USA
| | - Victor Treviño
- Grupo de Enfoque e Investigación en Bioinformática, Departamento de Investigación e Innovación, Escuela Nacional de Medicina, Tecnológico de Monterrey, Monterrey, Nuevo León 64849, México
| |
Collapse
|
13
|
Network-based survival-associated module biomarker and its crosstalk with cell death genes in ovarian cancer. Sci Rep 2015; 5:11566. [PMID: 26099452 PMCID: PMC4477367 DOI: 10.1038/srep11566] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2015] [Accepted: 05/28/2015] [Indexed: 12/27/2022] Open
Abstract
Ovarian cancer remains a dismal disease with diagnosing in the late, metastatic stages, therefore, there is a growing realization of the critical need to develop effective biomarkers for understanding underlying mechanisms. Although existing evidences demonstrate the important role of the single genetic abnormality in pathogenesis, the perturbations of interactors in the complex network are often ignored. Moreover, ovarian cancer diagnosis and treatment still exist a large gap that need to be bridged. In this work, we adopted a network-based survival-associated approach to capture a 12-gene network module based on differential co-expression PPI network in the advanced-stage, high-grade ovarian serous cystadenocarcinoma. Then, regulatory genes (protein-coding genes and non-coding genes) direct interacting with the module were found to be significantly overlapped with cell death genes. More importantly, these overlapping genes tightly clustered together pointing to the module, deciphering the crosstalk between network-based survival-associated module and cell death in ovarian cancer.
Collapse
|
14
|
Discover the molecular biomarker associated with cell death and extracellular matrix module in ovarian cancer. BIOMED RESEARCH INTERNATIONAL 2015; 2015:735689. [PMID: 25861644 PMCID: PMC4378326 DOI: 10.1155/2015/735689] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/05/2014] [Revised: 02/10/2015] [Accepted: 02/15/2015] [Indexed: 02/01/2023]
Abstract
High throughput technologies have provided many new research methods for ovarian cancer investigation. In tradition, in order to find the underlying functional mechanisms of the survival-associated genes, gene sets enrichment analysis (GSEA) is always regarded as the important choice. However, GSEA produces too many candidate genes and cannot discover the signaling transduction cascades. In this work, we have used a network-based strategy to optimize the discovery of biomarkers using multifactorial data, including patient expression, clinical survival, and protein-protein interaction (PPI) data. The biomarkers discovered by this strategy belong to the network-based biomarker, which is apt to reveal the underlying functional mechanisms of the biomarker. In this work, over 400 expression arrays in ovarian cancer have been analyzed: the results showed that cell death and extracellular module are the main themes related to ovarian cancer progression.
Collapse
|
15
|
Chen H, Zhu Z, Zhu Y, Wang J, Mei Y, Cheng Y. Pathway mapping and development of disease-specific biomarkers: protein-based network biomarkers. J Cell Mol Med 2015; 19:297-314. [PMID: 25560835 PMCID: PMC4407592 DOI: 10.1111/jcmm.12447] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2014] [Accepted: 08/22/2014] [Indexed: 01/06/2023] Open
Abstract
It is known that a disease is rarely a consequence of an abnormality of a single gene, but reflects the interactions of various processes in a complex network. Annotated molecular networks offer new opportunities to understand diseases within a systems biology framework and provide an excellent substrate for network-based identification of biomarkers. The network biomarkers and dynamic network biomarkers (DNBs) represent new types of biomarkers with protein-protein or gene-gene interactions that can be monitored and evaluated at different stages and time-points during development of disease. Clinical bioinformatics as a new way to combine clinical measurements and signs with human tissue-generated bioinformatics is crucial to translate biomarkers into clinical application, validate the disease specificity, and understand the role of biomarkers in clinical settings. In this article, the recent advances and developments on network biomarkers and DNBs are comprehensively reviewed. How network biomarkers help a better understanding of molecular mechanism of diseases, the advantages and constraints of network biomarkers for clinical application, clinical bioinformatics as a bridge to the development of diseases-specific, stage-specific, severity-specific and therapy predictive biomarkers, and the potentials of network biomarkers are also discussed.
Collapse
Affiliation(s)
- Hao Chen
- Department of Cardiothoracic Surgery, Tongji Hospital, Tongji University, Shanghai, China
| | | | | | | | | | | |
Collapse
|
16
|
Liu W, Wang Q, Zhao J, Zhang C, Liu Y, Zhang J, Bai X, Li X, Feng H, Liao M, Wang W, Li C. Integration of pathway structure information into a reweighted partial Cox regression approach for survival analysis on high-dimensional gene expression data. MOLECULAR BIOSYSTEMS 2015; 11:1876-86. [DOI: 10.1039/c5mb00044k] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Accurately predicting the risk of cancer relapse or death is important for clinical utility.
Collapse
|
17
|
Cheng CP, Kuo IY, Alakus H, Frazer KA, Harismendy O, Wang YC, Tseng VS. Network-based analysis identifies epigenetic biomarkers of esophageal squamous cell carcinoma progression. ACTA ACUST UNITED AC 2014; 30:3054-61. [PMID: 25015989 DOI: 10.1093/bioinformatics/btu433] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
MOTIVATION A rapid progression of esophageal squamous cell carcinoma (ESCC) causes a high mortality rate because of the propensity for metastasis driven by genetic and epigenetic alterations. The identification of prognostic biomarkers would help prevent or control metastatic progression. Expression analyses have been used to find such markers, but do not always validate in separate cohorts. Epigenetic marks, such as DNA methylation, are a potential source of more reliable and stable biomarkers. Importantly, the integration of both expression and epigenetic alterations is more likely to identify relevant biomarkers. RESULTS We present a new analysis framework, using ESCC progression-associated gene regulatory network (GRN escc), to identify differentially methylated CpG sites prognostic of ESCC progression. From the CpG loci differentially methylated in 50 tumor-normal pairs, we selected 44 CpG loci most highly associated with survival and located in the promoters of genes more likely to belong to GRN escc. Using an independent ESCC cohort, we confirmed that 8/10 of CpG loci in the promoter of GRN escc genes significantly correlated with patient survival. In contrast, 0/10 CpG loci in the promoter genes outside the GRN escc were correlated with patient survival. We further characterized the GRN escc network topology and observed that the genes with methylated CpG loci associated with survival deviated from the center of mass and were less likely to be hubs in the GRN escc. We postulate that our analysis framework improves the identification of bona fide prognostic biomarkers from DNA methylation studies, especially with partial genome coverage.
Collapse
Affiliation(s)
- Chun-Pei Cheng
- Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 701, Taiwan, Moores Cancer Center, University of California San Diego, La Jolla, CA 92093, USA, Institute of Basic Medical Sciences, National Cheng Kung University, Tainan 701, Taiwan, Department of Pediatrics and Rady Children's Hospital, University of California San Diego, La Jolla, CA 92093, USA, Department of General, Visceral and Cancer Surgery, University of Cologne, Köln, Germany, Institute for Genomic Medicine, University of California San Diego, La Jolla, CA 92093, USA, Department of Pharmacology and Institute of Medical Informatics, National Cheng Kung University, Tainan 701, Taiwan Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 701, Taiwan, Moores Cancer Center, University of California San Diego, La Jolla, CA 92093, USA, Institute of Basic Medical Sciences, National Cheng Kung University, Tainan 701, Taiwan, Department of Pediatrics and Rady Children's Hospital, University of California San Diego, La Jolla, CA 92093, USA, Department of General, Visceral and Cancer Surgery, University of Cologne, Köln, Germany, Institute for Genomic Medicine, University of California San Diego, La Jolla, CA 92093, USA, Department of Pharmacology and Institute of Medical Informatics, National Cheng Kung University, Tainan 701, Taiwan
| | - I-Ying Kuo
- Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 701, Taiwan, Moores Cancer Center, University of California San Diego, La Jolla, CA 92093, USA, Institute of Basic Medical Sciences, National Cheng Kung University, Tainan 701, Taiwan, Department of Pediatrics and Rady Children's Hospital, University of California San Diego, La Jolla, CA 92093, USA, Department of General, Visceral and Cancer Surgery, University of Cologne, Köln, Germany, Institute for Genomic Medicine, University of California San Diego, La Jolla, CA 92093, USA, Department of Pharmacology and Institute of Medical Informatics, National Cheng Kung University, Tainan 701, Taiwan
| | - Hakan Alakus
- Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 701, Taiwan, Moores Cancer Center, University of California San Diego, La Jolla, CA 92093, USA, Institute of Basic Medical Sciences, National Cheng Kung University, Tainan 701, Taiwan, Department of Pediatrics and Rady Children's Hospital, University of California San Diego, La Jolla, CA 92093, USA, Department of General, Visceral and Cancer Surgery, University of Cologne, Köln, Germany, Institute for Genomic Medicine, University of California San Diego, La Jolla, CA 92093, USA, Department of Pharmacology and Institute of Medical Informatics, National Cheng Kung University, Tainan 701, Taiwan Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 701, Taiwan, Moores Cancer Center, University of California San Diego, La Jolla, CA 92093, USA, Institute of Basic Medical Sciences, National Cheng Kung University, Tainan 701, Taiwan, Department of Pediatrics and Rady Children's Hospital, University of California San Diego, La Jolla, CA 92093, USA, Department of General, Visceral and Cancer Surgery, University of Cologne, Köln, Germany, Institute for Genomic Medicine, University of California San Diego, La Jolla, CA 92093, USA, Department of Pharmacology and Institute of Medical Informatics, National Cheng Kung University, Tainan 701, Taiwan Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 701, Taiwan, Moores Cancer Center, University of California San Diego, La Jolla, CA 92093, USA, Institute of Basic Medical Sciences, National Cheng Kung University, Tainan 701, Taiwan, Department of Pediatrics and Rady Children's Hospital, University of California San Diego, La Jolla, CA 92093, USA, Department of General, Visceral and Cancer Surgery, University of Cologne, Köln, Germany, Institute for Genomic Medicine, University of California San Diego, La Jolla, CA 92093, USA, Department of Ph
| | - Kelly A Frazer
- Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 701, Taiwan, Moores Cancer Center, University of California San Diego, La Jolla, CA 92093, USA, Institute of Basic Medical Sciences, National Cheng Kung University, Tainan 701, Taiwan, Department of Pediatrics and Rady Children's Hospital, University of California San Diego, La Jolla, CA 92093, USA, Department of General, Visceral and Cancer Surgery, University of Cologne, Köln, Germany, Institute for Genomic Medicine, University of California San Diego, La Jolla, CA 92093, USA, Department of Pharmacology and Institute of Medical Informatics, National Cheng Kung University, Tainan 701, Taiwan Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 701, Taiwan, Moores Cancer Center, University of California San Diego, La Jolla, CA 92093, USA, Institute of Basic Medical Sciences, National Cheng Kung University, Tainan 701, Taiwan, Department of Pediatrics and Rady Children's Hospital, University of California San Diego, La Jolla, CA 92093, USA, Department of General, Visceral and Cancer Surgery, University of Cologne, Köln, Germany, Institute for Genomic Medicine, University of California San Diego, La Jolla, CA 92093, USA, Department of Pharmacology and Institute of Medical Informatics, National Cheng Kung University, Tainan 701, Taiwan Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 701, Taiwan, Moores Cancer Center, University of California San Diego, La Jolla, CA 92093, USA, Institute of Basic Medical Sciences, National Cheng Kung University, Tainan 701, Taiwan, Department of Pediatrics and Rady Children's Hospital, University of California San Diego, La Jolla, CA 92093, USA, Department of General, Visceral and Cancer Surgery, University of Cologne, Köln, Germany, Institute for Genomic Medicine, University of California San Diego, La Jolla, CA 92093, USA, Department of Ph
| | - Olivier Harismendy
- Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 701, Taiwan, Moores Cancer Center, University of California San Diego, La Jolla, CA 92093, USA, Institute of Basic Medical Sciences, National Cheng Kung University, Tainan 701, Taiwan, Department of Pediatrics and Rady Children's Hospital, University of California San Diego, La Jolla, CA 92093, USA, Department of General, Visceral and Cancer Surgery, University of Cologne, Köln, Germany, Institute for Genomic Medicine, University of California San Diego, La Jolla, CA 92093, USA, Department of Pharmacology and Institute of Medical Informatics, National Cheng Kung University, Tainan 701, Taiwan Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 701, Taiwan, Moores Cancer Center, University of California San Diego, La Jolla, CA 92093, USA, Institute of Basic Medical Sciences, National Cheng Kung University, Tainan 701, Taiwan, Department of Pediatrics and Rady Children's Hospital, University of California San Diego, La Jolla, CA 92093, USA, Department of General, Visceral and Cancer Surgery, University of Cologne, Köln, Germany, Institute for Genomic Medicine, University of California San Diego, La Jolla, CA 92093, USA, Department of Pharmacology and Institute of Medical Informatics, National Cheng Kung University, Tainan 701, Taiwan
| | - Yi-Ching Wang
- Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 701, Taiwan, Moores Cancer Center, University of California San Diego, La Jolla, CA 92093, USA, Institute of Basic Medical Sciences, National Cheng Kung University, Tainan 701, Taiwan, Department of Pediatrics and Rady Children's Hospital, University of California San Diego, La Jolla, CA 92093, USA, Department of General, Visceral and Cancer Surgery, University of Cologne, Köln, Germany, Institute for Genomic Medicine, University of California San Diego, La Jolla, CA 92093, USA, Department of Pharmacology and Institute of Medical Informatics, National Cheng Kung University, Tainan 701, Taiwan Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 701, Taiwan, Moores Cancer Center, University of California San Diego, La Jolla, CA 92093, USA, Institute of Basic Medical Sciences, National Cheng Kung University, Tainan 701, Taiwan, Department of Pediatrics and Rady Children's Hospital, University of California San Diego, La Jolla, CA 92093, USA, Department of General, Visceral and Cancer Surgery, University of Cologne, Köln, Germany, Institute for Genomic Medicine, University of California San Diego, La Jolla, CA 92093, USA, Department of Pharmacology and Institute of Medical Informatics, National Cheng Kung University, Tainan 701, Taiwan
| | - Vincent S Tseng
- Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 701, Taiwan, Moores Cancer Center, University of California San Diego, La Jolla, CA 92093, USA, Institute of Basic Medical Sciences, National Cheng Kung University, Tainan 701, Taiwan, Department of Pediatrics and Rady Children's Hospital, University of California San Diego, La Jolla, CA 92093, USA, Department of General, Visceral and Cancer Surgery, University of Cologne, Köln, Germany, Institute for Genomic Medicine, University of California San Diego, La Jolla, CA 92093, USA, Department of Pharmacology and Institute of Medical Informatics, National Cheng Kung University, Tainan 701, Taiwan Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 701, Taiwan, Moores Cancer Center, University of California San Diego, La Jolla, CA 92093, USA, Institute of Basic Medical Sciences, National Cheng Kung University, Tainan 701, Taiwan, Department of Pediatrics and Rady Children's Hospital, University of California San Diego, La Jolla, CA 92093, USA, Department of General, Visceral and Cancer Surgery, University of Cologne, Köln, Germany, Institute for Genomic Medicine, University of California San Diego, La Jolla, CA 92093, USA, Department of Pharmacology and Institute of Medical Informatics, National Cheng Kung University, Tainan 701, Taiwan
| |
Collapse
|
18
|
Ben-Hamo R, Gidoni M, Efroni S. PhenoNet: identification of key networks associated with disease phenotype. Bioinformatics 2014; 30:2399-405. [PMID: 24812342 DOI: 10.1093/bioinformatics/btu199] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
MOTIVATION At the core of transcriptome analyses of cancer is a challenge to detect molecular differences affiliated with disease phenotypes. This approach has led to remarkable progress in identifying molecular signatures and in stratifying patients into clinical groups. Yet, despite this progress, many of the identified signatures are not robust enough to be clinically used and not consistent enough to provide a follow-up on molecular mechanisms. RESULTS To address these issues, we introduce PhenoNet, a novel algorithm for the identification of pathways and networks associated with different phenotypes. PhenoNet uses two types of input data: gene expression data (RMA, RPKM, FPKM, etc.) and phenotypic information, and integrates these data with curated pathways and protein-protein interaction information. Comprehensive iterations across all possible pathways and subnetworks result in the identification of key pathways or subnetworks that distinguish between the two phenotypes. AVAILABILITY AND IMPLEMENTATION Matlab code is available upon request. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Rotem Ben-Hamo
- The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan 5290002, Israel
| | - Moriah Gidoni
- The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan 5290002, Israel
| | - Sol Efroni
- The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan 5290002, Israel
| |
Collapse
|
19
|
Targeted development of specific biomarkers of endometrial stromal cell differentiation using bioinformatics: the IFITM1 model. Mod Pathol 2014; 27:569-79. [PMID: 24072182 DOI: 10.1038/modpathol.2013.123] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2013] [Revised: 05/13/2013] [Accepted: 05/14/2013] [Indexed: 12/24/2022]
Abstract
When classifying cellular uterine mesenchymal neoplasms, histological distinction of endometrial stromal from smooth muscle neoplasms can be difficult. The only widely established marker of endometrial stromal differentiation, CD10, has marginal specificity. We took a bioinformatics approach to identify more specific markers of endometrial stromal differentiation by searching the Human Protein Atlas, a public database of protein expression profiles. After screening the database using different methods, interferon-induced transmembrane protein 1 (IFITM1) was selected for further analysis. Immunohistochemistry for IFITM1 was performed using tissue sections from the selected cases of proliferative endometrium (22), secretory endometrium (6), inactive endometrium (19), adenomyosis (10), conventional leiomyoma (11), cellular leiomyoma (16), endometrial stromal nodule (2), low-grade endometrial stromal sarcoma (16), high-grade endometrial stromal sarcoma (2) and undifferentiated uterine sarcoma (2). Stained slides were scored in terms of intensity and distribution. Normal endometrial samples uniformly showed diffuse and strong IFITM1 staining. Endometrial stromal neoplasms, particularly low-grade endometrial stromal sarcoma, showed higher IFITM1 expression compared with smooth muscle neoplasms (P<0.0001). IFITM1 immunohistochemistry has high sensitivity and specificity, particularly in the distinction between low-grade endometrial stromal sarcoma and leiomyoma (81.2 and 86.7%, respectively). Our results indicate that IFITM1 is a sensitive and specific marker of endometrial stromal differentiation across the spectrum from proliferative endometrium to metastatic stromal sarcoma. IFITM1 is a potential valuable addition to immunohistochemical panels used in the diagnosis of cellular mesenchymal uterine tumors. Further studies with larger number of cases are necessary to corroborate this impression and determine the utility of IFITM1 in routine practice. This study is a clear example of how bioinformatics, particularly tools for mining genomic and proteomic databases, can enhance and accelerate biomarker development in diagnostic pathology.
Collapse
|