Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Kim D, Shin H, Sohn KA, Verma A, Ritchie MD, Kim JH. Incorporating inter-relationships between different levels of genomic data into cancer clinical outcome prediction. Methods 2014;67:344-53. [PMID: 24561168 DOI: 10.1016/j.ymeth.2014.02.003] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2013] [Revised: 01/25/2014] [Accepted: 02/07/2014] [Indexed: 01/06/2023] Open

For:	Kim D, Shin H, Sohn KA, Verma A, Ritchie MD, Kim JH. Incorporating inter-relationships between different levels of genomic data into cancer clinical outcome prediction. Methods 2014;67:344-53. [PMID: 24561168 DOI: 10.1016/j.ymeth.2014.02.003] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2013] [Revised: 01/25/2014] [Accepted: 02/07/2014] [Indexed: 01/06/2023] Open

Number

Cited by Other Article(s)

Hernández-Lemus E, Ochoa S. Methods for multi-omic data integration in cancer research. Front Genet 2024;15:1425456. [PMID: 39364009 PMCID: PMC11446849 DOI: 10.3389/fgene.2024.1425456] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Accepted: 08/28/2024] [Indexed: 10/05/2024] Open

Jung AM, Furlong MA, Goodrich JM, Cardenas A, Beitel SC, Littau SR, Caban-Martinez AJ, Gulotta JJ, Wallentine DD, Urwin D, Gabriel J, Hughes J, Graber JM, Grant C, Burgess JL. Associations Between Epigenetic Age Acceleration and microRNA Expression Among U.S. Firefighters. Epigenet Insights 2023;16:25168657231206301. [PMID: 37953967 PMCID: PMC10634256 DOI: 10.1177/25168657231206301] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2023] [Accepted: 09/20/2023] [Indexed: 11/14/2023] Open

Abstract

Epigenetic changes may be biomarkers of health. Epigenetic age acceleration (EAA), the discrepancy between epigenetic age measured via epigenetic clocks and chronological age, is associated with morbidity and mortality. However, the intersection of epigenetic clocks with microRNAs (miRNAs) and corresponding miRNA-based health implications have not been evaluated. We analyzed DNA methylation and miRNA profiles from blood sampled among 332 individuals enrolled across 2 U.S.-based firefighter occupational studies (2015-2018 and 2018-2020). We considered 7 measures of EAA in leukocytes (PhenoAge, GrimAge, Horvath, skin-blood, and Hannum epigenetic clocks, and extrinsic and intrinsic epigenetic age acceleration). We identified miRNAs associated with EAA using individual linear regression models, adjusted for sex, race/ethnicity, chronological age, and cell type estimates, and investigated downstream effects of associated miRNAs with miRNA enrichment analyses and genomic annotations. On average, participants were 38 years old, 88% male, and 75% non-Hispanic white. We identified 183 of 798 miRNAs associated with EAA (FDR q < 0.05); 126 with PhenoAge, 59 with GrimAge, 1 with Horvath, and 1 with the skin-blood clock. Among miRNAs associated with Horvath and GrimAge, there were 61 significantly enriched disease annotations including age-related metabolic and cardiovascular conditions and several cancers. Enriched pathways included those related to proteins and protein modification. We identified miRNAs associated with EAA of multiple epigenetic clocks. PhenoAge had more associations with individual miRNAs, but GrimAge and Horvath had greater implications for miRNA-associated pathways. Understanding the relationship between these epigenetic markers could contribute to our understanding of the molecular underpinnings of aging and aging-related diseases.

Collapse

Affiliation(s)

Alesia M Jung Department of Community, Environment & Policy, Mel & Enid Zuckerman College of Public Health, University of Arizona, Tucson, AZ, USA Department of Pharmacology & Toxicology, R. Ken Coit College of Pharmacy, College of Public Health, Tucson, AZ, USA
Melissa A Furlong Department of Community, Environment & Policy, Mel & Enid Zuckerman College of Public Health, University of Arizona, Tucson, AZ, USA
Jaclyn M Goodrich Department of Environmental Health Sciences, School of Public Health, University of Michigan, Ann Arbor, MI, USA
Andres Cardenas Department of Epidemiology and Population Health, Stanford University, Stanford, CA, USA
Shawn C Beitel Department of Community, Environment & Policy, Mel & Enid Zuckerman College of Public Health, University of Arizona, Tucson, AZ, USA
Sally R Littau Department of Community, Environment & Policy, Mel & Enid Zuckerman College of Public Health, University of Arizona, Tucson, AZ, USA
Alberto J Caban-Martinez Department of Public Health Sciences, Miller School of Medicine, University of Miami, Miami, FL, USA
John J Gulotta Sarasota Fire Department, Sarasota, FL, USA
Darin D Wallentine Sarasota Fire Department, Sarasota, FL, USA
Derek Urwin Los Angeles County Fire Department, Los Angeles, CA, USA Department of Chemistry & Biochemistry, University of California Los Angeles, Los Angeles, CA, USA Division of Health Safety and Medicine, International Association of Fire Fighters, Washington, DC, USA
Jamie Gabriel Los Angeles County Fire Department, Los Angeles, CA, USA
Jeffrey Hughes Orange County Fire Authority, Irvine, CA, USA
Judith M Graber Department of Biostatistics & Epidemiology, School of Public Health, Rutgers University, Piscataway, NJ, USA
Casey Grant Fire Protection Research Foundation, Quincy, MA, USA
Jefferey L Burgess Department of Community, Environment & Policy, Mel & Enid Zuckerman College of Public Health, University of Arizona, Tucson, AZ, USA

Collapse

Woodward AA, Urbanowicz RJ, Naj AC, Moore JH. Genetic heterogeneity: Challenges, impacts, and methods through an associative lens. Genet Epidemiol 2022;46:555-571. [PMID: 35924480 PMCID: PMC9669229 DOI: 10.1002/gepi.22497] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Revised: 07/06/2022] [Accepted: 07/19/2022] [Indexed: 01/07/2023]

Shivakumar M, Han S, Lee Y, Kim D. Epigenetic interplay between methylation and miRNA in bladder cancer: focus on isoform expression. BMC Genomics 2021;22:754. [PMID: 34674656 PMCID: PMC8529714 DOI: 10.1186/s12864-021-08052-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Accepted: 09/24/2021] [Indexed: 11/10/2022] Open

Kim SY, Choe EK, Shivakumar M, Kim D, Sohn KA. Multi-layered network-based pathway activity inference using directed random walks: application to predicting clinical outcomes in urologic cancer. Bioinformatics 2021;37:2405-2413. [PMID: 33543748 PMCID: PMC8388033 DOI: 10.1093/bioinformatics/btab086] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Revised: 12/11/2020] [Accepted: 02/02/2021] [Indexed: 12/13/2022] Open

Tong D, Tian Y, Zhou T, Ye Q, Li J, Ding K, Li J. Improving prediction performance of colon cancer prognosis based on the integration of clinical and multi-omics data. BMC Med Inform Decis Mak 2020;20:22. [PMID: 32033604 PMCID: PMC7006213 DOI: 10.1186/s12911-020-1043-1] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2019] [Accepted: 01/31/2020] [Indexed: 12/16/2022] Open

Hernández-Lemus E, Reyes-Gopar H, Espinal-Enríquez J, Ochoa S. The Many Faces of Gene Regulation in Cancer: A Computational Oncogenomics Outlook. Genes (Basel) 2019;10:E865. [PMID: 31671657 PMCID: PMC6896122 DOI: 10.3390/genes10110865] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2019] [Revised: 10/16/2019] [Accepted: 10/24/2019] [Indexed: 12/16/2022] Open

Lin X, Pavani KC, Smits K, Deforce D, Heindryckx B, Van Soom A, Peelman L. Bta-miR-10b Secreted by Bovine Embryos Negatively Impacts Preimplantation Embryo Quality. Front Genet 2019;10:757. [PMID: 31507632 PMCID: PMC6713719 DOI: 10.3389/fgene.2019.00757] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2019] [Accepted: 07/17/2019] [Indexed: 01/02/2023] Open

Kim TR, Jeong HH, Sohn KA. Topological integration of RPPA proteomic data with multi-omics data for survival prediction in breast cancer via pathway activity inference. BMC Med Genomics 2019;12:94. [PMID: 31296204 PMCID: PMC6624183 DOI: 10.1186/s12920-019-0511-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open

Abstract

BACKGROUND

The analysis of integrated multi-omics data enables the identification of disease-related biomarkers that cannot be identified from a single omics profile. Although protein-level data reflects the cellular status of cancer tissue more directly than gene-level data, past studies have mainly focused on multi-omics integration using gene-level data as opposed to protein-level data. However, the use of protein-level data (such as mass spectrometry) in multi-omics integration has some limitations. For example, the correlation between the characteristics of gene-level data (such as mRNA) and protein-level data is weak, and it is difficult to detect low-abundance signaling proteins that are used to target cancer. The reverse phase protein array (RPPA) is a highly sensitive antibody-based quantification method for signaling proteins. However, the number of protein features in RPPA data is extremely low compared to the number of gene features in gene-level data. In this study, we present a new method for integrating RPPA profiles with RNA-Seq and DNA methylation profiles for survival prediction based on the integrative directed random walk (iDRW) framework proposed in our previous study. In the iDRW framework, each omics profile is merged into a single pathway profile that reflects the topological information of the pathway. In order to address the sparsity of RPPA profiles, we employ the random walk with restart (RWR) approach on the pathway network.

RESULTS

Our model was validated using survival prediction analysis for a breast cancer dataset from The Cancer Genome Atlas. Our proposed model exhibited improved performance compared with other methods that utilize pathway information and also out-performed models that did not include the RPPA data utilized in our study. The risk pathways identified for breast cancer in this study were closely related to well-known breast cancer risk pathways.

CONCLUSIONS

Our results indicated that RPPA data is useful for survival prediction for breast cancer patients under our framework. We also observed that iDRW effectively integrates RNA-Seq, DNA methylation, and RPPA profiles, while variation in the composition of the omics data can affect both prediction performance and risk pathway identification. These results suggest that omics data composition is a critical parameter for iDRW.

Collapse

Kim SY, Jeong HH, Kim J, Moon JH, Sohn KA. Robust pathway-based multi-omics data integration using directed random walks for survival prediction in multiple cancer studies. Biol Direct 2019;14:8. [PMID: 31036036 PMCID: PMC6489180 DOI: 10.1186/s13062-019-0239-8] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2018] [Accepted: 04/10/2019] [Indexed: 01/15/2023] Open

Abstract

Background

Integrating the rich information from multi-omics data has been a popular approach to survival prediction and bio-marker identification for several cancer studies. To facilitate the integrative analysis of multiple genomic profiles, several studies have suggested utilizing pathway information rather than using individual genomic profiles.

Methods

We have recently proposed an integrative directed random walk-based method utilizing pathway information (iDRW) for more robust and effective genomic feature extraction. In this study, we applied iDRW to multiple genomic profiles for two different cancers, and designed a directed gene-gene graph which reflects the interaction between gene expression and copy number data. In the experiments, the performances of the iDRW method and four state-of-the-art pathway-based methods were compared using a survival prediction model which classifies samples into two survival groups.

Results

The results show that the integrative analysis guided by pathway information not only improves prediction performance, but also provides better biological insights into the top pathways and genes prioritized by the model in both the neuroblastoma and the breast cancer datasets. The pathways and genes selected by the iDRW method were shown to be related to the corresponding cancers.

Conclusions

In this study, we demonstrated the effectiveness of a directed random walk-based multi-omics data integration method applied to gene expression and copy number data for both breast cancer and neuroblastoma datasets. We revamped a directed gene-gene graph considering the impact of copy number variation on gene expression and redefined the weight initialization and gene-scoring method. The benchmark result for iDRW with four pathway-based methods demonstrated that the iDRW method improved survival prediction performance and jointly identified cancer-related pathways and genes for two different cancer datasets.

Reviewers

This article was reviewed by Helena Molina-Abril and Marta Hidalgo.

Collapse

El-Manzalawy Y, Hsieh TY, Shivakumar M, Kim D, Honavar V. Min-redundancy and max-relevance multi-view feature selection for predicting ovarian cancer survival using multi-omics data. BMC Med Genomics 2018;11:71. [PMID: 30255801 PMCID: PMC6157248 DOI: 10.1186/s12920-018-0388-0] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open

Kim SY, Kim TR, Jeong HH, Sohn KA. Integrative pathway-based survival prediction utilizing the interaction between gene expression and DNA methylation in breast cancer. BMC Med Genomics 2018;11:68. [PMID: 30255812 PMCID: PMC6157196 DOI: 10.1186/s12920-018-0389-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open

Doostparast Torshizi A, Petzold LR. Graph-based semi-supervised learning with genomic data integration using condition-responsive genes applied to phenotype classification. J Am Med Inform Assoc 2018;25:99-108. [PMID: 28505320 PMCID: PMC7647127 DOI: 10.1093/jamia/ocx032] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2016] [Revised: 02/08/2017] [Accepted: 03/14/2017] [Indexed: 11/14/2022] Open

Kim D, Li R, Lucas A, Verma SS, Dudek SM, Ritchie MD. Using knowledge-driven genomic interactions for multi-omics data analysis: metadimensional models for predicting clinical outcomes in ovarian carcinoma. J Am Med Inform Assoc 2017;24:577-587. [PMID: 28040685 DOI: 10.1093/jamia/ocw165] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2016] [Accepted: 12/02/2016] [Indexed: 02/07/2023] Open

Abstract

It is common that cancer patients have different molecular signatures even though they have similar clinical features, such as histology, due to the heterogeneity of tumors. To overcome this variability, we previously developed a new approach incorporating prior biological knowledge that identifies knowledge-driven genomic interactions associated with outcomes of interest. However, no systematic approach has been proposed to identify interaction models between pathways based on multi-omics data. Here we have proposed such a novel methodological framework, called metadimensional knowledge-driven genomic interactions (MKGIs). To test the utility of the proposed framework, we applied it to an ovarian cancer dataset including multi-omics profiles from The Cancer Genome Atlas to predict grade, stage, and survival outcome. We found that each knowledge-driven genomic interaction model, based on different genomic datasets, contains different sets of pathway features, which suggests that each genomic data type may contribute to outcomes in ovarian cancer via a different pathway. In addition, MKGI models significantly outperformed the single knowledge-driven genomic interaction model. From the MKGI models, many interactions between pathways associated with outcomes were found, including the mitogen-activated protein kinase (MAPK) signaling pathway and the gonadotropin-releasing hormone (GnRH) signaling pathway, which are known to play important roles in cancer pathogenesis. The beauty of incorporating biological knowledge into the model based on multi-omics data is the ability to improve diagnosis and prognosis and provide better interpretability. Thus, determining variability in molecular signatures based on these interactions between pathways may lead to better diagnostic/treatment strategies for better precision medicine.

Collapse

Lee G, Bang L, Kim SY, Kim D, Sohn KA. Identifying subtype-specific associations between gene expression and DNA methylation profiles in breast cancer. BMC Med Genomics 2017;10:28. [PMID: 28589855 PMCID: PMC5461552 DOI: 10.1186/s12920-017-0268-z] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open

Shivakumar M, Lee Y, Bang L, Garg T, Sohn KA, Kim D. Identification of epigenetic interactions between miRNA and DNA methylation associated with gene expression as potential prognostic markers in bladder cancer. BMC Med Genomics 2017;10:30. [PMID: 28589857 PMCID: PMC5461531 DOI: 10.1186/s12920-017-0269-y] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open

Abstract

Background

One of the fundamental challenges in cancer is to detect the regulators of gene expression changes during cancer progression. Through transcriptional silencing of critical cancer-related genes, epigenetic change such as DNA methylation plays a crucial role in cancer. In addition, miRNA, another major component of epigenome, is also a regulator at the post-transcriptional levels that modulate transcriptome changes. However, a mechanistic role of synergistic interactions between DNA methylation and miRNA as epigenetic regulators on transcriptomic changes and its association with clinical outcomes such as survival have remained largely unexplored in cancer.

Methods

In this study, we propose an integrative framework to identify epigenetic interactions between methylation and miRNA associated with transcriptomic changes. To test the utility of the proposed framework, the bladder cancer data set, including DNA methylation, miRNA expression, and gene expression data, from The Cancer Genome Atlas (TCGA) was analyzed for this study.

Results

First, we found 120 genes associated with interactions between the two epigenomic components. Then, 11 significant epigenetic interactions between miRNA and methylation, which target E2F3, CCND1, UTP6, CDADC1, SLC35E3, METRNL, TPCN2, NACC2, VGLL4, and PTEN, were found to be associated with survival. To this end, exploration of TCGA bladder cancer data identified epigenetic interactions that are associated with survival as potential prognostic markers in bladder cancer.

Conclusions

Given the importance and prevalence of these interactions of epigenetic events in bladder cancer it is timely to understand further how different epigenetic components interact and influence each other.

Electronic supplementary material

The online version of this article (doi:10.1186/s12920-017-0269-y) contains supplementary material, which is available to authorized users.

Collapse

Kim M, Nam Y, Shin H. An inference method from multi-layered structure of biomedical data. BMC Med Inform Decis Mak 2017;17:52. [PMID: 28539122 PMCID: PMC5444045 DOI: 10.1186/s12911-017-0450-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open

Hassanzadeh HR, Phan JH, Wang MD. A Multi-Modal Graph-Based Semi-Supervised Pipeline for Predicting Cancer Survival. PROCEEDINGS. IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE 2016;2016:184-189. [PMID: 32655981 DOI: 10.1109/bibm.2016.7822516] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Nam Y, Kim M, Lee K, Shin H. CLASH: Complementary Linkage with Anchoring and Scoring for Heterogeneous biomolecular and clinical data. BMC Med Inform Decis Mak 2016;16 Suppl 3:72. [PMID: 27454118 PMCID: PMC4959382 DOI: 10.1186/s12911-016-0315-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open

Abstract

Background

The study on disease-disease association has been increasingly viewed and analyzed as a network, in which the connections between diseases are configured using the source information on interactome maps of biomolecules such as genes, proteins, metabolites, etc. Although abundance in source information leads to tighter connections between diseases in the network, for a certain group of diseases, such as metabolic diseases, the connections do not occur much due to insufficient source information; a large proportion of their associated genes are still unknown. One way to circumvent the difficulties in the lack of source information is to integrate available external information by using one of up-to-date integration or fusion methods. However, if one wants a disease network placing huge emphasis on the original source of data but still utilizing external sources only to complement it, integration may not be pertinent. Interpretation on the integrated network would be ambiguous: meanings conferred on edges would be vague due to fused information.

Methods

In this study, we propose a network based algorithm that complements the original network by utilizing external information while preserving the network’s originality. The proposed algorithm links the disconnected node to the disease network by using complementary information from external data source through four steps: anchoring, connecting, scoring, and stopping.

Results

When applied to the network of metabolic diseases that is sourced from protein-protein interaction data, the proposed algorithm recovered connections by 97%, and improved the AUC performance up to 0.71 (lifted from 0.55) by using the external information outsourced from text mining results on PubMed comorbidity literatures. Experimental results also show that the proposed algorithm is robust to noisy external information.

Conclusion

This research has novelty in which the proposed algorithm preserves the network’s originality, but at the same time, complements it by utilizing external information. Furthermore it can be utilized for original association recovery and novel association discovery for disease network.

Collapse

Świtnicki MP, Juul M, Madsen T, Sørensen KD, Pedersen JS. PINCAGE: probabilistic integration of cancer genomics data for perturbed gene identification and sample classification. Bioinformatics 2016;32:1353-65. [PMID: 26740525 DOI: 10.1093/bioinformatics/btv758] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2015] [Accepted: 12/17/2015] [Indexed: 02/02/2023] Open

Abstract

MOTIVATION

Cancer development and progression is driven by a complex pattern of genomic and epigenomic perturbations. Both types of perturbations can affect gene expression levels and disease outcome. Integrative analysis of cancer genomics data may therefore improve detection of perturbed genes and prediction of disease state. As different data types are usually dependent, analysis based on independence assumptions will make inefficient use of the data and potentially lead to false conclusions.

MODEL

Here, we present PINCAGE (Probabilistic INtegration of CAncer GEnomics data), a method that uses probabilistic integration of cancer genomics data for combined evaluation of RNA-seq gene expression and 450k array DNA methylation measurements of promoters as well as gene bodies. It models the dependence between expression and methylation using modular graphical models, which also allows future inclusion of additional data types.

RESULTS

We apply our approach to a Breast Invasive Carcinoma dataset from The Cancer Genome Atlas consortium, which includes 82 adjacent normal and 730 cancer samples. We identify new biomarker candidates of breast cancer development (PTF1A, RABIF, RAG1AP1, TIMM17A, LOC148145) and progression (SERPINE3, ZNF706). PINCAGE discriminates better between normal and tumour tissue and between progressing and non-progressing tumours in comparison with established methods that assume independence between tested data types, especially when using evidence from multiple genes. Our method can be applied to any type of cancer or, more generally, to any genomic disease for which sufficient amount of molecular data is available.

AVAILABILITY AND IMPLEMENTATION

R scripts available at http://moma.ki.au.dk/prj/pincage/

CONTACT

: michal.switnicki@clin.au.dk or jakob.skou@clin.au.dk

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Schmitz U, Wolkenhauer O. Taking Bioinformatics to Systems Medicine. Methods Mol Biol 2016;1386:17-41. [PMID: 26677177 PMCID: PMC7120931 DOI: 10.1007/978-1-4939-3283-2_2] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]

Jeong HH, Leem S, Wee K, Sohn KA. Integrative network analysis for survival-associated gene-gene interactions across multiple genomic profiles in ovarian cancer. J Ovarian Res 2015;8:42. [PMID: 26138921 PMCID: PMC4491426 DOI: 10.1186/s13048-015-0171-1] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2014] [Accepted: 06/24/2015] [Indexed: 01/01/2023] Open

Abstract

BACKGROUND

Recent advances in high-throughput technology and the emergence of large-scale genomic datasets have enabled detection of genomic features that affect clinical outcomes. Although many previous computational studies have analysed the effect of each single gene or the additive effects of multiple genes on the clinical outcome, less attention has been devoted to the identification of gene-gene interactions of general type that are associated with the clinical outcome. Moreover, the integration of information from multiple molecular profiles adds another challenge to this problem. Recently, network-based approaches have gained huge popularity. However, previous network construction methods have been more concerned with the relationship between features only, rather than the effect of feature interactions on clinical outcome.

METHODS

We propose a mutual information-based integrative network analysis framework (MINA) that identifies gene pairs associated with clinical outcome and systematically analyses the resulting networks over multiple genomic profiles. We implement an efficient non-parametric testing scheme that ensures the significance of detected gene interactions. We develop a tool named MINA that automates the proposed analysis scheme of identifying outcome-associated gene interactions and generating various networks from those interacting pairs for downstream analysis.

RESULTS

We demonstrate the proposed framework using real data from ovarian cancer patients in The Cancer Genome Atlas (TCGA). Statistically significant gene pairs associated with survival were identified from multiple genomic profiles, which include many individual genes that have weak or no effect on survival. Moreover, we also show that integrated networks, constructed by merging networks from multiple genomic profiles, demonstrate better topological properties and biological significance than individual networks.

CONCLUSIONS

We have developed a simple but powerful analysis tool that is able to detect gene-gene interactions associated with clinical outcome on multiple genomic profiles. By being network-based, our approach provides a better insight into the underlying gene-gene interaction mechanisms that affect the clinical outcome of cancer patients.

Collapse

Kim D, Li R, Dudek SM, Ritchie MD. Predicting censored survival data based on the interactions between meta-dimensional omics data in breast cancer. J Biomed Inform 2015;56:220-8. [PMID: 26048077 DOI: 10.1016/j.jbi.2015.05.019] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2015] [Revised: 05/15/2015] [Accepted: 05/27/2015] [Indexed: 12/27/2022]

Abstract

Evaluation of survival models to predict cancer patient prognosis is one of the most important areas of emphasis in cancer research. A binary classification approach has difficulty directly predicting survival due to the characteristics of censored observations and the fact that the predictive power depends on the threshold used to set two classes. In contrast, the traditional Cox regression approach has some drawbacks in the sense that it does not allow for the identification of interactions between genomic features, which could have key roles associated with cancer prognosis. In addition, data integration is regarded as one of the important issues in improving the predictive power of survival models since cancer could be caused by multiple alterations through meta-dimensional genomic data including genome, epigenome, transcriptome, and proteome. Here we have proposed a new integrative framework designed to perform these three functions simultaneously: (1) predicting censored survival data; (2) integrating meta-dimensional omics data; (3) identifying interactions within/between meta-dimensional genomic features associated with survival. In order to predict censored survival time, martingale residuals were calculated as a new continuous outcome and a new fitness function used by the grammatical evolution neural network (GENN) based on mean absolute difference of martingale residuals was implemented. To test the utility of the proposed framework, a simulation study was conducted, followed by an analysis of meta-dimensional omics data including copy number, gene expression, DNA methylation, and protein expression data in breast cancer retrieved from The Cancer Genome Atlas (TCGA). On the basis of the results from breast cancer dataset, we were able to identify interactions not only within a single dimension of genomic data but also between meta-dimensional omics data that are associated with survival. Notably, the predictive power of our best meta-dimensional model was 73% which outperformed all of the other models conducted based on a single dimension of genomic data. Breast cancer is an extremely heterogeneous disease and the high levels of genomic diversity within/between breast tumors could affect the risk of therapeutic responses and disease progression. Thus, identifying interactions within/between meta-dimensional omics data associated with survival in breast cancer is expected to deliver direction for improved meta-dimensional prognostic biomarkers and therapeutic targets.

Collapse

Chen L. Systems biology with omics data. Methods 2015;67:267-8. [PMID: 24882145 DOI: 10.1016/j.ymeth.2014.05.005] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open