1
|
Tafavvoghi M, Bongo LA, Shvetsov N, Busund LTR, Møllersen K. Publicly available datasets of breast histopathology H&E whole-slide images: A scoping review. J Pathol Inform 2024; 15:100363. [PMID: 38405160 PMCID: PMC10884505 DOI: 10.1016/j.jpi.2024.100363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 11/24/2023] [Accepted: 01/23/2024] [Indexed: 02/27/2024] Open
Abstract
Advancements in digital pathology and computing resources have made a significant impact in the field of computational pathology for breast cancer diagnosis and treatment. However, access to high-quality labeled histopathological images of breast cancer is a big challenge that limits the development of accurate and robust deep learning models. In this scoping review, we identified the publicly available datasets of breast H&E-stained whole-slide images (WSIs) that can be used to develop deep learning algorithms. We systematically searched 9 scientific literature databases and 9 research data repositories and found 17 publicly available datasets containing 10 385 H&E WSIs of breast cancer. Moreover, we reported image metadata and characteristics for each dataset to assist researchers in selecting proper datasets for specific tasks in breast cancer computational pathology. In addition, we compiled 2 lists of breast H&E patches and private datasets as supplementary resources for researchers. Notably, only 28% of the included articles utilized multiple datasets, and only 14% used an external validation set, suggesting that the performance of other developed models may be susceptible to overestimation. The TCGA-BRCA was used in 52% of the selected studies. This dataset has a considerable selection bias that can impact the robustness and generalizability of the trained algorithms. There is also a lack of consistent metadata reporting of breast WSI datasets that can be an issue in developing accurate deep learning models, indicating the necessity of establishing explicit guidelines for documenting breast WSI dataset characteristics and metadata.
Collapse
Affiliation(s)
- Masoud Tafavvoghi
- Department of Community Medicine, Uit The Arctic University of Norway, Tromsø, Norway
| | - Lars Ailo Bongo
- Department of Computer Science, Uit The Arctic University of Norway, Tromsø, Norway
| | - Nikita Shvetsov
- Department of Computer Science, Uit The Arctic University of Norway, Tromsø, Norway
| | | | - Kajsa Møllersen
- Department of Community Medicine, Uit The Arctic University of Norway, Tromsø, Norway
| |
Collapse
|
2
|
Dubey P, Kumar S. Advancing prostate cancer detection: a comparative analysis of PCLDA-SVM and PCLDA-KNN classifiers for enhanced diagnostic accuracy. Sci Rep 2023; 13:13745. [PMID: 37612436 PMCID: PMC10447543 DOI: 10.1038/s41598-023-40906-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 08/18/2023] [Indexed: 08/25/2023] Open
Abstract
This investigation aimed to assess the effectiveness of different classification models in diagnosing prostate cancer using a screening dataset obtained from the National Cancer Institute's Cancer Data Access System. The dataset was first reduced using the PCLDA method, which combines Principal Component Analysis and Linear Discriminant Analysis. Two classifiers, Support Vector Machine (SVM) and k-Nearest Neighbour (KNN), were then applied to compare their performance. The results showed that the PCLDA-SVM model achieved an impressive accuracy rate of 97.99%, with a precision of 0.92, sensitivity of 92.83%, specificity of 97.65%, and F1 score of 0.93. Additionally, it demonstrated a low error rate of 0.016 and a Matthews Correlation Coefficient (MCC) and Kappa coefficient of 0.946. On the other hand, the PCLDA-KNN model also performed well, achieving an accuracy of 97.8%, precision of 0.93, sensitivity of 93.39%, specificity of 97.86%, an F1 score of 0.92, a high MCC and Kappa coefficient of 0.98, and an error rate of 0.006. In conclusion, the PCLDA-SVM method exhibited improved efficacy in diagnosing prostate cancer compared to the PCLDA-KNN model. Both models, however, showed promising results, suggesting the potential of these classifiers in prostate cancer diagnosis.
Collapse
Affiliation(s)
- Priya Dubey
- Electrical and Electronics Engineering, Birla Institute of Technology, Ranchi, Jharkhand, 835215, India.
| | - Surendra Kumar
- Electrical and Electronics Engineering, Birla Institute of Technology, Ranchi, Jharkhand, 835215, India
| |
Collapse
|
3
|
Panic J, Defeudis A, Balestra G, Giannini V, Rosati S. Normalization Strategies in Multi-Center Radiomics Abdominal MRI: Systematic Review and Meta-Analyses. IEEE Open J Eng Med Biol 2023; 4:67-76. [PMID: 37283773 PMCID: PMC10241248 DOI: 10.1109/ojemb.2023.3271455] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Revised: 03/18/2023] [Accepted: 04/25/2023] [Indexed: 06/08/2023] Open
Abstract
Goal: Artificial intelligence applied to medical image analysis has been extensively used to develop non-invasive diagnostic and prognostic signatures. However, these imaging biomarkers should be largely validated on multi-center datasets to prove their robustness before they can be introduced into clinical practice. The main challenge is represented by the great and unavoidable image variability which is usually addressed using different pre-processing techniques including spatial, intensity and feature normalization. The purpose of this study is to systematically summarize normalization methods and to evaluate their correlation with the radiomics model performances through meta-analyses. This review is carried out according to the PRISMA statement: 4777 papers were collected, but only 74 were included. Two meta-analyses were carried out according to two clinical aims: characterization and prediction of response. Findings of this review demonstrated that there are some commonly used normalization approaches, but not a commonly agreed pipeline that can allow to improve performance and to bridge the gap between bench and bedside.
Collapse
Affiliation(s)
- Jovana Panic
- Department of Surgical Science, and Polytechnic of Turin, Department of Electronics and TelecommunicationsUniversity of Turin10129TurinItaly
| | - Arianna Defeudis
- Department of Surgical ScienceUniversity of Turin10129TurinItaly
- Candiolo Cancer InstituteFPO-IRCCS10060CandioloItaly
| | - Gabriella Balestra
- Department of Electronics and TelecommunicationsPolytechnic of Turin10129TurinItaly
| | - Valentina Giannini
- Department of Surgical ScienceUniversity of Turin10129TurinItaly
- Candiolo Cancer InstituteFPO-IRCCS10060CandioloItaly
| | - Samanta Rosati
- Department of Electronics and TelecommunicationsPolytechnic of Turin10129TurinItaly
| |
Collapse
|
4
|
Omar MI, MacLennan S, Ribal MJ, Roobol MJ, Dimitropoulos K, van den Broeck T, MacLennan SJ, Axelsson SE, Gandaglia G, Willemse PP, Mastris K, Ransohoff JB, Devecseri Z, Abbott T, De Meulder B, Bjartell A, Asiimwe A, N'Dow J. Unanswered questions in prostate cancer - findings of an international multi-stakeholder consensus by the PIONEER consortium. Nat Rev Urol 2023:10.1038/s41585-023-00748-9. [PMID: 37012441 DOI: 10.1038/s41585-023-00748-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/20/2023] [Indexed: 04/05/2023]
Abstract
PIONEER is a European network of excellence for big data in prostate cancer consisting of 37 private and public stakeholders from 9 countries across Europe. Many progresses have been done in prostate cancer management, but unanswered questions in the field still exist, and big data could help to answer these questions. The PIONEER consortium conducted a two-round modified Delphi survey aiming at building consensus between two stakeholder groups - health-care professionals and patients with prostate cancer - about the most important questions in the field of prostate cancer to be answered using big data. Respondents were asked to consider what would be the effect of answering the proposed questions on improving diagnosis and treatment outcomes for patients with prostate cancer and to score these questions on a scale of 1 (not important) to 9 (critically important). The mean percentage of participants who scored each of the proposed questions as critically important was calculated across the two stakeholder groups and used to rank the questions and identify the highest scoring questions in the critically important category. The identification of questions in prostate cancer that are important to various stakeholders will help the PIONEER consortium to provide answers to these questions to improve the clinical care of patients with prostate cancer.
Collapse
Affiliation(s)
- Muhammad Imran Omar
- Academic Urology Unit, University of Aberdeen, Aberdeen, UK.
- Guidelines Office, European Association of Urology, Arnhem, Netherlands.
| | | | - Maria J Ribal
- Guidelines Office, European Association of Urology, Arnhem, Netherlands
| | | | | | | | | | | | - Giorgio Gandaglia
- Department of Urology and Division of Experimental Oncology, Urological Research Institute, Vita-Salute San Raffaele University, IRCCS San Raffaele Scientific Institute, Milan, Italy
| | | | - Ken Mastris
- European Cancer Patient Coalition, Brussels, Belgium
| | | | | | | | | | - Anders Bjartell
- Department of Translational Medicine, Lund University, Lund, Sweden
| | | | - James N'Dow
- Academic Urology Unit, University of Aberdeen, Aberdeen, UK
- Guidelines Office, European Association of Urology, Arnhem, Netherlands
| |
Collapse
|
5
|
Bleker J, Kwee TC, Yakar D. Quality of Multicenter Studies Using MRI Radiomics for Diagnosing Clinically Significant Prostate Cancer: A Systematic Review. Life (Basel) 2022; 12:946. [PMID: 35888036 DOI: 10.3390/life12070946] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Revised: 06/20/2022] [Accepted: 06/21/2022] [Indexed: 11/17/2022]
Abstract
Background: Reproducibility and generalization are major challenges for clinically significant prostate cancer modeling using MRI radiomics. Multicenter data seem indispensable to deal with these challenges, but the quality of such studies is currently unknown. The aim of this study was to systematically review the quality of multicenter studies on MRI radiomics for diagnosing clinically significant PCa. Methods: This systematic review followed the 2020 Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) checklist. Multicenter studies investigating the value of MRI radiomics for the diagnosis of clinically significant prostate cancer were included. Quality was assessed using the checklist for artificial intelligence in medical imaging (CLAIM) and the radiomics quality score (RQS). CLAIM consisted of 42 equally important items referencing different elements of good practice AI in medical imaging. RQS consisted of 36 points awarded over 16 items related to good practice radiomics. Final CLAIM and RQS scores were percentage-based, allowing for a total quality score consisting of the average of CLAIM and RQS. Results: Four studies were included. The average total CLAIM score was 74.6% and the average RQS was 52.8%. The corresponding average total quality score (CLAIM + RQS) was 63.7%. Conclusions: A very small number of multicenter radiomics PCa classification studies have been performed with the existing studies being of bad or average quality. Good multicenter studies might increase by encouraging preferably prospective data sharing and paying extra care to documentation in regards to reproducibility and clinical utility.
Collapse
|
6
|
Corradini D, Brizi L, Gaudiano C, Bianchi L, Marcelli E, Golfieri R, Schiavina R, Testa C, Remondini D. Challenges in the Use of Artificial Intelligence for Prostate Cancer Diagnosis from Multiparametric Imaging Data. Cancers (Basel) 2021; 13:3944. [PMID: 34439099 DOI: 10.3390/cancers13163944] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 08/02/2021] [Accepted: 08/02/2021] [Indexed: 11/18/2022] Open
Abstract
Simple Summary Prostate Cancer is one of the main threats to men’s health. Its accurate diagnosis is crucial to properly treat patients depending on the cancer’s level of aggressiveness. Tumor risk-stratification is still a challenging task due to the difficulties met during the reading of multi-parametric Magnetic Resonance Images. Artificial Intelligence models may help radiologists in staging the aggressiveness of the equivocal lesions, reducing inter-observer variability and evaluation time. However, these algorithms need many high-quality images to work efficiently, bringing up overfitting and lack of standardization and reproducibility as emerging issues to be addressed. This study attempts to illustrate the state of the art of current research of Artificial Intelligence methods to stratify prostate cancer for its clinical significance suggesting how widespread use of public databases could be a possible solution to these issues. Abstract Many efforts have been carried out for the standardization of multiparametric Magnetic Resonance (mp-MR) images evaluation to detect Prostate Cancer (PCa), and specifically to differentiate levels of aggressiveness, a crucial aspect for clinical decision-making. Prostate Imaging—Reporting and Data System (PI-RADS) has contributed noteworthily to this aim. Nevertheless, as pointed out by the European Association of Urology (EAU 2020), the PI-RADS still has limitations mainly due to the moderate inter-reader reproducibility of mp-MRI. In recent years, many aspects in the diagnosis of cancer have taken advantage of the use of Artificial Intelligence (AI) such as detection, segmentation of organs and/or lesions, and characterization. Here a focus on AI as a potentially important tool for the aim of standardization and reproducibility in the characterization of PCa by mp-MRI is reported. AI includes methods such as Machine Learning and Deep learning techniques that have shown to be successful in classifying mp-MR images, with similar performances obtained by radiologists. Nevertheless, they perform differently depending on the acquisition system and protocol used. Besides, these methods need a large number of samples that cover most of the variability of the lesion aspect and zone to avoid overfitting. The use of publicly available datasets could improve AI performance to achieve a higher level of generalizability, exploiting large numbers of cases and a big range of variability in the images. Here we explore the promise and the advantages, as well as emphasizing the pitfall and the warnings, outlined in some recent studies that attempted to classify clinically significant PCa and indolent lesions using AI methods. Specifically, we focus on the overfitting issue due to the scarcity of data and the lack of standardization and reproducibility in every step of the mp-MR image acquisition and the classifier implementation. In the end, we point out that a solution can be found in the use of publicly available datasets, whose usage has already been promoted by some important initiatives. Our future perspective is that AI models may become reliable tools for clinicians in PCa diagnosis, reducing inter-observer variability and evaluation time.
Collapse
|
7
|
Elkhader J, Elemento O. Artificial intelligence in oncology: From bench to clinic. Semin Cancer Biol 2021; 84:113-128. [PMID: 33915289 DOI: 10.1016/j.semcancer.2021.04.013] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 03/22/2021] [Accepted: 04/15/2021] [Indexed: 02/07/2023]
Abstract
In the past few years, Artificial Intelligence (AI) techniques have been applied to almost every facet of oncology, from basic research to drug development and clinical care. In the clinical arena where AI has perhaps received the most attention, AI is showing promise in enhancing and automating image-based diagnostic approaches in fields such as radiology and pathology. Robust AI applications, which retain high performance and reproducibility over multiple datasets, extend from predicting indications for drug development to improving clinical decision support using electronic health record data. In this article, we review some of these advances. We also introduce common concepts and fundamentals of AI and its various uses, along with its caveats, to provide an overview of the opportunities and challenges in the field of oncology. Leveraging AI techniques productively to provide better care throughout a patient's medical journey can fuel the predictive promise of precision medicine.
Collapse
Affiliation(s)
- Jamal Elkhader
- HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Dept. of Physiology and Biophysics, Weill Cornell Medicine, New York, NY 10021, USA; Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, NY, 10021, USA; Sandra and Edward Meyer Cancer Center, Weill Cornell Medicine, New York, NY, 10065, USA; Tri-Institutional Training Program in Computational Biology and Medicine, New York, NY, 10065, USA
| | - Olivier Elemento
- HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Dept. of Physiology and Biophysics, Weill Cornell Medicine, New York, NY 10021, USA; Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, NY, 10021, USA; Sandra and Edward Meyer Cancer Center, Weill Cornell Medicine, New York, NY, 10065, USA; Tri-Institutional Training Program in Computational Biology and Medicine, New York, NY, 10065, USA.
| |
Collapse
|
8
|
de Dieuleveult M, Marchal C, Jouinot A, Letessier A, Miotto B. Molecular and Clinical Relevance of ZBTB38 Expression Levels in Prostate Cancer. Cancers (Basel) 2020; 12:E1106. [PMID: 32365491 DOI: 10.3390/cancers12051106] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2020] [Revised: 04/12/2020] [Accepted: 04/23/2020] [Indexed: 12/18/2022] Open
Abstract
Prostate cancer is one of the most commonly diagnosed cancers in men. A number of genomic and clinical studies have led to a better understanding of prostate cancer biology. Still, the care of patients as well as the prediction of disease aggressiveness, recurrence and outcome remain challenging. Here, we showed that expression of the gene ZBTB38 is associated with poor prognosis in localised prostate cancer and could help discriminate aggressive localised prostate tumours from those who can benefit only from observation. Analysis of different prostate cancer cohorts indicates that low expression levels of ZBTB38 associate with increased levels of chromosomal abnormalities and more aggressive pathological features, including higher rate of biochemical recurrence of the disease. Importantly, gene expression profiling of these tumours, complemented with cellular assays on prostate cancer cell lines, unveiled that tumours with low levels of ZBTB38 expression might be targeted by doxorubicin, a compound generating reactive oxygen species. Our study shows that ZBTB38 is involved in prostate cancer pathogenesis and may represent a useful marker to identify high risk and highly rearranged localised prostate cancer susceptible to doxorubicin.
Collapse
|
9
|
Hulsen T. Sharing Is Caring-Data Sharing Initiatives in Healthcare. Int J Environ Res Public Health 2020; 17:ijerph17093046. [PMID: 32349396 PMCID: PMC7246891 DOI: 10.3390/ijerph17093046] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/28/2020] [Revised: 03/17/2020] [Accepted: 04/24/2020] [Indexed: 02/05/2023]
Abstract
In recent years, more and more health data are being generated. These data come not only from professional health systems, but also from wearable devices. All these 'big data' put together can be utilized to optimize treatments for each unique patient ('precision medicine'). For this to be possible, it is necessary that hospitals, academia and industry work together to bridge the 'valley of death' of translational medicine. However, hospitals and academia often are reluctant to share their data with other parties, even though the patient is actually the owner of his/her own health data. Academic hospitals usually invest a lot of time in setting up clinical trials and collecting data, and want to be the first ones to publish papers on this data. There are some publicly available datasets, but these are usually only shared after study (and publication) completion, which means a severe delay of months or even years before others can analyse the data. One solution is to incentivize the hospitals to share their data with (other) academic institutes and the industry. Here, we show an analysis of the current literature around data sharing, and we discuss five aspects of data sharing in the medical domain: publisher requirements, data ownership, growing support for data sharing, data sharing initiatives and how the use of federated data might be a solution. We also discuss some potential future developments around data sharing, such as medical crowdsourcing and data generalists.
Collapse
Affiliation(s)
- Tim Hulsen
- Department of Professional Health Solutions & Services, Philips Research, 5656AE Eindhoven, The Netherlands
| |
Collapse
|
10
|
Affiliation(s)
- Rex E. FitzGerald
- Swiss Centre for Applied Human Toxicology SCAHT, University of Basel, Missionsstrasse 64, CH-4055 Basel, Switzerland
| |
Collapse
|
11
|
Lin Y, Zhao X, Miao Z, Ling Z, Wei X, Pu J, Hou J, Shen B. Data-driven translational prostate cancer research: from biomarker discovery to clinical decision. J Transl Med 2020; 18:119. [PMID: 32143723 PMCID: PMC7060655 DOI: 10.1186/s12967-020-02281-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2020] [Accepted: 02/26/2020] [Indexed: 02/08/2023] Open
Abstract
Prostate cancer (PCa) is a common malignant tumor with increasing incidence and high heterogeneity among males worldwide. In the era of big data and artificial intelligence, the paradigm of biomarker discovery is shifting from traditional experimental and small data-based identification toward big data-driven and systems-level screening. Complex interactions between genetic factors and environmental effects provide opportunities for systems modeling of PCa genesis and evolution. We hereby review the current research frontiers in informatics for PCa clinical translation. First, the heterogeneity and complexity in PCa development and clinical theranostics are introduced to raise the concern for PCa systems biology studies. Then biomarkers and risk factors ranging from molecular alternations to clinical phenotype and lifestyle changes are explicated for PCa personalized management. Methodologies and applications for multi-dimensional data integration and computational modeling are discussed. The future perspectives and challenges for PCa systems medicine and holistic healthcare are finally provided.
Collapse
Affiliation(s)
- Yuxin Lin
- Department of Urology, The First Affiliated Hospital of Soochow University, Suzhou, 215006, China
| | - Xiaojun Zhao
- Department of Urology, The First Affiliated Hospital of Soochow University, Suzhou, 215006, China
| | - Zhijun Miao
- Department of Urology, Suzhou Dushuhu Public Hospital, Suzhou, 215123, China
| | - Zhixin Ling
- Department of Urology, The First Affiliated Hospital of Soochow University, Suzhou, 215006, China
| | - Xuedong Wei
- Department of Urology, The First Affiliated Hospital of Soochow University, Suzhou, 215006, China
| | - Jinxian Pu
- Department of Urology, The First Affiliated Hospital of Soochow University, Suzhou, 215006, China
| | - Jianquan Hou
- Department of Urology, The First Affiliated Hospital of Soochow University, Suzhou, 215006, China.
| | - Bairong Shen
- Institutes for Systems Genetics, West China Hospital, Sichuan University, Chengdu, 610041, China.
| |
Collapse
|
12
|
Jean-Quartier C, Jeanquartier F, Holzinger A. Open Data for Differential Network Analysis in Glioma. Int J Mol Sci 2020; 21:E547. [PMID: 31952211 PMCID: PMC7013918 DOI: 10.3390/ijms21020547] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2019] [Revised: 12/29/2019] [Accepted: 01/03/2020] [Indexed: 12/20/2022] Open
Abstract
The complexity of cancer diseases demands bioinformatic techniques and translational research based on big data and personalized medicine. Open data enables researchers to accelerate cancer studies, save resources and foster collaboration. Several tools and programming approaches are available for analyzing data, including annotation, clustering, comparison and extrapolation, merging, enrichment, functional association and statistics. We exploit openly available data via cancer gene expression analysis, we apply refinement as well as enrichment analysis via gene ontology and conclude with graph-based visualization of involved protein interaction networks as a basis for signaling. The different databases allowed for the construction of huge networks or specified ones consisting of high-confidence interactions only. Several genes associated to glioma were isolated via a network analysis from top hub nodes as well as from an outlier analysis. The latter approach highlights a mitogen-activated protein kinase next to a member of histondeacetylases and a protein phosphatase as genes uncommonly associated with glioma. Cluster analysis from top hub nodes lists several identified glioma-associated gene products to function within protein complexes, including epidermal growth factors as well as cell cycle proteins or RAS proto-oncogenes. By using selected exemplary tools and open-access resources for cancer research and differential network analysis, we highlight disturbed signaling components in brain cancer subtypes of glioma.
Collapse
|
13
|
Affiliation(s)
- Tim Hulsen
- Department of Professional Health Solutions & Services, Philips Research, Eindhoven, The Netherlands. E-mail:
| |
Collapse
|
14
|
|