1
|
Turkki R, Byckhov D, Lundin M, Isola J, Nordling S, Kovanen PE, Verrill C, von Smitten K, Joensuu H, Lundin J, Linder N. Breast cancer outcome prediction with tumour tissue images and machine learning. Breast Cancer Res Treat 2019; 177:41-52. [PMID: 31119567 PMCID: PMC6647903 DOI: 10.1007/s10549-019-05281-1] [Citation(s) in RCA: 51] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2018] [Accepted: 05/16/2019] [Indexed: 12/12/2022]
Abstract
PURPOSE Recent advances in machine learning have enabled better understanding of large and complex visual data. Here, we aim to investigate patient outcome prediction with a machine learning method using only an image of tumour sample as an input. METHODS Utilising tissue microarray (TMA) samples obtained from the primary tumour of patients (N = 1299) within a nationwide breast cancer series with long-term-follow-up, we train and validate a machine learning method for patient outcome prediction. The prediction is performed by classifying samples into low or high digital risk score (DRS) groups. The outcome classifier is trained using sample images of 868 patients and evaluated and compared with human expert classification in a test set of 431 patients. RESULTS In univariate survival analysis, the DRS classification resulted in a hazard ratio of 2.10 (95% CI 1.33-3.32, p = 0.001) for breast cancer-specific survival. The DRS classification remained as an independent predictor of breast cancer-specific survival in a multivariate Cox model with a hazard ratio of 2.04 (95% CI 1.20-3.44, p = 0.007). The accuracy (C-index) of the DRS grouping was 0.60 (95% CI 0.55-0.65), as compared to 0.58 (95% CI 0.53-0.63) for human expert predictions based on the same TMA samples. CONCLUSIONS Our findings demonstrate the feasibility of learning prognostic signals in tumour tissue images without domain knowledge. Although further validation is needed, our study suggests that machine learning algorithms can extract prognostically relevant information from tumour histology complementing the currently used prognostic factors in breast cancer.
Collapse
Affiliation(s)
- Riku Turkki
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland. .,Science for Life Laboratory (SciLifeLab), Karolinska Institutet, Solna, Sweden.
| | - Dmitrii Byckhov
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
| | - Mikael Lundin
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
| | - Jorma Isola
- Department of Cancer Biology, BioMediTech, University of Tampere, Tampere, Finland
| | - Stig Nordling
- Department of Pathology, Medicum, University of Helsinki, Helsinki, Finland
| | - Panu E Kovanen
- HUSLAB and Medicum, Helsinki University Hospital Cancer Center and University of Helsinki, Helsinki, Finland
| | - Clare Verrill
- Nuffield Department of Surgical Sciences, University of Oxford, Oxford, UK.,NIHR Oxford Biomedical Research Centre, Oxford, UK
| | | | - Heikki Joensuu
- Department of Oncology, Helsinki University Hospital and University of Helsinki, Helsinki, Finland
| | - Johan Lundin
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland.,Department of Public Health Sciences, Karolinska Institutet, Stockholm, Sweden
| | - Nina Linder
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland.,Department of Women's and Children's Health, International Maternal and Child health (IMCH), Uppsala University, Uppsala, Sweden
| |
Collapse
|
2
|
Popovici V, Budinská E, Čápková L, Schwarz D, Dušek L, Feit J, Jaggi R. Joint analysis of histopathology image features and gene expression in breast cancer. BMC Bioinformatics 2016; 17:209. [PMID: 27170365 PMCID: PMC4864935 DOI: 10.1186/s12859-016-1072-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2015] [Accepted: 05/04/2016] [Indexed: 02/08/2023] Open
Abstract
Background Genomics and proteomics are nowadays the dominant techniques for novel biomarker discovery. However, histopathology images contain a wealth of information related to the tumor histology, morphology and tumor-host interactions that is not accessible through these techniques. Thus, integrating the histopathology images in the biomarker discovery workflow could potentially lead to the identification of new image-based biomarkers and the refinement or even replacement of the existing genomic and proteomic signatures. However, extracting meaningful and robust image features to be mined jointly with genomic (and clinical, etc.) data represents a real challenge due to the complexity of the images. Results We developed a framework for integrating the histopathology images in the biomarker discovery workflow based on the bag-of-features approach – a method that has the advantage of being assumption-free and data-driven. The images were reduced to a set of salient patterns and additional measurements of their spatial distribution, with the resulting features being directly used in a standard biomarker discovery application. We demonstrated this framework in a search for prognostic biomarkers in breast cancer which resulted in the identification of several prognostic image features and a promising multimodal (imaging and genomic) prognostic signature. The source code for the image analysis procedures is freely available. Conclusions The framework proposed allows for a joint analysis of images and gene expression data. Its application to a set of breast cancer cases resulted in image-based and combined (image and genomic) prognostic scores for relapse-free survival. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-1072-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Vlad Popovici
- Institute of Biostatistics and Analyses, Faculty of Medicine, Masarykova Univerzita, Kamenice 5, Brno, 62500, Czech Republic.
| | - Eva Budinská
- Institute of Biostatistics and Analyses, Faculty of Medicine, Masarykova Univerzita, Kamenice 5, Brno, 62500, Czech Republic.,RECETOX, Masarykova Univerzita, Kamenice 5, Brno, 62500, Czech Republic
| | - Lenka Čápková
- Institute of Biostatistics and Analyses, Faculty of Medicine, Masarykova Univerzita, Kamenice 5, Brno, 62500, Czech Republic
| | - Daniel Schwarz
- Institute of Biostatistics and Analyses, Faculty of Medicine, Masarykova Univerzita, Kamenice 5, Brno, 62500, Czech Republic
| | - Ladislav Dušek
- Institute of Biostatistics and Analyses, Faculty of Medicine, Masarykova Univerzita, Kamenice 5, Brno, 62500, Czech Republic
| | - Josef Feit
- Institute of Biostatistics and Analyses, Faculty of Medicine, Masarykova Univerzita, Kamenice 5, Brno, 62500, Czech Republic
| | - Rolf Jaggi
- Department of Clinical Research, Faculty of Medicine, University of Bern, Bern, Switzerland
| |
Collapse
|
3
|
Current world literature. Curr Opin Oncol 2012; 24:756-68. [PMID: 23079785 DOI: 10.1097/cco.0b013e32835a4c91] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
|