1
|
Tafavvoghi M, Bongo LA, Shvetsov N, Busund LTR, Møllersen K. Publicly available datasets of breast histopathology H&E whole-slide images: A scoping review. J Pathol Inform 2024; 15:100363. [PMID: 38405160 PMCID: PMC10884505 DOI: 10.1016/j.jpi.2024.100363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 11/24/2023] [Accepted: 01/23/2024] [Indexed: 02/27/2024] Open
Abstract
Advancements in digital pathology and computing resources have made a significant impact in the field of computational pathology for breast cancer diagnosis and treatment. However, access to high-quality labeled histopathological images of breast cancer is a big challenge that limits the development of accurate and robust deep learning models. In this scoping review, we identified the publicly available datasets of breast H&E-stained whole-slide images (WSIs) that can be used to develop deep learning algorithms. We systematically searched 9 scientific literature databases and 9 research data repositories and found 17 publicly available datasets containing 10 385 H&E WSIs of breast cancer. Moreover, we reported image metadata and characteristics for each dataset to assist researchers in selecting proper datasets for specific tasks in breast cancer computational pathology. In addition, we compiled 2 lists of breast H&E patches and private datasets as supplementary resources for researchers. Notably, only 28% of the included articles utilized multiple datasets, and only 14% used an external validation set, suggesting that the performance of other developed models may be susceptible to overestimation. The TCGA-BRCA was used in 52% of the selected studies. This dataset has a considerable selection bias that can impact the robustness and generalizability of the trained algorithms. There is also a lack of consistent metadata reporting of breast WSI datasets that can be an issue in developing accurate deep learning models, indicating the necessity of establishing explicit guidelines for documenting breast WSI dataset characteristics and metadata.
Collapse
Affiliation(s)
- Masoud Tafavvoghi
- Department of Community Medicine, Uit The Arctic University of Norway, Tromsø, Norway
| | - Lars Ailo Bongo
- Department of Computer Science, Uit The Arctic University of Norway, Tromsø, Norway
| | - Nikita Shvetsov
- Department of Computer Science, Uit The Arctic University of Norway, Tromsø, Norway
| | | | - Kajsa Møllersen
- Department of Community Medicine, Uit The Arctic University of Norway, Tromsø, Norway
| |
Collapse
|
2
|
Rakaee M, Andersen S, Giannikou K, Paulsen EE, Kilvaer TK, Busund LTR, Berg T, Richardsen E, Lombardi AP, Adib E, Pedersen MI, Tafavvoghi M, Wahl SGF, Petersen RH, Bondgaard AL, Yde CW, Baudet C, Licht P, Lund-Iversen M, Grønberg BH, Fjellbirkeland L, Helland Å, Pøhl M, Kwiatkowski DJ, Donnem T. Machine learning-based immune phenotypes correlate with STK11/KEAP1 co-mutations and prognosis in resectable NSCLC: a sub-study of the TNM-I trial. Ann Oncol 2023; 34:578-588. [PMID: 37100205 DOI: 10.1016/j.annonc.2023.04.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2023] [Revised: 04/12/2023] [Accepted: 04/13/2023] [Indexed: 04/28/2023] Open
Abstract
BACKGROUND We aim to implement an immune cell score model in routine clinical practice for resected non-small-cell lung cancer (NSCLC) patients (NCT03299478). Molecular and genomic features associated with immune phenotypes in NSCLC have not been explored in detail. PATIENTS AND METHODS We developed a machine learning (ML)-based model to classify tumors into one of three categories: inflamed, altered, and desert, based on the spatial distribution of CD8+ T cells in two prospective (n = 453; TNM-I trial) and retrospective (n = 481) stage I-IIIA NSCLC surgical cohorts. NanoString assays and targeted gene panel sequencing were used to evaluate the association of gene expression and mutations with immune phenotypes. RESULTS Among the total of 934 patients, 24.4% of tumors were classified as inflamed, 51.3% as altered, and 24.3% as desert. There were significant associations between ML-derived immune phenotypes and adaptive immunity gene expression signatures. We identified a strong association of the nuclear factor-κB pathway and CD8+ T-cell exclusion through a positive enrichment in the desert phenotype. KEAP1 [odds ratio (OR) 0.27, Q = 0.02] and STK11 (OR 0.39, Q = 0.04) were significantly co-mutated in non-inflamed lung adenocarcinoma (LUAD) compared to the inflamed phenotype. In the retrospective cohort, the inflamed phenotype was an independent prognostic factor for prolonged disease-specific survival and time to recurrence (hazard ratio 0.61, P = 0.01 and 0.65, P = 0.02, respectively). CONCLUSIONS ML-based immune phenotyping by spatial distribution of T cells in resected NSCLC is able to identify patients at greater risk of disease recurrence after surgical resection. LUADs with concurrent KEAP1 and STK11 mutations are enriched for altered and desert immune phenotypes.
Collapse
Affiliation(s)
- M Rakaee
- Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, USA; Department of Clinical Pathology, University Hospital of North Norway, Tromso; Department of Clinical Medicine, UiT The Arctic University of Norway, Tromso.
| | - S Andersen
- Department of Clinical Medicine, UiT The Arctic University of Norway, Tromso; Department of Oncology, University Hospital of North Norway, Tromso, Norway
| | - K Giannikou
- Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, USA; Division of Hematology and Oncology, Cancer and Blood Disease Institute, Children's Hospital Los Angeles, Los Angeles, USA
| | - E-E Paulsen
- Department of Clinical Medicine, UiT The Arctic University of Norway, Tromso; Department of Pulmonology, University Hospital of North Norway, Tromso
| | - T K Kilvaer
- Department of Clinical Medicine, UiT The Arctic University of Norway, Tromso; Department of Oncology, University Hospital of North Norway, Tromso, Norway
| | - L-T R Busund
- Department of Clinical Pathology, University Hospital of North Norway, Tromso; Department of Medical Biology, UiT The Arctic University of Norway, Tromso, Norway
| | - T Berg
- Department of Clinical Pathology, University Hospital of North Norway, Tromso; Department of Medical Biology, UiT The Arctic University of Norway, Tromso, Norway
| | - E Richardsen
- Department of Clinical Pathology, University Hospital of North Norway, Tromso; Department of Medical Biology, UiT The Arctic University of Norway, Tromso, Norway
| | - A P Lombardi
- Department of Medical Biology, UiT The Arctic University of Norway, Tromso, Norway
| | - E Adib
- Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, USA; Lank Center for Genitourinary Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, USA
| | - M I Pedersen
- Department of Clinical Medicine, UiT The Arctic University of Norway, Tromso
| | - M Tafavvoghi
- Department of Community Medicine, UiT The Arctic University of Norway, Tromso
| | - S G F Wahl
- Department of Oncology, St. Olav's Hospital, Trondheim University Hospital, Trondheim; Department of Clinical and Molecular Medicine, Norwegian University of Science and Technology, Trondheim, Norway
| | - R H Petersen
- Department of Cardiothoracic Surgery, Copenhagen University Hospital, Rigshospitalet, Copenhagen; Department of Clinical Medicine, University of Copenhagen, Copenhagen
| | - A L Bondgaard
- Department of Pathology, Copenhagen University Hospital, Rigshospitalet, Copenhagen
| | - C W Yde
- Center for Genomic Medicine, Copenhagen University Hospital, Rigshospitalet, Copenhagen
| | - C Baudet
- Center for Genomic Medicine, Copenhagen University Hospital, Rigshospitalet, Copenhagen
| | - P Licht
- Department of Cardiothoracic Surgery, Odense University Hospital, Odense, Denmark
| | - M Lund-Iversen
- Department of Pathology, The Norwegian Radium Hospital, Oslo University Hospital, Oslo
| | - B H Grønberg
- Department of Oncology, St. Olav's Hospital, Trondheim University Hospital, Trondheim; Department of Clinical and Molecular Medicine, Norwegian University of Science and Technology, Trondheim, Norway
| | - L Fjellbirkeland
- Department of Respiratory Medicine, Oslo University Hospital, University of Oslo, Oslo
| | - Å Helland
- Department of Cancer Genetics, Institute for Cancer Research, Norwegian Radium Hospital, Oslo University Hospital, Oslo; Department of Oncology, Oslo University Hospital, Oslo; Department of Clinical Medicine, University of Oslo, Oslo, Norway
| | - M Pøhl
- Department of Oncology, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
| | - D J Kwiatkowski
- Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, USA; Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, USA
| | - T Donnem
- Department of Clinical Medicine, UiT The Arctic University of Norway, Tromso; Department of Oncology, University Hospital of North Norway, Tromso, Norway
| |
Collapse
|