1
|
Dunn C, Brettle D, Hodgson C, Hughes R, Treanor D. An international study of stain variability in histopathology using qualitative and quantitative analysis. J Pathol Inform 2025; 17:100423. [PMID: 40145070 PMCID: PMC11938143 DOI: 10.1016/j.jpi.2025.100423] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2024] [Revised: 01/06/2025] [Accepted: 02/07/2025] [Indexed: 03/28/2025] Open
Abstract
Hematoxylin and eosin (H&E) staining accounts for over 80% of slides stained worldwide. Although routinely used, there are high levels of variation between labs due to different staining methods. Staining is a pivotal part of slide preparation, but quality control is largely subjective, with overall clinical assurance provided by external quality assessment (EQA) services, underpinned by expert assessment. Digital pathology offers the potential to provide objective quantification of stain, through color analysis, to augment EQA assessment. This large-scale study evaluated H&E staining in 247 international labs participating in the UK NEQAS CPT EQA programme. Tissue sections were circulated to each lab to stain using their routine H&E staining protocol. The slides were reviewed by independent expert UK NEQAS CPT assessors, and quantitative digital analysis was conducted, comprising of H&E color deconvolution and color difference determination (ΔE). Most labs (69%) achieved an EQA score indicating good or excellent staining, with high inter-observer concordance to support this (92.5% within one mark of each other). H&E color difference, ΔE, showed 60% of labs were within 2 ΔE of the mean, which is considered as only perceptible through close observation. There was little correlation found between H&E intensity and assessor score, however, the H&E intensity ratio indicated a trend with assessor score suggesting there may be an optimal stain relationship that should be investigated further. The presented hybrid analysis combines expert analysis with objective data. This has the potential to inform upon optimal tissue staining and allows us to consider quantitative standards of H&E staining in pathology practice.
Collapse
Affiliation(s)
- Catriona Dunn
- National Pathology Imaging Co-operative, Leeds Teaching Hospitals NHS Trust, Beckett Street, Leeds, UK
| | - David Brettle
- National Pathology Imaging Co-operative, Leeds Teaching Hospitals NHS Trust, Beckett Street, Leeds, UK
| | - Chantell Hodgson
- UK NEQAS Cellular Pathology Technique, Haylofts, St Thomas Street, Haymarket, Newcastle, UK
| | - Robert Hughes
- UK NEQAS Cellular Pathology Technique, Haylofts, St Thomas Street, Haymarket, Newcastle, UK
| | - Darren Treanor
- National Pathology Imaging Co-operative, Leeds Teaching Hospitals NHS Trust, Beckett Street, Leeds, UK
- Department of Histopathology, Leeds Teaching Hospitals NHS Trust, Beckett Street, Leeds, UK
- Department of Pathology and Data Analytics, University of Leeds, Beckett Street, Leeds, UK
- Department of Clinical Pathology and Clinical and Experimental Medicine, Linköping University, Linköping, Sweden
- Centre for Medical Image Science and Visualisation, Linköping University, Linköping, Sweden
| |
Collapse
|
2
|
Ahmed SR, Befano B, Egemen D, Rodriguez AC, Desai KT, Jeronimo J, Ajenifuja KO, Clark C, Perkins R, Campos NG, Inturrisi F, Wentzensen N, Han P, Guillen D, Norman J, Goldstein AT, Madeleine MM, Donastorg Y, Schiffman M, de Sanjose S, Kalpathy-Cramer J. Generalizable deep neural networks for image quality classification of cervical images. Sci Rep 2025; 15:6312. [PMID: 39984572 PMCID: PMC11845747 DOI: 10.1038/s41598-025-90024-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2024] [Accepted: 02/10/2025] [Indexed: 02/23/2025] Open
Abstract
Successful translation of artificial intelligence (AI) models into clinical practice, across clinical domains, is frequently hindered by the lack of image quality control. Diagnostic models are often trained on images with no denotation of image quality in the training data; this, in turn, can lead to misclassifications by these models when implemented in the clinical setting. In the case of cervical images, quality classification is a crucial task to ensure accurate detection of precancerous lesions or cancer; this is true for both gynecologic-oncologists' (manual) and diagnostic AI models' (automated) predictions. Factors that impact the quality of a cervical image include but are not limited to blur, poor focus, poor light, noise, obscured view of the cervix due to mucus and/or blood, improper position, and over- and/or under-exposure. Utilizing a multi-level image quality ground truth denoted by providers, we generated an image quality classifier following a multi-stage model selection process that investigated several key design choices on a multi-heterogenous "SEED" dataset of 40,534 images. We subsequently validated the best model on an external dataset ("EXT"), comprising 1,340 images captured using a different device and acquired in different geographies from "SEED". We assessed the relative impact of various axes of data heterogeneity, including device, geography, and ground-truth rater on model performance. Our best performing model achieved an area under the receiver operating characteristics curve (AUROC) of 0.92 (low quality, LQ vs. rest) and 0.93 (high quality, HQ vs. rest), and a minimal total %extreme misclassification (%EM) of 2.8% on the internal validation set. Our model also generalized well externally, achieving corresponding AUROCs of 0.83 and 0.82, and %EM of 3.9% when tested out-of-the-box on the external validation ("EXT") set. Additionally, our model was geography agnostic with no meaningful difference in performance across geographies, did not exhibit catastrophic forgetting upon retraining with new data, and mimicked the overall/average ground truth rater behavior well. Our work represents one of the first efforts at generating and externally validating an image quality classifier across multiple axes of data heterogeneity to aid in visual diagnosis of cervical precancer and cancer. We hope that this will motivate the accompaniment of adequate guardrails for AI-based pipelines to account for image quality and generalizability concerns.
Collapse
Affiliation(s)
- Syed Rakin Ahmed
- Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Massachusetts General Hospital, Boston, MA, 02129, USA.
- Harvard Graduate Program in Biophysics, Harvard Medical School, Harvard University, Cambridge, MA, 02115, USA.
- Massachusetts Institute of Technology, Cambridge, MA, 02139, USA.
- Geisel School of Medicine at Dartmouth, Dartmouth College, Hanover, NH, 03755, USA.
| | - Brian Befano
- Information Management Services, Calverton, MD, 20705, USA
- University of Washington, Seattle, WA, 98195, USA
| | - Didem Egemen
- Clinical Epidemiology Unit, Clinical Genetics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Ana Cecilia Rodriguez
- Clinical Epidemiology Unit, Clinical Genetics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Kanan T Desai
- Clinical Epidemiology Unit, Clinical Genetics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Jose Jeronimo
- Clinical Epidemiology Unit, Clinical Genetics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Kayode O Ajenifuja
- Department of Obstetrics and Gynecology, Obafemi Awolowo University Teaching Hospital, Ile Ife, Nigeria
| | - Christopher Clark
- Department of Ophthalmology, University of Colorado Anschutz, Denver, CO, 80045, USA
| | - Rebecca Perkins
- Department of Obstetrics and Gynecology, Boston University Chobanian & Avedisian School of Medicine, Boston, MA, 02118, USA
| | - Nicole G Campos
- Center for Health Decision Science, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA
| | - Federica Inturrisi
- Clinical Epidemiology Unit, Clinical Genetics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Nicolas Wentzensen
- Clinical Epidemiology Unit, Clinical Genetics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Paul Han
- Division of Cancer Control and Population Sciences, National Cancer Institute, National Institutes of Health, Rockville, MD, USA
| | | | - Judy Norman
- Women's Health, Mercy Medical Center, Phnom Penh, Cambodia
| | | | | | - Yeycy Donastorg
- HIV Vaccine Trials Research Unit, Instituto Dermatológico y Cirugía de la Piel "Dr. Huberto Bogaert Díaz", Santo Domingo, Dominican Republic
| | - Mark Schiffman
- Clinical Epidemiology Unit, Clinical Genetics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Silvia de Sanjose
- Clinical Epidemiology Unit, Clinical Genetics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA
- ISGlobal, Barcelona, Spain
| | - Jayashree Kalpathy-Cramer
- Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Massachusetts General Hospital, Boston, MA, 02129, USA
- Department of Ophthalmology, University of Colorado Anschutz, Denver, CO, 80045, USA
| |
Collapse
|
3
|
Sun AK, Fan S, Choi SW. Exploring Multiplex Immunohistochemistry (mIHC) Techniques and Histopathology Image Analysis: Current Practice and Potential for Clinical Incorporation. Cancer Med 2025; 14:e70523. [PMID: 39764760 PMCID: PMC11705464 DOI: 10.1002/cam4.70523] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2024] [Revised: 08/10/2024] [Accepted: 12/10/2024] [Indexed: 01/11/2025] Open
Abstract
BACKGROUND By simultaneously staining multiple immunomarkers on a single tissue section, multiplexed immunohistochemistry (mIHC) enhances the amount of information that can be observed in a single tissue section and thus can be a powerful tool to visualise cellular interactions directly in the tumour microenvironment. Performing mIHC remains technically and practically challenging, and this technique has many limitations if not properly validated. However, with proper validation, heterogeneity between histopathological images can be avoided. AIMS This review aimed to summarize the currently used methods and to propose a standardised method for effective mIHC. MATERIALS AND METHODS An extensive literature review was conducted to identify different methods currently in use for mIHC. RESULTS Guidelines for antibody selection, panel design, antibody validation and analytical strategies are given. The advantages and disadvantages of each method are discussed. CONCLUSION This review summarizes widely used pathology imaging software and discusses the potential for automation of pathology image analysis so that mIHC technology can be a truly powerful tool for research as well as clinical use.
Collapse
Affiliation(s)
- Aria Kaiyuan Sun
- Department of Anaesthesiology, School of Clinical Medicine, Faculty of MedicineThe University of Hong KongHong KongHong Kong
| | - Song Fan
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene RegulationSun Yat‐Sen Memorial HospitalGuangzhouChina
| | - Siu Wai Choi
- Department of Orthopaedics and Traumatology, School of Clinical Medicine, Faculty of MedicineThe University of Hong KongHong KongHong Kong
| |
Collapse
|
4
|
Broad A, Wright A, McGenity C, Treanor D, de Kamps M. Object-based feedback attention in convolutional neural networks improves tumour detection in digital pathology. Sci Rep 2024; 14:30400. [PMID: 39638839 PMCID: PMC11621113 DOI: 10.1038/s41598-024-80717-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2024] [Accepted: 11/21/2024] [Indexed: 12/07/2024] Open
Abstract
Human visual attention allows prior knowledge or expectations to influence visual processing, allocating limited computational resources to only that part of the image that are likely to behaviourally important. Here, we present an image recognition system based on biological vision that guides attention to more informative locations within a larger parent image, using a sequence of saccade-like motions. We demonstrate that at the end of the saccade sequence the system has an improved classification ability compared to the convolutional neural network (CNN) that represents the feedforward part of the model. Feedback activations highlight salient image features supporting the explainability of the classification. Our attention model deviates substantially from more common feedforward attention mechanisms, which linearly reweight part of the input. This model uses several passes of feedforward and backward activation, which interact non-linearly. We apply our feedback architecture to histopathology patch images, demonstrating a 3.5% improvement in accuracy (p < 0.001) when retrospectively processing 59,057 9-class patches from 689 colorectal cancer WSIs. In the saccade implementation, overall agreement between expert-labelled patches and model prediction reached 93.23% for tumour tissue, surpassing inter-pathologist agreement. Our method is adaptable to other areas of science which rely on the analysis of extremely large-scale images.
Collapse
Affiliation(s)
- Andrew Broad
- School of Computing, University of Leeds, Leeds, UK
- Leeds Institute for Data Analytics, University of Leeds, Leeds, UK
- Leeds Teaching Hospitals NHS Trust, Leeds, UK
- National Pathology Imaging Cooperative, Leeds, UK
| | - Alexander Wright
- Leeds Teaching Hospitals NHS Trust, Leeds, UK
- Division of Pathology and Data Analytics, Leeds Institute of Medical Research, University of Leeds, Leeds, UK
- National Pathology Imaging Cooperative, Leeds, UK
| | - Clare McGenity
- Leeds Teaching Hospitals NHS Trust, Leeds, UK
- Division of Pathology and Data Analytics, Leeds Institute of Medical Research, University of Leeds, Leeds, UK
- National Pathology Imaging Cooperative, Leeds, UK
| | - Darren Treanor
- Leeds Teaching Hospitals NHS Trust, Leeds, UK
- National Pathology Imaging Cooperative, Leeds, UK
- University of Leeds, Leeds, UK
- Department of Clinical Pathology, Linköping University, Linköping, Sweden
- Department of Clinical and Experimental Medicine, Linköping University, Linköping, Sweden
- Center for Medical Image Science and Visualization (CMIV), Linköping University, Linköping, Sweden
| | - Marc de Kamps
- School of Computing, University of Leeds, Leeds, UK.
- Leeds Institute for Data Analytics, University of Leeds, Leeds, UK.
| |
Collapse
|
5
|
Hosseini MS, Bejnordi BE, Trinh VQH, Chan L, Hasan D, Li X, Yang S, Kim T, Zhang H, Wu T, Chinniah K, Maghsoudlou S, Zhang R, Zhu J, Khaki S, Buin A, Chaji F, Salehi A, Nguyen BN, Samaras D, Plataniotis KN. Computational pathology: A survey review and the way forward. J Pathol Inform 2024; 15:100357. [PMID: 38420608 PMCID: PMC10900832 DOI: 10.1016/j.jpi.2023.100357] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2023] [Revised: 12/21/2023] [Accepted: 12/23/2023] [Indexed: 03/02/2024] Open
Abstract
Computational Pathology (CPath) is an interdisciplinary science that augments developments of computational approaches to analyze and model medical histopathology images. The main objective for CPath is to develop infrastructure and workflows of digital diagnostics as an assistive CAD system for clinical pathology, facilitating transformational changes in the diagnosis and treatment of cancer that are mainly address by CPath tools. With evergrowing developments in deep learning and computer vision algorithms, and the ease of the data flow from digital pathology, currently CPath is witnessing a paradigm shift. Despite the sheer volume of engineering and scientific works being introduced for cancer image analysis, there is still a considerable gap of adopting and integrating these algorithms in clinical practice. This raises a significant question regarding the direction and trends that are undertaken in CPath. In this article we provide a comprehensive review of more than 800 papers to address the challenges faced in problem design all-the-way to the application and implementation viewpoints. We have catalogued each paper into a model-card by examining the key works and challenges faced to layout the current landscape in CPath. We hope this helps the community to locate relevant works and facilitate understanding of the field's future directions. In a nutshell, we oversee the CPath developments in cycle of stages which are required to be cohesively linked together to address the challenges associated with such multidisciplinary science. We overview this cycle from different perspectives of data-centric, model-centric, and application-centric problems. We finally sketch remaining challenges and provide directions for future technical developments and clinical integration of CPath. For updated information on this survey review paper and accessing to the original model cards repository, please refer to GitHub. Updated version of this draft can also be found from arXiv.
Collapse
Affiliation(s)
- Mahdi S. Hosseini
- Department of Computer Science and Software Engineering (CSSE), Concordia Univeristy, Montreal, QC H3H 2R9, Canada
| | | | - Vincent Quoc-Huy Trinh
- Institute for Research in Immunology and Cancer of the University of Montreal, Montreal, QC H3T 1J4, Canada
| | - Lyndon Chan
- The Edward S. Rogers Sr. Department of Electrical & Computer Engineering (ECE), University of Toronto, Toronto, ON M5S 3G4, Canada
| | - Danial Hasan
- The Edward S. Rogers Sr. Department of Electrical & Computer Engineering (ECE), University of Toronto, Toronto, ON M5S 3G4, Canada
| | - Xingwen Li
- The Edward S. Rogers Sr. Department of Electrical & Computer Engineering (ECE), University of Toronto, Toronto, ON M5S 3G4, Canada
| | - Stephen Yang
- The Edward S. Rogers Sr. Department of Electrical & Computer Engineering (ECE), University of Toronto, Toronto, ON M5S 3G4, Canada
| | - Taehyo Kim
- The Edward S. Rogers Sr. Department of Electrical & Computer Engineering (ECE), University of Toronto, Toronto, ON M5S 3G4, Canada
| | - Haochen Zhang
- The Edward S. Rogers Sr. Department of Electrical & Computer Engineering (ECE), University of Toronto, Toronto, ON M5S 3G4, Canada
| | - Theodore Wu
- The Edward S. Rogers Sr. Department of Electrical & Computer Engineering (ECE), University of Toronto, Toronto, ON M5S 3G4, Canada
| | - Kajanan Chinniah
- The Edward S. Rogers Sr. Department of Electrical & Computer Engineering (ECE), University of Toronto, Toronto, ON M5S 3G4, Canada
| | - Sina Maghsoudlou
- Department of Computer Science and Software Engineering (CSSE), Concordia Univeristy, Montreal, QC H3H 2R9, Canada
| | - Ryan Zhang
- The Edward S. Rogers Sr. Department of Electrical & Computer Engineering (ECE), University of Toronto, Toronto, ON M5S 3G4, Canada
| | - Jiadai Zhu
- The Edward S. Rogers Sr. Department of Electrical & Computer Engineering (ECE), University of Toronto, Toronto, ON M5S 3G4, Canada
| | - Samir Khaki
- The Edward S. Rogers Sr. Department of Electrical & Computer Engineering (ECE), University of Toronto, Toronto, ON M5S 3G4, Canada
| | - Andrei Buin
- Huron Digitial Pathology, St. Jacobs, ON N0B 2N0, Canada
| | - Fatemeh Chaji
- Department of Computer Science and Software Engineering (CSSE), Concordia Univeristy, Montreal, QC H3H 2R9, Canada
| | - Ala Salehi
- Department of Electrical and Computer Engineering, University of New Brunswick, Fredericton, NB E3B 5A3, Canada
| | - Bich Ngoc Nguyen
- University of Montreal Hospital Center, Montreal, QC H2X 0C2, Canada
| | - Dimitris Samaras
- Department of Computer Science, Stony Brook University, Stony Brook, NY 11794, United States
| | - Konstantinos N. Plataniotis
- The Edward S. Rogers Sr. Department of Electrical & Computer Engineering (ECE), University of Toronto, Toronto, ON M5S 3G4, Canada
| |
Collapse
|
6
|
Meißner AK, Blau T, Reinecke D, Fürtjes G, Leyer L, Müller N, von Spreckelsen N, Stehle T, Al Shugri A, Büttner R, Goldbrunner R, Timmer M, Neuschmelting V. Image Quality Assessment and Reliability Analysis of Artificial Intelligence-Based Tumor Classification of Stimulated Raman Histology of Tumor Biobank Samples. Diagnostics (Basel) 2024; 14:2701. [PMID: 39682609 DOI: 10.3390/diagnostics14232701] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2024] [Revised: 11/21/2024] [Accepted: 11/26/2024] [Indexed: 12/18/2024] Open
Abstract
BACKGROUND Stimulated Raman histology (SRH) is a label-free optical imaging method for rapid intraoperative analysis of fresh tissue samples. Analysis of SRH images using Convolutional Neural Networks (CNN) has shown promising results for predicting the main histopathological classes of neurooncological tumors. Due to the relatively low number of rare tumor representations in CNN training datasets, a valid prediction of rarer entities remains limited. To develop new reliable analysis tools, larger datasets and greater tumor variety are crucial. One way to accomplish this is through research biobanks storing frozen tumor tissue samples. However, there is currently no data available regarding the pertinency of previously frozen tissue samples for SRH analysis. The aim of this study was to assess image quality and perform a comparative reliability analysis of artificial intelligence-based tumor classification using SRH in fresh and frozen tissue samples. METHODS In a monocentric prospective study, tissue samples from 25 patients undergoing brain tumor resection were obtained. SRH was acquired in fresh and defrosted samples of the same specimen after varying storage durations at -80 °C. Image quality was rated by an experienced neuropathologist, and prediction of histopathological diagnosis was performed using two established CNNs. RESULTS The image quality of SRH in fresh and defrosted tissue samples was high, with a mean image quality score of 1.96 (range 1-5) for both groups. CNN analysis showed high internal consistency for histo-(Cα 0.95) and molecular (Cα 0.83) pathological tumor classification. The results were confirmed using a dataset with samples from the local tumor biobank (Cα 0.91 and 0.53). CONCLUSIONS Our results showed that SRH appears comparably reliable in fresh and frozen tissue samples, enabling the integration of tumor biobank specimens to potentially improve the diagnostic range and reliability of CNN prediction tools.
Collapse
Affiliation(s)
- Anna-Katharina Meißner
- Department of General Neurosurgery, Center for Neurosurgery, Faculty of Medicine and University Hospital Cologne, University of Cologne, 50937 Cologne, Germany
| | - Tobias Blau
- Institute for Neuropathology, University of Duisburg-Essen, 45141 Essen, Germany
| | - David Reinecke
- Department of General Neurosurgery, Center for Neurosurgery, Faculty of Medicine and University Hospital Cologne, University of Cologne, 50937 Cologne, Germany
| | - Gina Fürtjes
- Department of General Neurosurgery, Center for Neurosurgery, Faculty of Medicine and University Hospital Cologne, University of Cologne, 50937 Cologne, Germany
| | - Lili Leyer
- Department of General Neurosurgery, Center for Neurosurgery, Faculty of Medicine and University Hospital Cologne, University of Cologne, 50937 Cologne, Germany
| | - Nina Müller
- Department of General Neurosurgery, Center for Neurosurgery, Faculty of Medicine and University Hospital Cologne, University of Cologne, 50937 Cologne, Germany
| | - Niklas von Spreckelsen
- Department of General Neurosurgery, Center for Neurosurgery, Faculty of Medicine and University Hospital Cologne, University of Cologne, 50937 Cologne, Germany
- Institute for Neuropathology, University of Duisburg-Essen, 45141 Essen, Germany
- Department of Neurosurgery, Westküstenklinikum Heide, 25746 Heide, Germany
| | - Thomas Stehle
- Institute for Neuropathology, Faculty of Medicine, University Hospital Cologne, University of Cologne, 50937 Cologne, Germany
| | - Abdulkader Al Shugri
- Institute for Neuropathology, Faculty of Medicine, University Hospital Cologne, University of Cologne, 50937 Cologne, Germany
| | - Reinhard Büttner
- Department of Pathology, Faculty of Medicine, University Hospital Cologne, University of Cologne, 50937 Cologne, Germany
| | - Roland Goldbrunner
- Department of General Neurosurgery, Center for Neurosurgery, Faculty of Medicine and University Hospital Cologne, University of Cologne, 50937 Cologne, Germany
| | - Marco Timmer
- Department of General Neurosurgery, Center for Neurosurgery, Faculty of Medicine and University Hospital Cologne, University of Cologne, 50937 Cologne, Germany
| | - Volker Neuschmelting
- Department of General Neurosurgery, Center for Neurosurgery, Faculty of Medicine and University Hospital Cologne, University of Cologne, 50937 Cologne, Germany
| |
Collapse
|
7
|
Matthews GA, McGenity C, Bansal D, Treanor D. Public evidence on AI products for digital pathology. NPJ Digit Med 2024; 7:300. [PMID: 39455883 PMCID: PMC11511888 DOI: 10.1038/s41746-024-01294-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Accepted: 10/08/2024] [Indexed: 10/28/2024] Open
Abstract
Novel products applying artificial intelligence (AI)-based methods to digital pathology images are touted to have many uses and benefits. However, publicly available information for products can be variable, with few sources of independent evidence. This review aimed to identify public evidence for AI-based products for digital pathology. Key features of products on the European Economic Area/Great Britain (EEA/GB) markets were examined, including their regulatory approval, intended use, and published validation studies. There were 26 AI-based products that met the inclusion criteria and, of these, 24 had received regulatory approval via the self-certification route as General in vitro diagnostic (IVD) medical devices. Only 10 of the products (38%) had peer-reviewed internal validation studies and 11 products (42%) had peer-reviewed external validation studies. To support transparency an online register was developed using identified public evidence ( https://osf.io/gb84r/ ), which we anticipate will provide an accessible resource on novel devices and support decision making.
Collapse
Affiliation(s)
| | - Clare McGenity
- Leeds Teaching Hospitals NHS Trust, Leeds, UK
- University of Leeds, Leeds, UK
| | | | - Darren Treanor
- Leeds Teaching Hospitals NHS Trust, Leeds, UK.
- University of Leeds, Leeds, UK.
- Department of Clinical Pathology & Department of Clinical and Experimental Medicine, Linköping University, Linköping, Sweden.
- Centre for Medical Image Science and Visualization (CMIV), Linköping University, Linköping, Sweden.
| |
Collapse
|
8
|
Kanwal N, Khoraminia F, Kiraz U, Mosquera-Zamudio A, Monteagudo C, Janssen EAM, Zuiverloon TCM, Rong C, Engan K. Equipping computational pathology systems with artifact processing pipelines: a showcase for computation and performance trade-offs. BMC Med Inform Decis Mak 2024; 24:288. [PMID: 39375719 PMCID: PMC11457387 DOI: 10.1186/s12911-024-02676-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2024] [Accepted: 09/09/2024] [Indexed: 10/09/2024] Open
Abstract
BACKGROUND Histopathology is a gold standard for cancer diagnosis. It involves extracting tissue specimens from suspicious areas to prepare a glass slide for a microscopic examination. However, histological tissue processing procedures result in the introduction of artifacts, which are ultimately transferred to the digitized version of glass slides, known as whole slide images (WSIs). Artifacts are diagnostically irrelevant areas and may result in wrong predictions from deep learning (DL) algorithms. Therefore, detecting and excluding artifacts in the computational pathology (CPATH) system is essential for reliable automated diagnosis. METHODS In this paper, we propose a mixture of experts (MoE) scheme for detecting five notable artifacts, including damaged tissue, blur, folded tissue, air bubbles, and histologically irrelevant blood from WSIs. First, we train independent binary DL models as experts to capture particular artifact morphology. Then, we ensemble their predictions using a fusion mechanism. We apply probabilistic thresholding over the final probability distribution to improve the sensitivity of the MoE. We developed four DL pipelines to evaluate computational and performance trade-offs. These include two MoEs and two multiclass models of state-of-the-art deep convolutional neural networks (DCNNs) and vision transformers (ViTs). These DL pipelines are quantitatively and qualitatively evaluated on external and out-of-distribution (OoD) data to assess generalizability and robustness for artifact detection application. RESULTS We extensively evaluated the proposed MoE and multiclass models. DCNNs-based MoE and ViTs-based MoE schemes outperformed simpler multiclass models and were tested on datasets from different hospitals and cancer types, where MoE using (MobileNet) DCNNs yielded the best results. The proposed MoE yields 86.15 % F1 and 97.93% sensitivity scores on unseen data, retaining less computational cost for inference than MoE using ViTs. This best performance of MoEs comes with relatively higher computational trade-offs than multiclass models. Furthermore, we apply post-processing to create an artifact segmentation mask, a potential artifact-free RoI map, a quality report, and an artifact-refined WSI for further computational analysis. During the qualitative evaluation, field experts assessed the predictive performance of MoEs over OoD WSIs. They rated artifact detection and artifact-free area preservation, where the highest agreement translated to a Cohen Kappa of 0.82, indicating substantial agreement for the overall diagnostic usability of the DCNN-based MoE scheme. CONCLUSIONS The proposed artifact detection pipeline will not only ensure reliable CPATH predictions but may also provide quality control. In this work, the best-performing pipeline for artifact detection is MoE with DCNNs. Our detailed experiments show that there is always a trade-off between performance and computational complexity, and no straightforward DL solution equally suits all types of data and applications. The code and HistoArtifacts dataset can be found online at Github and Zenodo , respectively.
Collapse
Affiliation(s)
- Neel Kanwal
- Department of Electrical Engineering and Computer Science, University of Stavanger, 4021, Stavanger, Norway.
| | - Farbod Khoraminia
- Department of Urology, University Medical Center Rotterdam, Erasmus MC Cancer Institute, 1035 GD, Rotterdam, The Netherlands
| | - Umay Kiraz
- Department of Pathology, Stavanger University Hospital, 4011, Stavanger, Norway
- Department of Chemistry, Bioscience and Environmental Engineering, University of Stavanger, 4021, Stavanger, Norway
| | - Andrés Mosquera-Zamudio
- Department of Pathology, INCLIVA Biomedical Research Institute, and University of Valencia, 46010, Valencia, Spain
| | - Carlos Monteagudo
- Department of Pathology, INCLIVA Biomedical Research Institute, and University of Valencia, 46010, Valencia, Spain
| | - Emiel A M Janssen
- Department of Pathology, Stavanger University Hospital, 4011, Stavanger, Norway
- Department of Chemistry, Bioscience and Environmental Engineering, University of Stavanger, 4021, Stavanger, Norway
| | - Tahlita C M Zuiverloon
- Department of Urology, University Medical Center Rotterdam, Erasmus MC Cancer Institute, 1035 GD, Rotterdam, The Netherlands
| | - Chunming Rong
- Department of Electrical Engineering and Computer Science, University of Stavanger, 4021, Stavanger, Norway
| | - Kjersti Engan
- Department of Electrical Engineering and Computer Science, University of Stavanger, 4021, Stavanger, Norway.
| |
Collapse
|
9
|
Humphries MP, Kaye D, Stankeviciute G, Halliwell J, Wright AI, Bansal D, Brettle D, Treanor D. Development of a multi-scanner facility for data acquisition for digital pathology artificial intelligence. J Pathol 2024; 264:80-89. [PMID: 38984400 DOI: 10.1002/path.6326] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Revised: 04/22/2024] [Accepted: 05/31/2024] [Indexed: 07/11/2024]
Abstract
Whole slide imaging (WSI) of pathology glass slides using high-resolution scanners has enabled the large-scale application of artificial intelligence (AI) in pathology, to support the detection and diagnosis of disease, potentially increasing efficiency and accuracy in tissue diagnosis. Despite the promise of AI, it has limitations. 'Brittleness' or sensitivity to variation in inputs necessitates that large amounts of data are used for training. AI is often trained on data from different scanners but not usually by replicating the same slide across scanners. The utilisation of multiple WSI instruments to produce digital replicas of the same slides will make more comprehensive datasets and may improve the robustness and generalisability of AI algorithms as well as reduce the overall data requirements of AI training. To this end, the National Pathology Imaging Cooperative (NPIC) has built the AI FORGE (Facilitating Opportunities for Robust Generalisable data Emulation), a unique multi-scanner facility embedded in a clinical site in the NHS to (1) compare scanner performance, (2) replicate digital pathology image datasets across WSI systems, and (3) support the evaluation of clinical AI algorithms. The NPIC AI FORGE currently comprises 15 scanners from nine manufacturers. It can generate approximately 4,000 WSI images per day (approximately 7 TB of image data). This paper describes the process followed to plan and build such a facility. © 2024 The Author(s). The Journal of Pathology published by John Wiley & Sons Ltd on behalf of The Pathological Society of Great Britain and Ireland.
Collapse
Affiliation(s)
- Matthew P Humphries
- National Pathology Imaging Cooperative, Leeds Teaching Hospitals NHS Trust, Leeds, UK
- University of Leeds, Leeds, UK
| | - Danny Kaye
- National Pathology Imaging Cooperative, Leeds Teaching Hospitals NHS Trust, Leeds, UK
- University of Leeds, Leeds, UK
| | - Gaby Stankeviciute
- National Pathology Imaging Cooperative, Leeds Teaching Hospitals NHS Trust, Leeds, UK
- University of Leeds, Leeds, UK
| | - Jacob Halliwell
- National Pathology Imaging Cooperative, Leeds Teaching Hospitals NHS Trust, Leeds, UK
- University of Leeds, Leeds, UK
| | - Alexander I Wright
- National Pathology Imaging Cooperative, Leeds Teaching Hospitals NHS Trust, Leeds, UK
- University of Leeds, Leeds, UK
| | - Daljeet Bansal
- National Pathology Imaging Cooperative, Leeds Teaching Hospitals NHS Trust, Leeds, UK
- University of Leeds, Leeds, UK
| | - David Brettle
- National Pathology Imaging Cooperative, Leeds Teaching Hospitals NHS Trust, Leeds, UK
- University of Leeds, Leeds, UK
| | - Darren Treanor
- National Pathology Imaging Cooperative, Leeds Teaching Hospitals NHS Trust, Leeds, UK
- University of Leeds, Leeds, UK
- Department of Histopathology, Leeds Teaching Hospitals NHS Trust, Leeds, UK
- Department of Clinical Pathology and Department of Clinical and Experimental Medicine, Linköping University, Linköping, Sweden
- Centre for Medical Image Science and Visualization (CMIV), Linköping University, Linköping, Sweden
| |
Collapse
|
10
|
Chang J, Hatfield B. Advancements in computer vision and pathology: Unraveling the potential of artificial intelligence for precision diagnosis and beyond. Adv Cancer Res 2024; 161:431-478. [PMID: 39032956 DOI: 10.1016/bs.acr.2024.05.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/23/2024]
Abstract
The integration of computer vision into pathology through slide digitalization represents a transformative leap in the field's evolution. Traditional pathology methods, while reliable, are often time-consuming and susceptible to intra- and interobserver variability. In contrast, computer vision, empowered by artificial intelligence (AI) and machine learning (ML), promises revolutionary changes, offering consistent, reproducible, and objective results with ever-increasing speed and scalability. The applications of advanced algorithms and deep learning architectures like CNNs and U-Nets augment pathologists' diagnostic capabilities, opening new frontiers in automated image analysis. As these technologies mature and integrate into digital pathology workflows, they are poised to provide deeper insights into disease processes, quantify and standardize biomarkers, enhance patient outcomes, and automate routine tasks, reducing pathologists' workload. However, this transformative force calls for cross-disciplinary collaboration between pathologists, computer scientists, and industry innovators to drive research and development. While acknowledging its potential, this chapter addresses the limitations of AI in pathology, encompassing technical, practical, and ethical considerations during development and implementation.
Collapse
Affiliation(s)
- Justin Chang
- Virginia Commonwealth University Health System, Richmond, VA, United States
| | - Bryce Hatfield
- Virginia Commonwealth University Health System, Richmond, VA, United States.
| |
Collapse
|
11
|
Browning L, Jesus C, Malacrino S, Guan Y, White K, Puddle A, Alham NK, Haghighat M, Colling R, Birks J, Rittscher J, Verrill C. Artificial Intelligence-Based Quality Assessment of Histopathology Whole-Slide Images within a Clinical Workflow: Assessment of 'PathProfiler' in a Diagnostic Pathology Setting. Diagnostics (Basel) 2024; 14:990. [PMID: 38786288 PMCID: PMC11120465 DOI: 10.3390/diagnostics14100990] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Revised: 04/17/2024] [Accepted: 04/28/2024] [Indexed: 05/25/2024] Open
Abstract
Digital pathology continues to gain momentum, with the promise of artificial intelligence to aid diagnosis and for assessment of features which may impact prognosis and clinical management. Successful adoption of these technologies depends upon the quality of digitised whole-slide images (WSI); however, current quality control largely depends upon manual assessment, which is inefficient and subjective. We previously developed PathProfiler, an automated image quality assessment tool, and in this feasibility study we investigate its potential for incorporation into a diagnostic clinical pathology setting in real-time. A total of 1254 genitourinary WSI were analysed by PathProfiler. PathProfiler was developed and trained on prostate tissue and, of the prostate biopsy WSI, representing 46% of the WSI analysed, 4.5% were flagged as potentially being of suboptimal quality for diagnosis. All had concordant subjective issues, mainly focus-related, 54% severe enough to warrant remedial action which resulted in improved image quality. PathProfiler was less reliable in assessment of non-prostate surgical resection-type cases, on which it had not been trained. PathProfiler shows potential for incorporation into a digitised clinical pathology workflow, with opportunity for image quality improvement. Whilst its reliability in the current form appears greatest for assessment of prostate specimens, other specimen types, particularly biopsies, also showed benefit.
Collapse
Affiliation(s)
- Lisa Browning
- Department of Cellular Pathology, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford OX3 9DU, UK
| | - Christine Jesus
- Department of Cellular Pathology, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford OX3 9DU, UK
| | - Stefano Malacrino
- Nuffield Department of Surgical Sciences, University of Oxford, Oxford OX3 9DU, UK
| | - Yue Guan
- Department of Cellular Pathology, Royal Berkshire Hospital, Royal Berkshire NHS Foundation Trust, Reading RG1 5AN, UK
| | - Kieron White
- Department of Cellular Pathology, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford OX3 9DU, UK
| | - Alison Puddle
- Department of Cellular Pathology, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford OX3 9DU, UK
| | | | - Maryam Haghighat
- School of Electrical Engineering and Robotics, Queensland University of Technology, Brisbane, QLD 4000, Australia
| | - Richard Colling
- Department of Cellular Pathology, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford OX3 9DU, UK
- Nuffield Department of Surgical Sciences, University of Oxford, Oxford OX3 9DU, UK
| | - Jacqueline Birks
- Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford OX3 7LD, UK
| | - Jens Rittscher
- Department of Engineering Science, University of Oxford, Oxford OX1 3PJ, UK
- NIHR Oxford Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, Oxford OX3 9DU, UK
| | - Clare Verrill
- Department of Cellular Pathology, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford OX3 9DU, UK
- Nuffield Department of Surgical Sciences, University of Oxford, Oxford OX3 9DU, UK
- NIHR Oxford Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, Oxford OX3 9DU, UK
| |
Collapse
|
12
|
Irmakci I, Nateghi R, Zhou R, Vescovo M, Saft M, Ross AE, Yang XJ, Cooper LAD, Goldstein JA. Tissue Contamination Challenges the Credibility of Machine Learning Models in Real World Digital Pathology. Mod Pathol 2024; 37:100422. [PMID: 38185250 PMCID: PMC10960671 DOI: 10.1016/j.modpat.2024.100422] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Revised: 11/13/2023] [Accepted: 12/15/2023] [Indexed: 01/09/2024]
Abstract
Machine learning (ML) models are poised to transform surgical pathology practice. The most successful use attention mechanisms to examine whole slides, identify which areas of tissue are diagnostic, and use them to guide diagnosis. Tissue contaminants, such as floaters, represent unexpected tissue. Although human pathologists are extensively trained to consider and detect tissue contaminants, we examined their impact on ML models. We trained 4 whole-slide models. Three operate in placenta for the following functions: (1) detection of decidual arteriopathy, (2) estimation of gestational age, and (3) classification of macroscopic placental lesions. We also developed a model to detect prostate cancer in needle biopsies. We designed experiments wherein patches of contaminant tissue are randomly sampled from known slides and digitally added to patient slides and measured model performance. We measured the proportion of attention given to contaminants and examined the impact of contaminants in the t-distributed stochastic neighbor embedding feature space. Every model showed performance degradation in response to one or more tissue contaminants. Decidual arteriopathy detection--balanced accuracy decreased from 0.74 to 0.69 ± 0.01 with addition of 1 patch of prostate tissue for every 100 patches of placenta (1% contaminant). Bladder, added at 10% contaminant, raised the mean absolute error in estimating gestational age from 1.626 weeks to 2.371 ± 0.003 weeks. Blood, incorporated into placental sections, induced false-negative diagnoses of intervillous thrombi. Addition of bladder to prostate cancer needle biopsies induced false positives, a selection of high-attention patches, representing 0.033 mm2, and resulted in a 97% false-positive rate when added to needle biopsies. Contaminant patches received attention at or above the rate of the average patch of patient tissue. Tissue contaminants induce errors in modern ML models. The high level of attention given to contaminants indicates a failure to encode biological phenomena. Practitioners should move to quantify and ameliorate this problem.
Collapse
Affiliation(s)
- Ismail Irmakci
- Department of Pathology, Northwestern University, Feinberg School of Medicine, Chicago, Illinois
| | - Ramin Nateghi
- Department of Pathology, Northwestern University, Feinberg School of Medicine, Chicago, Illinois
| | - Rujoi Zhou
- Department of Pathology, Northwestern University, Feinberg School of Medicine, Chicago, Illinois
| | - Mariavittoria Vescovo
- Department of Pathology, Northwestern University, Feinberg School of Medicine, Chicago, Illinois
| | - Madeline Saft
- Department of Pathology, Northwestern University, Feinberg School of Medicine, Chicago, Illinois
| | - Ashley E Ross
- Department of Pathology, Northwestern University, Feinberg School of Medicine, Chicago, Illinois
| | - Ximing J Yang
- Department of Pathology, Northwestern University, Feinberg School of Medicine, Chicago, Illinois
| | - Lee A D Cooper
- Department of Pathology, Northwestern University, Feinberg School of Medicine, Chicago, Illinois
| | - Jeffery A Goldstein
- Department of Pathology, Northwestern University, Feinberg School of Medicine, Chicago, Illinois.
| |
Collapse
|
13
|
Kanwal N, López-Pérez M, Kiraz U, Zuiverloon TCM, Molina R, Engan K. Are you sure it's an artifact? Artifact detection and uncertainty quantification in histological images. Comput Med Imaging Graph 2024; 112:102321. [PMID: 38199127 DOI: 10.1016/j.compmedimag.2023.102321] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 11/08/2023] [Accepted: 12/12/2023] [Indexed: 01/12/2024]
Abstract
Modern cancer diagnostics involves extracting tissue specimens from suspicious areas and conducting histotechnical procedures to prepare a digitized glass slide, called Whole Slide Image (WSI), for further examination. These procedures frequently introduce different types of artifacts in the obtained WSI, and histological artifacts might influence Computational Pathology (CPATH) systems further down to a diagnostic pipeline if not excluded or handled. Deep Convolutional Neural Networks (DCNNs) have achieved promising results for the detection of some WSI artifacts, however, they do not incorporate uncertainty in their predictions. This paper proposes an uncertainty-aware Deep Kernel Learning (DKL) model to detect blurry areas and folded tissues, two types of artifacts that can appear in WSIs. The proposed probabilistic model combines a CNN feature extractor and a sparse Gaussian Processes (GPs) classifier, which improves the performance of current state-of-the-art artifact detection DCNNs and provides uncertainty estimates. We achieved 0.996 and 0.938 F1 scores for blur and folded tissue detection on unseen data, respectively. In extensive experiments, we validated the DKL model on unseen data from external independent cohorts with different staining and tissue types, where it outperformed DCNNs. Interestingly, the DKL model is more confident in the correct predictions and less in the wrong ones. The proposed DKL model can be integrated into the preprocessing pipeline of CPATH systems to provide reliable predictions and possibly serve as a quality control tool.
Collapse
Affiliation(s)
- Neel Kanwal
- Department of Electrical Engineering and Computer Science, University of Stavanger, 4021 Stavanger, Norway.
| | - Miguel López-Pérez
- Department of Computer Science and Artificial Intelligence, University of Granada, 18071 Granada, Spain
| | - Umay Kiraz
- Department of Pathology, Stavanger University Hospital, 4011 Stavanger, Norway; Department of Chemistry, Bioscience and Environmental Engineering, University of Stavanger, 4021 Stavanger, Norway
| | - Tahlita C M Zuiverloon
- Department of Urology, University Medical Center Rotterdam, Erasmus MC Cancer Institute, 1035 GD Rotterdam, The Netherlands
| | - Rafael Molina
- Department of Computer Science and Artificial Intelligence, University of Granada, 18071 Granada, Spain
| | - Kjersti Engan
- Department of Electrical Engineering and Computer Science, University of Stavanger, 4021 Stavanger, Norway
| |
Collapse
|
14
|
Dunn C, Brettle D, Cockroft M, Keating E, Revie C, Treanor D. Quantitative assessment of H&E staining for pathology: development and clinical evaluation of a novel system. Diagn Pathol 2024; 19:42. [PMID: 38395890 PMCID: PMC10885446 DOI: 10.1186/s13000-024-01461-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Accepted: 02/03/2024] [Indexed: 02/25/2024] Open
Abstract
BACKGROUND Staining tissue samples to visualise cellular detail and tissue structure is at the core of pathology diagnosis, but variations in staining can result in significantly different appearances of the tissue sample. While the human visual system is adept at compensating for stain variation, with the growth of digital imaging in pathology, the impact of this variation can be more profound. Despite the ubiquity of haematoxylin and eosin staining in clinical practice worldwide, objective quantification is not yet available. We propose a method for quantitative haematoxylin and eosin stain assessment to facilitate quality assurance of histopathology staining, enabling truly quantitative quality control and improved standardisation. METHODS The stain quantification method comprises conventional microscope slides with a stain-responsive biopolymer film affixed to one side, called stain assessment slides. The stain assessment slides were characterised with haematoxylin and eosin, and implemented in one clinical laboratory to quantify variation levels. RESULTS Stain assessment slide stain uptake increased linearly with duration of haematoxylin and eosin staining (r = 0.99), and demonstrated linearly comparable staining to samples of human liver tissue (r values 0.98-0.99). Laboratory implementation of this technique quantified intra- and inter-instrument variation of staining instruments at one point in time and across a five-day period. CONCLUSION The proposed method has been shown to reliably quantify stain uptake, providing an effective laboratory quality control method for stain variation. This is especially important for whole slide imaging and the future development of artificial intelligence in digital pathology.
Collapse
Affiliation(s)
- Catriona Dunn
- National Pathology Imaging Co-operative, Leeds Teaching Hospitals NHS Trust, Leeds, UK.
- Department of Pathology and Data Analytics, University of Leeds, Leeds, UK.
| | - David Brettle
- National Pathology Imaging Co-operative, Leeds Teaching Hospitals NHS Trust, Leeds, UK
| | - Martin Cockroft
- New Technology Group, Futamura Chemical UK Limited, Wigton, UK
| | | | | | - Darren Treanor
- National Pathology Imaging Co-operative, Leeds Teaching Hospitals NHS Trust, Leeds, UK
- Department of Pathology and Data Analytics, University of Leeds, Leeds, UK
- Department of Histopathology, Leeds Teaching Hospitals NHS Trust, Leeds, UK
- Department of Clinical Pathology and Clinical and Experimental Medicine, Linköping University, Linköping, Sweden
- Centre for Medical Image Science and Visualisation, Linköping University, Linköping, Sweden
| |
Collapse
|
15
|
Schwen LO, Kiehl TR, Carvalho R, Zerbe N, Homeyer A. Digitization of Pathology Labs: A Review of Lessons Learned. J Transl Med 2023; 103:100244. [PMID: 37657651 DOI: 10.1016/j.labinv.2023.100244] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Revised: 07/18/2023] [Accepted: 08/23/2023] [Indexed: 09/03/2023] Open
Abstract
Pathology laboratories are increasingly using digital workflows. This has the potential of increasing laboratory efficiency, but the digitization process also involves major challenges. Several reports have been published describing the individual experiences of specific laboratories with the digitization process. However, a comprehensive overview of the lessons learned is still lacking. We provide an overview of the lessons learned for different aspects of the digitization process, including digital case management, digital slide reading, and computer-aided slide reading. We also cover metrics used for monitoring performance and pitfalls and corresponding values observed in practice. The overview is intended to help pathologists, information technology decision makers, and administrators to benefit from the experiences of others and to implement the digitization process in an optimal way to make their own laboratory future-proof.
Collapse
Affiliation(s)
- Lars Ole Schwen
- Fraunhofer Institute for Digital Medicine MEVIS, Bremen, Germany.
| | - Tim-Rasmus Kiehl
- Charité-Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt Universität zu Berlin, Institute of Pathology, Berlin, Germany
| | - Rita Carvalho
- Charité-Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt Universität zu Berlin, Institute of Pathology, Berlin, Germany
| | - Norman Zerbe
- Charité-Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt Universität zu Berlin, Institute of Pathology, Berlin, Germany
| | - André Homeyer
- Fraunhofer Institute for Digital Medicine MEVIS, Bremen, Germany
| |
Collapse
|
16
|
Irmakci I, Nateghi R, Zhou R, Ross AE, Yang XJ, Cooper LAD, Goldstein JA. Tissue contamination challenges the credibility of machine learning models in real world digital pathology. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.04.28.23289287. [PMID: 37205404 PMCID: PMC10187357 DOI: 10.1101/2023.04.28.23289287] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
Machine learning (ML) models are poised to transform surgical pathology practice. The most successful use attention mechanisms to examine whole slides, identify which areas of tissue are diagnostic, and use them to guide diagnosis. Tissue contaminants, such as floaters, represent unexpected tissue. While human pathologists are extensively trained to consider and detect tissue contaminants, we examined their impact on ML models. We trained 4 whole slide models. Three operate in placenta for 1) detection of decidual arteriopathy (DA), 2) estimation of gestational age (GA), and 3) classification of macroscopic placental lesions. We also developed a model to detect prostate cancer in needle biopsies. We designed experiments wherein patches of contaminant tissue are randomly sampled from known slides and digitally added to patient slides and measured model performance. We measured the proportion of attention given to contaminants and examined the impact of contaminants in T-distributed Stochastic Neighbor Embedding (tSNE) feature space. Every model showed performance degradation in response to one or more tissue contaminants. DA detection balanced accuracy decreased from 0.74 to 0.69 +/- 0.01 with addition of 1 patch of prostate tissue for every 100 patches of placenta (1% contaminant). Bladder, added at 10% contaminant raised the mean absolute error in estimating gestation age from 1.626 weeks to 2.371 +/ 0.003 weeks. Blood, incorporated into placental sections, induced false negative diagnoses of intervillous thrombi. Addition of bladder to prostate cancer needle biopsies induced false positives, a selection of high-attention patches, representing 0.033mm2, resulted in a 97% false positive rate when added to needle biopsies. Contaminant patches received attention at or above the rate of the average patch of patient tissue. Tissue contaminants induce errors in modern ML models. The high level of attention given to contaminants indicates a failure to encode biological phenomena. Practitioners should move to quantify and ameliorate this problem.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Jeffery A. Goldstein
- To whom correspondence should be addressed: Olson 2-455, 710 N. Fairbanks Ave, Chicago IL, 60611,
| |
Collapse
|
17
|
Albahra S, Gorbett T, Robertson S, D'Aleo G, Kumar SVS, Ockunzzi S, Lallo D, Hu B, Rashidi HH. Artificial intelligence and machine learning overview in pathology & laboratory medicine: A general review of data preprocessing and basic supervised concepts. Semin Diagn Pathol 2023; 40:71-87. [PMID: 36870825 DOI: 10.1053/j.semdp.2023.02.002] [Citation(s) in RCA: 32] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Revised: 02/10/2023] [Accepted: 02/14/2023] [Indexed: 02/17/2023]
Abstract
Machine learning (ML) is becoming an integral aspect of several domains in medicine. Yet, most pathologists and laboratory professionals remain unfamiliar with such tools and are unprepared for their inevitable integration. To bridge this knowledge gap, we present an overview of key elements within this emerging data science discipline. First, we will cover general, well-established concepts within ML, such as data type concepts, data preprocessing methods, and ML study design. We will describe common supervised and unsupervised learning algorithms and their associated common machine learning terms (provided within a comprehensive glossary of terms that are discussed within this review). Overall, this review will offer a broad overview of the key concepts and algorithms in machine learning, with a focus on pathology and laboratory medicine. The objective is to provide an updated useful reference for those new to this field or those who require a refresher.
Collapse
Affiliation(s)
- Samer Albahra
- Pathology and Laboratory Medicine Institute (PLMI), Cleveland Clinic, Cleveland, OH, United States; PLMI's Center for Artificial Intelligence & Data Science, Cleveland Clinic, Cleveland, OH, United States.
| | - Tom Gorbett
- Pathology and Laboratory Medicine Institute (PLMI), Cleveland Clinic, Cleveland, OH, United States; PLMI's Center for Artificial Intelligence & Data Science, Cleveland Clinic, Cleveland, OH, United States
| | - Scott Robertson
- Pathology and Laboratory Medicine Institute (PLMI), Cleveland Clinic, Cleveland, OH, United States; PLMI's Center for Artificial Intelligence & Data Science, Cleveland Clinic, Cleveland, OH, United States
| | - Giana D'Aleo
- Pathology and Laboratory Medicine Institute (PLMI), Cleveland Clinic, Cleveland, OH, United States; PLMI's Center for Artificial Intelligence & Data Science, Cleveland Clinic, Cleveland, OH, United States
| | - Sushasree Vasudevan Suseel Kumar
- Pathology and Laboratory Medicine Institute (PLMI), Cleveland Clinic, Cleveland, OH, United States; PLMI's Center for Artificial Intelligence & Data Science, Cleveland Clinic, Cleveland, OH, United States
| | - Samuel Ockunzzi
- Pathology and Laboratory Medicine Institute (PLMI), Cleveland Clinic, Cleveland, OH, United States; PLMI's Center for Artificial Intelligence & Data Science, Cleveland Clinic, Cleveland, OH, United States
| | - Daniel Lallo
- Pathology and Laboratory Medicine Institute (PLMI), Cleveland Clinic, Cleveland, OH, United States; PLMI's Center for Artificial Intelligence & Data Science, Cleveland Clinic, Cleveland, OH, United States
| | - Bo Hu
- Department of Quantitative Health Sciences, Cleveland Clinic, Cleveland, OH, United States; PLMI's Center for Artificial Intelligence & Data Science, Cleveland Clinic, Cleveland, OH, United States
| | - Hooman H Rashidi
- Pathology and Laboratory Medicine Institute (PLMI), Cleveland Clinic, Cleveland, OH, United States; PLMI's Center for Artificial Intelligence & Data Science, Cleveland Clinic, Cleveland, OH, United States.
| |
Collapse
|
18
|
Zhou Y, Koyuncu C, Lu C, Grobholz R, Katz I, Madabhushi A, Janowczyk A. Multi-site cross-organ calibrated deep learning (MuSClD): Automated diagnosis of non-melanoma skin cancer. Med Image Anal 2023; 84:102702. [PMID: 36516556 PMCID: PMC9825103 DOI: 10.1016/j.media.2022.102702] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Revised: 11/09/2022] [Accepted: 11/21/2022] [Indexed: 11/25/2022]
Abstract
Although deep learning (DL) has demonstrated impressive diagnostic performance for a variety of computational pathology tasks, this performance often markedly deteriorates on whole slide images (WSI) generated at external test sites. This phenomenon is due in part to domain shift, wherein differences in test-site pre-analytical variables (e.g., slide scanner, staining procedure) result in WSI with notably different visual presentations compared to training data. To ameliorate pre-analytic variances, approaches such as CycleGAN can be used to calibrate visual properties of images between sites, with the intent of improving DL classifier generalizability. In this work, we present a new approach termed Multi-Site Cross-Organ Calibration based Deep Learning (MuSClD) that employs WSIs of an off-target organ for calibration created at the same site as the on-target organ, based off the assumption that cross-organ slides are subjected to a common set of pre-analytical sources of variance. We demonstrate that by using an off-target organ from the test site to calibrate training data, the domain shift between training and testing data can be mitigated. Importantly, this strategy uniquely guards against potential data leakage introduced during calibration, wherein information only available in the testing data is imparted on the training data. We evaluate MuSClD in the context of the automated diagnosis of non-melanoma skin cancer (NMSC). Specifically, we evaluated MuSClD for identifying and distinguishing (a) basal cell carcinoma (BCC), (b) in-situ squamous cell carcinomas (SCC-In Situ), and (c) invasive squamous cell carcinomas (SCC-Invasive), using an Australian (training, n = 85) and a Swiss (held-out testing, n = 352) cohort. Our experiments reveal that MuSCID reduces the Wasserstein distances between sites in terms of color, contrast, and brightness metrics, without imparting noticeable artifacts to training data. The NMSC-subtyping performance is statistically improved as a result of MuSCID in terms of one-vs. rest AUC: BCC (0.92 vs 0.87, p = 0.01), SCC-In Situ (0.87 vs 0.73, p = 0.15) and SCC-Invasive (0.92 vs 0.82, p = 1e-5). Compared to baseline NMSC-subtyping with no calibration, the internal validation results of MuSClD (BCC (0.98), SCC-In Situ (0.92), and SCC-Invasive (0.97)) suggest that while domain shift indeed degrades classification performance, our on-target calibration using off-target tissue can safely compensate for pre-analytical variabilities, while improving the robustness of the model.
Collapse
Affiliation(s)
- Yufei Zhou
- Department of Computer and Data Sciences, Case Western Reserve University, Cleveland, OH, USA
| | - Can Koyuncu
- Department of Biomedical Engineering, Emory University and Georgia Institute of Technology, Atlanta, GA, USA,Louis Stokes Cleveland Veterans Administration Medical Center, Cleveland, USA
| | - Cheng Lu
- Department of Biomedical Engineering, Emory University and Georgia Institute of Technology, Atlanta, GA, USA
| | - Rainer Grobholz
- Institute of Pathology, Cantonal Hospital Aarau, Aarau, Switzerland,Medical Faculty University of Zurich, Zurich, Switzerland
| | - Ian Katz
- Southern Sun Pathology, Sydney, NSW, Australia,University of Queensland, Brisbane, Qld, Australia
| | - Anant Madabhushi
- Department of Biomedical Engineering, Emory University and Georgia Institute of Technology, Atlanta, GA, USA; Atlanta VA Medical Center, Atlanta, USA.
| | - Andrew Janowczyk
- Department of Biomedical Engineering, Emory University and Georgia Institute of Technology, Atlanta, GA, USA,Department of Oncology, Lausanne University Hospital,Department of Diagnostics, Division of Clinical Pathology, Geneva University Hospitals
| |
Collapse
|
19
|
Patil A, Diwakar H, Sawant J, Kurian NC, Yadav S, Rane S, Bameta T, Sethi A. Efficient quality control of whole slide pathology images with human-in-the-loop training. J Pathol Inform 2023; 14:100306. [PMID: 37089617 PMCID: PMC10113897 DOI: 10.1016/j.jpi.2023.100306] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 03/11/2023] [Accepted: 03/13/2023] [Indexed: 04/25/2023] Open
Abstract
Histopathology whole slide images (WSIs) are being widely used to develop deep learning-based diagnostic solutions, especially for precision oncology. Most of these diagnostic softwares are vulnerable to biases and impurities in the training and test data which can lead to inaccurate diagnoses. For instance, WSIs contain multiple types of tissue regions, at least some of which might not be relevant to the diagnosis. We introduce HistoROI, a robust yet lightweight deep learning-based classifier to segregate WSI into 6 broad tissue regions-epithelium, stroma, lymphocytes, adipose, artifacts, and miscellaneous. HistoROI is trained using a novel human in-the-loop and active learning paradigm that ensures variations in training data for labeling efficient generalization. HistoROI consistently performs well across multiple organs, despite being trained on only a single dataset, demonstrating strong generalization. Further, we have examined the utility of HistoROI in improving the performance of downstream deep learning-based tasks using the CAMELYON breast cancer lymph node and TCGA lung cancer datasets. For the former dataset, the area under the receiver operating characteristic curve (AUC) for metastasis versus normal tissue of a neural network trained using weakly supervised learning increased from 0.88 to 0.92 by filtering the data using HistoROI. Similarly, the AUC increased from 0.88 to 0.93 for the classification between adenocarcinoma and squamous cell carcinoma on the lung cancer dataset. We also found that the performance of the HistoROI improves upon HistoQC for artifact detection on a test dataset of 93 annotated WSIs. The limitations of the proposed model are analyzed, and potential extensions are also discussed.
Collapse
Affiliation(s)
| | - Harsh Diwakar
- Indian Institute of Technology, Bombay, Mumbai, India
| | - Jay Sawant
- Indian Institute of Technology, Bombay, Mumbai, India
| | | | - Subhash Yadav
- Tata Memorial Centre - ACTREC, HBNI, Navi Mumbai, India
| | - Swapnil Rane
- Tata Memorial Centre - ACTREC, HBNI, Navi Mumbai, India
| | - Tripti Bameta
- Tata Memorial Centre - ACTREC, HBNI, Navi Mumbai, India
| | - Amit Sethi
- Indian Institute of Technology, Bombay, Mumbai, India
| |
Collapse
|
20
|
Michielli N, Caputo A, Scotto M, Mogetta A, Pennisi OAM, Molinari F, Balmativola D, Bosco M, Gambella A, Metovic J, Tota D, Carpenito L, Gasparri P, Salvi M. Stain normalization in digital pathology: Clinical multi-center evaluation of image quality. J Pathol Inform 2022; 13:100145. [PMID: 36268060 PMCID: PMC9577129 DOI: 10.1016/j.jpi.2022.100145] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2022] [Revised: 09/14/2022] [Accepted: 09/22/2022] [Indexed: 11/20/2022] Open
Abstract
In digital pathology, the final appearance of digitized images is affected by several factors, resulting in stain color and intensity variation. Stain normalization is an innovative solution to overcome stain variability. However, the validation of color normalization tools has been assessed only from a quantitative perspective, through the computation of similarity metrics between the original and normalized images. To the best of our knowledge, no works investigate the impact of normalization on the pathologist's evaluation. The objective of this paper is to propose a multi-tissue (i.e., breast, colon, liver, lung, and prostate) and multi-center qualitative analysis of a stain normalization tool with the involvement of pathologists with different years of experience. Two qualitative studies were carried out for this purpose: (i) a first study focused on the analysis of the perceived image quality and absence of significant image artifacts after the normalization process; (ii) a second study focused on the clinical score of the normalized image with respect to the original one. The results of the first study prove the high quality of the normalized image with a low impact artifact generation, while the second study demonstrates the superiority of the normalized image with respect to the original one in clinical practice. The normalization process can help both to reduce variability due to tissue staining procedures and facilitate the pathologist in the histological examination. The experimental results obtained in this work are encouraging and can justify the use of a stain normalization tool in clinical routine.
Collapse
Affiliation(s)
- Nicola Michielli
- Biolab, PolitoMed Lab, Department of Electronics and Telecommunications, Politecnico di Torino, Corso Duca degli Abruzzi 24, 10129 Turin, Italy
| | - Alessandro Caputo
- Department of Medicine and Surgery, University Hospital of Salerno, Salerno, Italy
| | - Manuela Scotto
- Biolab, PolitoMed Lab, Department of Electronics and Telecommunications, Politecnico di Torino, Corso Duca degli Abruzzi 24, 10129 Turin, Italy
| | - Alessandro Mogetta
- Biolab, PolitoMed Lab, Department of Electronics and Telecommunications, Politecnico di Torino, Corso Duca degli Abruzzi 24, 10129 Turin, Italy
| | - Orazio Antonino Maria Pennisi
- Technology Transfer and Industrial Liaison Department, Politecnico di Torino, Corso Duca degli Abruzzi 24, 10129 Turin, Italy
| | - Filippo Molinari
- Biolab, PolitoMed Lab, Department of Electronics and Telecommunications, Politecnico di Torino, Corso Duca degli Abruzzi 24, 10129 Turin, Italy
| | - Davide Balmativola
- Pathology Unit, Humanitas Gradenigo Hospital, Corso Regina Margherita 8, 10153 Turin, Italy
| | - Martino Bosco
- Department of Pathology, Michele and Pietro Ferrero Hospital, 12060 Verduno, Italy
| | - Alessandro Gambella
- Pathology Unit, Department of Medical Sciences, University of Turin, Via Santena 7, 10126 Turin, Italy
| | - Jasna Metovic
- Pathology Unit, Department of Medical Sciences, University of Turin, Via Santena 7, 10126 Turin, Italy
| | - Daniele Tota
- Pathology Unit, Department of Medical Sciences, University of Turin, Via Santena 7, 10126 Turin, Italy
| | - Laura Carpenito
- Department of Pathology, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy
- University of Milan, Milan, Italy
| | - Paolo Gasparri
- UOC di Anatomia Patologica, ASP Catania P.O. “Gravina”, Caltagirone, Italy
| | - Massimo Salvi
- Biolab, PolitoMed Lab, Department of Electronics and Telecommunications, Politecnico di Torino, Corso Duca degli Abruzzi 24, 10129 Turin, Italy
| |
Collapse
|
21
|
A multi-view deep learning model for pathology image diagnosis. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03918-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
22
|
Broad A, Wright AI, de Kamps M, Treanor D. Attention-guided sampling for colorectal cancer analysis with digital pathology. J Pathol Inform 2022; 13:100110. [PMID: 36268074 PMCID: PMC9577057 DOI: 10.1016/j.jpi.2022.100110] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Revised: 05/27/2022] [Accepted: 05/31/2022] [Indexed: 11/25/2022] Open
Abstract
Improvements to patient care through the development of automated image analysis in pathology are restricted by the small image patch size that can be processed by convolutional neural networks (CNNs), when compared to the whole-slide image (WSI). Tile-by-tile processing across the entire WSI is slow and inefficient. While this may improve with future computing power, the technique remains vulnerable to noise from uninformative image areas. We propose a novel attention-inspired algorithm that selects image patches from informative parts of the WSI, first using a sparse randomised grid pattern, then iteratively re-sampling at higher density in regions where a CNN classifies patches as tumour. Subsequent uniform sampling across the enclosing region of interest (ROI) is used to mitigate sampling bias. Benchmarking tests informed the adoption of VGG19 as the main CNN architecture, with 79% classification accuracy. A further CNN was trained to separate false-positive normal epithelium from tumour epithelium, in a novel adaptation of a two-stage model used in brain imaging. These subsystems were combined in a processing pipeline to generate spatial distributions of classified patches from unseen WSIs. The ROI was predicted with a mean F1 (Dice) score of 86.6% over 100 evaluation WSIs. Several algorithms for evaluating tumour–stroma ratio (TSR) within the ROI were compared, giving a lowest root mean square (RMS) error of 11.3% relative to pathologists’ annotations, against 13.5% for an equivalent tile-by-tile pipeline. Our pipeline processed WSIs between 3.3x and 6.3x faster than tile-by-tile processing. We propose our attention-based sampling pipeline as a useful tool for pathology researchers, with the further potential for incorporating additional diagnostic calculations.
Collapse
|
23
|
Wahab N, Miligy IM, Dodd K, Sahota H, Toss M, Lu W, Jahanifar M, Bilal M, Graham S, Park Y, Hadjigeorghiou G, Bhalerao A, Lashen AG, Ibrahim AY, Katayama A, Ebili HO, Parkin M, Sorell T, Raza SEA, Hero E, Eldaly H, Tsang YW, Gopalakrishnan K, Snead D, Rakha E, Rajpoot N, Minhas F. Semantic annotation for computational pathology: multidisciplinary experience and best practice recommendations. J Pathol Clin Res 2022; 8:116-128. [PMID: 35014198 PMCID: PMC8822374 DOI: 10.1002/cjp2.256] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2021] [Revised: 11/25/2021] [Accepted: 12/10/2021] [Indexed: 02/06/2023]
Abstract
Recent advances in whole-slide imaging (WSI) technology have led to the development of a myriad of computer vision and artificial intelligence-based diagnostic, prognostic, and predictive algorithms. Computational Pathology (CPath) offers an integrated solution to utilise information embedded in pathology WSIs beyond what can be obtained through visual assessment. For automated analysis of WSIs and validation of machine learning (ML) models, annotations at the slide, tissue, and cellular levels are required. The annotation of important visual constructs in pathology images is an important component of CPath projects. Improper annotations can result in algorithms that are hard to interpret and can potentially produce inaccurate and inconsistent results. Despite the crucial role of annotations in CPath projects, there are no well-defined guidelines or best practices on how annotations should be carried out. In this paper, we address this shortcoming by presenting the experience and best practices acquired during the execution of a large-scale annotation exercise involving a multidisciplinary team of pathologists, ML experts, and researchers as part of the Pathology image data Lake for Analytics, Knowledge and Education (PathLAKE) consortium. We present a real-world case study along with examples of different types of annotations, diagnostic algorithm, annotation data dictionary, and annotation constructs. The analyses reported in this work highlight best practice recommendations that can be used as annotation guidelines over the lifecycle of a CPath project.
Collapse
Affiliation(s)
- Noorul Wahab
- Tissue Image Analytics CentreUniversity of WarwickCoventryUK
| | - Islam M Miligy
- PathologyUniversity of NottinghamNottinghamUK
- Department of Pathology, Faculty of MedicineMenoufia UniversityShebin El‐KomEgypt
| | - Katherine Dodd
- HistopathologyUniversity Hospital Coventry and WarwickshireCoventryUK
| | - Harvir Sahota
- HistopathologyUniversity Hospital Coventry and WarwickshireCoventryUK
| | | | - Wenqi Lu
- Tissue Image Analytics CentreUniversity of WarwickCoventryUK
| | | | - Mohsin Bilal
- Tissue Image Analytics CentreUniversity of WarwickCoventryUK
| | - Simon Graham
- Tissue Image Analytics CentreUniversity of WarwickCoventryUK
| | - Young Park
- Tissue Image Analytics CentreUniversity of WarwickCoventryUK
| | | | - Abhir Bhalerao
- Tissue Image Analytics CentreUniversity of WarwickCoventryUK
| | | | | | - Ayaka Katayama
- Graduate School of MedicineGunma UniversityMaebashiJapan
| | | | | | - Tom Sorell
- Department of Politics and International StudiesUniversity of WarwickCoventryUK
| | | | - Emily Hero
- HistopathologyUniversity Hospital Coventry and WarwickshireCoventryUK
- Leicester Royal Infirmary, HistopathologyUniversity Hospitals LeicesterLeicesterUK
| | - Hesham Eldaly
- HistopathologyUniversity Hospital Coventry and WarwickshireCoventryUK
| | - Yee Wah Tsang
- HistopathologyUniversity Hospital Coventry and WarwickshireCoventryUK
| | | | - David Snead
- HistopathologyUniversity Hospital Coventry and WarwickshireCoventryUK
| | - Emad Rakha
- PathologyUniversity of NottinghamNottinghamUK
| | - Nasir Rajpoot
- Tissue Image Analytics CentreUniversity of WarwickCoventryUK
| | - Fayyaz Minhas
- Tissue Image Analytics CentreUniversity of WarwickCoventryUK
| |
Collapse
|
24
|
McGenity C, Wright A, Treanor D. AIM in Surgical Pathology. Artif Intell Med 2022. [DOI: 10.1007/978-3-030-64573-1_278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
25
|
Atallah NM, Toss MS, Verrill C, Salto-Tellez M, Snead D, Rakha EA. Potential quality pitfalls of digitalized whole slide image of breast pathology in routine practice. Mod Pathol 2022; 35:903-910. [PMID: 34961765 PMCID: PMC8711290 DOI: 10.1038/s41379-021-01000-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Revised: 12/11/2021] [Accepted: 12/12/2021] [Indexed: 12/26/2022]
Abstract
Using digitalized whole slide images (WSI) in routine histopathology practice is a revolutionary technology. This study aims to assess the clinical impacts of WSI quality and representation of the corresponding glass slides. 40,160 breast WSIs were examined and compared with their corresponding glass slides. The presence, frequency, location, tissue type, and the clinical impacts of missing tissue were assessed. Scanning time, type of the specimens, time to WSIs implementation, and quality control (QC) measures were also considered. The frequency of missing tissue ranged from 2% to 19%. The area size of the missed tissue ranged from 1-70%. In most cases (>75%), the missing tissue area size was <10% and peripherally located. In all cases the missed tissue was fat with or without small entrapped normal breast parenchyma. No missing tissue was identified in WSIs of the core biopsy specimens. QC measures improved images quality and reduced WSI failure rates by seven-fold. A negative linear correlation between the frequency of missing tissue and both the scanning time and the image file size was observed (p < 0.05). None of the WSI with missing tissues resulted in a change in the final diagnosis. Missing tissue on breast WSI is observed but with variable frequency and little diagnostic consequence. Balancing between WSI quality and scanning time/image file size should be considered and pathology laboratories should undertake their own assessments of risk and provide the relevant mitigations with the appropriate level of caution.
Collapse
Affiliation(s)
- Nehal M. Atallah
- grid.4563.40000 0004 1936 8868Department of Histopathology, School of Medicine, the University of Nottingham and Nottingham University Hospitals NHS Trust, Nottingham, UK ,grid.411775.10000 0004 0621 4712Department of Pathology, Faculty of Medicine, Menoufia University, Shebin Elkom, Al-Menoufia, Egypt
| | - Michael S. Toss
- grid.4563.40000 0004 1936 8868Department of Histopathology, School of Medicine, the University of Nottingham and Nottingham University Hospitals NHS Trust, Nottingham, UK
| | - Clare Verrill
- grid.4991.50000 0004 1936 8948Nuffield Department of Surgical Sciences, University of Oxford, Oxford, UK ,grid.4991.50000 0004 1936 8948NIHR Oxford Biomedical Research Centre, University of Oxford, John Radcliffe Hospital, Oxford, UK
| | - Manuel Salto-Tellez
- grid.4777.30000 0004 0374 7521Precision Medicine Centre of Excellence, The Patrick G Johnston Centre for Cancer Research, Queen’s University, Belfast, UK
| | - David Snead
- grid.15628.380000 0004 0393 1193Cellular Pathology, University Hospitals Coventry and Warwickshire NHS Trust, Coventry, Coventry, UK
| | - Emad A. Rakha
- grid.4563.40000 0004 1936 8868Department of Histopathology, School of Medicine, the University of Nottingham and Nottingham University Hospitals NHS Trust, Nottingham, UK ,grid.411775.10000 0004 0621 4712Department of Pathology, Faculty of Medicine, Menoufia University, Shebin Elkom, Al-Menoufia, Egypt
| |
Collapse
|
26
|
Patel AU, Shaker N, Erck S, Kellough DA, Palermini E, Li Z, Lujan G, Satturwar S, Parwani AV. Types and frequency of whole slide imaging scan failures in a clinical high throughput digital pathology scanning laboratory. J Pathol Inform 2022; 13:100112. [PMID: 36268081 PMCID: PMC9577040 DOI: 10.1016/j.jpi.2022.100112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2022] [Revised: 06/20/2022] [Accepted: 06/21/2022] [Indexed: 11/18/2022] Open
Abstract
Digital workflow transformation continues to sweep throughout a diversity of pathology departments spanning the globe following catalyzation of whole slide imaging (WSI) adoption by the SARS-CoV-2 (COVID-19) pandemic. The utility of WSI for a litany of use cases including primary diagnosis has been emphasized during this period, with WSI scanning devices gaining the approval of healthcare regulatory bodies and practitioners alike for clinical applications following extensive validatory efforts. As successful validation for WSI is predicated upon pathologist diagnostic interpretability of digital images with high glass slide concordance, departmental adoption of WSI is tantamount to the reliability of such images often predicated upon quality assessment notwithstanding image interpretability but extending to quality of practice following WSI adoption. Metrics of importance within this context include failure rates inclusive of different scanning errors that result in poor image quality and the potential such errors may incur upon departmental turnaround time (TAT). We sought to evaluate the impact of WSI implementation through retrospective evaluation of scan failure frequency in archival versus newly prepared slides, types of scanning error, and impact upon TAT following commencement of live WSI operation in May 2017 until the present period within a fully digitized high-volume academic institution. A 1.19% scan failure incidence rate was recorded during this period, with re-scanning requested and successfully executed for 1.19% of cases during the reported period of January 2019 until present. No significant impact upon TAT was deduced, suggesting an outcome which may be encouraging for departments considering digital workflow adoption.
Collapse
|
27
|
A Transfer Learning Architecture Based on a Support Vector Machine for Histopathology Image Classification. APPLIED SCIENCES-BASEL 2021. [DOI: 10.3390/app11146380] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Recently, digital pathology is an essential application for clinical practice and medical research. Due to the lack of large annotated datasets, the deep transfer learning technique is often used to classify histopathology images. A softmax classifier is often used to perform classification tasks. Besides, a Support Vector Machine (SVM) classifier is also popularly employed, especially for binary classification problems. Accurately determining the category of the histopathology images is vital for the diagnosis of diseases. In this paper, the conventional softmax classifier and the SVM classifier-based transfer learning approach are evaluated to classify histopathology cancer images in a binary breast cancer dataset and a multiclass lung and colon cancer dataset. In order to achieve better classification accuracy, a methodology that attaches SVM classifier to the fully-connected (FC) layer of the softmax-based transfer learning model is proposed. The proposed architecture involves a first step training the newly added FC layer on the target dataset using the softmax-based model and a second step training the SVM classifier with the newly trained FC layer. Cross-validation is used to ensure no bias for the evaluation of the performance of the models. Experimental results reveal that the conventional SVM classifier-based model is the least accurate on either binary or multiclass cancer datasets. The conventional softmax-based model shows moderate classification accuracy, while the proposed synthetic architecture achieves the best classification accuracy.
Collapse
|
28
|
AIM in Surgical Pathology. Artif Intell Med 2021. [DOI: 10.1007/978-3-030-58080-3_278-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|