1
|
Hallaj S, Chuter BG, Lieu AC, Singh P, Kalpathy-Cramer J, Xu BY, Christopher M, Zangwill LM, Weinreb RN, Baxter SL. Federated Learning in Glaucoma: A Comprehensive Review and Future Perspectives. Ophthalmol Glaucoma 2025; 8:92-105. [PMID: 39214457 PMCID: PMC11911940 DOI: 10.1016/j.ogla.2024.08.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2024] [Revised: 08/20/2024] [Accepted: 08/23/2024] [Indexed: 09/04/2024]
Abstract
CLINICAL RELEVANCE Glaucoma is a complex eye condition with varied morphological and clinical presentations, making diagnosis and management challenging. The lack of a consensus definition for glaucoma or glaucomatous optic neuropathy further complicates the development of universal diagnostic tools. Developing robust artificial intelligence (AI) models for glaucoma screening is essential for early detection and treatment but faces significant obstacles. Effective deep learning algorithms require large, well-curated datasets from diverse patient populations and imaging protocols. However, creating centralized data repositories is hindered by concerns over data sharing, patient privacy, regulatory compliance, and intellectual property. Federated Learning (FL) offers a potential solution by enabling data to remain locally hosted while facilitating distributed model training across multiple sites. METHODS A comprehensive literature review was conducted on the application of Federated Learning in training AI models for glaucoma screening. Publications from 1950 to 2024 were searched using databases such as PubMed and IEEE Xplore with keywords including "glaucoma," "federated learning," "artificial intelligence," "deep learning," "machine learning," "distributed learning," "privacy-preserving," "data sharing," "medical imaging," and "ophthalmology." Articles were included if they discussed the use of FL in glaucoma-related AI tasks or addressed data sharing and privacy challenges in ophthalmic AI development. RESULTS FL enables collaborative model development without centralizing sensitive patient data, addressing privacy and regulatory concerns. Studies show that FL can improve model performance and generalizability by leveraging diverse datasets while maintaining data security. FL models have achieved comparable or superior accuracy to those trained on centralized data, demonstrating effectiveness in real-world clinical settings. CONCLUSIONS Federated Learning presents a promising strategy to overcome current obstacles in developing AI models for glaucoma screening. By balancing the need for extensive, diverse training data with the imperative to protect patient privacy and comply with regulations, FL facilitates collaborative model training without compromising data security. This approach offers a pathway toward more accurate and generalizable AI solutions for glaucoma detection and management. FINANCIAL DISCLOSURE(S) Proprietary or commercial disclosure may be found after the references in the Footnotes and Disclosures at the end of this article.
Collapse
Affiliation(s)
- Shahin Hallaj
- Division of Ophthalmology Informatics and Data Science, Hamilton Glaucoma Center, Shiley Eye Institute, Viterbi Family Department of Ophthalmology, University of California, San Diego, La Jolla, California; Division of Biomedical Informatics, Department of Medicine, University of California San Diego, La Jolla, California
| | - Benton G Chuter
- Division of Ophthalmology Informatics and Data Science, Hamilton Glaucoma Center, Shiley Eye Institute, Viterbi Family Department of Ophthalmology, University of California, San Diego, La Jolla, California; Division of Biomedical Informatics, Department of Medicine, University of California San Diego, La Jolla, California
| | - Alexander C Lieu
- Division of Ophthalmology Informatics and Data Science, Hamilton Glaucoma Center, Shiley Eye Institute, Viterbi Family Department of Ophthalmology, University of California, San Diego, La Jolla, California; Division of Biomedical Informatics, Department of Medicine, University of California San Diego, La Jolla, California
| | - Praveer Singh
- Division of Artificial Medical Intelligence, Department of Ophthalmology, University of Colorado School of Medicine, Aurora, Colorado
| | - Jayashree Kalpathy-Cramer
- Division of Artificial Medical Intelligence, Department of Ophthalmology, University of Colorado School of Medicine, Aurora, Colorado
| | - Benjamin Y Xu
- Roski Eye Institute, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - Mark Christopher
- Division of Ophthalmology Informatics and Data Science, Hamilton Glaucoma Center, Shiley Eye Institute, Viterbi Family Department of Ophthalmology, University of California, San Diego, La Jolla, California
| | - Linda M Zangwill
- Division of Ophthalmology Informatics and Data Science, Hamilton Glaucoma Center, Shiley Eye Institute, Viterbi Family Department of Ophthalmology, University of California, San Diego, La Jolla, California
| | - Robert N Weinreb
- Division of Ophthalmology Informatics and Data Science, Hamilton Glaucoma Center, Shiley Eye Institute, Viterbi Family Department of Ophthalmology, University of California, San Diego, La Jolla, California
| | - Sally L Baxter
- Division of Ophthalmology Informatics and Data Science, Hamilton Glaucoma Center, Shiley Eye Institute, Viterbi Family Department of Ophthalmology, University of California, San Diego, La Jolla, California; Division of Biomedical Informatics, Department of Medicine, University of California San Diego, La Jolla, California.
| |
Collapse
|
2
|
Liu H, Huang J, Jia D, Wang Q, Xu J, Shen D. Transferring Adult-Like Phase Images for Robust Multi-View Isointense Infant Brain Segmentation. IEEE TRANSACTIONS ON MEDICAL IMAGING 2025; 44:69-78. [PMID: 39024078 DOI: 10.1109/tmi.2024.3430348] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/20/2024]
Abstract
Accurate tissue segmentation of infant brain in magnetic resonance (MR) images is crucial for charting early brain development and identifying biomarkers. Due to ongoing myelination and maturation, in the isointense phase (6-9 months of age), the gray and white matters of infant brain exhibit similar intensity levels in MR images, posing significant challenges for tissue segmentation. Meanwhile, in the adult-like phase around 12 months of age, the MR images show high tissue contrast and can be easily segmented. In this paper, we propose to effectively exploit adult-like phase images to achieve robust multi-view isointense infant brain segmentation. Specifically, in one way, we transfer adult-like phase images to the isointense view, which have similar tissue contrast as the isointense phase images, and use the transferred images to train an isointense-view segmentation network. On the other way, we transfer isointense phase images to the adult-like view, which have enhanced tissue contrast, for training a segmentation network in the adult-like view. The segmentation networks of different views form a multi-path architecture that performs multi-view learning to further boost the segmentation performance. Since anatomy-preserving style transfer is key to the downstream segmentation task, we develop a Disentangled Cycle-consistent Adversarial Network (DCAN) with strong regularization terms to accurately transfer realistic tissue contrast between isointense and adult-like phase images while still maintaining their structural consistency. Experiments on both NDAR and iSeg-2019 datasets demonstrate a significant superior performance of our method over the state-of-the-art methods.
Collapse
|
3
|
Yanzhen M, Song C, Wanping L, Zufang Y, Wang A. Exploring approaches to tackle cross-domain challenges in brain medical image segmentation: a systematic review. Front Neurosci 2024; 18:1401329. [PMID: 38948927 PMCID: PMC11211279 DOI: 10.3389/fnins.2024.1401329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Accepted: 05/28/2024] [Indexed: 07/02/2024] Open
Abstract
Introduction Brain medical image segmentation is a critical task in medical image processing, playing a significant role in the prediction and diagnosis of diseases such as stroke, Alzheimer's disease, and brain tumors. However, substantial distribution discrepancies among datasets from different sources arise due to the large inter-site discrepancy among different scanners, imaging protocols, and populations. This leads to cross-domain problems in practical applications. In recent years, numerous studies have been conducted to address the cross-domain problem in brain image segmentation. Methods This review adheres to the standards of the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) for data processing and analysis. We retrieved relevant papers from PubMed, Web of Science, and IEEE databases from January 2018 to December 2023, extracting information about the medical domain, imaging modalities, methods for addressing cross-domain issues, experimental designs, and datasets from the selected papers. Moreover, we compared the performance of methods in stroke lesion segmentation, white matter segmentation and brain tumor segmentation. Results A total of 71 studies were included and analyzed in this review. The methods for tackling the cross-domain problem include Transfer Learning, Normalization, Unsupervised Learning, Transformer models, and Convolutional Neural Networks (CNNs). On the ATLAS dataset, domain-adaptive methods showed an overall improvement of ~3 percent in stroke lesion segmentation tasks compared to non-adaptive methods. However, given the diversity of datasets and experimental methodologies in current studies based on the methods for white matter segmentation tasks in MICCAI 2017 and those for brain tumor segmentation tasks in BraTS, it is challenging to intuitively compare the strengths and weaknesses of these methods. Conclusion Although various techniques have been applied to address the cross-domain problem in brain image segmentation, there is currently a lack of unified dataset collections and experimental standards. For instance, many studies are still based on n-fold cross-validation, while methods directly based on cross-validation across sites or datasets are relatively scarce. Furthermore, due to the diverse types of medical images in the field of brain segmentation, it is not straightforward to make simple and intuitive comparisons of performance. These challenges need to be addressed in future research.
Collapse
Affiliation(s)
- Ming Yanzhen
- School of Artificial Intelligence Academy, Wuhan Technology and Business University, Wuhan, Hubei, China
- Institute of Information and Intelligent Engineering Applications, Wuhan Technology and Business University, Wuhan, Hubei, China
| | - Chen Song
- Wuhan Dobest Information Technology Co., Ltd, Hubei, China
| | - Li Wanping
- School of Artificial Intelligence Academy, Wuhan Technology and Business University, Wuhan, Hubei, China
- Institute of Information and Intelligent Engineering Applications, Wuhan Technology and Business University, Wuhan, Hubei, China
| | - Yang Zufang
- School of Artificial Intelligence Academy, Wuhan Technology and Business University, Wuhan, Hubei, China
- Institute of Information and Intelligent Engineering Applications, Wuhan Technology and Business University, Wuhan, Hubei, China
| | - Alan Wang
- Auckland Bioengineering Institute, The University of Auckland, Auckland, New Zealand
- Medical Imaging Research Center, Faculty of Medical and Health Sciences, The University of Auckland, Auckland, New Zealand
- Centre for Co-Created Ageing Research, The University of Auckland, Auckland, New Zealand
- Centre for Brain Research, The University of Auckland, Auckland, New Zealand
| |
Collapse
|
4
|
Simarro J, Meyer MI, Van Eyndhoven S, Phan TV, Billiet T, Sima DM, Ortibus E. A deep learning model for brain segmentation across pediatric and adult populations. Sci Rep 2024; 14:11735. [PMID: 38778071 PMCID: PMC11111768 DOI: 10.1038/s41598-024-61798-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Accepted: 05/09/2024] [Indexed: 05/25/2024] Open
Abstract
Automated quantification of brain tissues on MR images has greatly contributed to the diagnosis and follow-up of neurological pathologies across various life stages. However, existing solutions are specifically designed for certain age ranges, limiting their applicability in monitoring brain development from infancy to late adulthood. This retrospective study aims to develop and validate a brain segmentation model across pediatric and adult populations. First, we trained a deep learning model to segment tissues and brain structures using T1-weighted MR images from 390 patients (age range: 2-81 years) across four different datasets. Subsequently, the model was validated on a cohort of 280 patients from six distinct test datasets (age range: 4-90 years). In the initial experiment, the proposed deep learning-based pipeline, icobrain-dl, demonstrated segmentation accuracy comparable to both pediatric and adult-specific models across diverse age groups. Subsequently, we evaluated intra- and inter-scanner variability in measurements of various tissues and structures in both pediatric and adult populations computed by icobrain-dl. Results demonstrated significantly higher reproducibility compared to similar brain quantification tools, including childmetrix, FastSurfer, and the medical device icobrain v5.9 (p-value< 0.01). Finally, we explored the potential clinical applications of icobrain-dl measurements in diagnosing pediatric patients with Cerebral Visual Impairment and adult patients with Alzheimer's Disease.
Collapse
Affiliation(s)
- Jaime Simarro
- icometrix, Leuven, Belgium.
- Department of Development and Regeneration, KU Leuven, Leuven, Belgium.
| | | | | | | | | | | | - Els Ortibus
- Department of Development and Regeneration, KU Leuven, Leuven, Belgium
- Department of Pediatric Neurology, UZ Leuven, Leuven, Belgium
- Child and Youth Institute, KU Leuven, Leuven, Belgium
| |
Collapse
|
5
|
Wittens MMJ, Allemeersch GJ, Sima DM, Vanderhasselt T, Raeymaeckers S, Fransen E, Smeets D, de Mey J, Bjerke M, Engelborghs S. Towards validation in clinical routine: a comparative analysis of visual MTA ratings versus the automated ratio between inferior lateral ventricle and hippocampal volumes in Alzheimer's disease diagnosis. Neuroradiology 2024; 66:487-506. [PMID: 38240767 PMCID: PMC10937807 DOI: 10.1007/s00234-024-03280-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 12/28/2023] [Indexed: 03/14/2024]
Abstract
PURPOSE To assess the performance of the inferior lateral ventricle (ILV) to hippocampal (Hip) volume ratio on brain MRI, for Alzheimer's disease (AD) diagnostics, comparing it to individual automated ILV and hippocampal volumes, and visual medial temporal lobe atrophy (MTA) consensus ratings. METHODS One-hundred-twelve subjects (mean age ± SD, 66.85 ± 13.64 years) with varying degrees of cognitive decline underwent MRI using a Philips Ingenia 3T. The MTA scale by Scheltens, rated on coronal 3D T1-weighted images, was determined by three experienced radiologists, blinded to diagnosis and sex. Automated volumetry was computed by icobrain dm (v. 5.10) for total, left, right hippocampal, and ILV volumes. The ILV/Hip ratio, defined as the percentage ratio between ILV and hippocampal volumes, was calculated and compared against a normative reference population (n = 1903). Inter-rater agreement, association, classification accuracy, and clinical interpretability on patient level were reported. RESULTS Visual MTA scores showed excellent inter-rater agreement. Ordinal logistic regression and correlation analyses demonstrated robust associations between automated brain segmentations and visual MTA ratings, with the ILV/Hip ratio consistently outperforming individual hippocampal and ILV volumes. Pairwise classification accuracy showed good performance without statistically significant differences between the ILV/Hip ratio and visual MTA across disease stages, indicating potential interchangeability. Comparison to the normative population and clinical interpretability assessments showed commensurability in classifying MTA "severity" between visual MTA and ILV/Hip ratio measurements. CONCLUSION The ILV/Hip ratio shows the highest correlation to visual MTA, in comparison to automated individual ILV and hippocampal volumes, offering standardized measures for diagnostic support in different stages of cognitive decline.
Collapse
Affiliation(s)
- Mandy M J Wittens
- Dept. of Biomedical Sciences, University of Antwerp, Universiteitsplein 1, 2610, Antwerp, Belgium
- Dept. of Neurology, Universitair Ziekenhuis Brussel (UZ Brussel), Av. du Laerbeek 101, 1090, Brussels, Belgium
| | - Gert-Jan Allemeersch
- Dept. of Radiology, Universitair Ziekenhuis Brussel (UZ Brussel), Av. du Laerbeek 101, 1090, Brussels, Belgium.
| | - Diana M Sima
- Icometrix, Kolonel Begaultlaan 1b, 3012, Leuven, Belgium
- AI Supported Modelling in Clinical Sciences (AIMS), Vrije Universiteit Brussel, Pleinlaan 2, 1050, Brussels, Belgium
| | - Tim Vanderhasselt
- Dept. of Radiology, Universitair Ziekenhuis Brussel (UZ Brussel), Av. du Laerbeek 101, 1090, Brussels, Belgium
| | - Steven Raeymaeckers
- Dept. of Radiology, Universitair Ziekenhuis Brussel (UZ Brussel), Av. du Laerbeek 101, 1090, Brussels, Belgium
| | - Erik Fransen
- StatUa Center for Statistics, University of Antwerp, Universiteitsplein 1, 2610, Antwerp, Belgium
| | - Dirk Smeets
- Icometrix, Kolonel Begaultlaan 1b, 3012, Leuven, Belgium
| | - Johan de Mey
- Dept. of Radiology, Universitair Ziekenhuis Brussel (UZ Brussel), Av. du Laerbeek 101, 1090, Brussels, Belgium
| | - Maria Bjerke
- Dept. of Biomedical Sciences, University of Antwerp, Universiteitsplein 1, 2610, Antwerp, Belgium
- NEUR (Neuroprotection & Neuromodulation), Center for Neurosciences (C4N), Vrije Universiteit Brussel (VUB), Av. du Laerbeek 101, 1090, Brussels, Belgium
- Laboratory of Neurochemistry, Dept. of Clinical Chemistry, Universitair Ziekenhuis Brussel (UZ Brussel), Av. du Laerbeek 101, 1090, Brussels, Belgium
| | - Sebastiaan Engelborghs
- Dept. of Biomedical Sciences, University of Antwerp, Universiteitsplein 1, 2610, Antwerp, Belgium
- Dept. of Neurology, Universitair Ziekenhuis Brussel (UZ Brussel), Av. du Laerbeek 101, 1090, Brussels, Belgium
- NEUR (Neuroprotection & Neuromodulation), Center for Neurosciences (C4N), Vrije Universiteit Brussel (VUB), Av. du Laerbeek 101, 1090, Brussels, Belgium
| |
Collapse
|
6
|
Vernikouskaya I, Müller HP, Roselli F, Ludolph AC, Kassubek J, Rasche V. AI-assisted quantification of hypothalamic atrophy in amyotrophic lateral sclerosis by convolutional neural network-based automatic segmentation. Sci Rep 2023; 13:21505. [PMID: 38057503 PMCID: PMC10700600 DOI: 10.1038/s41598-023-48649-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Accepted: 11/29/2023] [Indexed: 12/08/2023] Open
Abstract
The hypothalamus is a small structure of the brain with an essential role in metabolic homeostasis, sleep regulation, and body temperature control. Some neurodegenerative diseases such as amyotrophic lateral sclerosis (ALS) and dementia syndromes are reported to be related to hypothalamic volume alterations. Despite its crucial role in human body regulation, neuroimaging studies of this structure are rather scarce due to work-intensive operator-dependent manual delineations from MRI and lack of automated segmentation tools. In this study we present a fully automatic approach based on deep convolutional neural networks (CNN) for hypothalamic segmentation and volume quantification. We applied CNN of U-Net architecture with EfficientNetB0 backbone to allow for accurate automatic hypothalamic segmentation in seconds on a GPU. We further applied our approach for the quantification of the normalized hypothalamic volumes to a large neuroimaging dataset of 432 ALS patients and 112 healthy controls (without the ground truth labels). Using the automated volumetric analysis, we could reproduce hypothalamic atrophy findings associated with ALS by detecting significant volume differences between ALS patients and controls at the group level. In conclusion, a fast and unbiased AI-assisted hypothalamic quantification method is introduced in this study (whose acceptance rate based on the outlier removal strategy was estimated to be above 95%) and made publicly available for researchers interested in the conduction of hypothalamus studies at a large scale.
Collapse
Affiliation(s)
- Ina Vernikouskaya
- Department of Internal Medicine II, Ulm University Medical Center, Albert-Einstein-Allee 23, 89081, Ulm, Germany.
| | | | - Francesco Roselli
- Department of Neurology, University of Ulm, Ulm, Germany
- German Center for Neurodegenerative Diseases (DZNE), Ulm, Germany
| | - Albert C Ludolph
- Department of Neurology, University of Ulm, Ulm, Germany
- German Center for Neurodegenerative Diseases (DZNE), Ulm, Germany
| | - Jan Kassubek
- Department of Neurology, University of Ulm, Ulm, Germany
- German Center for Neurodegenerative Diseases (DZNE), Ulm, Germany
| | - Volker Rasche
- Department of Internal Medicine II, Ulm University Medical Center, Albert-Einstein-Allee 23, 89081, Ulm, Germany
- Core Facility Small Animal MRI, University of Ulm, Ulm, Germany
| |
Collapse
|
7
|
Mendelsohn Z, Pemberton HG, Gray J, Goodkin O, Carrasco FP, Scheel M, Nawabi J, Barkhof F. Commercial volumetric MRI reporting tools in multiple sclerosis: a systematic review of the evidence. Neuroradiology 2023; 65:5-24. [PMID: 36331588 PMCID: PMC9816195 DOI: 10.1007/s00234-022-03074-w] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2022] [Accepted: 09/29/2022] [Indexed: 11/06/2022]
Abstract
PURPOSE MRI is integral to the diagnosis of multiple sclerosis (MS) and is important for clinical prognostication. Quantitative volumetric reporting tools (QReports) can improve the accuracy and objectivity of MRI-based assessments. Several QReports are commercially available; however, validation can be difficult to establish and does not currently follow a common pathway. To aid evidence-based clinical decision-making, we performed a systematic review of commercial QReports for use in MS including technical details and published reports of validation and in-use evaluation. METHODS We categorized studies into three types of testing: technical validation, for example, comparison to manual segmentation, clinical validation by clinicians or interpretation of results alongside clinician-rated variables, and in-use evaluation, such as health economic assessment. RESULTS We identified 10 companies, which provide MS lesion and brain segmentation and volume quantification, and 38 relevant publications. Tools received regulatory approval between 2006 and 2020, contextualize results to normative reference populations, ranging from 620 to 8000 subjects, and require T1- and T2-FLAIR-weighted input sequences for longitudinal assessment of whole-brain volume and lesions. In MS, six QReports provided evidence of technical validation, four companies have conducted clinical validation by correlating results with clinical variables, only one has tested their QReport by clinician end-users, and one has performed a simulated in-use socioeconomic evaluation. CONCLUSION We conclude that there is limited evidence in the literature regarding clinical validation and in-use evaluation of commercial MS QReports with a particular lack of clinician end-user testing. Our systematic review provides clinicians and institutions with the available evidence when considering adopting a quantitative reporting tool for MS.
Collapse
Affiliation(s)
- Zoe Mendelsohn
- grid.83440.3b0000000121901201Neuroradiological Academic Unit, UCL Queen Square Institute of Neurology, University College London, London, UK ,grid.83440.3b0000000121901201Department of Medical Physics and Bioengineering, Centre for Medical Image Computing (CMIC), University College London, London, UK ,grid.83440.3b0000000121901201Department of Neuroinflammation, Queen Square Multiple Sclerosis Centre, UCL Institute of Neurology, University College London, London, UK ,grid.6363.00000 0001 2218 4662Department of Neuroradiology, Charité School of Medicine and University Hospital Berlin, Berlin, Germany ,grid.6363.00000 0001 2218 4662Department of Radiology, Charité School of Medicine and University Hospital Berlin, Berlin, Germany
| | - Hugh G. Pemberton
- grid.83440.3b0000000121901201Neuroradiological Academic Unit, UCL Queen Square Institute of Neurology, University College London, London, UK ,grid.83440.3b0000000121901201Department of Medical Physics and Bioengineering, Centre for Medical Image Computing (CMIC), University College London, London, UK ,grid.420685.d0000 0001 1940 6527GE Healthcare, Amersham, UK
| | - James Gray
- grid.416626.10000 0004 0391 2793Stepping Hill Hospital, NHS Foundation Trust, Stockport, UK
| | - Olivia Goodkin
- grid.83440.3b0000000121901201Neuroradiological Academic Unit, UCL Queen Square Institute of Neurology, University College London, London, UK ,grid.83440.3b0000000121901201Department of Medical Physics and Bioengineering, Centre for Medical Image Computing (CMIC), University College London, London, UK ,grid.83440.3b0000000121901201Department of Neuroinflammation, Queen Square Multiple Sclerosis Centre, UCL Institute of Neurology, University College London, London, UK
| | - Ferran Prados Carrasco
- grid.83440.3b0000000121901201Department of Medical Physics and Bioengineering, Centre for Medical Image Computing (CMIC), University College London, London, UK ,grid.83440.3b0000000121901201Department of Neuroinflammation, Queen Square Multiple Sclerosis Centre, UCL Institute of Neurology, University College London, London, UK ,grid.36083.3e0000 0001 2171 6620E-Health Centre, Universitat Oberta de Catalunya, Barcelona, Spain
| | - Michael Scheel
- grid.6363.00000 0001 2218 4662Department of Neuroradiology, Charité School of Medicine and University Hospital Berlin, Berlin, Germany
| | - Jawed Nawabi
- grid.6363.00000 0001 2218 4662Department of Radiology, Charité School of Medicine and University Hospital Berlin, Berlin, Germany ,grid.484013.a0000 0004 6879 971XBerlin Institute of Health at Charité – Universitätsmedizin Berlin, BIH Biomedical Innovation Academy, BIH Charité Digital Clinician Scientist Program, Berlin, Germany
| | - Frederik Barkhof
- grid.83440.3b0000000121901201Neuroradiological Academic Unit, UCL Queen Square Institute of Neurology, University College London, London, UK ,grid.83440.3b0000000121901201Department of Medical Physics and Bioengineering, Centre for Medical Image Computing (CMIC), University College London, London, UK ,grid.83440.3b0000000121901201Department of Neuroinflammation, Queen Square Multiple Sclerosis Centre, UCL Institute of Neurology, University College London, London, UK ,grid.12380.380000 0004 1754 9227Radiology & Nuclear Medicine, Amsterdam University Medical Centers, Vrije Universiteit, Amsterdam, The Netherlands
| |
Collapse
|
8
|
Barragán-Montero A, Bibal A, Dastarac MH, Draguet C, Valdés G, Nguyen D, Willems S, Vandewinckele L, Holmström M, Löfman F, Souris K, Sterpin E, Lee JA. Towards a safe and efficient clinical implementation of machine learning in radiation oncology by exploring model interpretability, explainability and data-model dependency. Phys Med Biol 2022; 67:10.1088/1361-6560/ac678a. [PMID: 35421855 PMCID: PMC9870296 DOI: 10.1088/1361-6560/ac678a] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Accepted: 04/14/2022] [Indexed: 01/26/2023]
Abstract
The interest in machine learning (ML) has grown tremendously in recent years, partly due to the performance leap that occurred with new techniques of deep learning, convolutional neural networks for images, increased computational power, and wider availability of large datasets. Most fields of medicine follow that popular trend and, notably, radiation oncology is one of those that are at the forefront, with already a long tradition in using digital images and fully computerized workflows. ML models are driven by data, and in contrast with many statistical or physical models, they can be very large and complex, with countless generic parameters. This inevitably raises two questions, namely, the tight dependence between the models and the datasets that feed them, and the interpretability of the models, which scales with its complexity. Any problems in the data used to train the model will be later reflected in their performance. This, together with the low interpretability of ML models, makes their implementation into the clinical workflow particularly difficult. Building tools for risk assessment and quality assurance of ML models must involve then two main points: interpretability and data-model dependency. After a joint introduction of both radiation oncology and ML, this paper reviews the main risks and current solutions when applying the latter to workflows in the former. Risks associated with data and models, as well as their interaction, are detailed. Next, the core concepts of interpretability, explainability, and data-model dependency are formally defined and illustrated with examples. Afterwards, a broad discussion goes through key applications of ML in workflows of radiation oncology as well as vendors' perspectives for the clinical implementation of ML.
Collapse
Affiliation(s)
- Ana Barragán-Montero
- Molecular Imaging, Radiation and Oncology (MIRO) Laboratory, Institut de Recherche Expérimentale et Clinique (IREC), UCLouvain, Belgium
| | - Adrien Bibal
- PReCISE, NaDI Institute, Faculty of Computer Science, UNamur and CENTAL, ILC, UCLouvain, Belgium
| | - Margerie Huet Dastarac
- Molecular Imaging, Radiation and Oncology (MIRO) Laboratory, Institut de Recherche Expérimentale et Clinique (IREC), UCLouvain, Belgium
| | - Camille Draguet
- Molecular Imaging, Radiation and Oncology (MIRO) Laboratory, Institut de Recherche Expérimentale et Clinique (IREC), UCLouvain, Belgium
- Department of Oncology, Laboratory of Experimental Radiotherapy, KU Leuven, Belgium
| | - Gilmer Valdés
- Department of Radiation Oncology, Department of Epidemiology and Biostatistics, University of California, San Francisco, United States of America
| | - Dan Nguyen
- Medical Artificial Intelligence and Automation (MAIA) Laboratory, Department of Radiation Oncology, UT Southwestern Medical Center, United States of America
| | - Siri Willems
- ESAT/PSI, KU Leuven Belgium & MIRC, UZ Leuven, Belgium
| | | | | | | | - Kevin Souris
- Molecular Imaging, Radiation and Oncology (MIRO) Laboratory, Institut de Recherche Expérimentale et Clinique (IREC), UCLouvain, Belgium
| | - Edmond Sterpin
- Molecular Imaging, Radiation and Oncology (MIRO) Laboratory, Institut de Recherche Expérimentale et Clinique (IREC), UCLouvain, Belgium
- Department of Oncology, Laboratory of Experimental Radiotherapy, KU Leuven, Belgium
| | - John A Lee
- Molecular Imaging, Radiation and Oncology (MIRO) Laboratory, Institut de Recherche Expérimentale et Clinique (IREC), UCLouvain, Belgium
| |
Collapse
|