1
|
Doran SJ, Barfoot T, Wedlake L, Winfield JM, Petts J, Glocker B, Li X, Leach M, Kaiser M, Barwick TD, Chaidos A, Satchwell L, Soneji N, Elgendy K, Sheeka A, Wallitt K, Koh DM, Messiou C, Rockall A. Curation of myeloma observational study MALIMAR using XNAT: solving the challenges posed by real-world data. Insights Imaging 2024; 15:47. [PMID: 38361108 PMCID: PMC10869673 DOI: 10.1186/s13244-023-01591-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Accepted: 12/06/2023] [Indexed: 02/17/2024] Open
Abstract
OBJECTIVES MAchine Learning In MyelomA Response (MALIMAR) is an observational clinical study combining "real-world" and clinical trial data, both retrospective and prospective. Images were acquired on three MRI scanners over a 10-year window at two institutions, leading to a need for extensive curation. METHODS Curation involved image aggregation, pseudonymisation, allocation between project phases, data cleaning, upload to an XNAT repository visible from multiple sites, annotation, incorporation of machine learning research outputs and quality assurance using programmatic methods. RESULTS A total of 796 whole-body MR imaging sessions from 462 subjects were curated. A major change in scan protocol part way through the retrospective window meant that approximately 30% of available imaging sessions had properties that differed significantly from the remainder of the data. Issues were found with a vendor-supplied clinical algorithm for "composing" whole-body images from multiple imaging stations. Historic weaknesses in a digital video disk (DVD) research archive (already addressed by the mid-2010s) were highlighted by incomplete datasets, some of which could not be completely recovered. The final dataset contained 736 imaging sessions for 432 subjects. Software was written to clean and harmonise data. Implications for the subsequent machine learning activity are considered. CONCLUSIONS MALIMAR exemplifies the vital role that curation plays in machine learning studies that use real-world data. A research repository such as XNAT facilitates day-to-day management, ensures robustness and consistency and enhances the value of the final dataset. The types of process described here will be vital for future large-scale multi-institutional and multi-national imaging projects. CRITICAL RELEVANCE STATEMENT This article showcases innovative data curation methods using a state-of-the-art image repository platform; such tools will be vital for managing the large multi-institutional datasets required to train and validate generalisable ML algorithms and future foundation models in medical imaging. KEY POINTS • Heterogeneous data in the MALIMAR study required the development of novel curation strategies. • Correction of multiple problems affecting the real-world data was successful, but implications for machine learning are still being evaluated. • Modern image repositories have rich application programming interfaces enabling data enrichment and programmatic QA, making them much more than simple "image marts".
Collapse
Affiliation(s)
- Simon J Doran
- Division of Radiotherapy and Imaging, The Institute of Cancer Research, London, UK.
- National Cancer Imaging Translational Accelerator, London, UK.
| | - Theo Barfoot
- Division of Radiotherapy and Imaging, The Institute of Cancer Research, London, UK
- Department of Radiology, The Royal Marsden NHS Foundation Trust, London, UK
| | | | - Jessica M Winfield
- Division of Radiotherapy and Imaging, The Institute of Cancer Research, London, UK
- Joint Department of Physics, The Royal Marsden NHS Foundation Trust, London, UK
| | - James Petts
- Division of Radiotherapy and Imaging, The Institute of Cancer Research, London, UK
| | - Ben Glocker
- Department of Computing, Imperial College London, London, UK
| | - Xingfeng Li
- Division of Cancer, Department of Surgery and Cancer, Imperial College London, London, UK
| | - Martin Leach
- Division of Radiotherapy and Imaging, The Institute of Cancer Research, London, UK
- Joint Department of Physics, The Royal Marsden NHS Foundation Trust, London, UK
| | - Martin Kaiser
- Division of Radiotherapy and Imaging, The Institute of Cancer Research, London, UK
- Haemato-Oncology Unit, The Royal Marsden NHS Foundation Trust, London, UK
| | - Tara D Barwick
- Division of Cancer, Department of Surgery and Cancer, Imperial College London, London, UK
- Department of Radiology, Imperial College Healthcare NHS Trust, London, UK
| | - Aristeidis Chaidos
- Department of Haematology, Imperial College Healthcare NHS Trust, London, UK
| | - Laura Satchwell
- Research and Development Statistics Unit, The Royal Marsden NHS Foundation Trust, London, UK
| | - Neil Soneji
- Department of Radiology, Imperial College Healthcare NHS Trust, London, UK
| | - Khalil Elgendy
- Department of Radiology, Imperial College Healthcare NHS Trust, London, UK
| | - Alexander Sheeka
- Department of Radiology, Imperial College Healthcare NHS Trust, London, UK
| | - Kathryn Wallitt
- Department of Radiology, Imperial College Healthcare NHS Trust, London, UK
| | - Dow-Mu Koh
- Division of Radiotherapy and Imaging, The Institute of Cancer Research, London, UK
- National Cancer Imaging Translational Accelerator, London, UK
- Department of Radiology, The Royal Marsden NHS Foundation Trust, London, UK
| | - Christina Messiou
- Division of Radiotherapy and Imaging, The Institute of Cancer Research, London, UK
- Department of Radiology, The Royal Marsden NHS Foundation Trust, London, UK
| | - Andrea Rockall
- Division of Cancer, Department of Surgery and Cancer, Imperial College London, London, UK
- Department of Radiology, Imperial College Healthcare NHS Trust, London, UK
| |
Collapse
|
3
|
Sjöholm T, Tarai S, Malmberg F, Strand R, Korenyushkin A, Enblad G, Ahlström H, Kullberg J. A whole-body diffusion MRI normal atlas: development, evaluation and initial use. Cancer Imaging 2023; 23:87. [PMID: 37710346 PMCID: PMC10503210 DOI: 10.1186/s40644-023-00603-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2023] [Accepted: 08/28/2023] [Indexed: 09/16/2023] Open
Abstract
BACKGROUND Statistical atlases can provide population-based descriptions of healthy volunteers and/or patients and can be used for region- and voxel-based analysis. This work aims to develop whole-body diffusion atlases of healthy volunteers scanned at 1.5T and 3T. Further aims include evaluating the atlases by establishing whole-body Apparent Diffusion Coefficient (ADC) values of healthy tissues and including healthy tissue deviations in an automated tumour segmentation task. METHODS Multi-station whole-body Diffusion Weighted Imaging (DWI) and water-fat Magnetic Resonance Imaging (MRI) of healthy volunteers (n = 45) were acquired at 1.5T (n = 38) and/or 3T (n = 29), with test-retest imaging for five subjects per scanner. Using deformable image registration, whole-body MRI data was registered and composed into normal atlases. Healthy tissue ADCmean was manually measured for ten tissues, with test-retest percentage Repeatability Coefficient (%RC), and effect of age, sex and scanner assessed. Voxel-wise whole-body analyses using the normal atlases were studied with ADC correlation analyses and an automated tumour segmentation task. For the latter, lymphoma patient MRI scans (n = 40) with and without information about healthy tissue deviations were entered into a 3D U-Net architecture. RESULTS Sex- and Body Mass Index (BMI)-stratified whole-body high b-value DWI and ADC normal atlases were created at 1.5T and 3T. %RC of healthy tissue ADCmean varied depending on tissue assessed (4-48% at 1.5T, 6-70% at 3T). Scanner differences in ADCmean were visualised in Bland-Altman analyses of dually scanned subjects. Sex differences were measurable for liver, muscle and bone at 1.5T, and muscle at 3T. Volume of Interest (VOI)-based multiple linear regression, and voxel-based correlations in normal atlas space, showed that age and ADC were negatively associated for liver and bone at 1.5T, and positively associated with brain tissue at 1.5T and 3T. Adding voxel-wise information about healthy tissue deviations in an automated tumour segmentation task gave numerical improvements in the segmentation metrics Dice score, sensitivity and precision. CONCLUSIONS Whole-body DWI and ADC normal atlases were created at 1.5T and 3T, and applied in whole-body voxel-wise analyses.
Collapse
Affiliation(s)
- Therese Sjöholm
- Department of Surgical Sciences, Uppsala University, Uppsala, Sweden
| | - Sambit Tarai
- Department of Surgical Sciences, Uppsala University, Uppsala, Sweden
| | - Filip Malmberg
- Department of Information Technology, Uppsala University, Uppsala, Sweden
| | - Robin Strand
- Department of Information Technology, Uppsala University, Uppsala, Sweden
| | | | - Gunilla Enblad
- Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden
| | - Håkan Ahlström
- Department of Surgical Sciences, Uppsala University, Uppsala, Sweden
- Antaros Medical AB, Mölndal, Sweden
| | - Joel Kullberg
- Department of Surgical Sciences, Uppsala University, Uppsala, Sweden.
- Antaros Medical AB, Mölndal, Sweden.
| |
Collapse
|
5
|
Lother D, Robert M, Elwood E, Smith S, Tunariu N, Johnston SRD, Parton M, Bhaludin B, Millard T, Downey K, Sharma B. Imaging in metastatic breast cancer, CT, PET/CT, MRI, WB-DWI, CCA: review and new perspectives. Cancer Imaging 2023; 23:53. [PMID: 37254225 DOI: 10.1186/s40644-023-00557-8] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Accepted: 04/17/2023] [Indexed: 06/01/2023] Open
Abstract
BACKGROUND Breast cancer is the most frequent cancer in women and remains the second leading cause of death in Western countries. It represents a heterogeneous group of diseases with diverse tumoral behaviour, treatment responsiveness and prognosis. While major progress in diagnosis and treatment has resulted in a decline in breast cancer-related mortality, some patients will relapse and prognosis in this cohort of patients remains poor. Treatment is determined according to tumor subtype; primarily hormone receptor status and HER2 expression. Menopausal status and site of disease relapse are also important considerations in treatment protocols. MAIN BODY Staging and repeated evaluation of patients with metastatic breast cancer are central to the accurate assessment of disease extent at diagnosis and during treatment; guiding ongoing clinical management. Advances have been made in the diagnostic and therapeutic fields, particularly with new targeted therapies. In parallel, oncological imaging has evolved exponentially with the development of functional and anatomical imaging techniques. Consistent, reproducible and validated methods of assessing response to therapy is critical in effectively managing patients with metastatic breast cancer. CONCLUSION Major progress has been made in oncological imaging over the last few decades. Accurate disease assessment at diagnosis and during treatment is important in the management of metastatic breast cancer. CT (and BS if appropriate) is generally widely available, relatively cheap and sufficient in many cases. However, several additional imaging modalities are emerging and can be used as adjuncts, particularly in pregnancy or other diagnostically challenging cases. Nevertheless, no single imaging technique is without limitation. The authors have evaluated the vast array of imaging techniques - individual, combined parametric and multimodal - that are available or that are emerging in the management of metastatic breast cancer. This includes WB DW-MRI, CCA, novel PET breast cancer-epitope specific radiotracers and radiogenomics.
Collapse
Affiliation(s)
| | - Marie Robert
- Institut de Cancérologie de l'Ouest, St Herblain, France
| | | | - Sam Smith
- The Royal Marsden Hospital, London & Sutton, UK
| | - Nina Tunariu
- The Royal Marsden Hospital, London & Sutton, UK
- The Institute of Cancer Research (ICR), London & Sutton, UK
| | - Stephen R D Johnston
- The Royal Marsden Hospital, London & Sutton, UK
- The Institute of Cancer Research (ICR), London & Sutton, UK
| | | | | | | | - Kate Downey
- The Royal Marsden Hospital, London & Sutton, UK
- The Institute of Cancer Research (ICR), London & Sutton, UK
| | - Bhupinder Sharma
- The Royal Marsden Hospital, London & Sutton, UK.
- The Institute of Cancer Research (ICR), London & Sutton, UK.
| |
Collapse
|