1
|
Abbaspour E, Mansoori B, Karimzadhagh S, Chalian M, Pouramini A, Sheida F, Daskareh M, Haseli S. Machine learning and deep learning models for preoperative detection of lymph node metastasis in colorectal cancer: a systematic review and meta-analysis. Abdom Radiol (NY) 2025; 50:1927-1941. [PMID: 39522103 DOI: 10.1007/s00261-024-04668-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2024] [Revised: 10/26/2024] [Accepted: 10/28/2024] [Indexed: 11/16/2024]
Abstract
OBJECTIVE To evaluate the diagnostic performance of Machine Learning (ML) and Deep Learning (DL) models for predicting preoperative Lymph Node Metastasis (LNM) in Colorectal Cancer (CRC) patients. METHODS A systematic review and meta-analysis were conducted following PRISMA-DTA and AMSTAR-2 guidelines. We searched PubMed, Web of Science, Embase, and Cochrane Library databases until February 16, 2024. Study quality and risk of bias were assessed using the QUADAS-2 tool. Data were analyzed using STATA v18, applying random-effects models to all analyses. RESULTS Twelve studies involving 8321 patients were included, with most published in 2021-2024 (9/12). The pooled AUC of ML models for predicting LNM in CRC patients was 0.87 (95% CI: 0.82-0.91, I2:86.17) with a sensitivity of 78% (95% CI: 69-87%) and a specificity of 77% (95% CI: 64%-90%). In addition, when assessing the AUC reported by radiologists, both junior and senior radiologists had similar performance, significantly lower than the ML models. (P < 0.001). Subgroup analysis revealed higher AUCs in prospective studies (0.95, 95% CI: 0.87-1) compared to retrospective studies (0.85, 95% CI: 0.81-0.89) (P = 0.03). Studies without external validation exhibited significantly higher AUCs than those with external validation (P < 0.01). While there was no significant difference in AUC and sensitivity between the T1-T2 and T2-T4 stages, specificity was significantly higher in the T2-T4 stages than the low stages of T1 and T2 (95%, 95% CI: 92-98% vs. 61%, 95% CI: 44-78%; P < 0.01). CONCLUSION ML models demonstrate strong potential for preoperative LNM staging and treatment planning in CRC, potentially reducing the need for additional surgeries and related health and financial burdens. Further prospective multicenter studies, with standardized reporting of algorithms, modality parameters, and LNM staging, are needed to validate these findings.
Collapse
Affiliation(s)
- Elahe Abbaspour
- Division of Musculoskeletal Imaging and Intervention, Department of Radiology, University of Washington, Seattle, WA, USA
| | - Bahar Mansoori
- Division of Abdominal Imaging, Department of Radiology, University of Washington, Seattle, WA, USA
| | - Sahand Karimzadhagh
- Division of Musculoskeletal Imaging and Intervention, Department of Radiology, University of Washington, Seattle, WA, USA.
- Gastroenterology and Liver Diseases Research Center, Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
| | - Majid Chalian
- Division of Musculoskeletal Imaging and Intervention, Department of Radiology, University of Washington, Seattle, WA, USA
| | - Alireza Pouramini
- Division of Musculoskeletal Imaging and Intervention, Department of Radiology, University of Washington, Seattle, WA, USA
| | - Fateme Sheida
- Gastrointestinal and Liver Diseases Research Center, Guilan University of Medical Sciences, Rasht, Iran
- Cancer Research Center, Hamadan University of Medical Sciences, Hamedan, Iran
| | - Mahyar Daskareh
- Department of Radiology, University of California San Diego, San Diego, CA, USA
| | - Sara Haseli
- Division of Musculoskeletal Imaging and Intervention, Department of Radiology, University of Washington, Seattle, WA, USA
- Department of Radiology, The OncoRad Research Core, University of Washington/Fred Hutchinson Cancer Center, Seattle, WA, USA
| |
Collapse
|
2
|
Magnin CY, Lauer D, Ammeter M, Gote-Schniering J. From images to clinical insights: an educational review on radiomics in lung diseases. Breathe (Sheff) 2025; 21:230225. [PMID: 40104259 PMCID: PMC11915127 DOI: 10.1183/20734735.0225-2023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2024] [Accepted: 12/16/2024] [Indexed: 03/20/2025] Open
Abstract
Radiological imaging is a cornerstone in the clinical workup of lung diseases. Radiomics represents a significant advancement in clinical lung imaging, offering a powerful tool to complement traditional qualitative image analysis. Radiomic features are quantitative and computationally describe shape, intensity, texture and wavelet characteristics from medical images that can uncover detailed and often subtle information that goes beyond the visual capabilities of radiological examiners. By extracting this quantitative information, radiomics can provide deep insights into the pathophysiology of lung diseases and support clinical decision-making as well as personalised medicine approaches. In this educational review, we provide a step-by-step guide to radiomics-based medical image analysis, discussing the technical challenges and pitfalls, and outline the potential clinical applications of radiomics in diagnosing, prognosticating and evaluating treatment responses in respiratory medicine.
Collapse
Affiliation(s)
- Cheryl Y Magnin
- Department of Rheumatology and Immunology, Inselspital, Bern University Hospital, University of Bern, Bern, Switzerland
- Lung Precision Medicine (LPM), Department for BioMedical Research (DBMR), University of Bern, Bern, Switzerland
- Both authors contributed equally
| | - David Lauer
- Department of Rheumatology and Immunology, Inselspital, Bern University Hospital, University of Bern, Bern, Switzerland
- Lung Precision Medicine (LPM), Department for BioMedical Research (DBMR), University of Bern, Bern, Switzerland
- Both authors contributed equally
| | - Michael Ammeter
- Department of Rheumatology and Immunology, Inselspital, Bern University Hospital, University of Bern, Bern, Switzerland
- Lung Precision Medicine (LPM), Department for BioMedical Research (DBMR), University of Bern, Bern, Switzerland
- Graduate School for Cellular and Biomedical Sciences, University of Bern, Bern, Switzerland
| | - Janine Gote-Schniering
- Department of Rheumatology and Immunology, Inselspital, Bern University Hospital, University of Bern, Bern, Switzerland
- Lung Precision Medicine (LPM), Department for BioMedical Research (DBMR), University of Bern, Bern, Switzerland
- Department of Pulmonary Medicine, Allergology and Clinical Immunology, Inselspital, Bern University Hospital, University of Bern, Bern, Switzerland
| |
Collapse
|
3
|
Hamada A, Yasaka K, Hatano S, Kurokawa M, Inui S, Kubo T, Watanabe Y, Abe O. Deep-Learning Reconstruction of High-Resolution CT Improves Interobserver Agreement for the Evaluation of Pulmonary Fibrosis. Can Assoc Radiol J 2024; 75:542-548. [PMID: 38293802 DOI: 10.1177/08465371241228468] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2024] Open
Abstract
Objective: This study aimed to investigate whether deep-learning reconstruction (DLR) improves interobserver agreement in the evaluation of honeycombing for patients with interstitial lung disease (ILD) who underwent high-resolution computed tomography (CT) compared with hybrid iterative reconstruction (HIR). Methods: In this retrospective study, 35 consecutive patients suspected of ILD who underwent CT including the chest region were included. High-resolution CT images of the unilateral lung with DLR and HIR were reconstructed for the right and left lungs. A radiologist placed regions of interest on the lung and measured standard deviation of CT attenuation (i.e., quantitative image noise). In the qualitative image analyses, 5 blinded readers assessed the presence of honeycombing and reticulation, qualitative image noise, artifacts, and overall image quality using a 5-point scale (except for artifacts which was evaluated using a 3-point scale). Results: The quantitative and qualitative image noise in DLR was remarkably reduced compared to that in HIR (P < .001). Artifacts and overall DLR quality were significantly improved compared to those of HIR (P < .001 for 4 out of 5 readers). Interobserver agreement in the evaluations of honeycombing and reticulation for DLR (0.557 [0.450-0.693] and 0.525 [0.470-0.541], respectively) were higher than those for HIR (0.321 [0.211-0.520] and 0.470 [0.354-0.533], respectively). A statistically significant difference was found for honeycombing (P = .014). Conclusions: DLR improved interobserver agreement in the evaluation of honeycombing in patients with ILD on CT compared to HIR.
Collapse
Affiliation(s)
- Akiyoshi Hamada
- Department of Radiology, The University of Tokyo Hospital, Tokyo, Japan
| | - Koichiro Yasaka
- Department of Radiology, The University of Tokyo Hospital, Tokyo, Japan
| | - Sosuke Hatano
- Department of Radiology, The University of Tokyo Hospital, Tokyo, Japan
| | - Mariko Kurokawa
- Department of Radiology, The University of Tokyo Hospital, Tokyo, Japan
| | - Shohei Inui
- Department of Radiology, The University of Tokyo Hospital, Tokyo, Japan
| | - Takatoshi Kubo
- Department of Radiology, The University of Tokyo Hospital, Tokyo, Japan
| | - Yusuke Watanabe
- Department of Radiology, The University of Tokyo Hospital, Tokyo, Japan
| | - Osamu Abe
- Department of Radiology, The University of Tokyo Hospital, Tokyo, Japan
| |
Collapse
|
4
|
Fujita N, Yasaka K, Hatano S, Sakamoto N, Kurokawa R, Abe O. Deep learning reconstruction for high-resolution computed tomography images of the temporal bone: comparison with hybrid iterative reconstruction. Neuroradiology 2024; 66:1105-1112. [PMID: 38514472 DOI: 10.1007/s00234-024-03330-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Accepted: 03/04/2024] [Indexed: 03/23/2024]
Abstract
PURPOSE We investigated whether the quality of high-resolution computed tomography (CT) images of the temporal bone improves with deep learning reconstruction (DLR) compared with hybrid iterative reconstruction (HIR). METHODS This retrospective study enrolled 36 patients (15 men, 21 women; age, 53.9 ± 19.5 years) who had undergone high-resolution CT of the temporal bone. Axial and coronal images were reconstructed using DLR, HIR, and filtered back projection (FBP). In qualitative image analyses, two radiologists independently compared the DLR and HIR images with FBP in terms of depiction of structures, image noise, and overall quality, using a 5-point scale (5 = better than FBP, 1 = poorer than FBP) to evaluate image quality. The other two radiologists placed regions of interest on the tympanic cavity and measured the standard deviation of CT attenuation (i.e., quantitative image noise). Scores from the qualitative and quantitative analyses of the DLR and HIR images were compared using, respectively, the Wilcoxon signed-rank test and the paired t-test. RESULTS Qualitative and quantitative image noise was significantly reduced in DLR images compared with HIR images (all comparisons, p ≤ 0.016). Depiction of the otic capsule, auditory ossicles, and tympanic membrane was significantly improved in DLR images compared with HIR images (both readers, p ≤ 0.003). Overall image quality was significantly superior in DLR images compared with HIR images (both readers, p < 0.001). CONCLUSION Compared with HIR, DLR provided significantly better-quality high-resolution CT images of the temporal bone.
Collapse
Affiliation(s)
- Nana Fujita
- Department of Radiology, Graduate School of Medicine, The University of Tokyo, 7-3-1 Hongo, Bunkyo-Ku, Tokyo, 113-8655, Japan
| | - Koichiro Yasaka
- Department of Radiology, Graduate School of Medicine, The University of Tokyo, 7-3-1 Hongo, Bunkyo-Ku, Tokyo, 113-8655, Japan.
| | - Sosuke Hatano
- Department of Radiology, Graduate School of Medicine, The University of Tokyo, 7-3-1 Hongo, Bunkyo-Ku, Tokyo, 113-8655, Japan
| | - Naoya Sakamoto
- Department of Radiology, Graduate School of Medicine, The University of Tokyo, 7-3-1 Hongo, Bunkyo-Ku, Tokyo, 113-8655, Japan
| | - Ryo Kurokawa
- Department of Radiology, Graduate School of Medicine, The University of Tokyo, 7-3-1 Hongo, Bunkyo-Ku, Tokyo, 113-8655, Japan
| | - Osamu Abe
- Department of Radiology, Graduate School of Medicine, The University of Tokyo, 7-3-1 Hongo, Bunkyo-Ku, Tokyo, 113-8655, Japan
| |
Collapse
|
5
|
Hsiao CC, Peng CH, Wu FZ, Cheng DC. Impact of Voxel Normalization on a Machine Learning-Based Method: A Study on Pulmonary Nodule Malignancy Diagnosis Using Low-Dose Computed Tomography (LDCT). Diagnostics (Basel) 2023; 13:3690. [PMID: 38132274 PMCID: PMC10742752 DOI: 10.3390/diagnostics13243690] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Revised: 12/04/2023] [Accepted: 12/14/2023] [Indexed: 12/23/2023] Open
Abstract
Lung cancer (LC) stands as the foremost cause of cancer-related fatality rates worldwide. Early diagnosis significantly enhances patient survival rate. Nowadays, low-dose computed tomography (LDCT) is widely employed on the chest as a tool for large-scale lung cancer screening. Nonetheless, a large amount of chest radiographs creates an onerous burden for radiologists. Some computer-aided diagnostic (CAD) tools can provide insight to the use of medical images for diagnosis and can augment diagnostic speed. However, due to the variation in the parameter settings across different patients, substantial discrepancies in image voxels persist. We found that different voxel sizes can create a compromise between model generalization and diagnostic efficacy. This study investigates the performance disparities of diagnostic models trained on original images and LDCT images reconstructed to different voxel sizes while making isotropic. We examined the ability of our method to differentiate between benign and malignant nodules. Using 11 features, a support vector machine (SVM) was trained on LDCT images using an isotropic voxel with a side length of 1.5 mm for 225 patients in-house. The result yields a favorable model performance with an accuracy of 0.9596 and an area under the receiver operating characteristic curve (ROC/AUC) of 0.9855. In addition, to furnish CAD tools for clinical application, future research including LDCT images from multi-centers is encouraged.
Collapse
Affiliation(s)
- Chia-Chi Hsiao
- Department of Radiology, Kaohsiung Veterans General Hospital, Kaohsiung 813414, Taiwan;
| | - Chen-Hao Peng
- Department of Biomedical Imaging and Radiological Science, China Medical University, Taichung 40400, Taiwan;
| | - Fu-Zong Wu
- Department of Radiology, Kaohsiung Veterans General Hospital, Kaohsiung 813414, Taiwan;
| | - Da-Chuan Cheng
- Department of Biomedical Imaging and Radiological Science, China Medical University, Taichung 40400, Taiwan;
| |
Collapse
|