1
|
Yangi K, Hong J, Gholami AS, On TJ, Reed AG, Puppalla P, Chen J, Calderon Valero CE, Xu Y, Li B, Santello M, Lawton MT, Preul MC. Deep learning in neurosurgery: a systematic literature review with a structured analysis of applications across subspecialties. Front Neurol 2025; 16:1532398. [PMID: 40308224 PMCID: PMC12040697 DOI: 10.3389/fneur.2025.1532398] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2024] [Accepted: 03/04/2025] [Indexed: 05/02/2025] Open
Abstract
Objective This study systematically reviewed deep learning (DL) applications in neurosurgical practice to provide a comprehensive understanding of DL in neurosurgery. The review process included a systematic overview of recent developments in DL technologies, an examination of the existing literature on their applications in neurosurgery, and insights into the future of neurosurgery. The study also summarized the most widely used DL algorithms, their specific applications in neurosurgical practice, their limitations, and future directions. Materials and methods An advanced search using medical subject heading terms was conducted in Medline (via PubMed), Scopus, and Embase databases restricted to articles published in English. Two independent neurosurgically experienced reviewers screened selected articles. Results A total of 456 articles were initially retrieved. After screening, 162 were found eligible and included in the study. Reference lists of all 162 articles were checked, and 19 additional articles were found eligible and included in the study. The 181 included articles were divided into 6 categories according to the subspecialties: general neurosurgery (n = 64), neuro-oncology (n = 49), functional neurosurgery (n = 32), vascular neurosurgery (n = 17), neurotrauma (n = 9), and spine and peripheral nerve (n = 10). The leading procedures in which DL algorithms were most commonly used were deep brain stimulation and subthalamic and thalamic nuclei localization (n = 24) in the functional neurosurgery group; segmentation, identification, classification, and diagnosis of brain tumors (n = 29) in the neuro-oncology group; and neuronavigation and image-guided neurosurgery (n = 13) in the general neurosurgery group. Apart from various video and image datasets, computed tomography, magnetic resonance imaging, and ultrasonography were the most frequently used datasets to train DL algorithms in all groups overall (n = 79). Although there were few studies involving DL applications in neurosurgery in 2016, research interest began to increase in 2019 and has continued to grow in the 2020s. Conclusion DL algorithms can enhance neurosurgical practice by improving surgical workflows, real-time monitoring, diagnostic accuracy, outcome prediction, volumetric assessment, and neurosurgical education. However, their integration into neurosurgical practice involves challenges and limitations. Future studies should focus on refining DL models with a wide variety of datasets, developing effective implementation techniques, and assessing their affect on time and cost efficiency.
Collapse
Affiliation(s)
- Kivanc Yangi
- The Loyal and Edith Davis Neurosurgical Research Laboratory, Barrow Neurological Institute, St. Joseph’s Hospital and Medical Center, Phoenix, AZ, United States
| | - Jinpyo Hong
- The Loyal and Edith Davis Neurosurgical Research Laboratory, Barrow Neurological Institute, St. Joseph’s Hospital and Medical Center, Phoenix, AZ, United States
| | - Arianna S. Gholami
- The Loyal and Edith Davis Neurosurgical Research Laboratory, Barrow Neurological Institute, St. Joseph’s Hospital and Medical Center, Phoenix, AZ, United States
| | - Thomas J. On
- The Loyal and Edith Davis Neurosurgical Research Laboratory, Barrow Neurological Institute, St. Joseph’s Hospital and Medical Center, Phoenix, AZ, United States
| | - Alexander G. Reed
- The Loyal and Edith Davis Neurosurgical Research Laboratory, Barrow Neurological Institute, St. Joseph’s Hospital and Medical Center, Phoenix, AZ, United States
| | - Pravarakhya Puppalla
- The Loyal and Edith Davis Neurosurgical Research Laboratory, Barrow Neurological Institute, St. Joseph’s Hospital and Medical Center, Phoenix, AZ, United States
| | - Jiuxu Chen
- The Loyal and Edith Davis Neurosurgical Research Laboratory, Barrow Neurological Institute, St. Joseph’s Hospital and Medical Center, Phoenix, AZ, United States
- School of Computing and Augmented Intelligence, Arizona State University, Tempe, AZ, United States
| | - Carlos E. Calderon Valero
- The Loyal and Edith Davis Neurosurgical Research Laboratory, Barrow Neurological Institute, St. Joseph’s Hospital and Medical Center, Phoenix, AZ, United States
| | - Yuan Xu
- The Loyal and Edith Davis Neurosurgical Research Laboratory, Barrow Neurological Institute, St. Joseph’s Hospital and Medical Center, Phoenix, AZ, United States
| | - Baoxin Li
- School of Computing and Augmented Intelligence, Arizona State University, Tempe, AZ, United States
| | - Marco Santello
- School of Biological and Health Systems Engineering, Arizona State University, Tempe, AZ, United States
| | - Michael T. Lawton
- The Loyal and Edith Davis Neurosurgical Research Laboratory, Barrow Neurological Institute, St. Joseph’s Hospital and Medical Center, Phoenix, AZ, United States
| | - Mark C. Preul
- The Loyal and Edith Davis Neurosurgical Research Laboratory, Barrow Neurological Institute, St. Joseph’s Hospital and Medical Center, Phoenix, AZ, United States
| |
Collapse
|
2
|
Ghosh A, Li H, Towbin A, Turpin B, Trout A. T2-weighted MRI radiomics for the prediction of pediatric and young adult rhabdomyosarcoma alveolar subtype and distant metastasis: a pilot study. Pediatr Radiol 2025:10.1007/s00247-025-06205-6. [PMID: 40100409 DOI: 10.1007/s00247-025-06205-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/24/2024] [Revised: 02/14/2025] [Accepted: 02/19/2025] [Indexed: 03/20/2025]
Abstract
INTRODUCTION Rhabdomyosarcomas are the most common soft tissue sarcoma in children. While treatment outcomes have improved, risk-based therapy classification relies on staging and tumor subtypes for therapeutic planning. OBJECTIVE This study investigated the utility of T2-weighted MR radiomics features and machine learning models in identifying the presence of distant metastasis and alveolar histological subtypes at baseline imaging in children diagnosed with rhabdomyosarcoma. MATERIALS AND METHODS This retrospective cross-sectional study utilized MRIs from 86 patients, 49 (median age (IQR) 59 months (37-161), alveolar subtype=15, distant metastasis=9) of whom had been imaged at outside imaging centers (training set); and 37 (median age 52 months (24-164), alveolar subtype=14, distant metastasis=8) of whom were imaged at our institution (holdout validation set). Radiomic features were extracted from T2-weighted images. We selected features that demonstrated intra-scan repeatability and used maximum relevance and minimum redundancy supervised feature selection to identify the 50 most important features. Lasso logistic regression and support vector machine (SVM) classifiers were trained to predict binary outcomes. The median of all predictions for a given patient was used as patient-level predictions. DeLong's test compared the area under the receiver operating characteristic curves (AUC). Cut-offs obtained by maximizing the Youden index were evaluated on an external validation set, and accuracy metrics were reported. RESULTS On the validation set, the Lasso and SVM classifiers obtained patient level AUCs of 0.76 (95% CI 0.59-0.94) and 0.73 (0.54-0.92), respectively, in predicting alveolar subtype, with the Lasso regressor obtaining 71.4% (41.9-91.6) sensitivity and 60.9% (38.5-80.3) specificity. When predicting the presence of distant metastasis, the Lasso and SVM classifier had AUCs of 0.81 (0.67-0.95) and 0.77 (0.58-0.97), respectively. There were no differences between model performance (P>0.05). A total of 12 and 18 features had nonzero coefficients in the Lasso regressors for predicting alveolar subtype and tumor metastasis, respectively. CONCLUSION MRI radiomics from baseline T2-weighted MRI demonstrated potential in predicting alveolar subtype and distant metastatic disease at presentation. Larger studies are needed to explore multinomial multiclass models for better prognostication of pediatric rhabdomyosarcomas.
Collapse
Affiliation(s)
- Adarsh Ghosh
- Cincinnati Children's Hospital Medical Center, Cincinnati, USA.
- Nationwide Children's Hospital, 700 Children's Drive, Columbus, OH, 43205, USA.
| | - Hailong Li
- Cincinnati Children's Hospital Medical Center, Cincinnati, USA
| | | | - Brian Turpin
- Cincinnati Children's Hospital Medical Center, Cincinnati, USA
| | - Andrew Trout
- Cincinnati Children's Hospital Medical Center, Cincinnati, USA
| |
Collapse
|
3
|
Tixier F, Rodriguez D, Jones J, Martin L, Yassall A, Selvaraj B, Islam M, Ostendorf A, Hester ME, Ho ML. Radiomic detection of abnormal brain regions in tuberous sclerosis complex. Med Phys 2024; 51:9103-9114. [PMID: 39312593 DOI: 10.1002/mp.17400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 06/18/2024] [Accepted: 08/22/2024] [Indexed: 09/25/2024] Open
Abstract
BACKGROUND Radiomics refers to the extraction of quantitative information from medical images and is most commonly utilized in oncology to provide ancillary information for solid tumor diagnosis, prognosis, and treatment response. The traditional radiomic pipeline involves segmentation of volumes of interest with comparison to normal brain. In other neurologic disorders, such as epilepsy, lesion delineation may be difficult or impossible due to poor anatomic definition, small size, and multifocal or diffuse distribution. Tuberous sclerosis complex (TSC) is a rare genetic disease in which brain magnetic resonance imaging (MRI) demonstrates multifocal abnormalities with variable imaging and epileptogenic features. PURPOSE The purpose of this study was to develop a radiomic workflow for identification of abnormal brain regions in TSC, using a whole-brain atlas-based approach with generation of heatmaps based on signal deviation from normal controls. METHODS This was a retrospective pilot study utilizing high-resolution whole-brain 3D FLAIR MRI datasets from retrospective enrollment of tuberous sclerosis complex (TSC) patients and normal controls. Subjects underwent MRI including high-resolution 3D FLAIR sequences. Preprocessing included skull stripping, coregistration, and intensity normalization. Using the Brainnetome and Harvard-Oxford atlases, brain regions were parcellated into 318 discrete regions. Expert neuroradiologists spatially labeled all tubers in TSC patients using ITK-SNAP. The pyradiomics toolbox was used to extract 88 radiomic features based on IBSI guidelines, comparing tuber-affected and non-tuber-affected parenchyma in TSC patients, as well as normal brain tissue in control patients. For model training and validation, regions with tubers from 20 TSC patients and 30 normal control subjects were randomly divided into two training sets (80%) and two validation sets (20%). Additional model testing was performed on a separate group of 20 healthy controls. LASSO (least absolute shrinkage and selection operator) was used to perform variable selection and regularization to identify regions containing tubers. Relevant radiomic features selected by LASSO were combined to produce a radiomic score ω, defined as the sum of squared differences from average control group values. Region-specific ω scores were converted to heat maps and spatially coregistered with brain MRI to reflect overall radiomic deviation from normal. RESULTS The proposed radiomic workflow allows for quantification of deviation from normal in 318 regions of the brain with the use of a summative radiomic score ω. This score can be used to generate spatially registered heatmaps to identify brain regions with radiomic abnormalities. The pilot study of TSC showed radiomic scores ω that were statistically different in regions containing tubers from regions without tubers/normal brain (p < 0.0001). Our model exhibits an AUC of 0.81 (95% confidence interval: 0.78-0.84) on the testing set, and the best threshold obtained on the training set, when applied to the testing set, allows us to identify regions with tubers with a specificity of 0.91 and a sensitivity of 0.60. CONCLUSION We describe a whole-brain atlas-based radiomic approach to identify abnormal brain regions in TSC patients. This approach may be helpful for identifying specific regions of interest based on relatively greater signal deviation, particularly in clinical scenarios with numerous or poorly defined anatomic lesions.
Collapse
Affiliation(s)
- Florent Tixier
- Department of Radiology, Nationwide Children's Hospital, Columbus, Ohio, USA
| | - Diana Rodriguez
- Department of Radiology, Nationwide Children's Hospital, Columbus, Ohio, USA
| | - Jeremy Jones
- Department of Radiology, Nationwide Children's Hospital, Columbus, Ohio, USA
| | - Lisa Martin
- Department of Radiology, Nationwide Children's Hospital, Columbus, Ohio, USA
| | - Anthony Yassall
- Department of Radiology, Nationwide Children's Hospital, Columbus, Ohio, USA
| | - Bhavani Selvaraj
- Department of Radiology, Nationwide Children's Hospital, Columbus, Ohio, USA
| | - Monica Islam
- Department of Neurology, Nationwide Children's Hospital, Columbus, Ohio, USA
- Department of Pediatrics, The Ohio State University College of Medicine, Columbus, Ohio, USA
| | - Adam Ostendorf
- Department of Neurology, Nationwide Children's Hospital, Columbus, Ohio, USA
- Department of Pediatrics, The Ohio State University College of Medicine, Columbus, Ohio, USA
| | - Mark E Hester
- Department of Pediatrics, The Ohio State University College of Medicine, Columbus, Ohio, USA
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Columbus, Ohio, USA
- Department of Neuroscience, College of Medicine, Ohio State University, Columbus, Ohio, USA
| | - Mai-Lan Ho
- Department of Radiology, University of Missouri, Columbia, Missouri, USA
| |
Collapse
|
4
|
Shakeri M, Mostaar A, Sadeghi AZ, Hosseini SM, Joybari AY, Ghadiri H. A Comprehensive Evaluation of Radiomic Features in Normal Brain Magnetic Resonance Imaging: Investigating Robustness and Region Variations. J Med Phys 2024; 49:608-622. [PMID: 39926136 PMCID: PMC11801087 DOI: 10.4103/jmp.jmp_149_24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2024] [Revised: 10/22/2024] [Accepted: 11/01/2024] [Indexed: 02/11/2025] Open
Abstract
Background Despite extensive research on various brain diseases, a few studies have focused on radiomic feature distribution in healthy brain images. The present study applied a novel radiomic framework to investigate the robustness and baseline values of radiomic features in normal brain magnetic resonance imaging (MRIs) regions. Materials and Methods Analyses were performed on T1 and T2 images including 276 normal brains and 14 healthy volunteers were scanned with three scanners using the same protocols. The images were divided into 1024 three-dimensional nonoverlap patches with the same pixel size. Seven patches located in the thalamus, putamen, hippocampus and brain stem were selected as volume of interest (VOI). Eighty-five radiomic features were generated. To investigate the variation of features across VOIs, the analysis of variance was performed and coefficient of variation (COV) and intraclass correlation coefficient (ICC) were explored to examine the features repeatability. Results Thalamus (right and left) and hippocampus (left) resulted in more stable features (COV ≤ 6%) in T1 and T2 images, respectively. The inter-scanner ICC analysis demonstrated the features of T2 sequences represented more repeatable results and the brain stem and thalamus (both T1 and T2) showed particularly high repeatability (higher ICC values). Robust results (ICC ≥ 0.9) were identified for energy and range features of the first order class and several textures features across different brain regions. Conclusion Our results indicated the baselines of the repeatable texture features in healthy brain structural MRI highlighting inter-scanner stability. According to the findings, MRI sequencing and VOI location impact feature robustness and should be considered in brain radiomic studies.
Collapse
Affiliation(s)
- Mahsa Shakeri
- Department of Medical Physics and Biomedical Engineering, School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
- Research Center for Molecular and Cellular Imaging (RCMCI), Advanced Medical Technologies and Equipment Institute (AMTEI), Tehran University of Medical Sciences (TUMS), Tehran, Iran
| | - Ahmad Mostaar
- Department of Medical Physics and Biomedical Engineering, School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Arash Zare Sadeghi
- Medical Physics Department, School of Medicine, Iran University of Medical Sciences, Tehran, Iran
| | - Seyyed Mohammad Hosseini
- Department of Medical Physics and Biomedical Engineering, School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
- Research Center for Molecular and Cellular Imaging (RCMCI), Advanced Medical Technologies and Equipment Institute (AMTEI), Tehran University of Medical Sciences (TUMS), Tehran, Iran
| | - Ali Yaghobi Joybari
- Department of Radiation Oncology, School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Hossein Ghadiri
- Department of Medical Physics and Biomedical Engineering, School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
- Research Center for Molecular and Cellular Imaging (RCMCI), Advanced Medical Technologies and Equipment Institute (AMTEI), Tehran University of Medical Sciences (TUMS), Tehran, Iran
| |
Collapse
|
5
|
Teng X, Wang Y, Nicol AJ, Ching JCF, Wong EKY, Lam KTC, Zhang J, Lee SWY, Cai J. Enhancing the Clinical Utility of Radiomics: Addressing the Challenges of Repeatability and Reproducibility in CT and MRI. Diagnostics (Basel) 2024; 14:1835. [PMID: 39202322 PMCID: PMC11353986 DOI: 10.3390/diagnostics14161835] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2024] [Revised: 08/03/2024] [Accepted: 08/16/2024] [Indexed: 09/03/2024] Open
Abstract
Radiomics, which integrates the comprehensive characterization of imaging phenotypes with machine learning algorithms, is increasingly recognized for its potential in the diagnosis and prognosis of oncological conditions. However, the repeatability and reproducibility of radiomic features are critical challenges that hinder their widespread clinical adoption. This review aims to address the paucity of discussion regarding the factors that influence the reproducibility and repeatability of radiomic features and their subsequent impact on the application of radiomic models. We provide a synthesis of the literature on the repeatability and reproducibility of CT/MR-based radiomic features, examining sources of variation, the number of reproducible features, and the availability of individual feature repeatability indices. We differentiate sources of variation into random effects, which are challenging to control but can be quantified through simulation methods such as perturbation, and biases, which arise from scanner variability and inter-reader differences and can significantly affect the generalizability of radiomic model performance in diverse settings. Four suggestions for repeatability and reproducibility studies are suggested: (1) detailed reporting of variation sources, (2) transparent disclosure of calculation parameters, (3) careful selection of suitable reliability indices, and (4) comprehensive reporting of reliability metrics. This review underscores the importance of random effects in feature selection and harmonizing biases between development and clinical application settings to facilitate the successful translation of radiomic models from research to clinical practice.
Collapse
Affiliation(s)
- Xinzhi Teng
- Department of Health Technology and Informatics, The Hong Kong Polytechnic University, 11 Yuk Choi Rd, Hung Hom, Hong Kong SAR, China; (X.T.); (Y.W.); (A.J.N.); (J.C.F.C.); (E.K.Y.W.); (K.T.C.L.); (J.Z.)
| | - Yongqiang Wang
- Department of Health Technology and Informatics, The Hong Kong Polytechnic University, 11 Yuk Choi Rd, Hung Hom, Hong Kong SAR, China; (X.T.); (Y.W.); (A.J.N.); (J.C.F.C.); (E.K.Y.W.); (K.T.C.L.); (J.Z.)
| | - Alexander James Nicol
- Department of Health Technology and Informatics, The Hong Kong Polytechnic University, 11 Yuk Choi Rd, Hung Hom, Hong Kong SAR, China; (X.T.); (Y.W.); (A.J.N.); (J.C.F.C.); (E.K.Y.W.); (K.T.C.L.); (J.Z.)
| | - Jerry Chi Fung Ching
- Department of Health Technology and Informatics, The Hong Kong Polytechnic University, 11 Yuk Choi Rd, Hung Hom, Hong Kong SAR, China; (X.T.); (Y.W.); (A.J.N.); (J.C.F.C.); (E.K.Y.W.); (K.T.C.L.); (J.Z.)
| | - Edwin Ka Yiu Wong
- Department of Health Technology and Informatics, The Hong Kong Polytechnic University, 11 Yuk Choi Rd, Hung Hom, Hong Kong SAR, China; (X.T.); (Y.W.); (A.J.N.); (J.C.F.C.); (E.K.Y.W.); (K.T.C.L.); (J.Z.)
| | - Kenneth Tsz Chun Lam
- Department of Health Technology and Informatics, The Hong Kong Polytechnic University, 11 Yuk Choi Rd, Hung Hom, Hong Kong SAR, China; (X.T.); (Y.W.); (A.J.N.); (J.C.F.C.); (E.K.Y.W.); (K.T.C.L.); (J.Z.)
| | - Jiang Zhang
- Department of Health Technology and Informatics, The Hong Kong Polytechnic University, 11 Yuk Choi Rd, Hung Hom, Hong Kong SAR, China; (X.T.); (Y.W.); (A.J.N.); (J.C.F.C.); (E.K.Y.W.); (K.T.C.L.); (J.Z.)
| | - Shara Wee-Yee Lee
- Department of Health Technology and Informatics, The Hong Kong Polytechnic University, 11 Yuk Choi Rd, Hung Hom, Hong Kong SAR, China; (X.T.); (Y.W.); (A.J.N.); (J.C.F.C.); (E.K.Y.W.); (K.T.C.L.); (J.Z.)
| | - Jing Cai
- Department of Health Technology and Informatics, The Hong Kong Polytechnic University, 11 Yuk Choi Rd, Hung Hom, Hong Kong SAR, China; (X.T.); (Y.W.); (A.J.N.); (J.C.F.C.); (E.K.Y.W.); (K.T.C.L.); (J.Z.)
- Hong Kong Polytechnic University Shenzhen Research Institute, Shenzhen 518057, China
| |
Collapse
|
6
|
Bonada M, Rossi LF, Carone G, Panico F, Cofano F, Fiaschi P, Garbossa D, Di Meco F, Bianconi A. Deep Learning for MRI Segmentation and Molecular Subtyping in Glioblastoma: Critical Aspects from an Emerging Field. Biomedicines 2024; 12:1878. [PMID: 39200342 PMCID: PMC11352020 DOI: 10.3390/biomedicines12081878] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2024] [Revised: 07/29/2024] [Accepted: 07/31/2024] [Indexed: 09/02/2024] Open
Abstract
Deep learning (DL) has been applied to glioblastoma (GBM) magnetic resonance imaging (MRI) assessment for tumor segmentation and inference of molecular, diagnostic, and prognostic information. We comprehensively overviewed the currently available DL applications, critically examining the limitations that hinder their broader adoption in clinical practice and molecular research. Technical limitations to the routine application of DL include the qualitative heterogeneity of MRI, related to different machinery and protocols, and the absence of informative sequences, possibly compensated by artificial image synthesis. Moreover, taking advantage from the available benchmarks of MRI, algorithms should be trained on large amounts of data. Additionally, the segmentation of postoperative imaging should be further addressed to limit the inaccuracies previously observed for this task. Indeed, molecular information has been promisingly integrated in the most recent DL tools, providing useful prognostic and therapeutic information. Finally, ethical concerns should be carefully addressed and standardized to allow for data protection. DL has provided reliable results for GBM assessment concerning MRI analysis and segmentation, but the routine clinical application is still limited. The current limitations could be prospectively addressed, giving particular attention to data collection, introducing new technical advancements, and carefully regulating ethical issues.
Collapse
Affiliation(s)
- Marta Bonada
- Neurosurgery Unit, Department of Neuroscience, University of Turin, Via Cherasco 15, 10126 Turin, Italy; (M.B.); (F.C.); (D.G.)
- Department of Neurosurgery, Fondazione IRCCS Istituto Neurologico Carlo Besta, Via Celoria 11, 20133 Milan, Italy; (G.C.)
| | - Luca Francesco Rossi
- Department of Informatics, Polytechnic University of Turin, Corso Castelfidardo 39, 10129 Turin, Italy;
| | - Giovanni Carone
- Department of Neurosurgery, Fondazione IRCCS Istituto Neurologico Carlo Besta, Via Celoria 11, 20133 Milan, Italy; (G.C.)
| | - Flavio Panico
- Neurosurgery Unit, Department of Neuroscience, University of Turin, Via Cherasco 15, 10126 Turin, Italy; (M.B.); (F.C.); (D.G.)
| | - Fabio Cofano
- Neurosurgery Unit, Department of Neuroscience, University of Turin, Via Cherasco 15, 10126 Turin, Italy; (M.B.); (F.C.); (D.G.)
| | - Pietro Fiaschi
- Division of Neurosurgery, Ospedale Policlinico San Martino, IRCCS for Oncology and Neurosciences, Largo Rosanna Benzi 10, 16132 Genoa, Italy;
- Department of Neuroscience, Rehabilitation, Ophthalmology, Genetics and Maternal and Child Health, University of Genoa, Largo Rosanna Benzi 10, 16132 Genoa, Italy
| | - Diego Garbossa
- Neurosurgery Unit, Department of Neuroscience, University of Turin, Via Cherasco 15, 10126 Turin, Italy; (M.B.); (F.C.); (D.G.)
| | - Francesco Di Meco
- Department of Neurosurgery, Fondazione IRCCS Istituto Neurologico Carlo Besta, Via Celoria 11, 20133 Milan, Italy; (G.C.)
| | - Andrea Bianconi
- Neurosurgery Unit, Department of Neuroscience, University of Turin, Via Cherasco 15, 10126 Turin, Italy; (M.B.); (F.C.); (D.G.)
- Division of Neurosurgery, Ospedale Policlinico San Martino, IRCCS for Oncology and Neurosciences, Largo Rosanna Benzi 10, 16132 Genoa, Italy;
| |
Collapse
|
7
|
A nomogram for preoperative risk stratification based on MR morphological parameters in patients with endometrioid adenocarcinoma. Eur J Radiol 2023; 163:110789. [PMID: 37068415 DOI: 10.1016/j.ejrad.2023.110789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Revised: 03/02/2023] [Accepted: 03/14/2023] [Indexed: 03/19/2023]
Abstract
PURPOSE To develop and validate a nomogram based on MRI morphological parameters to preoperatively discriminate between low-risk and non-low-risk patients with endometrioid endometrial carcinoma (EEC). METHODS Two hundred eighty-one women with histologically confirmed EEC were divided into training (1.5-T MRI, n = 182) and validation cohorts (3.0-T MRI, n = 99). According to the European Society of Medical Oncology guidelines, the patients were divided into four risk groups: low, intermediate, high-intermediate, and high. Binary classification models were developed (low-risk vs. non-low-risk). Univariate logistic regression (LR) analyses were used to determine which variables to select to build the predictive models. Five classification models were constructed, and the best model was selected. The area under the receiver operating characteristic curve (AUC) was calculated to evaluate the performance of the prediction model and nomogram. P < 0.05 indicated a statistically significant difference. RESULTS Age and four morphological parameters (tumor size, tumor volume, maximum anteroposterior tumor diameter on sagittal T2-weighted images (APsag), and tumor area ratio (TAR)) were selected, and the LR model was used to construct an MRI morphological nomogram. The AUCs for the nomogram in predicting a non-low-risk of EEC among patients in the training and validation cohorts were 0.856 (sensitivity = 75.0%, specificity = 83.1%) and 0.849 (sensitivity = 74.6%, specificity = 85.0%), respectively. CONCLUSION An MRI morphological nomogram was developed and achieved high diagnostic performance for classifying low-risk and non-low-risk EEC preoperatively, which could provide support for therapeutic decision-making. Furthermore, our findings indicate that this nomogram is robust in the clinical application of various field strength data.
Collapse
|
8
|
Robustness of radiomics to variations in segmentation methods in multimodal brain MRI. Sci Rep 2022; 12:16712. [PMID: 36202934 PMCID: PMC9537186 DOI: 10.1038/s41598-022-20703-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2022] [Accepted: 09/16/2022] [Indexed: 11/09/2022] Open
Abstract
Radiomics in neuroimaging uses fully automatic segmentation to delineate the anatomical areas for which radiomic features are computed. However, differences among these segmentation methods affect radiomic features to an unknown extent. A scan-rescan dataset (n = 46) of T1-weighted and diffusion tensor images was used. Subjects were split into a sleep-deprivation and a control group. Scans were segmented using four segmentation methods from which radiomic features were computed. First, we measured segmentation agreement using the Dice-coefficient. Second, robustness and reproducibility of radiomic features were measured using the intraclass correlation coefficient (ICC). Last, difference in predictive power was assessed using the Friedman-test on performance in a radiomics-based sleep deprivation classification application. Segmentation agreement was generally high (interquartile range = 0.77–0.90) and median feature robustness to segmentation method variation was higher (ICC > 0.7) than scan-rescan reproducibility (ICC 0.3–0.8). However, classification performance differed significantly among segmentation methods (p < 0.001) ranging from 77 to 84%. Accuracy was higher for more recent deep learning-based segmentation methods. Despite high agreement among segmentation methods, subtle differences significantly affected radiomic features and their predictive power. Consequently, the effect of differences in segmentation methods should be taken into account when designing and evaluating radiomics-based research methods.
Collapse
|
9
|
Fatania K, Mohamud F, Clark A, Nix M, Short SC, O'Connor J, Scarsbrook AF, Currie S. Intensity standardization of MRI prior to radiomic feature extraction for artificial intelligence research in glioma-a systematic review. Eur Radiol 2022; 32:7014-7025. [PMID: 35486171 PMCID: PMC9474349 DOI: 10.1007/s00330-022-08807-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Revised: 03/11/2022] [Accepted: 04/10/2022] [Indexed: 11/16/2022]
Abstract
OBJECTIVES Radiomics is a promising avenue in non-invasive characterisation of diffuse glioma. Clinical translation is hampered by lack of reproducibility across centres and difficulty in standardising image intensity in MRI datasets. The study aim was to perform a systematic review of different methods of MRI intensity standardisation prior to radiomic feature extraction. METHODS MEDLINE, EMBASE, and SCOPUS were searched for articles meeting the following eligibility criteria: MRI radiomic studies where one method of intensity normalisation was compared with another or no normalisation, and original research concerning patients diagnosed with diffuse gliomas. Using PRISMA criteria, data were extracted from short-listed studies including number of patients, MRI sequences, validation status, radiomics software, method of segmentation, and intensity standardisation. QUADAS-2 was used for quality appraisal. RESULTS After duplicate removal, 741 results were returned from database and reference searches and, from these, 12 papers were eligible. Due to a lack of common pre-processing and different analyses, a narrative synthesis was sought. Three different intensity standardisation techniques have been studied: histogram matching (5/12), limiting or rescaling signal intensity (8/12), and deep learning (1/12)-only two papers compared different methods. From these studies, histogram matching produced the more reliable features compared to other methods of altering MRI signal intensity. CONCLUSION Multiple methods of intensity standardisation have been described in the literature without clear consensus. Further research that directly compares different methods of intensity standardisation on glioma MRI datasets is required. KEY POINTS • Intensity standardisation is a key pre-processing step in the development of robust radiomic signatures to evaluate diffuse glioma. • A minority of studies compared the impact of two or more methods. • Further research is required to directly compare multiple methods of MRI intensity standardisation on glioma datasets.
Collapse
Affiliation(s)
- Kavi Fatania
- Department of Radiology, Leeds Teaching Hospitals NHS Trust, Leeds, UK.
- Leeds Institute of Medical Research, University of Leeds, Leeds, UK.
- Department of Radiology, Leeds General Infirmary, Great George Street, Leeds, LS1 3EX, UK.
| | | | - Anna Clark
- Department of Medical Physics, Leeds Teaching Hospitals NHS Trust, Leeds, UK
| | - Michael Nix
- Department of Medical Physics, Leeds Teaching Hospitals NHS Trust, Leeds, UK
| | - Susan C Short
- Leeds Institute of Medical Research, University of Leeds, Leeds, UK
- Department of Clinical Oncology, Leeds Teaching Hospitals NHS Trust, Leeds, UK
| | - James O'Connor
- Division of Cancer Sciences, The University of Manchester, Manchester, UK
- Department of Radiology, The Christie Hospital, Manchester, UK
- Division of Radiotherapy and Imaging, Institute of Cancer Research, London, UK
| | - Andrew F Scarsbrook
- Department of Radiology, Leeds Teaching Hospitals NHS Trust, Leeds, UK
- Leeds Institute of Medical Research, University of Leeds, Leeds, UK
| | - Stuart Currie
- Department of Radiology, Leeds Teaching Hospitals NHS Trust, Leeds, UK
- Leeds Institute of Medical Research, University of Leeds, Leeds, UK
| |
Collapse
|
10
|
Scalco E, Rizzo G, Mastropietro A. The stability of oncologic MRI radiomic features and the potential role of deep learning: a review. Phys Med Biol 2022; 67. [DOI: 10.1088/1361-6560/ac60b9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Accepted: 03/24/2022] [Indexed: 11/11/2022]
Abstract
Abstract
The use of MRI radiomic models for the diagnosis, prognosis and treatment response prediction of tumors has been increasingly reported in literature. However, its widespread adoption in clinics is hampered by issues related to features stability. In the MRI radiomic workflow, the main factors that affect radiomic features computation can be found in the image acquisition and reconstruction phase, in the image pre-processing steps, and in the segmentation of the region of interest on which radiomic indices are extracted. Deep Neural Networks (DNNs), having shown their potentiality in the medical image processing and analysis field, can be seen as an attractive strategy to partially overcome the issues related to radiomic stability and mitigate their impact. In fact, DNN approaches can be prospectively integrated in the MRI radiomic workflow to improve image quality, obtain accurate and reproducible segmentations and generate standardized images. In this review, DNN methods that can be included in the image processing steps of the radiomic workflow are described and discussed, in the light of a detailed analysis of the literature in the context of MRI radiomic reliability.
Collapse
|
11
|
Xue C, Yuan J, Zhou Y, Wong OL, Cheung KY, Yu SK. Acquisition repeatability of MRI radiomics features in the head and neck: a dual-3D-sequence multi-scan study. Vis Comput Ind Biomed Art 2022; 5:10. [PMID: 35359245 PMCID: PMC8971276 DOI: 10.1186/s42492-022-00106-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Accepted: 02/23/2022] [Indexed: 02/08/2023] Open
Abstract
Radiomics has increasingly been investigated as a potential biomarker in quantitative imaging to facilitate personalized diagnosis and treatment of head and neck cancer (HNC), a group of malignancies associated with high heterogeneity. However, the feature reliability of radiomics is a major obstacle to its broad validity and generality in application to the highly heterogeneous head and neck (HN) tissues. In particular, feature repeatability of radiomics in magnetic resonance imaging (MRI) acquisition, which is considered a crucial confounding factor of radiomics feature reliability, is still sparsely investigated. This study prospectively investigated the acquisition repeatability of 93 MRI radiomics features in ten HN tissues of 15 healthy volunteers, aiming for potential magnetic resonance-guided radiotherapy (MRgRT) treatment of HNC. Each subject underwent four MRI acquisitions with MRgRT treatment position and immobilization using two pulse sequences of 3D T1-weighed turbo spin-echo and 3D T2-weighed turbo spin-echo on a 1.5 T MRI simulator. The repeatability of radiomics feature acquisition was evaluated in terms of the intraclass correlation coefficient (ICC), whereas within-subject acquisition variability was evaluated in terms of the coefficient of variation (CV). The results showed that MRI radiomics features exhibited heterogeneous acquisition variability and uncertainty dependent on feature types, tissues, and pulse sequences. Only a small fraction of features showed excellent acquisition repeatability (ICC > 0.9) and low within-subject variability. Multiple MRI scans improved the accuracy and confidence of the identification of reliable features concerning MRI acquisition compared to simple test-retest repeated scans. This study contributes to the literature on the reliability of radiomics features with respect to MRI acquisition and the selection of reliable radiomics features for use in modeling in future HNC MRgRT applications.
Collapse
Affiliation(s)
- Cindy Xue
- Research Department, Hong Kong Sanatorium & Hospital, Hong Kong, China
| | - Jing Yuan
- Research Department, Hong Kong Sanatorium & Hospital, Hong Kong, China.
| | - Yihang Zhou
- Research Department, Hong Kong Sanatorium & Hospital, Hong Kong, China
| | - Oi Lei Wong
- Research Department, Hong Kong Sanatorium & Hospital, Hong Kong, China
| | - Kin Yin Cheung
- Medical Physics Department, Hong Kong Sanatorium & Hospital, Hong Kong, China
| | - Siu Ki Yu
- Medical Physics Department, Hong Kong Sanatorium & Hospital, Hong Kong, China
| |
Collapse
|
12
|
Brain MRI Radiomics Analysis of School-Aged Children with Tetralogy of Fallot. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2021; 2021:2380346. [PMID: 34745322 PMCID: PMC8570890 DOI: 10.1155/2021/2380346] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Accepted: 10/16/2021] [Indexed: 11/18/2022]
Abstract
Introduction Radiomics could be potential imaging biomarkers by capturing and analyzing the features. Children and adolescents with CHD have worse neurodevelopmental and functional outcomes compared with their peers. Early diagnosis and intervention are the necessity to improve neurological outcomes in CHD patients. Methods School-aged TOF patients and their healthy peers were recruited for MRI and neurodevelopmental assessment. LASSO regression was used for dimension reduction. ROC curve graph showed the performance of the model. Results Six related features were finally selected for modeling. The final model AUC was 0.750. The radiomics features can be potential significant predictors for neurodevelopmental diagnoses. Conclusion The radiomics on the conventional MRI can help predict the neurodevelopment of school-aged children and provide parents with rehabilitation advice as early as possible.
Collapse
|
13
|
Xue C, Yuan J, Lo GG, Chang ATY, Poon DMC, Wong OL, Zhou Y, Chu WCW. Radiomics feature reliability assessed by intraclass correlation coefficient: a systematic review. Quant Imaging Med Surg 2021; 11:4431-4460. [PMID: 34603997 DOI: 10.21037/qims-21-86] [Citation(s) in RCA: 86] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Accepted: 05/17/2021] [Indexed: 12/13/2022]
Abstract
Radiomics research is rapidly growing in recent years, but more concerns on radiomics reliability are also raised. This review attempts to update and overview the current status of radiomics reliability research in the ever expanding medical literature from the perspective of a single reliability metric of intraclass correlation coefficient (ICC). To conduct this systematic review, Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines were followed. After literature search and selection, a total of 481 radiomics studies using CT, PET, or MRI, covering a wide range of subject and disease types, were included for review. In these highly heterogeneous studies, feature reliability to image segmentation was much more investigated than reliability to other factors, such as image acquisition, reconstruction, post-processing, and feature quantification. The reported ICCs also suggested high radiomics feature reliability to image segmentation. Image acquisition was found to introduce much more feature variability than image segmentation, in particular for MRI, based on the reported ICC values. Image post-processing and feature quantification yielded different levels of radiomics reliability and might be used to mitigate image acquisition-induced variability. Some common flaws and pitfalls in ICC use were identified, and suggestions on better ICC use were given. Due to the extremely high study heterogeneities and possible risks of bias, the degree of radiomics feature reliability that has been achieved could not yet be safely synthesized or derived in this review. More future researches on radiomics reliability are warranted.
Collapse
Affiliation(s)
- Cindy Xue
- Medical Physics and Research Department, Hong Kong Sanatorium & Hospital, Happy Valley, Hong Kong, China.,Department of Imaging and Interventional Radiology, Faculty of Medicine, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, China
| | - Jing Yuan
- Medical Physics and Research Department, Hong Kong Sanatorium & Hospital, Happy Valley, Hong Kong, China
| | - Gladys G Lo
- Department of Diagnostic & Interventional Radiology, Hong Kong Sanatorium & Hospital, Happy Valley, Hong Kong, China
| | - Amy T Y Chang
- Comprehensive Oncology Centre, Hong Kong Sanatorium & Hospital, Happy Valley, Hong Kong, China
| | - Darren M C Poon
- Comprehensive Oncology Centre, Hong Kong Sanatorium & Hospital, Happy Valley, Hong Kong, China
| | - Oi Lei Wong
- Medical Physics and Research Department, Hong Kong Sanatorium & Hospital, Happy Valley, Hong Kong, China
| | - Yihang Zhou
- Medical Physics and Research Department, Hong Kong Sanatorium & Hospital, Happy Valley, Hong Kong, China
| | - Winnie C W Chu
- Department of Imaging and Interventional Radiology, Faculty of Medicine, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, China
| |
Collapse
|