1
|
Elmekki H, Alagha A, Sami H, Spilkin A, Zanuttini AM, Zakeri E, Bentahar J, Kadem L, Xie WF, Pibarot P, Mizouni R, Otrok H, Singh S, Mourad A. CACTUS: An open dataset and framework for automated Cardiac Assessment and Classification of Ultrasound images using deep transfer learning. Comput Biol Med 2025; 190:110003. [PMID: 40107020 DOI: 10.1016/j.compbiomed.2025.110003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2024] [Revised: 03/02/2025] [Accepted: 03/05/2025] [Indexed: 03/22/2025]
Abstract
Cardiac ultrasound (US) scanning is one of the most commonly used techniques in cardiology to diagnose the health of the heart and its proper functioning. During a typical US scan, medical professionals take several images of the heart to be classified based on the cardiac views they contain, with a focus on high-quality images. However, this task is time consuming and error prone. Therefore, it is necessary to consider ways to automate these tasks and assist medical professionals in classifying and assessing cardiac US images. Machine learning (ML) techniques are regarded as a prominent solution due to their success in the development of numerous applications aimed at enhancing the medical field, including addressing the shortage of echography technicians. However, the limited availability of medical data presents a significant barrier to the application of ML in the field of cardiology, particularly regarding US images of the heart. This paper addresses this challenge by introducing the first open graded dataset for Cardiac Assessment and ClassificaTion of UltraSound (CACTUS), which is available online. This dataset contains images obtained from scanning a CAE Blue Phantom and representing various heart views and different quality levels, exceeding the conventional cardiac views typically found in literature. Additionally, the paper introduces a Deep Learning (DL) framework consisting of two main components. The first component is responsible for classifying cardiac US images based on the heart view using a Convolutional Neural Network (CNN) architecture. The second component uses the concept of Transfer Learning (TL) to utilize knowledge from the first component and fine-tune it to create a model for grading and assessing cardiac images. The framework demonstrates high performance in both classification and grading, achieving up to 99.43% accuracy and as low as 0.3067 error, respectively. To showcase its robustness, the framework is further fine-tuned using new images representing additional cardiac views and also compared to several other state-of-the-art architectures. The framework's outcomes and its performance in handling real-time scans were also assessed using a questionnaire answered by cardiac experts.
Collapse
Affiliation(s)
- Hanae Elmekki
- Concordia Institute for Information Systems Engineering, Concordia University, Montreal, Canada.
| | - Ahmed Alagha
- Concordia Institute for Information Systems Engineering, Concordia University, Montreal, Canada.
| | - Hani Sami
- Concordia Institute for Information Systems Engineering, Concordia University, Montreal, Canada; Artificial Intelligence & Cyber Systems Research Center, Department of CSM, Lebanese American University, Beirut, Lebanon.
| | - Amanda Spilkin
- Department of Mechanical, Industrial and Aerospace Engineering, Concordia University, Montreal, Canada.
| | | | - Ehsan Zakeri
- Department of Mechanical, Industrial and Aerospace Engineering, Concordia University, Montreal, Canada.
| | - Jamal Bentahar
- Concordia Institute for Information Systems Engineering, Concordia University, Montreal, Canada; Department of Computer Science, 6G Research Center, Khalifa University, Abu Dhabi, United Arab Emirates.
| | - Lyes Kadem
- Department of Mechanical, Industrial and Aerospace Engineering, Concordia University, Montreal, Canada.
| | - Wen-Fang Xie
- Department of Mechanical, Industrial and Aerospace Engineering, Concordia University, Montreal, Canada.
| | | | - Rabeb Mizouni
- Department of Computer Science, Khalifa University, Abu Dhabi, United Arab Emirates.
| | - Hadi Otrok
- Department of Computer Science, Khalifa University, Abu Dhabi, United Arab Emirates.
| | - Shakti Singh
- Department of Computer Science, Khalifa University, Abu Dhabi, United Arab Emirates.
| | - Azzam Mourad
- Department of Computer Science, 6G Research Center, Khalifa University, Abu Dhabi, United Arab Emirates; Artificial Intelligence & Cyber Systems Research Center, Department of CSM, Lebanese American University, Beirut, Lebanon.
| |
Collapse
|
2
|
Shah AA, Daud A, Bukhari A, Alshemaimri B, Ahsan M, Younis R. DEL-Thyroid: deep ensemble learning framework for detection of thyroid cancer progression through genomic mutation. BMC Med Inform Decis Mak 2024; 24:198. [PMID: 39039464 PMCID: PMC11533268 DOI: 10.1186/s12911-024-02604-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2024] [Accepted: 07/10/2024] [Indexed: 07/24/2024] Open
Abstract
Genes, expressed as sequences of nucleotides, are susceptible to mutations, some of which can lead to cancer. Machine learning and deep learning methods have emerged as vital tools in identifying mutations associated with cancer. Thyroid cancer ranks as the 5th most prevalent cancer in the USA, with thousands diagnosed annually. This paper presents an ensemble learning model leveraging deep learning techniques such as Long Short-Term Memory (LSTM), Gated Recurrent Units (GRUs), and Bi-directional LSTM (Bi-LSTM) to detect thyroid cancer mutations early. The model is trained on a dataset sourced from asia.ensembl.org and IntOGen.org, consisting of 633 samples with 969 mutations across 41 genes, collected from individuals of various demographics. Feature extraction encompasses techniques including Hahn moments, central moments, raw moments, and various matrix-based methods. Evaluation employs three testing methods: self-consistency test (SCT), independent set test (IST), and 10-fold cross-validation test (10-FCVT). The proposed ensemble learning model demonstrates promising performance, achieving 96% accuracy in the independent set test (IST). Statistical measures such as training accuracy, testing accuracy, recall, sensitivity, specificity, Mathew's Correlation Coefficient (MCC), loss, training accuracy, F1 Score, and Cohen's kappa are utilized for comprehensive evaluation.
Collapse
Affiliation(s)
- Asghar Ali Shah
- Center of Excellence in Artificial Intelligence (CoE-AI), Department of Computer Science, Bahria University, Islamabad, 04408, Pakistan
| | - Ali Daud
- Faculty of Resilience, Rabdan Academy, Abu Dhabi, United Arab Emirates.
| | - Amal Bukhari
- Department of Information Systems and Technology, Collage of Computer Science and Engineering, University of Jeddah, Jeddah, Saudi Arabia
| | - Bader Alshemaimri
- Software Engineering Department, College of Computing and Information Sciences, King Saud University, Riyadh, Saudi Arabia
| | - Muhammad Ahsan
- Department of Computer Science, University of Alabama at Birmingham, 1402 10th Avenue S, Birmingham, AL, 35294, USA
| | - Rehmana Younis
- College of Letters and Sciences, Graduate Student of Robotics Engineering, Columbus State University, Columbus, USA
| |
Collapse
|
3
|
Sant VR, Radhachandran A, Ivezic V, Lee DT, Livhits MJ, Wu JX, Masamed R, Arnold CW, Yeh MW, Speier W. From Bench-to-Bedside: How Artificial Intelligence is Changing Thyroid Nodule Diagnostics, a Systematic Review. J Clin Endocrinol Metab 2024; 109:1684-1693. [PMID: 38679750 PMCID: PMC11180510 DOI: 10.1210/clinem/dgae277] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Revised: 04/04/2024] [Accepted: 04/16/2024] [Indexed: 05/01/2024]
Abstract
CONTEXT Use of artificial intelligence (AI) to predict clinical outcomes in thyroid nodule diagnostics has grown exponentially over the past decade. The greatest challenge is in understanding the best model to apply to one's own patient population, and how to operationalize such a model in practice. EVIDENCE ACQUISITION A literature search of PubMed and IEEE Xplore was conducted for English-language publications between January 1, 2015 and January 1, 2023, studying diagnostic tests on suspected thyroid nodules that used AI. We excluded articles without prospective or external validation, nonprimary literature, duplicates, focused on nonnodular thyroid conditions, not using AI, and those incidentally using AI in support of an experimental diagnostic outside standard clinical practice. Quality was graded by Oxford level of evidence. EVIDENCE SYNTHESIS A total of 61 studies were identified; all performed external validation, 16 studies were prospective, and 33 compared a model to physician prediction of ground truth. Statistical validation was reported in 50 papers. A diagnostic pipeline was abstracted, yielding 5 high-level outcomes: (1) nodule localization, (2) ultrasound (US) risk score, (3) molecular status, (4) malignancy, and (5) long-term prognosis. Seven prospective studies validated a single commercial AI; strengths included automating nodule feature assessment from US and assisting the physician in predicting malignancy risk, while weaknesses included automated margin prediction and interobserver variability. CONCLUSION Models predominantly used US images to predict malignancy. Of 4 Food and Drug Administration-approved products, only S-Detect was extensively validated. Implementing an AI model locally requires data sanitization and revalidation to ensure appropriate clinical performance.
Collapse
Affiliation(s)
- Vivek R Sant
- Division of Endocrine Surgery, UT Southwestern Medical Center, Dallas, TX 75390, USA
| | - Ashwath Radhachandran
- Biomedical Artificial Intelligence Research Lab, UCLA Department of Bioengineering, Los Angeles, CA 90024, USA
| | - Vedrana Ivezic
- Biomedical Artificial Intelligence Research Lab, UCLA Department of Bioengineering, Los Angeles, CA 90024, USA
| | - Denise T Lee
- Department of Surgery, Icahn School of Medicine at Mount Sinai Hospital, New York, NY 10029, USA
| | - Masha J Livhits
- Section of Endocrine Surgery, UCLA David Geffen School of Medicine, Los Angeles, CA 90095, USA
| | - James X Wu
- Section of Endocrine Surgery, UCLA David Geffen School of Medicine, Los Angeles, CA 90095, USA
| | - Rinat Masamed
- Department of Radiology, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Corey W Arnold
- Biomedical Artificial Intelligence Research Lab, UCLA Department of Bioengineering, Los Angeles, CA 90024, USA
| | - Michael W Yeh
- Section of Endocrine Surgery, UCLA David Geffen School of Medicine, Los Angeles, CA 90095, USA
| | - William Speier
- Biomedical Artificial Intelligence Research Lab, UCLA Department of Bioengineering, Los Angeles, CA 90024, USA
| |
Collapse
|
4
|
Chen JH, Zhang YQ, Zhu TT, Zhang Q, Zhao AX, Huang Y. Applying machine-learning models to differentiate benign and malignant thyroid nodules classified as C-TIRADS 4 based on 2D-ultrasound combined with five contrast-enhanced ultrasound key frames. Front Endocrinol (Lausanne) 2024; 15:1299686. [PMID: 38633756 PMCID: PMC11021584 DOI: 10.3389/fendo.2024.1299686] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Accepted: 03/21/2024] [Indexed: 04/19/2024] Open
Abstract
Objectives To apply machine learning to extract radiomics features from thyroid two-dimensional ultrasound (2D-US) combined with contrast-enhanced ultrasound (CEUS) images to classify and predict benign and malignant thyroid nodules, classified according to the Chinese version of the thyroid imaging reporting and data system (C-TIRADS) as category 4. Materials and methods This retrospective study included 313 pathologically diagnosed thyroid nodules (203 malignant and 110 benign). Two 2D-US images and five CEUS key frames ("2nd second after the arrival time" frame, "time to peak" frame, "2nd second after peak" frame, "first-flash" frame, and "second-flash" frame) were selected to manually label the region of interest using the "Labelme" tool. A total of 7 images of each nodule and their annotates were imported into the Darwin Research Platform for radiomics analysis. The datasets were randomly split into training and test cohorts in a 9:1 ratio. Six classifiers, namely, support vector machine, logistic regression, decision tree, random forest (RF), gradient boosting decision tree and extreme gradient boosting, were used to construct and test the models. Performance was evaluated using a receiver operating characteristic curve analysis. The area under the curve (AUC), sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), accuracy (ACC), and F1-score were calculated. One junior radiologist and one senior radiologist reviewed the 2D-US image and CEUS videos of each nodule and made a diagnosis. We then compared their AUC and ACC with those of our best model. Results The AUC of the diagnosis of US, CEUS and US combined CEUS by junior radiologist and senior radiologist were 0.755, 0.750, 0.784, 0.800, 0.873, 0.890, respectively. The RF classifier performed better than the other five, with an AUC of 1 for the training cohort and 0.94 (95% confidence interval 0.88-1) for the test cohort. The sensitivity, specificity, accuracy, PPV, NPV, and F1-score of the RF model in the test cohort were 0.82, 0.93, 0.90, 0.85, 0.92, and 0.84, respectively. The RF model with 2D-US combined with CEUS key frames achieved equivalent performance as the senior radiologist (AUC: 0.94 vs. 0.92, P = 0.798; ACC: 0.90 vs. 0.92) and outperformed the junior radiologist (AUC: 0.94 vs. 0.80, P = 0.039, ACC: 0.90 vs. 0.81) in the test cohort. Conclusions Our model, based on 2D-US and CEUS key frames radiomics features, had good diagnostic efficacy for thyroid nodules, which are classified as C-TIRADS 4. It shows promising potential in assisting less experienced junior radiologists.
Collapse
Affiliation(s)
| | | | | | | | | | - Ying Huang
- Department of Ultrasound, Shengjing Hospital of China Medical University, Shenyang, China
| |
Collapse
|
5
|
Chen C, Jiang Y, Yao J, Lai M, Liu Y, Jiang X, Ou D, Feng B, Zhou L, Xu J, Wu L, Zhou Y, Yue W, Dong F, Xu D. Deep learning to assist composition classification and thyroid solid nodule diagnosis: a multicenter diagnostic study. Eur Radiol 2024; 34:2323-2333. [PMID: 37819276 DOI: 10.1007/s00330-023-10269-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Revised: 08/01/2023] [Accepted: 08/07/2023] [Indexed: 10/13/2023]
Abstract
OBJECTIVES This study aimed to propose a deep learning (DL)-based framework for identifying the composition of thyroid nodules and assessing their malignancy risk. METHODS We conducted a retrospective multicenter study using ultrasound images from four hospitals. Convolutional neural network (CNN) models were constructed to classify ultrasound images of thyroid nodules into solid and non-solid, as well as benign and malignant. A total of 11,201 images of 6784 nodules were used for training, validation, and testing. The area under the receiver-operating characteristic curve (AUC) was employed as the primary evaluation index. RESULTS The models had AUCs higher than 0.91 in the benign and malignant grading of solid thyroid nodules, with the Inception-ResNet AUC being the highest at 0.94. In the test set, the best algorithm for identifying benign and malignant thyroid nodules had a sensitivity of 0.88, and a specificity of 0.86. In the human vs. DL test set, the best algorithm had a sensitivity of 0.93, and a specificity of 0.86. The Inception-ResNet model performed better than the senior physicians (p < 0.001). The sensitivity and specificity of the optimal model based on the external test set were 0.90 and 0.75, respectively. CONCLUSIONS This research demonstrates that CNNs can assist thyroid nodule diagnosis and reduce the rate of unnecessary fine-needle aspiration (FNA). CLINICAL RELEVANCE STATEMENT High-resolution ultrasound has led to increased detection of thyroid nodules. This results in unnecessary fine-needle aspiration and anxiety for patients whose nodules are benign. Deep learning can solve these problems to some extent. KEY POINTS • Thyroid solid nodules have a high probability of malignancy. • Our models can improve the differentiation between benign and malignant solid thyroid nodules. • The differential performance of one model was superior to that of senior radiologists. Applying this could reduce the rate of unnecessary fine-needle aspiration of solid thyroid nodules.
Collapse
Affiliation(s)
- Chen Chen
- Department of Diagnostic Ultrasound Imaging & Interventional Therapy, Zhejiang Cancer Hospital, Hangzhou Institute of Medicine (HIM), Chinese Academy of Sciences, Hangzhou, 310022, China
- Wenling Big Data and Artificial Intelligence Institute in Medicine, Taizhou, 317502, China
- Taizhou Key Laboratory of Minimally Invasive Interventional Therapy & Artificial Intelligence, Taizhou Campus of Zhejiang Cancer Hospital (Taizhou Cancer Hospital), Taizhou, 317502, China
| | - Yitao Jiang
- Illuminate, LLC, Shenzhen, Guangdong, 518000, China
| | - Jincao Yao
- Department of Diagnostic Ultrasound Imaging & Interventional Therapy, Zhejiang Cancer Hospital, Hangzhou Institute of Medicine (HIM), Chinese Academy of Sciences, Hangzhou, 310022, China
- Key Laboratory of Head & Neck Cancer, Translational Research of Zhejiang Province, Hangzhou, 310022, China
- Zhejiang Provincial Research Center for Cancer Intelligent Diagnosis and Molecular Technology, Hangzhou, 310022, China
| | - Min Lai
- The Second Clinical Medical College, Zhejiang Chinese Medical University, Hangzhou, 310053, China
| | - Yuanzhen Liu
- Department of Diagnostic Ultrasound Imaging & Interventional Therapy, Zhejiang Cancer Hospital, Hangzhou Institute of Medicine (HIM), Chinese Academy of Sciences, Hangzhou, 310022, China
- Wenling Big Data and Artificial Intelligence Institute in Medicine, Taizhou, 317502, China
- Taizhou Key Laboratory of Minimally Invasive Interventional Therapy & Artificial Intelligence, Taizhou Campus of Zhejiang Cancer Hospital (Taizhou Cancer Hospital), Taizhou, 317502, China
| | - Xianping Jiang
- Department of Ultrasound, Shengzhou People's Hospital (the First Affiliated Hospital of Zhejiang University Shengzhou Branch), Shengzhou, 312400, China
| | - Di Ou
- Department of Diagnostic Ultrasound Imaging & Interventional Therapy, Zhejiang Cancer Hospital, Hangzhou Institute of Medicine (HIM), Chinese Academy of Sciences, Hangzhou, 310022, China
- Key Laboratory of Head & Neck Cancer, Translational Research of Zhejiang Province, Hangzhou, 310022, China
- Zhejiang Provincial Research Center for Cancer Intelligent Diagnosis and Molecular Technology, Hangzhou, 310022, China
| | - Bojian Feng
- Department of Diagnostic Ultrasound Imaging & Interventional Therapy, Zhejiang Cancer Hospital, Hangzhou Institute of Medicine (HIM), Chinese Academy of Sciences, Hangzhou, 310022, China
- Wenling Big Data and Artificial Intelligence Institute in Medicine, Taizhou, 317502, China
- Taizhou Key Laboratory of Minimally Invasive Interventional Therapy & Artificial Intelligence, Taizhou Campus of Zhejiang Cancer Hospital (Taizhou Cancer Hospital), Taizhou, 317502, China
| | - Lingyan Zhou
- Department of Diagnostic Ultrasound Imaging & Interventional Therapy, Zhejiang Cancer Hospital, Hangzhou Institute of Medicine (HIM), Chinese Academy of Sciences, Hangzhou, 310022, China
- Key Laboratory of Head & Neck Cancer, Translational Research of Zhejiang Province, Hangzhou, 310022, China
- Zhejiang Provincial Research Center for Cancer Intelligent Diagnosis and Molecular Technology, Hangzhou, 310022, China
| | - Jinfeng Xu
- Department of Ultrasound, The Second Clinical Medical College, Jinan University, The First Affiliated Hospital, Southern University of Science and Technology, Shenzhen People's Hospital, Shenzhen, 518020, China
| | - Linghu Wu
- Department of Ultrasound, The Second Clinical Medical College, Jinan University, The First Affiliated Hospital, Southern University of Science and Technology, Shenzhen People's Hospital, Shenzhen, 518020, China
| | - Yuli Zhou
- Department of Ultrasound, The Second Clinical Medical College, Jinan University, The First Affiliated Hospital, Southern University of Science and Technology, Shenzhen People's Hospital, Shenzhen, 518020, China
| | - Wenwen Yue
- Center of Minimally Invasive Treatment for Tumor, Department of Medical Ultrasound, Shanghai Tenth People's Hospital, School of Medicine, Tongji University, Shanghai, 200072, China.
| | - Fajin Dong
- Department of Ultrasound, The Second Clinical Medical College, Jinan University, The First Affiliated Hospital, Southern University of Science and Technology, Shenzhen People's Hospital, Shenzhen, 518020, China.
| | - Dong Xu
- Department of Diagnostic Ultrasound Imaging & Interventional Therapy, Zhejiang Cancer Hospital, Hangzhou Institute of Medicine (HIM), Chinese Academy of Sciences, Hangzhou, 310022, China.
- Wenling Big Data and Artificial Intelligence Institute in Medicine, Taizhou, 317502, China.
- Taizhou Key Laboratory of Minimally Invasive Interventional Therapy & Artificial Intelligence, Taizhou Campus of Zhejiang Cancer Hospital (Taizhou Cancer Hospital), Taizhou, 317502, China.
| |
Collapse
|
6
|
Barinov L, Jairaj A, Middleton WD, D M, Beland, Kirsch J, Filice RW, Reverter JL, Arguelles I, Grant EG. Improving the Efficacy of ACR TI-RADS Through Deep Learning-Based Descriptor Augmentation. J Digit Imaging 2023; 36:2392-2401. [PMID: 37580483 PMCID: PMC10584788 DOI: 10.1007/s10278-023-00884-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Revised: 07/10/2023] [Accepted: 07/11/2023] [Indexed: 08/16/2023] Open
Abstract
Thyroid nodules occur in up to 68% of people, 95% of which are benign. Of the 5% of malignant nodules, many would not result in symptoms or death, yet 600,000 FNAs are still performed annually, with a PPV of 5-7% (up to 30%). Artificial intelligence (AI) systems have the capacity to improve diagnostic accuracy and workflow efficiency when integrated into clinical decision pathways. Previous studies have evaluated AI systems against physicians, whereas we aim to compare the benefits of incorporating AI into their final diagnostic decision. This work analyzed the potential for artificial intelligence (AI)-based decision support systems to improve physician accuracy, variability, and efficiency. The decision support system (DSS) assessed was Koios DS, which provides automated sonographic nodule descriptor predictions and a direct cancer risk assessment aligned to ACR TI-RADS. The study was conducted retrospectively between (08/2020) and (10/2020). The set of cases used included 650 patients (21% male, 79% female) of age 53 ± 15. Fifteen physicians assessed each of the cases in the set, both unassisted and aided by the DSS. The order of the reading condition was randomized, and reading blocks were separated by a period of 4 weeks. The system's impact on reader accuracy was measured by comparing the area under the ROC curve (AUC), sensitivity, and specificity of readers with and without the DSS with FNA as ground truth. The impact on reader variability was evaluated using Pearson's correlation coefficient. The impact on efficiency was determined by comparing the average time per read. There was a statistically significant increase in average AUC of 0.083 [0.066, 0.099] and an increase in sensitivity and specificity of 8.4% [5.4%, 11.3%] and 14% [12.5%, 15.5%], respectively, when aided by Koios DS. The average time per case decreased by 23.6% (p = 0.00017), and the observed Pearson's correlation coefficient increased from r = 0.622 to r = 0.876 when aided by Koios DS. These results indicate that providing physicians with automated clinical decision support significantly improved diagnostic accuracy, as measured by AUC, sensitivity, and specificity, and reduced inter-reader variability and interpretation times.
Collapse
Affiliation(s)
- Lev Barinov
- Department of Radiology, University of Pennsylvania, Philadelphia, PA, USA.
- Memorial Sloan Kettering Cancer Center, 1275 York Avenue, New York, NY, 10065, USA.
| | | | | | | | - Beland
- Warren Alpert Medical School, Providence, RI, USA
| | | | - Ross W Filice
- MedStar Georgetown University Hospital, Washington, DC, USA
| | | | | | | |
Collapse
|
7
|
Xu J, Xu HL, Cao YN, Huang Y, Gao S, Wu QJ, Gong TT. The performance of deep learning on thyroid nodule imaging predicts thyroid cancer: A systematic review and meta-analysis of epidemiological studies with independent external test sets. Diabetes Metab Syndr 2023; 17:102891. [PMID: 37907027 DOI: 10.1016/j.dsx.2023.102891] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/22/2023] [Revised: 10/06/2023] [Accepted: 10/15/2023] [Indexed: 11/02/2023]
Abstract
BACKGROUND AND AIMS It is still controversial whether deep learning (DL) systems add accuracy to thyroid nodule imaging classification based on the recent available evidence. We conducted this study to analyze the current evidence of DL in thyroid nodule imaging diagnosis in both internal and external test sets. METHODS Until the end of December 2022, PubMed, IEEE, Embase, Web of Science, and the Cochrane Library were searched. We included primary epidemiological studies using externally validated DL techniques in image-based thyroid nodule appraisal. This systematic review was registered on PROSPERO (CRD42022362892). RESULTS We evaluated evidence from 17 primary epidemiological studies using externally validated DL techniques in image-based thyroid nodule appraisal. Fourteen studies were deemed eligible for meta-analysis. The pooled sensitivity, specificity, and area under the curve (AUC) of these DL algorithms were 0.89 (95% confidence interval 0.87-0.90), 0.84 (0.82-0.86), and 0.93 (0.91-0.95), respectively. For the internal validation set, the pooled sensitivity, specificity, and AUC were 0.91 (0.89-0.93), 0.88 (0.85-0.91), and 0.96 (0.93-0.97), respectively. In the external validation set, the pooled sensitivity, specificity, and AUC were 0.87 (0.85-0.89), 0.81 (0.77-0.83), and 0.91 (0.88-0.93), respectively. Notably, in subgroup analyses, DL algorithms still demonstrated exceptional diagnostic validity. CONCLUSIONS Current evidence suggests DL-based imaging shows diagnostic performances comparable to clinicians for differentiating thyroid nodules in both the internal and external test sets.
Collapse
Affiliation(s)
- Jin Xu
- Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang, China
| | - He-Li Xu
- Department of Clinical Epidemiology, Shengjing Hospital of China Medical University, Shenyang, China
| | - Yi-Ning Cao
- Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang, China; Department of Clinical Epidemiology, Shengjing Hospital of China Medical University, Shenyang, China
| | - Ying Huang
- Department of Ultrasound, Shengjing Hospital of China Medical University, Shenyang, China
| | - Song Gao
- Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang, China
| | - Qi-Jun Wu
- Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang, China; Department of Clinical Epidemiology, Shengjing Hospital of China Medical University, Shenyang, China; Key Laboratory of Reproductive and Genetic Medicine (China Medical University), National Health Commission, Shenyang, China.
| | - Ting-Ting Gong
- Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang, China.
| |
Collapse
|
8
|
Cheng PC, Chiang HHK. Diagnosis of Salivary Gland Tumors Using Transfer Learning with Fine-Tuning and Gradual Unfreezing. Diagnostics (Basel) 2023; 13:3333. [PMID: 37958229 PMCID: PMC10648910 DOI: 10.3390/diagnostics13213333] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 10/25/2023] [Accepted: 10/27/2023] [Indexed: 11/15/2023] Open
Abstract
Ultrasound is the primary tool for evaluating salivary gland tumors (SGTs); however, tumor diagnosis currently relies on subjective features. This study aimed to establish an objective ultrasound diagnostic method using deep learning. We collected 446 benign and 223 malignant SGT ultrasound images in the training/validation set and 119 benign and 44 malignant SGT ultrasound images in the testing set. We trained convolutional neural network (CNN) models from scratch and employed transfer learning (TL) with fine-tuning and gradual unfreezing to classify malignant and benign SGTs. The diagnostic performances of these models were compared. By utilizing the pretrained ResNet50V2 with fine-tuning and gradual unfreezing, we achieved a 5-fold average validation accuracy of 0.920. The diagnostic performance on the testing set demonstrated an accuracy of 89.0%, a sensitivity of 81.8%, a specificity of 91.6%, a positive predictive value of 78.3%, and a negative predictive value of 93.2%. This performance surpasses that of other models in our study. The corresponding Grad-CAM visualizations were also presented to provide explanations for the diagnosis. This study presents an effective and objective ultrasound method for distinguishing between malignant and benign SGTs, which could assist in preoperative evaluation.
Collapse
Affiliation(s)
- Ping-Chia Cheng
- Department of Biomedical Engineering, National Yang Ming Chiao Tung University, Taipei 11221, Taiwan;
- Department of Otolaryngology Head and Neck Surgery, Far Eastern Memorial Hospital, New Taipei City 22060, Taiwan
- Department of Communication Engineering, Asia Eastern University of Science and Technology, New Taipei City 22060, Taiwan
| | - Hui-Hua Kenny Chiang
- Department of Biomedical Engineering, National Yang Ming Chiao Tung University, Taipei 11221, Taiwan;
| |
Collapse
|
9
|
Liang J, Pang T, Liu W, Li X, Huang L, Gong X, Diao X. Comparison of six machine learning methods for differentiating benign and malignant thyroid nodules using ultrasonographic characteristics. BMC Med Imaging 2023; 23:154. [PMID: 37828438 PMCID: PMC10571314 DOI: 10.1186/s12880-023-01117-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2022] [Accepted: 10/02/2023] [Indexed: 10/14/2023] Open
Abstract
BACKGROUND Several machine learning (ML) classifiers for thyroid nodule diagnosis have been compared in terms of their accuracy, sensitivity, specificity, negative predictive value (NPV), positive predictive value (PPV), and area under the receiver operating curve (AUC). A total of 525 patients with thyroid nodules (malignant, n = 228; benign, n = 297) underwent conventional ultrasonography, strain elastography, and contrast-enhanced ultrasound. Six algorithms were compared: support vector machine (SVM), linear discriminant analysis (LDA), random forest (RF), logistic regression (LG), GlmNet, and K-nearest neighbors (K-NN). The diagnostic performances of the 13 suspicious sonographic features for discriminating benign and malignant thyroid nodules were assessed using different ML algorithms. To compare these algorithms, a 10-fold cross-validation paired t-test was applied to the algorithm performance differences. RESULTS The logistic regression algorithm had better diagnostic performance than the other ML algorithms. However, it was only slightly higher than those of GlmNet, LDA, and RF. The accuracy, sensitivity, specificity, NPV, PPV, and AUC obtained by running logistic regression were 86.48%, 83.33%, 88.89%, 87.42%, 85.20%, and 92.84%, respectively. CONCLUSIONS The experimental results indicate that GlmNet, SVM, LDA, LG, K-NN, and RF exhibit slight differences in classification performance.
Collapse
Affiliation(s)
- Jianguang Liang
- School of Pharmacy & School of Biological and Food Engineering, Changzhou University, Changzhou, Jiangsu, 213164, China.
| | - Tiantian Pang
- Health Science Center, Shenzhen University, Shenzhen, 518060, China
- School of Biomedical Engineering, Shenzhen University, Shenzhen, 518060, China
- Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, Shenzhen, 518060, China
- National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, Shenzhen, 518060, China
- College of Computer Science and Technology, Jilin University, Changchun, 130012, China
| | - Weixiang Liu
- Health Science Center, Shenzhen University, Shenzhen, 518060, China
- School of Biomedical Engineering, Shenzhen University, Shenzhen, 518060, China
- Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, Shenzhen, 518060, China
- National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, Shenzhen, 518060, China
| | - Xiaogang Li
- Health Science Center, Shenzhen University, Shenzhen, 518060, China
- School of Biomedical Engineering, Shenzhen University, Shenzhen, 518060, China
- Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, Shenzhen, 518060, China
- National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, Shenzhen, 518060, China
| | - Leidan Huang
- Guangzhou Medical University, Guangzhou, 510182, China
- Department of Ultrasound, First Affiliated Hospital of Shenzhen University, Second People's Hospital of Shenzhen, Shenzhen, 518035, China
| | - Xuehao Gong
- Department of Ultrasound, First Affiliated Hospital of Shenzhen University, Second People's Hospital of Shenzhen, Shenzhen, 518035, China.
| | - Xianfen Diao
- Health Science Center, Shenzhen University, Shenzhen, 518060, China.
- School of Biomedical Engineering, Shenzhen University, Shenzhen, 518060, China.
- Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, Shenzhen, 518060, China.
- National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, Shenzhen, 518060, China.
| |
Collapse
|
10
|
Huang X, Chen X, Zhong X, Tian T. The CNN model aided the study of the clinical value hidden in the implant images. J Appl Clin Med Phys 2023; 24:e14141. [PMID: 37656066 PMCID: PMC10562019 DOI: 10.1002/acm2.14141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Revised: 08/14/2023] [Accepted: 08/16/2023] [Indexed: 09/02/2023] Open
Abstract
PURPOSE This article aims to construct a new method to evaluate radiographic image identification results based on artificial intelligence, which can complement the limited vision of researchers when studying the effect of various factors on clinical implantation outcomes. METHODS We constructed a convolutional neural network (CNN) model using the clinical implant radiographic images. Moreover, we used gradient-weighted class activation mapping (Grad-CAM) to obtain thermal maps to present identification differences before performing statistical analyses. Subsequently, to verify whether these differences presented by the Grad-CAM algorithm would be of value to clinical practices, we measured the bone thickness around the identified sites. Finally, we analyzed the influence of the implant type on the implantation according to the measurement results. RESULTS The thermal maps showed that the sites with significant differences between Straumann BL and Bicon implants as identified by the CNN model were mainly the thread and neck area. (2) The heights of the mesial, distal, buccal, and lingual bone of the Bicon implant post-op were greater than those of Straumann BL (P < 0.05). (3) Between the first and second stages of surgery, the amount of bone thickness variation at the buccal and lingual sides of the Bicon implant platform was greater than that of the Straumann BL implant (P < 0.05). CONCLUSION According to the results of this study, we found that the identified-neck-area of the Bicon implant was placed deeper than the Straumann BL implant, and there was more bone resorption on the buccal and lingual sides at the Bicon implant platform between the first and second stages of surgery. In summary, this study proves that using the CNN classification model can identify differences that complement our limited vision.
Collapse
Affiliation(s)
- Xinxu Huang
- State Key Laboratory of Oral DiseasesNational Clinical Research Center for Oral DiseasesWest China Hospital of StomatologySichuan UniversityChengduChina
| | - Xingyu Chen
- State Key Laboratory of Oral DiseasesNational Clinical Research Center for Oral DiseasesWest China Hospital of StomatologySichuan UniversityChengduChina
| | - Xinnan Zhong
- State Key Laboratory of Oral DiseasesNational Clinical Research Center for Oral DiseasesWest China Hospital of StomatologySichuan UniversityChengduChina
| | - Taoran Tian
- State Key Laboratory of Oral DiseasesNational Clinical Research Center for Oral DiseasesWest China Hospital of StomatologySichuan UniversityChengduChina
| |
Collapse
|
11
|
Nakrosis A, Paulauskaite-Taraseviciene A, Raudonis V, Narusis I, Gruzauskas V, Gruzauskas R, Lagzdinyte-Budnike I. Towards Early Poultry Health Prediction through Non-Invasive and Computer Vision-Based Dropping Classification. Animals (Basel) 2023; 13:3041. [PMID: 37835647 PMCID: PMC10571708 DOI: 10.3390/ani13193041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 09/17/2023] [Accepted: 09/25/2023] [Indexed: 10/15/2023] Open
Abstract
The use of artificial intelligence techniques with advanced computer vision techniques offers great potential for non-invasive health assessments in the poultry industry. Evaluating the condition of poultry by monitoring their droppings can be highly valuable as significant changes in consistency and color can be indicators of serious and infectious diseases. While most studies have prioritized the classification of droppings into two categories (normal and abnormal), with some relevant studies dealing with up to five categories, this investigation goes a step further by employing image processing algorithms to categorize droppings into six classes, based on visual information indicating some level of abnormality. To ensure a diverse dataset, data were collected in three different poultry farms in Lithuania by capturing droppings on different types of litter. With the implementation of deep learning, the object detection rate reached 92.41% accuracy. A range of machine learning algorithms, including different deep learning architectures, has been explored and, based on the obtained results, we have proposed a comprehensive solution by combining different models for segmentation and classification purposes. The results revealed that the segmentation task achieved the highest accuracy of 0.88 in terms of the Dice coefficient employing the K-means algorithm. Meanwhile, YOLOv5 demonstrated the highest classification accuracy, achieving an ACC of 91.78%.
Collapse
Affiliation(s)
- Arnas Nakrosis
- Faculty of Informatics, Kaunas University of Technology, Studentu 50, 51368 Kaunas, Lithuania; (A.N.); (I.N.); (I.L.-B.)
| | - Agne Paulauskaite-Taraseviciene
- Faculty of Informatics, Kaunas University of Technology, Studentu 50, 51368 Kaunas, Lithuania; (A.N.); (I.N.); (I.L.-B.)
- Artificial Intelligence Centre, Kaunas University of Technology, K. Barsausko 59, 51423 Kaunas, Lithuania; (V.R.); (V.G.); (R.G.)
| | - Vidas Raudonis
- Artificial Intelligence Centre, Kaunas University of Technology, K. Barsausko 59, 51423 Kaunas, Lithuania; (V.R.); (V.G.); (R.G.)
- Faculty of Electrical and Electronics, Kaunas University of Technology, Studentu 48, 51367 Kaunas, Lithuania
| | - Ignas Narusis
- Faculty of Informatics, Kaunas University of Technology, Studentu 50, 51368 Kaunas, Lithuania; (A.N.); (I.N.); (I.L.-B.)
| | - Valentas Gruzauskas
- Artificial Intelligence Centre, Kaunas University of Technology, K. Barsausko 59, 51423 Kaunas, Lithuania; (V.R.); (V.G.); (R.G.)
- Institute of Computer Science, Vilnius University, 08303 Vilnius, Lithuania
| | - Romas Gruzauskas
- Artificial Intelligence Centre, Kaunas University of Technology, K. Barsausko 59, 51423 Kaunas, Lithuania; (V.R.); (V.G.); (R.G.)
| | - Ingrida Lagzdinyte-Budnike
- Faculty of Informatics, Kaunas University of Technology, Studentu 50, 51368 Kaunas, Lithuania; (A.N.); (I.N.); (I.L.-B.)
| |
Collapse
|
12
|
Toro-Tobon D, Loor-Torres R, Duran M, Fan JW, Singh Ospina N, Wu Y, Brito JP. Artificial Intelligence in Thyroidology: A Narrative Review of the Current Applications, Associated Challenges, and Future Directions. Thyroid 2023; 33:903-917. [PMID: 37279303 PMCID: PMC10440669 DOI: 10.1089/thy.2023.0132] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Background: The use of artificial intelligence (AI) in health care has grown exponentially with the promise of facilitating biomedical research and enhancing diagnosis, treatment, monitoring, disease prevention, and health care delivery. We aim to examine the current state, limitations, and future directions of AI in thyroidology. Summary: AI has been explored in thyroidology since the 1990s, and currently, there is an increasing interest in applying AI to improve the care of patients with thyroid nodules (TNODs), thyroid cancer, and functional or autoimmune thyroid disease. These applications aim to automate processes, improve the accuracy and consistency of diagnosis, personalize treatment, decrease the burden for health care professionals, improve access to specialized care in areas lacking expertise, deepen the understanding of subtle pathophysiologic patterns, and accelerate the learning curve of less experienced clinicians. There are promising results for many of these applications. Yet, most are in the validation or early clinical evaluation stages. Only a few are currently adopted for risk stratification of TNODs by ultrasound and determination of the malignant nature of indeterminate TNODs by molecular testing. Challenges of the currently available AI applications include the lack of prospective and multicenter validations and utility studies, small and low diversity of training data sets, differences in data sources, lack of explainability, unclear clinical impact, inadequate stakeholder engagement, and inability to use outside of the research setting, which might limit the value of their future adoption. Conclusions: AI has the potential to improve many aspects of thyroidology; however, addressing the limitations affecting the suitability of AI interventions in thyroidology is a prerequisite to ensure that AI provides added value for patients with thyroid disease.
Collapse
Affiliation(s)
- David Toro-Tobon
- Division of Endocrinology, Diabetes, Metabolism and Nutrition, Department of Medicine, Mayo Clinic, Rochester, Minnesota, USA
| | - Ricardo Loor-Torres
- Division of Endocrinology, Diabetes, Metabolism and Nutrition, Department of Medicine, Mayo Clinic, Rochester, Minnesota, USA
| | - Mayra Duran
- Division of Endocrinology, Diabetes, Metabolism and Nutrition, Department of Medicine, Mayo Clinic, Rochester, Minnesota, USA
| | - Jungwei W. Fan
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, Minnesota, USA
| | - Naykky Singh Ospina
- Division of Endocrinology, Department of Medicine, University of Florida, Gainesville, Florida, USA
| | - Yonghui Wu
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, Florida, USA
| | - Juan P. Brito
- Division of Endocrinology, Diabetes, Metabolism and Nutrition, Department of Medicine, Mayo Clinic, Rochester, Minnesota, USA
| |
Collapse
|
13
|
Zhang M, Xue M, Li S, Zou Y, Zhu Q. Fusion deep learning approach combining diffuse optical tomography and ultrasound for improving breast cancer classification. BIOMEDICAL OPTICS EXPRESS 2023; 14:1636-1646. [PMID: 37078047 PMCID: PMC10110311 DOI: 10.1364/boe.486292] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Revised: 02/25/2023] [Accepted: 03/04/2023] [Indexed: 05/03/2023]
Abstract
Diffuse optical tomography (DOT) is a promising technique that provides functional information related to tumor angiogenesis. However, reconstructing the DOT function map of a breast lesion is an ill-posed and underdetermined inverse process. A co-registered ultrasound (US) system that provides structural information about the breast lesion can improve the localization and accuracy of DOT reconstruction. Additionally, the well-known US characteristics of benign and malignant breast lesions can further improve cancer diagnosis based on DOT alone. Inspired by a fusion model deep learning approach, we combined US features extracted by a modified VGG-11 network with images reconstructed from a DOT deep learning auto-encoder-based model to form a new neural network for breast cancer diagnosis. The combined neural network model was trained with simulation data and fine-tuned with clinical data: it achieved an AUC of 0.931 (95% CI: 0.919-0.943), superior to those achieved using US images alone (0.860) or DOT images alone (0.842).
Collapse
Affiliation(s)
- Menghao Zhang
- Electrical and System Engineering Department, Washington University in St. Louis, 1 Brooking Dr, St. Louis, MO 63130, USA
| | - Minghao Xue
- Biomedical Engineering Department, Washington University in St. Louis, 1 Brooking Dr, St. Louis, MO 63130, USA
| | - Shuying Li
- Biomedical Engineering Department, Washington University in St. Louis, 1 Brooking Dr, St. Louis, MO 63130, USA
| | - Yun Zou
- Biomedical Engineering Department, Washington University in St. Louis, 1 Brooking Dr, St. Louis, MO 63130, USA
| | - Quing Zhu
- Electrical and System Engineering Department, Washington University in St. Louis, 1 Brooking Dr, St. Louis, MO 63130, USA
- Biomedical Engineering Department, Washington University in St. Louis, 1 Brooking Dr, St. Louis, MO 63130, USA
| |
Collapse
|
14
|
Aboudi N, Khachnaoui H, Moussa O, Khlifa N. Bilinear Pooling for Thyroid Nodule Classification in Ultrasound Imaging. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING 2023. [DOI: 10.1007/s13369-023-07674-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/24/2023]
|
15
|
Lee JH, Kim YG, Ahn Y, Park S, Kong HJ, Choi JY, Kim K, Nam IC, Lee MC, Masuoka H, Miyauchi A, Kim S, Kim YA, Choe EK, Chai YJ. Investigation of optimal convolutional neural network conditions for thyroid ultrasound image analysis. Sci Rep 2023; 13:1360. [PMID: 36693894 PMCID: PMC9873643 DOI: 10.1038/s41598-023-28001-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Accepted: 01/11/2023] [Indexed: 01/26/2023] Open
Abstract
Neural network models have been used to analyze thyroid ultrasound (US) images and stratify malignancy risk of the thyroid nodules. We investigated the optimal neural network condition for thyroid US image analysis. We compared scratch and transfer learning models, performed stress tests in 10% increments, and compared the performance of three threshold values. All validation results indicated superiority of the transfer learning model over the scratch model. Stress test indicated that training the algorithm using 3902 images (70%) resulted in a performance which was similar to the full dataset (5575). Threshold 0.3 yielded high sensitivity (1% false negative) and low specificity (72% false positive), while 0.7 gave low sensitivity (22% false negative) and high specificity (23% false positive). Here we showed that transfer learning was more effective than scratch learning in terms of area under curve, sensitivity, specificity and negative/positive predictive value, that about 3900 images were minimally required to demonstrate an acceptable performance, and that algorithm performance can be customized according to the population characteristics by adjusting threshold value.
Collapse
Affiliation(s)
- Joon-Hyop Lee
- Department of Surgery, Gachon University College of Medicine, Gil Medical Center, Inchon, Korea
| | - Young-Gon Kim
- Transdisciplinary Department of Medicine and Advanced Technology, Seoul National University Hospital, Seoul, Korea
| | - Youngbin Ahn
- Transdisciplinary Department of Medicine and Advanced Technology, Seoul National University Hospital, Seoul, Korea
| | - Seyeon Park
- Transdisciplinary Department of Medicine and Advanced Technology, Seoul National University Hospital, Seoul, Korea
| | - Hyoun-Joong Kong
- Transdisciplinary Department of Medicine and Advanced Technology, Seoul National University Hospital, Seoul, Korea
| | - June Young Choi
- Department of Surgery, Seoul National University Bundang Hospital, Seongnam-si, Gyeonggi-do, Korea
| | - Kwangsoon Kim
- Department of Surgery, College of Medicine, The Catholic University of Korea, Seoul, Korea
| | - Inn-Chul Nam
- Department of Otolaryngology-Head and Neck Surgery, College of Medicine, The Catholic University of Korea, Seoul, Korea
| | - Myung-Chul Lee
- Department of Otorhinolaryngology-Head and Neck Surgery, Korea Cancer Center Hospital, Korea Institute of Radiological and Medical Science, Seoul, Korea
| | | | | | - Sungwan Kim
- Transdisciplinary Department of Medicine and Advanced Technology, Seoul National University Hospital, Seoul, Korea
| | - Young A Kim
- Department of Pathology, Seoul Metropolitan Government Seoul National University Boramae Medical Center, Seoul, Korea
| | - Eun Kyung Choe
- Transdisciplinary Department of Medicine and Advanced Technology, Seoul National University Hospital, Seoul, Korea. .,Department of Surgery, Seoul National University Hospital Healthcare System Gangnam Center, Seoul, Korea.
| | - Young Jun Chai
- Transdisciplinary Department of Medicine and Advanced Technology, Seoul National University Hospital, Seoul, Korea. .,Department of Surgery, Seoul Metropolitan Government, Seoul National University Boramae Medical Center, 20 Boramaep-ro 5-gil, Dongjak-gu, Seoul, 07061, Korea.
| |
Collapse
|
16
|
The Use of Artificial Intelligence in the Diagnosis and Classification of Thyroid Nodules: An Update. Cancers (Basel) 2023; 15:cancers15030708. [PMID: 36765671 PMCID: PMC9913834 DOI: 10.3390/cancers15030708] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2022] [Revised: 01/20/2023] [Accepted: 01/20/2023] [Indexed: 01/27/2023] Open
Abstract
The incidence of thyroid nodules diagnosed is increasing every year, leading to a greater risk of unnecessary procedures being performed or wrong diagnoses being made. In our paper, we present the latest knowledge on the use of artificial intelligence in diagnosing and classifying thyroid nodules. We particularly focus on the usefulness of artificial intelligence in ultrasonography for the diagnosis and characterization of pathology, as these are the two most developed fields. In our search of the latest innovations, we reviewed only the latest publications of specific types published from 2018 to 2022. We analyzed 930 papers in total, from which we selected 33 that were the most relevant to the topic of our work. In conclusion, there is great scope for the use of artificial intelligence in future thyroid nodule classification and diagnosis. In addition to the most typical uses of artificial intelligence in cancer differentiation, we identified several other novel applications of artificial intelligence during our review.
Collapse
|
17
|
Zhu PS, Zhang YR, Ren JY, Li QL, Chen M, Sang T, Li WX, Li J, Cui XW. Ultrasound-based deep learning using the VGGNet model for the differentiation of benign and malignant thyroid nodules: A meta-analysis. Front Oncol 2022; 12:944859. [PMID: 36249056 PMCID: PMC9554631 DOI: 10.3389/fonc.2022.944859] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2022] [Accepted: 08/19/2022] [Indexed: 12/13/2022] Open
Abstract
Objective The aim of this study was to evaluate the accuracy of deep learning using the convolutional neural network VGGNet model in distinguishing benign and malignant thyroid nodules based on ultrasound images. Methods Relevant studies were selected from PubMed, Embase, Cochrane Library, China National Knowledge Infrastructure (CNKI), and Wanfang databases, which used the deep learning-related convolutional neural network VGGNet model to classify benign and malignant thyroid nodules based on ultrasound images. Cytology and pathology were used as gold standards. Furthermore, reported eligibility and risk bias were assessed using the QUADAS-2 tool, and the diagnostic accuracy of deep learning VGGNet was analyzed with pooled sensitivity, pooled specificity, diagnostic odds ratio, and the area under the curve. Results A total of 11 studies were included in this meta-analysis. The overall estimates of sensitivity and specificity were 0.87 [95% CI (0.83, 0.91)] and 0.85 [95% CI (0.79, 0.90)], respectively. The diagnostic odds ratio was 38.79 [95% CI (22.49, 66.91)]. The area under the curve was 0.93 [95% CI (0.90, 0.95)]. No obvious publication bias was found. Conclusion Deep learning using the convolutional neural network VGGNet model based on ultrasound images performed good diagnostic efficacy in distinguishing benign and malignant thyroid nodules. Systematic Review Registration https://www.crd.york.ac.nk/prospero, identifier CRD42022336701.
Collapse
Affiliation(s)
- Pei-Shan Zhu
- Department of Ultrasound, the First Affiliated Hospital of Medical College, Shihezi University, Shihezi, China
| | - Yu-Rui Zhang
- Department of Ultrasound, the First Affiliated Hospital of Medical College, Shihezi University, Shihezi, China
| | - Jia-Yu Ren
- Department of Medical Ultrasound, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Qiao-Li Li
- Department of Ultrasound, the First Affiliated Hospital of Medical College, Shihezi University, Shihezi, China
| | - Ming Chen
- Department of Ultrasound, the First Affiliated Hospital of Medical College, Shihezi University, Shihezi, China
| | - Tian Sang
- Department of Ultrasound, the First Affiliated Hospital of Medical College, Shihezi University, Shihezi, China
| | - Wen-Xiao Li
- Department of Ultrasound, the First Affiliated Hospital of Medical College, Shihezi University, Shihezi, China
| | - Jun Li
- Department of Ultrasound, the First Affiliated Hospital of Medical College, Shihezi University, Shihezi, China,NHC Key Laboratory of Prevention and Treatment of Central Asia High Incidence Diseases, First Affiliated Hospital, School of Medicine, Shihezi University, Shihezi, China,*Correspondence: Jun Li, ; Xin-Wu Cui,
| | - Xin-Wu Cui
- Department of Medical Ultrasound, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China,*Correspondence: Jun Li, ; Xin-Wu Cui,
| |
Collapse
|
18
|
Sorrenti S, Dolcetti V, Radzina M, Bellini MI, Frezza F, Munir K, Grani G, Durante C, D’Andrea V, David E, Calò PG, Lori E, Cantisani V. Artificial Intelligence for Thyroid Nodule Characterization: Where Are We Standing? Cancers (Basel) 2022; 14:cancers14143357. [PMID: 35884418 PMCID: PMC9315681 DOI: 10.3390/cancers14143357] [Citation(s) in RCA: 47] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Revised: 06/24/2022] [Accepted: 07/08/2022] [Indexed: 12/12/2022] Open
Abstract
Simple Summary In the present review, an up-to-date summary of the state of the art of artificial intelligence (AI) implementation for thyroid nodule characterization and cancer is provided. The opinion on the real effectiveness of AI systems remains controversial. Taking into consideration the largest and most scientifically valid studies, it is possible to state that AI provides results that are comparable or inferior to expert ultrasound specialists and radiologists. Promising data approve AI as a support tool and simultaneously highlight the need for a radiologist supervisory framework for AI provided results. Therefore, current solutions might be more suitable for educational purposes. Abstract Machine learning (ML) is an interdisciplinary sector in the subset of artificial intelligence (AI) that creates systems to set up logical connections using algorithms, and thus offers predictions for complex data analysis. In the present review, an up-to-date summary of the current state of the art regarding ML and AI implementation for thyroid nodule ultrasound characterization and cancer is provided, highlighting controversies over AI application as well as possible benefits of ML, such as, for example, training purposes. There is evidence that AI increases diagnostic accuracy and significantly limits inter-observer variability by using standardized mathematical algorithms. It could also be of aid in practice settings with limited sub-specialty expertise, offering a second opinion by means of radiomics and computer-assisted diagnosis. The introduction of AI represents a revolutionary event in thyroid nodule evaluation, but key issues for further implementation include integration with radiologist expertise, impact on workflow and efficiency, and performance monitoring.
Collapse
Affiliation(s)
- Salvatore Sorrenti
- Department of Surgical Sciences, “Sapienza” University of Rome, 00161 Rome, Italy; (S.S.); (V.D.); (E.L.)
| | - Vincenzo Dolcetti
- Department of Radiological, Anatomo-Pathological Sciences, “Sapienza” University of Rome, 00161 Rome, Italy; (V.D.); (V.C.)
| | - Maija Radzina
- Radiology Research Laboratory, Riga Stradins University, LV-1007 Riga, Latvia;
- Medical Faculty, University of Latvia, Diagnostic Radiology Institute, Paula Stradina Clinical University Hospital, LV-1007 Riga, Latvia
| | - Maria Irene Bellini
- Department of Surgical Sciences, “Sapienza” University of Rome, 00161 Rome, Italy; (S.S.); (V.D.); (E.L.)
- Correspondence:
| | - Fabrizio Frezza
- Department of Information Engineering, Electronics and Telecommunications, “Sapienza” University of Rome, 00184 Rome, Italy; (F.F.); (K.M.)
- Consorzio Nazionale Interuniversitario per le Telecomunicazioni (CNIT), Viale G.P. Usberti 181/A Sede Scientifica di Ingegneria-Palazzina 3, 43124 Parma, Italy
| | - Khushboo Munir
- Department of Information Engineering, Electronics and Telecommunications, “Sapienza” University of Rome, 00184 Rome, Italy; (F.F.); (K.M.)
| | - Giorgio Grani
- Department of Translational and Precision Medicine, “Sapienza” University of Rome, 00161 Rome, Italy; (G.G.); (C.D.); (E.D.)
| | - Cosimo Durante
- Department of Translational and Precision Medicine, “Sapienza” University of Rome, 00161 Rome, Italy; (G.G.); (C.D.); (E.D.)
| | - Vito D’Andrea
- Department of Surgical Sciences, “Sapienza” University of Rome, 00161 Rome, Italy; (S.S.); (V.D.); (E.L.)
| | - Emanuele David
- Department of Translational and Precision Medicine, “Sapienza” University of Rome, 00161 Rome, Italy; (G.G.); (C.D.); (E.D.)
| | - Pietro Giorgio Calò
- Department of Surgical Sciences, “Policlinico Universitario Duilio Casula”, University of Cagliari, 09042 Monserrato, Italy;
| | - Eleonora Lori
- Department of Surgical Sciences, “Sapienza” University of Rome, 00161 Rome, Italy; (S.S.); (V.D.); (E.L.)
| | - Vito Cantisani
- Department of Radiological, Anatomo-Pathological Sciences, “Sapienza” University of Rome, 00161 Rome, Italy; (V.D.); (V.C.)
| |
Collapse
|
19
|
Xue Y, Zhou Y, Wang T, Chen H, Wu L, Ling H, Wang H, Qiu L, Ye D, Wang B. Accuracy of Ultrasound Diagnosis of Thyroid Nodules Based on Artificial Intelligence-Assisted Diagnostic Technology: A Systematic Review and Meta-Analysis. Int J Endocrinol 2022; 2022:9492056. [PMID: 36193283 PMCID: PMC9525757 DOI: 10.1155/2022/9492056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Revised: 08/04/2022] [Accepted: 08/24/2022] [Indexed: 12/07/2022] Open
Abstract
BACKGROUND Ultrasonography (US) is the most common method of identifying thyroid nodules, but US images require an experienced surgeon for identification. Many artificial intelligence (AI) techniques such as computer-aided diagnostic systems (CAD), deep learning (DL), and machine learning (ML) have been used to assist in the diagnosis of thyroid nodules, but whether AI techniques can improve the diagnostic accuracy of thyroid nodules still needs to be explored. OBJECTIVE To clarify the accuracy of AI-based thyroid nodule US images for differentiating benign and malignant thyroid nodules. METHODS A search strategy of "subject terms + key words" was used to search PubMed, Cochrane Library, Embase, Web of Science, China Biology Medicine (CBM), and China National Knowledge Infrastructure (CNKI) for studies on AI-assisted diagnosis of thyroid nodules based on US images. The summarized receiver operating characteristic (SROC) curve and the pooled sensitivity and specificity were used to assess the performance of the diagnostic tests. The quality assessment of diagnostics accuracy studies-2 (QUADAS-2) tool was used to assess the methodological quality of the included studies. The Review Manager 5.3 and Stata 15 were used to process the data. Subgroup analysis was based on the integrity of data collection. RESULTS A total of 25 studies with 17,429 US images of thyroid nodules were included. AI-assisted diagnostic techniques had better diagnostic efficacy in the diagnosis of benign and malignant thyroid nodules: sensitivity 0.88 (95% CI: (0.85-0.90)), specificity 0.81 (95% CI: 0.74-0.86), diagnostic odds ratio (DOR) 30 (95% CI: 19-46). The SROC curve indicated that the area under the curve (AUC) was 0.92 (95% CI: 0.89-0.94). Threshold effect analysis showed a Spearman correlation coefficient: 0.17 < 0.5, suggesting no threshold effect for the included studies. After a meta-regression analysis of 4 different subgroups, the results showed a statistically significant effect of mean age ≥50 years on heterogeneity. Compared with studies with an average age of ≥50 years, AI-assisted diagnostic techniques had higher diagnostic performance in studies with an average age of <50 years (0.89 (95% CI: 0.87-0.92) vs. 0.80 (95% CI: 0.73-0.88)), (0.83 (95% CI: 0.77-0.88) vs. 0.73 (95% CI: 0.60-0.87)). CONCLUSIONS AI-assisted diagnostic techniques had good diagnostic efficacy for thyroid nodules. For the diagnosis of <50 year olds, AI-assisted diagnostic technology was more effective in diagnosis.
Collapse
Affiliation(s)
- Yu Xue
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, Hefei, Anhui, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, Anhui, China
| | - Ying Zhou
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, Hefei, Anhui, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, Anhui, China
| | - Tingrui Wang
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, Hefei, Anhui, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, Anhui, China
| | - Huijuan Chen
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, Hefei, Anhui, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, Anhui, China
| | - Lingling Wu
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, Hefei, Anhui, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, Anhui, China
| | - Huayun Ling
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, Hefei, Anhui, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, Anhui, China
| | - Hong Wang
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, Hefei, Anhui, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, Anhui, China
| | - Lijuan Qiu
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, Hefei, Anhui, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, Anhui, China
| | - Dongqing Ye
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, Hefei, Anhui, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, Anhui, China
| | - Bin Wang
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, Hefei, Anhui, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, Anhui, China
| |
Collapse
|