1
|
Costa MVL, de Aguiar EJ, Rodrigues LS, Traina C, Traina AJM. DEELE-Rad: exploiting deep radiomics features in deep learning models using COVID-19 chest X-ray images. Health Inf Sci Syst 2025; 13:11. [PMID: 39741501 PMCID: PMC11683036 DOI: 10.1007/s13755-024-00330-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Accepted: 12/17/2024] [Indexed: 01/03/2025] Open
Abstract
Purpose Deep learning-based radiomics techniques have the potential to aid specialists and physicians in performing decision-making in COVID-19 scenarios. Specifically, a Deep Learning (DL) ensemble model is employed to classify medical images when addressing the diagnosis during the classification tasks for COVID-19 using chest X-ray images. It also provides feasible and reliable visual explicability concerning the results to support decision-making. Methods Our DEELE-Rad approach integrates DL and Machine Learning (ML) techniques. We use deep learning models to extract deep radiomics features and evaluate its performance regarding end-to-end classifiers. We avoid successive radiomics approach steps by employing these models with transfer learning techniques from ImageNet, such as VGG16, ResNet50V2, and DenseNet201 architectures. We extract 100 and 500 deep radiomics features from each DL model. We also placed these features into well-established ML classifiers and applied automatic parameter tuning and a cross-validation strategy. Besides, we exploit insights into the decision-making behavior by applying a visual explanation method. Results Experimental evaluation on our proposed approach achieved 89.97% AUC when using 500 deep radiomics features from the DenseNet201 end-to-end classifier. Besides, our ensemble DEELE-Rad method improves the results up to 96.19% AUC for the 500 dimensions. To outperform, ML DEELE-Rad reached the best results with an Accuracy of 98.39% and 99.19% AUC for the same setup. Our visual assessment employs additional possibilities for specialists and physicians to decision-making. Conclusion The results reflect that the DEELE-Rad approach provides robustness and confidence to the images' analysis. Our approach can benefit healthcare specialists when employed at clinical routines and respective decision-making procedures. For reproducibility, our code is available at https://github.com/usmarcv/deele-rad.
Collapse
Affiliation(s)
- Márcus V. L. Costa
- Institute of Mathematics and Computer Science, University of São Paulo, São Carlos, São Paulo 13566-590 Brazil
| | - Erikson J. de Aguiar
- Institute of Mathematics and Computer Science, University of São Paulo, São Carlos, São Paulo 13566-590 Brazil
| | - Lucas S. Rodrigues
- Institute of Mathematics and Computer Science, University of São Paulo, São Carlos, São Paulo 13566-590 Brazil
| | - Caetano Traina
- Institute of Mathematics and Computer Science, University of São Paulo, São Carlos, São Paulo 13566-590 Brazil
| | - Agma J. M. Traina
- Institute of Mathematics and Computer Science, University of São Paulo, São Carlos, São Paulo 13566-590 Brazil
| |
Collapse
|
2
|
Chihaoui M, Dhibi N, Ferchichi A. Optimization of convolutional neural network and visual geometry group-16 using genetic algorithms for pneumonia detection. Front Med (Lausanne) 2024; 11:1498403. [PMID: 39697204 PMCID: PMC11653186 DOI: 10.3389/fmed.2024.1498403] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2024] [Accepted: 11/11/2024] [Indexed: 12/20/2024] Open
Abstract
Pneumonia is still a major global health issue, so effective diagnostic methods are needed. This research proposes a new methodology for improving convolutional neural networks (CNNs) and the Visual Geometry Group-16 (VGG16) model by incorporating genetic algorithms (GAs) to detect pneumonia. The work uses a dataset of 5,856 frontal chest radiography images critical in training and testing machine learning algorithms. The issue relates to challenges of medical image classification, the complexity of which can be significantly addressed by properly optimizing CNN. Moreover, our proposed methodology used GAs to determine the hyperparameters for CNNs and VGG16 and fine-tune the architecture to improve the existing performance measures. The evaluation of the optimized models showed some good performances with purely convolutional neural network archetyping, averaging 97% in terms of training accuracy and 94% based on the testing process. At the same time, it has a low error rate of 0.072. Although adding this layer affected the training and testing time, it created a new impression on the test accuracy and training accuracy of the VGG16 model, with 90.90% training accuracy, 90.90%, and a loss of 0.11. Future work will involve contributing more examples so that a richer database of radiographic images is attained, optimizing the GA parameters even more, and pursuing the use of ensemble applications so that the diagnosis capability is heightened. Apart from emphasizing the contribution of GAs in improving the CNN architecture, this study also seeks to contribute to the early detection of pneumonia to minimize the complications faced by patients, especially children.
Collapse
Affiliation(s)
- Mejda Chihaoui
- Computer Science Department, Applied College, University of Ha'il, Hail, Saudi Arabia
- REGIM: Research Groups on Intelligent Machines, University of Sfax, National School of Engineers (ENIS), Sfax, Tunisia
| | - Naziha Dhibi
- REGIM: Research Groups on Intelligent Machines, University of Sfax, National School of Engineers (ENIS), Sfax, Tunisia
| | - Ahlem Ferchichi
- Computer Science Department, Applied College, University of Ha'il, Hail, Saudi Arabia
| |
Collapse
|
3
|
Alam FB, Podder P, Mondal MRH. RVCNet: A hybrid deep neural network framework for the diagnosis of lung diseases. PLoS One 2023; 18:e0293125. [PMID: 38153925 PMCID: PMC10754458 DOI: 10.1371/journal.pone.0293125] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Accepted: 10/06/2023] [Indexed: 12/30/2023] Open
Abstract
Early evaluation and diagnosis can significantly reduce the life-threatening nature of lung diseases. Computer-aided diagnostic systems (CADs) can help radiologists make more precise diagnoses and reduce misinterpretations in lung disease diagnosis. Existing literature indicates that more research is needed to correctly classify lung diseases in the presence of multiple classes for different radiographic imaging datasets. As a result, this paper proposes RVCNet, a hybrid deep neural network framework for predicting lung diseases from an X-ray dataset of multiple classes. This framework is developed based on the ideas of three deep learning techniques: ResNet101V2, VGG19, and a basic CNN model. In the feature extraction phase of this new hybrid architecture, hyperparameter fine-tuning is used. Additional layers, such as batch normalization, dropout, and a few dense layers, are applied in the classification phase. The proposed method is applied to a dataset of COVID-19, non-COVID lung infections, viral pneumonia, and normal patients' X-ray images. The experiments take into account 2262 training and 252 testing images. Results show that with the Nadam optimizer, the proposed algorithm has an overall classification accuracy, AUC, precision, recall, and F1-score of 91.27%, 92.31%, 90.48%, 98.30%, and 94.23%, respectively. Finally, these results are compared with some recent deep-learning models. For this four-class dataset, the proposed RVCNet has a classification accuracy of 91.27%, which is better than ResNet101V2, VGG19, VGG19 over CNN, and other stand-alone models. Finally, the application of the GRAD-CAM approach clearly interprets the classification of images by the RVCNet framework.
Collapse
Affiliation(s)
- Fatema Binte Alam
- Institute of Information and Communication Technology, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
| | - Prajoy Podder
- Institute of Information and Communication Technology, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
| | - M. Rubaiyat Hossain Mondal
- Institute of Information and Communication Technology, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
| |
Collapse
|
4
|
Yao D, Xu Z, Lin Y, Zhan Y. Accurate and intelligent diagnosis of pediatric pneumonia using X-ray images and blood testing data. Front Bioeng Biotechnol 2023; 11:1058888. [PMID: 37292095 PMCID: PMC10245274 DOI: 10.3389/fbioe.2023.1058888] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Accepted: 01/30/2023] [Indexed: 06/10/2023] Open
Abstract
Computer-aided diagnosis (CAD) methods such as the X-rays-based method is one of the cheapest and safe alternative options to diagnose the disease compared to other alternatives such as Computed Tomography (CT) scan, and so on. However, according to our experiments on X-ray public datasets and real clinical datasets, we found that there are two challenges in the current classification of pneumonia: existing public datasets have been preprocessed too well, making the accuracy of the results relatively high; existing models have weak ability to extract features from the clinical pneumonia X-ray dataset. To solve the dataset problems, we collected a new dataset of pediatric pneumonia with labels obtained through a comprehensive pathogen-radiology-clinical diagnostic screening. Then, to accurately capture the important features in imbalanced data, based on the new dataset, we proposed for the first time a two-stage training multimodal pneumonia classification method combining X-ray images and blood testing data, which improves the image feature extraction ability through a global-local attention module and mitigate the influence of class imbalance data on the results through the two-stage training strategy. In experiments, the performance of our proposed model is the best on new clinical data and outperforms the diagnostic accuracy of four experienced radiologists. Through further research on the performance of various blood testing indicators in the model, we analyzed the conclusions that are helpful for radiologists to diagnose.
Collapse
Affiliation(s)
- Dan Yao
- State Key Laboratory of Reliability and Intelligence of Electrical Equipment, School of Health Sciences and Biomedical Engineering, Hebei University of Technology, Tianjin, China
| | - Zhenghua Xu
- State Key Laboratory of Reliability and Intelligence of Electrical Equipment, School of Health Sciences and Biomedical Engineering, Hebei University of Technology, Tianjin, China
| | - Yi Lin
- Department of Radiology, Hainan Women and Children’s Medical Center, Haikou, China
| | - Yuefu Zhan
- Department of Radiology, Hainan Women and Children’s Medical Center, Haikou, China
| |
Collapse
|
5
|
COVID-19 Data Analytics Using Extended Convolutional Technique. Interdiscip Perspect Infect Dis 2022; 2022:4578838. [DOI: 10.1155/2022/4578838] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Revised: 10/25/2022] [Accepted: 10/27/2022] [Indexed: 11/09/2022] Open
Abstract
The healthcare system, lifestyle, industrial growth, economy, and livelihood of human beings worldwide were affected due to the triggered global pandemic by the COVID-19 virus that originated and was first reported in Wuhan city, Republic Country of China. COVID cases are difficult to predict and detect in their early stages, and their spread and mortality are uncontrollable. The reverse transcription polymerase chain reaction (RT-PCR) is still the first and foremost diagnostical methodology accepted worldwide; hence, it creates a scope of new diagnostic tools and techniques of detection approach which can produce effective and faster results compared with its predecessor. Innovational through current studies that complement the existence of the novel coronavirus (COVID-19) to findings in the thorax (chest) X-ray imaging, the projected research’s method makes use of present deep learning (DL) models with the integration of various frameworks such as GoogleNet, U-Net, and ResNet50 to novel method those X-ray images and categorize patients as the corona positive (COVID + ve) or the corona negative (COVID -ve). The anticipated technique entails the pretreatment phase through dissection of the lung, getting rid of the environment which does now no longer provide applicable facts and can provide influenced consequences; then after this, the preliminary degree comes up with the category version educated below the switch mastering system; and in conclusion, consequences are evaluated and interpreted through warmth maps visualization. The proposed research method completed a detection accuracy of COVID-19 at around 99%.
Collapse
|
6
|
Sitaula C, Shahi TB. Monkeypox Virus Detection Using Pre-trained Deep Learning-based Approaches. J Med Syst 2022; 46:78. [PMID: 36201085 PMCID: PMC9535233 DOI: 10.1007/s10916-022-01868-2] [Citation(s) in RCA: 47] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Accepted: 09/22/2022] [Indexed: 11/17/2022]
Abstract
Monkeypox virus is emerging slowly with the decline of COVID-19 virus infections around the world. People are afraid of it, thinking that it would appear as a pandemic like COVID-19. As such, it is crucial to detect them earlier before widespread community transmission. AI-based detection could help identify them at the early stage. In this paper, we aim to compare 13 different pre-trained deep learning (DL) models for the Monkeypox virus detection. For this, we initially fine-tune them with the addition of universal custom layers for all of them and analyse the results using four well-established measures: Precision, Recall, F1-score, and Accuracy. After the identification of the best-performing DL models, we ensemble them to improve the overall performance using a majority voting over the probabilistic outputs obtained from them. We perform our experiments on a publicly available dataset, which results in average Precision, Recall, F1-score, and Accuracy of 85.44%, 85.47%, 85.40%, and 87.13%, respectively with the help of our proposed ensemble approach. These encouraging results, which outperform the state-of-the-art methods, suggest that the proposed approach is applicable to health practitioners for mass screening.
Collapse
Affiliation(s)
- Chiranjibi Sitaula
- Department of Electrical and Computer Systems Engineering, Monash University, Wellignton Rd, Clayton, VIC 3800 Australia
| | - Tej Bahadur Shahi
- School of Engineering and Technology, Central Queensland University, Norman Garden, QLD 4701 Australia
- Central Department of Computer Science and IT, Tribhuvan University, TU Rd, Kirtipur, Kathmandu 44618 Nepal
| |
Collapse
|
7
|
Furtado A, da Purificação CAC, Badaró R, Nascimento EGS. A Light Deep Learning Algorithm for CT Diagnosis of COVID-19 Pneumonia. Diagnostics (Basel) 2022; 12:1527. [PMID: 35885433 PMCID: PMC9319098 DOI: 10.3390/diagnostics12071527] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2022] [Revised: 06/20/2022] [Accepted: 06/20/2022] [Indexed: 11/24/2022] Open
Abstract
A large number of reports present artificial intelligence (AI) algorithms, which support pneumonia detection caused by COVID-19 from chest CT (computed tomography) scans. Only a few studies provided access to the source code, which limits the analysis of the out-of-distribution generalization ability. This study presents Cimatec-CovNet-19, a new light 3D convolutional neural network inspired by the VGG16 architecture that supports COVID-19 identification from chest CT scans. We trained the algorithm with a dataset of 3000 CT Scans (1500 COVID-19-positive) with images from different parts of the world, enhanced with 3000 images obtained with data augmentation techniques. We introduced a novel pre-processing approach to perform a slice-wise selection based solely on the lung CT masks and an empirically chosen threshold for the very first slice. It required only 16 slices from a CT examination to identify COVID-19. The model achieved a recall of 0.88, specificity of 0.88, ROC-AUC of 0.95, PR-AUC of 0.95, and F1-score of 0.88 on a test set with 414 samples (207 COVID-19). These results support Cimatec-CovNet-19 as a good and light screening tool for COVID-19 patients. The whole code is freely available for the scientific community.
Collapse
Affiliation(s)
- Adhvan Furtado
- Supercomputing Center SENAI CIMATEC, Av. Orlando Gomes, 1845, Piatã, Salvador 41560-010, Brazil; (A.F.); (C.A.C.d.P.)
| | | | - Roberto Badaró
- Instituto SENAI de Inovação em Saúde, Av. Orlando Gomes, 1845, Piatã, Salvador 41560-010, Brazil;
| | | |
Collapse
|
8
|
Li F, Zhou L, Wang Y, Chen C, Yang S, Shan F, Liu L. Modeling long-range dependencies for weakly supervised disease classification and localization on chest X-ray. Quant Imaging Med Surg 2022; 12:3364-3378. [PMID: 35655823 PMCID: PMC9131331 DOI: 10.21037/qims-21-1117] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Accepted: 03/14/2022] [Indexed: 12/31/2023]
Abstract
BACKGROUND Computer-aided diagnosis based on chest X-ray (CXR) is an exponentially growing field of research owing to the development of deep learning, especially convolutional neural networks (CNNs). However, due to the intrinsic locality of convolution operations, CNNs cannot model long-range dependencies. Although vision transformers (ViTs) have recently been proposed to alleviate this limitation, those trained on patches cannot learn any dependencies for inter-patch pixels and thus, are insufficient for medical image detection. To address this problem, in this paper, we propose a CXR detection method which integrates CNN with a ViT for modeling patch-wise and inter-patch dependencies. METHODS We experimented on the ChestX-ray14 dataset and followed the official training-test set split. Because the training data only had global annotations, the detection network was weakly supervised. A DenseNet with a feature pyramid structure was designed and integrated with an adaptive ViT to model inter-patch and patch-wise long-range dependencies and obtain fine-grained feature maps. We compared the performance using our method with that of other disease detection methods. RESULTS For disease classification, our method achieved the best result among all the disease detection methods, with a mean area under the curve (AUC) of 0.829. For lesion localization, our method achieved significantly higher intersection of the union (IoU) scores on the test images with bounding box annotations than did the other detection methods. The visualized results showed that our predictions were more accurate and detailed. Furthermore, evaluation of our method in an external validation dataset demonstrated its generalization ability. CONCLUSIONS Our proposed method achieves the new state of the art for thoracic disease classification and weakly supervised localization. It has potential to assist in clinical decision-making.
Collapse
Affiliation(s)
- Fangyun Li
- Institute of Biomedical Sciences, Fudan University, Shanghai, China
| | - Lingxiao Zhou
- Institute of Microscale Optoelectronics, Shenzhen University, Shenzhen, China
| | - Yunpeng Wang
- Institute of Biomedical Sciences, Fudan University, Shanghai, China
| | - Chuan Chen
- Department of Radiology, Shanghai Public Health Clinical Center, Fudan University, Shanghai, China
| | - Shuyi Yang
- Department of Radiology, Zhongshan Hospital, Fudan University, Shanghai, China
- Shanghai Institute of Medical Imaging, Shanghai, China
| | - Fei Shan
- Department of Radiology, Shanghai Public Health Clinical Center, Fudan University, Shanghai, China
| | - Lei Liu
- Institute of Biomedical Sciences, Fudan University, Shanghai, China
- School of Basic Medical Sciences, Fudan University, Shanghai, China
| |
Collapse
|
9
|
New Optimized Deep Learning Application for COVID-19 Detection in Chest X-ray Images. Symmetry (Basel) 2022. [DOI: 10.3390/sym14051003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Due to false negative results of the real-time Reverse Transcriptase-Polymerase Chain Reaction (RT-PCR) test, the complemental practices such as computed tomography (CT) and X-ray in combination with RT-PCR are discussed to achieve a more accurate diagnosis of COVID-19 in clinical practice. Since radiology includes visual understanding as well as decision making under limited conditions such as uncertainty, urgency, patient burden, and hospital facilities, mistakes are inevitable. Therefore, there is an immediate requirement to carry out further investigation and develop new accurate detection and identification methods to provide automatically quantitative evaluation of COVID-19. In this paper, we propose a new computer-aided diagnosis application for COVID-19 detection using deep learning techniques. A new technique, which receives symmetric X-ray data as the input, is presented in this study by combining Convolutional Neural Networks (CNN) with Ant Lion Optimization Algorithm (ALO) and Multiclass Naïve Bayes Classifier (NB). Moreover, several other classifiers such as Softmax, Support Vector Machines (SVM), K-Nearest Neighbors (KNN) and Decision Tree (DT) are combined with CNN. The promising results of these classifiers are evaluated and presented for accuracy, precision, and F1-score metrics. NB classifier with Ant Lion Optimization Algorithm and CNN produced the best results with 98.31% accuracy, 100% precision and 98.25% F1-score and with the lowest execution time.
Collapse
|
10
|
Pneumonia Recognition by Deep Learning: A Comparative Investigation. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12094334] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Pneumonia is a common infectious disease. Currently, the most common method of pneumonia identification is manual diagnosis by professional doctors, but the accuracy and identification efficiency of this method is not satisfactory, and computer-aided diagnosis technology has emerged. With the development of artificial intelligence, deep learning has also been applied to pneumonia diagnosis and can achieve high accuracy. In this paper, we compare five deep learning models in different situations for pneumonia recognition. The objective was to employ five deep learning models to identify pneumonia X-ray images and to compare and analyze them in different cases, thus screening out the optimal model for each type of case to improve the efficiency of pneumonia recognition and further apply it to the computer-aided diagnosis of pneumonia species. In the proposed framework: (1) datasets are collected and processed, (2) five deep learning models for pneumonia recognition are built, (3) the five models are compared, and the optimal model for each case is selected. The results show that the LeNet5 and AlexNet models achieved better pneumonia recognition for small datasets, while the MobileNet and ResNet18 models were more suitable for pneumonia recognition for large datasets. The comparative analysis of each model under different situations can provide a deeper understanding of the efficiency of each model in identifying pneumonia, thus making the practical application and selection of deep learning models for pneumonia recognition more convenient.
Collapse
|
11
|
Deep Learning Applied to Chest Radiograph Classification—A COVID-19 Pneumonia Experience. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12083712] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Due to the recent COVID-19 pandemic, a large number of reports present deep learning algorithms that support the detection of pneumonia caused by COVID-19 in chest radiographs. Few studies have provided the complete source code, limiting testing and reproducibility on different datasets. This work presents Cimatec_XCOV19, a novel deep learning system inspired by the Inception-V3 architecture that is able to (i) support the identification of abnormal chest radiographs and (ii) classify the abnormal radiographs as suggestive of COVID-19. The training dataset has 44,031 images with 2917 COVID-19 cases, one of the largest datasets in recent literature. We organized and published an external validation dataset of 1158 chest radiographs from a Brazilian hospital. Two experienced radiologists independently evaluated the radiographs. The Cimatec_XCOV19 algorithm obtained a sensitivity of 0.85, specificity of 0.82, and AUC ROC of 0.93. We compared the AUC ROC of our algorithm with a well-known public solution and did not find a statistically relevant difference between both performances. We provide full access to the code and the test dataset, enabling this work to be used as a tool for supporting the fast screening of COVID-19 on chest X-ray exams, serving as a reference for educators, and supporting further algorithm enhancements.
Collapse
|
12
|
Shahi TB, Sitaula C, Neupane A, Guo W. Fruit classification using attention-based MobileNetV2 for industrial applications. PLoS One 2022; 17:e0264586. [PMID: 35213643 PMCID: PMC8880666 DOI: 10.1371/journal.pone.0264586] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Accepted: 02/13/2022] [Indexed: 11/18/2022] Open
Abstract
Recent deep learning methods for fruits classification resulted in promising performance. However, these methods are with heavy-weight architectures in nature, and hence require a higher storage and expensive training operations due to feeding a large number of training parameters. There is a necessity to explore lightweight deep learning models without compromising the classification accuracy. In this paper, we propose a lightweight deep learning model using the pre-trained MobileNetV2 model and attention module. First, the convolution features are extracted to capture the high-level object-based information. Second, an attention module is used to capture the interesting semantic information. The convolution and attention modules are then combined together to fuse both the high-level object-based information and the interesting semantic information, which is followed by the fully connected layers and the softmax layer. Evaluation of our proposed method, which leverages transfer learning approach, on three public fruit-related benchmark datasets shows that our proposed method outperforms the four latest deep learning methods with a smaller number of trainable parameters and a superior classification accuracy. Our model has a great potential to be adopted by industries closely related to the fruit growing and retailing or processing chain for automatic fruit identification and classifications in the future.
Collapse
Affiliation(s)
- Tej Bahadur Shahi
- School of Engineering and Technology, Central Queensland University, North Rockhampton, QLD, Australia
- * E-mail:
| | - Chiranjibi Sitaula
- Department of Electrical and Computer Systems Engineering, Monash University, Melbourne, VIC, Australia
| | - Arjun Neupane
- School of Engineering and Technology, Central Queensland University, North Rockhampton, QLD, Australia
| | - William Guo
- School of Engineering and Technology, Central Queensland University, North Rockhampton, QLD, Australia
| |
Collapse
|
13
|
Aria M, Nourani E, Golzari Oskouei A. ADA-COVID: Adversarial Deep Domain Adaptation-Based Diagnosis of COVID-19 from Lung CT Scans Using Triplet Embeddings. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:2564022. [PMID: 35154300 PMCID: PMC8826267 DOI: 10.1155/2022/2564022] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/13/2021] [Revised: 12/08/2021] [Accepted: 01/07/2022] [Indexed: 12/12/2022]
Abstract
Rapid diagnosis of COVID-19 with high reliability is essential in the early stages. To this end, recent research often uses medical imaging combined with machine vision methods to diagnose COVID-19. However, the scarcity of medical images and the inherent differences in existing datasets that arise from different medical imaging tools, methods, and specialists may affect the generalization of machine learning-based methods. Also, most of these methods are trained and tested on the same dataset, reducing the generalizability and causing low reliability of the obtained model in real-world applications. This paper introduces an adversarial deep domain adaptation-based approach for diagnosing COVID-19 from lung CT scan images, termed ADA-COVID. Domain adaptation-based training process receives multiple datasets with different input domains to generate domain-invariant representations for medical images. Also, due to the excessive structural similarity of medical images compared to other image data in machine vision tasks, we use the triplet loss function to generate similar representations for samples of the same class (infected cases). The performance of ADA-COVID is evaluated and compared with other state-of-the-art COVID-19 diagnosis algorithms. The obtained results indicate that ADA-COVID achieves classification improvements of at least 3%, 20%, 20%, and 11% in accuracy, precision, recall, and F1 score, respectively, compared to the best results of competitors, even without directly training on the same data. The implementation source code of the ADA-COVID is publicly available at https://github.com/MehradAria/ADA-COVID.
Collapse
Affiliation(s)
- Mehrad Aria
- Faculty of Information Technology and Computer Engineering, Azarbaijan Shahid Madani University, Tabriz, Iran
| | - Esmaeil Nourani
- Faculty of Information Technology and Computer Engineering, Azarbaijan Shahid Madani University, Tabriz, Iran
| | - Amin Golzari Oskouei
- Department of Computer Engineering, Faculty of Electrical and Computer Engineering, University of Tabriz, Tabriz, Iran
| |
Collapse
|
14
|
Sarki R, Ahmed K, Wang H, Zhang Y, Wang K. Automated detection of COVID-19 through convolutional neural network using chest x-ray images. PLoS One 2022; 17:e0262052. [PMID: 35061767 PMCID: PMC8782355 DOI: 10.1371/journal.pone.0262052] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Accepted: 12/15/2021] [Indexed: 01/08/2023] Open
Abstract
The COVID-19 epidemic has a catastrophic impact on global well-being and public health. More than 27 million confirmed cases have been reported worldwide until now. Due to the growing number of confirmed cases, and challenges to the variations of the COVID-19, timely and accurate classification of healthy and infected patients is essential to control and treat COVID-19. We aim to develop a deep learning-based system for the persuasive classification and reliable detection of COVID-19 using chest radiography. Firstly, we evaluate the performance of various state-of-the-art convolutional neural networks (CNNs) proposed over recent years for medical image classification. Secondly, we develop and train CNN from scratch. In both cases, we use a public X-Ray dataset for training and validation purposes. For transfer learning, we obtain 100% accuracy for binary classification (i.e., Normal/COVID-19) and 87.50% accuracy for tertiary classification (Normal/COVID-19/Pneumonia). With the CNN trained from scratch, we achieve 93.75% accuracy for tertiary classification. In the case of transfer learning, the classification accuracy drops with the increased number of classes. The results are demonstrated by comprehensive receiver operating characteristics (ROC) and confusion metric analysis with 10-fold cross-validation.
Collapse
Affiliation(s)
- Rubina Sarki
- Institute for Sustainable Industries & Liveable Cities, Victoria University, Melbourne, Victoria, Australia
- * E-mail:
| | - Khandakar Ahmed
- Institute for Sustainable Industries & Liveable Cities, Victoria University, Melbourne, Victoria, Australia
| | - Hua Wang
- Institute for Sustainable Industries & Liveable Cities, Victoria University, Melbourne, Victoria, Australia
| | - Yanchun Zhang
- Institute for Sustainable Industries & Liveable Cities, Victoria University, Melbourne, Victoria, Australia
| | - Kate Wang
- RMIT, Melbourne, Victoria, Australia
| |
Collapse
|
15
|
Sitaula C, Shahi TB, Aryal S, Marzbanrad F. Fusion of multi-scale bag of deep visual words features of chest X-ray images to detect COVID-19 infection. Sci Rep 2021; 11:23914. [PMID: 34903792 PMCID: PMC8668931 DOI: 10.1038/s41598-021-03287-8] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Accepted: 11/29/2021] [Indexed: 12/23/2022] Open
Abstract
Chest X-ray (CXR) images have been one of the important diagnosis tools used in the COVID-19 disease diagnosis. Deep learning (DL)-based methods have been used heavily to analyze these images. Compared to other DL-based methods, the bag of deep visual words-based method (BoDVW) proposed recently is shown to be a prominent representation of CXR images for their better discriminability. However, single-scale BoDVW features are insufficient to capture the detailed semantic information of the infected regions in the lungs as the resolution of such images varies in real application. In this paper, we propose a new multi-scale bag of deep visual words (MBoDVW) features, which exploits three different scales of the 4th pooling layer’s output feature map achieved from VGG-16 model. For MBoDVW-based features, we perform the Convolution with Max pooling operation over the 4th pooling layer using three different kernels: \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$1 \times 1$$\end{document}1×1, \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$2 \times 2$$\end{document}2×2, and \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$3 \times 3$$\end{document}3×3. We evaluate our proposed features with the Support Vector Machine (SVM) classification algorithm on four CXR public datasets (CD1, CD2, CD3, and CD4) with over 5000 CXR images. Experimental results show that our method produces stable and prominent classification accuracy (84.37%, 88.88%, 90.29%, and 83.65% on CD1, CD2, CD3, and CD4, respectively).
Collapse
Affiliation(s)
- Chiranjibi Sitaula
- Department of Electrical and Computer Systems Engineering, Monash University, Clayton, VIC, 3800, Australia.
| | - Tej Bahadur Shahi
- School of Engineering and Technology, Central Queensland University, Rockhampton, QLD, 4701, Australia.,School of Information Technology, Deakin University, Waurn Ponds, VIC, 3216, Australia
| | - Sunil Aryal
- Central Department of Computer Science and IT, Tribhuvan University, Kathmandu, 44600, Nepal
| | - Faezeh Marzbanrad
- Department of Electrical and Computer Systems Engineering, Monash University, Clayton, VIC, 3800, Australia
| |
Collapse
|