1
|
Abad M, Casas-Roma J, Prados F. Generalizable disease detection using model ensemble on chest X-ray images. Sci Rep 2024; 14:5890. [PMID: 38467705 PMCID: PMC10928229 DOI: 10.1038/s41598-024-56171-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Accepted: 03/03/2024] [Indexed: 03/13/2024] Open
Abstract
In the realm of healthcare, the demand for swift and precise diagnostic tools has been steadily increasing. This study delves into a comprehensive performance analysis of three pre-trained convolutional neural network (CNN) architectures: ResNet50, DenseNet121, and Inception-ResNet-v2. To ensure the broad applicability of our approach, we curated a large-scale dataset comprising a diverse collection of chest X-ray images, that included both positive and negative cases of COVID-19. The models' performance was evaluated using separate datasets for internal validation (from the same source as the training images) and external validation (from different sources). Our examination uncovered a significant drop in network efficacy, registering a 10.66% reduction for ResNet50, a 36.33% decline for DenseNet121, and a 19.55% decrease for Inception-ResNet-v2 in terms of accuracy. Best results were obtained with DenseNet121 achieving the highest accuracy at 96.71% in internal validation and Inception-ResNet-v2 attaining 76.70% accuracy in external validation. Furthermore, we introduced a model ensemble approach aimed at improving network performance when making inferences on images from diverse sources beyond their training data. The proposed method uses uncertainty-based weighting by calculating the entropy in order to assign appropriate weights to the outputs of each network. Our results showcase the effectiveness of the ensemble method in enhancing accuracy up to 97.38% for internal validation and 81.18% for external validation, while maintaining a balanced ability to detect both positive and negative cases.
Collapse
Affiliation(s)
- Maider Abad
- Universitat Oberta de Catalunya, e-Health Center, Barcelona, Spain.
| | - Jordi Casas-Roma
- Universitat Oberta de Catalunya, e-Health Center, Barcelona, Spain
- Department of Computer Science, Universitat Autònoma de Barcelona (UAB), Bellaterra, Spain
- Computer Vision Center (CVC), Universitat Autònoma de Barcelona (UAB), Bellaterra, Spain
| | - Ferran Prados
- Universitat Oberta de Catalunya, e-Health Center, Barcelona, Spain
- Queen Square MS Centre, Department of Neuroinflammation, UCL Queen Square Institute of Neurology, Faculty of Brain Science, University College of London, London, WC1N 3BG, UK
- Centre for Medical Image Computing (CMIC), Department of Medical Physics and Bioengineering, University College London, London, WC1V 6LJ, UK
| |
Collapse
|
2
|
Shayegan MJ. A brief review and scientometric analysis on ensemble learning methods for handling COVID-19. Heliyon 2024; 10:e26694. [PMID: 38420425 PMCID: PMC10901105 DOI: 10.1016/j.heliyon.2024.e26694] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Revised: 02/07/2024] [Accepted: 02/19/2024] [Indexed: 03/02/2024] Open
Abstract
Numerous efforts and research have been conducted worldwide to combat the coronavirus disease 2019 (COVID-19) pandemic. In this regard, some researchers have focused on deep and machine-learning approaches to discover more about this disease. There have been many articles on using ensemble learning methods for COVID-19 detection. Still, there seems to be no scientometric analysis or a brief review of these researches. Hence, a combined method of scientometric analysis and brief review was used to study the published articles that employed an ensemble learning approach to detect COVID-19. This research used both methods to overcome their limitations, leading to enhanced and reliable outcomes. The related articles were retrieved from the Scopus database. Then a two-step procedure was employed. A concise review of the collected articles was conducted. Then they underwent scientometric and bibliometric analyses. The findings revealed that convolutional neural network (CNN) is the mostly employed algorithm, while support vector machine (SVM), random forest, Resnet, DenseNet, and visual geometry group (VGG) were also frequently used. Additionally, China has had a significant presence in the numerous top-ranking categories of this field of research. Both study phases yielded valuable results and rankings.
Collapse
|
3
|
Rahman MA, Brown DJ, Mahmud M, Harris M, Shopland N, Heym N, Sumich A, Turabee ZB, Standen B, Downes D, Xing Y, Thomas C, Haddick S, Premkumar P, Nastase S, Burton A, Lewis J. Enhancing biofeedback-driven self-guided virtual reality exposure therapy through arousal detection from multimodal data using machine learning. Brain Inform 2023; 10:14. [PMID: 37341863 DOI: 10.1186/s40708-023-00193-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Accepted: 05/15/2023] [Indexed: 06/22/2023] Open
Abstract
Virtual reality exposure therapy (VRET) is a novel intervention technique that allows individuals to experience anxiety-evoking stimuli in a safe environment, recognise specific triggers and gradually increase their exposure to perceived threats. Public-speaking anxiety (PSA) is a prevalent form of social anxiety, characterised by stressful arousal and anxiety generated when presenting to an audience. In self-guided VRET, participants can gradually increase their tolerance to exposure and reduce anxiety-induced arousal and PSA over time. However, creating such a VR environment and determining physiological indices of anxiety-induced arousal or distress is an open challenge. Environment modelling, character creation and animation, psychological state determination and the use of machine learning (ML) models for anxiety or stress detection are equally important, and multi-disciplinary expertise is required. In this work, we have explored a series of ML models with publicly available data sets (using electroencephalogram and heart rate variability) to predict arousal states. If we can detect anxiety-induced arousal, we can trigger calming activities to allow individuals to cope with and overcome distress. Here, we discuss the means of effective selection of ML models and parameters in arousal detection. We propose a pipeline to overcome the model selection problem with different parameter settings in the context of virtual reality exposure therapy. This pipeline can be extended to other domains of interest where arousal detection is crucial. Finally, we have implemented a biofeedback framework for VRET where we successfully provided feedback as a form of heart rate and brain laterality index from our acquired multimodal data for psychological intervention to overcome anxiety.
Collapse
Affiliation(s)
- Muhammad Arifur Rahman
- Department of Computer Science, Nottingham Trent University, Clifton Lane, Nottingham, NG11 8NS, UK
| | - David J Brown
- Department of Computer Science, Nottingham Trent University, Clifton Lane, Nottingham, NG11 8NS, UK
| | - Mufti Mahmud
- Department of Computer Science, Nottingham Trent University, Clifton Lane, Nottingham, NG11 8NS, UK.
- Medical Technologies Innovation Facility, Nottingham Trent University, Clifton Lane, Nottingham, NG11 8NS, UK.
- Computing and Informatics Research Centre, Nottingham Trent University, Clifton Lane, Nottingham, NG11 8NS, UK.
| | - Matthew Harris
- Department of Computer Science, Nottingham Trent University, Clifton Lane, Nottingham, NG11 8NS, UK
| | - Nicholas Shopland
- Department of Computer Science, Nottingham Trent University, Clifton Lane, Nottingham, NG11 8NS, UK
| | - Nadja Heym
- School of Social Sciences, Nottingham Trent University, Shakespeare St, Nottingham, NG1 4FQ, UK
| | - Alexander Sumich
- School of Social Sciences, Nottingham Trent University, Shakespeare St, Nottingham, NG1 4FQ, UK
| | - Zakia Batool Turabee
- School of Social Sciences, Nottingham Trent University, Shakespeare St, Nottingham, NG1 4FQ, UK
| | - Bradley Standen
- Department of Computer Science, Nottingham Trent University, Clifton Lane, Nottingham, NG11 8NS, UK
| | - David Downes
- Nottingham School of Art & Design, Nottingham Trent University, Shakespeare St, Nottingham, NG1 4FQ, UK
| | - Yangang Xing
- School of ADBE, Nottingham Trent University, Shakespeare St, Nottingham, NG1 4FQ, UK
| | - Carolyn Thomas
- Nottingham School of Art & Design, Nottingham Trent University, Shakespeare St, Nottingham, NG1 4FQ, UK
| | - Sean Haddick
- Department of Computer Science, Nottingham Trent University, Clifton Lane, Nottingham, NG11 8NS, UK
| | - Preethi Premkumar
- Division of Psychology, London South Bank University, London, SE1 0AA, UK
| | | | - Andrew Burton
- Department of Computer Science, Nottingham Trent University, Clifton Lane, Nottingham, NG11 8NS, UK
| | - James Lewis
- Department of Computer Science, Nottingham Trent University, Clifton Lane, Nottingham, NG11 8NS, UK
| |
Collapse
|
4
|
Hajamohideen F, Shaffi N, Mahmud M, Subramanian K, Al Sariri A, Vimbi V, Abdesselam A. Four-way classification of Alzheimer's disease using deep Siamese convolutional neural network with triplet-loss function. Brain Inform 2023; 10:5. [PMID: 36806042 PMCID: PMC9937523 DOI: 10.1186/s40708-023-00184-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Accepted: 01/03/2023] [Indexed: 02/19/2023] Open
Abstract
Alzheimer's disease (AD) is a neurodegenerative disease that causes irreversible damage to several brain regions, including the hippocampus causing impairment in cognition, function, and behaviour. Early diagnosis of the disease will reduce the suffering of the patients and their family members. Towards this aim, in this paper, we propose a Siamese Convolutional Neural Network (SCNN) architecture that employs the triplet-loss function for the representation of input MRI images as k-dimensional embeddings. We used both pre-trained and non-pretrained CNNs to transform images into the embedding space. These embeddings are subsequently used for the 4-way classification of Alzheimer's disease. The model efficacy was tested using the ADNI and OASIS datasets which produced an accuracy of 91.83% and 93.85%, respectively. Furthermore, obtained results are compared with similar methods proposed in the literature.
Collapse
Affiliation(s)
- Faizal Hajamohideen
- College of Computing and Information Sciences, University of Technology and Applied Sciences, Jamia Street, 311 Sohar, Sultanate of Oman
| | - Noushath Shaffi
- College of Computing and Information Sciences, University of Technology and Applied Sciences, Jamia Street, 311 Sohar, Sultanate of Oman
| | - Mufti Mahmud
- Department of Computer Science, Nottingham Trent University, Clifton Lane, NG11 8NS Nottingham, UK
- Medical Technologies Innovation Facility, Nottingham Trent University, Clifton Lane, NG11 8NS Nottingham, UK
- Computing and Informatics Research Centre, Nottingham Trent University, Clifton Lane, NG11 8NS Nottingham, UK
| | - Karthikeyan Subramanian
- College of Computing and Information Sciences, University of Technology and Applied Sciences, Jamia Street, 311 Sohar, Sultanate of Oman
| | - Arwa Al Sariri
- College of Computing and Information Sciences, University of Technology and Applied Sciences, Jamia Street, 311 Sohar, Sultanate of Oman
| | - Viswan Vimbi
- College of Computing and Information Sciences, University of Technology and Applied Sciences, Jamia Street, 311 Sohar, Sultanate of Oman
| | - Abdelhamid Abdesselam
- Department of Computer Science, Sultan Qaboos University, 123 Muscat, Sultanate of Oman
| | - for the Alzheimer’s Disease Neuroimaging Initiative
- College of Computing and Information Sciences, University of Technology and Applied Sciences, Jamia Street, 311 Sohar, Sultanate of Oman
- Department of Computer Science, Nottingham Trent University, Clifton Lane, NG11 8NS Nottingham, UK
- Medical Technologies Innovation Facility, Nottingham Trent University, Clifton Lane, NG11 8NS Nottingham, UK
- Computing and Informatics Research Centre, Nottingham Trent University, Clifton Lane, NG11 8NS Nottingham, UK
- Department of Computer Science, Sultan Qaboos University, 123 Muscat, Sultanate of Oman
| |
Collapse
|
5
|
Ullah N, Khan JA, El-Sappagh S, El-Rashidy N, Khan MS. A Holistic Approach to Identify and Classify COVID-19 from Chest Radiographs, ECG, and CT-Scan Images Using ShuffleNet Convolutional Neural Network. Diagnostics (Basel) 2023; 13:diagnostics13010162. [PMID: 36611454 PMCID: PMC9818310 DOI: 10.3390/diagnostics13010162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 12/21/2022] [Accepted: 12/28/2022] [Indexed: 01/05/2023] Open
Abstract
Early and precise COVID-19 identification and analysis are pivotal in reducing the spread of COVID-19. Medical imaging techniques, such as chest X-ray or chest radiographs, computed tomography (CT) scan, and electrocardiogram (ECG) trace images are the most widely known for early discovery and analysis of the coronavirus disease (COVID-19). Deep learning (DL) frameworks for identifying COVID-19 positive patients in the literature are limited to one data format, either ECG or chest radiograph images. Moreover, using several data types to recover abnormal patterns caused by COVID-19 could potentially provide more information and restrict the spread of the virus. This study presents an effective COVID-19 detection and classification approach using the Shufflenet CNN by employing three types of images, i.e., chest radiograph, CT-scan, and ECG-trace images. For this purpose, we performed extensive classification experiments with the proposed approach using each type of image. With the chest radiograph dataset, we performed three classification experiments at different levels of granularity, i.e., binary, three-class, and four-class classifications. In addition, we performed a binary classification experiment with the proposed approach by classifying CT-scan images into COVID-positive and normal. Finally, utilizing the ECG-trace images, we conducted three experiments at different levels of granularity, i.e., binary, three-class, and five-class classifications. We evaluated the proposed approach with the baseline COVID-19 Radiography Database, SARS-CoV-2 CT-scan, and ECG images dataset of cardiac and COVID-19 patients. The average accuracy of 99.98% for COVID-19 detection in the three-class classification scheme using chest radiographs, optimal accuracy of 100% for COVID-19 detection using CT scans, and average accuracy of 99.37% for five-class classification scheme using ECG trace images have proved the efficacy of our proposed method over the contemporary methods. The optimal accuracy of 100% for COVID-19 detection using CT scans and the accuracy gain of 1.54% (in the case of five-class classification using ECG trace images) from the previous approach, which utilized ECG images for the first time, has a major contribution to improving the COVID-19 prediction rate in early stages. Experimental findings demonstrate that the proposed framework outperforms contemporary models. For example, the proposed approach outperforms state-of-the-art DL approaches, such as Squeezenet, Alexnet, and Darknet19, by achieving the accuracy of 99.98 (proposed method), 98.29, 98.50, and 99.67, respectively.
Collapse
Affiliation(s)
- Naeem Ullah
- Department of Software Engineering, University of Engineering and Technology Taxila, Taxila 47050, Pakistan
| | - Javed Ali Khan
- Department of Software Engineering, University of Science and Technology Bannu, Bannu 28100, Pakistan
- Correspondence: or (J.A.K.); (S.E.-S.)
| | - Shaker El-Sappagh
- Faculty of Computer Science and Engineering, Galala University, Suez 435611, Egypt
- Information Systems Department, Faculty of Computers and Artificial Intelligence, Benha University, Banha 13518, Egypt
- Correspondence: or (J.A.K.); (S.E.-S.)
| | - Nora El-Rashidy
- Department of Machine Learning and Information Retrieval, Faculty of Artificial Intelligence, Kafrelsheiksh University, Kafr Elsheikh 33516, Egypt
| | - Mohammad Sohail Khan
- Department of Computer Software Engineering, University of Engineering and Technology Mardan, Mardan 23200, Pakistan
| |
Collapse
|
6
|
Dey S, Bhattacharya R, Malakar S, Schwenker F, Sarkar R. CovidConvLSTM: A fuzzy ensemble model for COVID-19 detection from chest X-rays. EXPERT SYSTEMS WITH APPLICATIONS 2022; 206:117812. [PMID: 35754941 PMCID: PMC9212804 DOI: 10.1016/j.eswa.2022.117812] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Revised: 06/05/2022] [Accepted: 06/06/2022] [Indexed: 05/17/2023]
Abstract
The rapid outbreak of COVID-19 has affected the lives and livelihoods of a large part of the society. Hence, to confine the rapid spread of this virus, early detection of COVID-19 is extremely important. One of the most common ways of detecting COVID-19 is by using chest X-ray images. In the literature, it is found that most of the research activities applied convolutional neural network (CNN) models where the features generated by the last convolutional layer were directly passed to the classification models. In this paper, convolutional long short-term memory (ConvLSTM) layer is used in order to encode the spatial dependency among the feature maps obtained from the last convolutional layer of the CNN and to improve the image representational capability of the model. Additionally, the squeeze-and-excitation (SE) block, a spatial attention mechanism, is used to allocate weights to important local features. These two mechanisms are employed on three popular CNN models - VGG19, InceptionV3, and MobileNet to improve their classification strength. Finally, the Sugeno fuzzy integral based ensemble method is used on these classifiers' outputs to enhance the detection accuracy further. For experiments, three chest X-ray datasets, which are very prevalent for COVID-19 detection, are considered. For all the three datasets, it is found that the results obtained by the proposed method are comparable to state-of-the-art methods. The code, along with the pre-trained models, can be found at https://github.com/colabpro123/CovidConvLSTM.
Collapse
Affiliation(s)
- Subhrajit Dey
- Department of Electrical Engineering, Jadavpur University, Kolkata, India
| | - Rajdeep Bhattacharya
- Department of Computer Science and Engineering, Jadavpur University, Kolkata, India
| | - Samir Malakar
- Department of Computer Science, Asutosh College, Kolkata, India
| | | | - Ram Sarkar
- Department of Computer Science and Engineering, Jadavpur University, Kolkata, India
| |
Collapse
|
7
|
Pramanik R, Dey S, Malakar S, Mirjalili S, Sarkar R. TOPSIS aided ensemble of CNN models for screening COVID-19 in chest X-ray images. Sci Rep 2022; 12:15409. [PMID: 36104401 PMCID: PMC9471038 DOI: 10.1038/s41598-022-18463-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2021] [Accepted: 08/12/2022] [Indexed: 11/23/2022] Open
Abstract
The novel coronavirus (COVID-19), has undoubtedly imprinted our lives with its deadly impact. Early testing with isolation of the individual is the best possible way to curb the spread of this deadly virus. Computer aided diagnosis (CAD) provides an alternative and cheap option for screening of the said virus. In this paper, we propose a convolution neural network (CNN)-based CAD method for COVID-19 and pneumonia detection from chest X-ray images. We consider three input types for three identical base classifiers. To capture maximum possible complementary features, we consider the original RGB image, Red channel image and the original image stacked with Robert's edge information. After that we develop an ensemble strategy based on the technique for order preference by similarity to an ideal solution (TOPSIS) to aggregate the outcomes of base classifiers. The overall framework, called TOPCONet, is very light in comparison with standard CNN models in terms of the number of trainable parameters required. TOPCONet achieves state-of-the-art results when evaluated on the three publicly available datasets: (1) IEEE COVID-19 dataset + Kaggle Pneumonia Dataset, (2) Kaggle Radiography dataset and (3) COVIDx.
Collapse
|
8
|
MXT: A New Variant of Pyramid Vision Transformer for Multi-label Chest X-ray Image Classification. Cognit Comput 2022. [DOI: 10.1007/s12559-022-10032-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
|