1
|
Rickard D, Kabir MA, Homaira N. Machine learning-based approaches for distinguishing viral and bacterial pneumonia in paediatrics: A scoping review. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2025; 268:108802. [PMID: 40349546 DOI: 10.1016/j.cmpb.2025.108802] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2024] [Revised: 04/13/2025] [Accepted: 04/22/2025] [Indexed: 05/14/2025]
Abstract
BACKGROUND AND OBJECTIVE Pneumonia is the leading cause of hospitalisation and mortality among children under five, particularly in low-resource settings. Accurate differentiation between viral and bacterial pneumonia is essential for guiding appropriate treatment, yet it remains challenging due to overlapping clinical and radiographic features. Advances in machine learning (ML), particularly deep learning (DL), have shown promise in classifying pneumonia using chest X-ray (CXR) images. This scoping review summarises the evidence on ML techniques for classifying viral and bacterial pneumonia using CXR images in paediatric patients. METHODS This scoping review was conducted following the Joanna Briggs Institute methodology and the PRISMA-ScR guidelines. A comprehensive search was performed in PubMed, Embase, and Scopus to identify studies involving children (0-18 years) with pneumonia diagnosed through CXR, using ML models for binary or multiclass classification. Data extraction included ML models, dataset characteristics, and performance metrics. RESULTS A total of 35 studies, published between 2018 and 2025, were included in this review. Of these, 31 studies used the publicly available Kermany dataset, raising concerns about overfitting and limited generalisability to broader, real-world clinical populations. Most studies (n=33) used convolutional neural networks (CNNs) for pneumonia classification. While many models demonstrated promising performance, significant variability was observed due to differences in methodologies, dataset sizes, and validation strategies, complicating direct comparisons. For binary classification (viral vs bacterial pneumonia), a median accuracy of 92.3% (range: 80.8% to 97.9%) was reported. For multiclass classification (healthy, viral pneumonia, and bacterial pneumonia), the median accuracy was 91.8% (range: 76.8% to 99.7%). CONCLUSIONS Current evidence is constrained by a predominant reliance on a single dataset and variability in methodologies, which limit the generalisability and clinical applicability of findings. To address these limitations, future research should focus on developing diverse and representative datasets while adhering to standardised reporting guidelines. Such efforts are essential to improve the reliability, reproducibility, and translational potential of machine learning models in clinical settings.
Collapse
Affiliation(s)
- Declan Rickard
- School of Clinical Medicine, UNSW Sydney, Kensington, NSW, 2052, Australia.
| | - Muhammad Ashad Kabir
- School of Computing, Mathematics and Engineering, Charles Sturt University, Bathurst, NSW, 2795, Australia; Artificial Intelligence and Cyber Futures Institute, Charles Sturt University, Bathurst, NSW, 2795, Australia.
| | - Nusrat Homaira
- School of Clinical Medicine, UNSW Sydney, Kensington, NSW, 2052, Australia; Discipline of Pediatrics and Child Health, UNSW Sydney, Randwick, NSW, 2031, Australia; Respiratory Department, Sydney Children's Hospital, Randwick, NSW, 2031, Australia.
| |
Collapse
|
2
|
Hage Chehade A, Abdallah N, Marion JM, Hatt M, Oueidat M, Chauvet P. Advancing chest X-ray diagnostics: A novel CycleGAN-based preprocessing approach for enhanced lung disease classification in ChestX-Ray14. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2025; 259:108518. [PMID: 39615193 DOI: 10.1016/j.cmpb.2024.108518] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/21/2024] [Revised: 10/28/2024] [Accepted: 11/15/2024] [Indexed: 12/11/2024]
Abstract
BACKGROUND AND OBJECTIVE Chest radiography is a medical imaging technique widely used to diagnose thoracic diseases. However, X-ray images may contain artifacts such as irrelevant objects, medical devices, wires and electrodes that can introduce unnecessary noise, making difficult the distinction of relevant anatomical structures, and hindering accurate diagnoses. We aim in this study to address the issue of these artifacts in order to improve lung diseases classification results. METHODS In this paper we present a novel preprocessing approach which begins by detecting images that contain artifacts and then we reduce the artifacts' noise effect by generating sharper images using a CycleGAN model. The DenseNet-121 model, used for the classification, incorporates channel and spatial attention mechanisms to specifically focus on relevant parts of the image. Additional information contained in the dataset, namely clinical characteristics, were also integrated into the model. RESULTS We evaluated the performance of the classification model before and after applying our proposed artifact preprocessing approach. These results clearly demonstrate that our preprocessing approach significantly improves the model's AUC by 5.91% for pneumonia and 6.44% for consolidation classification, outperforming previous studies for the 14 diseases in the ChestX-Ray14 dataset. CONCLUSION This research highlights the importance of considering the presence of artifacts when diagnosing lung diseases from radiographic images. By eliminating unwanted noise, our approach enables models to focus on relevant diagnostic features, thereby improving their performance. The results demonstrated that our approach is promising, highlighting its potential for broader applications in lung disease classification.
Collapse
Affiliation(s)
| | | | | | - Mathieu Hatt
- LaTIM, INSERM UMR 1101, University of Brest, Brest, France
| | | | | |
Collapse
|
3
|
Li Y, Zhang L, Yu H, Wang J, Wang S, Liu J, Zheng Q. A comprehensive segmentation of chest X-ray improves deep learning-based WHO radiologically confirmed pneumonia diagnosis in children. Eur Radiol 2024; 34:3471-3482. [PMID: 37930411 DOI: 10.1007/s00330-023-10367-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Revised: 08/29/2023] [Accepted: 08/31/2023] [Indexed: 11/07/2023]
Abstract
OBJECTIVES To investigate a comprehensive segmentation of chest X-ray (CXR) in promoting deep learning-based World Health Organization's (WHO) radiologically confirmed pneumonia diagnosis in children. METHODS A total of 4400 participants between January 2016 and June 2021were identified for a cross-sectional study and divided into primary endpoint pneumonia (PEP), other infiltrates, and normal groups according to WHO's diagnostic criteria. The CXR was divided into six segments of left lung, right lung, mediastinum, diaphragm, ext-left lung, and ext-right lung by adopting the RA-UNet. To demonstrate the benefits of lung field segmentation in pneumonia diagnosis, the segmented images and images that were not segmented, which constituted seven segmentation combinations, were fed into the CBAM-ResNet under a three-category classification comparison. The interpretability of the CBAM-ResNet for pneumonia diagnosis was also performed by adopting a Grad-CAM module. RESULTS The RA-UNet achieved a high spatial overlap between manual and automatic segmentation (averaged DSC = 0.9639). The CBAM-ResNet when fed with the six segments achieved superior three-category diagnosis performance (accuracy = 0.8243) over other segmentation combinations and deep learning models under comparison, which was increased by around 6% in accuracy, precision, specificity, sensitivity, F1-score, and around 3% in AUC. The Grad-CAM could capture the pneumonia lesions more accurately, generating a more interpretable visualization and enhancing the superiority and reliability of our study in assisting pediatric pneumonia diagnosis. CONCLUSIONS The comprehensive segmentation of CXR could improve deep learning-based pneumonia diagnosis in childhood with a more reasonable WHO's radiological standardized pneumonia classification instead of conventional dichotomous bacterial pneumonia and viral pneumonia. CLINICAL RELEVANCE STATEMENT The comprehensive segmentation of chest X-ray improves deep learning-based WHO confirmed pneumonia diagnosis in children, laying a strong foundation for the potential inclusion of computer-aided pediatric CXR readings in precise classification of pneumonia and PCV vaccine trials efficacy in children. KEY POINTS • The chest X-ray was comprehensively segmented into six anatomical structures of left lung, right lung, mediastinum, diaphragm, ext-left lung, and ext-right lung. • The comprehensive segmentation improved the three-category classification of primary endpoint pneumonia, other infiltrates, and normal with an increase by around 6% in accuracy, precision, specificity, sensitivity, F1-score, and around 3% in AUC. • The comprehensive segmentation gave rise to a more accurate and interpretable visualization results in capturing the pneumonia lesions.
Collapse
Affiliation(s)
- Yuemei Li
- School of Computer and Control Engineering, Yantai University, Yantai, 264005, China
| | - Lin Zhang
- Department of Radiology, Xiamen Children's Hospital, Children's Hospital of Fudan University at Xiamen, Xiamen, Fujian, China
| | - Hu Yu
- School of Computer and Control Engineering, Yantai University, Yantai, 264005, China
| | - Jian Wang
- Department of Radiology, Xiamen Children's Hospital, Children's Hospital of Fudan University at Xiamen, Xiamen, Fujian, China
| | - Shuo Wang
- Yantai University Trier College of Sustainable Technology, Yantai, 264005, Shandong Province, China
- Trier University of Applied Sciences, D-54208, Trier, Germany
| | - Jungang Liu
- Department of Radiology, Xiamen Children's Hospital, Children's Hospital of Fudan University at Xiamen, Xiamen, Fujian, China.
| | - Qiang Zheng
- School of Computer and Control Engineering, Yantai University, Yantai, 264005, China.
| |
Collapse
|
4
|
Fu Y, Xue P, Zhang Z, Dong E. PKA 2-Net: Prior Knowledge-Based Active Attention Network for Accurate Pneumonia Diagnosis on Chest X-Ray Images. IEEE J Biomed Health Inform 2023; 27:3513-3524. [PMID: 37058372 DOI: 10.1109/jbhi.2023.3267057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/15/2023]
Abstract
To accurately diagnose pneumonia patients on a limited annotated chest X-ray image dataset, a prior knowledge-based active attention network (PKA2-Net1) was constructed. The PKA2-Net uses improved ResNet as the backbone network and consists of residual blocks, novel subject enhancement and background suppression (SEBS) blocks and candidate template generators, where template generators are designed to generate candidate templates for characterizing the importance of different spatial locations in feature maps. The core of PKA2-Net is SEBS block, which is proposed based on the prior knowledge that highlighting distinctive features and suppressing irrelevant features can improve the recognition effect. The purpose of SEBS block is to generate active attention features without any high-level features and enhance the ability of the model to localize lung lesions. In SEBS block, first, a series of candidate templates T with different spatial energy distributions are generated and the controllability of the energy distribution in T enables active attention features to maintain the continuity and integrity of the feature space distributions. Second, Top-n templates are selected from T according to certain learning rules, which are then operated by a convolution layer for generating supervision information that can guide the inputs of SEBS block to form active attention features. We evaluated the PKA2-Net on the binary classification problem of identifying pneumonia and healthy controls on a dataset containing 5856 chest X-ray images (ChestXRay2017), the results showed that our method can achieve 97.63% accuracy and 0.9872 sensitivity.
Collapse
|
5
|
Lung Diseases Detection Using Various Deep Learning Algorithms. JOURNAL OF HEALTHCARE ENGINEERING 2023; 2023:3563696. [PMID: 36776955 PMCID: PMC9918362 DOI: 10.1155/2023/3563696] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Revised: 08/17/2022] [Accepted: 11/24/2022] [Indexed: 02/05/2023]
Abstract
The primary objective of this proposed framework work is to detect and classify various lung diseases such as pneumonia, tuberculosis, and lung cancer from standard X-ray images and Computerized Tomography (CT) scan images with the help of volume datasets. We implemented three deep learning models namely Sequential, Functional & Transfer models and trained them on open-source training datasets. To augment the patient's treatment, deep learning techniques are promising and successful domains that extend the machine learning domain where CNNs are trained to extract features and offers great potential from datasets of images in biomedical application. Our primary aim is to validate our models as a new direction to address the problem on the datasets and then to compare their performance with other existing models. Our models were able to reach higher levels of accuracy for possible solutions and provide effectiveness to humankind for faster detection of diseases and serve as best performing models. The conventional networks have poor performance for tilted, rotated, and other abnormal orientation and have poor learning framework. The results demonstrated that the proposed framework with a sequential model outperforms other existing methods in terms of an F1 score of 98.55%, accuracy of 98.43%, recall of 96.33% for pneumonia and for tuberculosis F1 score of 97.99%, accuracy of 99.4%, and recall of 98.88%. In addition, the functional model for cancer outperformed with an accuracy of 99.9% and specificity of 99.89% and paves way to less number of trained parameters, leading to less computational overhead and less expensive than existing pretrained models. In our work, we implemented a state-of-the art CNN with various models to classify lung diseases accurately.
Collapse
|
6
|
Zhu Z, Li J, Huang J, Li Z, Zhang H, Chen S, Zhong Q, Xie Y, Hu S, Wang Y, Wang D, Yu G. An intelligent prediagnosis system for disease prediction and examination recommendation based on electronic medical record and a medical-semantic-aware convolution neural network (MSCNN) for pediatric chronic cough. Transl Pediatr 2022; 11:1216-1233. [PMID: 35958012 PMCID: PMC9360821 DOI: 10.21037/tp-22-275] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Accepted: 07/10/2022] [Indexed: 11/06/2022] Open
Abstract
BACKGROUND Due to the phenotypic similarities among different pediatric respiratory diseases with chronic cough, primary doctors often misdiagnose and the misuse of examinations is prevalent. In the pre-diagnosis stage, the patients' chief complaints and other information in the electronic medical record (EMR) provide a powerful reference for respiratory experts to make preliminary disease judgment and examination plan. In this paper, we proposed an intelligent prediagnosis system to predict disease diagnosis and recommend examinations based on EMR text. METHODS We examined the clinical notes of 178,293 children with chronic cough symptoms from retrospective EMR data. The dataset is split into 7:3 for training and testing. From the testing set, we also extract 5% of samples for validation. We proposed a medical-semantic-aware convolution neural network (MSCNN) framework that can accomplish two downstream tasks from the same medical language model through transfer learning. First, a medical language model based on the word2vec algorithm was built to generate embeddings for the text data. Then, text convolutional neural network (TextCNN) was used to build models for disease prediction and examination recommendation. RESULTS We implemented 5 algorithms for disease prediction. In the disease prediction task, our algorithm outperformed the baseline methods on all metrics, with a top-1 accuracy (AC) of 0.68 and a top-3 AC of 0.923 on the testing set. By adding data enhancement, the top-3 AC reached 0.926. In the examination recommendation task, the overall AC on the testing set was 0.93 and the macro average (MA) F1-score was 0.88. The average area under the curve (AUC) on the training set was 0.97 while on the testing set it was 0.86. CONCLUSIONS We constructed an intelligent prediagnosis system with an MSCNN framework that can predict diseases and make examination recommendations based on EMR data. Our approach achieved good results on a retrospective clinical dataset and thus has great potential for the application of automated diagnosis assist in clinical practice during pre-diagnosis stage, which will provide help for primary level doctors or doctors in basic-level hospitals. Due to the generality of the proposed framework, it can be straight forwardly extended to prediagnosis for other diseases.
Collapse
Affiliation(s)
- Zhu Zhu
- Department of Data and Information, The Children's Hospital, Zhejiang University School of Medicine, Hangzhou, China.,National Clinical Research Center for Child Health, Hangzhou, China
| | - Jing Li
- Department of Data and Information, The Children's Hospital, Zhejiang University School of Medicine, Hangzhou, China.,National Clinical Research Center for Child Health, Hangzhou, China
| | - Jian Huang
- Department of Data and Information, The Children's Hospital, Zhejiang University School of Medicine, Hangzhou, China.,National Clinical Research Center for Child Health, Hangzhou, China
| | - Zheming Li
- Department of Data and Information, The Children's Hospital, Zhejiang University School of Medicine, Hangzhou, China.,National Clinical Research Center for Child Health, Hangzhou, China
| | | | - Siyu Chen
- Avain (Hangzhou) Technology Co., Ltd., Hangzhou, China
| | - Qianhui Zhong
- Avain (Hangzhou) Technology Co., Ltd., Hangzhou, China
| | - Yulan Xie
- Avain (Hangzhou) Technology Co., Ltd., Hangzhou, China
| | - Shasha Hu
- Department of Data and Information, The Children's Hospital, Zhejiang University School of Medicine, Hangzhou, China.,National Clinical Research Center for Child Health, Hangzhou, China
| | - Yinshuo Wang
- National Clinical Research Center for Child Health, Hangzhou, China.,Department of Pulmonology, The Children's Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Dejian Wang
- Department of R&D, Hangzhou Healink Technology, Hangzhou, China
| | - Gang Yu
- Department of Data and Information, The Children's Hospital, Zhejiang University School of Medicine, Hangzhou, China.,National Clinical Research Center for Child Health, Hangzhou, China.,Polytechnic Institute, Zhejiang University, Hangzhou, China
| |
Collapse
|
7
|
Zhang Y, Xiao Q, Deng X, Jiang W. A multi-source information fusion method for ship target recognition based on Bayesian inference and evidence theory. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2022. [DOI: 10.3233/jifs-211638] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
The ship target recognition (STR) is greatly related to the battlefield situation awareness, which has recently gained prominence in the military domains. With the diversification and complexity of military missions, ship targets are mostly performed in the form of formations. Therefore, using the formation information to improve the accuracy of the ship target type recognition is worth studying. To effectively identify ship target type, we in this paper jointly consider the ship dynamic, formation, and feature information to propose a STR method based on Bayesian inference and evidence theory. Specifically, we first calculate the ship position distance matrix and the directional distance matrix with the Dynamic Time Warping (DTW) and the difference-vector algorithm taken into account. Then, we use the two distance matrices to obtain the ship formation information at different distance thresholds by the hierarchical clustering method, based on which we can infer the ship type. Thirdly, formation information and other attribute information are as nodes of the Bayesian Network (BN) to infer the ship type. Afterward, we can convert the recognition results at different thresholds into body of evidences (BOEs) as multiple information sources. Finally, we fuse the BOEs to get the final recognition. The proposed method is verified in simulation battle scenario in this paper. The simulation results demonstrate that the proposed method achieves performance superiority as compared with other ship recognition methods in terms of recognition accuracy.
Collapse
Affiliation(s)
- Yu Zhang
- School of Electronics and Information, Northwestern Polytechnical University, Xi’an, Shaanxi, China
| | - Qunli Xiao
- School of Electronics and Information, Northwestern Polytechnical University, Xi’an, Shaanxi, China
| | - Xinyang Deng
- School of Electronics and Information, Northwestern Polytechnical University, Xi’an, Shaanxi, China
| | - Wen Jiang
- School of Electronics and Information, Northwestern Polytechnical University, Xi’an, Shaanxi, China
- Peng Cheng Laboratory, Shenzhen, China
| |
Collapse
|