1
|
Park JS, Park SY, Moon JW, Kim K, Suh DI. Artificial Intelligence Models for Pediatric Lung Sound Analysis: Systematic Review and Meta-Analysis. J Med Internet Res 2025; 27:e66491. [PMID: 40249944 PMCID: PMC12048790 DOI: 10.2196/66491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2024] [Revised: 02/14/2025] [Accepted: 03/13/2025] [Indexed: 04/20/2025] Open
Abstract
BACKGROUND Pediatric respiratory diseases, including asthma and pneumonia, are major causes of morbidity and mortality in children. Auscultation of lung sounds is a key diagnostic tool but is prone to subjective variability. The integration of artificial intelligence (AI) and machine learning (ML) with electronic stethoscopes offers a promising approach for automated and objective lung sound. OBJECTIVE This systematic review and meta-analysis assess the performance of ML models in pediatric lung sound analysis. The study evaluates the methodologies, model performance, and database characteristics while identifying limitations and future directions for clinical implementation. METHODS A systematic search was conducted in Medline via PubMed, Embase, Web of Science, OVID, and IEEE Xplore for studies published between January 1, 1990, and December 16, 2024. Inclusion criteria are as follows: studies developing ML models for pediatric lung sound classification with a defined database, physician-labeled reference standard, and reported performance metrics. Exclusion criteria are as follows: studies focusing on adults, cardiac auscultation, validation of existing models, or lacking performance metrics. Risk of bias was assessed using a modified Quality Assessment of Diagnostic Accuracy Studies (version 2) framework. Data were extracted on study design, dataset, ML methods, feature extraction, and classification tasks. Bivariate meta-analysis was performed for binary classification tasks, including wheezing and abnormal lung sound detection. RESULTS A total of 41 studies met the inclusion criteria. The most common classification task was binary detection of abnormal lung sounds, particularly wheezing. Pooled sensitivity and specificity for wheeze detection were 0.902 (95% CI 0.726-0.970) and 0.955 (95% CI 0.762-0.993), respectively. For abnormal lung sound detection, pooled sensitivity was 0.907 (95% CI 0.816-0.956) and specificity 0.877 (95% CI 0.813-0.921). The most frequently used feature extraction methods were Mel-spectrogram, Mel-frequency cepstral coefficients, and short-time Fourier transform. Convolutional neural networks were the predominant ML model, often combined with recurrent neural networks or residual network architectures. However, high heterogeneity in dataset size, annotation methods, and evaluation criteria were observed. Most studies relied on small, single-center datasets, limiting generalizability. CONCLUSIONS ML models show high accuracy in pediatric lung sound analysis, but face limitations due to dataset heterogeneity, lack of standard guidelines, and limited external validation. Future research should focus on standardized protocols and the development of large-scale, multicenter datasets to improve model robustness and clinical implementation.
Collapse
Affiliation(s)
- Ji Soo Park
- Department of Pediatrics, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Sa-Yoon Park
- The Institute of Convergence Medicine with Innovative Technology, Seoul National University Hospital, Seoul, Republic of Korea
- Department of Physiology, College of Korean Medicine, Wonkwang University, Iksan, Republic of Korea
| | - Jae Won Moon
- Department of Pediatrics, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Kwangsoo Kim
- The Institute of Convergence Medicine with Innovative Technology, Seoul National University Hospital, Seoul, Republic of Korea
- Department of Medicine, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Dong In Suh
- Department of Pediatrics, Seoul National University College of Medicine, Seoul, Republic of Korea
| |
Collapse
|
2
|
Hilberink L, Wehage P, Pashai Fakhri M, Gaedcke S, DeLuca D, Mattis P, Rademacher J. [Artificial intelligence and machine learning in auscultation: prospects of the project DigitaLung]. Pneumologie 2025; 79:229-235. [PMID: 39961349 DOI: 10.1055/a-2507-1486] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/12/2025]
Abstract
Auscultation is one of the key medical skills in physical examination. The main problem with auscultation is the lack of objectivity of the findings and great dependence on the experience of the examiner. Auscultation using machine learning and neural networks promises great potential for solving these problems in clinical practice.A selective search for studies in PubMed was carried out, which revealed the possibilities of machine learning in medical diagnostics.In all the studies identified, significant differences were shown between the respective test groups in favour of artificial intelligence (AI). In addition to the positive study results, the limitations of AI could also be analysed and critically scrutinised.Medical research in the field of artificial intelligence is still in its infancy. The prospects and limitations of AI must be further investigated and require close attention in the collaboration between clinicians, scientists and AI experts. Publicly funded projects such as DigitaLung (a digital auscultation system for the differential diagnosis of lung diseases using machine learning), which aims to improve lung auscultation using AI, will help to unlock the diagnostic benefits of AI for patient care and could improve care in the future.
Collapse
Affiliation(s)
- Luca Hilberink
- Respiratory Medicine and Infectious Diseases, Hannover Medical School, Hannover, Deutschland
| | - Pia Wehage
- Respiratory Medicine and Infectious Diseases, Hannover Medical School, Hannover, Deutschland
| | - Milad Pashai Fakhri
- Respiratory Medicine and Infectious Diseases, Hannover Medical School, Hannover, Deutschland
| | - Svenja Gaedcke
- Biomedical Research in Endstage and Obstructive Lung Disease Hannover (BREATH), German Center for Lung Research (DZL), Hannover Medical School, Hannover, Deutschland
| | - David DeLuca
- Biomedical Research in Endstage and Obstructive Lung Disease Hannover (BREATH), German Center for Lung Research (DZL), Hannover Medical School, Hannover, Deutschland
| | - Patricia Mattis
- Business Development Manager, Translational Biomedical Engineering, Fraunhofer Institute for Toxicology and Experimental Medicine ITEM, Hannover, Deutschland
| | - Jessica Rademacher
- Respiratory Medicine and Infectious Diseases, Hannover Medical School, Hannover, Deutschland
| |
Collapse
|
3
|
Hu J, Leow CS, Tao S, Goh WL, Gao Y. Supervised Contrastive Learning Framework and Hardware Implementation of Learned ResNet for Real-Time Respiratory Sound Classification. IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS 2025; 19:185-195. [PMID: 38837933 DOI: 10.1109/tbcas.2024.3409584] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2024]
Abstract
This paper presents a supervised contrastive learning (SCL) framework for respiratory sound classification and the hardware implementation of learned ResNet on field programmable gate array (FPGA) for real-time monitoring. At the algorithmic level, multiple techniques such as features augmentation and MixUp are combined holistically to mitigate the impact of data scarcity and imbalanced classes in the training dataset. Bayesian optimization further enhances the classification accuracy through parameter tuning in pre-processing and SCL. The proposed framework achieves 0.8725 total score (including runtime score) on a ResNet-18 model in both event and record multi-class classification tasks using the SJTU Paediatric Respiratory Sound Database (SPRSound). In addition, algorithm-hardware co-optimizations including Quantization-Aware Training (QAT), merge of network layers, optimization of memory size and number of parallel threads are performed for hardware implementation on FPGA. This approach reduces 40% model size and 70% computation latency. The learned ResNet is implemented on a Xilinx Zynq ZCU102 FPGA with 16ms latency and less than 2% inference score degradation compared to the software model.
Collapse
|
4
|
Romaszko-Wojtowicz A, Jaśkiewicz Ł, Jurczak P, Doboszyńska A. Telemedicine in Primary Practice in the Age of the COVID-19 Pandemic-Review. MEDICINA (KAUNAS, LITHUANIA) 2023; 59:1541. [PMID: 37763659 PMCID: PMC10532942 DOI: 10.3390/medicina59091541] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Revised: 08/18/2023] [Accepted: 08/22/2023] [Indexed: 09/29/2023]
Abstract
Background and Objectives: In the era of the COVID-19 pandemic, telemedicine, so far underestimated, has gained in value. Currently, telemedicine is not only a telephone or chat consultation, but also the possibility of the remote recording of signals (such as ECG, saturation, and heart rate) or even remote auscultation of the lungs. The objective of this review article is to present a potential role for, and disseminate knowledge of, telemedicine during the COVID-19 pandemic. Material and Methods: In order to analyze the research material in accordance with PRISMA guidelines, a systematic search of the ScienceDirect, Web of Science, and PubMed databases was conducted. Out of the total number of 363 papers identified, 22 original articles were subjected to analysis. Results: This article presents the possibilities of remote patient registration, which contributes to an improvement in remote diagnostics and diagnoses. Conclusions: Telemedicine is, although not always and not by everyone, an accepted form of providing medical services. It cannot replace direct patient-doctor contact, but it can undoubtedly contribute to accelerating diagnoses and improving their quality at a distance.
Collapse
Affiliation(s)
- Anna Romaszko-Wojtowicz
- Department of Pulmonology, School of Public Health, Collegium Medicum, University of Warmia and Mazury in Olsztyn, 10-719 Olsztyn, Poland;
| | - Łukasz Jaśkiewicz
- Department of Human Physiology and Pathophysiology, School of Medicine, Collegium Medicum, University of Warmia and Mazury in Olsztyn, 10-082 Olsztyn, Poland;
| | - Paweł Jurczak
- Student Scientific Club of Cardiopulmonology and Rare Diseases of the Respiratory System, School of Medicine, Collegium Medicum, University of Warmia and Mazury in Olsztyn, 10-082 Olsztyn, Poland;
| | - Anna Doboszyńska
- Department of Pulmonology, School of Public Health, Collegium Medicum, University of Warmia and Mazury in Olsztyn, 10-719 Olsztyn, Poland;
| |
Collapse
|
5
|
Sethi AK, Muddaloor P, Anvekar P, Agarwal J, Mohan A, Singh M, Gopalakrishnan K, Yadav A, Adhikari A, Damani D, Kulkarni K, Aakre CA, Ryu AJ, Iyer VN, Arunachalam SP. Digital Pulmonology Practice with Phonopulmography Leveraging Artificial Intelligence: Future Perspectives Using Dual Microwave Acoustic Sensing and Imaging. SENSORS (BASEL, SWITZERLAND) 2023; 23:5514. [PMID: 37420680 DOI: 10.3390/s23125514] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Revised: 06/01/2023] [Accepted: 06/05/2023] [Indexed: 07/09/2023]
Abstract
Respiratory disorders, being one of the leading causes of disability worldwide, account for constant evolution in management technologies, resulting in the incorporation of artificial intelligence (AI) in the recording and analysis of lung sounds to aid diagnosis in clinical pulmonology practice. Although lung sound auscultation is a common clinical practice, its use in diagnosis is limited due to its high variability and subjectivity. We review the origin of lung sounds, various auscultation and processing methods over the years and their clinical applications to understand the potential for a lung sound auscultation and analysis device. Respiratory sounds result from the intra-pulmonary collision of molecules contained in the air, leading to turbulent flow and subsequent sound production. These sounds have been recorded via an electronic stethoscope and analyzed using back-propagation neural networks, wavelet transform models, Gaussian mixture models and recently with machine learning and deep learning models with possible use in asthma, COVID-19, asbestosis and interstitial lung disease. The purpose of this review was to summarize lung sound physiology, recording technologies and diagnostics methods using AI for digital pulmonology practice. Future research and development in recording and analyzing respiratory sounds in real time could revolutionize clinical practice for both the patients and the healthcare personnel.
Collapse
Affiliation(s)
- Arshia K Sethi
- GIH Artificial Intelligence Laboratory (GAIL), Division of Gastroenterology and Hepatology, Department of Medicine, Mayo Clinic, Rochester, MN 55905, USA
| | - Pratyusha Muddaloor
- GIH Artificial Intelligence Laboratory (GAIL), Division of Gastroenterology and Hepatology, Department of Medicine, Mayo Clinic, Rochester, MN 55905, USA
| | | | - Joshika Agarwal
- Division of Pulmonary and Critical Care Medicine, Department of Medicine, Mayo Clinic, Rochester, MN 55905, USA
| | - Anmol Mohan
- GIH Artificial Intelligence Laboratory (GAIL), Division of Gastroenterology and Hepatology, Department of Medicine, Mayo Clinic, Rochester, MN 55905, USA
| | | | - Keerthy Gopalakrishnan
- GIH Artificial Intelligence Laboratory (GAIL), Division of Gastroenterology and Hepatology, Department of Medicine, Mayo Clinic, Rochester, MN 55905, USA
- Microwave Engineering and Imaging Laboratory (MEIL), Division of Gastroenterology & Hepatology, Department of Medicine, Mayo Clinic, Rochester, MN 55905, USA
| | - Ashima Yadav
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN 55905, USA
| | - Aakriti Adhikari
- GIH Artificial Intelligence Laboratory (GAIL), Division of Gastroenterology and Hepatology, Department of Medicine, Mayo Clinic, Rochester, MN 55905, USA
| | - Devanshi Damani
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN 55905, USA
- Department of Internal Medicine, Texas Tech University Health Science Center, El Paso, TX 79995, USA
| | - Kanchan Kulkarni
- INSERM, Centre de Recherche Cardio-Thoracique de Bordeaux, University of Bordeaux, U1045, F-33000 Bordeaux, France
- IHU Liryc, Heart Rhythm Disease Institute, Fondation Bordeaux Université, F-33600 Pessac, France
| | | | - Alexander J Ryu
- Department of Medicine, Mayo Clinic, Rochester, MN 55905, USA
| | - Vivek N Iyer
- Division of Pulmonary and Critical Care Medicine, Department of Medicine, Mayo Clinic, Rochester, MN 55905, USA
| | - Shivaram P Arunachalam
- GIH Artificial Intelligence Laboratory (GAIL), Division of Gastroenterology and Hepatology, Department of Medicine, Mayo Clinic, Rochester, MN 55905, USA
- Department of Medicine, Mayo Clinic, Rochester, MN 55905, USA
- Department of Radiology, Mayo Clinic, Rochester, MN 55905, USA
- Microwave Engineering and Imaging Laboratory (MEIL), Division of Gastroenterology & Hepatology, Department of Medicine, Mayo Clinic, Rochester, MN 55905, USA
| |
Collapse
|
6
|
Hong JW, Kim SH, Han GT. Detection of Multiple Respiration Patterns Based on 1D SNN from Continuous Human Breathing Signals and the Range Classification Method for Each Respiration Pattern. SENSORS (BASEL, SWITZERLAND) 2023; 23:s23115275. [PMID: 37300002 DOI: 10.3390/s23115275] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/30/2023] [Revised: 05/27/2023] [Accepted: 05/30/2023] [Indexed: 06/12/2023]
Abstract
Human respiratory information is being used as an important source of biometric information that can enable the analysis of health status in the healthcare domain. The analysis of the frequency or duration of a specific respiration pattern and the classification of respiration patterns in the corresponding section for a certain period of time are important for the utilization of respiratory information in various ways. Existing methods require window slide processing to classify sections for each respiration pattern from the breathing data for a certain time period. In this case, when multiple respiration patterns exist within one window, the recognition rate can be lowered. To solve this problem, a 1D Siamese neural network (SNN)-based human respiration pattern detection model and a merge-and-split algorithm for the classification of multiple respiration patterns in each region for all respiration sections are proposed in this study. When calculating the accuracy based on intersection over union (IOU) for the respiration range classification result for each pattern, the accuracy was found to be improved by approximately 19.3% compared with the existing deep neural network (DNN) and 12.4% compared with a 1D convolutional neural network (CNN). The accuracy of detection based on the simple respiration pattern was approximately 14.5% higher than that of the DNN and 5.3% higher than that of the 1D CNN.
Collapse
Affiliation(s)
- Jin-Woo Hong
- Department of Computer Engineering, Gachon University, Seongnam 13120, Republic of Korea
| | | | - Gi-Tae Han
- Department of Computer Engineering, Gachon University, Seongnam 13120, Republic of Korea
| |
Collapse
|
7
|
Zhang Q, Zhang J, Yuan J, Huang H, Zhang Y, Zhang B, Lv G, Lin S, Wang N, Liu X, Tang M, Wang Y, Ma H, Liu L, Yuan S, Zhou H, Zhao J, Li Y, Yin Y, Zhao L, Wang G, Lian Y. SPRSound: Open-Source SJTU Paediatric Respiratory Sound Database. IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS 2022; 16:867-881. [PMID: 36070274 DOI: 10.1109/tbcas.2022.3204910] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
It has proved that the auscultation of respiratory sound has advantage in early respiratory diagnosis. Various methods have been raised to perform automatic respiratory sound analysis to reduce subjective diagnosis and physicians' workload. However, these methods highly rely on the quality of respiratory sound database. In this work, we have developed the first open-access paediatric respiratory sound database, SPRSound. The database consists of 2,683 records and 9,089 respiratory sound events from 292 participants. Accurate label is important to achieve a good prediction for adventitious respiratory sound classification problem. A custom-made sound label annotation software (SoundAnn) has been developed to perform sound editing, sound annotation, and quality assurance evaluation. A team of 11 experienced paediatric physicians is involved in the entire process to establish golden standard reference for the dataset. To verify the robustness and accuracy of the classification model, we have investigated the effects of different feature extraction methods and machine learning classifiers on the classification performance of our dataset. As such, we have achieved a score of 75.22%, 61.57%, 56.71%, and 37.84% for the four different classification challenges at the event level and record level.
Collapse
|