1
|
Idrisoglu A, Moraes ALD, Cheddad A, Anderberg P, Jakobsson A, Berglund JS. Vowel segmentation impact on machine learning classification for chronic obstructive pulmonary disease. Sci Rep 2025; 15:9930. [PMID: 40121302 PMCID: PMC11929820 DOI: 10.1038/s41598-025-95320-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2024] [Accepted: 03/20/2025] [Indexed: 03/25/2025] Open
Abstract
Vowel-based voice analysis is gaining attention as a potential non-invasive tool for COPD classification, offering insights into phonatory function. The growing need for voice data has necessitated the adoption of various techniques, including segmentation, to augment existing datasets for training comprehensive Machine Learning (ML) modelsThis study aims to investigate the possible effects of segmentation of the utterance of vowel "a" on the performance of ML classifiers CatBoost (CB), Random Forest (RF), and Support Vector Machine (SVM). This research involves training individual ML models using three distinct dataset constructions: full-sequence, segment-wise, and group-wise, derived from the utterance of the vowel "a" which consists of 1058 recordings belonging to 48 participants. This approach comprehensively analyzes how each data categorization impacts the model's performance and results. A nested cross-validation (nCV) approach was implemented with grid search for hyperparameter optimization. This rigorous methodology was employed to minimize overfitting risks and maximize model performance. Compared to the full-sequence dataset, the findings indicate that the second segment yielded higher results within the four-segment category. Specifically, the CB model achieved superior accuracy, attaining 97.8% and 84.6% on the validation and test sets, respectively. The same category for the CB model also demonstrated the best balance regarding true positive rate (TPR) and true negative rate (TNR), making it the most clinically effective choice. These findings suggest that time-sensitive properties in vowel production are important for COPD classification and that segmentation can aid in capturing these properties. Despite these promising results, the dataset size and demographic homogeneity limit generalizability, highlighting areas for future research.Trial registration The study is registered on clinicaltrials.gov with ID: NCT06160674.
Collapse
Affiliation(s)
- Alper Idrisoglu
- Department of Health, Blekinge Institute of Technology, 371 41, Karlskrona, Sweden.
| | | | - Abbas Cheddad
- Department of Health, Blekinge Institute of Technology, 371 41, Karlskrona, Sweden
- Institute of Computer Science, University of Tartu, Narva mnt 18, 51009, Tartu, Estonia
| | - Peter Anderberg
- Department of Health, Blekinge Institute of Technology, 371 41, Karlskrona, Sweden
| | | | | |
Collapse
|
2
|
Yousefi M, Akhbari M, Mohamadi Z, Karami S, Dasoomi H, Atabi A, Sarkeshikian SA, Abdoullahi Dehaki M, Bayati H, Mashayekhi N, Varmazyar S, Rahimian Z, Asadi Anar M, Shafiei D, Mohebbi A. Machine learning based algorithms for virtual early detection and screening of neurodegenerative and neurocognitive disorders: a systematic-review. Front Neurol 2024; 15:1413071. [PMID: 39717687 PMCID: PMC11663744 DOI: 10.3389/fneur.2024.1413071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2024] [Accepted: 11/05/2024] [Indexed: 12/25/2024] Open
Abstract
Background and aim Neurodegenerative disorders (e.g., Alzheimer's, Parkinson's) lead to neuronal loss; neurocognitive disorders (e.g., delirium, dementia) show cognitive decline. Early detection is crucial for effective management. Machine learning aids in more precise disease identification, potentially transforming healthcare. This comprehensive systematic review discusses how machine learning (ML), can enhance early detection of these disorders, surpassing traditional diagnostics' constraints. Methods In this review, databases were examined up to August 15th, 2023, for ML data on neurodegenerative and neurocognitive diseases using PubMed, Scopus, Google Scholar, and Web of Science. Two investigators used the RAYYAN intelligence tool for systematic reviews to conduct the screening. Six blinded reviewers reviewed titles/abstracts. Cochrane risk of bias tool was used for quality assessment. Results Our search found 7,069 research studies, of which 1,365 items were duplicates and thus removed. Four thousand three hundred and thirty four studies were screened, and 108 articles met the criteria for inclusion after preprocessing. Twelve ML algorithms were observed for dementia, showing promise in early detection. Eighteen ML algorithms were identified for Parkinson's, each effective in detection and diagnosis. Studies emphasized that ML algorithms are necessary for Alzheimer's to be successful. Fourteen ML algorithms were discovered for mild cognitive impairment, with LASSO logistic regression being the only one with unpromising results. Conclusion This review emphasizes the pressing necessity of integrating verified digital health resources into conventional medical practice. This integration may signify a new era in the early detection of neurodegenerative and neurocognitive illnesses, potentially changing the course of these conditions for millions globally. This study showcases specific and statistically significant findings to illustrate the progress in the area and the prospective influence of these advancements on the global management of neurocognitive and neurodegenerative illnesses.
Collapse
Affiliation(s)
- Milad Yousefi
- Institute for Cognitive and Brain Sciences, Shahid Beheshti University, Tehran, Iran
| | - Matin Akhbari
- Faculty of Medicine, Istanbul Yeni Yuzyil University, Istanbul, Türkiye
| | - Zhina Mohamadi
- School of Medicine, Kermanshah University of Medical Sciences, Kermanshah, Iran
| | - Shaghayegh Karami
- School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - Hediyeh Dasoomi
- Student Research Committee, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran
| | - Alireza Atabi
- School of Medicine, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran
| | | | - Mahdi Abdoullahi Dehaki
- Master’s of AI Engineering, Islamic Azad University Tehran Science and Research Branch, Tehran, Iran
| | - Hesam Bayati
- Department of Radiology, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Negin Mashayekhi
- Department of Neuroscience, Bahçeşehir University, Istanbul, Türkiye
| | - Shirin Varmazyar
- School of Medicine, Shahroud University of Medical Sciences, Shahrud, Iran
| | - Zahra Rahimian
- School of Medicine, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Mahsa Asadi Anar
- Student Research Committee, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Daniel Shafiei
- School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Alireza Mohebbi
- Students Research Committee, Ardabil University of Medical Sciences, Ardabil, Iran
| |
Collapse
|
3
|
Dubbioso R, Spisto M, Verde L, Iuzzolino VV, Senerchia G, Salvatore E, De Pietro G, De Falco I, Sannino G. Voice signals database of ALS patients with different dysarthria severity and healthy controls. Sci Data 2024; 11:800. [PMID: 39030186 PMCID: PMC11271596 DOI: 10.1038/s41597-024-03597-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Accepted: 07/02/2024] [Indexed: 07/21/2024] Open
Abstract
This paper describes a new publicly-available database of VOiCe signals acquired in Amyotrophic Lateral Sclerosis (ALS) patients (VOC-ALS) and healthy controls performing different speech tasks. This dataset consists of 1224 voice signals recorded from 153 participants: 51 healthy controls (32 males and 19 females) and 102 ALS patients (65 males and 37 females) with different severity of dysarthria. Each subject's voice was recorded using a smartphone application (Vox4Health) while performing several vocal tasks, including a sustained phonation of the vowels /a/, /e/, /i/, /o/, /u/ and /pa/, /ta/, /ka/ syllable repetition. Basic derived speech metrics such as harmonics-to-noise ratio, mean and standard deviation of fundamental frequency (F0), jitter and shimmer were calculated. The F0 standard deviation of vowels and syllables showed an excellent ability to identify people with ALS and to discriminate the different severity of dysarthria. These data represent the most comprehensive database of voice signals in ALS and form a solid basis for research on the recognition of voice impairment in ALS patients for use in clinical applications.
Collapse
Affiliation(s)
- Raffaele Dubbioso
- Department of Neurosciences, Reproductive Sciences and Odontostomatology, University of Naples "Federico II", Naples, 80131, Italy
| | - Myriam Spisto
- Department of Psychology of the University of Campania "Luigi Vanvitelli", Caserta, 81100, Italy
| | - Laura Verde
- Department of Mathematics and Physics of the University of Campania "Luigi Vanvitelli", Caserta, 81100, Italy
| | - Valentina Virginia Iuzzolino
- Department of Neurosciences, Reproductive Sciences and Odontostomatology, University of Naples "Federico II", Naples, 80131, Italy
| | - Gianmaria Senerchia
- Department of Neurosciences, Reproductive Sciences and Odontostomatology, University of Naples "Federico II", Naples, 80131, Italy
| | - Elena Salvatore
- Department of Advanced Biomedical Sciences, University of Naples "Federico II", Naples, 80131, Italy
| | - Giuseppe De Pietro
- Department of Information Sciences and Technologies, Pegaso University, Naples, 80143, Italy
| | - Ivanoe De Falco
- National Research Council of Italy (CNR), Institute for High-Performance Computing and Networking (ICAR), Naples, 80131, Italy
| | - Giovanna Sannino
- National Research Council of Italy (CNR), Institute for High-Performance Computing and Networking (ICAR), Naples, 80131, Italy.
| |
Collapse
|
4
|
Milella G, Sciancalepore D, Cavallaro G, Piccirilli G, Nanni AG, Fraddosio A, D’Errico E, Paolicelli D, Fiorella ML, Simone IL. Acoustic Voice Analysis as a Useful Tool to Discriminate Different ALS Phenotypes. Biomedicines 2023; 11:2439. [PMID: 37760880 PMCID: PMC10525613 DOI: 10.3390/biomedicines11092439] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2023] [Revised: 08/24/2023] [Accepted: 08/29/2023] [Indexed: 09/29/2023] Open
Abstract
Approximately 80-96% of people with amyotrophic lateral sclerosis (ALS) become unable to speak during the disease progression. Assessing upper and lower motor neuron impairment in bulbar regions of ALS patients remains challenging, particularly in distinguishing spastic and flaccid dysarthria. This study aimed to evaluate acoustic voice parameters as useful biomarkers to discriminate ALS clinical phenotypes. Triangular vowel space area (tVSA), alternating motion rates (AMRs), and sequential motion rates (SMRs) were analyzed in 36 ALS patients and 20 sex/age-matched healthy controls (HCs). tVSA, AMR, and SMR values significantly differed between ALS and HCs, and between ALS with prevalent upper (pUMN) and lower motor neuron (pLMN) impairment. tVSA showed higher accuracy in discriminating pUMN from pLMN patients. AMR and SMR were significantly lower in patients with bulbar onset than those with spinal onset, both with and without bulbar symptoms. Furthermore, these values were also lower in patients with spinal onset associated with bulbar symptoms than in those with spinal onset alone. Additionally, AMR and SMR values correlated with the degree of dysphagia. Acoustic voice analysis may be considered a useful prognostic tool to differentiate spastic and flaccid dysarthria and to assess the degree of bulbar involvement in ALS.
Collapse
Affiliation(s)
- Giammarco Milella
- Neurology Unit, Department of Translational Biomedicine and Neurosciences, 70121 Bari, Italy; (G.M.); (G.P.); (A.G.N.); (A.F.); (E.D.); (D.P.)
| | - Diletta Sciancalepore
- Otolaryngology Unit, Department of Translational Biomedicine and Neurosciences (DiBraiN), University of Bari Aldo Moro, 70121 Bari, Italy; (D.S.); (G.C.); (M.L.F.)
| | - Giada Cavallaro
- Otolaryngology Unit, Department of Translational Biomedicine and Neurosciences (DiBraiN), University of Bari Aldo Moro, 70121 Bari, Italy; (D.S.); (G.C.); (M.L.F.)
| | - Glauco Piccirilli
- Neurology Unit, Department of Translational Biomedicine and Neurosciences, 70121 Bari, Italy; (G.M.); (G.P.); (A.G.N.); (A.F.); (E.D.); (D.P.)
| | - Alfredo Gabriele Nanni
- Neurology Unit, Department of Translational Biomedicine and Neurosciences, 70121 Bari, Italy; (G.M.); (G.P.); (A.G.N.); (A.F.); (E.D.); (D.P.)
| | - Angela Fraddosio
- Neurology Unit, Department of Translational Biomedicine and Neurosciences, 70121 Bari, Italy; (G.M.); (G.P.); (A.G.N.); (A.F.); (E.D.); (D.P.)
| | - Eustachio D’Errico
- Neurology Unit, Department of Translational Biomedicine and Neurosciences, 70121 Bari, Italy; (G.M.); (G.P.); (A.G.N.); (A.F.); (E.D.); (D.P.)
| | - Damiano Paolicelli
- Neurology Unit, Department of Translational Biomedicine and Neurosciences, 70121 Bari, Italy; (G.M.); (G.P.); (A.G.N.); (A.F.); (E.D.); (D.P.)
| | - Maria Luisa Fiorella
- Otolaryngology Unit, Department of Translational Biomedicine and Neurosciences (DiBraiN), University of Bari Aldo Moro, 70121 Bari, Italy; (D.S.); (G.C.); (M.L.F.)
| | | |
Collapse
|