1
|
Jang K, Li L, Le TH, Setiani A, Rami FZ, Kim H, Chung YC. Acoustic biomarkers for schizophrenia spectrum disorders and their associations with symptoms and cognitive functioning. Prog Neuropsychopharmacol Biol Psychiatry 2025; 138:111339. [PMID: 40120847 DOI: 10.1016/j.pnpbp.2025.111339] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/29/2024] [Revised: 03/13/2025] [Accepted: 03/16/2025] [Indexed: 03/25/2025]
Abstract
BACKGROUNDS Acoustic biomarkers for schizophrenia spectrum disorders (SSDs) hold great promise due to their capacity to capture emotional information, which is often impaired in these patients. These biomarkers are easily accessible, noninvasive, objective, and cost-effective. This study investigated the accuracy of different machine learning (ML) models in classifying patients with SSDs or schizophrenia (SZ) versus healthy controls (HCs), as well as patients with cognitive-deficit (Cog-D) versus cognitive-non-deficit (Cog-ND) versus HCs. Additionally, correlations of the top 25 features contributing to these classifications with psychopathology and cognitive functioning were explored. METHODS Speech data were collected from patients with SSDs (n = 238) and HCs (n = 157) using multiple tasks, including the reading of emotional sentences. The Extrapyramidal Symptom Rating Scale (ESRS) was used to control for potential medication effects on speech. Acoustic features were extracted using the openSMILE toolkit, and models were trained with 10-fold cross-validation. Partial correlation analysis, adjusted for ESRS and chlorpromazine (CPZ) equivalent, was conducted between the top 25 features and measures of psychopathology and cognitive functioning. RESULTS Among the five ML models, accuracy of support vector machine (SVM) model was the best. It classified SSDs versus HCs with 83 % accuracy when using all 7 tasks, and 85 % when using only the happy sentences task. The SVM classification accuracy for Cog-D versus Cog-ND within SSDs was poor across all tasks; however, the accuracy for Cog-D versus HCs was 79 % when using free speech or happy sentences. The accuracy for classifying SZ versus HCs and Cog-D versus Cog-ND versus HCs exhibited variations. Several of the top 25 acoustic features correlated significantly with attention and verbal memory in patients with SSDs. CONCLUSIONS Our findings suggested that acoustic analysis, combined with a ML approach, could be used to classify successfully SSDs or the Cog-D subtype versus HCs. Features related to pitch, loudness, and timbre were particularly associated with attention in patients with SSDs. Future research should explore further the potential applications of acoustic biomarkers in multi-class classification, treatment response, and relapse detection in patients with SSDs.
Collapse
Affiliation(s)
- Kangwook Jang
- School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea
| | - Ling Li
- Department of Psychiatry, Jeonbuk National University Medical School, Jeonju, Republic of Korea; Research Institute of Clinical Medicine of Jeonbuk National University, Biomedical Research Institute of Jeonbuk National University Hospital, Republic of Korea
| | - Thi-Hung Le
- Department of Psychiatry, Jeonbuk National University Medical School, Jeonju, Republic of Korea; Research Institute of Clinical Medicine of Jeonbuk National University, Biomedical Research Institute of Jeonbuk National University Hospital, Republic of Korea
| | - Ariana Setiani
- Department of Psychiatry, Jeonbuk National University Medical School, Jeonju, Republic of Korea; Research Institute of Clinical Medicine of Jeonbuk National University, Biomedical Research Institute of Jeonbuk National University Hospital, Republic of Korea
| | - Fatima Zahra Rami
- Department of Psychiatry, Jeonbuk National University Medical School, Jeonju, Republic of Korea; Research Institute of Clinical Medicine of Jeonbuk National University, Biomedical Research Institute of Jeonbuk National University Hospital, Republic of Korea
| | - Hoirin Kim
- School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea
| | - Young Chul Chung
- Department of Psychiatry, Jeonbuk National University Hospital, Jeonju, Republic of Korea; Department of Psychiatry, Jeonbuk National University Medical School, Jeonju, Republic of Korea; Research Institute of Clinical Medicine of Jeonbuk National University, Biomedical Research Institute of Jeonbuk National University Hospital, Republic of Korea.
| |
Collapse
|
2
|
Lee J, Kim N, Ha JW, Kang K, Park E, Yoon J, Park KS. Exploring Voice Acoustic Features Associated with Cognitive Status in Korean Speakers: A Preliminary Machine Learning Study. Diagnostics (Basel) 2024; 14:2837. [PMID: 39767198 PMCID: PMC11675567 DOI: 10.3390/diagnostics14242837] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2024] [Revised: 12/02/2024] [Accepted: 12/12/2024] [Indexed: 01/11/2025] Open
Abstract
Objective: To develop a non-invasive cognitive impairment detection system using speech data analysis, addressing the growing global dementia crisis and enabling accessible early screening through daily health monitoring. Methods: Speech data from 223 Korean patients were collected across eight tasks. Patients were classified based on Korean Mini-Mental State Examination scores. Four machine learning models were tested for three binary classification tasks. Voice acoustic features were extracted and analyzed. Results: The Deep Neural Network model performed best in two classification tasks, with Precision-Recall Area Under the Curve scores of 0.737 for severe vs. no impairment and 0.726 for mild vs. no impairment, while Random Forest achieved 0.715 for severe + mild vs. no impairment. Several acoustic features emerged as potentially important indicators, with DDA shimmer from the /i/ task and stdevF0 from the /puh-tuh-kuh/ task showing consistent patterns across classification tasks. Conclusions: This preliminary study suggests that certain acoustic features may be associated with cognitive status, though demographic factors significantly influence these relationships. Further research with demographically matched populations is needed to validate these findings.
Collapse
Affiliation(s)
- Jiho Lee
- Neopons Inc., Daegu 41260, Republic of Korea; (J.L.); (N.K.)
| | - Nayeon Kim
- Neopons Inc., Daegu 41260, Republic of Korea; (J.L.); (N.K.)
| | - Ji-Wan Ha
- Department of Speech-Language Pathology, Daegu University, Gyeongsan 38453, Republic of Korea;
| | - Kyunghun Kang
- Department of Neurology, School of Medicine, Kyungpook National University, Daegu 41404, Republic of Korea;
| | - Eunhee Park
- Department of Rehabilitation Medicine, School of Medicine, Kyungpook National University, Daegu 41404, Republic of Korea;
| | - Janghyeok Yoon
- Department of Industrial Engineering, Konkuk University, Seoul 05029, Republic of Korea
| | - Ki-Su Park
- Department of Neurosurgery, School of Medicine, Kyungpook National University, Daegu 41404, Republic of Korea
| |
Collapse
|
3
|
Martínez-Cano A, de la Sacristana RFBG, Martín-Conty JL, Mordillo-Mateos L, Bernal-Jimenéz JJ, Polonio-López B, Martínez-Lorca M. Fundamental Frequency of the Voice in Schizophrenia and Its Value as a Biomarker of the Disease. J Voice 2024:S0892-1997(24)00394-1. [PMID: 39690086 DOI: 10.1016/j.jvoice.2024.11.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2024] [Revised: 11/02/2024] [Accepted: 11/05/2024] [Indexed: 12/19/2024]
Abstract
Recent research on schizophrenia seeks to identify objective biomarkers of the disease. The voice, and in particular the fundamental frequency (F0), could be one of them. METHODOLOGY We conducted a cross-sectional and descriptive study with a sample of 154 people. Of these, 46 were diagnosed with schizophrenia, 41 were at substance abuse, and 67 formed the control group, matched in variables of sex, age, and educational level, but without substance use compared with the high-risk group. RESULTS The biomechanical analyses of the voice indicated significant differences between the groups, differentiated by gender: in men (F = 5.316; P = 0.006) and in women (F = 4.13; P = 0.004). The greatest differences between groups were observed in the group of vulnerable individuals, with some stability of the F0 in people with schizophrenia. Furthermore, we found correlations between positive symptoms and decreased F0 (r = -0.353; P = 0.016). CONCLUSIONS Our study shows that schizophrenia is associated with decreased F0 in both men and women, and that medication could stabilize this decrease. These findings have important implications for the objective monitoring and diagnosis of schizophrenia.
Collapse
Affiliation(s)
- Alfonso Martínez-Cano
- Faculty of Health Sciences, University of Castilla-La Mancha, Talavera de la Reina, 45600, Spain; Department of Medical Sciences, University of Castilla-La Mancha, Albacete 02071, Spain
| | - Roberto Fernández-Baillo Gallego de la Sacristana
- University of Alcalá de Henares, Madrid 28801, Spain; Department of Surgery, Medical and Social Sciences, Human Anatomy and Embryology, University Campus-C/19, Ctra. Madrid-Barcelona, Km 33.600, Alcalá de Henares 28805 Madrid, Spain
| | - Jose Luis Martín-Conty
- Faculty of Health Sciences, University of Castilla-La Mancha, Talavera de la Reina, 45600, Spain; Department of Nursing, Physiotherapy and Occupational Therapy, Faculty of Health Sciences, Universidad de Castilla-La Mancha, Talavera de la Reina, 45600, Spain; Technological Innovation Applied to Health Research Group (ITAS Group), Faculty of Health Sciences, University of de Castilla-La Mancha, Talavera de la Reina, 45600, Spain
| | - Laura Mordillo-Mateos
- Faculty of Health Sciences, University of Castilla-La Mancha, Talavera de la Reina, 45600, Spain; Department of Nursing, Physiotherapy and Occupational Therapy, Faculty of Health Sciences, Universidad de Castilla-La Mancha, Talavera de la Reina, 45600, Spain; Technological Innovation Applied to Health Research Group (ITAS Group), Faculty of Health Sciences, University of de Castilla-La Mancha, Talavera de la Reina, 45600, Spain
| | - Juan José Bernal-Jimenéz
- Faculty of Health Sciences, University of Castilla-La Mancha, Talavera de la Reina, 45600, Spain; Department of Nursing, Physiotherapy and Occupational Therapy, Faculty of Health Sciences, Universidad de Castilla-La Mancha, Talavera de la Reina, 45600, Spain; Technological Innovation Applied to Health Research Group (ITAS Group), Faculty of Health Sciences, University of de Castilla-La Mancha, Talavera de la Reina, 45600, Spain
| | - Begoña Polonio-López
- Faculty of Health Sciences, University of Castilla-La Mancha, Talavera de la Reina, 45600, Spain; Technological Innovation Applied to Health Research Group (ITAS Group), Faculty of Health Sciences, University of de Castilla-La Mancha, Talavera de la Reina, 45600, Spain.
| | - Manuela Martínez-Lorca
- Faculty of Health Sciences, University of Castilla-La Mancha, Talavera de la Reina, 45600, Spain; Department of Psychology, University of Castilla-La Mancha, Albacete 02071, Spain
| |
Collapse
|
4
|
He R, de la Foz VO, Cacho LMF, Homan P, Sommer I, Ayesa-arriola R, Hinzen W. Task-voting for schizophrenia spectrum disorders prediction using machine learning across linguistic feature domains.. [DOI: 10.1101/2024.08.31.24312886] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/06/2024]
Abstract
AbstractBackground and HypothesisIdentifying schizophrenia spectrum disorders (SSD) from spontaneous speech features is a key focus in computational psychiatry today.Study DesignWe present a task-voting procedure using different speech-elicitation tasks to predict SSD in Spanish, followed by ablation studies highlighting the roles of specific tasks and feature domains. Speech from five tasks was recorded from 92 subjects (49 with SSD and 41 controls). A total of 319 features were automatically extracted, from which 24 were pre-selected based on between-feature correlations and ANOVA F-values, covering acoustic-prosody, morphosyntax, and semantic similarity metrics.Study ResultsExtraTrees-based classification using these features yielded an accuracy of 0.840 on hold-out data. Ablating picture descriptions impaired performance most, followed by story reading, retelling, and free speech. Removing morphosyntactic measures impaired performance most, followed by acoustic and semantic measures. Mixed-effect models suggested significant group differences on all 24 features. In SSD, speech patterns were slower and more variable temporally, while variations in pitch, amplitude, and sound intensity decreased. Semantic similarity between speech and prompts decreased, while minimal distances from embedding centroids to each word increased, and word-to-word similarity arrays became more predictable, all replicating patterns documented in other languages. Morphosyntactically, SSD patients used more first-person pronouns together with less third-person pronouns, and more punctuations and negations. Semantic metrics correlated with a range of positive symptoms, and multiple acoustic-prosodic features with negative symptoms.ConclusionsThis study highlights the importance of combining different speech tasks and features for SSD detection, and validates previously found patterns in psychosis for Spanish.
Collapse
|
5
|
Tan EJ, Rossell SL. Exploring associations between trait symptoms and speech patterns in schizophrenia spectrum disorders: A mediation analysis. Schizophr Res 2024; 270:188-190. [PMID: 38917556 DOI: 10.1016/j.schres.2024.06.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/04/2023] [Revised: 05/23/2024] [Accepted: 06/15/2024] [Indexed: 06/27/2024]
Affiliation(s)
- Eric Josiah Tan
- Memory, Aging and Cognition Centre, National University Health System, Singapore; Department of Pharmacology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore; Centre for Mental Health and Brain Sciences, School of Health Sciences, Swinburne University of Technology, Hawthorn, Australia.
| | - Susan Lee Rossell
- Centre for Mental Health and Brain Sciences, School of Health Sciences, Swinburne University of Technology, Hawthorn, Australia; Department of Psychiatry, St Vincent's Hospital, Melbourne, Australia
| |
Collapse
|
6
|
Huang J, Zhao Y, Tian Z, Qu W, Du X, Zhang J, Tan Y, Wang Z, Tan S. Evaluating the clinical utility of speech analysis and machine learning in schizophrenia: A pilot study. Comput Biol Med 2023; 164:107359. [PMID: 37591160 DOI: 10.1016/j.compbiomed.2023.107359] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Revised: 07/04/2023] [Accepted: 08/12/2023] [Indexed: 08/19/2023]
Abstract
BACKGROUND Schizophrenia is a serious mental disorder that significantly impacts social functioning and quality of life. However, current diagnostic methods lack objective biomarker support. While some studies have indicated differences in audio features between patients with schizophrenia and healthy controls, these findings are influenced by demographic information and variations in experimental paradigms. Therefore, it is crucial to explore stable and reliable audio biomarkers for an auxiliary diagnosis and disease severity prediction of schizophrenia. METHOD A total of 130 individuals (65 patients with schizophrenia and 65 healthy controls) read three fixed texts containing positive, neutral, and negative emotions, and recorded them. All audio signals were preprocessed and acoustic features were extracted by a librosa-0.9.2 toolkit. Independent sample t-tests were performed on two sets of acoustic features, and Pearson correlation on the acoustic features and Positive and Negative Syndrome Scale (PANSS) scores of the schizophrenia group. Classification algorithms in scikit-learn were used to diagnose schizophrenia and predict the level of negative symptoms. RESULTS Significant differences were observed between the two groups in the mfcc_8, mfcc_11, and mfcc_33 of mel-frequency cepstral coefficient (MFCC). Furthermore, a significant correlation was found between mfcc_7 and the negative PANSS scores. Through acoustic features, we could not only differentiate patients with schizophrenia from healthy controls with an accuracy of 0.815 but also predict the grade of the negative symptoms in schizophrenia with an average accuracy of 0.691. CONCLUSIONS The results demonstrated the considerable potential of acoustic characteristics as reliable biomarkers for diagnosing schizophrenia and predicting clinical symptoms.
Collapse
Affiliation(s)
- Jie Huang
- Beijing HuiLongGuan Hospital, Peking University HuiLongGuan Clinical Medical School, Beijing, 100096, China
| | - Yanli Zhao
- Beijing HuiLongGuan Hospital, Peking University HuiLongGuan Clinical Medical School, Beijing, 100096, China
| | - Zhanxiao Tian
- Beijing HuiLongGuan Hospital, Peking University HuiLongGuan Clinical Medical School, Beijing, 100096, China
| | - Wei Qu
- Beijing HuiLongGuan Hospital, Peking University HuiLongGuan Clinical Medical School, Beijing, 100096, China
| | - Xia Du
- Beijing HuiLongGuan Hospital, Peking University HuiLongGuan Clinical Medical School, Beijing, 100096, China
| | - Jie Zhang
- Beijing HuiLongGuan Hospital, Peking University HuiLongGuan Clinical Medical School, Beijing, 100096, China
| | - Yunlong Tan
- Beijing HuiLongGuan Hospital, Peking University HuiLongGuan Clinical Medical School, Beijing, 100096, China
| | - Zhiren Wang
- Beijing HuiLongGuan Hospital, Peking University HuiLongGuan Clinical Medical School, Beijing, 100096, China
| | - Shuping Tan
- Beijing HuiLongGuan Hospital, Peking University HuiLongGuan Clinical Medical School, Beijing, 100096, China.
| |
Collapse
|