1
|
Short-term effects of transcranial direct current stimulation on motor speech in Parkinson's disease: a pilot study. J Neural Transm (Vienna) 2024:10.1007/s00702-024-02771-5. [PMID: 38592459 DOI: 10.1007/s00702-024-02771-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Accepted: 03/25/2024] [Indexed: 04/10/2024]
Abstract
INTRODUCTION Hypokinetic dysarthria (HD) is a common motor speech symptom of Parkinson's disease (PD) which does not respond well to PD treatments. We investigated short-term effects of transcranial direct current stimulation (tDCS) on HD in PD using acoustic analysis of speech. Based on our previous studies we focused on stimulation of the right superior temporal gyrus (STG) - an auditory feedback area. METHODS In 14 PD patients with HD, we applied anodal, cathodal and sham tDCS to the right STG using a cross-over design. A protocol consisting of speech tasks was performed prior to and immediately after each stimulation session. Linear mixed models were used for the evaluation of the effects of each stimulation condition on the relative change of acoustic parameters. We also performed a simulation of the mean electric field induced by tDCS. RESULTS Linear mixed model showed a statistically significant effect of the stimulation condition on the relative change of median duration of silences longer than 50 ms (p = 0.015). The relative change after the anodal stimulation (mean = -5.9) was significantly lower as compared to the relative change after the sham stimulation (mean = 12.8), p = 0.014. We also found a correlation between the mean electric field magnitude in the right STG and improvement of articulation precision after anodal tDCS (R = 0.637; p = 0.019). CONCLUSIONS The exploratory study showed that anodal tDCS applied over the auditory feedback area may lead to shorter pauses in a speech of PD patients.
Collapse
|
2
|
Effects of Deep-Brain Stimulation on Speech: Perceptual and Acoustic Data. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2024; 67:1090-1106. [PMID: 38498664 PMCID: PMC11005955 DOI: 10.1044/2024_jslhr-23-00511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Revised: 12/15/2023] [Accepted: 01/16/2024] [Indexed: 03/20/2024]
Abstract
PURPOSE This study examined speech changes induced by deep-brain stimulation (DBS) in speakers with Parkinson's disease (PD) using a set of auditory-perceptual and acoustic measures. METHOD Speech recordings from nine speakers with PD and DBS were compared between DBS-On and DBS-Off conditions using auditory-perceptual and acoustic analyses. Auditory-perceptual ratings included voice quality, articulation precision, prosody, speech intelligibility, and listening effort obtained from 44 listeners. Acoustic measures were made for voicing proportion, second formant frequency slope, vowel dispersion, articulation rate, and range of fundamental frequency and intensity. RESULTS No significant changes were found between DBS-On and DBS-Off for the five perceptual ratings. Four of six acoustic measures revealed significant differences between the two conditions. While articulation rate and acoustic vowel dispersion increased, voicing proportion and intensity range decreased from the DBS-Off to DBS-On condition. However, a visual examination of the data indicated that the statistical significance was mostly driven by a small number of participants, while the majority did not show a consistent pattern of such changes. CONCLUSIONS Our data, in general, indicate no-to-minimal changes in speech production ensued from DBS stimulation. The findings are discussed with a focus on large interspeaker variability in PD in terms of their speech characteristics and the potential effects of DBS on speech.
Collapse
|
3
|
Automatic speech-based assessment to discriminate Parkinson's disease from essential tremor with a cross-language approach. NPJ Digit Med 2024; 7:37. [PMID: 38368458 PMCID: PMC10874421 DOI: 10.1038/s41746-024-01027-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Accepted: 02/05/2024] [Indexed: 02/19/2024] Open
Abstract
Parkinson's disease (PD) and essential tremor (ET) are prevalent movement disorders that mainly affect elderly people, presenting diagnostic challenges due to shared clinical features. While both disorders exhibit distinct speech patterns-hypokinetic dysarthria in PD and hyperkinetic dysarthria in ET-the efficacy of speech assessment for differentiation remains unexplored. Developing technology for automatic discrimination could enable early diagnosis and continuous monitoring. However, the lack of data for investigating speech behavior in these patients has inhibited the development of a framework for diagnostic support. In addition, phonetic variability across languages poses practical challenges in establishing a universal speech assessment system. Therefore, it is necessary to develop models robust to the phonetic variability present in different languages worldwide. We propose a method based on Gaussian mixture models to assess domain adaptation from models trained in German and Spanish to classify PD and ET patients in Czech. We modeled three different speech dimensions: articulation, phonation, and prosody and evaluated the models' performance in both bi-class and tri-class classification scenarios (with the addition of healthy controls). Our results show that a fusion of the three speech dimensions achieved optimal results in binary classification, with accuracies up to 81.4 and 86.2% for monologue and /pa-ta-ka/ tasks, respectively. In tri-class scenarios, incorporating healthy speech signals resulted in accuracies of 63.3 and 71.6% for monologue and /pa-ta-ka/ tasks, respectively. Our findings suggest that automated speech analysis, combined with machine learning is robust, accurate, and can be adapted to different languages to distinguish between PD and ET patients.
Collapse
|
4
|
Levodopa may modulate specific speech impairment in Parkinson's disease: an fMRI study. J Neural Transm (Vienna) 2024; 131:181-187. [PMID: 37943390 DOI: 10.1007/s00702-023-02715-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 10/22/2023] [Indexed: 11/10/2023]
Abstract
Hypokinetic dysarthria (HD) is a difficult-to-treat symptom affecting quality of life in patients with Parkinson's disease (PD). Levodopa may partially alleviate some symptoms of HD in PD, but the neural correlates of these effects are not fully understood. The aim of our study was to identify neural mechanisms by which levodopa affects articulation and prosody in patients with PD. Altogether 20 PD patients participated in a task fMRI study (overt sentence reading). Using a single dose of levodopa after an overnight withdrawal of dopaminergic medication, levodopa-induced BOLD signal changes within the articulatory pathway (in regions of interest; ROIs) were studied. We also correlated levodopa-induced BOLD signal changes with the changes in acoustic parameters of speech. We observed no significant changes in acoustic parameters due to acute levodopa administration. After levodopa administration as compared to the OFF dopaminergic condition, patients showed task-induced BOLD signal decreases in the left ventral thalamus (p = 0.0033). The changes in thalamic activation were associated with changes in pitch variation (R = 0.67, p = 0.006), while the changes in caudate nucleus activation were related to changes in the second formant variability which evaluates precise articulation (R = 0.70, p = 0.003). The results are in line with the notion that levodopa does not have a major impact on HD in PD, but it may induce neural changes within the basal ganglia circuitries that are related to changes in speech prosody and articulation.
Collapse
|
5
|
Effects of deep brain stimulation of the subthalamic nucleus on patients with Parkinson's disease: a machine-learning voice analysis. Front Neurol 2023; 14:1267360. [PMID: 37928137 PMCID: PMC10622670 DOI: 10.3389/fneur.2023.1267360] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Accepted: 09/20/2023] [Indexed: 11/07/2023] Open
Abstract
Introduction Deep brain stimulation of the subthalamic nucleus (STN-DBS) can exert relevant effects on the voice of patients with Parkinson's disease (PD). In this study, we used artificial intelligence to objectively analyze the voices of PD patients with STN-DBS. Materials and methods In a cross-sectional study, we enrolled 108 controls and 101 patients with PD. The cohort of PD was divided into two groups: the first group included 50 patients with STN-DBS, and the second group included 51 patients receiving the best medical treatment. The voices were clinically evaluated using the Unified Parkinson's Disease Rating Scale part-III subitem for voice (UPDRS-III-v). We recorded and then analyzed voices using specific machine-learning algorithms. The likelihood ratio (LR) was also calculated as an objective measure for clinical-instrumental correlations. Results Clinically, voice impairment was greater in STN-DBS patients than in those who received oral treatment. Using machine learning, we objectively and accurately distinguished between the voices of STN-DBS patients and those under oral treatments. We also found significant clinical-instrumental correlations since the greater the LRs, the higher the UPDRS-III-v scores. Discussion STN-DBS deteriorates speech in patients with PD, as objectively demonstrated by machine-learning voice analysis.
Collapse
|
6
|
Consonant articulation acoustics and intelligibility in Swedish speakers with Parkinson's disease: a pilot study. CLINICAL LINGUISTICS & PHONETICS 2023; 37:845-865. [PMID: 35833475 DOI: 10.1080/02699206.2022.2095926] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Revised: 05/16/2022] [Accepted: 06/22/2022] [Indexed: 06/15/2023]
Abstract
Imprecise consonant articulation is common in speakers with Parkinson's disease and can affect intelligibility. The research on the relationship between acoustic speech measures and intelligibility in Parkinson's disease is limited, and most of the research has been conducted on English. This pilot study investigated aspects of consonant articulation acoustics in eleven Swedish speakers with Parkinson's disease and six neurologically healthy persons. The focus of the study was on consonant cluster production, articulatory motion rate and variation, and voice onset time, and how these acoustic features correlate with speech intelligibility. Among the measures in the present study, typicality ratings of heterorganic consonant clusters /spr/ and /skr/ had the strongest correlations with intelligibility. Measures based on syllable repetition, such as repetition rate and voice onset time, showed varying results with weak to moderate correlations with intelligibility. One conclusion is that some acoustic measures may be more sensitive than others to the impact of the underlying sensory-motor impairment and dysarthria on speech production and intelligibility in speakers with Parkinson's disease. Some aspects of articulation appear to be equally demanding in terms of acoustic realisation for elderly healthy speakers and for speakers with Parkinson's disease, such as sequential motion rate measures. Clinically, this would imply that for the purpose of detecting signs of disordered speech motor control, choosing measures with less variation among older speakers without articulation impairment would lead to more robust results.
Collapse
|
7
|
Quantifying articulatory impairments in neurodegenerative motor diseases: A scoping review and meta-analysis of interpretable acoustic features. INTERNATIONAL JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2023; 25:486-499. [PMID: 36001500 PMCID: PMC9950294 DOI: 10.1080/17549507.2022.2089234] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
PURPOSE Neurodegenerative motor diseases (NMDs) have devastating effects on the lives of patients and their loved ones, in part due to the impact of neurologic abnormalities on speech, which significantly limits functional communication. Clinical speech researchers have thus spent decades investigating speech features in populations suffering from NMDs. Features of impaired articulatory function are of particular interest given their detrimental impact on intelligibility, their ability to encode a variety of distinct movement disorders, and their potential as diagnostic indicators of neurodegenerative diseases. The objectives of this scoping review were to identify (1) which components of articulation (i.e. coordination, consistency, speed, precision, and repetition rate) are the most represented in the acoustic literature on NMDs; (2) which acoustic articulatory features demonstrate the most potential for detecting speech motor dysfunction in NMDs; and (3) which articulatory components are the most impaired within each NMD. METHOD This review examined literature published between 1976 and 2020. Studies were identified from six electronic databases using predefined key search terms. The first research objective was addressed using a frequency count of studies investigating each articulatory component, while the second and third objectives were addressed using meta-analyses. RESULT Findings from 126 studies revealed a considerable emphasis on articulatory precision. Of the 24 features included in the meta-analyses, vowel dispersion/distance and stop gap duration exhibited the largest effects when comparing the NMD population to controls. The meta-analyses also revealed divergent patterns of articulatory performance across disease types, providing evidence of unique profiles of articulatory impairment. CONCLUSION This review illustrates the current state of the literature on acoustic articulatory features in NMDs. By highlighting the areas of need within each articulatory component and disease group, this work provides a foundation on which clinical researchers, speech scientists, neurologists, and computer science engineers can develop research questions that will both broaden and deepen the understanding of articulatory impairments in NMDs.
Collapse
|
8
|
Repetitive transcranial magnetic stimulation for hypokinetic dysarthria in Parkinson's disease enhances white matter integrity of the auditory-motor loop. Eur J Neurol 2023; 30:881-886. [PMID: 36529528 DOI: 10.1111/ene.15665] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Accepted: 12/08/2022] [Indexed: 12/23/2022]
Abstract
BACKGROUND AND PURPOSE In our previous study, repeated sessions of repetitive transcranial magnetic stimulation (rTMS) over the auditory feedback area were shown to improve hypokinetic dysarthria (HD) in Parkinson's disease (PD) and led to changes in functional connectivity within the left-sided articulatory networks. We analyzed data from this previous study and assessed the effects of rTMS for HD in PD on the diffusion parameters of the left anterior arcuate fasciculus (AAF), which connects the auditory feedback area with motor regions involved in articulation. METHODS Patients were assigned to 10 sessions of real or sham 1-Hz stimulation over the right posterior superior temporal gyrus. Stimulation effects were evaluated using magnetic resonance diffusion tensor imaging and by a speech therapist using a validated tool (Phonetics score of the Dysarthric Profile) at baseline, immediately after 2 weeks of stimulation, and at follow-up visits at Weeks 6 and 10 after the baseline. RESULTS Altogether, data from 33 patients were analyzed. A linear mixed model revealed significant time-by-group interaction (p = 0.006) for the relative changes of fractional anisotropy of the AAF; the value increases were associated with the temporal evolution of the Phonetics score (R = 0.367, p = 0.028) in the real stimulation group. CONCLUSIONS Real rTMS treatment for HD in PD as compared to sham stimulation led to increases of white matter integrity of the auditory-motor loop during the 2-month follow-up period. The changes were related to motor speech improvements.
Collapse
|
9
|
Asymmetric slowness and dystonic posturing. Pract Neurol 2023; 23:183-185. [PMID: 36549888 DOI: 10.1136/pn-2022-003593] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/16/2022] [Indexed: 12/24/2022]
|
10
|
Speech and gait abnormalities in motor subtypes of de-novo Parkinson's disease. CNS Neurosci Ther 2023. [PMID: 36942517 DOI: 10.1111/cns.14158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Revised: 02/22/2023] [Accepted: 02/24/2023] [Indexed: 03/23/2023] Open
Abstract
AIM To investigate the presence and relationship of temporal speech and gait parameters in patients with postural instability/gait disorder (PIGD) and tremor-dominant (TD) motor subtypes of Parkinson's disease (PD). METHODS Speech samples and instrumented walkway system assessments were acquired from a total of 60 de-novo PD patients (40 in TD and 20 in PIGD subtype) and 40 matched healthy controls. Objective acoustic vocal assessment of seven distinct speech timing dimensions was related to instrumental gait measures including velocity, cadence, and stride length. RESULTS Compared to controls, PIGD subtype showed greater consonant timing abnormalities by prolonged voice onset time (VOT) while also shorter stride length during both normal walking and dual task, while decreased velocity and cadence only during dual task. Speaking rate was faster in PIGD than TD subtype. In PIGD subtype, prolonged VOT correlated with slower gait velocity (r = -0.56, p = 0.01) and shorter stride length (r = -0.59, p = 0.008) during normal walking, whereas relationships were also found between decreased cadence in dual task and irregular alternating motion rates (r = -0.48, p = 0.04) and prolonged pauses (r = -0.50, p = 0.03). No correlation between speech and gait was detected in TD subtype. CONCLUSION Our findings suggest that speech and gait rhythm disorder share similar underlying pathomechanisms specific for PIGD subtype.
Collapse
|
11
|
Has machine learning over-promised in healthcare? A critical analysis and a proposal for improved evaluation, with evidence from Parkinson’s disease. Artif Intell Med 2023; 139:102524. [PMID: 37100503 DOI: 10.1016/j.artmed.2023.102524] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Revised: 02/22/2023] [Accepted: 02/28/2023] [Indexed: 03/17/2023]
Abstract
Adoption of artificial intelligence (AI) by the medical community has long been anticipated, endorsed by a stream of machine learning literature showcasing AI systems that yield extraordinary performance. However, many of these systems are likely over-promising and will under-deliver in practice. One key reason is the community's failure to acknowledge and address the presence of inflationary effects in the data. These simultaneously inflate evaluation performance and prevent a model from learning the underlying task, thus severely misrepresenting how that model would perform in the real world. This paper investigated the impact of these inflationary effects on healthcare tasks, as well as how these effects can be addressed. Specifically, we defined three inflationary effects that occur in medical data sets and allow models to easily reach small training losses and prevent skillful learning. We investigated two data sets of sustained vowel phonation from participants with and without Parkinson's disease, and revealed that published models which have achieved high classification performances on these were artificially enhanced due to the inflationary effects. Our experiments showed that removing each inflationary effect corresponded with a decrease in classification accuracy, and that removing all inflationary effects reduced the evaluated performance by up to 30%. Additionally, the performance on a more realistic test set increased, suggesting that the removal of these inflationary effects enabled the model to better learn the underlying task and generalize. Source code is available at https://github.com/Wenbo-G/pd-phonation-analysis under the MIT license.
Collapse
|
12
|
Multilingual evaluation of interpretable biomarkers to represent language and speech patterns in Parkinson's disease. Front Neurol 2023; 14:1142642. [PMID: 36937510 PMCID: PMC10017962 DOI: 10.3389/fneur.2023.1142642] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Accepted: 02/08/2023] [Indexed: 03/06/2023] Open
Abstract
Motor impairments are only one aspect of Parkinson's disease (PD), which also include cognitive and linguistic impairments. Speech-derived interpretable biomarkers may help clinicians diagnose PD at earlier stages and monitor the disorder's evolution over time. This study focuses on the multilingual evaluation of a composite array of biomarkers that facilitate PD evaluation from speech. Hypokinetic dysarthria, a motor speech disorder associated with PD, has been extensively analyzed in previously published studies on automatic PD evaluation, with a relative lack of inquiry into language and task variability. In this study, we explore certain acoustic, linguistic, and cognitive information encoded within the speech of several cohorts with PD. A total of 24 biomarkers were analyzed from American English, Italian, Castilian Spanish, Colombian Spanish, German, and Czech by conducting a statistical analysis to evaluate which biomarkers best differentiate people with PD from healthy participants. The study leverages conceptual robustness as a criterion in which a biomarker behaves the same, independent of the language. Hence, we propose a set of speech-based biomarkers that can effectively help evaluate PD while being language-independent. In short, the best acoustic and cognitive biomarkers permitting discrimination between experimental groups across languages were fundamental frequency standard deviation, pause time, pause percentage, silence duration, and speech rhythm standard deviation. Linguistic biomarkers representing the length of the narratives and the number of nouns and auxiliaries also provided discrimination between groups. Altogether, in addition to being significant, these biomarkers satisfied the robustness requirements.
Collapse
|
13
|
Long-Term Averaged Spectrum Descriptors of Dysarthria in Patients With Parkinson's Disease Treated With Subthalamic Nucleus Deep Brain Stimulation. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:4690-4699. [PMID: 36472939 DOI: 10.1044/2022_jslhr-22-00308] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
PURPOSE This study aimed to evaluate whether long-term averaged spectrum (LTAS) descriptors for reading and monologue are suitable to detect worsening of dysarthria in patients with Parkinson's disease (PD) treated with subthalamic nucleus deep brain stimulation (STN-DBS) with potential effect of ON and OFF stimulation conditions and types of connected speech. METHOD Four spectral moments based on LTAS were computed for monologue and reading passage collected from 23 individuals with PD treated with bilateral STN-DBS and 23 age- and gender-matched healthy controls. Speech performance of patients with PD was compared in ON and OFF STN-DBS conditions. RESULTS All LTAS spectral moments including mean, standard deviation, skewness, and kurtosis across both monologue and reading passage were able to significantly distinguish between patients with PD in both stimulation conditions and control speakers. The spectral mean was the only LTAS measure sensitive to capture better speech performance in STN-DBS ON, as compared to the STN-DBS OFF stimulation condition (p < .05). Standardized reading passage was more sensitive compared to monologue in detecting dysarthria severity via LTAS descriptors with an area under the curve of up to 0.92 obtained between PD and control groups. CONCLUSIONS Our findings confirmed that LTAS is a suitable approach to objectively describe changes in speech impairment severity due to STN-DBS therapy in patients with PD. We envisage these results as an important step toward a continuum development of technological solutions for the automated assessment of stimulation-induced dysarthria. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.21644798.
Collapse
|
14
|
The efficacy of acoustic-based articulatory phenotyping for characterizing and classifying four divergent neurodegenerative diseases using sequential motion rates. J Neural Transm (Vienna) 2022; 129:1487-1511. [PMID: 36305960 PMCID: PMC9859630 DOI: 10.1007/s00702-022-02550-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 10/13/2022] [Indexed: 01/25/2023]
Abstract
Despite the impacts of neurodegeneration on speech function, little is known about how to comprehensively characterize the resulting speech abnormalities using a set of objective measures. Quantitative phenotyping of speech motor impairments may have important implications for identifying clinical syndromes and their underlying etiologies, monitoring disease progression over time, and improving treatment efficacy. The goal of this research was to investigate the validity and classification accuracy of comprehensive acoustic-based articulatory phenotypes in speakers with distinct neurodegenerative diseases. Articulatory phenotypes were characterized based on acoustic features that were selected to represent five components of motor performance: Coordination, Consistency, Speed, Precision, and Rate. The phenotypes were first used to characterize the articulatory abnormalities across four progressive neurologic diseases known to have divergent speech motor deficits: amyotrophic lateral sclerosis (ALS), progressive ataxia (PA), Parkinson's disease (PD), and the nonfluent variant of primary progressive aphasia and progressive apraxia of speech (nfPPA + PAOS). We then examined the efficacy of articulatory phenotyping for disease classification. Acoustic analyses were conducted on audio recordings of 217 participants (i.e., 46 ALS, 52 PA, 60 PD, 20 nfPPA + PAOS, and 39 controls) during a sequential speech task. Results revealed evidence of distinct articulatory phenotypes for the four clinical groups and that the phenotypes demonstrated strong classification accuracy for all groups except ALS. Our results highlight the phenotypic variability present across neurodegenerative diseases, which, in turn, may inform (1) the differential diagnosis of neurological diseases and (2) the development of sensitive outcome measures for monitoring disease progression or assessing treatment efficacy.
Collapse
|
15
|
Computerized analysis of speech and voice for Parkinson's disease: A systematic review. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2022; 226:107133. [PMID: 36183641 DOI: 10.1016/j.cmpb.2022.107133] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2022] [Revised: 09/13/2022] [Accepted: 09/13/2022] [Indexed: 06/16/2023]
Abstract
BACKGROUND AND OBJECTIVE Speech impairment is an early symptom of Parkinson's disease (PD). This study has summarized the literature related to speech and voice in detecting PD and assessing its severity. METHODS A systematic review of the literature from 2010 to 2021 to investigate analysis methods and signal features. The keywords "Automatic analysis" in conjunction with "PD speech" or "PD voice" were used, and the PubMed and ScienceDirect databases were searched. A total of 838 papers were found on the first run, of which 189 were selected. One hundred and forty-seven were found to be suitable for the review. The different datasets, recording protocols, signal analysis methods and features that were reported are listed. Values of the features that separate PD patients from healthy controls were tabulated. Finally, the barriers that limit the wide use of computerized speech analysis are discussed. RESULTS Speech and voice may be valuable markers for PD. However, large differences between the datasets make it difficult to compare different studies. In addition, speech analytic methods that are not informed by physiological understanding may alienate clinicians. CONCLUSIONS The potential usefulness of speech and voice for the detection and assessment of PD is confirmed by evidence from the classification and correlation results.
Collapse
|
16
|
Speech acoustic indices for differential diagnosis between Parkinson's disease, multiple system atrophy and progressive supranuclear palsy. NPJ Parkinsons Dis 2022; 8:142. [PMID: 36302780 PMCID: PMC9613976 DOI: 10.1038/s41531-022-00389-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Accepted: 09/01/2022] [Indexed: 11/05/2022] Open
Abstract
While speech disorder represents an early and prominent clinical feature of atypical parkinsonian syndromes such as multiple system atrophy (MSA) and progressive supranuclear palsy (PSP), little is known about the sensitivity of speech assessment as a potential diagnostic tool. Speech samples were acquired from 215 subjects, including 25 MSA, 20 PSP, 20 Parkinson's disease participants, and 150 healthy controls. The accurate differential diagnosis of dysarthria subtypes was based on the quantitative acoustic analysis of 26 speech dimensions related to phonation, articulation, prosody, and timing. A semi-supervised weighting-based approach was then applied to find the best feature combinations for separation between PSP and MSA. Dysarthria was perceptible in all PSP and MSA patients and consisted of a combination of hypokinetic, spastic, and ataxic components. Speech features related to respiratory dysfunction, imprecise consonants, monopitch, slow speaking rate, and subharmonics contributed to worse performance in PSP than MSA, whereas phonatory instability, timing abnormalities, and articulatory decay were more distinctive for MSA compared to PSP. The combination of distinct speech patterns via objective acoustic evaluation was able to discriminate between PSP and MSA with very high accuracy of up to 89% as well as between PSP/MSA and PD with up to 87%. Dysarthria severity in MSA/PSP was related to overall disease severity. Speech disorders reflect the differing underlying pathophysiology of tauopathy in PSP and α-synucleinopathy in MSA. Vocal assessment may provide a low-cost alternative screening method to existing subjective clinical assessment and imaging diagnostic approaches.
Collapse
|
17
|
Articulatory undershoot of vowels in isolated REM sleep behavior disorder and early Parkinson's disease. NPJ Parkinsons Dis 2022; 8:137. [PMID: 36266347 PMCID: PMC9584921 DOI: 10.1038/s41531-022-00407-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Accepted: 10/04/2022] [Indexed: 11/09/2022] Open
Abstract
Imprecise vowels represent a common deficit associated with hypokinetic dysarthria resulting from a reduced articulatory range of motion in Parkinson's disease (PD). It is not yet unknown whether the vowel articulation impairment is already evident in the prodromal stages of synucleinopathy. We aimed to assess whether vowel articulation abnormalities are present in isolated rapid eye movement sleep behaviour disorder (iRBD) and early-stage PD. A total of 180 male participants, including 60 iRBD, 60 de-novo PD and 60 age-matched healthy controls performed reading of a standardized passage. The first and second formant frequencies of the corner vowels /a/, /i/, and /u/ extracted from predefined words, were utilized to construct articulatory-acoustic measures of Vowel Space Area (VSA) and Vowel Articulation Index (VAI). Compared to controls, VSA was smaller in both iRBD (p = 0.01) and PD (p = 0.001) while VAI was lower only in PD (p = 0.002). iRBD subgroup with abnormal olfactory function had smaller VSA compared to iRBD subgroup with preserved olfactory function (p = 0.02). In PD patients, the extent of bradykinesia and rigidity correlated with VSA (r = -0.33, p = 0.01), while no correlation between axial gait symptoms or tremor and vowel articulation was detected. Vowel articulation impairment represents an early prodromal symptom in the disease process of synucleinopathy. Acoustic assessment of vowel articulation may provide a surrogate marker of synucleinopathy in scenarios where a single robust feature to monitor the dysarthria progression is needed.
Collapse
|
18
|
Layer recurrent neural network-based diagnosis of Parkinson’s disease using voice features. BIOMED ENG-BIOMED TE 2022; 67:249-266. [DOI: 10.1515/bmt-2022-0022] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2022] [Accepted: 05/18/2022] [Indexed: 12/13/2022]
Abstract
Abstract
Parkinson’s disease (PD), a slow-progressing neurological disease, affects a large percentage of the world’s elderly population, and this population is expected to grow over the next decade. As a result, early detection is crucial for community health and the future of the globe in order to take proper safeguards and have a less arduous treatment procedure. Recent research has begun to focus on the motor system deficits caused by PD. Because practically most of the PD patients suffer from voice abnormalities, researchers working on automated diagnostic systems investigate vocal impairments. In this paper, we undertake extensive experiments with features extracted from voice signals. We propose a layer Recurrent Neural Network (RNN) based diagnosis for PD. To prove the efficiency of the model, different network models are compared. To the best of our knowledge, several neural network topologies, namely RNN, Cascade Forward Neural Networks (CFNN), and Feed Forward Neural Networks (FFNN), are used and compared for voice-based PD detection for the first time. In addition, the impacts of data normalization and feature selection (FS) are thoroughly examined. The findings reveal that normalization increases classifier performance and Laplacian-based FS outperforms. The proposed RNN model with 300 voice features achieves 99.74% accuracy.
Collapse
|
19
|
An investigation about the relationship between dysarthria level of speech and the neurological state of Parkinson’s patients. Biocybern Biomed Eng 2022. [DOI: 10.1016/j.bbe.2022.04.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|
20
|
Short-term effect of dopaminergic medication on speech in early-stage Parkinson's disease. NPJ Parkinsons Dis 2022; 8:22. [PMID: 35256614 PMCID: PMC8901688 DOI: 10.1038/s41531-022-00286-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Accepted: 02/01/2022] [Indexed: 11/24/2022] Open
Abstract
The effect of dopaminergic medication on speech has rarely been examined in early-stage Parkinson’s disease (PD) and the respective literature is inconclusive and limited by inappropriate design with lack of PD control group. The study aims to examine the short-term effect of dopaminergic medication on speech in PD using patients with good motor responsiveness to levodopa challenge compared to a control group of PD patients with poor motor responsiveness. A total of 60 early-stage PD patients were investigated before (OFF) and after (ON) acute levodopa challenge and compared to 30 age-matched healthy controls. PD patients were categorised into two clinical subgroups (PD responders vs. PD nonresponders) according to the comparison of their motor performance based on movement disorder society-unified Parkinson’s disease rating scale, part III. Seven distinctive parameters of hypokinetic dysarthria were examined using quantitative acoustic analysis. We observed increased monopitch (p > 0.01), aggravated monoloudness (p > 0.05) and longer duration of stop consonants (p > 0.05) in PD compared to healthy controls, confirming the presence of hypokinetic dysarthria in early PD. No speech alterations from OFF to ON state were revealed in any of the two PD groups and speech dimensions investigated including monopitch, monoloudness, imprecise consonants, harsh voice, slow sequential motion rates, articulation rate, or inappropriate silences, although a subgroup of PD responders manifested obvious improvement in motor function after levodopa intake (p > 0.001). Since the short-term usage of levodopa does not easily affect voice and speech performance in PD, speech assessment may provide a medication state-independent motor biomarker of PD.
Collapse
|
21
|
Abstract
Introduction Parkinson's disease (PD) is characterized by specific voice disorders collectively termed hypokinetic dysarthria. We here investigated voice changes by using machine learning algorithms, in a large cohort of patients with PD in different stages of the disease, OFF and ON therapy. Methods We investigated 115 patients affected by PD (mean age: 68.2 ± 9.2 years) and 108 age-matched healthy subjects (mean age: 60.2 ± 11.0 years). The PD cohort included 57 early-stage patients (Hoehn &Yahr ≤ 2) who never took L-Dopa for their disease at the time of the study, and 58 mid-advanced-stage patients (Hoehn &Yahr >2) who were chronically-treated with L-Dopa. We clinically evaluated voices using specific subitems of the Unified Parkinson's Disease Rating Scale and the Voice Handicap Index. Voice samples recorded through a high-definition audio recorder underwent machine learning analysis based on the support vector machine classifier. We also calculated the receiver operating characteristic curves to examine the diagnostic accuracy of the analysis and assessed possible clinical-instrumental correlations. Results Voice is abnormal in early-stage PD and as the disease progresses, voice increasingly degradres as demonstrated by high accuracy in the discrimination between healthy subjects and PD patients in the early-stage and mid-advanced-stage. Also, L-dopa therapy improves but not restore voice in PD as shown by high accuracy in the comparison between patients OFF and ON therapy. Finally, for the first time we achieved significant clinical-instrumental correlations by using a new score (LR value) calculated by machine learning. Conclusion Voice is abnormal in early-stage PD, progressively degrades in mid-advanced-stage and can be improved but not restored by L-Dopa. Lastly, machine learning allows tracking disease severity and quantifying the symptomatic effect of L-Dopa on voice parameters with previously unreported high accuracy, thus representing a potential new biomarker of PD.
Collapse
|
22
|
Things to Consider When Automatically Detecting Parkinson’s Disease Using the Phonation of Sustained Vowels: Analysis of Methodological Issues. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12030991] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Diagnosing Parkinson’s Disease (PD) necessitates monitoring symptom progression. Unfortunately, diagnostic confirmation often occurs years after disease onset. A more sensitive and objective approach is paramount to the expedient diagnosis and treatment of persons with PD (PwPDs). Recent studies have shown that we can train accurate models to detect signs of PD from audio recordings of confirmed PwPDs. However, disparities exist between studies and may be caused, in part, by differences in employed corpora or methodologies. Our hypothesis is that unaccounted covariates in methodology, experimental design, and data preparation resulted in overly optimistic results in studies of PD automatic detection employing sustained vowels. These issues include record-wise fold creation rather than subject-wise; an imbalance of age between the PwPD and control classes; using too small of a corpus compared to the sizes of feature vectors; performing cross-validation without including development data; and the absence of cross-corpora testing to confirm results. In this paper, we evaluate the influence of these methodological issues in the automatic detection of PD employing sustained vowels. We perform several experiments isolating each issue to measure its influence employing three different corpora. Moreover, we analyze if the perceived dysphonia of the speakers could be causing differences in results between the corpora. Results suggest that each independent methodological issue analyzed has an effect on classification accuracy. Consequently, we recommend a list of methodological steps to be considered in future experiments to avoid overoptimistic or misleading results.
Collapse
|
23
|
Early auditory responses to speech sounds in Parkinson's disease: preliminary data. Sci Rep 2022; 12:1019. [PMID: 35046514 PMCID: PMC8770631 DOI: 10.1038/s41598-022-05128-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2021] [Accepted: 01/06/2022] [Indexed: 11/14/2022] Open
Abstract
Parkinson’s disease (PD), as a manifestation of basal ganglia dysfunction, is associated with a number of speech deficits, including reduced voice modulation and vocal output. Interestingly, previous work has shown that participants with PD show an increased feedback-driven motor response to unexpected fundamental frequency perturbations during speech production, and a heightened ability to detect differences in vocal pitch relative to control participants. Here, we explored one possible contributor to these enhanced responses. We recorded the frequency-following auditory brainstem response (FFR) to repetitions of the speech syllable [da] in PD and control participants. Participants with PD displayed a larger amplitude FFR related to the fundamental frequency of speech stimuli relative to the control group. The current preliminary results suggest the dysfunction of the basal ganglia in PD contributes to the early stage of auditory processing and may reflect one component of a broader sensorimotor processing impairment associated with the disease.
Collapse
|
24
|
Parkinson's disease patients with freezing of gait have more severe voice impairment than non-freezers during "ON state". J Neural Transm (Vienna) 2022; 129:277-286. [PMID: 34989833 DOI: 10.1007/s00702-021-02458-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Accepted: 12/26/2021] [Indexed: 12/22/2022]
Abstract
BACKGROUND Speech disorders and freezing of gait (FOG) in Parkinson's disease (PD) may have some common pathological mechanisms. The purpose of this study was to compare the acoustic parameters of PD patients with dopamine-responsive FOG (PD-FOG) and without FOG (PD-nFOG) during "ON state" and explore the ability of "ON state" voice features in distinguishing PD-FOG from PD-nFOG. METHODS A total of 120 subjects, including 40 PD patients with dopamine-responsive FOG, 40 PD-nFOG, and 40 healthy controls (HCs) were recruited. All subjects underwent neuropsychological tests. Speech samples were recorded through the sustained vowel pronunciation tasks during the "ON state" and then analyzed by the Praat software. A set of 27 voice features was extracted from each sample for comparison. Support vector machine (SVM) was used to build mathematical models to classify PD-FOG and PD-nFOG. RESULTS Compared with PD-nFOG, the jitter, the standard deviation of fundamental frequency (F0SD), the standard deviation of pulse period (pulse period SD) and the noise-homophonic-ratio (NHR) were increased, and the maximum phonation time (MPT) was decreased in PD-FOG. The above voice features were correlated with the freezing of gait questionnaire (FOGQ). The average accuracy, specificity, and sensitivity of SVM models based on 27 voice features for classifying PD-FOG and PD-nFOG were 73.57%, 75.71%, and 71.43%, respectively. CONCLUSIONS PD-FOG have more severe voice impairment than PD-nFOG during "ON state".
Collapse
|
25
|
Lee Silverman Voice Treatment to Improve Speech in Parkinson's Disease: A Systemic Review and Meta-Analysis. PARKINSON'S DISEASE 2021; 2021:3366870. [PMID: 35070257 PMCID: PMC8782619 DOI: 10.1155/2021/3366870] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Revised: 12/07/2021] [Accepted: 12/10/2021] [Indexed: 02/07/2023]
Abstract
BACKGROUND Speech changes occur in the early stages of Parkinson's disease (PD) and cause communication difficulties, leading to social isolation. Lee Silverman voice treatment (LSVT) is a speech therapy approach designed to improve patients' language and voice capabilities. OBJECTIVE The effectiveness of the LSVT was compared with that of other speech interventions or no treatment to evaluate PD patients with dysarthria. DESIGN Systematic review with meta-analysis of randomized trials. Data Sources: PubMed, Embase, Cochrane Library, CNKI, and SinoMed library were searched from inception to December 2021 related to PD and LSVT. METHOD Abstracts were screened and reviewed against the eligibility criteria (intervention group participants were PD assessed based on LSVT (LSVT Loud) and randomized control). RESULT Ten randomized controlled trials were identified on speech symptoms in patients with PD. Compared with the respiratory therapy (RET) exercise, or no training group, a significant improvement was detected in the sound press level (SPL) after immediate treatment during the reading of vowel and rainbow passages and an increase in semitone standard deviation (STSD). Furthermore, the LSVT training significantly increased the participants' scores on unified Parkinson's disease rating scale (UPDRS-III) and speech intelligibility. CONCLUSION This meta-analysis demonstrated the efficacy of LSVT in increasing vocal loudness and functional communication among individuals with PD. However, most studies included participants with mild-moderate PD. Thus, additional randomized controlled trials (RCTs) with large sample sizes are needed to validate the efficacy of LSVT in patients with different progressions of PD, including severe PD.
Collapse
|
26
|
Validation of cepstral peak prominence in assessing early voice changes of Parkinson's disease: Effect of speaking task and ambient noise. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 150:4522. [PMID: 34972306 DOI: 10.1121/10.0009063] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Accepted: 12/03/2021] [Indexed: 06/14/2023]
Abstract
Although the cepstral peak prominence (CPP) and its variant, the cepstral peak prominence smooth (CPPS), are considered to be robust acoustic measures for the evaluation of dysphonia, whether they are sensitive to capture early voice changes in Parkinson's disease (PD) has not yet been explored. This study aimed to investigate the voice changes via the CPP measures in the idiopathic rapid eye movement sleep behavior disorder (iRBD), a special case of prodromal neurodegeneration, and recently diagnosed and advanced-stage Parkinson's disease (AS-PD) patients using different speaking tasks across noise-free and noisy environments. The sustained vowel phonation, reading of passages, and monologues of 60 early stage untreated PD, 30 advanced-stage Parkinson's disease, 60 iRBD, and 60 healthy control (HC) participants were evaluated. Significant differences were found between the PD groups and controls in sustained phonation via the CPP (p < 0.05) and CPPS (p < 0.01) and the monologue via the CPP (p < 0.01), although neither the CPP nor CPPS measures were sufficiently sensitive to capture the possible prodromal dysphonia in the iRBD. The quality of the CPP and CPPS measures was influenced substantially by the addition of ambient noise. It was anticipated that the CPP measures might serve as a promising digital biomarker in assessing the dysphonia from the early stages of PD.
Collapse
|
27
|
Dysprosody in Isolated REM Sleep Behavior Disorder with Impaired Olfaction but Intact Nigrostriatal Pathway. Mov Disord 2021; 37:619-623. [PMID: 34837250 DOI: 10.1002/mds.28873] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Revised: 11/01/2021] [Accepted: 11/08/2021] [Indexed: 11/08/2022] Open
Abstract
BACKGROUND Impairments of olfactory and speech function are likely early prodromal symptoms of α-synucleinopathy. OBJECTIVE The aim of this study is to assess whether dysprosody is present in isolated rapid eye movement sleep behavior disorder (iRBD) with hyposmia/anosmia and a normal nigrostriatal system. METHODS Pitch variability during speech was investigated in 17 iRBD subjects with normal olfactory function (iRBD-NOF), 30 iRBD subjects with abnormal olfactory function (iRBD-AOF), and 50 healthy controls. iRBD subjects were evaluated using the University of Pennsylvania Smell Identification Test and [123I]-2ß-carbomethoxy-3ß-(4-iodophenyl)-N-(3-fluoropropyl)-nortropane dopamine transporter single-photon emission computed tomography (DAT-SPECT). All iRBD subjects completed the 24-month follow-up with DAT-SPECT, speech, and olfactory testing. RESULTS At baseline, only iRBD-AOF showed monopitch when compared to iRBD-NOF (P = 0.04) and controls (P = 0.03), with no difference between iRBD-NOF and controls (P = 1). At follow-up, dysprosody progressed only in iRBD-AOF with abnormal DAT-SPECT (P = 0.03). CONCLUSION Prosody is impaired in hyposmic but not in normosmic iRBD subjects before the nigrostriatal dopaminergic transmission is affected (Braak stage 2). © 2021 International Parkinson and Movement Disorder Society.
Collapse
|