1
|
Sonkaya ZZ, Özturk B, Sonkaya R, Taskiran E, Karadas Ö. Using Objective Speech Analysis Techniques for the Clinical Diagnosis and Assessment of Speech Disorders in Patients with Multiple Sclerosis. Brain Sci 2024; 14:384. [PMID: 38672033 PMCID: PMC11047916 DOI: 10.3390/brainsci14040384] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Revised: 04/11/2024] [Accepted: 04/12/2024] [Indexed: 04/28/2024] Open
Abstract
Multiple sclerosis (MS) is one of the chronic and neurodegenerative diseases of the central nervous system (CNS). It generally affects motor, sensory, cerebellar, cognitive, and language functions. It is thought that identifying MS speech disorders using quantitative methods will make a significant contribution to physicians in the diagnosis and follow-up of MS patients. In this study, it was aimed to investigate the speech disorders of MS via objective speech analysis techniques. The study was conducted on 20 patients diagnosed with MS according to McDonald's 2017 criteria and 20 healthy volunteers without any speech or voice pathology. Speech data obtained from patients and healthy individuals were analyzed with the PRAAT speech analysis program, and classification algorithms were tested to determine the most effective classifier in separating specific speech features of MS disease. As a result of the study, the K-nearest neighbor algorithm (K-NN) was found to be the most successful classifier (95%) in distinguishing pathological sounds which were seen in MS patients from those in healthy individuals. The findings obtained in our study can be considered as preliminary data to determine the voice characteristics of MS patients.
Collapse
Affiliation(s)
- Zeynep Z. Sonkaya
- Department of Experimental Linguistics, Ankara University, 06590 Ankara, Turkey
| | - Bilgin Özturk
- Department of Neurology, Gülhane Medicine Faculty, Health Science University, 06010 Ankara, Turkey; (B.Ö.); (R.S.); (Ö.K.)
| | - Rıza Sonkaya
- Department of Neurology, Gülhane Medicine Faculty, Health Science University, 06010 Ankara, Turkey; (B.Ö.); (R.S.); (Ö.K.)
| | - Esra Taskiran
- Department of Neurology, Antalya Training and Research Hospital, 07100 Antalya, Turkey;
| | - Ömer Karadas
- Department of Neurology, Gülhane Medicine Faculty, Health Science University, 06010 Ankara, Turkey; (B.Ö.); (R.S.); (Ö.K.)
| |
Collapse
|
2
|
Isaev DY, Vlasova RM, Di Martino JM, Stephen CD, Schmahmann JD, Sapiro G, Gupta AS. Uncertainty of Vowel Predictions as a Digital Biomarker for Ataxic Dysarthria. CEREBELLUM (LONDON, ENGLAND) 2024; 23:459-470. [PMID: 37039956 PMCID: PMC10826261 DOI: 10.1007/s12311-023-01539-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 02/27/2023] [Indexed: 04/12/2023]
Abstract
Dysarthria is a common manifestation across cerebellar ataxias leading to impairments in communication, reduced social connections, and decreased quality of life. While dysarthria symptoms may be present in other neurological conditions, ataxic dysarthria is a perceptually distinct motor speech disorder, with the most prominent characteristics being articulation and prosody abnormalities along with distorted vowels. We hypothesized that uncertainty of vowel predictions by an automatic speech recognition system can capture speech changes present in cerebellar ataxia. Speech of participants with ataxia (N=61) and healthy controls (N=25) was recorded during the "picture description" task. Additionally, participants' dysarthric speech and ataxia severity were assessed on a Brief Ataxia Rating Scale (BARS). Eight participants with ataxia had speech and BARS data at two timepoints. A neural network trained for phoneme prediction was applied to speech recordings. Average entropy of vowel tokens predictions (AVE) was computed for each participant's recording, together with mean pitch and intensity standard deviations (MPSD and MISD) in the vowel segments. AVE and MISD demonstrated associations with BARS speech score (Spearman's rho=0.45 and 0.51), and AVE demonstrated associations with BARS total (rho=0.39). In the longitudinal cohort, Wilcoxon pairwise signed rank test demonstrated an increase in BARS total and AVE, while BARS speech and acoustic measures did not significantly increase. Relationship of AVE to both BARS speech and BARS total, as well as the ability to capture disease progression even in absence of measured speech decline, indicates the potential of AVE as a digital biomarker for cerebellar ataxia.
Collapse
Affiliation(s)
- Dmitry Yu Isaev
- Department of Biomedical Engineering, Duke University, Durham, NC, USA.
| | - Roza M Vlasova
- Department of Psychiatry, UNC School of Medicine, University of North Carolina, Chapel Hill, NC, USA
| | - J Matias Di Martino
- Department of Electrical and Computer Engineering, Duke University, Durham, NC, USA
| | - Christopher D Stephen
- Ataxia Center & Department of Neurology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - Jeremy D Schmahmann
- Ataxia Center & Department of Neurology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - Guillermo Sapiro
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
- Department of Electrical and Computer Engineering, Duke University, Durham, NC, USA
- Departments of Mathematics & Computer Science, Duke University, Durham, NC, USA
| | - Anoopum S Gupta
- Ataxia Center & Department of Neurology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
3
|
Moya-Galé G, Kim Y, Fabiano L. Raising Awareness About Language- and Culture-Specific Considerations in the Management of Dysarthria Associated With Parkinson's Disease Within the United States. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023:1-9. [PMID: 37902554 DOI: 10.1044/2023_jslhr-23-00365] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/31/2023]
Abstract
PURPOSE The purpose of this article is to raise awareness about the importance of diverting from English-centric approaches in the management of dysarthria associated with Parkinson's disease (PD) in the United States, and embracing a language- and culture-specific perspective when working with linguistically and culturally diverse populations within the context of culturally responsive, precision medicine. METHOD This tutorial is divided into two primary components: a critical review of language universal and language-specific characteristics of dysarthria associated with PD and their relationship with speech intelligibility, and a practical guide to culturally responsive evidence-based practice for speech-language pathologists. CONCLUSIONS We offer a framework for linguistically and culturally appropriate considerations when working with clients with dysarthria associated with PD. While "universal" representations of dysarthria may be part of the big picture, language-specific contributions to speakers' intelligibility should be carefully examined to maximize treatment outcomes. Additionally, an evidence-based model that fully embraces clients' wishes and values within the context of culturally responsive, precision medicine should be prioritized, a practice that may include the use of interpreters.
Collapse
|
4
|
Convey RB, Ihalainen T, Liu Y, Räsänen O, Ylinen S, Penttilä N. A comparative study of automatic vowel articulation index and auditory-perceptual assessments of speech intelligibility in Parkinson's disease. INTERNATIONAL JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2023:1-11. [PMID: 37800979 DOI: 10.1080/17549507.2023.2251725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/07/2023]
Abstract
PURPOSE The purpose of this study was to analyse the relationship between automatic vowel articulation index (aVAI) and direct magnitude estimation (DME) among speakers with Parkinson's disease (PD) and healthy controls. We further analysed the potential of aVAI to serve as an objective measure of speech impairment in the clinical setting. METHOD Speech samples from native Finnish speakers were utilised. Expert raters utilised DME to scale the intelligibility of speech samples. aVAI scores for PD speakers and healthy control speakers were analysed in relationship to DME speech intelligibility ratings and, among PD speakers, disease stage utilising nonparametric statistical analysis. RESULT Mean DME intelligibility ratings were lower among PD speakers compared to healthy controls. Mean aVAI scores were nearly the same between speaker groups. DME intelligibility ratings and aVAI were strongly correlated within the PD speaker group. aVAI and DME intelligibility ratings were moderately correlated with disease stage as measured by the Hoehn and Yahr scale. CONCLUSION aVAI was observed to be a promising tool for analysing vowel articulation in PD speakers. Further research is warranted on the application of aVAI as an objective measure of severity of speech impairment in the clinical setting, with varying patient populations and speech samples.
Collapse
Affiliation(s)
- Rachel B Convey
- Faculty of Social Sciences, Tampere University, Tampere, Finland
| | - Tiina Ihalainen
- Faculty of Social Sciences, Tampere University, Tampere, Finland
| | - Yuanyuan Liu
- Information Technology and Communication Sciences, Tampere University, Tampere, Finland
| | - Okko Räsänen
- Information Technology and Communication Sciences, Tampere University, Tampere, Finland
| | - Sari Ylinen
- Faculty of Social Sciences, Tampere University, Tampere, Finland
| | - Nelly Penttilä
- Faculty of Social Sciences, Tampere University, Tampere, Finland
| |
Collapse
|
5
|
Milella G, Sciancalepore D, Cavallaro G, Piccirilli G, Nanni AG, Fraddosio A, D’Errico E, Paolicelli D, Fiorella ML, Simone IL. Acoustic Voice Analysis as a Useful Tool to Discriminate Different ALS Phenotypes. Biomedicines 2023; 11:2439. [PMID: 37760880 PMCID: PMC10525613 DOI: 10.3390/biomedicines11092439] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2023] [Revised: 08/24/2023] [Accepted: 08/29/2023] [Indexed: 09/29/2023] Open
Abstract
Approximately 80-96% of people with amyotrophic lateral sclerosis (ALS) become unable to speak during the disease progression. Assessing upper and lower motor neuron impairment in bulbar regions of ALS patients remains challenging, particularly in distinguishing spastic and flaccid dysarthria. This study aimed to evaluate acoustic voice parameters as useful biomarkers to discriminate ALS clinical phenotypes. Triangular vowel space area (tVSA), alternating motion rates (AMRs), and sequential motion rates (SMRs) were analyzed in 36 ALS patients and 20 sex/age-matched healthy controls (HCs). tVSA, AMR, and SMR values significantly differed between ALS and HCs, and between ALS with prevalent upper (pUMN) and lower motor neuron (pLMN) impairment. tVSA showed higher accuracy in discriminating pUMN from pLMN patients. AMR and SMR were significantly lower in patients with bulbar onset than those with spinal onset, both with and without bulbar symptoms. Furthermore, these values were also lower in patients with spinal onset associated with bulbar symptoms than in those with spinal onset alone. Additionally, AMR and SMR values correlated with the degree of dysphagia. Acoustic voice analysis may be considered a useful prognostic tool to differentiate spastic and flaccid dysarthria and to assess the degree of bulbar involvement in ALS.
Collapse
Affiliation(s)
- Giammarco Milella
- Neurology Unit, Department of Translational Biomedicine and Neurosciences, 70121 Bari, Italy; (G.M.); (G.P.); (A.G.N.); (A.F.); (E.D.); (D.P.)
| | - Diletta Sciancalepore
- Otolaryngology Unit, Department of Translational Biomedicine and Neurosciences (DiBraiN), University of Bari Aldo Moro, 70121 Bari, Italy; (D.S.); (G.C.); (M.L.F.)
| | - Giada Cavallaro
- Otolaryngology Unit, Department of Translational Biomedicine and Neurosciences (DiBraiN), University of Bari Aldo Moro, 70121 Bari, Italy; (D.S.); (G.C.); (M.L.F.)
| | - Glauco Piccirilli
- Neurology Unit, Department of Translational Biomedicine and Neurosciences, 70121 Bari, Italy; (G.M.); (G.P.); (A.G.N.); (A.F.); (E.D.); (D.P.)
| | - Alfredo Gabriele Nanni
- Neurology Unit, Department of Translational Biomedicine and Neurosciences, 70121 Bari, Italy; (G.M.); (G.P.); (A.G.N.); (A.F.); (E.D.); (D.P.)
| | - Angela Fraddosio
- Neurology Unit, Department of Translational Biomedicine and Neurosciences, 70121 Bari, Italy; (G.M.); (G.P.); (A.G.N.); (A.F.); (E.D.); (D.P.)
| | - Eustachio D’Errico
- Neurology Unit, Department of Translational Biomedicine and Neurosciences, 70121 Bari, Italy; (G.M.); (G.P.); (A.G.N.); (A.F.); (E.D.); (D.P.)
| | - Damiano Paolicelli
- Neurology Unit, Department of Translational Biomedicine and Neurosciences, 70121 Bari, Italy; (G.M.); (G.P.); (A.G.N.); (A.F.); (E.D.); (D.P.)
| | - Maria Luisa Fiorella
- Otolaryngology Unit, Department of Translational Biomedicine and Neurosciences (DiBraiN), University of Bari Aldo Moro, 70121 Bari, Italy; (D.S.); (G.C.); (M.L.F.)
| | | |
Collapse
|
6
|
Moya-Galé G, Wisler AA, Walsh SJ, McAuliffe MJ, Levy ES. Acoustic Predictors of Ease of Understanding in Spanish Speakers With Dysarthria Associated With Parkinson's Disease. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023; 66:2999-3012. [PMID: 36508721 DOI: 10.1044/2022_jslhr-22-00284] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
PURPOSE The purpose of this study was to examine selected baseline acoustic features of hypokinetic dysarthria in Spanish speakers with Parkinson's disease (PD) and identify potential acoustic predictors of ease of understanding in Spanish. METHOD Seventeen Spanish-speaking individuals with mild-to-moderate hypokinetic dysarthria secondary to PD and eight healthy controls were recorded reading a translation of the Rainbow Passage. Acoustic measures of vowel space area, as indicated by the formant centralization ratio (FCR), envelope modulation spectra (EMS), and articulation rate were derived from the speech samples. Additionally, 15 healthy adults rated ease of understanding of the recordings on a visual analogue scale. A multiple linear regression model was implemented to investigate the predictive value of the selected acoustic parameters on ease of understanding. RESULTS Listeners' ease of understanding was significantly lower for speakers with dysarthria than for healthy controls. The FCR, EMS from the first 10 s of the reading passage, and the difference in EMS between the end and the beginning sections of the passage differed significantly between the two groups of speakers. Findings indicated that 67.7% of the variability in ease of understanding was explained by the predictive model, suggesting a moderately strong relationship between the acoustic and perceptual domains. CONCLUSIONS Measures of envelope modulation spectra were found to be highly significant model predictors of ease of understanding of Spanish-speaking individuals with hypokinetic dysarthria associated with PD. Articulation rate was also found to be important (albeit to a lesser degree) in the predictive model. The formant centralization ratio should be further examined with a larger sample size and more severe dysarthria to determine its efficacy in predicting ease of understanding.
Collapse
Affiliation(s)
| | | | | | | | - Erika S Levy
- Teachers College, Columbia University, New York, NY
| |
Collapse
|
7
|
Kuo C, Berry J. The Relationship Between Acoustic and Kinematic Vowel Space Areas With and Without Normalization for Speakers With and Without Dysarthria. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2023; 32:1923-1937. [PMID: 37105919 PMCID: PMC10561967 DOI: 10.1044/2023_ajslp-22-00158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Revised: 09/09/2022] [Accepted: 01/17/2023] [Indexed: 06/19/2023]
Abstract
PURPOSE Few studies have reported on the vowel space area (VSA) in both acoustic and kinematic domains. This study examined acoustic and kinematic VSAs for speakers with and without dysarthria and evaluated effects of normalization on acoustic and kinematic VSAs and the relationship between these measures. METHOD Vowel data from 12 speakers with and without dysarthria, presenting with a range of speech abilities, were examined. The speakers included four speakers with Parkinson's disease (PD), four speakers with brain injury (BI), and four neurotypical (NT) speakers. Speech acoustic and kinematic data were acquired simultaneously using electromagnetic articulography during a passage reading task. Raw and normalized VSAs calculated from corner vowels /i/, /æ/, /ɑ/, and /u/ were evaluated. Normalization was achieved through z score transformations to the acoustic and kinematic data. The effect of normalization on variability within and across groups was evaluated. Regression analysis was used across speakers to assess the association between acoustic and kinematic VSAs for both raw and normalized data. RESULTS When evaluating the speakers as three different groups (i.e., PD, BI, and NT), normalization reduced the standard deviations within each group and changed the relative differences in average magnitude between groups. Regression analysis revealed a significant relationship between normalized, but not raw, acoustic and kinematic VSAs, after the exclusion of an outlier speaker. CONCLUSIONS Normalization reduces the variability across speakers, within groups, and changes average magnitudes affecting speaker group comparisons. Normalization also influences the correlation between acoustic and kinematic measures. Further investigation of the impact of normalization techniques upon acoustic and kinematic measures is warranted. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.22669747.
Collapse
Affiliation(s)
- Christina Kuo
- Department of Communication Sciences and Disorders, James Madison University, Harrisonburg, VA
| | - Jeffrey Berry
- Department of Speech Pathology and Audiology, Marquette University, Milwaukee, WI
| |
Collapse
|
8
|
Thompson A, Hirsch ME, Lansford KL, Kim Y. Vowel Acoustics as Predictors of Speech Intelligibility in Dysarthria. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023; 66:3100-3114. [PMID: 36795536 PMCID: PMC10569402 DOI: 10.1044/2022_jslhr-22-00287] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Revised: 09/27/2022] [Accepted: 12/12/2022] [Indexed: 06/18/2023]
Abstract
PURPOSE This study sought to determine if alternative vowel space area (VSA) measures (i.e., novel trajectory-based measures: vowel space hull area and vowel space density) predicted speech intelligibility to the same extent as two traditional vowel measures (i.e., token-based measures: VSA and corner dispersion) in speakers with dysarthria. Additionally, this study examined if the strength of the relationship between acoustic vowel measures and intelligibility differed based on how intelligibility was measured (i.e., orthographic transcriptions [OTs] and visual analog scale [VAS] ratings). METHOD The Grandfather Passage was read aloud by 40 speakers with dysarthria of varying etiologies, including Parkinson's disease (n = 10), amyotrophic lateral sclerosis (n = 10), Huntington's disease (n = 10), and cerebellar ataxia (n = 10). Token- and trajectory-based acoustic vowel measures were calculated from the passage. Naïve listeners (N = 140) were recruited via crowdsourcing to provide OTs and VAS intelligibility ratings. Hierarchical linear regression models were created to model OTs and VAS intelligibility ratings using the acoustic vowel measures as predictors. RESULTS Traditional VSA was the sole significant predictor of speech intelligibility for both the OTs (R 2 = .259) and VAS (R 2 = .236) models. In contrast, the trajectory-based measures were not significant predictors of intelligibility. Additionally, the OTs and VAS intelligibility ratings conveyed similar information. CONCLUSIONS The findings suggest that traditional token-based vowel measures better predict intelligibility than trajectory-based measures. Additionally, the findings suggest that VAS methods are comparable to OT methods for estimating speech intelligibility for research purposes.
Collapse
Affiliation(s)
- Austin Thompson
- School of Communication Science and Disorders, Florida State University, Tallahassee
| | - Micah E. Hirsch
- School of Communication Science and Disorders, Florida State University, Tallahassee
| | - Kaitlin L. Lansford
- School of Communication Science and Disorders, Florida State University, Tallahassee
| | - Yunjung Kim
- School of Communication Science and Disorders, Florida State University, Tallahassee
| |
Collapse
|
9
|
Lehner K, Ziegler W. Clinical measures of communication limitations in dysarthria assessed through crowdsourcing: specificity, sensitivity, and retest-reliability. CLINICAL LINGUISTICS & PHONETICS 2022; 36:988-1009. [PMID: 34763588 DOI: 10.1080/02699206.2021.1979658] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Revised: 08/05/2021] [Accepted: 09/05/2021] [Indexed: 06/13/2023]
Abstract
Assessing the impact of dysarthria on a patient's ability to communicate should be an integral part of patient management. However, due to the high demands on reliable quantification of communication limitations, hardly any formal clinical tests with approved psychometric properties have been developed so far. This study investigates a web-based assessment of communication impairment in dysarthria, named KommPaS. The test comprises measures of intelligibility, naturalness, perceived listener effort and communication efficiency, as well as a total score that integrates these parameters. The approach is characterized by a quasi-random access to a large inventory of test materials and to a large group of naïve listeners, recruited via crowdsourcing. As part of a larger research program to establish the clinical applicability of this new approach, the present paper focuses on two psychometric issues, namely specificity and sensitivity (study 1) and retest-reliability (study 2). Study 1: KommPaS was administered to 54 healthy adults and 100 adult persons with dysarthria (PWD). Non-parametric criterion-based norms (specificity: 0.95) were used to derive a standard metric for each of the four component variables, and corresponding sensitivity values for the presence of dysarthria were identified. Overall classification accuracy of the total score was determined using a ROC analysis. The resulting cutscores showed a high accuracy in the separation of PWD from healthy speakers for the naturalness and the total score. Study 2: A sub-group of 20 PWD enrolled in study 1 were administered a second KommPaS examination. ICC analyses revealed good to excellent retest reliabilities for all parameters.
Collapse
Affiliation(s)
- Katharina Lehner
- Clinical Neuropsychology Research Group, Institute of Phonetics and Speech Processing, Ludwig-Maximilians-University, Munich, Germany
| | - Wolfram Ziegler
- Clinical Neuropsychology Research Group, Institute of Phonetics and Speech Processing, Ludwig-Maximilians-University, Munich, Germany
| |
Collapse
|
10
|
Marczyk A, O'Brien B, Tremblay P, Woisard V, Ghio A. Correlates of vowel clarity in the spectrotemporal modulation domain: Application to speech impairment evaluation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022; 152:2675. [PMID: 36456260 DOI: 10.1121/10.0015024] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Accepted: 10/13/2022] [Indexed: 06/17/2023]
Abstract
This article reports on vowel clarity metrics based on spectrotemporal modulations of speech signals. Motivated by previous findings on the relevance of modulation-based metrics for speech intelligibility assessment and pathology classification, the current study used factor analysis to identify regions within a bi-dimensional modulation space, the magnitude power spectrum, as in Elliott and Theunissen [(2009). PLoS Comput. Biol. 5(3), e1000302] by relating them to a set of conventional acoustic metrics of vowel space area and vowel distinctiveness. Two indices based on the energy ratio between high and low modulation rates across temporal and spectral dimensions of the modulation space emerged from the analyses. These indices served as input for measurements of central tendency and classification analyses that aimed to identify vowel-related speech impairments in French native speakers with head and neck cancer (HNC) and Parkinson dysarthria (PD). Following the analysis, vowel-related speech impairment was identified in HNC speakers, but not in PD. These results were consistent with findings based on subjective evaluations of speech intelligibility. The findings reported are consistent with previous studies indicating that impaired speech is associated with attenuation in energy in higher spectrotemporal modulation bands.
Collapse
Affiliation(s)
- Anna Marczyk
- Aix-Marseille Université, CNRS, LPL, UMR 7309, Aix-en-Provence, France
| | - Benjamin O'Brien
- Aix-Marseille Université, CNRS, LPL, UMR 7309, Aix-en-Provence, France
| | - Pascale Tremblay
- Universite Laval, Faculte de Medecine, Departement de Readaptation, Quebec City, Quebec G1V 0A6, Canada
| | | | - Alain Ghio
- Aix-Marseille Université, CNRS, LPL, UMR 7309, Aix-en-Provence, France
| |
Collapse
|
11
|
Hirsch ME, Thompson A, Kim Y, Lansford KL. The Reliability and Validity of Speech-Language Pathologists’ Estimations of Intelligibility in Dysarthria. Brain Sci 2022; 12:brainsci12081011. [PMID: 36009074 PMCID: PMC9406197 DOI: 10.3390/brainsci12081011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Revised: 07/27/2022] [Accepted: 07/28/2022] [Indexed: 02/01/2023] Open
Abstract
This study examined the reliability and validity of speech-language pathologists’ (SLP) estimations of speech intelligibility in dysarthria, including a visual analog scale (VAS) method and a percent estimation method commonly used in clinical settings. Speech samples from 20 speakers with dysarthria of varying etiologies were used to collect orthographic transcriptions from naïve listeners n=70 and VAS ratings and percent estimations of intelligibility from SLPs n=21. Intra- and interrater reliability for the two SLP intelligibility measures were evaluated, and the relationship between these measures was assessed. Finally, linear regression was used to evaluate the relationship between the naïve listeners’ orthographic transcription scores and the two SLP intelligibility measures. The results indicated that the intrarater reliability for both SLP intelligibility measures was strong, and the interrater reliability between the SLP ratings was moderate to excellent. A moderate positive relationship between SLPs’ VAS ratings and percent estimations was also observed. Finally, both SLPs’ percent estimations and VAS ratings were predictive of naïve listeners’ orthographic transcription scores, with SLPs’ percent estimations being the strongest predictor. In conclusion, the average SLP percent estimations and VAS ratings are valid and reliable intelligibility measures. However, the validity and reliability of these measures vary between SLPs.
Collapse
|
12
|
Olmstead AJ, Lee J, Chen J. Perceptual Learning of Altered Vowel Space Improves Identification of Vowels Produced by Individuals With Dysarthria Secondary to Amyotrophic Lateral Sclerosis. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:2204-2214. [PMID: 35623135 DOI: 10.1044/2022_jslhr-21-00567] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
PURPOSE This study examines the efficacy of perceptual training for improving typical listeners' identification of vowels produced by individuals with dysarthria. We examined whether training on a subset of vowels can generalize to (a) untrained vowels and (b) other speakers with similar overall intelligibility. METHOD Sixty naive listeners completed a pretest/posttest perceptual learning task. In the pretraining test and posttraining test, participants identified nine American English monophthongs produced by two speakers with dysarthria secondary to amyotrophic lateral sclerosis (ALS). In the 20-min training task, a two-alternative forced choice (2AFC) task with feedback trained listeners on a subset of the vowels and speakers presented in the pretraining test. RESULTS Vowel identification accuracy improved overall as a function of training. However, patterns of generalization between speakers and vowel types were not symmetric. Specifically, listeners generalized training from front vowels to back vowels but not vice versa. Likewise, listeners generalized from one speaker to another but not in the opposite direction. Examination of confusion matrices for the pretraining and posttraining revealed complex patterns of vowel-specific improvement. CONCLUSIONS This study demonstrates that listeners benefit from a very simple training paradigm targeting vowels. Additionally, error patterns revealed that vowels are both resistant to and responsive to perceptual learning. Implications for future research and clinical training paradigms are discussed.
Collapse
Affiliation(s)
- Annie J Olmstead
- Department of Communication Sciences and Disorders, The Pennsylvania State University, University Park
| | - Jimin Lee
- Department of Communication Sciences and Disorders, The Pennsylvania State University, University Park
| | - Janice Chen
- Department of Communication Sciences and Disorders, The Pennsylvania State University, University Park
| |
Collapse
|
13
|
Carl M, Levy ES, Icht M. Speech treatment for Hebrew-speaking adolescents and young adults with developmental dysarthria: A comparison of mSIT and Beatalk. INTERNATIONAL JOURNAL OF LANGUAGE & COMMUNICATION DISORDERS 2022; 57:660-679. [PMID: 35363414 DOI: 10.1111/1460-6984.12715] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Accepted: 02/16/2022] [Indexed: 06/14/2023]
Abstract
BACKGROUND Individuals with developmental dysarthria typically demonstrate reduced functioning of one or more of the speech subsystems, which negatively impacts speech intelligibility and communication within social contexts. A few treatment approaches are available for improving speech production and intelligibility among individuals with developmental dysarthria. However, these approaches have only limited application and research findings among adolescents and young adults. AIMS To determine and compare the effectiveness of two treatment approaches, the modified Speech Intelligibility Treatment (mSIT) and the Beatalk technique, on speech production and intelligibility among Hebrew-speaking adolescents and young adults with developmental dysarthria. METHODS & PROCEDURES Two matched groups of adolescents and young adults with developmental dysarthria participated in the study. Each received one of the two treatments, mSIT or Beatalk, over the course of 9 weeks. Measures of speech intelligibility, articulatory accuracy, voice and vowel acoustics were assessed both pre- and post-treatment. OUTCOMES & RESULTS Both the mSIT and Beatalk groups demonstrated gains in at least some of the outcome measures. Participants in the mSIT group exhibited improvement in speech intelligibility and voice measures, while participants in the Beatalk group demonstrated increased articulatory accuracy and gains in voice measures from pre- to post-treatment. Significant increases were noted post-treatment for first formant values for select vowels. CONCLUSIONS & IMPLICATIONS Results of this preliminary study are promising for both treatment approaches. The differentiated results indicate their distinct application to speech intelligibility deficits. The current findings also hold clinical significance for treatment among adolescents and young adults with motor speech disorders and application for a language other than English. WHAT THIS PAPER ADDS What is already known on the subject Developmental dysarthria (e.g., secondary to cerebral palsy) is a motor speech disorder that negatively impacts speech intelligibility, and thus communication participation. Select treatment approaches are available with the aim of improving speech intelligibility in individuals with developmental dysarthria; however, these approaches are limited in number and have only seldomly been applied specifically to adolescents and young adults. What this paper adds to existing knowledge The current study presents preliminary data regarding two treatment approaches, the mSIT and Beatalk technique, administered to Hebrew-speaking adolescents and young adults with developmental dysarthria in a group setting. Results demonstrate the initial effectiveness of the treatment approaches, with different gains noted for each approach across speech and voice domains. What are the potential or actual clinical implications of this work? The findings add to the existing literature on potential treatment approaches aiming to improve speech production and intelligibility among individuals with developmental dysarthria. The presented approaches also show promise for group-based treatments as well as the potential for improvement among adolescents and young adults with motor speech disorders.
Collapse
Affiliation(s)
- Micalle Carl
- Department of Communication Disorders, Ariel University, Ariel, Israel
| | - Erika S Levy
- Teachers College, Columbia University, New York, NY, USA
| | - Michal Icht
- Department of Communication Disorders, Ariel University, Ariel, Israel
| |
Collapse
|
14
|
Tamura T, Tanaka Y, Watanabe Y, Sato K. Relationships between maximum tongue pressure and second formant transition in speakers with different types of dysarthria. PLoS One 2022; 17:e0264995. [PMID: 35259200 PMCID: PMC8903301 DOI: 10.1371/journal.pone.0264995] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Accepted: 02/21/2022] [Indexed: 11/18/2022] Open
Abstract
The effects of muscle weakness on speech are currently not fully known. We investigated the relationships between maximum tongue pressure and second formant transition in adults with different types of dysarthria. It focused on the slope in the second formant transition because it reflects the tongue velocity during articulation. Sixty-three Japanese speakers with dysarthria (median age, 68 years; interquartile range, 58–77 years; 44 men and 19 women) admitted to acute and convalescent hospitals were included. Thirty neurologically normal speakers aged 19–85 years (median age, 22 years; interquartile range, 21.0–23.8 years; 14 men and 16 women) were also included. The relationship between the maximum tongue pressure and speech function was evaluated using correlation analysis in the dysarthria group. Speech intelligibility, the oral diadochokinesis rate, and the second formant slope were based on the impaired speech index. More than half of the speakers had mild to moderate dysarthria. Speakers with dysarthria showed significantly lower maximum tongue pressure, speech intelligibility, oral diadochokinesis rate, and second formant slope than neurologically normal speakers. Only the second formant slope was significantly correlated with the maximum tongue pressure (r = 0.368, p = 0.003). The relationship between the second formant slope and maximum tongue pressure showed a similar correlation in the analysis of subgroups divided by sex. The oral diadochokinesis rate, which is related to the speed of articulation, is affected by voice on/off, mandibular opening/closing, and range of motion. In contrast, the second formant slope was less affected by these factors. These results suggest that the maximum isometric tongue strength is associated with tongue movement speed during articulation.
Collapse
Affiliation(s)
- Toshiaki Tamura
- Department of Speech, Language, and Hearing Sciences, Niigata University of Health and Welfare, Niigata city, Niigata, Japan
- Major in Health and Welfare, Niigata University of Health and Welfare Graduate School, Niigata city, Niigata, Japan
- * E-mail:
| | - Yasuhiro Tanaka
- Faculty of Psychological and Physical Science, Aichi Gakuin University, Nisshin city, Aichi, Japan
| | - Yoshihiro Watanabe
- Department of Rehabilitation, Uonuma Kikan Hospital, Uonuma city, Niigata, Japan
| | - Katsuro Sato
- Department of Speech, Language, and Hearing Sciences, Niigata University of Health and Welfare, Niigata city, Niigata, Japan
- Major in Health and Welfare, Niigata University of Health and Welfare Graduate School, Niigata city, Niigata, Japan
| |
Collapse
|
15
|
Rowe HP, Stipancic KL, Lammert AC, Green JR. Validation of an Acoustic-Based Framework of Speech Motor Control: Assessing Criterion and Construct Validity Using Kinematic and Perceptual Measures. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:4736-4753. [PMID: 34735295 PMCID: PMC9150673 DOI: 10.1044/2021_jslhr-21-00201] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Revised: 06/29/2021] [Accepted: 08/16/2021] [Indexed: 05/19/2023]
Abstract
PURPOSE This study investigated the criterion (analytical and clinical) and construct (divergent) validity of a novel, acoustic-based framework composed of five key components of motor control: Coordination, Consistency, Speed, Precision, and Rate. METHOD Acoustic and kinematic analyses were performed on audio recordings from 22 subjects with amyotrophic lateral sclerosis during a sequential motion rate task. Perceptual analyses were completed by two licensed speech-language pathologists, who rated each subject's speech on the five framework components and their overall severity. Analytical and clinical validity were assessed by comparing performance on the acoustic features to their kinematic correlates and to clinician ratings of the five components, respectively. Divergent validity of the acoustic-based framework was then assessed by comparing performance on each pair of acoustic features to determine whether the features represent distinct articulatory constructs. Bivariate correlations and partial correlations with severity as a covariate were conducted for each comparison. RESULTS Results revealed moderate-to-strong analytical validity for every acoustic feature, both with and without controlling for severity, and moderate-to-strong clinical validity for all acoustic features except Coordination, without controlling for severity. When severity was included as a covariate, the strong associations for Speed and Precision became weak. Divergent validity was supported by weak-to-moderate pairwise associations between all acoustic features except Speed (second-formant [F2] slope of consonant transition) and Precision (between-consonant variability in F2 slope). CONCLUSIONS This study demonstrated that the acoustic-based framework has potential as an objective, valid, and clinically useful tool for profiling articulatory deficits in individuals with speech motor disorders. The findings also suggest that compared to clinician ratings, instrumental measures are more sensitive to subtle differences in articulatory function. With further research, this framework could provide more accurate and reliable characterizations of articulatory impairment, which may eventually increase clinical confidence in the diagnosis and treatment of patients with different articulatory phenotypes.
Collapse
Affiliation(s)
| | - Kaila L. Stipancic
- MGH Institute of Health Professions, Boston, MA
- Department of Communicative Disorders and Sciences, The State University of New York at Buffalo
| | - Adam C. Lammert
- Department of Biomedical Engineering, Worcester Polytechnic Institute, MA
| | | |
Collapse
|
16
|
Soriano JU, Olivieri A, Hustad KC. Utility of the Intelligibility in Context Scale for Predicting Speech Intelligibility of Children with Cerebral Palsy. Brain Sci 2021; 11:1540. [PMID: 34827539 PMCID: PMC8615948 DOI: 10.3390/brainsci11111540] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2021] [Revised: 11/11/2021] [Accepted: 11/12/2021] [Indexed: 11/22/2022] Open
Abstract
The Intelligibility in Context Scale (ICS) is a widely used, efficient tool for describing a child's speech intelligibility. Few studies have explored the relationship between ICS scores and transcription intelligibility scores, which are the gold standard for clinical measurement. This study examined how well ICS composite scores predicted transcription intelligibility scores among children with cerebral palsy (CP), how well individual questions from the ICS differentially predicted transcription intelligibility scores, and how well the ICS composite scores differentiated between children with and without speech motor impairment. Parents of 48 children with CP, who were approximately 13 years of age, completed the ICS. Ninety-six adult naïve listeners provided orthographic transcriptions of children's speech. Transcription intelligibility scores were regressed on ICS composite scores and individual item scores. Dysarthria status was regressed on ICS composite scores. Results indicated that ICS composite scores were moderately strong predictors of transcription intelligibility scores. One individual ICS item differentially predicted transcription intelligibility scores, and dysarthria severity influenced how well ICS composite scores differentiated between children with and without speech motor impairment. Findings suggest that the ICS has potential clinical utility for children with CP, especially when used with other objective measures of speech intelligibility.
Collapse
Affiliation(s)
- Jennifer U. Soriano
- Waisman Center, University of Wisconsin–Madison, Madison, WI 53705, USA;
- Department of Communication Sciences & Disorders, University of Wisconsin–Madison, Madison, WI 53706, USA;
| | - Abby Olivieri
- Department of Communication Sciences & Disorders, University of Wisconsin–Madison, Madison, WI 53706, USA;
| | - Katherine C. Hustad
- Waisman Center, University of Wisconsin–Madison, Madison, WI 53705, USA;
- Department of Communication Sciences & Disorders, University of Wisconsin–Madison, Madison, WI 53706, USA;
| |
Collapse
|
17
|
Knowles T, Adams SG, Jog M. Speech Rate Mediated Vowel and Stop Voicing Distinctiveness in Parkinson's Disease. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:4096-4123. [PMID: 34582276 DOI: 10.1044/2021_jslhr-21-00160] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Purpose The purpose of this study was to quantify changes in acoustic distinctiveness in two groups of talkers with Parkinson's disease as they modify across a wide range of speaking rates. Method People with Parkinson's disease with and without deep brain stimulation and older healthy controls read 24 carrier phrases at different speech rates. Target nonsense words in the carrier phrases were designed to elicit stop consonants and corner vowels. Participants spoke at seven self-selected speech rates from very slow to very fast, elicited via magnitude production. Speech rate was measured in absolute words per minute and as a proportion of each talker's habitual rate. Measures of segmental distinctiveness included a temporal consonant measure, namely, voice onset time, and a spectral vowel measure, namely, vowel articulation index. Results All talkers successfully modified their rate of speech from slow to fast. Talkers with Parkinson's disease and deep brain stimulation demonstrated greater baseline speech impairment and produced smaller proportional changes at the fast end of the continuum. Increasingly slower speaking rates were associated with increased temporal contrasts (voice onset time) but not spectral contrasts (vowel articulation). Faster speech was associated with decreased contrasts in both domains. Talkers with deep brain stimulation demonstrated more aberrant productions across all speaking rates. Conclusions Findings suggest that temporal and spectral segmental distinctiveness are asymmetrically affected by speaking rate modifications in Parkinson's disease. Talkers with deep brain stimulation warrant further investigation with regard to speech changes they make as they adjust their speaking rate.
Collapse
Affiliation(s)
- Thea Knowles
- Department of Communicative Disorders and Sciences, University at Buffalo, NY
| | - Scott G Adams
- School of Communication Sciences and Disorders, Western University, London, Ontario, Canada
- Health & Rehabilitation Sciences, Western University, London, Ontario, Canada
- Department of Clinical Neurological Sciences, University Hospital, London, Ontario, Canada
| | - Mandar Jog
- Department of Clinical Neurological Sciences, University Hospital, London, Ontario, Canada
| |
Collapse
|
18
|
van Brenk F, Kain A, Tjaden K. Investigating Acoustic Correlates of Intelligibility Gains and Losses During Slowed Speech: A Hybridization Approach. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2021; 30:1343-1360. [PMID: 34048663 PMCID: PMC8702861 DOI: 10.1044/2021_ajslp-20-00172] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Purpose This exploratory study sought to identify acoustic variables explaining rate-related variation in intelligibility for speakers with dysarthria secondary to multiple sclerosis. Method Seven speakers with dysarthria due to multiple sclerosis produced the same set of Harvard sentences at habitual and slow rates. Speakers were selected from a larger corpus on the basis of rate-related intelligibility characteristics. Four speakers demonstrated improved intelligibility and three speakers demonstrated reduced intelligibility when rate was slowed. A speech analysis resynthesis paradigm termed hybridization was used to create stimuli in which segmental (i.e., short-term spectral) and suprasegmental variables (i.e., sentence-level fundamental frequency, energy characteristics, and duration) of sentences produced at the slow rate were donated individually or in combination to habitually produced sentences. Online crowdsourced orthographic transcription was used to quantify intelligibility for six hybridized sentence types and the original habitual and slow productions. Results Sentence duration alone was not a contributing factor to improved intelligibility associated with slowed rate. Speakers whose intelligibility improved with slowed rate showed higher intelligibility scores for duration spectrum hybrids and energy hybrids compared to the original habitual rate sentences, suggesting these acoustic cues contributed to improved intelligibility for sentences produced with a slowed rate. Energy contour characteristics were also found to play a role in intelligibility losses for speakers with decreased intelligibility at slowed rate. The relative contribution of speech acoustic variables to intelligibility gains and losses varied considerably between speakers. Conclusions Hybridization can be used to identify acoustic correlates of intelligibility variation associated with slowed rate. This approach has further elucidated speaker-specific and individualized speech production adjustments when slowing rate.
Collapse
Affiliation(s)
- Frits van Brenk
- Department of Communicative Disorders and Sciences, University at Buffalo, NY
| | - Alexander Kain
- Department of Pediatrics, Oregon Health & Science University, Portland
| | - Kris Tjaden
- Department of Communicative Disorders and Sciences, University at Buffalo, NY
| |
Collapse
|
19
|
Knowles T, Adams SG, Jog M. Variation in Speech Intelligibility Ratings as a Function of Speech Rate Modification in Parkinson's Disease. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:1773-1793. [PMID: 33950711 DOI: 10.1044/2021_jslhr-20-00593] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Purpose The aim of this study was to quantify changes in speech intelligibility in two cohorts of people with Parkinson's disease (PD; those with and without deep brain stimulation [DBS]) across a broad range of self-selected speech rate alterations in (a) read sentences and (b) extemporaneous speech (monologues). Method Four speaker groups participated in this study: younger and older controls, people with PD undergoing standard pharmaceutical treatment, and people with PD and DBS. Naïve listeners rated the intelligibility of read sentences and extemporaneous monologues, spoken by participants at seven self-selected speech rates from very slow to very fast. Intelligibility was modeled as a function of group, speech rate condition, and speech task. Results Overall, compared to habitual speech rate, slower speech rate conditions were not associated with changes in speech intelligibility, whereas faster-than-habitual conditions were associated in declines in intelligibility. Results were mediated by group and task effects, such that talkers with PD and DBS were more likely to see intelligibility benefits at slower self-selected speech rates and less likely to see detriments at faster rates, and these differences were amplified in monologues compared to sentences. Conclusion Findings suggest differences in the ways in which slower and faster speech rate adjustments impact speech intelligibility in people with PD with and without DBS, with the latter demonstrating greater magnitudes of change.
Collapse
Affiliation(s)
- Thea Knowles
- Department of Communicative Disorders and Sciences, University at Buffalo, NY
| | - Scott G Adams
- School of Communication Sciences and Disorders, Western University, London, Ontario, Canada
- Health and Rehabilitation Sciences, Western University, London, Ontario, Canada
- Department of Clinical Neurological Sciences, University Hospital, London, Ontario, Canada
| | - Mandar Jog
- Department of Clinical Neurological Sciences, University Hospital, London, Ontario, Canada
| |
Collapse
|
20
|
Borrie SA, Lansford KL, Barrett TS. A Clinical Advantage: Experience Informs Recognition and Adaptation to a Novel Talker With Dysarthria. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:1503-1514. [PMID: 33831307 PMCID: PMC8608160 DOI: 10.1044/2021_jslhr-20-00663] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Revised: 12/22/2020] [Accepted: 01/21/2021] [Indexed: 05/19/2023]
Abstract
Purpose Perceptual training paradigms, which leverage the mechanism of perceptual learning, show that naïve listeners, those with no prior experience with dysarthria, benefit from explicit familiarization with a talker with dysarthria. It is theorized that familiarization affords listeners an opportunity to acquire distributional knowledge of the degraded speech signal. Here, we extend investigations to clinically experienced listeners, speech-language pathologists (SLPs), and advance models of listener recognition and adaptation to dysarthric speech. Method Forty-seven SLPs completed a standard three-phase perceptual training protocol (pretest, familiarization, and posttest) with a novel talker with dysarthria. Intelligibility scores were compared with historical data from naïve listeners. Potential relationships between intelligibility scores and characteristics of clinical experience were examined. Results Intelligibility scores of SLPs improved by an average of 19% from pretest to posttest. This intelligibility improvement was lower than naïve listeners, although the difference was small. Moreover, clinical characteristics related to level of dysarthria experience (e.g., percent of caseload composed of dysarthria) predicted pretest/initial intelligibility. No predictive relationships between clinical characteristics and intelligibility improvement were revealed. Conclusions As a group, SLPs benefitted from perceptual training, suggesting that, despite prior experience, the opportunity to acquire knowledge of talker-specific cue distributions is crucial for optimal adaptation. However, SLPs with greater dysarthria experience were better at initially understanding the talker with dysarthria. This suggests that, through regular interaction with individuals with dysarthria, clinicians acquire knowledge of the cue distributions of dysarthric speech more generally and can generalize this group-specific knowledge to aid in understanding other talkers with dysarthria. Consistent with theoretical models of perceptual learning, both talker- and group-specific knowledge informed recognition and adaptation to dysarthric speech.
Collapse
Affiliation(s)
- Stephanie A. Borrie
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan
| | - Kaitlin L. Lansford
- School of Communication Science and Disorders, Florida State University, Tallahassee
| | | |
Collapse
|
21
|
Pommée T, Balaguer M, Pinquier J, Mauclair J, Woisard V, Speyer R. Relationship between phoneme-level spectral acoustics and speech intelligibility in healthy speech: a systematic review. SPEECH, LANGUAGE AND HEARING 2021. [DOI: 10.1080/2050571x.2021.1913300] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Affiliation(s)
- Timothy Pommée
- Institut de Recherche en Informatique de Toulouse, CNRS, Université de Toulouse – Paul Sabatier, Toulouse, France
| | - Mathieu Balaguer
- Institut de Recherche en Informatique de Toulouse, CNRS, Université de Toulouse – Paul Sabatier, Toulouse, France
- Centre Hospitalier Universitaire Larrey, Toulouse, France
| | - Julien Pinquier
- Institut de Recherche en Informatique de Toulouse, CNRS, Université de Toulouse – Paul Sabatier, Toulouse, France
| | - Julie Mauclair
- Institut de Recherche en Informatique de Toulouse, CNRS, Université de Toulouse – Paul Sabatier, Toulouse, France
| | - Virginie Woisard
- Centre Hospitalier Universitaire Larrey, Toulouse, France
- Oncopole, Toulouse, France
- Unité de Recherche Interdisciplinaire Octogone Lordat, Maison de la Recherche, Université de Toulouse – Jean-Jaurès, Toulouse, France
| | - Renée Speyer
- Faculty of Educational Sciences, University of Oslo, Oslo, Norway
| |
Collapse
|
22
|
Advances in Parkinson's Disease detection and assessment using voice and speech: A review of the articulatory and phonatory aspects. Biomed Signal Process Control 2021. [DOI: 10.1016/j.bspc.2021.102418] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
|
23
|
Carl M, Icht M. Acoustic vowel analysis and speech intelligibility in young adult Hebrew speakers: Developmental dysarthria versus typical development. INTERNATIONAL JOURNAL OF LANGUAGE & COMMUNICATION DISORDERS 2021; 56:283-298. [PMID: 33522087 DOI: 10.1111/1460-6984.12598] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Revised: 12/08/2020] [Accepted: 12/31/2020] [Indexed: 06/12/2023]
Abstract
BACKGROUND Developmental dysarthria is a motor speech impairment commonly characterized by varying levels of reduced speech intelligibility. The relationship between intelligibility deficits and acoustic vowel space among these individuals has long been noted in the literature, with evidence of vowel centralization (e.g., in English and Mandarin). However, the degree to which this centralization occurs and the intelligibility-acoustic relationship is maintained in different vowel systems has yet to be studied thoroughly. In comparison with American English, the Hebrew vowel system is significantly smaller, with a potentially smaller vowel space area, a factor that may impact upon the comparisons of the acoustic vowel space and its correlation with speech intelligibility. Data on vowel space and speech intelligibility are particularly limited for Hebrew speakers with motor speech disorders. AIMS To determine the nature and degree of vowel space centralization in Hebrew-speaking adolescents and young adults with dysarthria, in comparison with typically developing (TD) peers, and to correlate these findings with speech intelligibility scores. METHODS & PROCEDURES Adolescents and young adults with developmental dysarthria (secondary to cerebral palsy (CP) and other motor deficits, n = 17) and their TD peers (n = 17) were recorded producing Hebrew corner vowels within single words. For intelligibility assessments, naïve listeners transcribed those words produced by speakers with CP, and intelligibility scores were calculated. OUTCOMES & RESULTS Acoustic analysis of vowel formants (F1, F2) revealed a centralization of vowel space among speakers with CP for all acoustic metrics of vowel formants, and mainly for the formant centralization ratio (FCR), in comparison with TD peers. Intelligibility scores were correlated strongly with the FCR metric for speakers with CP. CONCLUSIONS & IMPLICATIONS The main results, vowel space centralization for speakers with CP in comparison with TD peers, echo previous cross-linguistic results. The correlation of acoustic results with speech intelligibility carries clinical implications. Taken together, the results contribute to better characterization of the speech production deficit in Hebrew speakers with motor speech disorders. Furthermore, they may guide clinical decision-making and intervention planning to improve speech intelligibility. What this paper adds What is already known on the subject Speech production and intelligibility deficits among individuals with developmental dysarthria (e.g., secondary to CP) are well documented. These deficits have also been correlated with centralization of the acoustic vowel space, although primarily in English speakers. Little is known about the acoustic characteristics of vowels in Hebrew speakers with motor speech disorders, and whether correlations with speech intelligibility are maintained. What this paper adds to existing knowledge This study is the first to describe the acoustic characteristics of vowel space in Hebrew-speaking adolescents and young adults with developmental dysarthria. The results demonstrate a centralization of the acoustic vowel space in comparison with TD peers for all measures, as found in other languages. Correlation between acoustic measures and speech intelligibility scores were also documented. We discuss these results within the context of cross-linguistic comparisons. What are the potential or actual clinical implications of this work? The results confirm the use of objective acoustic measures in the assessment of individuals with motor speech disorders, providing such data for Hebrew-speaking adolescents and young adults. These measures can be used to determine the nature and severity of the speech deficit across languages, may guide intervention planning, as well as measure the effectiveness of intelligibility-based treatment programmes.
Collapse
|
24
|
Hirsch ME, Lansford KL, Barrett TS, Borrie SA. Generalized Learning of Dysarthric Speech Between Male and Female Talkers. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:444-451. [PMID: 33508210 PMCID: PMC8632474 DOI: 10.1044/2020_jslhr-20-00313] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Revised: 09/28/2020] [Accepted: 10/21/2020] [Indexed: 05/29/2023]
Abstract
Purpose Perceptual training is a listener-targeted means for improving intelligibility of dysarthric speech. Recent work has shown that training with one talker generalizes to a novel talker of the same sex and that the magnitude of benefit is maximized when the talkers are perceptually similar. The current study expands previous findings by investigating whether perceptual training effects generalize between talkers of different sex. Method Forty new listeners were recruited for this study and completed a pretest, familiarization, and posttest perceptual training paradigm. Historical data collected using the same three-phase protocol were included in the data analysis. All listeners were exposed to the same talker with dysarthria during the pretest and posttest phases. For the familiarization phase, listeners were exposed to one of four talkers with dysarthria, differing in sex and level of perceptual similarity to the test talker or a control talker. During the testing phases, listener transcribed phrases produced by the test talker with dysarthria. Listener transcriptions were then used to calculate a percent words correct intelligibility score. Results Multiple linear regression analysis revealed that intelligibility at posttest was not predicted by sex of the training talker. Consistent with earlier work, the magnitude of intelligibility gain was greater when the familiarization and test talkers were perceptually similar. Additional analyses revealed greater between-listeners variability in the dissimilar conditions as compared to the similar conditions. Conclusions Learning as a result of perceptual training with one talker with dysarthria generalized to another talker regardless of sex. In addition, listeners trained with perceptually similar talkers had greater and more consistent intelligibility improvement. Together, these results add to previous evidence demonstrating that learning generalizes to novel talkers with dysarthria and that perceptual training is suitable for many listeners.
Collapse
Affiliation(s)
- Micah E. Hirsch
- School of Communication Science and Disorders, Florida State University, Tallahassee
| | - Kaitlin L. Lansford
- School of Communication Science and Disorders, Florida State University, Tallahassee
| | | | - Stephanie A. Borrie
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan
| |
Collapse
|
25
|
Chiaramonte R, Vecchio M. A Systematic Review of Measures of Dysarthria Severity in Stroke Patients. PM R 2020; 13:314-324. [PMID: 32818305 DOI: 10.1002/pmrj.12469] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Revised: 08/05/2020] [Accepted: 08/10/2020] [Indexed: 12/29/2022]
Abstract
OBJECTIVE To define methods to measure dysarthria due to stroke and guide physicians in delineating a diagnostic protocol using the best current strategies. DESIGN Systematic review. LITERATURE SURVEY A search was conducted on PubMed, EMBASE, the Cochrane Library, and Web of Science to identify measurement methods for dysarthria severity in adults after stroke. METHODS Two reviewers independently reviewed articles and came to a consensus about which ones to include. The authors excluded all duplicates, articles involving individuals with aphasia or other speech problems other than dysarthria, and articles unrelated to stroke. Articles were included if diagnostic measures were used to examine the effectiveness of speech rehabilitation in stroke patients. SYNTHESIS The search identified 1154 articles with the keywords "stroke" OR "ictus" OR "cerebral vascular accident" AND "dysarthria" OR "Speech and Language Disorders" AND "diagnosis" OR "assessment." The reviewers analyzed 86 full texts. There were 37 publications that met the criteria and were included in the systematic review. These articles were used to describe the main methods used for measuring the severity of stroke-related dysarthria before and after speech rehabilitation. CONCLUSION Despite the range of diagnostic tools available, robust trials are lacking, and the diagnostic approaches are always different. More research is needed to find the best diagnostic methodologies and delineate a definitive diagnostic protocol.
Collapse
Affiliation(s)
- Rita Chiaramonte
- Department of Biomedical and Biotechnological Sciences, Section of Pharmacology, University of Catania, Catania, Italy
| | - Michele Vecchio
- Department of Biomedical and Biotechnological Sciences, Section of Pharmacology, University of Catania, Catania, Italy.,Rehabilitation Unit, "AOU Policlinico Vittorio Emanuele, University of Catania, Catania, Italy
| |
Collapse
|
26
|
Lee J, Kim H, Jung Y. Patterns of Misidentified Vowels in Individuals With Dysarthria Secondary to Amyotrophic Lateral Sclerosis. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:2649-2666. [PMID: 32777194 DOI: 10.1044/2020_jslhr-19-00237] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Purpose The current study examines the pattern of misidentified vowels produced by individuals with dysarthria secondary to amyotrophic lateral sclerosis (ALS). Method Twenty-three individuals with ALS and 22 typical individuals produced 10 monophthongs in an /h/-vowel-/d/ context. One hundred thirty-five listeners completed a forced-choice vowel identification test. Misidentified vowels were examined in terms of the target vowel categories (front-back; low-mid-high) and the direction of misidentification (the directional pattern when the target vowel was misidentified, e.g., misidentification "to a lower vowel"). In addition, acoustic predictors of vowel misidentifications were tested based on log first formant (F1), log second formant, log F1 vowel inherent spectral change, log second formant vowel inherent spectral change, and vowel duration. Results First, high and mid vowels were more frequently misidentified than low vowels for all speaker groups. Second, front and back vowels were misidentified at a similar rate for both the Mild and Severe groups, whereas back vowels were more frequently misidentified than front vowels in typical individuals. Regarding the direction of vowel misidentification, vowel errors were mostly made within the same backness (front-back) category for all groups. In addition, more errors were found toward a lower vowel category than toward a higher vowel category in the Severe group, but not in the Mild group. Overall, log F1 difference was identified as a consistent acoustic predictor of the main vowel misidentification pattern. Conclusion Frequent misidentifications in the vowel height dimension and the acoustic predictor, F1, suggest that limited tongue height control is the major articulatory dysfunction in individuals with ALS. Clinical implications regarding this finding are discussed.
Collapse
Affiliation(s)
- Jimin Lee
- Department of Communication Sciences and Disorders, The Pennsylvania State University, University Park
| | - Heejin Kim
- Department of Linguistics, University of Illinois at Urbana-Champaign, Urbana
| | - Yong Jung
- Graduate Program in Bioinformatics and Genomics, The Pennsylvania State University, University Park
| |
Collapse
|
27
|
Langlois C, Tucker BV, Sawatzky AN, Reed A, Boliek CA. Effects of an intensive voice treatment on articulatory function and speech intelligibility in children with motor speech disorders: A phase one study. JOURNAL OF COMMUNICATION DISORDERS 2020; 86:106003. [PMID: 32505858 DOI: 10.1016/j.jcomdis.2020.106003] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/13/2019] [Revised: 04/14/2020] [Accepted: 04/19/2020] [Indexed: 06/11/2023]
Abstract
Producing speech that is clear, audible, and intelligible to others is a challenge for many children with cerebral palsy (CP) and children with Down syndrome (DS). Previous studies have demonstrated the effectiveness of using the Lee Silverman Voice Treatment (LSVT LOUD®) to increase vocal loudness and improve speech intelligibility in individuals with dysarthria secondary to Parkinson's disease (PD), and some research suggests that it also may be effective for individuals with dysarthria secondary to other conditions, including CP and DS. Although LSVT LOUD targets healthy vocal loudness, there is some evidence of spreading effects to the articulatory system. Acoustic data from two groups of children with secondary motor speech disorders [one with CP (n = 17) and one with DS (n = 9)] who received a full dose of LSVT LOUD and for whom post-treatment intelligibility gains have been previously reported, were analyzed for treatment effects on: 1) vowel duration, 2) acoustic vowel space and 3) the ratio of F2/i/ to F2/u/. Statistically significant changes in vowel duration and acoustic vowel space occurred pre-treatment to 12 weeks post-treatment in the CP group, and increased acoustic vowel space was observed in 5 of the DS participants. The present study provides preliminary evidence of intensive voice treatment spreading effects to the articulatory system in some children with CP and children with DS consistent with previous findings in other populations.
Collapse
Affiliation(s)
- Colette Langlois
- Faculty of Rehabilitation Medicine, University of Alberta, Canada
| | - Benjamin V Tucker
- Department of Linguistics, University of Alberta, Canada; Department of Communication Sciences and Disorders, University of Alberta, Canada
| | - Ashley N Sawatzky
- Department of Communication Sciences and Disorders, University of Alberta, Canada
| | - Alesha Reed
- Faculty of Rehabilitation Medicine, University of Alberta, Canada; Department of Communication Sciences and Disorders, University of Alberta, Canada
| | - Carol A Boliek
- Department of Communication Sciences and Disorders, University of Alberta, Canada; Neuroscience and Mental Health Institute, University of Alberta, Canada.
| |
Collapse
|
28
|
Lansford KL, Borrie SA, Barrett TS, Flechaus C. When Additional Training Isn't Enough: Further Evidence That Unpredictable Speech Inhibits Adaptation. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:1700-1711. [PMID: 32437259 PMCID: PMC7839029 DOI: 10.1044/2020_jslhr-19-00380] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/05/2019] [Revised: 02/21/2020] [Accepted: 02/24/2020] [Indexed: 05/29/2023]
Abstract
Purpose Robust improvements in intelligibility following familiarization, a listener-targeted perceptual training paradigm, have been revealed for talkers diagnosed with spastic, ataxic, and hypokinetic dysarthria but not for talkers with hyperkinetic dysarthria. While the theoretical explanation for the lack of intelligibility improvement following training with hyperkinetic talkers is that there is insufficient distributional regularity in the speech signals to support perceptual adaptation, it could simply be that the standard training protocol was inadequate to facilitate learning of the unpredictable talker. In a pair of experiments, we addressed this possible alternate explanation by modifying the levels of exposure and feedback provided by the perceptual training protocol to offer listeners a more robust training experience. Method In Experiment 1, we examined the exposure modifications, testing whether perceptual adaptation to an unpredictable talker with hyperkinetic dysarthria could be achieved with greater or more diverse exposure to dysarthric speech during the training phase. In Experiment 2, we examined feedback modifications, testing whether perceptual adaptation to the unpredictable talker could be achieved with the addition of internally generated somatosensory feedback, via vocal imitation, during the training phase. Results Neither task modification led to improved intelligibility of the unpredictable talker with hyperkinetic dysarthria. Furthermore, listeners who completed the vocal imitation task demonstrated significantly reduced intelligibility at posttest. Conclusion Together, the results from Experiments 1 and 2 replicate and extend findings from our previous work, suggesting perceptual adaptation is inhibited for talkers whose speech is largely characterized by unpredictable degradations. Collectively, these results underscore the importance of integrating signal predictability into theoretical models of perceptual learning.
Collapse
Affiliation(s)
- Kaitlin L. Lansford
- School of Communication Science & Disorders, Florida State University, Tallahassee
| | - Stephanie A. Borrie
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan
| | | | - Cassidy Flechaus
- School of Communication Science & Disorders, Florida State University, Tallahassee
| |
Collapse
|
29
|
Chiaramonte R, Vecchio M. Dysarthria and stroke. The effectiveness of speech rehabilitation. A systematic review and meta-analysis of the studies. Eur J Phys Rehabil Med 2020; 57:24-43. [PMID: 32519528 DOI: 10.23736/s1973-9087.20.06242-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
INTRODUCTION Speech difficulties, such as dysarthria or aphasia, in addition to motor impairments are frequently seen in post-stroke patients. EVIDENCE ACQUISITION Literature searches with the keywords: "stroke" and "dysarthria" and "diagnosis" and "stroke" and "dysarthria" and "assessment" were conducted using PubMed, EMBASE, Cochrane Library, and Web of Science databases to perform the systematic review about the methods used to measure the severity of dysarthria in subjects post-stroke. The search was performed by two authors from 15 January to 22 February 2020. The research identified a total of 402 articles for the search using the keywords "stroke" and "dysarthria," and "diagnosis" and 84 references for the search using the keywords "stroke" and "dysarthria" and "assessment." Sixty-nine selected articles were analyzed by the reviewers. Thirty-seven publications met the inclusion criteria and were included in the systematic review. Thirty-two articles were excluded for several reasons: 1) 12 involved individuals with aphasia or other speech problems different from dysarthria; 2) 12 examined different topics from our aim; and 3) eight did not include post-stroke cases. EVIDENCE SYNTHESIS The systematic review identified methods for measuring the severity of post-stroke dysarthria. The meta-analysis showed the acoustic parameters affected in dysarthria secondary to stroke and the differences in these parameters after speech therapy. CONCLUSIONS The alternating and sequential motion rate (AMR- Pə, AMR-Tə, AMR-Kə, and SMR-PəTəKə) and maximum phonation time were significantly improved after speech rehabilitation.
Collapse
Affiliation(s)
- Rita Chiaramonte
- Section of Pharmacology, Department of Biomedical and Biotechnological Sciences, University of Catania, Catania, Italy - .,Department of Physical Medicine and Rehabilitation, ASP 7, Hospital of Scicli, Scicli, Ragusa, Italy -
| | - Michele Vecchio
- Section of Pharmacology, Department of Biomedical and Biotechnological Sciences, University of Catania, Catania, Italy.,Unit of Rehabilitation, Vittorio Emanuele University Hospital, Catania, Italy
| |
Collapse
|
30
|
Carl M, Kent RD, Levy ES, Whalen DH. Vowel Acoustics and Speech Intelligibility in Young Adults With Down Syndrome. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:674-687. [PMID: 32160481 DOI: 10.1044/2019_jslhr-19-00204] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Purpose Speech production deficits and reduced intelligibility are frequently noted in individuals with Down syndrome (DS) and are attributed to a combination of several factors. This study reports acoustic data on vowel production in young adults with DS and relates these findings to perceptual analysis of speech intelligibility. Method Participants were eight young adults with DS as well as eight age- and gender-matched typically developing (TD) controls. Several different acoustic measures of vowel centralization and variability were applied to tokens of corner vowels (/ɑ/, /æ/, /i/, /u/) produced in common English words. Intelligibility was assessed for single-word productions of speakers with DS, by means of transcriptions from 14 adult listeners. Results Group differentiation was found for some, but not all, of the acoustic measures. Low vowels were more acoustically centralized and variable in speakers with DS than TD controls. Acoustic findings were associated with overall intelligibility scores. Vowel formant dispersion was the most sensitive measure in distinguishing DS and TD formant data. Conclusion Corner vowels are differentially affected in speakers with DS. The acoustic characterization of vowel production and its association with speech intelligibility scores within the DS group support the conclusion of motor control deficits in the overall speech impairment. Implications are discussed for effective treatment planning.
Collapse
Affiliation(s)
- Micalle Carl
- Program in Speech-Language-Hearing Sciences, City University of New York Graduate Center, New York
| | | | - Erika S Levy
- Teachers College, Columbia University, New York, NY
| | - D H Whalen
- Program in Speech-Language-Hearing Sciences, City University of New York Graduate Center, New York
- Haskins Laboratories, New Haven, CT
- Yale University, New Haven, CT
| |
Collapse
|
31
|
Quantitative analysis of vowel production in cerebral palsy children with dysarthria. J Clin Neurosci 2019; 66:77-82. [DOI: 10.1016/j.jocn.2019.05.020] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2019] [Accepted: 05/21/2019] [Indexed: 11/17/2022]
|
32
|
Chiu YF, Forrest K, Loux T. Relationship Between F2 Slope and Intelligibility in Parkinson's Disease: Lexical Effects and Listening Environment. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2019; 28:887-894. [PMID: 31306601 DOI: 10.1044/2018_ajslp-msc18-18-0098] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Purpose There is a complex relationship between speech production and intelligibility of speech. The current study sought to evaluate the interaction of the factors of lexical characteristics, listening environment, and the 2nd formant transition (F2 slope) on intelligibility of speakers with Parkinson's disease (PD). Method Twelve speakers with PD and 12 healthy controls read sentences that included words with the diphthongs /aɪ/, /ɔɪ/, and /aʊ/. The F2 slope of the diphthong transition was measured and averaged across the 3 diphthongs for each speaker. Young adult listeners transcribed the sentences to assess intelligibility of words with high and low word frequency and high and low neighborhood density in quiet and noisy listening conditions. The average F2 slope and intelligibility scores were entered into regression models to examine their relationship. Results F2 slope was positively related to intelligibility in speakers with PD in both listening conditions with a stronger relationship in noise than in quiet. There was no significant relationship between F2 slope and intelligibility of healthy speakers. In the quiet condition, F2 slope was only correlated with intelligibility in less-frequent words produced by the PD group. In the noise condition, F2 slope was related to intelligibility in high- and low-frequency words and high-density words in PD. Conclusions The relationship between F2 slope and intelligibility in PD was affected by lexical factors and listening conditions. F2 slope was more strongly related to intelligibility in noise than in quiet for speakers with PD. This relationship was absent in highly frequent words presented in quiet and those with fewer lexical neighbors.
Collapse
Affiliation(s)
- Yi-Fang Chiu
- Department of Communication Sciences and Disorders, Saint Louis University, MO
| | - Karen Forrest
- Department of Speech and Hearing Sciences, Indiana University Bloomington
| | - Travis Loux
- Department of Epidemiology and Biostatistics, Saint Louis University, MO
| |
Collapse
|
33
|
Vizza P, Tradigo G, Mirarchi D, Bossio RB, Lombardo N, Arabia G, Quattrone A, Veltri P. Methodologies of speech analysis for neurodegenerative diseases evaluation. Int J Med Inform 2019; 122:45-54. [DOI: 10.1016/j.ijmedinf.2018.11.008] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2018] [Revised: 09/18/2018] [Accepted: 11/20/2018] [Indexed: 10/27/2022]
|
34
|
Mou Z, Chen Z, Yang J, Xu L. Acoustic properties of vowel production in Mandarin-speaking patients with post-stroke dysarthria. Sci Rep 2018; 8:14188. [PMID: 30242251 PMCID: PMC6155015 DOI: 10.1038/s41598-018-32429-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2018] [Accepted: 09/05/2018] [Indexed: 11/17/2022] Open
Abstract
This study investigated the acoustic features of vowel production in Mandarin-speaking patients with post-stroke dysarthria (PSD). The subjects included 31 native Mandarin-speaking patients with PSD (age: 25–83 years old) and 38 neurologically normal adults in a similar age range (age: 21–76 years old). Each subject was recorded producing a list of Mandarin monosyllables that included six monophthong vowels (i.e., /a, i, u, ɤ, y, o/) embedded in the /CV/ context. The patients’ speech samples were evaluated by two native Mandarin speakers. The evaluation scores were then used to classify all patients into two levels of severity: mild or moderate-to-severe. Formants (F1 and F2) were extracted from each vowel token. Results showed that all vowel categories in the patients with PSD were produced with more variability than in the healthy speakers. Great overlaps between vowel categories and reduced vowel space were observed in the patients. The magnitude of the vowel dispersion and overlap between vowel categories increased as a function of the severity of the disorder. The deviations of the vowel acoustic features in the patients in comparison to the healthy speakers may provide guidance for clinical rehabilitation to improve the speech intelligibility of patients with PSD.
Collapse
Affiliation(s)
- Zhiwei Mou
- Department of Rehabilitation, the First Affiliated Hospital of Jinan University, Guangzhou, Guangdong, 510630, China
| | - Zhuoming Chen
- Department of Rehabilitation, the First Affiliated Hospital of Jinan University, Guangzhou, Guangdong, 510630, China
| | - Jing Yang
- Department of Communication Sciences and Disorders, University of Wisconsin-Milwaukee, Milwaukee, WI, 53211, USA
| | - Li Xu
- School of Rehabilitation and Communication Sciences, Ohio University, Athens, Ohio, 45701, USA.
| |
Collapse
|
35
|
Whitfield JA, Dromey C, Palmer P. Examining Acoustic and Kinematic Measures of Articulatory Working Space: Effects of Speech Intensity. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2018; 61:1104-1117. [PMID: 29710247 DOI: 10.1044/2018_jslhr-s-17-0388] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/12/2017] [Accepted: 01/24/2018] [Indexed: 06/08/2023]
Abstract
PURPOSE The purpose of this study was to examine the effect of speech intensity on acoustic and kinematic vowel space measures and conduct a preliminary examination of the relationship between kinematic and acoustic vowel space metrics calculated from continuously sampled lingual marker and formant traces. METHOD Young adult speakers produced 3 repetitions of 2 different sentences at 3 different loudness levels. Lingual kinematic and acoustic signals were collected and analyzed. Acoustic and kinematic variants of several vowel space metrics were calculated from the formant frequencies and the position of 2 lingual markers. Traditional metrics included triangular vowel space area and the vowel articulation index. Acoustic and kinematic variants of sentence-level metrics based on the articulatory-acoustic vowel space and the vowel space hull area were also calculated. RESULTS Both acoustic and kinematic variants of the sentence-level metrics significantly increased with an increase in loudness, whereas no statistically significant differences in traditional vowel-point metrics were observed for either the kinematic or acoustic variants across the 3 loudness conditions. In addition, moderate-to-strong relationships between the acoustic and kinematic variants of the sentence-level vowel space metrics were observed for the majority of participants. CONCLUSIONS These data suggest that both kinematic and acoustic vowel space metrics that reflect the dynamic contributions of both consonant and vowel segments are sensitive to within-speaker changes in articulation associated with manipulations of speech intensity.
Collapse
Affiliation(s)
- Jason A Whitfield
- Department of Communication Sciences and Disorders, Bowling Green State University, OH
| | - Christopher Dromey
- Department of Communication Disorders, Brigham Young University, Provo, UT
| | - Panika Palmer
- Department of Communication Disorders, Brigham Young University, Provo, UT
| |
Collapse
|
36
|
Borrie SA, Lansford KL, Barrett TS. Understanding dysrhythmic speech: When rhythm does not matter and learning does not happen. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 143:EL379. [PMID: 29857710 PMCID: PMC5959736 DOI: 10.1121/1.5037620] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/05/2018] [Revised: 04/19/2018] [Accepted: 04/24/2018] [Indexed: 05/22/2023]
Abstract
A positive relationship between rhythm perception and improved understanding of a naturally dysrhythmic speech signal, ataxic dysarthria, has been previously reported [Borrie, Lansford, and Barrett. (2017). J. Speech Lang. Hear. Res. 60, 3110-3117]. The current follow-on investigation suggests that this relationship depends on the nature of the dysrhythmia. When the corrupted rhythm cues are relatively predictable, affording some learnable acoustic regularity, the relationship is replicated. However, this relationship is nonexistent, along with any intelligibility improvements, when the corrupted rhythm cues are unpredictable. Findings highlight a key role for rhythm perception and distributional regularities in adaptation to dysrhythmic speech.
Collapse
Affiliation(s)
- Stephanie A Borrie
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan, Utah 84322, USA
| | - Kaitlin L Lansford
- School of Communication Sciences and Disorders, Florida State University, Tallahassee, Florida 32306, USA
| | - Tyson S Barrett
- Department of Kinesiology and Health Sciences, Utah State University, Logan, Utah 84322, USA
| |
Collapse
|
37
|
Knowles T, Adams S, Abeyesekera A, Mancinelli C, Gilmore G, Jog M. Deep Brain Stimulation of the Subthalamic Nucleus Parameter Optimization for Vowel Acoustics and Speech Intelligibility in Parkinson's Disease. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2018; 61:510-524. [PMID: 29471373 DOI: 10.1044/2017_jslhr-s-17-0157] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/27/2017] [Accepted: 11/16/2017] [Indexed: 06/08/2023]
Abstract
PURPOSE The settings of 3 electrical stimulation parameters were adjusted in 12 speakers with Parkinson's disease (PD) with deep brain stimulation of the subthalamic nucleus (STN-DBS) to examine their effects on vowel acoustics and speech intelligibility. METHOD Participants were tested under permutations of low, mid, and high STN-DBS frequency, voltage, and pulse width settings. At each session, participants recited a sentence. Acoustic characteristics of vowel production were extracted, and naive listeners provided estimates of speech intelligibility. RESULTS Overall, lower-frequency STN-DBS stimulation (60 Hz) was found to lead to improvements in intelligibility and acoustic vowel expansion. An interaction between speaker sex and STN-DBS stimulation was found for vowel measures. The combination of low frequency, mid to high voltage, and low to mid pulse width led to optimal speech outcomes; however, these settings did not demonstrate significant speech outcome differences compared with the standard clinical STN-DBS settings, likely due to substantial individual variability. CONCLUSIONS Although lower-frequency STN-DBS stimulation was found to yield consistent improvements in speech outcomes, it was not found to necessarily lead to the best speech outcomes for all participants. Nevertheless, frequency may serve as a starting point to explore settings that will optimize an individual's speech outcomes following STN-DBS surgery. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.5899228.
Collapse
Affiliation(s)
- Thea Knowles
- School of Communication Sciences and Disorders, Western University, London, Ontario, Canada
- Health and Rehabilitation Sciences, Western University, London, Ontario, Canada
| | - Scott Adams
- School of Communication Sciences and Disorders, Western University, London, Ontario, Canada
- Health and Rehabilitation Sciences, Western University, London, Ontario, Canada
- Department of Clinical Neurological Sciences, University Hospital, London, Ontario, Canada
| | - Anita Abeyesekera
- School of Communication Sciences and Disorders, Western University, London, Ontario, Canada
- Health and Rehabilitation Sciences, Western University, London, Ontario, Canada
| | - Cynthia Mancinelli
- School of Communication Sciences and Disorders, Western University, London, Ontario, Canada
- Health and Rehabilitation Sciences, Western University, London, Ontario, Canada
| | - Greydon Gilmore
- Department of Biomedical Engineering, Western University, London, Ontario, Canada
| | - Mandar Jog
- Department of Clinical Neurological Sciences, University Hospital, London, Ontario, Canada
| |
Collapse
|
38
|
Ingvalson EM, Lansford KL, Fedorova V, Fernandez G. Receptive Vocabulary, Cognitive Flexibility, and Inhibitory Control Differentially Predict Older and Younger Adults' Success Perceiving Speech by Talkers With Dysarthria. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2017; 60:3632-3641. [PMID: 29204601 DOI: 10.1044/2017_jslhr-h-17-0119] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/03/2017] [Accepted: 07/28/2017] [Indexed: 06/07/2023]
Abstract
PURPOSE Previous research has demonstrated equivocal findings related to the effect of listener age on intelligibility ratings of dysarthric speech. The aim of the present study was to investigate the mechanisms that support younger and older adults' perception of speech by talkers with dysarthria. METHOD Younger and older adults identified words in phrases produced by talkers with dysarthria. Listeners also completed assessments on peripheral hearing, receptive vocabulary, and executive control functions. RESULTS Older and younger adults did not differ in their ability to perceive speech by talkers with dysarthria. Younger adults' success in identifying words produced by talkers with dysarthria was associated only with their hearing acuity. In contrast, older adults showed effects of working memory and cognitive flexibility and interactions between hearing acuity and receptive vocabulary and between hearing acuity and inhibitory control. CONCLUSIONS Although older and younger adults had equivalent performance identifying words produced by talkers with dysarthria, older adults appear to utilize more cognitive support to identify those words.
Collapse
Affiliation(s)
- Erin M Ingvalson
- School of Communication Science and Disorders, Florida State University, Tallahassee
- Department of Otolaryngology-Head and Neck Surgery, Northwestern University, Chicago, IL
| | - Kaitlin L Lansford
- School of Communication Science and Disorders, Florida State University, Tallahassee
| | - Valeriya Fedorova
- School of Communication Science and Disorders, Florida State University, Tallahassee
| | - Gabriel Fernandez
- School of Communication Science and Disorders, Florida State University, Tallahassee
| |
Collapse
|
39
|
Borrie SA, Lansford KL, Barrett TS. Generalized Adaptation to Dysarthric Speech. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2017; 60:3110-3117. [PMID: 29075754 PMCID: PMC5945074 DOI: 10.1044/2017_jslhr-s-17-0127] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/09/2017] [Accepted: 06/02/2017] [Indexed: 05/04/2023]
Abstract
PURPOSE Generalization of perceptual learning has received limited attention in listener adaptation studies with dysarthric speech. This study investigated whether adaptation to a talker with dysarthria could be predicted by the nature of the listener's prior familiarization experience, specifically similarity of perceptual features, and level of intelligibility. METHOD Following an intelligibility pretest involving a talker with ataxic dysarthria, 160 listeners were familiarized with 1 of 7 talkers with dysarthria-who differed from the test talker in terms of perceptual similarity (same, similar, dissimilar) and level of intelligibility (low, mid, high)-or a talker with no neurological impairment (control). Listeners then completed an intelligibility posttest on the test talker. RESULTS All listeners benefited from familiarization with a talker with dysarthria; however, adaptation to the test talker was superior when the familiarization talker had similar perceptual features and reduced when the familiarization talker had low intelligibility. CONCLUSION Evidence for both generalization and specificity of learning highlights the differential value of listeners' prior experiences for adaptation to, and improved understanding of, a talker with dysarthria. These findings broaden our theoretical knowledge of adaptation to degraded speech, as well as the clinical application of training paradigms that exploit perceptual processes for therapeutic gain.
Collapse
Affiliation(s)
- Stephanie A. Borrie
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan
| | - Kaitlin L. Lansford
- School of Communication Sciences and Disorders, Florida State University, Tallahassee
| | | |
Collapse
|
40
|
Reilly KJ, Pettibone C. Vowel generalization and its relation to adaptation during perturbations of auditory feedback. J Neurophysiol 2017; 118:2925-2934. [PMID: 28835529 PMCID: PMC5686240 DOI: 10.1152/jn.00702.2016] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2016] [Revised: 08/21/2017] [Accepted: 08/22/2017] [Indexed: 11/22/2022] Open
Abstract
Repeated perturbations of auditory feedback during vowel production elicit changes not only in the production of the perturbed vowel (adaptation) but also in the production of nearby vowels that were not perturbed (generalization). The finding that adaptation generalizes to other, nonperturbed vowels suggests that sensorimotor representations for vowels are not independent; instead, the goals for producing any one vowel may depend in part on the goals for other vowels. The present study investigated the dependence or independence of vowel representations by evaluating adaptation and generalization in two groups of speakers exposed to auditory perturbations of their first formant (F1) during different vowels. The speakers in both groups who adapted to the perturbation exhibited generalization in two nonperturbed vowels that were produced under masking noise. Correlation testing was performed to evaluate the relations between adaptation and generalization as well as between the generalization in the two nonperturbed vowels. These tests identified significant coupling between the F1 changes of adjacent vowels but not nonadjacent vowels. The pattern of correlation findings indicates that generalization was due in part to feedforward representations that are partly shared across adjacent vowels, possibly to maintain their acoustic contrast.NEW & NOTEWORTHY Speech adaptations to alterations, or perturbations, of auditory feedback have provided important insights into sensorimotor representations underlying speech. One finding from these studies that is yet to be accounted for is vowel generalization, which describes the effects of repeated perturbations to one vowel on the production of other vowels that were not perturbed. The present study used correlation testing to quantify the effects of changes in a perturbed vowel on neighboring (i.e., similar) nonperturbed vowels. The results identified significant correlations between the changes of adjacent, but not nonadjacent, vowel pairs. This finding suggests that generalization is partly a response to adaptation and not solely due to the auditory perturbation.
Collapse
Affiliation(s)
- Kevin J Reilly
- Department of Audiology and Speech Pathology, University of Tennessee Health Science Center, Knoxville, Tennessee; and
| | | |
Collapse
|
41
|
Kim Y, Choi Y. A Cross-Language Study of Acoustic Predictors of Speech Intelligibility in Individuals With Parkinson's Disease. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2017; 60:2506-2518. [PMID: 28821018 PMCID: PMC5831618 DOI: 10.1044/2017_jslhr-s-16-0121] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/28/2016] [Revised: 09/13/2016] [Accepted: 04/21/2017] [Indexed: 05/28/2023]
Abstract
PURPOSE The present study aimed to compare acoustic models of speech intelligibility in individuals with the same disease (Parkinson's disease [PD]) and presumably similar underlying neuropathologies but with different native languages (American English [AE] and Korean). METHOD A total of 48 speakers from the 4 speaker groups (AE speakers with PD, Korean speakers with PD, healthy English speakers, and healthy Korean speakers) were asked to read a paragraph in their native languages. Four acoustic variables were analyzed: acoustic vowel space, voice onset time contrast scores, normalized pairwise variability index, and articulation rate. Speech intelligibility scores were obtained from scaled estimates of sentences extracted from the paragraph. RESULTS The findings indicated that the multiple regression models of speech intelligibility were different in Korean and AE, even with the same set of predictor variables and with speakers matched on speech intelligibility across languages. Analysis of the descriptive data for the acoustic variables showed the expected compression of the vowel space in speakers with PD in both languages, lower normalized pairwise variability index scores in Korean compared with AE, and no differences within or across language in articulation rate. CONCLUSIONS The results indicate that the basis of an intelligibility deficit in dysarthria is likely to depend on the native language of the speaker and listener. Additional research is required to explore other potential predictor variables, as well as additional language comparisons to pursue cross-linguistic considerations in classification and diagnosis of dysarthria types.
Collapse
Affiliation(s)
- Yunjung Kim
- Department of Communication Sciences and Disorders, Louisiana State University, Baton Rouge
| | - Yaelin Choi
- Department of Speech-Language Pathology, Myongji University, Seoul, Republic of Korea
| |
Collapse
|
42
|
Allison KM, Annear L, Policicchio M, Hustad KC. Range and Precision of Formant Movement in Pediatric Dysarthria. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2017; 60:1864-1876. [PMID: 28655064 PMCID: PMC5831086 DOI: 10.1044/2017_jslhr-s-15-0438] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2015] [Revised: 06/17/2016] [Accepted: 02/08/2017] [Indexed: 05/21/2023]
Abstract
PURPOSE This study aimed to improve understanding of speech characteristics associated with dysarthria in children with cerebral palsy by analyzing segmental and global formant measures in single-word and sentence contexts. METHOD Ten 5-year-old children with cerebral palsy and dysarthria and 10 age-matched, typically developing children participated in this study. Vowel space area and second formant interquartile range were measured from children's elicited productions of single words and sentences. RESULTS Results showed that the children with dysarthria had significantly smaller vowel space areas than typically developing children in both word and sentence contexts; however, overall ranges of second formant movement did not differ between groups in word or sentence contexts. Additional analysis of single words revealed that, compared to typical children, children with dysarthria had smaller second formant interquartile ranges in single words with phonetic contexts requiring large changes in vocal tract configuration, but not in single words with monophthongs. CONCLUSIONS Results of this study suggest that children with dysarthria may not have globally reduced ranges of articulatory movement compared to typically developing peers; however, they do exhibit reduced precision in producing phonetic targets.
Collapse
Affiliation(s)
- Kristen M. Allison
- Department of Communication Sciences and Disorders and the Waisman Center, University of Wisconsin–Madison
| | - Lucas Annear
- Department of Communication Sciences and Disorders and the Waisman Center, University of Wisconsin–Madison
| | - Marisa Policicchio
- Department of Communication Sciences and Disorders and the Waisman Center, University of Wisconsin–Madison
| | - Katherine C. Hustad
- Department of Communication Sciences and Disorders and the Waisman Center, University of Wisconsin–Madison
| |
Collapse
|
43
|
Berry J, Kolb A, Schroeder J, Johnson MT. Jaw Rotation in Dysarthria Measured With a Single Electromagnetic Articulography Sensor. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2017; 26:596-610. [PMID: 28654942 DOI: 10.1044/2017_ajslp-16-0104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2016] [Accepted: 12/22/2016] [Indexed: 06/07/2023]
Abstract
PURPOSE This study evaluated a novel method for characterizing jaw rotation using orientation data from a single electromagnetic articulography sensor. This method was optimized for clinical application, and a preliminary examination of clinical feasibility and value was undertaken. METHOD The computational adequacy of the single-sensor orientation method was evaluated through comparisons of jaw-rotation histories calculated from dual-sensor positional data for 16 typical talkers. The clinical feasibility and potential value of single-sensor jaw rotation were assessed through comparisons of 7 talkers with dysarthria and 19 typical talkers in connected speech. RESULTS The single-sensor orientation method allowed faster and safer participant preparation, required lower data-acquisition costs, and generated less high-frequency artifact than the dual-sensor positional approach. All talkers with dysarthria, regardless of severity, demonstrated jaw-rotation histories with more numerous changes in movement direction and reduced smoothness compared with typical talkers. CONCLUSIONS Results suggest that the single-sensor orientation method for calculating jaw rotation during speech is clinically feasible. Given the preliminary nature of this study and the small participant pool, the clinical value of such measures remains an open question. Further work must address the potential confound of reduced speaking rate on movement smoothness.
Collapse
Affiliation(s)
- Jeff Berry
- Department of Speech Pathology & Audiology, Marquette University, Milwaukee, WI
| | - Andrew Kolb
- Department of Electrical & Computer Engineering, Marquette University, Milwaukee, WI
| | - James Schroeder
- Department of Electrical & Computer Engineering, Marquette University, Milwaukee, WI
| | - Michael T Johnson
- Department of Electrical & Computer Engineering, Marquette University, Milwaukee, WI
| |
Collapse
|
44
|
Borrie SA, Lansford KL, Barrett TS. Rhythm Perception and Its Role in Perception and Learning of Dysrhythmic Speech. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2017; 60:561-570. [PMID: 28241307 DOI: 10.1044/2016_jslhr-s-16-0094] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2016] [Accepted: 09/11/2016] [Indexed: 05/13/2023]
Abstract
PURPOSE The perception of rhythm cues plays an important role in recognizing spoken language, especially in adverse listening conditions. Indeed, this has been shown to hold true even when the rhythm cues themselves are dysrhythmic. This study investigates whether expertise in rhythm perception provides a processing advantage for perception (initial intelligibility) and learning (intelligibility improvement) of naturally dysrhythmic speech, dysarthria. METHOD Fifty young adults with typical hearing participated in 3 key tests, including a rhythm perception test, a receptive vocabulary test, and a speech perception and learning test, with standard pretest, familiarization, and posttest phases. Initial intelligibility scores were calculated as the proportion of correct pretest words, while intelligibility improvement scores were calculated by subtracting this proportion from the proportion of correct posttest words. RESULTS Rhythm perception scores predicted intelligibility improvement scores but not initial intelligibility. On the other hand, receptive vocabulary scores predicted initial intelligibility scores but not intelligibility improvement. CONCLUSIONS Expertise in rhythm perception appears to provide an advantage for processing dysrhythmic speech, but a familiarization experience is required for the advantage to be realized. Findings are discussed in relation to the role of rhythm in speech processing and shed light on processing models that consider the consequence of rhythm abnormalities in dysarthria.
Collapse
Affiliation(s)
- Stephanie A Borrie
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan
| | - Kaitlin L Lansford
- School of Communication Sciences and Disorders, Florida State University, Tallahassee
| | | |
Collapse
|
45
|
McAuliffe MJ, Fletcher AR, Kerr SE, O'Beirne GA, Anderson T. Effect of Dysarthria Type, Speaking Condition, and Listener Age on Speech Intelligibility. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2017; 26:113-123. [PMID: 28124068 DOI: 10.1044/2016_ajslp-15-0182] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/19/2015] [Accepted: 07/11/2016] [Indexed: 06/06/2023]
Abstract
PURPOSE The aim of this study was to examine the effect of loud and slow speech cues on younger and older listeners' comprehension of dysarthric speech, specifically, (a) whether one strategy, as opposed to the other, promoted greater intelligibility gains for different speaker groups; (b) whether older and younger listeners' understandings were differentially affected by these strategies; and (c) which acoustic changes best predicted intelligibility gain in individual speakers. METHOD Twenty younger and 40 older listeners completed a perceptual task. Six individuals with dysarthria produced phrases across habitual, loud, and slow conditions. The primary dependent variable was proportion of words correct; follow-up acoustic analyses linked perceptual outcomes to changes in acoustic speech features. RESULTS Regardless of dysarthria type, the loud condition produced significant intelligibility gains. Overall, older listeners' comprehension was reduced relative to younger listeners. Follow-up analysis revealed considerable interspeaker differences in intelligibility outcomes across conditions. Although the most successful speaking mode varied, intelligibility gains were strongly associated with the degree of change participants made to their vowel formants. CONCLUSIONS Perceptual outcomes vary across speaking modes, even when speakers with dysarthria are grouped according to similar perceptual profiles. Further investigation of interspeaker differences is needed to inform individually tailored intervention approaches.
Collapse
Affiliation(s)
- Megan J McAuliffe
- Department of Communication Disorders, University of Canterbury, Christchurch, New ZealandNew Zealand Institute of Language, Brain & Behaviour, University of Canterbury, Christchurch, New Zealand
| | - Annalise R Fletcher
- Department of Communication Disorders, University of Canterbury, Christchurch, New ZealandNew Zealand Institute of Language, Brain & Behaviour, University of Canterbury, Christchurch, New Zealand
| | - Sarah E Kerr
- Department of Communication Disorders, University of Canterbury, Christchurch, New ZealandNew Zealand Institute of Language, Brain & Behaviour, University of Canterbury, Christchurch, New Zealand
| | - Greg A O'Beirne
- Department of Communication Disorders, University of Canterbury, Christchurch, New ZealandNew Zealand Institute of Language, Brain & Behaviour, University of Canterbury, Christchurch, New Zealand
| | - Tim Anderson
- New Zealand Brain Research Institute, Christchurch, New Zealand
| |
Collapse
|
46
|
Fletcher AR, McAuliffe MJ, Lansford KL, Liss JM. Assessing Vowel Centralization in Dysarthria: A Comparison of Methods. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2017; 60:341-354. [PMID: 28124069 PMCID: PMC6194930 DOI: 10.1044/2016_jslhr-s-15-0355] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/09/2015] [Accepted: 07/15/2016] [Indexed: 05/15/2023]
Abstract
PURPOSE The strength of the relationship between vowel centralization measures and perceptual ratings of dysarthria severity has varied considerably across reports. This article evaluates methods of acoustic-perceptual analysis to determine whether procedural changes can strengthen the association between these measures. METHOD Sixty-one speakers (17 healthy individuals and 44 speakers with dysarthria) read a standard passage. To obtain acoustic data, 2 points of formant extraction (midpoint and articulatory point) and 2 frequency measures (Hz and Bark) were trialed. Both vowel space area and an adapted formant centralization ratio were calculated using first and second formants of speakers' corner vowels. Twenty-eight listeners rated speech samples using different prompts: one with a focus on intelligibility, the other on speech precision. RESULTS Perceptually, listener ratings of speech precision provided the best index of acoustic change. Acoustically, the combined use of an articulatory-based formant extraction point, Bark frequency units, and the formant centralization ratio was most effective in explaining perceptual ratings. This combination of procedures resulted in an increase of 17% to 27% explained variance between measures. CONCLUSIONS The procedures researchers use to assess articulatory impairment can significantly alter the strength of relationship between acoustic and perceptual measures. Procedures that maximize this relationship are recommended.
Collapse
Affiliation(s)
- Annalise R. Fletcher
- Department of Communication Disorders and New Zealand Institute of Language, Brain & Behaviour, University of Canterbury, Christchurch
| | - Megan J. McAuliffe
- Department of Communication Disorders and New Zealand Institute of Language, Brain & Behaviour, University of Canterbury, Christchurch
| | - Kaitlin L. Lansford
- School of Communication Science & Disorders, Florida State University, Tallahassee
| | - Julie M. Liss
- Department of Speech and Hearing Science, Arizona State University, Tempe
| |
Collapse
|
47
|
Connaghan KP, Patel R. The Impact of Contrastive Stress on Vowel Acoustics and Intelligibility in Dysarthria. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2017; 60:38-50. [PMID: 28114612 PMCID: PMC5533559 DOI: 10.1044/2016_jslhr-s-15-0291] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/20/2015] [Revised: 04/29/2016] [Accepted: 07/09/2016] [Indexed: 05/19/2023]
Abstract
PURPOSE To compare vowel acoustics and intelligibility in words produced with and without contrastive stress by speakers with spastic (mixed-spastic) dysarthria secondary to cerebral palsy (DYSCP) and healthy controls (HCs). METHOD Fifteen participants (9 men, 6 women; age M = 42 years) with DYSCP and 15 HCs (9 men, 6 women; age M = 36 years) produced sentences containing target words with and without contrastive stress. Forty-five healthy listeners (age M = 25 years) completed a vowel identification task of DYSCP productions. Vowel acoustics were compared across stress conditions and groups using 1st (F1) and 2nd (F2) formant measures. Perceptual intelligibility was compared across stress conditions and dysarthria severity. RESULTS F1 and F2 significantly increased in stressed words for both groups, although the degree of change differed. Mean Euclidian distance between vowels also increased with stress. The relative probability of vowels falling within the target F1 × F2 space was greater for HCs but did not differ with stress. Stress production resulted in greater listener vowel identification accuracy for speakers with mild dysarthria. CONCLUSIONS Contrastive stress affected vowel formants for both groups. Perceptual results suggest that some speakers with dysarthria may benefit from a contrastive stress strategy to improve vowel intelligibility.
Collapse
|
48
|
Daudet L, Yadav N, Perez M, Poellabauer C, Schneider S, Huebner A. Portable mTBI Assessment Using Temporal and Frequency Analysis of Speech. IEEE J Biomed Health Inform 2016; 21:496-506. [PMID: 27913365 DOI: 10.1109/jbhi.2016.2633509] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
This paper shows that extraction and analysis of various acoustic features from speech using mobile devices can allow the detection of patterns that could be indicative of neurological trauma. This may pave the way for new types of biomarkers and diagnostic tools. Toward this end, we created a mobile application designed to diagnose mild traumatic brain injuries (mTBI) such as concussions. Using this application, data were collected from youth athletes from 47 high schools and colleges in the Midwestern United States. In this paper, we focus on the design of a methodology to collect speech data, the extraction of various temporal and frequency metrics from that data, and the statistical analysis of these metrics to find patterns that are indicative of a concussion. Our results suggest a strong correlation between certain temporal and frequency features and the likelihood of a concussion.
Collapse
|
49
|
Kuo C, Weismer G. Vowel reduction across tasks for male speakers of American English. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2016; 140:369. [PMID: 27475161 PMCID: PMC6910000 DOI: 10.1121/1.4955310] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2015] [Revised: 06/16/2016] [Accepted: 06/23/2016] [Indexed: 06/06/2023]
Abstract
This study examined acoustic variation of vowels within speakers across speech tasks. The overarching goal of the study was to understand within-speaker variation as one index of the range of normal speech motor behavior for American English vowels. Ten male speakers of American English performed four speech tasks including citation form sentence reading with a clear-speech style (clear-speech), citation form sentence reading (citation), passage reading (reading), and conversational speech (conversation). Eight monophthong vowels in a variety of consonant contexts were studied. Clear-speech was operationally defined as the reference point for describing variation. Acoustic measures associated with the conventions of vowel targets were obtained and examined. These included temporal midpoint formant frequencies for the first three formants (F1, F2, and F3) and the derived Euclidean distances in the F1-F2 and F2-F3 planes. Results indicated that reduction toward the center of the F1-F2 and F2-F3 planes increased in magnitude across the tasks in the order of clear-speech, citation, reading, and conversation. The cross-task variation was comparable for all speakers despite fine-grained individual differences. The characteristics of systematic within-speaker acoustic variation across tasks have potential implications for the understanding of the mechanisms of speech motor control and motor speech disorders.
Collapse
Affiliation(s)
- Christina Kuo
- Department of Communication Sciences and Disorders, James Madison University, Harrisonburg, Virginia 22807, USA
| | - Gary Weismer
- Department of Communication Sciences and Disorders, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| |
Collapse
|
50
|
Lansford KL, Liss JM, Norton RE. Free-classification of perceptually similar speakers with dysarthria. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2014; 57:2051-64. [PMID: 25057892 PMCID: PMC9589826 DOI: 10.1044/2014_jslhr-s-13-0177] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/06/2013] [Accepted: 06/27/2014] [Indexed: 05/15/2023]
Abstract
PURPOSE In this investigation, the construct of perceptual similarity was explored in the dysarthrias. Specifically, we employed an auditory free-classification task to determine whether listeners could cluster speakers by perceptual similarity, whether the clusters mapped to acoustic metrics, and whether the clusters were constrained by dysarthria subtype diagnosis. METHOD Twenty-three listeners blinded to speakers' medical and dysarthria subtype diagnoses participated. The task was to group together (drag and drop) the icons corresponding to 33 speakers with dysarthria on the basis of how similar they sounded. Cluster analysis and multidimensional scaling (MDS) modeled the perceptual dimensions underlying similarity. Acoustic metrics and perceptual judgments were used in correlation analyses to facilitate interpretation of the derived dimensions. RESULTS Six clusters of similar-sounding speakers and 3 perceptual dimensions underlying similarity were revealed. The clusters of similar-sounding speakers were not constrained by dysarthria subtype diagnosis. The 3 perceptual dimensions revealed by MDS were correlated with metrics for articulation rate, intelligibility, and vocal quality, respectively. CONCLUSIONS This study shows (a) feasibility of a free-classification approach for studying perceptual similarity in dysarthria, (b) correspondence between acoustic and perceptual metrics to clusters of similar-sounding speakers, and (c) similarity judgments transcended dysarthria subtype diagnosis.
Collapse
|