1
|
Dahl KL, Cádiz MD, Zuk J, Guenther FH, Stepp CE. Controlling Pitch for Prosody: Sensorimotor Adaptation in Linguistically Meaningful Contexts. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2024; 67:440-454. [PMID: 38241671 PMCID: PMC11000799 DOI: 10.1044/2023_jslhr-23-00460] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Revised: 10/09/2023] [Accepted: 11/02/2023] [Indexed: 01/21/2024]
Abstract
PURPOSE This study examined how speakers adapt to fundamental frequency (fo) errors that affect the use of prosody to convey linguistic meaning, whether fo adaptation in that context relates to adaptation in linguistically neutral sustained vowels, and whether cue trading is reflected in responses in the prosodic cues of fo and amplitude. METHOD Twenty-four speakers said vowels and sentences while fo was digitally altered to induce predictable errors. Shifts in fo (±200 cents) were applied to the entire sustained vowel and one word (emphasized or unemphasized) in sentences. Two prosodic cues-fo and amplitude-were extracted. The effects of fo shifts, shift direction, and emphasis on fo response magnitude were evaluated with repeated-measures analyses of variance. Relationships between adaptive fo responses in sentences and vowels and between adaptive fo and amplitude responses were evaluated with Spearman correlations. RESULTS Speakers adapted to fo errors in both linguistically meaningful sentences and linguistically neutral vowels. Adaptive fo responses of unemphasized words were smaller than those of emphasized words when fo was shifted upward. There was no relationship between adaptive fo responses in vowels and emphasized words, but adaptive fo and amplitude responses were strongly, positively correlated. CONCLUSIONS Sensorimotor adaptation occurs in response to fo errors regardless of how disruptive the error is to linguistic meaning. Adaptation to fo errors during sustained vowels may not involve the exact same mechanisms as sensorimotor adaptation as it occurs in meaningful speech. The relationship between adaptive responses in fo and amplitude supports an integrated model of prosody. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.25008908.
Collapse
Affiliation(s)
- Kimberly L. Dahl
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Manuel Díaz Cádiz
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Jennifer Zuk
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Frank H. Guenther
- Department of Speech, Language and Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
| | - Cara E. Stepp
- Department of Speech, Language and Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
- Department of Otolaryngology–Head and Neck Surgery, Boston University School of Medicine, MA
| |
Collapse
|
2
|
Miller HE, Kearney E, Nieto-Castañón A, Falsini R, Abur D, Acosta A, Chao SC, Dahl KL, Franken M, Heller Murray ES, Mollaei F, Niziolek CA, Parrell B, Perrachione T, Smith DJ, Stepp CE, Tomassi N, Guenther FH. Do Not Cut Off Your Tail: A Mega-Analysis of Responses to Auditory Perturbation Experiments. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023; 66:4315-4331. [PMID: 37850867 PMCID: PMC10715843 DOI: 10.1044/2023_jslhr-23-00315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Revised: 08/04/2023] [Accepted: 08/06/2023] [Indexed: 10/19/2023]
Abstract
PURPOSE The practice of removing "following" responses from speech perturbation analyses is increasingly common, despite no clear evidence as to whether these responses represent a unique response type. This study aimed to determine if the distribution of responses to auditory perturbation paradigms represents a bimodal distribution, consisting of two distinct response types, or a unimodal distribution. METHOD This mega-analysis pooled data from 22 previous studies to examine the distribution and magnitude of responses to auditory perturbations across four tasks: adaptive pitch, adaptive formant, reflexive pitch, and reflexive formant. Data included at least 150 unique participants for each task, with studies comprising younger adult, older adult, and Parkinson's disease populations. A Silverman's unimodality test followed by a smoothed bootstrap resampling technique was performed for each task to evaluate the number of modes in each distribution. Wilcoxon signed-ranks tests were also performed for each distribution to confirm significant compensation in response to the perturbation. RESULTS Modality analyses were not significant (p > .05) for any group or task, indicating unimodal distributions. Our analyses also confirmed compensatory reflexive responses to pitch and formant perturbations across all groups, as well as adaptive responses to sustained formant perturbations. However, analyses of sustained pitch perturbations only revealed evidence of adaptation in studies with younger adults. CONCLUSION The demonstration of a clear unimodal distribution across all tasks suggests that following responses do not represent a distinct response pattern, but rather the tail of a unimodal distribution. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.24282676.
Collapse
Affiliation(s)
- Hilary E. Miller
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Elaine Kearney
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | | | - Riccardo Falsini
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Defne Abur
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Alexander Acosta
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Sara-Ching Chao
- College of Health Solutions, Arizona State University, Tempe
| | - Kimberly L. Dahl
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Matthias Franken
- Department of Psychology, McGill University, Montréal, Québec, Canada
| | | | - Fatemeh Mollaei
- School of Psychology and Clinical Language Sciences, University of Reading, England
| | - Caroline A. Niziolek
- Department of Communication Sciences and Disorders, University of Wisconsin–Madison
| | - Benjamin Parrell
- Department of Communication Sciences and Disorders, University of Wisconsin–Madison
| | - Tyler Perrachione
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Dante J. Smith
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Cara E. Stepp
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Nicole Tomassi
- Graduate Program for Neuroscience, Boston University, MA
| | - Frank H. Guenther
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| |
Collapse
|
3
|
Abur D, Hillman RE, Stepp CE. Auditory-Motor Function Pre- and Post-Therapy in Hyperfunctional Voice Disorders: A Case Series. J Voice 2023:S0892-1997(23)00264-3. [PMID: 37716889 DOI: 10.1016/j.jvoice.2023.08.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 08/16/2023] [Accepted: 08/16/2023] [Indexed: 09/18/2023]
Abstract
OBJECTIVE/HYPOTHESIS Behavioral voice therapy is the most common treatment for hyperfunctional voice disorders (HVDs) but has limited long-term effectiveness since the comprehensive mechanisms underlying HVDs remain unclear. Recent work has implicated disordered sensorimotor integration during speech in some speakers with HVDs and suggests that auditory processing is a key factor to consider in HVD assessment and therapy. The purpose of this case-series study was to assess whether current voice therapy approaches for HVDs resulted in improvements to auditory-motor function. STUDY DESIGN Longitudinal (pre-post) study. METHOD Pre and postvoice therapy for HVDs, 11 speakers underwent an assessment of auditory-motor function via auditory discrimination of vocal pitch, responses to unanticipated auditory perturbations, and responses to predictable auditory perturbations of vocal pitch. RESULTS At the post-therapy session, 10 out of 11 participants demonstrated voice therapy success (via self-reported voice problems and/or auditory-perceptual judgements of voice by a clinician) and eight of the 11 participants demonstrated improvements in at least one measure of auditory discrimination and/or auditory-motor control. Specifically, three speakers demonstrated improvements in auditory discrimination, five speakers demonstrated improved (within typical cutoffs) responses to predictable perturbations, and two speakers demonstrated improvements in both auditory discrimination and auditory-motor measures. CONCLUSIONS Together, these findings support that voice therapy in individuals with HVDs may impact auditory-motor control and highlight the potential benefit of systematically addressing auditory function in voice therapy and assessment for HVDs.
Collapse
Affiliation(s)
- Defne Abur
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA; Department of Computational Linguistics, Center for Language and Cognition Groningen, University of Groningen, Groningen, the Netherlands; Research School of Behavioral and Cognitive Neurosciences, University of Groningen, Groningen, the Netherlands.
| | - Robert E Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA; Department of Surgery, Harvard Medical School, Boston, MA; MGH Institute of Health Professions, Boston, MA
| | - Cara E Stepp
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA; Department of Biomedical Engineering, Boston University, Boston, MA; Department of Otolaryngology - Head and Neck Surgery, Boston University School of Medicine, Boston, MA
| |
Collapse
|
4
|
Liu D, Chang Y, Dai G, Guo Z, Jones JA, Li T, Chen X, Chen M, Li J, Wu X, Liu P, Liu H. Right, but not left, posterior superior temporal gyrus is causally involved in vocal feedback control. Neuroimage 2023; 278:120282. [PMID: 37468021 DOI: 10.1016/j.neuroimage.2023.120282] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2023] [Revised: 06/25/2023] [Accepted: 07/16/2023] [Indexed: 07/21/2023] Open
Abstract
The posterior superior temporal gyrus (pSTG) has been implicated in the integration of auditory feedback and motor system for controlling vocal production. However, the question as to whether and how the pSTG is causally involved in vocal feedback control is currently unclear. To this end, the present study selectively stimulated the left or right pSTG with continuous theta burst stimulation (c-TBS) in healthy participants, then used event-related potentials to investigate neurobehavioral changes in response to altered auditory feedback during vocal pitch regulation. The results showed that, compared to control (vertex) stimulation, c-TBS over the right pSTG led to smaller vocal compensations for pitch perturbations accompanied by smaller cortical N1 and larger P2 responses. Enhanced P2 responses received contributions from the right-lateralized temporal and parietal regions as well as the insula, and were significantly correlated with suppressed vocal compensations. Surprisingly, these effects were not found when comparing c-TBS over the left pSTG with control stimulation. Our findings provide evidence, for the first time, that supports a causal relationship between right, but not left, pSTG and auditory-motor integration for vocal pitch regulation. This lends support to a right-lateralized contribution of the pSTG in not only the bottom-up detection of vocal feedback errors but also the involvement of driving motor commands for error correction in a top-down manner.
Collapse
Affiliation(s)
- Dongxu Liu
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, China
| | - Yichen Chang
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, China
| | - Guangyan Dai
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, China
| | - Zhiqiang Guo
- School of Computer, Zhuhai College of Science and Technology, Zhuhai, China
| | - Jeffery A Jones
- Department of Psychology and Laurier Centre for Cognitive Neuroscience, Wilfrid Laurier University, Waterloo, Ontario N2L 3C5, Canada
| | - Tingni Li
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, China; Centre for Eye and Vision Research, 17W Science Park, Hong Kong SAR, China
| | - Xi Chen
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, China
| | - Mingyun Chen
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, China
| | - Jingting Li
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, China
| | - Xiuqin Wu
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, China.
| | - Peng Liu
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, China.
| | - Hanjun Liu
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, China; Guangdong Provincial Key Laboratory of Brain Function and Disease, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China.
| |
Collapse
|
5
|
Kearney E, Nieto-Castañón A, Falsini R, Daliri A, Heller Murray ES, Smith DJ, Guenther FH. Quantitatively characterizing reflexive responses to pitch perturbations. Front Hum Neurosci 2022; 16:929687. [PMID: 36405080 PMCID: PMC9666385 DOI: 10.3389/fnhum.2022.929687] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Accepted: 10/04/2022] [Indexed: 11/06/2022] Open
Abstract
Background Reflexive pitch perturbation experiments are commonly used to investigate the neural mechanisms underlying vocal motor control. In these experiments, the fundamental frequency–the acoustic correlate of pitch–of a speech signal is shifted unexpectedly and played back to the speaker via headphones in near real-time. In response to the shift, speakers increase or decrease their fundamental frequency in the direction opposing the shift so that their perceived pitch is closer to what they intended. The goal of the current work is to develop a quantitative model of responses to reflexive perturbations that can be interpreted in terms of the physiological mechanisms underlying the response and that captures both group-mean data and individual subject responses. Methods A model framework was established that allowed the specification of several models based on Proportional-Integral-Derivative and State-Space/Directions Into Velocities of Articulators (DIVA) model classes. The performance of 19 models was compared in fitting experimental data from two published studies. The models were evaluated in terms of their ability to capture both population-level responses and individual differences in sensorimotor control processes. Results A three-parameter DIVA model performed best when fitting group-mean data from both studies; this model is equivalent to a single-rate state-space model and a first-order low pass filter model. The same model also provided stable estimates of parameters across samples from individual subject data and performed among the best models to differentiate between subjects. The three parameters correspond to gains in the auditory feedback controller’s response to a perceived error, the delay of this response, and the gain of the somatosensory feedback controller’s “resistance” to this correction. Excellent fits were also obtained from a four-parameter model with an additional auditory velocity error term; this model was better able to capture multi-component reflexive responses seen in some individual subjects. Conclusion Our results demonstrate the stereotyped nature of an individual’s responses to pitch perturbations. Further, we identified a model that captures population responses to pitch perturbations and characterizes individual differences in a stable manner with parameters that relate to underlying motor control capabilities. Future work will evaluate the model in characterizing responses from individuals with communication disorders.
Collapse
Affiliation(s)
- Elaine Kearney
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA, United States
- *Correspondence: Elaine Kearney,
| | - Alfonso Nieto-Castañón
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA, United States
- The McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, United States
| | - Riccardo Falsini
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA, United States
| | - Ayoub Daliri
- College of Health Solutions, Arizona State University, Tempe, AZ, United States
| | | | - Dante J. Smith
- Gradutate Program for Neuroscience, Boston University, Boston, MA, United States
| | - Frank H. Guenther
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA, United States
- Department of Biomedical Engineering, Boston University, Boston, MA, United States
- The Picower Institute for Learning and Memory, Massachusetts Institute of Technology, Cambridge, MA, United States
| |
Collapse
|
6
|
Auditory and somatosensory feedback mechanisms of laryngeal and articulatory speech motor control. Exp Brain Res 2022; 240:2155-2173. [PMID: 35736994 DOI: 10.1007/s00221-022-06395-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Accepted: 06/02/2022] [Indexed: 11/04/2022]
Abstract
PURPOSE Speech production is a complex motor task involving multiple subsystems. The relationships between these subsystems need to be comprehensively investigated to understand the underlying mechanisms of speech production. The goal of this paper is to examine the differential contributions of 1) auditory and somatosensory feedback control mechanisms, and 2) laryngeal and articulatory speech production subsystems on speech motor control at an individual speaker level using altered auditory and somatosensory feedback paradigms. METHODS Twenty young adults completed speaking tasks in which sudden and unpredictable auditory and physical perturbations were applied to the laryngeal and articulatory speech production subsystems. Auditory perturbations were applied to laryngeal or articulatory acoustic features of speech. Physical perturbations were applied to the larynx and the jaw. Pearson-product moment correlation coefficients were calculated between 1) auditory and somatosensory reflexive responses to investigate relationships between auditory and somatosensory feedback control mechanisms, and 2) laryngeal and articulatory reflexive responses as well as acuity measures to investigate the relationship between auditory-motor features of laryngeal and articulatory subsystems. RESULTS No statistically significant correlations were found concerning the relationships between auditory and somatosensory feedback. No statistically significant correlations were found between auditory-motor features in the laryngeal and articulatory control subsystems. CONCLUSION Results suggest that the laryngeal and articulatory speech production subsystems operate with differential auditory and somatosensory feedback control mechanisms. The outcomes suggest that current models of speech motor control should consider decoupling laryngeal and articulatory domains to better model speech motor control processes.
Collapse
|
7
|
Yüksel M. Reliability and Efficiency of Pitch-Shifting Plug-Ins in Voice and Hearing Research. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:878-889. [PMID: 35077652 DOI: 10.1044/2021_jslhr-21-00440] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
PURPOSE Auditory feedback perturbation with voice pitch manipulation has been widely used in previous studies. There are several hardware and software tools for such manipulations, but audio plug-ins developed for music, movies, and radio applications that operate in digital audio workstations may be extremely beneficial and are easy to use, accessible, and cost effective. However, it is unknown whether these plug-ins can perform similarly to tools that have been described in previous literature. Hence, this study aimed to evaluate the reliability and efficiency of these plug-ins. METHOD Six different plug-ins were used at +1 and -1 st pitch shifting with formant correction on and off to pitch shift the sustained /ɑ/ voice recording sample of 12 healthy participants (six cisgender males and six cisgender females). Pitch-shifting accuracy, formant shifting amount, intensity changes, and total latency values were reported. RESULTS Some variability was observed between different plug-ins and pitch shift settings. One plug-in managed to perform similarly in all four measured aspects with well-known hardware and software units with 1-cent pitch-shifting accuracy, low latency values, negligible intensity difference, and preserved formants. Other plug-ins performed similarly in some respects. CONCLUSIONS Audio plug-ins may be used effectively in pitch-shifting applications. Researchers and clinicians can access these plug-ins easily and test whether the features also fit their aims.
Collapse
Affiliation(s)
- Mustafa Yüksel
- Department of Speech and Language Therapy, School of Health Sciences, Ankara Medipol University, Turkey
- Department of Otorhinolaryngology, University Medical Center Groningen, the Netherlands
| |
Collapse
|
8
|
Tomassi NE, Weerathunge HR, Cushman MR, Bohland JW, Stepp CE. Assessing Ecologically Valid Methods of Auditory Feedback Measurement in Individuals With Typical Speech. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:121-135. [PMID: 34941381 PMCID: PMC9153919 DOI: 10.1044/2021_jslhr-21-00377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Revised: 09/15/2021] [Accepted: 09/16/2021] [Indexed: 06/14/2023]
Abstract
PURPOSE Auditory feedback is thought to contribute to the online control of speech production. Yet, the standard method of estimating auditory feedback control (i.e., reflexive responses to auditory-motor perturbations), although sound, requires specialized instrumentation, meticulous calibration, unnatural tasks, and specific acoustic environments. The purpose of this study was to explore more ecologically valid features of speech production to determine their relationships with auditory feedback mechanisms. METHOD Two previously proposed measures of within-utterance variability (centering and baseline variability) were compared with reflexive response magnitudes in 30 adults with typical speech. These three measures were estimated for both the laryngeal and articulatory subsystems of speech. RESULTS Regardless of the speech subsystem, neither centering nor baseline variability was shown to be related to reflexive response magnitudes. Likewise, no relationships were found between centering and baseline variability. CONCLUSIONS Despite previous suggestions that centering and baseline variability may be related to auditory feedback mechanisms, this study did not support these assertions. However, the detection of such relationships may have required a larger degree of variability in responses, relative to that found in those with typical speech. Future research on these relationships is warranted in populations with more heterogeneous responses, such as children or clinical populations. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.17330546.
Collapse
Affiliation(s)
- Nicole E. Tomassi
- Graduate Program for Neuroscience, Boston University, MA
- Department of Speech, Language & Hearing Sciences, Boston University, MA
| | - Hasini R. Weerathunge
- Department of Speech, Language & Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
| | - Megan R. Cushman
- Department of Speech, Language & Hearing Sciences, Boston University, MA
| | - Jason W. Bohland
- Department of Communication Science and Disorders, University of Pittsburgh, PA
| | - Cara E. Stepp
- Graduate Program for Neuroscience, Boston University, MA
- Department of Speech, Language & Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
- Department of Otolaryngology—Head & Neck Surgery, Boston University School of Medicine, MA
| |
Collapse
|
9
|
Abur D, Subaciute A, Daliri A, Lester-Smith RA, Lupiani AA, Cilento D, Enos NM, Weerathunge HR, Tardif MC, Stepp CE. Feedback and Feedforward Auditory-Motor Processes for Voice and Articulation in Parkinson's Disease. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:4682-4694. [PMID: 34731577 PMCID: PMC9150666 DOI: 10.1044/2021_jslhr-21-00153] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Revised: 06/03/2021] [Accepted: 07/27/2021] [Indexed: 06/13/2023]
Abstract
PURPOSE Unexpected and sustained manipulations of auditory feedback during speech production result in "reflexive" and "adaptive" responses, which can shed light on feedback and feedforward auditory-motor control processes, respectively. Persons with Parkinson's disease (PwPD) have shown aberrant reflexive and adaptive responses, but responses appear to differ for control of vocal and articulatory features. However, these responses have not been examined for both voice and articulation in the same speakers and with respect to auditory acuity and functional speech outcomes (speech intelligibility and naturalness). METHOD Here, 28 PwPD on their typical dopaminergic medication schedule and 28 age-, sex-, and hearing-matched controls completed tasks yielding reflexive and adaptive responses as well as auditory acuity for both vocal and articulatory features. RESULTS No group differences were found for any measures of auditory-motor control, conflicting with prior findings in PwPD while off medication. Auditory-motor measures were also compared with listener ratings of speech function: first formant frequency acuity was related to speech intelligibility, whereas adaptive responses to vocal fundamental frequency manipulations were related to speech naturalness. CONCLUSIONS These results support that auditory-motor processes for both voice and articulatory features are intact for PwPD receiving medication. This work is also the first to suggest associations between measures of auditory-motor control and speech intelligibility and naturalness.
Collapse
Affiliation(s)
- Defne Abur
- Department of Speech, Language & Hearing Sciences, Boston University, MA
| | | | - Ayoub Daliri
- Department of Speech, Language & Hearing Sciences, Boston University, MA
- College of Health Solutions, Arizona State University, Tempe
| | - Rosemary A. Lester-Smith
- Department of Speech, Language & Hearing Sciences, Boston University, MA
- Department of Speech, Language, and Hearing Sciences, Moody College of Communication, The University of Texas at Austin
| | - Ashling A. Lupiani
- Department of Speech, Language & Hearing Sciences, Boston University, MA
- Joint Department of Biomedical Engineering, University of North Carolina Chapel Hill and North Carolina State University, Raleigh
| | - Dante Cilento
- Department of Speech, Language & Hearing Sciences, Boston University, MA
| | - Nicole M. Enos
- Department of Biomedical Engineering, Boston University, MA
- Department of Electrical & Computer Engineering, Boston University, MA
| | | | - Monique C. Tardif
- Department of Speech, Language & Hearing Sciences, Boston University, MA
- Department of Communication Science and Disorders, University of Pittsburgh, PA
| | - Cara E. Stepp
- Department of Speech, Language & Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
- Department of Otolaryngology—Head & Neck Surgery, Boston University School of Medicine, MA
| |
Collapse
|
10
|
Thomas A, Mirza N, Eliades SJ. Auditory Feedback Control of Vocal Pitch in Spasmodic Dysphonia. Laryngoscope 2020; 131:2070-2075. [PMID: 33169850 DOI: 10.1002/lary.29254] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2020] [Revised: 10/07/2020] [Accepted: 10/21/2020] [Indexed: 11/12/2022]
Abstract
OBJECTIVES/HYPOTHESIS Hearing plays an important role in the maintenance of vocal control in normal individuals. In patients with spasmodic dysphonia (SD), however, the ability to maintain sustained control of phonation is impaired. The origins of SD are unknown, and it is unclear whether auditory feedback-dependent vocal control is compromised in these patients. STUDY DESIGN Prospective case-control study. METHODS We tested 15 SD patients and 11 age-matched controls. Voice recordings were performed while subjects repeated the vowel /e/ and auditory feedback of their vocal sounds was altered in real-time to introduce a pitch-shift (±2 semitones), presented back to subjects using headphones. Recordings were analyzed to determine voice changes following the pitch-shifted feedback. Results were further compared with patient demographics and subjective measures of dysphonia, including the Voice Handicap Index (VHI). RESULTS Despite considerable pitch variability and vocal breaks, SD patients exhibited significantly higher average vocal pitch compensation than control subjects. SD patients also exhibited greater variability than controls. However, there were no significant correlations between vocal compensation and patient demographics, although there was a significant inverse correlation with VHI. CONCLUSIONS In this pilot study, patients with SD exhibited increased sensitivity to altered auditory feedback during sustained phonation. These results are consistent with recent theories of SD as a disorder of sensory-motor feedback processing, and suggest possible avenues for future investigation. LEVEL OF EVIDENCE 3 Laryngoscope, 131:2070-2075, 2021.
Collapse
Affiliation(s)
- Arthur Thomas
- Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, U.S.A
| | - Natasha Mirza
- Department of Otorhinolaryngology - Head and Neck Surgery, Hospital of the University of Pennsylvania, Philadelphia, Pennsylvania, U.S.A
| | - Steven J Eliades
- Department of Otorhinolaryngology - Head and Neck Surgery, Hospital of the University of Pennsylvania, Philadelphia, Pennsylvania, U.S.A
| |
Collapse
|
11
|
Weerathunge HR, Abur D, Enos NM, Brown KM, Stepp CE. Auditory-Motor Perturbations of Voice Fundamental Frequency: Feedback Delay and Amplification. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:2846-2860. [PMID: 32755506 PMCID: PMC7890227 DOI: 10.1044/2020_jslhr-19-00407] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Revised: 03/30/2020] [Accepted: 06/10/2020] [Indexed: 06/11/2023]
Abstract
Purpose Gradual and sudden perturbations of vocal fundamental frequency (f o), also known as adaptive and reflexive f o perturbations, are techniques to study the influence of auditory feedback on voice f o control mechanisms. Previous vocal f o perturbations have incorporated varied setup-specific feedback delays and amplifications. Here, we investigated the effects of feedback delays (10-100 ms) and amplifications on both adaptive and reflexive f o perturbation paradigms, encapsulating the variability in equipment-specific delays (3-45 ms) and amplifications utilized in previous experiments. Method Responses to adaptive and reflexive f o perturbations were recorded in 24 typical speakers for four delay conditions (10, 40, 70, and 100 ms) or three amplification conditions (-10, +5, and +10 dB relative to microphone) in a counterbalanced order. Repeated-measures analyses of variance were carried out on the magnitude of f o responses to determine the effect of feedback condition. Results There was a statistically significant effect of the level of auditory feedback amplification on the response magnitude during adaptive f o perturbations, driven by the difference between +10- and -10-dB amplification conditions (hold phase difference: M = 38.3 cents, SD = 51.2 cents; after-effect phase: M = 66.1 cents, SD = 84.6 cents). No other statistically significant effects of condition were found for either paradigm. Conclusions Experimental equipment delays below 100 ms in behavioral paradigms do not affect the results of f o perturbation paradigms. As there is no statistically significant difference between the response magnitudes elicited by +5- and +10-dB auditory amplification conditions, this study is a confirmation that an auditory feedback amplification of +5 dB relative to microphone is sufficient to elicit robust compensatory responses for f o perturbation paradigms.
Collapse
Affiliation(s)
| | - Defne Abur
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
| | - Nicole M. Enos
- Department of Biomedical Engineering, Boston University, MA
- Department of Computer Engineering, Boston University, MA
| | - Katherine M. Brown
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
| | - Cara E. Stepp
- Department of Biomedical Engineering, Boston University, MA
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
- Department of Otolaryngology—Head and Neck Surgery, Boston University School of Medicine, MA
| |
Collapse
|
12
|
Kim KS, Wang H, Max L. It's About Time: Minimizing Hardware and Software Latencies in Speech Research With Real-Time Auditory Feedback. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:2522-2534. [PMID: 32640180 PMCID: PMC7872729 DOI: 10.1044/2020_jslhr-19-00419] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
Purpose Various aspects of speech production related to auditory-motor integration and learning have been examined through auditory feedback perturbation paradigms in which participants' acoustic speech output is experimentally altered and played back via earphones/headphones "in real time." Scientific rigor requires high precision in determining and reporting the involved hardware and software latencies. Many reports in the literature, however, are not consistent with the minimum achievable latency for a given experimental setup. Here, we focus specifically on this methodological issue associated with implementing real-time auditory feedback perturbations, and we offer concrete suggestions for increased reproducibility in this particular line of work. Method Hardware and software latencies as well as total feedback loop latency were measured for formant perturbation studies with the Audapter software. Measurements were conducted for various audio interfaces, desktop and laptop computers, and audio drivers. An approach for lowering Audapter's software latency through nondefault parameter specification was also tested. Results Oft-overlooked hardware-specific latencies were not negligible for some of the tested audio interfaces (adding up to 15 ms). Total feedback loop latencies (including both hardware and software latency) were also generally larger than claimed in the literature. Nondefault parameter values can improve Audapter's own processing latency without negative impact on formant tracking. Conclusions Audio interface selection and software parameter optimization substantially affect total feedback loop latency. Thus, the actual total latency (hardware plus software) needs to be correctly measured and described in all published reports. Future speech research with "real-time" auditory feedback perturbations should increase scientific rigor by minimizing this latency.
Collapse
Affiliation(s)
- Kwang S. Kim
- Department of Speech and Hearing Sciences, University of Washington, Seattle
| | - Hantao Wang
- Department of Speech and Hearing Sciences, University of Washington, Seattle
| | - Ludo Max
- Department of Speech and Hearing Sciences, University of Washington, Seattle
- Haskins Laboratories, New Haven, CT
| |
Collapse
|
13
|
Heller Murray ES, Stepp CE. Relationships between vocal pitch perception and production: a developmental perspective. Sci Rep 2020; 10:3912. [PMID: 32127585 PMCID: PMC7054315 DOI: 10.1038/s41598-020-60756-2] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2019] [Accepted: 02/11/2020] [Indexed: 01/17/2023] Open
Abstract
The purpose of this study was to examine the relationships between vocal pitch discrimination abilities and vocal responses to auditory pitch-shifts. Twenty children (6.6-11.7 years) and twenty adults (18-28 years) completed a listening task to determine auditory discrimination abilities to vocal fundamental frequency (fo) as well as two vocalization tasks in which their perceived fo was modulated in real-time. These pitch-shifts were either unexpected, providing information on auditory feedback control, or sustained, providing information on sensorimotor adaptation. Children were subdivided into two groups based on their auditory pitch discrimination abilities; children within two standard deviations of the adult group were classified as having adult-like discrimination abilities (N = 11), whereas children outside of this range were classified as having less sensitive discrimination abilities than adults (N = 9). Children with less sensitive auditory pitch discrimination abilities had significantly larger vocal response magnitudes to unexpected pitch-shifts and significantly smaller vocal response magnitudes to sustained pitch-shifts. Children with less sensitive auditory pitch discrimination abilities may rely more on auditory feedback and thus may be less adept at updating their stored motor programs.
Collapse
Affiliation(s)
| | - Cara E Stepp
- Department of Speech, Language and Hearing Sciences, Boston University, Boston, MA, USA
- Department of Otolaryngology - Head and Neck Surgery, Boston University School of Medicine, Boston, MA, USA
- Department of Biomedical Engineering, Boston University, Boston, MA, USA
| |
Collapse
|
14
|
Murray ESH, Hseu AF, Nuss RC, Woodnorth GH, Stepp CE. Vocal Pitch Discrimination in Children with and without Vocal Fold Nodules. APPLIED SCIENCES (BASEL, SWITZERLAND) 2019; 9:3042. [PMID: 31840003 PMCID: PMC6910133 DOI: 10.3390/app9153042] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Vocal pitch discrimination abilities were compared in sixteen children with vocal fold nodules (CwVN) and sixteen matched controls with typical voices (CwTV). Vocal pitch discrimination was also evaluated in thirty-five vocally healthy children and twenty adults to examine potential changes as a function of maturation. CwTV were categorized as either younger (N = 15, 5.6-7.7 years) or older (N = 20, 8.2-11.7 years). Participants completed two-alternative, forced choice listening tasks in which they judged whether pairs of sustained /α/ tokens were different in pitch. Each pair consisted of a base token with a fundamental frequency fo ) of 216.2 Hz and a test token with a fo that was adaptively modified, according to the participant's prior judgments. There were no significant differences in pitch discrimination abilities between CwVN and CwTV. Pitch discrimination abilities were significantly poorer in younger and older CwTV as compared to adults. Additionally, younger CwTV had significantly poorer discrimination abilities than older CwTV. Findings from this study suggest that CwVN do not have differences in pitch discrimination abilities, yet, therapies designed for CwVN should consider this developmental trend in perceptual abilities.
Collapse
Affiliation(s)
- Elizabeth S. Heller Murray
- Department of Speech, Language and Hearing Sciences, Boston University, Boston, MA 02215, USA
- Department of Otolaryngology and Communication Enhancement, Boston Children’s Hospital, Boston, MA 02115, USA
| | - Anne F. Hseu
- Department of Otolaryngology and Communication Enhancement, Boston Children’s Hospital, Boston, MA 02115, USA
| | - Roger C. Nuss
- Department of Otolaryngology and Communication Enhancement, Boston Children’s Hospital, Boston, MA 02115, USA
| | - Geralyn Harvey Woodnorth
- Department of Otolaryngology and Communication Enhancement, Boston Children’s Hospital, Boston, MA 02115, USA
| | - Cara E. Stepp
- Department of Speech, Language and Hearing Sciences, Boston University, Boston, MA 02215, USA
- Department of Otolaryngology—Head and Neck Surgery, Boston University School of Medicine, Boston, MA 02118, USA
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA
| |
Collapse
|