1
|
Anand S, Park Y, Shrivastav R, Eddins DA. Evaluating the Effect of Voice Quality Covariance on Auditory-Perceptual Evaluation Using a Novel Two-Dimensional Magnitude Estimation Task. J Speech Lang Hear Res 2023; 66:4849-4859. [PMID: 37902504 PMCID: PMC11001379 DOI: 10.1044/2023_jslhr-23-00226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Revised: 08/12/2023] [Accepted: 09/03/2023] [Indexed: 10/31/2023]
Abstract
PURPOSE Most people with dysphonia present with voices that vary along more than one voice quality (VQ) dimension. This study sought to examine the effect of covariance between breathy and rough VQ in natural voices. METHOD A two-dimensional matrix of 16 /a/ vowels was selected such that two VQ dimensions (breathiness and roughness) were sampled on a 4-point severity scale (none, mild, moderate, and severe). Ten listeners evaluated 480 stimuli (16 stimuli × 10 repetitions × 3 blocks) on one-dimensional magnitude estimation (1DME) tasks and a novel two-dimensional magnitude estimation (2DME) task that allowed for simultaneous measurement of breathiness and roughness. RESULTS Data indicated high intra- and interrater reliabilities for both breathiness and roughness in the 2DME and 1DME tasks. Correlation analyses revealed a strong correlation between 2DME and 1DME judgments for breathiness and roughness (r > .95). There was also a minimal correlation between breathy and rough VQ in the 2DME task (r < .10). CONCLUSIONS Covarying roughness or breathiness had less impact on the perception of the other VQ in natural dysphonic voices in 2DME compared to 1DME. An understanding and quantification of the perceptual interactions among the dimensions will aid in the refinement of computational models and in the establishment of the validity of clinical scales for VQ perception.
Collapse
Affiliation(s)
- Supraja Anand
- Department of Communication Sciences & Disorders, University of South Florida, Tampa
| | - Yeonggwang Park
- Department of Communication Sciences and Disorders, University of Central Florida, Orlando
| | - Rahul Shrivastav
- Office of the Provost & Executive Vice President, Indiana University Bloomington
| | - David A. Eddins
- Department of Communication Sciences and Disorders, University of Central Florida, Orlando
| |
Collapse
|
2
|
Park Y, Baker Brehm S, Kelchner L, Weinrich B, McElfresh K, Anand S, Shrivastav R, de Alarcon A, Eddins DA. Effects of Vibratory Source on Auditory-Perceptual and Bio-Inspired Computational Measures of Pediatric Voice Quality. J Voice 2023:S0892-1997(23)00254-0. [PMID: 37739862 PMCID: PMC10950844 DOI: 10.1016/j.jvoice.2023.08.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Revised: 08/11/2023] [Accepted: 08/14/2023] [Indexed: 09/24/2023]
Abstract
OBJECTIVE The vibratory source for voicing in children with dysphonia is classified into three categories including a glottal vibratory source (GVS) observed in those with vocal lesions or hyperfunction; supraglottal vibratory sources (SGVS) observed secondary to laryngeal airway injuries, malformations, or reconstruction surgeries; and a combination of both glottal and supraglottal vibratory sources called mixed vibratory source (MVS). This study evaluated the effects of vibratory source on three primary dimensions of voice quality (breathiness, roughness, and strain) in children with GVS, SGVS, and MVS using single-variable matching tasks and computational measures obtained from bio-inspired auditory models. METHODS A total of 44 dysphonic voice samples from children aged 4-11 years were selected. Seven listeners rated breathiness, roughness, and strain of 1000-ms /ɑ/ samples using single-variable matching tasks. Computational estimates of pitch strength, amplitude modulation filterbank output, and sharpness were obtained through custom-designed MATLAB algorithms. RESULTS Perceived roughness and strain were significantly higher in children with SGVS and MVS compared to children with GVS. Among the computational measures, only the modulation filterbank output resulted in significant differences among vibratory sources; a posthoc test revealed that children with SGVS had greater amplitude modulation than children with GVS, as expected from their rougher voice quality. CONCLUSIONS The results indicate that the output of an auditory amplitude modulation filterbank model may capture characteristics of SGVS that are strongly related to the rough voice quality.
Collapse
Affiliation(s)
- Yeonggwang Park
- Department of Communication Sciences and Disorders, University of Central Florida, Orlando, Florida.
| | - Susan Baker Brehm
- Department of Speech Pathology and Audiology, Miami University, Oxford, Ohio; Division of Speech-Language Pathology, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio
| | - Lisa Kelchner
- Division of Speech-Language Pathology, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio; Department of Communication Sciences and Disorders, University of Cincinnati, Cincinnati, Ohio
| | - Barbara Weinrich
- Department of Speech Pathology and Audiology, Miami University, Oxford, Ohio; Division of Speech-Language Pathology, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio
| | - Kevin McElfresh
- Division of Speech-Language Pathology, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio
| | - Supraja Anand
- Department of Communication Sciences and Disorders, University of South Florida, Tampa, Florida
| | - Rahul Shrivastav
- Office of the Provost & Executive Vice President, Indiana University, Bloomington, Indiana
| | - Alessandro de Alarcon
- Pediatric Otolaryngology Head & Neck Surgery, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio
| | - David A Eddins
- Department of Communication Sciences and Disorders, University of Central Florida, Orlando, Florida
| |
Collapse
|
3
|
Park Y, Anand S, Gifford SM, Shrivastav R, Eddins DA. Development and Validation of a Single-Variable Comparison Stimulus for Matching Strained Voice Quality Using a Psychoacoustic Framework. J Speech Lang Hear Res 2023; 66:16-29. [PMID: 36516473 PMCID: PMC10023177 DOI: 10.1044/2022_jslhr-22-00280] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Revised: 08/17/2022] [Accepted: 09/01/2022] [Indexed: 06/17/2023]
Abstract
PURPOSE Acoustic and perceptual quantification of vocal strain has been a vexing problem for years. To increase measurement rigor, a suitable single-variable matching stimulus for strain was developed and validated, based on the matching stimulus used previously for breathy and rough voice qualities. METHOD A set of 21 comparison stimuli for a single-variable matching task (SVMT) was synthesized based on a speech-shaped sawtooth waveform mixed with speech-shaped noise. Variable bandpass filter gain in mid-to-high frequencies achieved a wide range of computed sharpness (in constant sharpness steps) and served as the independent variable for the SVMT. Ten natural /ɑ/ stimuli with a wide range of the primary voice quality of strain and a minimum of breathiness or roughness were selected and assessed using the SVMT. Natural voice samples and synthetic comparison stimuli were also assessed using a perceptual magnitude estimation (ME) task. RESULTS ME data validated the correspondence of the set of comparison stimuli to varying perceived strain. Perceived strain magnitudes of the comparison stimuli increased significantly and linearly with computed sharpness (r 2 = .99). A linear regression revealed that strain matching values were significantly predicted by computed sharpness (r 2 = .96) and perceived strain magnitudes (r 2 = .95) of the natural voice stimuli. CONCLUSION The perception of vocal strain is strongly associated with computed sharpness and is captured accurately and precisely using an SVMT, in which the independent variable is the bandpass filter gain (in steps of equal sharpness) applied to the comparison stimuli.
Collapse
Affiliation(s)
- Yeonggwang Park
- Department of Communication Sciences & Disorders, University of South Florida, Tampa
| | - Supraja Anand
- Department of Communication Sciences & Disorders, University of South Florida, Tampa
| | - Sophia M. Gifford
- Department of Communication Sciences & Disorders, University of South Florida, Tampa
| | - Rahul Shrivastav
- Office of the Provost & Executive Vice President, Indiana University, Bloomington
| | - David A. Eddins
- Department of Communication Sciences & Disorders, University of South Florida, Tampa
| |
Collapse
|
4
|
Park Y, Anand S, Kopf LM, Shrivastav R, Eddins DA. Interactions Between Breathy and Rough Voice Qualities and Their Contributions to Overall Dysphonia Severity. J Speech Lang Hear Res 2022; 65:4071-4084. [PMID: 36260821 PMCID: PMC9940885 DOI: 10.1044/2022_jslhr-22-00012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
PURPOSE Dysphonic voices typically present multiple voice quality dimensions. This study investigated potential interactions between perceived breathiness and roughness and their contributions to overall dysphonia severity. METHOD Synthetic stimuli based on four talkers were created to systematically map out potential interactions. For each talker, a stimulus matrix composed of 49 stimuli (seven breathiness steps × seven roughness steps) was created by varying aspiration noise and open quotient to manipulate breathiness and superimposing amplitude modulation of varying depths to simulate roughness. One-dimensional matching (1DMA) and magnitude estimation (1DME) tasks were used to measure perceived breathiness, roughness, their potential interactions, and overall dysphonia severity. Additional 1DME tasks were used to assess a set of natural stimuli that varied along both breathiness and roughness. RESULTS For the synthetic stimuli, the 1DMA task indicated little interaction between the two voice qualities. For the 1DME task, breathiness magnitude was influenced by roughness step to a greater extent than roughness magnitude was influenced by breathiness step. The additive contributions of breathiness and roughness to overall severity gradually diminished with increasing breathiness and roughness steps, possibly reflecting a ceiling effect in the 1DME task. For the natural stimuli, little consistent interaction was observed between breathiness and roughness. CONCLUSIONS The matching task revealed minimal interaction between perceived breathiness and roughness, whereas the magnitude estimation task revealed some interaction between the two qualities and their cumulative contributions to overall dysphonia severity. Task differences are discussed in terms of differences in response bias and the role of perceptual anchors. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.21313701.
Collapse
Affiliation(s)
- Yeonggwang Park
- Department of Communication Sciences and Disorders, University of South Florida, Tampa
| | - Supraja Anand
- Department of Communication Sciences and Disorders, University of South Florida, Tampa
| | - Lisa M. Kopf
- Department of Speech, Language and Hearing Sciences, The George Washington University, Washington, DC
| | - Rahul Shrivastav
- Office of the Provost and Executive Vice President, Indiana University, Bloomington
- Department of Communicative Sciences and Disorders, Michigan State University, East Lansing
| | - David A. Eddins
- Department of Communication Sciences and Disorders, University of South Florida, Tampa
- Department of Communicative Sciences and Disorders, Michigan State University, East Lansing
| |
Collapse
|
5
|
Shrivastav R. In Memoriam: Harry H. Hollien, 1926-2022. J Voice 2022; 36:S0892-1997(22)00247-8. [PMID: 35995652 DOI: 10.1016/j.jvoice.2022.08.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2022]
Affiliation(s)
- Rahul Shrivastav
- Department of Speech and Hearing Sciences, Indiana University, Bloomington, Indiana
| |
Collapse
|
6
|
Park Y, Anand S, Ozmeral EJ, Shrivastav R, Eddins DA. Predicting Perceived Vocal Roughness Using a Bio-Inspired Computational Model of Auditory Temporal Envelope Processing. J Speech Lang Hear Res 2022; 65:2748-2758. [PMID: 35867607 PMCID: PMC9911094 DOI: 10.1044/2022_jslhr-22-00101] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Revised: 04/14/2022] [Accepted: 04/25/2022] [Indexed: 06/15/2023]
Abstract
PURPOSE Vocal roughness is often present in many voice disorders but the assessment of roughness mainly depends on the subjective auditory-perceptual evaluation and lacks acoustic correlates. This study aimed to apply the concept of roughness in general sound quality perception to vocal roughness assessment and to characterize the relationship between vocal roughness and temporal envelop fluctuation measures obtained from an auditory model. METHOD Ten /ɑ/ recordings with a wide range of roughness were selected from an existing database. Ten listeners rated the roughness of the recordings in a single-variable matching task. Temporal envelope fluctuations of the recordings were analyzed with an auditory processing model of amplitude modulation that utilizes a modulation filterbank of different modulation frequencies. Pitch strength and the smoothed cepstral peak prominence were also obtained for comparison. RESULTS Individual simple regression models yielded envelope standard deviation from a modulation filter with a low center frequency (64.3 Hz) as a statistically significant predictor of vocal roughness with a strong coefficient of determination (r 2 = .80). Pitch strength and CPPS were not significant predictors of roughness. CONCLUSION This result supports the possible utility of envelope fluctuation measures from an auditory model as objective correlates of vocal roughness.
Collapse
Affiliation(s)
- Yeonggwang Park
- Department of Communication Sciences and Disorders, University of South Florida, Tampa
| | - Supraja Anand
- Department of Communication Sciences and Disorders, University of South Florida, Tampa
| | - Erol J. Ozmeral
- Department of Communication Sciences and Disorders, University of South Florida, Tampa
| | - Rahul Shrivastav
- Office of the Provost & Executive Vice President, Indiana University Bloomington
| | - David A. Eddins
- Department of Communication Sciences and Disorders, University of South Florida, Tampa
| |
Collapse
|
7
|
Arora S, Singh GK, Das P, Rahman R, Bellad P, Shrivastav R, Bahuguna A, Sapra D, Gupta A. A study of COVID‐19 vaccine (Covishield) induced dermatological adverse effects from India. J Eur Acad Dermatol Venereol 2022; 36:e402-e404. [DOI: 10.1111/jdv.17951] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Revised: 12/25/2021] [Accepted: 01/19/2022] [Indexed: 11/26/2022]
Affiliation(s)
- S Arora
- Department of Dermatology Base Hospital Delhi Cantt & Army College of Medical Sciences New Delhi India
| | - G K Singh
- Department of Dermatology Base Hospital Delhi Cantt & Army College of Medical Sciences New Delhi India
| | - P Das
- Department of Dermatology Base Hospital Delhi Cantt & Army College of Medical Sciences New Delhi India
| | - R Rahman
- Department of Physiology Base Hospital Delhi Cantt & Army College of Medical Sciences New Delhi India
| | - P Bellad
- Department of Dermatology Base Hospital Delhi Cantt & Army College of Medical Sciences New Delhi India
| | | | - Amit Bahuguna
- Department of Dermatology Base Hospital Delhi Cantt & Army College of Medical Sciences New Delhi India
| | - Devyani Sapra
- Department of Dermatology Base Hospital Delhi Cantt & Army College of Medical Sciences New Delhi India
| | - Akanksha Gupta
- Department of Dermatology Base Hospital Delhi Cantt & Army College of Medical Sciences New Delhi India
| |
Collapse
|
8
|
Devaraju DS, Kemp A, Eddins DA, Shrivastav R, Chandrasekaran B, Hampton Wray A. Effects of Task Demands on Neural Correlates of Acoustic and Semantic Processing in Challenging Listening Conditions. J Speech Lang Hear Res 2021; 64:3697-3706. [PMID: 34403278 DOI: 10.1044/2021_jslhr-21-00006] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Purpose Listeners shift their listening strategies between lower level acoustic information and higher level semantic information to prioritize maximum speech intelligibility in challenging listening conditions. Although increasing task demands via acoustic degradation modulates lexical-semantic processing, the neural mechanisms underlying different listening strategies are unclear. The current study examined the extent to which encoding of lower level acoustic cues is modulated by task demand and associations with lexical-semantic processes. Method Electroencephalography was acquired while participants listened to sentences in the presence of four-talker babble that contained either higher or lower probability final words. Task difficulty was modulated by time available to process responses. Cortical tracking of speech-neural correlates of acoustic temporal envelope processing-were estimated using temporal response functions. Results Task difficulty did not affect cortical tracking of temporal envelope of speech under challenging listening conditions. Neural indices of lexical-semantic processing (N400 amplitudes) were larger with increased task difficulty. No correlations were observed between the cortical tracking of temporal envelope of speech and lexical-semantic processes, even after controlling for the effect of individualized signal-to-noise ratios. Conclusions Cortical tracking of the temporal envelope of speech and semantic processing are differentially influenced by task difficulty. While increased task demands modulated higher level semantic processing, cortical tracking of the temporal envelope of speech may be influenced by task difficulty primarily when the demand is manipulated in terms of acoustic properties of the stimulus, consistent with an emerging perspective in speech perception.
Collapse
Affiliation(s)
- Dhatri S Devaraju
- Department of Communication Science and Disorders, University of Pittsburgh, PA
| | - Amy Kemp
- Department of Communication Sciences and Special Education, University of Georgia, Athens
| | - David A Eddins
- Department of Communication Sciences & Disorders, University of South Florida, Tampa
| | | | | | - Amanda Hampton Wray
- Department of Communication Science and Disorders, University of Pittsburgh, PA
| |
Collapse
|
9
|
Eddins DA, Anand S, Lang A, Shrivastav R. Developing Clinically Relevant Scales of Breathy and Rough Voice Quality. J Voice 2021; 35:663.e9-663.e16. [PMID: 31932189 PMCID: PMC10413723 DOI: 10.1016/j.jvoice.2019.12.021] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2019] [Revised: 12/22/2019] [Accepted: 12/23/2019] [Indexed: 01/14/2023]
Abstract
The most common measurement tools used in the perceptual evaluation of voice quality yield ordinal data and thus do not support the establishment of mathematical relationships among different measurement values. This makes their interpretation challenging. Among the many desirable features of any psychophysical measurement tool is the ability to quantify the difference between two or more measurements and the ability to interpret the measurements in a manner that is related to the experience of the observer. The former allows one to compare among measurements using simple mathematics, while the latter allows that comparison to be interpreted in constructive ways. In this paper we describe the development of standard measurement scales for two dimensions of voice quality, following an approach that has been applied successfully to the perception of loudness. The scales follow step-by-step procedures used to develop the sone scale of loudness, which ties physical units to the perceptual estimates of loudness magnitude. Goals of the current work include development of analogous scales for the perception of breathy and rough voice qualities. First, the relationship between perceived voice quality and physical units were established using single-variable matching tasks. Second, the relationship between a change in physical units from the single-variable matching tasks and perceived voice quality magnitude were established using magnitude estimation tasks. Third, single reference points were identified on breathy and rough continuums. Finally, all points on the newly established voice quality continuums were rescaled relative to these arbitrary reference points. The proposed breathiness and roughness scales result in ratio-level data with standard measurement units that support quantitative comparisons of perceptual judgments. Such judgments can be used, for example, to compare magnitude of change pre- and post-treatment.
Collapse
Affiliation(s)
- David A Eddins
- Department of Communication Sciences and Disorders, University of South Florida, Tampa, Florida.
| | - Supraja Anand
- Department of Communication Sciences and Disorders, University of South Florida, Tampa, Florida
| | | | - Rahul Shrivastav
- Office of the Vice President for Instruction, University of Georgia, Athens, Georgia
| |
Collapse
|
10
|
Bottalico P, Codino J, Cantor-Cutiva LC, Marks K, Nudelman CJ, Skeffington J, Shrivastav R, Jackson-Menaldi MC, Hunter EJ, Rubin AD. Reproducibility of Voice Parameters: The Effect of Room Acoustics and Microphones. J Voice 2020; 34:320-334. [PMID: 30471944 PMCID: PMC6529301 DOI: 10.1016/j.jvoice.2018.10.016] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2018] [Revised: 10/30/2018] [Accepted: 10/31/2018] [Indexed: 10/27/2022]
Abstract
INTRODUCTION Computer analysis of voice recordings is an integral part of the evaluation and management of voice disorders. In many practices, voice samples are taken in rooms that are not sound attenuated and/or sound-proofed; further, the technology used is rarely consistent. This will likely affect the recordings, and therefore, their analyses. OBJECTIVES The objective of this study is to compare various acoustic outcome measures taken from samples recorded in a sound-proofed booth to those recorded in more common clinic environments. Further, the effects from six different commonly used microphones will be compared. METHODS Thirty-six speakers were recorded while reading a text and producing sustained vowels in a controlled acoustic environment. The collected samples were reproduced by a Head and Torso Simulator and recorded in three clinical rooms and in a sound booth using six different microphones. Newer measures (eg, Pitch Strength, cepstral peak prominence, Acoustic Voice Quality Index), as well as more traditional measures (eg Jitter, Shimmer, harmonics-to-noise ratio and Spectrum Tilt), were calculated from the samples collected with each microphone and within each room. RESULTS The measures which are more robust to room acoustic differences, background noise, and microphone quality include Jitter and smooth cepstral peak prominence, followed by Shimmer, Acoustic Voice Quality Index, harmonics-to-noise ratio, Pitch Strength, and Spectrum Tilt. CONCLUSIONS The effect of room acoustics and background noise on voice parameters appears to be stronger than the type of microphone used for the recording. Consequently, an appropriate acoustical clinical space may be more important than the quality of the microphone.
Collapse
Affiliation(s)
- Pasquale Bottalico
- Department of Speech and Hearing Science, University of Illinois Urbana-Champaign, Champaign, Illinois.
| | - Juliana Codino
- Lakeshore Ear, Nose, and Throat Center, Lakeshore Professional Voice Center, Michigan
| | - Lady Catherine Cantor-Cutiva
- Department of Collective Health, Universidad Nacional de Colombia, Bogota, Colombia; Department of Communicative Sciences and Disorders, Michigan State University, East Lansing, Michigan
| | - Katherine Marks
- Lakeshore Ear, Nose, and Throat Center, Lakeshore Professional Voice Center, Michigan
| | - Charles J Nudelman
- Department of Speech and Hearing Science, University of Illinois Urbana-Champaign, Champaign, Illinois
| | - Jean Skeffington
- Lakeshore Ear, Nose, and Throat Center, Lakeshore Professional Voice Center, Michigan
| | | | - Maria Cristina Jackson-Menaldi
- Lakeshore Ear, Nose, and Throat Center, Lakeshore Professional Voice Center, Michigan; Department of Otolaryngology, School of Medicine, Wayne State University, Detroit, Michigan
| | - Eric J Hunter
- Department of Communicative Sciences and Disorders, Michigan State University, East Lansing, Michigan
| | - Adam D Rubin
- Lakeshore Ear, Nose, and Throat Center, Lakeshore Professional Voice Center, Michigan
| |
Collapse
|
11
|
Rubin AD, Jackson-Menaldi C, Kopf LM, Marks K, Skeffington J, Skowronski MD, Shrivastav R, Hunter EJ. Comparison of Pitch Strength With Perceptual and Other Acoustic Metric Outcome Measures Following Medialization Laryngoplasty. J Voice 2020; 33:795-800. [PMID: 29773324 DOI: 10.1016/j.jvoice.2018.03.019] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2017] [Accepted: 03/27/2018] [Indexed: 11/15/2022]
Abstract
INTRODUCTION The diagnoses of voice disorders, as well as treatment outcomes, are often tracked using visual (eg, stroboscopic images), auditory (eg, perceptual ratings), objective (eg, from acoustic or aerodynamic signals), and patient report (eg, Voice Handicap Index and Voice-Related Quality of Life) measures. However, many of these measures are known to have low to moderate sensitivity and specificity for detecting changes in vocal characteristics, including vocal quality. OBJECTIVE The objective of this study was to compare changes in estimated pitch strength (PS) with other conventionally used acoustic measures based on the cepstral peak prominence (smoothed cepstral peak prominence, cepstral spectral index of dysphonia, and acoustic voice quality index), and clinical judgments of voice quality (GRBAS [grade, roughness, breathiness, asthenia, strain] scale) following laryngeal framework surgery. METHODS This study involved post hoc analysis of recordings from 22 patients pretreatment and post treatment (thyroplasty and behavioral therapy). Sustained vowels and connected speech were analyzed using objective measures (PS, smoothed cepstral peak prominence, cepstral spectral index of dysphonia, and acoustic voice quality index), and these results were compared with mean auditory-perceptual ratings by expert clinicians using the GRBAS scale. RESULTS All four acoustic measures changed significantly in the direction that usually indicates improved voice quality following treatment (P < 0.005). Grade and breathiness correlated the strongest with the acoustic measures (|r| ~ 0.7) with strain being the least correlated. CONCLUSIONS Acoustic analysis on running speech highly correlates with judged ratings. PS is a robust, easily obtained acoustic measure of voice quality that could be useful in the clinical environment to follow treatment of voice disorders.
Collapse
Affiliation(s)
- Adam D Rubin
- Lakeshore Ear, Nose, and Throat Center, Lakeshore Professional Voice Center, St. Clair Shores, Michigan; Department of Surgery, Oakland University William Beaumont School of Medicine, Rochester, Michigan.
| | - Cristina Jackson-Menaldi
- Lakeshore Ear, Nose, and Throat Center, Lakeshore Professional Voice Center, St. Clair Shores, Michigan; Department of Otolaryngology, School of Medicine, Wayne State University, Detroit, Michigan
| | - Lisa M Kopf
- Department of Communication Sciences and Disorders, University of Northern Iowa, Cedar Falls, Iowa
| | - Katherine Marks
- Lakeshore Ear, Nose, and Throat Center, Lakeshore Professional Voice Center, St. Clair Shores, Michigan
| | - Jean Skeffington
- Lakeshore Ear, Nose, and Throat Center, Lakeshore Professional Voice Center, St. Clair Shores, Michigan
| | - Mark D Skowronski
- Department of Communication Sciences and Disorders, University of South Florida, Tampa, Florida
| | - Rahul Shrivastav
- Office of the Vice President for Instruction, University of Georgia, Athens, Georgia
| | - Eric J Hunter
- Department of Communicative Sciences and Disorders, Michigan State University, East Lansing, Michigan
| |
Collapse
|
12
|
Anand S, Kopf LM, Shrivastav R, Eddins DA. Objective Indices of Perceived Vocal Strain. J Voice 2019; 33:838-845. [DOI: 10.1016/j.jvoice.2018.06.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2018] [Revised: 06/06/2018] [Accepted: 06/07/2018] [Indexed: 10/28/2022]
|
13
|
Anand S, Skowronski MD, Shrivastav R, Eddins DA. Perceptual and Quantitative Assessment of Dysphonia Across Vowel Categories. J Voice 2019; 33:473-481. [DOI: 10.1016/j.jvoice.2017.12.018] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2017] [Accepted: 12/21/2017] [Indexed: 10/16/2022]
|
14
|
Kemp A, Eddins D, Shrivastav R, Hampton Wray A. Effects of Task Difficulty on Neural Processes Underlying Semantics: An Event-Related Potentials Study. J Speech Lang Hear Res 2019; 62:367-386. [PMID: 30950685 DOI: 10.1044/2018_jslhr-h-17-0396] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Purpose Improving the ability to listen efficiently in noisy environments is a critical goal for hearing rehabilitation. However, understanding of the impact of difficult listening conditions on language processing is limited. The current study evaluated the neural processes underlying semantics in challenging listening conditions. Method Thirty adults with normal hearing completed an auditory sentence processing task in 4-talker babble. Event-related brain potentials were elicited by the final word in high- or low-context sentences, where the final word was either highly expected or not expected, followed by a 4-alternative forced-choice response with either longer (1,000 ms), middle (700 ms), or shorter (400 ms) response time deadlines (RTDs). Results Behavioral accuracy was reduced, and reactions times were faster for shorter RTDs. N400 amplitudes, reflecting ease of lexical access, were larger when elicited by target words in low-context sentences followed by shorter compared with longer RTDs. Conclusions These results reveal that more neural resources are allocated for semantic processing/lexical access when listening difficulty increases. Differences between RTDs may reflect increased attentional allocation for shorter RTDs. These findings suggest that situational listening demands can impact the demands for cognitive resources engaged in language processing, which could significantly impact listener experiences across environments.
Collapse
Affiliation(s)
- Amy Kemp
- Department of Communicative Sciences & Disorders, Michigan State University, East Lansing
| | - David Eddins
- Department of Communication Sciences & Disorders, University of South Florida, Tampa
| | | | - Amanda Hampton Wray
- Department of Communicative Sciences & Disorders, Michigan State University, East Lansing
| |
Collapse
|
15
|
Perimbeti SP, Shrivastav R, Ward K, Styler M. Abstract P2-08-32: Withdrawn. Cancer Res 2019. [DOI: 10.1158/1538-7445.sabcs18-p2-08-32] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
This abstract was withdrawn by the authors.
Citation Format: Perimbeti SP, Shrivastav R, Ward K, Styler M. Withdrawn [abstract]. In: Proceedings of the 2018 San Antonio Breast Cancer Symposium; 2018 Dec 4-8; San Antonio, TX. Philadelphia (PA): AACR; Cancer Res 2019;79(4 Suppl):Abstract nr P2-08-32.
Collapse
Affiliation(s)
- SP Perimbeti
- Mount Sinai St Luke's Roosevelt Hospital, New York, NY; Drexel University, Philadelphia, PA
| | - R Shrivastav
- Mount Sinai St Luke's Roosevelt Hospital, New York, NY; Drexel University, Philadelphia, PA
| | - K Ward
- Mount Sinai St Luke's Roosevelt Hospital, New York, NY; Drexel University, Philadelphia, PA
| | - M Styler
- Mount Sinai St Luke's Roosevelt Hospital, New York, NY; Drexel University, Philadelphia, PA
| |
Collapse
|
16
|
Shrivastav R, Correa A, Pati P, Modi V, Perimbeti S, Palazzo A. P5425Increased mortality in systemic lupus erythematosus affected patients with myocardial infarction: a nationwide study using National Inpatient Sample (NIS) of 1999-2014. Eur Heart J 2018. [DOI: 10.1093/eurheartj/ehy566.p5425] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Affiliation(s)
- R Shrivastav
- St Luke's Roosevelt Hospital, New York, United States of America
| | - A Correa
- St Luke's Roosevelt Hospital, New York, United States of America
| | - P Pati
- University of Pittsburgh, Pittsburgh, United States of America
| | - V Modi
- St Luke's Roosevelt Hospital, New York, United States of America
| | - S Perimbeti
- St Luke's Roosevelt Hospital, New York, United States of America
| | - A Palazzo
- St Luke's Roosevelt Hospital, New York, United States of America
| |
Collapse
|
17
|
Kopf LM, Skowronski MD, Anand S, Eddins DA, Shrivastav R. The Perception of Breathiness in the Voices of Pediatric Speakers. J Voice 2017; 33:204-213. [PMID: 29162356 DOI: 10.1016/j.jvoice.2017.09.024] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2017] [Revised: 09/27/2017] [Accepted: 09/28/2017] [Indexed: 10/18/2022]
Abstract
BACKGROUND The perception of pediatric voice quality has been investigated using clinical protocols developed for adult voices and acoustic analyses designed to identify important physical parameters associated with normal and dysphonic pediatric voices. Laboratory investigations of adult dysphonia have included sophisticated methods, including a psychoacoustic approach that involves a single-variable matching task (SVMT), characterized by high inter- and intra-listener reliability, and analyses that include bio-inspired models of auditory perception that have provided valuable information regarding adult voice quality. OBJECTIVES To establish the utility of a psychoacoustic approach to the investigation of voice quality perception in the context of pediatric voices? METHODS Six listeners judged the breathiness of 20 synthetic vowel stimuli using an SVMT. To support comparisons with previous data, stimuli were modeled after four pediatric speakers and synthesized using Klatt with five parameter settings that influence the perception of breathiness. The population average breathiness judgments were modeled with acoustic measures of loudness ratio, pitch strength, and cepstral peak. RESULTS Listeners reliably judged the perceived breathiness of pediatric voices, as with previous investigations of breathiness in adult dysphonic voices. Breathiness judgments were accurately modeled by loudness ratio (r2 = 0.93), pitch strength (r2 = 0.91), and cepstral peak (r2 = 0.82). Model accuracy was not affected significantly by including stimulus fundamental frequency and was slightly higher for pediatric than for adult voices. CONCLUSIONS The SVMT proved robust for pediatric voices spanning a wide range of breathiness. The data indicate that this is a promising approach for future investigation of pediatric voice quality.
Collapse
Affiliation(s)
- Lisa M Kopf
- Department of Communication Sciences and Disorders, University of Northern Iowa, Cedar Falls, Iowa
| | | | - Supraja Anand
- Department of Communication Sciences and Disorders, University of South Florida, Tampa, Florida
| | - David A Eddins
- Department of Communication Sciences and Disorders, University of South Florida, Tampa, Florida.
| | - Rahul Shrivastav
- Office of the Vice President for Instruction, University of Georgia, Athens, Georgia
| |
Collapse
|
18
|
Ranieri P, Shrivastav R, Wang M, Lin A, Fridman G, Fridman AA, Han LH, Miller V. Nanosecond-Pulsed Dielectric Barrier Discharge–Induced Antitumor Effects Propagate through Depth of Tissue via Intracellular Signaling. Plasma Med 2017. [DOI: 10.1615/plasmamed.2017019883] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
19
|
Eddins DA, Kopf LM, Shrivastav R. The psychophysics of roughness applied to dysphonic voice. J Acoust Soc Am 2015; 138:3820-5. [PMID: 26723336 PMCID: PMC4691258 DOI: 10.1121/1.4937753] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/01/2015] [Revised: 11/19/2015] [Accepted: 11/25/2015] [Indexed: 05/19/2023]
Abstract
Roughness is a sound quality that has been related to the amplitude modulation characteristics of the acoustic stimulus. Roughness also is considered one of the primary elements of voice quality associated with natural variations across normal voices and is a salient feature of many dysphonic voices. It is known that the roughness of tonal stimuli is dependent on the frequency and depth of amplitude modulation and on the carrier frequency. Here, it is determined if similar dependencies exist for voiced speech stimuli. Knowledge of such dependencies can lead to a better understanding of the acoustic characteristics of vocal roughness along the continuum of normal to dysphonic and may facilitate computational estimates of vocal roughness. Synthetic vowel stimuli were modeled after talkers selected from the Satloff/Heman-Ackah disordered voice database. To parametrically control amplitude modulation frequency and depth, synthesized stimuli had minimal amplitude fluctuations, and amplitude modulation was superimposed with the desired frequency and depth. Perceptual roughness judgments depended on amplitude modulation frequency and depth in a manner that closely matched data from tonal carriers. The dependence of perceived roughness on amplitude modulation frequency and depth closely matched the roughness of sinusoidal carriers as reported by Fastl and Zwicker [(2007) Psychoacoustics: Facts and Models, 3rd ed. (Springer, New York)].
Collapse
Affiliation(s)
- David A Eddins
- Department of Communication Sciences and Disorders, University of South Florida, 4202 East Fowler Avenue, PCD 1017, Tampa, Florida 33620, USA
| | - Lisa M Kopf
- Department of Communicative Sciences and Disorders, Michigan State University, 1026 Red Cedar Road, Room 109, Oyer Speech and Hearing Building, East Lansing Michigan 48824, USA
| | - Rahul Shrivastav
- Office of the Vice President for Instruction, University of Georgia, 0309 New College, Athens, Georgia 30602, USA
| |
Collapse
|
20
|
Skowronski MD, Shrivastav R, Hunter EJ. Cepstral Peak Sensitivity: A Theoretic Analysis and Comparison of Several Implementations. J Voice 2015; 29:670-81. [PMID: 25944288 DOI: 10.1016/j.jvoice.2014.11.005] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2014] [Accepted: 11/11/2014] [Indexed: 10/23/2022]
Abstract
OBJECTIVE The aim of this study was to develop a theoretic analysis of the cepstral peak (CP), to compare several CP software programs, and to propose methods for reducing variability in CP estimation. STUDY DESIGN Descriptive, experimental study. METHODS The theoretic CP value of a pulse train was derived and compared with estimates computed for pulse train WAV files using available CP software programs: (1) Hillenbrand's CP prominence (CPP) software (Western Michigan University, Kalamazoo, MI), (2) KayPENTAX (Montvale, NJ) Multi-Speech implementation of CPP, and (3) a MATLAB (The Mathworks, Natick, MA, version R2014a) implementation using cepstral interpolation. The CP variation was also investigated for synthetic breathy vowels. RESULTS For pulse trains with period T samples, the theoretic CP is 1/2+ε/T, |ε|<0.1 for all pulse trains (ε=0 for integer T). For fundamental frequencies between 70 and 230Hz, the CP mean±standard deviation was 0.496±0.002 using cepstral interpolation and 0.29±0.03 using Hillenbrand's software, whereas CPP was 35.0±3.8dB using Hillenbrand's software and 20.5±2.7dB using KayPENTAX's software. The CP and CPP versus signal-to-noise ratio for synthetic breathy vowels were fit to a logistic model for the Hillenbrand (R(2)=0.92) and KayPENTAX (R(2)=0.82) estimators as well as an ideal estimator (R(2)=0.98), which used a period-synchronous analysis. CONCLUSIONS The findings indicate that several variables unrelated to the signal itself impact CP values, with some factors introducing large variability in CP values that would otherwise be attributed to the signal (eg, voice quality). Variability may be reduced by using a period-synchronous analysis with Hann windows.
Collapse
Affiliation(s)
- Mark D Skowronski
- Department of Communicative Sciences and Disorders, Michigan State University, East Lansing, Michigan.
| | - Rahul Shrivastav
- Department of Communicative Sciences and Disorders, Michigan State University, East Lansing, Michigan
| | - Eric J Hunter
- Department of Communicative Sciences and Disorders, Michigan State University, East Lansing, Michigan
| |
Collapse
|
21
|
Eddins DA, Shrivastav R. Psychometric properties associated with perceived vocal roughness using a matching task. J Acoust Soc Am 2013; 134:EL294-300. [PMID: 24116533 PMCID: PMC3779263 DOI: 10.1121/1.4819183] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/02/2013] [Accepted: 08/09/2013] [Indexed: 05/19/2023]
Abstract
A psychophysical matching paradigm has been used to better quantify voice quality under laboratory conditions. The goals of this study were to establish which of two candidate comparison stimuli would best ensure that the range of perceived vocal roughness could be adequately bracketed using a matching task and to provide a general solution to the problem of estimating vocal roughness. Psychometric functions for roughness matching indicated that a speech-like sawtooth-plus-noise complex (20 dB signal-to-noise ratio) amplitude modulated by a sinusoidal function raised to the 4th power yielded a comparison stimulus with a perceptual dynamic range well suited for roughness matching.
Collapse
Affiliation(s)
- David A Eddins
- Department of Communication Sciences and Disorders, Global Center for Hearing and Speech Research, University of South Florida, 4202 East Fowler Avenue, PCD 1017, Tampa, Florida 33620
| | | |
Collapse
|
22
|
Abstract
Spontaneous swallowing in dysphagic individuals has been shown to occur at a lower rate compared to healthy controls, and passive swallowing detection may function as a valid screening test to identify dysphagia in at-risk populations. To automate swallow identification, acoustic source and vocal tract features were extracted from two types of swallows and eight upper airway movements from nine healthy subjects. Swallow vs non-swallow classification accuracy was 96.3 ± 1.1%. The results provide useful methods for further development of automated tools for identification of patients with swallowing impairment.
Collapse
Affiliation(s)
- Mark D Skowronski
- Department of Communicative Sciences and Disorders, 216 Oyer Speech and Hearing Clinic, Michigan State University, East Lansing, Michigan 48824, USA.
| | | | | |
Collapse
|
23
|
Anand S, Shrivastav R, Wingate JM, Chheda NN. An Acoustic-Perceptual Study of Vocal Tremor. J Voice 2012; 26:811.e1-7. [DOI: 10.1016/j.jvoice.2012.02.007] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2011] [Accepted: 02/27/2012] [Indexed: 10/28/2022]
|
24
|
|
25
|
Abstract
PURPOSE Perceptual estimates of voice quality obtained using rating scales are subject to contextual biases that influence how individuals assign numbers to estimate the magnitude of vocal quality. Because rating scales are commonly used in clinical settings, assessments of voice quality are also subject to the limitations of these scales. Instead, a matching task can be used to obtain objective measures of voice quality, thereby facilitating model development and tools for clinical use. METHOD Twenty-seven individuals participated in a rating task or at least 1 of 3 matching tests (named after their modulation functions: SINE, SQUARE, POWER) to quantify the degree of roughness in dysphonic voice stimuli. Participants evaluated the roughness of 34 voice samples using an amplitude-modulated complex carrier. RESULTS The matching thresholds were highly correlated with the ratings estimates. Reliability of thresholds did not significantly differ across tasks, but linear regressions showed that the POWER test resulted in larger perceptual distances. CONCLUSIONS A matching task can be used to obtain reliable estimates of roughness in dysphonic voices. The POWER comparison is recommended because the variability in matching thresholds across the range of roughness was evenly distributed, and the perceptual distances between stimuli were maximized.
Collapse
Affiliation(s)
- Sona Patel
- University of Florida, Gainesville, FL, USA.
| | | | | |
Collapse
|
26
|
Plowman EK, Maling N, Rivera BJ, Larson K, Thomas NJ, Fowler SC, Manfredsson FP, Shrivastav R, Kleim JA. Differential sensitivity of cranial and limb motor function to nigrostriatal dopamine depletion. Behav Brain Res 2012; 237:157-63. [PMID: 23018122 DOI: 10.1016/j.bbr.2012.09.031] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2012] [Revised: 09/14/2012] [Accepted: 09/17/2012] [Indexed: 10/27/2022]
Abstract
The present study determined the differential effects of unilateral striatal dopamine depletion on cranial motor versus limb motor function. Forty male Long Evans rats were first trained on a comprehensive motor testing battery that dissociated cranial versus limb motor function and included: cylinder forepaw placement, single pellet reaching, vermicelli pasta handling; sunflower seed opening, pasta biting acoustics, and a licking task. Following baseline testing, animals were randomized to either a 6-hydroxydopamine (6-OHDA) (n=20) or control (n=20) group. Animals in the 6-OHDA group received unilateral intrastriatal 6-OHDA infusions to induce striatal dopamine depletion. Six-weeks following infusion, all animals were re-tested on the same battery of motor tests. Near infrared densitometry was performed on sections taken through the striatum that were immunohistochemically stained for tyrosine hydroxylase (TH). Animals in the 6-OHDA condition showed a mean reduction in TH staining of 88.27%. Although 6-OHDA animals were significantly impaired on all motor tasks, limb motor deficits were more severe than cranial motor impairments. Further, performance on limb motor tasks was correlated with degree of TH depletion while performance on cranial motor impairments showed no significant correlation. These results suggest that limb motor function may be more sensitive to striatal dopaminergic depletion than cranial motor function and is consistent with the clinical observation that therapies targeting the nigrostriatal dopaminergic system in Parkinson's disease are more effective for limb motor symptoms than cranial motor impairments.
Collapse
Affiliation(s)
- Emily K Plowman
- Department of Communication Sciences and Disorders, University of South Florida, Tampa, FL 33620, United States.
| | | | | | | | | | | | | | | | | |
Collapse
|
27
|
Abstract
OBJECTIVE This study describes an innovative method to optimize cochlear implant (CI) devices for individual patients using speech stimuli. DESIGN Baseline performance of clinically created CI settings was evaluated objectively (CNC and BKB-SIN) and subjectively (listening situation questionnaire). Participants underwent the Clarujust optimization procedure during two sessions, each two weeks apart, allowing for acclimatization and experience with each group of programs. Outcome testing was completed on both optimization settings during the two week follow-up session. CNC and BKB-SIN scores were compared within participants across test sessions. STUDY SAMPLE Twenty adult experienced CI users were reprogrammed using the Clarujust™ fitting procedure. RESULTS Significant improvements (p < 0.05) on CNC words and BKB-SIN sentences were observed with optimized maps compared to maps programmed using standard methods. Subjective performance also showed increased satisfaction with the optimized maps. The maps for individual users varied significantly and no single fitting applied for all patients. CONCLUSIONS The Clarujust optimization method of programming CIs shows promise to improve patient performance and increase patient satisfaction in a shorter clinical test time.
Collapse
Affiliation(s)
- Alice E Holmes
- Department of Speech, Language and Hearing Science, University of Florida, Gainesville, FL 32611, USA.
| | | | | | | | | |
Collapse
|
28
|
Abstract
PURPOSE In this experiment, a single comparison stimulus was developed as a reference in a perceptual matching task for the quantification of breathy voice quality. Perceptual judgments of a set of synthetic voice samples were compared to previous data obtained using multiple comparison stimuli "customized" for different voices (Patel, Shrivastav, & Eddins, 2010). METHOD Five male and 5 female samples of the vowel /a/ were selected from the Kay Elemetrics Disordered Voice Database and resynthesized using a Klatt synthesizer. Eleven samples were created for each base voice by manipulating the aspiration noise level. Five samples from each continuum were evaluated in a perceptual matching task in which a single sawtooth and noise comparison stimulus was used to obtain breathiness judgments. Linear regression was used to compare measurements obtained using the new comparison stimulus against the customized comparison stimuli. RESULTS Results indicated that the noncustomized sawtooth comparison provides reliability and perceptual distances between stimuli similar to those obtained using customized comparison stimuli. CONCLUSION A single-variable matching task using a single comparison stimulus can be used to obtain perceptual estimates of breathiness across voices and experiments in a laboratory setting. This technique will help develop models of voice-quality perception.
Collapse
Affiliation(s)
- Sona Patel
- University of Florida, Gainesville, FL, USA.
| | | | | |
Collapse
|
29
|
Abstract
Two sounds with the same pitch may vary from each other based on saliency of their pitch sensation. This perceptual attribute is called "pitch strength." The study of voice pitch strength may be important in quantifying of normal and pathological qualities. The present study investigated how pitch strength varies across normal and dysphonic voices. A set of voices (vowel /a/) selected from the Kay Elemetrics Disordered Voice Database served as the stimuli. These stimuli demonstrated a wide range of voice quality. Ten listeners judged the pitch strength of these stimuli in an anchored magnitude estimation task. On a given trial, listeners heard three different stimuli. The first stimulus represented very low pitch strength (wide-band noise), the second stimulus consisted of the target voice and the third stimulus represented very high pitch strength (pure tone). Listeners estimated pitch strength of the target voice by positioning a continuous slider labeled with values between 0 and 1, reflecting the two anchor stimuli. Results revealed that listeners can judge pitch strength reliably in dysphonic voices. Moderate to high correlations with perceptual judgments of voice quality suggest that pitch strength may contribute to voice quality judgments.
Collapse
Affiliation(s)
- Rahul Shrivastav
- Malcom Randall VAMC and University of Florida, Dauer Hall, P.O. Box 117420, Gainesville, Florida 32611, USA.
| | | | | |
Collapse
|
30
|
Silverman EP, Garvan C, Shrivastav R, Sapienza CM. Combined Modality Treatment of Adductor Spasmodic Dysphonia. J Voice 2012; 26:77-86. [DOI: 10.1016/j.jvoice.2010.08.004] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2010] [Accepted: 08/27/2010] [Indexed: 11/16/2022]
|
31
|
|
32
|
Shrivastav R, Camacho A, Patel S, Eddins DA. A model for the prediction of breathiness in vowels. J Acoust Soc Am 2011; 129:1605-15. [PMID: 21428523 PMCID: PMC3077964 DOI: 10.1121/1.3543993] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2010] [Revised: 10/29/2010] [Accepted: 12/29/2010] [Indexed: 05/19/2023]
Abstract
The perception of breathiness in vowels is cued by multiple acoustic cues, including changes in aspiration noise (AH) and the open quotient (OQ) [Klatt and Klatt, J. Acoust. Soc. Am. 87(2), 820-857 (1990)]. A loudness model can be used to determine the extent to which AH masks the harmonic components in voice. The resulting "partial loudness" (PL) and loudness of AH ["noise loudness" (NL)] have been shown to be good predictors of perceived breathiness [Shrivastav and Sapienza, J. Acoust. Soc. Am. 114(1), 2217-2224 (2003)]. The levels of AH and OQ were systematically manipulated for ten synthetic vowels. Perceptual judgments of breathiness were obtained and regression functions to predict breathiness from the ratio of NL to PL (η) were derived. Results show that breathiness can be modeled as a power function of η. The power parameter of this function appears to be affected by the fundamental frequency of the vowel. A second experiment was conducted to determine if the resulting power function could estimate breathiness in a different set of voices. The breathiness of these stimuli, both natural and synthetic, was determined in a listening test. The model estimates of breathiness were highly correlated with perceptual data but the absolute predicted values showed some discrepancies.
Collapse
Affiliation(s)
- Rahul Shrivastav
- Department of Communication Sciences and Disorders, University of Florida and Malcom Randall VAMC, Dauer Hall, P.O. Box 117420, Gainesville, Florida 32611, USA.
| | | | | | | |
Collapse
|
33
|
Shrivastav R, Punde RP, Pandey H, Samarth R, Maudar K. Evolutionary Development of Molecular Tools in the Diagnosis of Mycobacterium tuberculosis: A Review. J of Medical Sciences 2010. [DOI: 10.3923/jms.2010.124.129] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
|
34
|
Stigler MH, Arora M, Dhavan P, Tripathy V, Shrivastav R, Reddy KS, Perry CL. Measuring obesity among school-aged youth in India: a comparison of three growth references. Indian Pediatr 2010; 48:105-10. [PMID: 20972296 DOI: 10.1007/s13312-011-0041-x] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2009] [Accepted: 03/02/2010] [Indexed: 11/28/2022]
Abstract
OBJECTIVE To compare three growth references that can be used to assess the weight status of school-aged youth living in India, with a particular focus on identifying overweight and obese youth. STUDY DESIGN Cross-sectional study. Kappa scores were used to measure agreement between growth references. Regression models were used to test for differences in weight status by grade level, gender, and school type, using each growth reference. SETTING Private (n=4) and Government schools (n=4) in Delhi, India. PARTICIPANTS Students (n=1818) in eighth and tenth grade attending the schools. MAIN OUTCOME MEASURES Weight status was derived using age- and gender-specific cut-points provided by: (a) a national growth reference specific to India; (b) an international reference recommended by the International Obesity Task Force (IOTF); and (c) a new international reference recommended by the World Health Organization (WHO). RESULTS The IOTF reference consistently classified participants in a lower weight status category, compared with the national reference (k=0.57) and the WHO reference (k=0.69). The agreement between the WHO and the national references was higher (k=0.84). CONCLUSIONS To date, all published studies of childhood obesity in India have used the IOTF reference, the national reference, or an old WHO reference to measure weight status among school-going youth. The new WHO reference may be a better choice. Compared to the IOTF reference, it does not appear to underestimate obesity and can still be used to compare trends, globally.
Collapse
Affiliation(s)
- M H Stigler
- Michael and Susan Dell Center for Advancement of Healthy Living, School of Public Health, University of Texas, Austin and Houston, Texas, USA.
| | | | | | | | | | | | | |
Collapse
|
35
|
Shrivastav R, Camacho A. A computational model to predict changes in breathiness resulting from variations in aspiration noise level. J Voice 2009; 24:395-405. [PMID: 19896328 DOI: 10.1016/j.jvoice.2008.12.001] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2008] [Accepted: 12/01/2008] [Indexed: 10/20/2022]
Abstract
Perception of breathy voice quality is cued by a number of acoustic changes including an increase in aspiration noise level (AH) and spectral slope. Changes in AH in a vowel may be evaluated through measures such as the harmonic-to-noise ratio, cepstral peak prominence (CPP), or via auditory measures such as the partial loudness of harmonic energy and loudness of aspiration noise. Although a number of experiments have reported high correlation between such measures and ratings of perceived breathiness, a formal model to predict breathiness of a vowel has not been proposed. This research describes two computational models to predict changes in breathiness resulting from variations in AH. One model uses auditory measures, whereas the other uses CPP as independent variables to predict breathiness. For both cases, a translated and truncated power function is required to predict breathiness. Some parameters in both of these models were observed to be pitch dependent. The "unified" model based on auditory measures was observed to be more accurate than one based on CPP.
Collapse
Affiliation(s)
- Rahul Shrivastav
- Department of Communication Sciences and Disorders, University of Florida, Gainesville, Florida 32611, USA.
| | | |
Collapse
|
36
|
Plowman-Prine E, Okun M, Sapienza C, Shrivastav R, Fernandez H, Foote K, Ellis C, Rodriguez A, Burkhead L, Rosenbek J. Perceptual characteristics of Parkinsonian speech: A comparison of the pharmacological effects of levodopa across speech and non-speech motor systems. NeuroRehabilitation 2009; 24:131-44. [DOI: 10.3233/nre-2009-0462] [Citation(s) in RCA: 71] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Affiliation(s)
- E.K Plowman-Prine
- Department of Neurology, University of Florida, Gainesville, FL, USA
- Brain Rehabilitation Research Center, Malcom Randall Veterans Affairs Hospital, Gainesville, FL, USA
| | - M.S. Okun
- Department of Neurology, University of Florida, Gainesville, FL, USA
| | - C.M. Sapienza
- Brain Rehabilitation Research Center, Malcom Randall Veterans Affairs Hospital, Gainesville, FL, USA
- Department of Communication Sciences and Disorders, University of Florida, Gainesville, FL, USA
| | - R. Shrivastav
- Brain Rehabilitation Research Center, Malcom Randall Veterans Affairs Hospital, Gainesville, FL, USA
- Department of Communication Sciences and Disorders, University of Florida, Gainesville, FL, USA
| | - H.H. Fernandez
- Department of Neurology, University of Florida, Gainesville, FL, USA
| | - K.D. Foote
- Department of Neurosurgery, University of Florida, Gainesville, FL, USA
| | - C. Ellis
- Department of Health Professions, Medical University of South Carolina, SC, USA
| | - A.D. Rodriguez
- Brain Rehabilitation Research Center, Malcom Randall Veterans Affairs Hospital, Gainesville, FL, USA
- Department of Communication Disorders, University of Florida, Gainesville, FL, USA
| | - L.M. Burkhead
- Department of Otolaryngology, Medical College of Georgia, GA, USA
| | - J.C. Rosenbek
- Brain Rehabilitation Research Center, Malcom Randall Veterans Affairs Hospital, Gainesville, FL, USA
- Department of Communication Disorders, University of Florida, Gainesville, FL, USA
| |
Collapse
|
37
|
Patel S, Shrivastav R, Eddins DA. Perceptual distances of breathy voice quality: a comparison of psychophysical methods. J Voice 2009; 24:168-77. [PMID: 19185451 DOI: 10.1016/j.jvoice.2008.08.002] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2008] [Accepted: 08/04/2008] [Indexed: 10/21/2022]
Abstract
Experiments to study voice quality have typically used rating scales or direct magnitude estimation to obtain listener judgments. Unfortunately, the data obtained using these tasks are context dependent, which makes it difficult to compare perceptual judgments of voice quality across experiments. The present experiment describes a simple matching task to quantify voice quality. The data obtained through this task were compared to perceptual judgments obtained using rating scale and direct magnitude estimation tasks to determine whether the three tasks provide equivalent perceptual distances across stimuli. Ten synthetic vowel continua that varied in terms of their aspiration noise were evaluated for breathiness using each of the three tasks. Linear and nonlinear regressions were used to compare the perceptual distances between stimuli obtained through each technique. Results show that the perceptual distances estimated from matching and direct magnitude estimation task are similar, but both differ from the rating scale task, suggesting that the matching task provides perceptual distances with ratio-level measurement properties. The matching task is advantageous for measurement of vocal quality because it provides reliable measurement with ratio-level scale properties. It allows the use of a fixed reference signal for all comparisons, thus allowing researchers to directly compare findings across different experiments.
Collapse
Affiliation(s)
- Sona Patel
- Department of Communication Sciences and Disorders, University of Florida, Gainesville, Florida 32611, USA
| | | | | |
Collapse
|
38
|
Perrin PB, Heesacker M, Shrivastav R. Removing the Tinted Spectacles: Accurate Client Emotionality Assessment Despite Therapists' Gender Stereotypes. Journal of Social and Clinical Psychology 2008. [DOI: 10.1521/jscp.2008.27.7.711] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
39
|
Ludlow CL, Hoit J, Kent R, Ramig LO, Shrivastav R, Strand E, Yorkston K, Sapienza CM. Translating principles of neural plasticity into research on speech motor control recovery and rehabilitation. J Speech Lang Hear Res 2008; 51:S240-58. [PMID: 18230849 PMCID: PMC2364711 DOI: 10.1044/1092-4388(2008/019)] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
PURPOSE To review the principles of neural plasticity and make recommendations for research on the neural bases for rehabilitation of neurogenic speech disorders. METHOD A working group in speech motor control and disorders developed this report, which examines the potential relevance of basic research on the brain mechanisms involved in neural plasticity and discusses possible similarities and differences for application to speech motor control disorders. The possible involvement of neural plasticity in changes in speech production in normalcy, development, aging, and neurological diseases and disorders was considered. This report focuses on the appropriate use of functional and structural neuroimaging and the design of feasibility studies aimed at understanding how brain mechanisms are altered by environmental manipulations such as training and stimulation and how these changes might enhance the future development of rehabilitative methods for persons with speech motor control disorders. CONCLUSIONS Increased collaboration with neuroscientists working in clinical research centers addressing human communication disorders might foster research in this area. It is hoped that this article will encourage future research on speech motor control disorders to address the principles of neural plasticity and their application for rehabilitation.
Collapse
Affiliation(s)
- Christy L Ludlow
- National Institute of Neurological Disorders and Stroke, Bethesda, MD, USA.
| | | | | | | | | | | | | | | |
Collapse
|
40
|
Affiliation(s)
- Sona Patel
- Department of Communication Sciences and Disorders, University of FloridaGainesville, FL
| | - Rahul Shrivastav
- Department of Communication Sciences and Disorders, University of FloridaGainesville, FL
| |
Collapse
|
41
|
Abstract
Professional voice users comprise 25% to 35% of the U.S. working population. Their voice problems may interfere with job performance and impact costs for both employers and employees. The purpose of this study was to examine treatment outcomes of two specific rehabilitation programs for a group of professional voice users. Eighteen professional voice users participated in this study; half had complaints of throat pain or vocal fatigue (Dysphonia Group), and half were found to have benign vocal fold lesions (Lesion Group). One group received 5 weeks of expiratory muscle strength training followed by six sessions of traditional voice therapy. Treatment order was reversed for the second group. The study was designed as a repeated measures study with independent variables of treatment order, laryngeal diagnosis (lesion vs non-lesion), gender, and time. Dependent variables included maximum expiratory pressure (MEP), Voice Handicap Index (VHI) score, Vocal Rating Scale (VRS) score, Voice Effort Scale score, phonetogram measures, subglottal pressures, and acoustic and perceptual measures. Results showed significant improvements in MEP, VHI scores, and VRS scores, subglottal pressure for loud intensity, phonetogram area, and dynamic range. No significant difference was found between laryngeal diagnosis groups. A significant difference was not observed for treatment order. It was concluded that the combined treatment was responsible for the improvements observed. The results indicate that a combined modality treatment may be successful in the remediation of vocal problems for professional voice users.
Collapse
Affiliation(s)
- Judith M Wingate
- Department of Communication Sciences and Disorders, University of Florida, Gainesville, Florida 32611-7420, USA.
| | | | | | | | | |
Collapse
|
42
|
|
43
|
Harnsberger JD, Shrivastav R, Brown WS, Rothman H, Hollien H. Speaking rate and fundamental frequency as speech cues to perceived age. J Voice 2006; 22:58-69. [PMID: 16968663 DOI: 10.1016/j.jvoice.2006.07.004] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2006] [Accepted: 06/16/2006] [Indexed: 11/22/2022]
Abstract
This study aimed to specify a set of acoustic cues fundamental to vocal aging and to establish their perceptual relevance, using acoustic analysis and perceptual testing. Three experiments were conducted to identify the perceptual correlates of the aging voice. The first experiment analyzed important voice parameters that signal a person's age for 16 older males and 14 younger males. In the second and third experiments, these acoustic patterns were systematically shifted through resynthesis to see if perceived age would be significantly influenced. In the second experiment, the older and younger male voices were resynthesized by manipulating speaking rate and fundamental frequency to shift the perceived age of the groups toward each other. In the third experiment, older and middle-aged male voices were resynthesized in a similar manner. In both perceptual studies, an age estimation task with naive listeners was used. The results of the first experiment showed that, in older speakers, sentence, word, and diphthong durations were all significantly longer and mean fundamental frequency was significantly higher than for the younger group. In the second experiment, only the manipulation of speaking rate resulted in a significant shift in perceived age, and it did so only for the older subjects. In the third experiment, a significant shift in age estimates was observed for the middle-aged, but not the older, voices when speaking rate was manipulated. The results of both perception tests suggest that speaking rate, but possibly not fundamental frequency, is a perceptually relevant cue to age in voice.
Collapse
Affiliation(s)
- James D Harnsberger
- Department of Communication Sciences and Disorders, University of Florida, Dauer Hall, Gainesville, Florida, USA.
| | | | | | | | | |
Collapse
|
44
|
Abstract
Perception of breathy voice quality appears to be cued by changes in the vowel spectrum. These changes are related to alterations in the intensity of aspiration noise and spectral slope of the harmonic energy [Shrivastav and Sapienza, J. Acoust. Soc. Am., 114 (4), 2217-2224 (2003)]. Ten young-adult listeners with normal hearing were tested using an adaptive listening task to determine the smallest change in signal-to-noise ratio that resulted in a change in breathiness. Six vowel continua, three female and three male, were generated using a Klatt synthesizer and served as stimuli. Results showed that listeners needed as much as 20-dB increase in aspiration noise to perceive a change in breathiness against a relatively normal voice. In contrast, listeners needed approximately an 11-dB increase in aspiration noise to discriminate breathiness against a severely breathy voice. The difference limens for breathiness were observed to vary across the six talkers. Voices having aspiration noise that was predominantly in the high frequencies had smaller difference limens. No significant differences for male and female voice were observed.
Collapse
Affiliation(s)
- Rahul Shrivastav
- Department of Communication Science and Disorders, Dauer Hall, P.O. Box 117420, University of Florida, Gainesville, Florida 32611, USA.
| | | |
Collapse
|
45
|
Abstract
Experiments on disordered voice quality with multidimensional scaling (MDS) have resulted in solutions with low R-square and have failed to show consistent dimensions across different listeners. These findings have been suggested to indicate large individual differences in the perception of voice quality. However, these inconsistencies may originate from several factors, including random stimulus selection, instructions that encourage listeners to respond to global difference in pairs of voices, and noisy perceptual data. This experiment used MDS techniques to study individual differences in perception of breathiness. The voices in the experiment were selected to have a relatively wide variation in breathiness but only minimal variation in roughness, strain, and fundamental frequency. Additionally, listeners were instructed specifically to rate similarities in breathiness rather than judging global differences in voices, and several judgments from each listener were averaged to minimize noise in the data. It was hypothesized that these modifications would result in an MDS solution that accounted for greater variance in perceptual data than previously shown. Results show that averaging multiple responses from each listener increased the R-square from 45% to approximately 75%. The poor R-square and large individual differences in voice quality perception observed in past research may have partly resulted from the experimental procedures in previous studies. These findings suggest that individual differences in the perception of voice quality are not as large as previously thought, and a model of voice quality perception for an "average" listener may be a good representation for the general population.
Collapse
Affiliation(s)
- Rahul Shrivastav
- Department of Communication Sciences and Disorders, University of Florida, Gainesville, Florida 32611, USA.
| |
Collapse
|
46
|
Shrivastav R, Sapienza CM, Nandur V. Application of psychometric theory to the measurement of voice quality using rating scales. J Speech Lang Hear Res 2005; 48:323-35. [PMID: 15989395 DOI: 10.1044/1092-4388(2005/022)] [Citation(s) in RCA: 102] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/25/2003] [Revised: 03/02/2004] [Accepted: 08/05/2004] [Indexed: 05/03/2023]
Abstract
Rating scales are commonly used to study voice quality. However, recent research has demonstrated that perceptual measures of voice quality obtained using rating scales suffer from poor interjudge agreement and reliability, especially in the mid-range of the scale. These findings, along with those obtained using multidimensional scaling (MDS), have been interpreted to show that listeners perceive voice quality in an idiosyncratic manner. Based on psychometric theory, the present research explored an alternative explanation for the poor interlistener agreement observed in previous research. This approach suggests that poor agreement between listeners may result, in part, from measurement errors related to a variety of factors rather than true differences in the perception of voice quality. In this study, 10 listeners rated breathiness for 27 vowel stimuli using a 5-point rating scale. Each stimulus was presented to the listeners 10 times in random order. Interlistener agreement and reliability were calculated from these ratings. Agreement and reliability were observed to improve when multiple ratings of each stimulus from each listener were averaged and when standardized scores were used instead of absolute ratings. The probability of exact agreement was found to be approximately .9 when using averaged ratings and standardized scores. In contrast, the probability of exact agreement was only .4 when a single rating from each listener was used to measure agreement. These findings support the hypothesis that poor agreement reported in past research partly arises from errors in measurement rather than individual differences in the perception of voice quality.
Collapse
Affiliation(s)
- Rahul Shrivastav
- Department of Communicative Science and Disorders, University of Florida, Gainesville 32611, USA.
| | | | | |
Collapse
|
47
|
Abstract
Despite much research, the relationship between vocal acoustic signals and perceived voice quality is not well understood. The present study used an auditory model proposed by Moore et al to study how changes in the acoustic spectrum may relate to changes in perceptual ratings of breathiness. Perceptual ratings of breathiness were obtained using a multidimensional scaling (MDS) design. The stimulus distances on the dominant MDS dimension were correlated with several commonly used acoustic measures for voice quality. These distances were also compared with measures obtained from the output of the auditory model. Results show that the partial loudness of the harmonic energy obtained with the aspiration noise acting as a masker was the most important predictor of perceptual ratings of breathiness. Results also demonstrate that measures obtained from the auditory spectrum were better predictors of perceptual ratings of breathiness than were commonly used acoustic spectral measures.
Collapse
Affiliation(s)
- Rahul Shrivastav
- Communication Sciences and Disorders, University of Florida, Gainesville, Florida 32611, USA.
| |
Collapse
|
48
|
Abstract
While several acoustic measures have been proposed to quantify listener ratings of breathy voice quality, most have failed to give a consistent and high correlation with perceptual ratings of breathiness. One reason for these limitations is that most acoustic measures do not address the nonlinear processes that occur in the peripheral auditory system during the auditory perceptual process. It was hypothesized that modeling such nonlinear events during signal processing may provide objective parameters that better correspond to perceptual ratings of breathy voice quality. Ten listeners rated 27 voice stimuli using a five-point rating scale. Acoustic measures were determined from these stimuli and were selected based on their history of having a moderate to strong correlation to perceptual ratings of breathiness. The stimuli were also analyzed using an auditory model proposed by Moore, Glasberg, and Baer [J. Audio Eng. Soc. 45(4), 224-239 (1997)], and new measures were calculated from the output of this model. These measures included the partial loudness of the signal and the loudness of the aspiration noise. Measures obtained from the output of the auditory model were found to account for a high amount of variance in the perceptual ratings of breathiness.
Collapse
Affiliation(s)
- Rahul Shrivastav
- Department of Communication Sciences and Disorders, University of Florida, Gainesville, Florida 32611, USA.
| | | |
Collapse
|
49
|
Yamaguchi H, Shrivastav R, Andrews ML, Niimi S. A comparison of voice quality ratings made by Japanese and American listeners using the GRBAS scale. Folia Phoniatr Logop 2003; 55:147-57. [PMID: 12771466 DOI: 10.1159/000070726] [Citation(s) in RCA: 73] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
The GRBAS scale is a widely used method for perceptual evaluation of voice quality. Two linguistically diverse groups of listeners (Japanese and American) rated 35 voice samples using the GRBAS scale. The ratings obtained from the two groups were compared to determine if the different linguistic background affected the use of the GRBAS scale. Results show that there are no significant differences between the Japanese and American listeners in the use of the Grade, Roughness and Breathiness scales. Ratings on the Asthenia and Strain scales, however, were different between the two groups of listeners. Despite these discrepancies, the GRBAS scale may be an excellent tool for perceptual evaluation of voice quality by linguistically diverse groups.
Collapse
|
50
|
Abstract
Variation in duration and frequency during two readings of each of four sentences by 15 normal 9- and 10-year-old children were compared. Instructions to the children included overt cues designed specifically to elicit durational and frequency changes. Children demonstrated increased sentence variability in their voices when they were cued. Specific key words in the four sentences were also analyzed and results indicated that semantic content in addition to parameter-specific cognitive cues provided a significant effect. The male 9 and 10-year-old subjects showed less variability in "no cue" readings than the females, but showed a greater increase in voice change during the "cognitive cue" readings. Implications for theory and practice are discussed.
Collapse
|