Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Anand S, Kopf LM, Shrivastav R, Eddins DA. Objective Indices of Perceived Vocal Strain. J Voice 2019;33:838-845. [DOI: 10.1016/j.jvoice.2018.06.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2018] [Revised: 06/06/2018] [Accepted: 06/07/2018] [Indexed: 10/28/2022]

For:	Anand S, Kopf LM, Shrivastav R, Eddins DA. Objective Indices of Perceived Vocal Strain. J Voice 2019;33:838-845. [DOI: 10.1016/j.jvoice.2018.06.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2018] [Revised: 06/06/2018] [Accepted: 06/07/2018] [Indexed: 10/28/2022]

Number

Cited by Other Article(s)

Stone TC, Erickson ML. Experienced and Inexperienced Listeners' Perception of Vocal Strain. J Voice 2024:S0892-1997(24)00024-9. [PMID: 38443265 DOI: 10.1016/j.jvoice.2024.02.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Revised: 02/01/2024] [Accepted: 02/02/2024] [Indexed: 03/07/2024]

Dragicevic DA, Dahl KL, Perkins Z, Abur D, Stepp CE. Effects of a Concurrent Working Memory Task on Speech Acoustics in Parkinson's Disease. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2024;33:418-434. [PMID: 38081054 PMCID: PMC11001185 DOI: 10.1044/2023_ajslp-23-00214] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 08/30/2023] [Accepted: 10/26/2023] [Indexed: 01/05/2024]

Sauder CL, Kapsner-Smith MR, Simmons E, Meyer T, Doyle PC, Eadie TL. The Effect of Rating Method on Reliability of Judgments of Strain Across Populations. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2024;33:393-405. [PMID: 38060689 PMCID: PMC11000812 DOI: 10.1044/2023_ajslp-23-00174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 08/17/2023] [Accepted: 10/17/2023] [Indexed: 01/05/2024]

Abstract

PURPOSE

Variability in auditory-perceptual ratings of voice limits their utility, with the poorest reliability often noted for vocal strain. The purpose of this study was to determine whether an experimental method, called visual sort and rate (VSR), promoted stronger rater reliability than visual analog scale (VAS), for ratings of strain in two clinical populations: adductor laryngeal dystonia (ADLD) and vocal hyperfunction (VH).

METHOD

Connected speech samples from speakers with ADLD and VH as well as age- and sex-matched controls were selected from a database. Fifteen inexperienced listeners rated strain for two speaker sets (25 ADLD speakers and five controls; 25 VH speakers and five controls) across four rating blocks: VAS-ADLD, VSR-ADLD, VAS-VH, and VSR-VH. For the VAS task, listeners rated each speaker for strain using a vertically oriented 100-mm VAS. For the VSR task, stimuli were distributed into sets of samples with a range of severities in each set. Listeners sorted and ranked samples for strain within each set, and final ratings were captured on a vertically oriented 100-mm VAS. Intrarater reliability (Pearson's r) and interrater variability (mean of the squared differences between a listener's ratings and group mean ratings) were compared across rating methods and populations using two repeated-measures analyses of variance.

RESULTS

Intrarater reliability of strain was significantly stronger when listeners used VSR compared to VAS; listeners also showed significantly better intrarater reliability in ADLD than VH. Listeners demonstrated significantly less interrater variability (better reliability) when using VSR compared to VAS. No significant effect of population or interactions was found between listeners for measures of interrater variability.

CONCLUSIONS

VSR increases intrarater reliability for ratings of vocal strain in speakers with VH and ADLD. VSR decreases variability of auditory-perceptual judgments of strain between inexperienced listeners in these clinical populations. Future research should determine whether benefits of VSR extend to voice clinicians and/or clinical settings.

Collapse

Cacace AT, Berri B. Blast Overpressures as a Military and Occupational Health Concern. Am J Audiol 2023;32:779-792. [PMID: 37713532 DOI: 10.1044/2023_aja-23-00125] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/17/2023] Open

Fujiki RB, Thibeault SL. Are Children with Cleft Palate at Increased Risk for Laryngeal Pathology? Cleft Palate Craniofac J 2023;60:1385-1394. [PMID: 35912443 DOI: 10.1177/10556656221104027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open

Park Y, Baker Brehm S, Kelchner L, Weinrich B, McElfresh K, Anand S, Shrivastav R, de Alarcon A, Eddins DA. Effects of Vibratory Source on Auditory-Perceptual and Bio-Inspired Computational Measures of Pediatric Voice Quality. J Voice 2023:S0892-1997(23)00254-0. [PMID: 37739862 PMCID: PMC10950844 DOI: 10.1016/j.jvoice.2023.08.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Revised: 08/11/2023] [Accepted: 08/14/2023] [Indexed: 09/24/2023]

Nguyen DD, Madill C. Auditory-perceptual Parameters as Predictors of Voice Acoustic Measures. J Voice 2023:S0892-1997(23)00088-7. [PMID: 37003863 DOI: 10.1016/j.jvoice.2023.02.030] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 02/23/2023] [Accepted: 02/23/2023] [Indexed: 04/03/2023]

Abstract

BACKGROUND

Much research has examined the relationship between perceptual and acoustic measures. However, little is known about the prediction values of perceptual measures on an acoustic parameter.

AIMS

This study utilized simulated and disordered voice samples to investigate the prediction values of breathiness, roughness, and strain ratings on the selection of some time-based and spectral-based measures of voice quality.

METHOD

This study retrospectively analysed two sets of precollected data. The experimental data had been collected from nine trained speakers manipulating false vocal fold activity, true vocal fold mass, and larynx height. The voice-disordered data had been extracted from a clinical database for 68 patients with muscle tension voice disorders (MTVD). Both data sets had been perceptually rated for breathiness, roughness, and strain. Voice samples (prolonged vowel /ɑ/ and Rainbow Passage readings) had undergone acoustic analysis using Praat for harmonics-to-noise ratio (HNR) and the program "Analysis of Dysphonia in Speech and Voice" (ADSV) for cepstral peak prominence (CPP), Cepstral/Spectral Index of Dysphonia (CSID), and Low/High spectral ratio (L/H ratio). Perceptual parameters were regressed against these acoustic measures to test their prediction values.

RESULTS

Reliability data showed satisfactory intra- and inter-reliability of perceptual ratings for both data sets. Breathiness significantly predicted CPP (both vocal tasks) and CSID (Rainbow Passage) in experimental data and predicted all the acoustic measures in MTVD data. Roughness significantly predicted HNR, CPP, and CSID in experimental data, and CPP (Rainbow Passage) and CSID (both vocal tasks) in MTVD data. Strain (both vocal tasks) significantly predicted L/H ratio in both data sets.

CONCLUSIONS

Breathiness ratings predicted selection of HNR, CPP and CSID; roughness ratings predicted selection of CPP and CSID, and strain ratings predicted L/H ratio.

Collapse

Anand S. Perceptual and Computational Estimates of Vocal Breathiness and Roughness in Sustained Phonation and Connected Speech. J Voice 2023:S0892-1997(23)00069-3. [PMID: 36933971 DOI: 10.1016/j.jvoice.2023.02.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Revised: 02/10/2023] [Accepted: 02/13/2023] [Indexed: 03/18/2023]

Abstract

OBJECTIVE

Clinical assessment of voice quality (VQ) often uses a combination of sustained phonations and more prolonged and more complex vocalizations. The purpose of this study was to compare the perceived vocal breathiness and vocal roughness of sustained phonations and connected speech over a wide range of dysphonia severity and to evaluate their relationship with acoustic measures and bioinspired models of breathiness and roughness.

METHODS

VQ dimension-specific single-variable matching task (SVMT) was used to index the perceived breathiness or roughness of five male and five female talkers on the basis of a sustained /a/ phonation and the 5th CAPE-V sentence. Acoustic measures of cepstral peak, autocorrelation peak and psychoacoustic measures of pitch strength, and temporal envelope standard deviation (EnvSD) was used to predict perceived breathiness and roughness judgments obtained from 10 listeners, respectively.

RESULTS

High intra- and inter-listener reliability was observed for sustained phonations and connected speech. Perceived breathiness and roughness of sustained vowels and sentences obtained using SVMT were highly correlated for most dysphonic voices. The pitch strength model of breathiness was able to capture larger amount of perceptual variance compared to cepstral peak in both vowels and sentences. Autocorrelation peak was strongly correlated to perceived roughness in sentences while EnvSD was strongly correlated to perceived roughness in vowels.

CONCLUSIONS

Results provide evidence that perception of VQ via SVMT can be successfully extended to connected speech. Computational models of VQ can be easily adapted to connected speech. Such automated models of VQ perception are valuable due to their computational efficiency and their ability to accurately capture the non-linearities of the human auditory system.

Collapse

Maffei MF, Green JR, Murton O, Yunusova Y, Rowe HP, Wehbe F, Diana K, Nicholson K, Berry JD, Connaghan KP. Acoustic Measures of Dysphonia in Amyotrophic Lateral Sclerosis. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023;66:872-887. [PMID: 36802910 PMCID: PMC10205101 DOI: 10.1044/2022_jslhr-22-00363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Revised: 10/25/2022] [Accepted: 12/01/2022] [Indexed: 05/25/2023]

Abstract

PURPOSE

Identifying efficacious measures to characterize dysphonia in complex neurodegenerative diseases is key to optimal assessment and intervention. This study evaluates the validity and sensitivity of acoustic features of phonatory disruption in amyotrophic lateral sclerosis (ALS).

METHOD

Forty-nine individuals with ALS (40-79 years old) were audio-recorded while producing a sustained vowel and continuous speech. Perturbation/noise-based (jitter, shimmer, and harmonics-to-noise ratio) and cepstral/spectral (cepstral peak prominence, low-high spectral ratio, and related features) acoustic measures were extracted. The criterion validity of each measure was assessed using correlations with perceptual voice ratings provided by three speech-language pathologists. Diagnostic accuracy of the acoustic features was evaluated using area-under-the-curve analysis.

RESULTS

Perturbation/noise-based and cepstral/spectral features extracted from /a/ were significantly correlated with listener ratings of roughness, breathiness, strain, and overall dysphonia. Fewer and smaller correlations between cepstral/spectral measures and perceptual ratings were observed for the continuous speech task, although post hoc analyses revealed stronger correlations in speakers with less perceptually impaired speech. Area-under-the-curve analyses revealed that multiple acoustic features, particularly from the sustained vowel task, adequately differentiated between individuals with ALS with and without perceptually dysphonic voices.

CONCLUSIONS

Our findings support using both perturbation/noise-based and cepstral/spectral measures of sustained /a/ to assess phonatory quality in ALS. Results from the continuous speech task suggest that multisubsystem involvement impacts cepstral/spectral analyses in complex motor speech disorders such as ALS. Further investigation of the validity and sensitivity of cepstral/spectral measures during continuous speech in ALS is warranted.

Collapse

Park Y, Anand S, Gifford SM, Shrivastav R, Eddins DA. Development and Validation of a Single-Variable Comparison Stimulus for Matching Strained Voice Quality Using a Psychoacoustic Framework. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023;66:16-29. [PMID: 36516473 PMCID: PMC10023177 DOI: 10.1044/2022_jslhr-22-00280] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Revised: 08/17/2022] [Accepted: 09/01/2022] [Indexed: 06/17/2023]

Kopf LM, Huh-Yoo J. A User-Centered Design Approach to Developing a Voice Monitoring System for Disorder Prevention. J Voice 2023;37:48-59. [PMID: 33189486 DOI: 10.1016/j.jvoice.2020.10.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2020] [Revised: 10/22/2020] [Accepted: 10/23/2020] [Indexed: 01/11/2023]

Nagle KF. Clinical Use of the CAPE-V Scales: Agreement, Reliability and Notes on Voice Quality. J Voice 2022:S0892-1997(22)00366-6. [PMID: 36543606 DOI: 10.1016/j.jvoice.2022.11.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Revised: 11/09/2022] [Accepted: 11/10/2022] [Indexed: 12/24/2022]

Hidaka S, Lee Y, Nakanishi M, Wakamiya K, Nakagawa T, Kaburagi T. Automatic GRBAS Scoring of Pathological Voices using Deep Learning and a Small Set of Labeled Voice Data. J Voice 2022:S0892-1997(22)00347-2. [PMID: 36437171 DOI: 10.1016/j.jvoice.2022.10.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Revised: 10/27/2022] [Accepted: 10/27/2022] [Indexed: 11/26/2022]

Abstract

OBJECTIVES

Auditory-perceptual evaluation frameworks, such as the grade-roughness-breathiness-asthenia-strain (GRBAS) scale, are the gold standard for the quantitative evaluation of pathological voice quality. However, the evaluation is subjective; thus, the ratings lack reproducibility due to inter- and intra-rater variation. Prior researchers have proposed deep-learning-based automatic GRBAS score estimation to address this problem. However, these methods require large amounts of labeled voice data. Therefore, this study investigates the potential of automatic GRBAS estimation using deep learning with smaller amounts of data.

METHODS

A dataset consisting of 300 pathological sustained /a/ vowel samples was created and rated by eight experts (200 for training, 50 for validation, and 50 for testing). A neural network model that predicts the probability distribution of GRBAS scores from an onset-to-offset waveform was proposed. Random speed perturbation, random crop, and frequency masking were investigated as data augmentation techniques, and power, instantaneous frequency, and group delay were investigated as time-frequency representations.

RESULTS

Five-fold cross-validation was conducted, and the automatic scoring performance was evaluated using the quadratic weighted Cohen's kappa. The results showed that the kappa values of the automatic scoring performance were comparable to those of the inter-rater reliability of experts for all GRBAS items and the intra-rater reliability of experts for items G, B, A, and S. Random speed perturbation was the most effective data augmentation technique overall. When data augmentation was applied, power was the most effective for items G, R, A, and S; for Item B, combining group delay and power yielded additional performance gains.

CONCLUSION

The automatic GRBAS scoring achieved by the proposed model using scant labeled data was comparable to that of experts. This suggests that the challenges resulting from insufficient data can be alleviated. The findings of this study can also contribute to performance improvements in other tasks such as automatic voice disorder detection.

Collapse

de Abreu SR, Sousa ESDS, de Moraes RM, Lopes LW. Performance of Acoustic Measures for the Discrimination Among Healthy, Rough, Breathy, and Strained Voices Using the Feedforward Neural Network. J Voice 2022:S0892-1997(22)00203-X. [PMID: 36028370 DOI: 10.1016/j.jvoice.2022.07.002] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2021] [Revised: 07/03/2022] [Accepted: 07/05/2022] [Indexed: 10/15/2022]

Park Y, Anand S, Ozmeral EJ, Shrivastav R, Eddins DA. Predicting Perceived Vocal Roughness Using a Bio-Inspired Computational Model of Auditory Temporal Envelope Processing. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022;65:2748-2758. [PMID: 35867607 PMCID: PMC9911094 DOI: 10.1044/2022_jslhr-22-00101] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Revised: 04/14/2022] [Accepted: 04/25/2022] [Indexed: 06/15/2023]

Fujiki RB, Huber JE, Sivasankar MP. The effects of vocal exertion on lung volume measurements and acoustics in speakers reporting high and low vocal fatigue. PLoS One 2022;17:e0268324. [PMID: 35551535 PMCID: PMC9098027 DOI: 10.1371/journal.pone.0268324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Accepted: 04/26/2022] [Indexed: 12/02/2022] Open

Abstract

Purpose

Vocal exertion is common and often results in reduced respiratory and laryngeal efficiency. It is unknown, however, whether the respiratory kinematic and acoustic adjustments employed during vocal exertion differ between speakers reporting vocal fatigue and those who do not. This study compared respiratory kinematics and acoustic measures in individuals reporting low and high levels of vocal fatigue during a vocal exertion task.

Methods

Individuals reporting low (N = 20) and high (N = 10) vocal fatigue participated in a repeated measures design study over 2 days. On each day, participants completed a 10-minute vocal exertion task consisting of repeated, loud vowel productions at elevated F0 sustained for maximum phonation time. Respiratory kinematic and acoustic measures were analyzed on the 1^st vowel production (T0), and the vowels produced 2 minutes (T2), 5 minutes (T5), 7 minutes (T7), and 10 minutes (T10) into the vocal exertion task. Vowel durations were also measured at each time point.

Results

No differences in respiratory kinematics were observed between low and high vocal fatigue groups at T0. As the vocal exertion task progressed (T2-T10), individuals reporting high vocal fatigue initiated phonation at lower lung volumes while individuals with low vocal fatigue initiated phonation at higher lung volumes. As the exertion task progressed, total lung volume excursion decreased in both groups. Differences in acoustic measures were observed, as individuals reporting high vocal fatigue produced softer, shorter vowels from T0 through T10.

Conclusions

Individuals reporting high vocal fatigue employed less efficient respiratory strategies during periods of increased vocal demand when compared with individuals reporting low vocal fatigue. Individuals reporting high vocal fatigue had shorter maximum phonation time on loud vowels. Further study should examine the potential screening value of loud maximum phonation time, as well as the clinical implications of the observed respiratory patterns for managing vocal fatigue.

Collapse

Kapsner-Smith MR, Díaz-Cádiz ME, Vojtech JM, Buckley DP, Mehta DD, Hillman RE, Tracy LF, Noordzij JP, Eadie TL, Stepp CE. Clinical Cutoff Scores for Acoustic Indices of Vocal Hyperfunction That Combine Relative Fundamental Frequency and Cepstral Peak Prominence. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022;65:1349-1369. [PMID: 35263546 PMCID: PMC9499364 DOI: 10.1044/2021_jslhr-21-00466] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]

Abur D, Perkell JS, Stepp CE. Impact of Vocal Effort on Respiratory and Articulatory Kinematics. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022;65:5-21. [PMID: 34843405 PMCID: PMC9150749 DOI: 10.1044/2021_jslhr-21-00323] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Revised: 07/27/2021] [Accepted: 08/24/2021] [Indexed: 06/13/2023]

Kapsner-Smith MR, Opuszynski A, Stepp CE, Eadie TL. The Effect of Visual Sort and Rate Versus Visual Analog Scales on the Reliability of Judgments of Dysphonia. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021;64:1571-1580. [PMID: 33909472 PMCID: PMC8608224 DOI: 10.1044/2021_jslhr-20-00623] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]

Abstract

Purpose The reliability of auditory-perceptual judgments between listeners is a long-standing problem in the assessment of voice disorders. The purpose of this study was to determine whether a relatively novel experimental scaling method, called visual sort and rate (VSR), yielded stronger reliability than the more frequently used method of visual analog scales (VAS) for ratings of overall severity (OS) and breathiness (BR) in speakers with voicedisorders. Method Fifty speech samples were selected from a database of speakers with voice disorders. Twenty-two inexperienced listeners provided ratings of OS or BR in four rating blocks: VSR-OS, VSR-BR, VAS-OS, and VSR-BR. For the VAS task, listeners rated each speaker for BR or OS using a vertically oriented 100-mm VAS. For the VSR task, stimuli were distributed into sets of samples with a range of speaker severities in each set. Listeners sorted and ranked samples for OS or BR within each set, and final ratings were captured on a vertically oriented 100-mm VAS. Interrater variability, defined as the mean of the squared differences between a listener's ratings and group mean ratings, and intrarater reliability (Pearson r) were compared across rating tasks for OS and BR using paired t tests. Results Results showed that listeners had significantly less interrater variability (better reliability) when using VSR methods compared to VAS for judgments of both OS and BR. Intrarater reliability was high across rating tasks and dimensions; however, ratings of BR were significantly more consistent within individual listeners when using VAS than when using VSR. Conclusions VSR is an experimental method that decreases variability of auditory-perceptual judgments between inexperienced listeners when rating speakers with a range of dysphonic severities and disorders. Future research should determine whether a clinically viable tool may be developed based on VSR principles and whether such benefits extend to experienced listeners.

Collapse

Fujiki RB, Thibeault SL. The Relationship Between Auditory-Perceptual Rating Scales and Objective Voice Measures in Children With Voice Disorders. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2021;30:228-238. [PMID: 33439742 DOI: 10.1044/2020_ajslp-20-00188] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Abstract

Purpose The purpose of this study was to determine concurrent validity of the Grade, Roughness, Breathiness, Asthenia, and Strain (GRBAS) and Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) auditory-perceptual scales in children with voice disorders. A secondary purpose was to determine correlation between the GRBAS, CAPE-V, and objective voice measures. Method GRBAS and CAPE-V ratings and acoustic and aerodynamic measures were collected from the University of Wisconsin-Madison Voice and Swallow Outcomes Database. Correlations between CAPE-V and GRBAS ratings were calculated for overall severity of dysphonia, roughness, breathiness, and strain. Correlations between auditory-perceptual voice ratings and objective voice measures were also examined. Results One hundred thirty GRBAS and CAPE-V auditory-perceptual ratings were significantly correlated for overall severity, roughness, breathiness, and strain. r ² values were highest for overall severity of dysphonia (r ² = .75) and lowest for strain (r ² = .54). CAPE-V and GRBAS ratings were largely associated with similar acoustic and aerodynamic measures. The highest correlations were observed for auditory-perceptual ratings of breathiness and jitter% (CAPE-V r ² = .44, GRBAS r ² = .44), shimmer% (CAPE-V r ² = .45, GRBAS r ² = .45), noise-to-harmonic ratio (CAPE-V r ² = .42, GRBAS r ² = .40), fundamental frequency (CAPE-V r ² = .47, GRBAS r ² = .44), and maximum phonation time (CAPE-V r ² = .56, GRBAS r ² = .51). Akaike information criterion values indicated that CAPE-V ratings were more strongly correlated with objective voice measures than GRBAS ratings. Conclusions CAPE-V and GRBAS scales have concurrent validity in children with voice disorders. CAPE-V ratings are more strongly correlated with acoustic and aerodynamic voice measures.

Collapse

Park Y, Cádiz MD, Nagle KF, Stepp CE. Perceptual and Acoustic Assessment of Strain Using Synthetically Modified Voice Samples. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020;63:3897-3908. [PMID: 33151770 PMCID: PMC8608200 DOI: 10.1044/2020_jslhr-20-00294] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/28/2020] [Revised: 07/23/2020] [Accepted: 08/17/2020] [Indexed: 06/11/2023]

Barsties V Latoszek B, Kim GH, Delgado Hernández J, Hosokawa K, Englert M, Neumann K, Hetjens S. The validity of the Acoustic Breathiness Index in the evaluation of breathy voice quality: A Meta-Analysis. Clin Otolaryngol 2020;46:31-40. [PMID: 32770718 DOI: 10.1111/coa.13629] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2020] [Revised: 07/03/2020] [Accepted: 07/31/2020] [Indexed: 02/01/2023]

Abstract

BACKGROUND

The evaluation of voice quality with acoustic measurements is useful to objectify the diagnostic process. Particularly, breathiness was highly evaluated and the Acoustic Breathiness Index (ABI) might have promising features.

OBJECTIVE OF REVIEW

The goal of the present meta-analysis is to quantify, from existing cross-validation studies, the evidence for the diagnostic accuracy of ABI, including its sensitivity and specificity.

TYPE OF REVIEW

Meta-analysis.

SEARCH STRATEGY

We searched in MEDLINE, Google Scholar and Science Citation Index, and as manual search for the term Acoustic Breathiness Index from inception to February 2020. Studies were included that used equal proportion of continuous speech and sustained vowel segments, a recording hardware with a sufficient standard for voice signal analyses, the software Praat for signal processing and the customised Praat script, and two groups of subjects (vocally healthy and voice-disordered). Furthermore, the diagnostic accuracy of ABI was measured.

EVALUATION METHOD

The primary outcome variable was ABI. The score ranged from 0 to 10 with varying thresholds according to different languages to determine the absence or presence of breathiness. A meta-analysis was performed according to the Preferred Reporting Items for Systematic Reviews and Meta-analyses of diagnostic test accuracy study guidelines. Data were extracted, and the risk of bias was assessed using the QUADAS-2 tool. The pooled sensitivity and specificity of ABI were determined using a summary receiver operating characteristic (SROC) approach to calculate also a weighted threshold value of ABI with its sensitivity and specificity.

RESULTS

A total of 34 unique citations were screened, and 10 full-text articles were reviewed, including six studies. In total, 3603 voice samples were considered for further analysis separating into 467 vocally healthy and 3136 voice-disordered voice samples. The pooled sensitivity was 0.84 (95% CI, 0.83-0.85), and the pooled specificity was 0.92 (95% CI, 0.89-0.94). The area under the curve of the SROC curve of this analysis showed an excellent value of 0.94. The weighted ABI threshold was determined at 3.40 (sensitivity: 0.86, 95% CI, 0.84-0.87.; specificity: 0.90, 95% CI 0.88-0.92).

CONCLUSIONS

The results confirm the ABI as robust and valid objective measure for evaluating breathiness.

Collapse

Kochilas HL, Cacace AT, Arnold A, Seidman MD, Tarver WB. Vagus nerve stimulation paired with tones for tinnitus suppression: Effects on voice and hearing. Laryngoscope Investig Otolaryngol 2020;5:286-296. [PMID: 32337360 PMCID: PMC7178458 DOI: 10.1002/lio2.364] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2019] [Revised: 01/23/2020] [Accepted: 02/08/2020] [Indexed: 12/16/2022] Open

Abstract

OBJECTIVE

In individuals with chronic tinnitus, our interest was to determine whether daily low-level electrical stimulation of the vagus nerve paired with tones (paired-VNSt) for tinnitus suppression had any adverse effects on motor-speech production and physiological acoustics of sustained vowels. Similarly, we were also interested in evaluating for changes in pure-tone thresholds, word-recognition performance, and minimum-masking levels. Both voice and hearing functions were measured repeatedly over a period of 1 year.

STUDY DESIGN

Longitudinal with repeated-measures.

METHODS

Digitized samples of sustained frontal, midline, and back vowels (/e/, /o/, /ah/) were analyzed with computer software to quantify the degree of jitter, shimmer, and harmonic-to-noise ratio contained in these waveforms. Pure-tone thresholds, monosyllabic word-recognition performance, and MMLs were also evaluated for VNS alterations. Linear-regression analysis was the benchmark statistic used to document change over time in voice and hearing status from a baseline condition.

RESULTS

Most of the regression functions for the vocal samples and audiometric variables had slope values that were not significantly different from zero. Four of the nine vocal functions showed a significant improvement over time, whereas three of the pure tone regression functions at 2-4 kHz showed some degree of decline; all changes observed were for the left ear, all were at adjacent frequencies, and all were ipsilateral to the side of VNS. However, mean pure-tone threshold changes did not exceed 4.29 dB from baseline and therefore, would not be considered clinically significant. In some individuals, larger threshold shifts were observed. No significant regression/slope effects were observed for word-recognition or MMLs.

CONCLUSION

Quantitative voice analysis and assessment of audiometric variables showed minimal if any evidence of adverse effects using paired-VNSt over a treatment period of 1 year. Therefore, we conclude that paired-VNSt is a safe tool for tinnitus abatement in humans without significant side effects.

LEVEL OF EVIDENCE

Level IV.

Collapse

Vojtech JM, Segina RK, Buckley DP, Kolin KR, Tardif MC, Noordzij JP, Stepp CE. Refining algorithmic estimation of relative fundamental frequency: Accounting for sample characteristics and fundamental frequency estimation method. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019;146:3184. [PMID: 31795681 PMCID: PMC6847943 DOI: 10.1121/1.5131025] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2019] [Revised: 10/07/2019] [Accepted: 10/08/2019] [Indexed: 05/26/2023]