1
|
Gama ACC, da Mata SM, Martins Dos Santos PC, Vieira MN, Sansão JPH, Quinino RDC. Auditory Training With Synthesized Voice Anchors: Effects on Rater Agreement. J Voice 2024; 38:366-375. [PMID: 34702611 DOI: 10.1016/j.jvoice.2021.09.009] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 09/15/2021] [Accepted: 09/16/2021] [Indexed: 11/23/2022]
Abstract
PURPOSE To investigate the effects of auditory training with synthesized voices on intra- and inter-rater agreement of the auditory-perceptual voice analysis of roughness and breathiness. METHODS This was an experimental study consisting of four auditory training sessions. The sample consisted of twenty raters, students from a Speech-Language Pathology course, who had previous experience with auditory-perceptual assessment. The raters participated in the four training sessions with a seven-day break in between sessions. Each training consisted of three tasks: 1) Pre-training activity: Participants were asked to rate 20 natural voices, normal and dysphonic, from zero to three, according to the parameters of roughness and breathiness; 2) Training activity: Synthesized voice anchor stimuli were presented, and participants were asked to rate them from zero to three. Four stimuli were related to roughness and four to breathiness. Participants heard 20 voice stimuli and were instructed to pair the natural voice with the synthesized anchor stimulus that most resembled it; and 3) Post-training activity: the 20 voices from the pre-training activity were randomized and participants rated the same voices, without prior knowledge that these were repeated. Statistical analysis of data was performed using the AC2 test, to assess the extent of agreement between raters, and the Friedman test to compare the training sessions. A 5% significance level was considered. RESULTS For the auditory-perceptual voice analysis of roughness, intra-rater agreement results ranged from 79% to 86% between the first and fourth auditory training session, with improvement in intra-rater agreement from the fourth session forward (P = 0.005). For the analysis of breathiness, results ranged from 88% to 92% between the first and fourth auditory training sessions, with improvement from the fourth session forward (P = 0.036). Inter-rater agreement results for the auditory-perceptual analysis of roughness ranged from 23% to 34%, and from 48% to 61% for breathiness, with no differences regarding training (P = 0.855). CONCLUSION The auditory-perceptual breathiness parameter had a higher AC2 indicator compared to the roughness parameter, suggesting better agreement. The intra-rater agreement showed improvement starting from the fourth auditory training session for the assessment of roughness and breathiness. The auditory training program did not show a positive inter-rater agreement impact.
Collapse
Affiliation(s)
- Ana Cristina Côrtes Gama
- Department of Speech-language Pathology, Federal University of Minas Gerais. Belo Horizonte, Minas Gerais, Brazil; Researcher of Productivity at National Council of Scientific Research, Brazil.
| | - Sabrina Martins da Mata
- Speech-language Pathology, Department of Speech-language Pathology, Federal University of Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| | - Priscila Campos Martins Dos Santos
- Speech-language Pathology, Post-graduate program (MSc) in speech therapy Sciences; Department of Speech-language Pathology, Federal University of Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| | - Maurílio Nunes Vieira
- Department of Electronic Engineering, Federal University of Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| | - João Pedro Hallack Sansão
- Department of Technology in Civil Engineering, Computing, Automation, Telematics and Humanities, Federal University of São João Del Rei - UFSJ - Ouro Branco, Minas Gerais, Brazil
| | - Roberto da Costa Quinino
- Department of Statistics of the Institute of Exact Sciences, Federal University of Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| |
Collapse
|
2
|
Benoy JJ, Jayakumar T. Effect of Anchor Voices and Listener Expertise on Auditory-Perceptual Judgments of Voice Quality Using the GRBAS Scale. J Voice 2024:S0892-1997(23)00397-1. [PMID: 38199908 DOI: 10.1016/j.jvoice.2023.12.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 12/10/2023] [Accepted: 12/11/2023] [Indexed: 01/12/2024]
Abstract
OBJECTIVES This study aimed to determine the effect of anchor voices and listener expertise on auditory-perceptual judgment of voice quality using the GRBAS scale. METHODS This study utilized a modified crossover design with counterbalancing. Anchor voices for each parameter of the GRBAS scale were chosen based on expert consensus. A total of 28 participants were divided into three groups based on their expertise. The first and second groups consisted of nine undergraduate (UG) and nine postgraduate (PG) students of speech-language pathology. The third group consisted of 10 practicing speech-language pathologists (SLPs). These participants carried out auditory-perceptual judgment of 60 dysphonic voice samples under two counterbalanced experimental conditions (with and without anchor voices). Each of the three groups was randomly divided into two subgroups to balance the experimental conditions. Interrater reliability for each subgroup was calculated using Krippendorff's α and 95% confidence intervals. RESULTS For all the groups involved in the study, interrater reliability was higher when anchor voices aided perceptual judgment for most parameters of the GRBAS scale. For the different parameters of GRBAS, interrater reliability for the UG group varied from fair (20 < α ≤ 40) to moderate (40 < α ≤ 60). In contrast, it was fair (20 < α ≤ 40) to substantial (60 < α ≤ 80) for the PG group and moderate (40 < α ≤ 60) to substantial (60 < α ≤ 80) for the SLP group. Variations in reliability were the least for the SLP group compared to the UG and PG groups. However, there were overlaps in interrater reliability between the groups, as revealed by the 95% confidence intervals. CONCLUSIONS Anchor voices help improve the auditory-perceptual judgment of voice quality, especially interrater reliability. Listener expertise is also shown to influence the interrater reliability of auditory-perceptual judgment of voice quality.
Collapse
Affiliation(s)
- Jesnu Jose Benoy
- Department of Speech-Language Sciences, All India Institute of Speech and Hearing, Mysuru, Karnataka, India
| | - Thirunavukkarasu Jayakumar
- Department of Speech-Language Sciences, All India Institute of Speech and Hearing, Mysuru, Karnataka, India.
| |
Collapse
|
3
|
Kankare E, Laukkanen AM. Validation of the Acoustic Breathiness Index in Speakers of Finnish Language. J Clin Med 2023; 12:7607. [PMID: 38137676 PMCID: PMC10743974 DOI: 10.3390/jcm12247607] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2023] [Revised: 11/27/2023] [Accepted: 12/07/2023] [Indexed: 12/24/2023] Open
Abstract
Breathiness (perception of turbulence noise in the voice) is one of the major components of hoarseness in dysphonic voices. This study aims to validate a multiparameter analysis tool, the Acoustic Breathiness Index (ABI), for quantification of breathiness in the speaking voice, including both sustained vowels and continuous speech. One hundred and eight speakers with dysphonia (28 M, 80 F, mean age 50, SD 15.4 years) and 87 non-dysphonic controls (18 M, 69 F, mean age 42, SD 14 years) volunteered as participants. They read a standard text and sustained vowel /a:/. Acoustic recordings were made using a head-mounted microphone. Acoustic samples were evaluated perceptually by nine voice experts of different backgrounds (speech therapists, vocologists and laryngologists). Breathiness (B) from the GRBAS scale was rated. Headphones were used in the perceptual analysis. The dysphonic and non-dysphonic speakers differed significantly from each other in the auditory perceptual evaluation of breathiness. A significant difference was also found for ABI, which had a mean value of 2.26 (SD 1.15) for non-dysphonic and 3.07 (SD 1.75) for dysphonic speakers. ABI correlated strongly with B (rs = 0.823, p = 0.01). ABI's power to distinguish the groups was high (88.6%). The highest sensitivity and specificity of ABI (80%) was obtained at threshold value 2.68. ABI is a valid tool for differentiating breathiness in non-dysphonic and dysphonic speakers of Finnish.
Collapse
Affiliation(s)
- Elina Kankare
- Department of Rehabilitation and Psychosocial Support, Logopedics, Phoniatrics, Tampere University Hospital, 33520 Tampere, Finland
| | - Anne-Maria Laukkanen
- Speech and Voice Research Laboratory, Tampere University, 33100 Tampere, Finland;
| |
Collapse
|
4
|
Labaere A, De Bodt M, Van Nuffelen G. Construction of an Anchor and Training Sample Set for Auditory-Perceptual Voice Evaluation With the GRBAS-scale. J Voice 2023:S0892-1997(23)00307-7. [PMID: 37981533 DOI: 10.1016/j.jvoice.2023.09.033] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Revised: 09/28/2023] [Accepted: 09/29/2023] [Indexed: 11/21/2023]
Abstract
OBJECTIVE The aim of this study is to construct a representative and comprehensive set of reliable anchor and training voice samples for all parameters and grades of the GRBAS-scale, in order to develop a standardized online training tool for perceptual voice evaluation. METHODS 140 voice samples varying from normal to severely dysphonic, consisting of the vowel /a/ and a Dutch standard text, were recorded. Samples were duplicated and added to a data folder in random order. A panel of five highly experienced judges rated GRBAS parameters for each voice sample. Intra- and interrater reliability were determined using intraclass correlation coefficient (ICC) and percentage agreement. Samples with agreement percentages of >0.90 for at least one parameter were qualified as anchor voices, whereas samples with a percentage agreement between 0.70 and 0.80 were considered valid training samples. RESULTS Intrarater reliability was moderate to good for all raters and parameters. Interrater reliability was good for parameter G and moderate for R, B, A and S. 83 samples were found to be valid anchor samples with 90-100% expert agreement for at least one parameter, covering all of the GRBAS parameter/score combinations. An additional 37 samples fulfilled the criteria for adoption in the online tool as a training sample, showing 70-80% agreement for one or more parameters. CONCLUSION Based on well-defined and stringent selection criteria, this study identifies a substantial number of valid anchor and training voices that qualify for implementation in a standardized multi-level training tool for the auditory-perceptual evaluation of voice by means of the GRBAS scale.
Collapse
Affiliation(s)
- Annelies Labaere
- Department of Translational Neurosciences, Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium; Department of Speech Pathology and audiology, Thomas More University of Applied Sciences, Antwerp, Belgium.
| | - Marc De Bodt
- Department of Translational Neurosciences, Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium; Department of Rehabilitation Sciences, Faculty of Medicine and health sciences, Ghent University, Ghent, Belgium
| | - Gwen Van Nuffelen
- Department of Translational Neurosciences, Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium; Department of Otorhinolaryngology, Head and Neck Surgery, Antwerp University Hospital, Antwerp, Belgium; Department of Rehabilitation Sciences, Faculty of Medicine and health sciences, Ghent University, Ghent, Belgium
| |
Collapse
|
5
|
Stipancic KL, Golzy M, Zhao Y, Pinkerton L, Rohl A, Kuruvilla-Dugdale M. Improving Perceptual Speech Ratings: The Effects of Auditory Training on Judgments of Dysarthric Speech. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023; 66:4236-4258. [PMID: 37774738 PMCID: PMC10715846 DOI: 10.1044/2023_jslhr-23-00322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Revised: 07/24/2023] [Accepted: 07/30/2023] [Indexed: 10/01/2023]
Abstract
PURPOSE Auditory training has been shown to reduce rater variability in perceptual voice assessment. Because rater variability is also a central issue in the auditory-perceptual assessment of dysarthria, this study sought to determine if training produces a meaningful change in rater reliability, criterion validity, and scaling magnitude of four features: overall speech impairment, articulatory imprecision, monotony, and slow rate. METHOD Forty-four nonexperts randomized to training and nontraining listener groups completed a pretest and posttest. Only the former group underwent auditory training between pre- and posttests. For both testing and training, listeners rated samples from speakers with amyotrophic lateral sclerosis (ALS), speakers with Parkinson's disease (PD), and neurologically healthy control speakers using separate visual analog scales (VASs) for each of the four features. Intraclass correlation coefficients were used to compare inter- and intrarater reliability between pre- and posttest for both listener groups. For criterion validity, severity ratings from the two nonexpert listener groups were compared to those of two experienced listeners for all four features. To determine changes in scaling magnitude, raw VAS scores for each feature were compared from pre- to posttest within the two nonexpert listener groups. Scaling changes were also compared between the two listener groups for the pre- and posttest conditions. RESULTS AND CONCLUSIONS In the training group, a meaningful improvement in interrater reliability was observed for some features in all three speaker groups, but not in the nontraining group. In contrast, for intrarater reliability, in the nontraining group, a meaningful improvement was observed for many features in all three speaker groups, but only for PD monotony and slow rate in the training group. All ratings from the nonexpert listeners were valid except for monotony. Raw VAS scores did not meaningfully change from pre- to posttest for any of the features, but there was a trend toward lower scores posttraining, mainly for the ALS samples. Modifications to the auditory training paradigm to further improve reliability and validity, along with future goals for optimizing training, are discussed.
Collapse
Affiliation(s)
- Kaila L. Stipancic
- Department of Communicative Disorders and Sciences, University at Buffalo, NY
| | - Mojgan Golzy
- Department of Health Management and Informatics, University of Missouri, Columbia
| | - Yunxin Zhao
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia
| | - Louise Pinkerton
- Department of Communication Sciences and Disorders, The University of Iowa, Iowa City
| | - Andrea Rohl
- Department of Neurosurgery, The University of Iowa, Iowa City
| | | |
Collapse
|
6
|
Paiva MAAD, Machado LDS, Lopes LW. Proposal of requirements for the development of a training simulator for the auditory-perceptual judgment of voice. Codas 2023; 35:e20220209. [PMID: 37820100 DOI: 10.1590/2317-1782/20232022209pt] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Accepted: 12/05/2022] [Indexed: 10/13/2023] Open
Abstract
PURPOSE to identify a set of requirements for the development of an auditory-perceptual training simulator (APT) based on the experience of professors who provide APT. METHODS This is a cross-sectional, descriptive study with a quantitative approach. Twenty-two professors answered an online questionnaire containing 31 items related to APT, involving items about the professional profile, conditions for APT in undergraduate and postgraduate courses in Speech Therapy, APT structure, and evaluation of the APT effect. RESULT it was observed that there is a variation in APT procedures performed in Brazil. The main requirements indicated by the respondents for the APT involve the use of synthesized voices in the initial moments, followed by human voices later; the use of speech tasks with sustained vowels and connected speech; the insertion of complementary information such as gender, age, the profession of the speaker and the spectrography of the vocal signal; training with a minimum time of six hours; the evaluation of the training effect by comparing intra- and inter-judge agreement before and after training; the addition of the parameters of general degree of vocal deviation, roughness, breathiness, and strain; the use of validated continuous and numerical scales; and offering it from the second year of the undergraduate program. CONCLUSION although there is variability in the response of experts, a minimum set of requirements indicated for performing APT with new judges was identified.
Collapse
|
7
|
Diaz JA, Rangarathnam AB, Medina A, Khare VD, Fisher HR. Preliminary Perceptual Findings of Singing After Flow Phonation Voice Exercises for Singers in Training. J Voice 2023:S0892-1997(23)00256-4. [PMID: 37718141 DOI: 10.1016/j.jvoice.2023.08.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Revised: 08/11/2023] [Accepted: 08/14/2023] [Indexed: 09/19/2023]
Abstract
INTRODUCTION Goals of voice therapy and vocal pedagogy share similar principles and therefore, therapy approaches designed to improve the disordered voice may be equally effective when used to enhance healthy voice and vice versa. The purpose of this study was to investigate the effects of flow phonation voice exercises on vocal characteristics of students in undergraduate vocal music training programs and examine their potential use in vocal pedagogy. METHODS A total of 10 cis females were recruited and were randomized into two groups: an experimental group (group 1, n = 6) receiving five sessions of flow phonation intervention across 5 weeks; and a control group (group 2, n = 4) that did not receive any direct intervention other than vocal hygiene education. Participants provided data pertaining to auditory perception, subjective respiratory measures, voice-related quality of life, and vocal fatigue before and after 5 weeks. Flow phonation exercises consisted of cup bubble blowing, gargling, and stretch and flow. Wilcoxon signed ranks test was administered to compare outcomes across time points and between groups. RESULTS Data indicate statistically significant changes in auditory perception of the singing voice, and voice-related quality of life for the group that received flow phonation exercises alone. No other measures showed statistical significance. DISCUSSION AND CONCLUSION Overall, this study indicates that the use of flow phonation voice exercises has the potential to improve voice instruction within the voice studio. While our target enrollments were not met to achieve optimal statistical power, our hypotheses were at least partially supported. In particular, positive intervention-related changes were observed in self-perceived voice handicap, and auditory perception of singing which were not observed in the control group.
Collapse
Affiliation(s)
- Jorge A Diaz
- Department of Communication Sciences and Disorders, Florida International University, Miami, FL
| | | | - Angela Medina
- Department of Communication Sciences and Disorders, Florida International University, Miami, FL
| | | | - Hélène R Fisher
- Department of Speech-Language Pathology, Nova Southeastern University, Fort Lauderdale, FL
| |
Collapse
|
8
|
Wang Z, Müller M, Caffier F, Caffier PP. Harnessing Machine Learning in Vocal Arts Medicine: A Random Forest Application for "Fach" Classification in Opera. Diagnostics (Basel) 2023; 13:2870. [PMID: 37761237 PMCID: PMC10528521 DOI: 10.3390/diagnostics13182870] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 08/28/2023] [Accepted: 09/04/2023] [Indexed: 09/29/2023] Open
Abstract
Vocal arts medicine provides care and prevention strategies for professional voice disorders in performing artists. The issue of correct "Fach" determination depending on the presence of a lyric or dramatic voice structure is of crucial importance for opera singers, as chronic overuse often leads to vocal fold damage. To avoid phonomicrosurgery or prevent a premature career end, our aim is to offer singers an improved, objective fach counseling using digital sound analyses and machine learning procedures. For this purpose, a large database of 2004 sound samples from professional opera singers was compiled. Building on this dataset, we employed a classic ensemble learning method, namely the Random Forest algorithm, to construct an efficient fach classifier. This model was trained to learn from features embedded within the sound samples, subsequently enabling voice classification as either lyric or dramatic. As a result, the developed system can decide with an accuracy of about 80% in most examined voice types whether a sound sample has a lyric or dramatic character. To advance diagnostic tools and health in vocal arts medicine and singing voice pedagogy, further machine learning methods will be applied to find the best and most efficient classification method based on artificial intelligence approaches.
Collapse
Affiliation(s)
- Zehui Wang
- Institute for Digital Transformation, University of Applied Sciences Ravensburg-Weingarten, Doggenriedstraße, 88250 Weingarten, Germany;
| | - Matthias Müller
- Occupational College of Music BFSM Krumbach, Mindelheimer Str. 47, 86381 Krumbach, Germany;
| | - Felix Caffier
- School of Computing, Communication and Business, HTW Berlin University of Applied Sciences, Treskowallee 8, 10318 Berlin, Germany;
| | - Philipp P. Caffier
- Department of Audiology and Phoniatrics, Charité—Universitätsmedizin Berlin, Campus Charité Mitte, Charitéplatz 1, 10117 Berlin, Germany
| |
Collapse
|
9
|
Curtis JA, Borders JC, Dakin AE, Troche MS. Auditory-Perceptual Assessments of Cough: Characterizing Rater Reliability and the Effects of a Standardized Training Protocol. Folia Phoniatr Logop 2023; 76:77-90. [PMID: 37544291 DOI: 10.1159/000533372] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Accepted: 07/24/2023] [Indexed: 08/08/2023] Open
Abstract
INTRODUCTION Auditory-perceptual assessments of cough are commonly used by speech-language pathologists working with people with swallowing disorders with emerging evidence beginning to demonstrate their validity; however, their reliability among novice clinicians is unknown. Therefore, the primary aim of this study was to characterize the reliability of auditory-perceptual assessments of cough among a group of novice clinicians. As a secondary aim, we assessed the effects of a standardized training protocol on the reliability of auditory-perceptual assessments of cough. METHODS Twelve novice clinicians blindly rated ten auditory-perceptual cough descriptors for 120 cough audio clips. Standardized training was then completed by the group of clinicians. The same cough audio clips were then re-randomized and blindly rated. Reliability was analyzed pre- and post-training within each clinician (intra-rater), between each unique pair of raters (dyad-level inter-rater), and for the entire group of raters (group-level inter-rater) using intraclass correlation coefficients and Cohen's Kappa. RESULTS Pre-training reliability was greatest for measures of strength, effectiveness, and normality and lowest when judging the type of expiratory maneuver (cough, throat clear, huff, other). The measures that improved the most with training were ratings of perceived crispness, amount of voicing, and type of expiratory maneuver. Intra-rater reliability coefficients ranged from 0.580 to 0.903 pre-training and 0.756-0.904 post-training. Dyad-level inter-rater reliability coefficients ranged from 0.295 to 0.745 pre-training and 0.450-0.804 post-training. Group-level inter-rater reliability coefficients ranged from 0.454 to 0.919 pre-training and 0.558-0.948 post-training. CONCLUSION Reliability of auditory-perceptual assessments varied across perceptual cough descriptors, but all appeared within the range of what has been historically reported for auditory-perceptual assessments of voice and visual-perceptual assessments of swallowing and cough airflow. Reliability improved for most cough descriptors following 30-60 min of standardized training. Future research is needed to examine the validity of auditory-perceptual assessments of cough by assessing the relationship between perceptual cough descriptors and instrumental measures of cough effectiveness to better understand the role of perceptual assessments in clinical practice.
Collapse
Affiliation(s)
- James A Curtis
- Department of Otolaryngology-Head and Neck Surgery, Aerodigestive Innovations Research Lab (AIR), Weill Cornell Medical College, New York, New York, USA
- Department of Biobehavioral Sciences, Laboratory for the Study of Upper Airway Dysfunction, Teachers College, Columbia University, New York, New York, USA
| | - James C Borders
- Department of Biobehavioral Sciences, Laboratory for the Study of Upper Airway Dysfunction, Teachers College, Columbia University, New York, New York, USA
| | - Avery E Dakin
- Department of Biobehavioral Sciences, Laboratory for the Study of Upper Airway Dysfunction, Teachers College, Columbia University, New York, New York, USA
| | - Michelle S Troche
- Department of Biobehavioral Sciences, Laboratory for the Study of Upper Airway Dysfunction, Teachers College, Columbia University, New York, New York, USA
| |
Collapse
|
10
|
Park Y, Anand S, Gifford SM, Shrivastav R, Eddins DA. Development and Validation of a Single-Variable Comparison Stimulus for Matching Strained Voice Quality Using a Psychoacoustic Framework. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023; 66:16-29. [PMID: 36516473 PMCID: PMC10023177 DOI: 10.1044/2022_jslhr-22-00280] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Revised: 08/17/2022] [Accepted: 09/01/2022] [Indexed: 06/17/2023]
Abstract
PURPOSE Acoustic and perceptual quantification of vocal strain has been a vexing problem for years. To increase measurement rigor, a suitable single-variable matching stimulus for strain was developed and validated, based on the matching stimulus used previously for breathy and rough voice qualities. METHOD A set of 21 comparison stimuli for a single-variable matching task (SVMT) was synthesized based on a speech-shaped sawtooth waveform mixed with speech-shaped noise. Variable bandpass filter gain in mid-to-high frequencies achieved a wide range of computed sharpness (in constant sharpness steps) and served as the independent variable for the SVMT. Ten natural /ɑ/ stimuli with a wide range of the primary voice quality of strain and a minimum of breathiness or roughness were selected and assessed using the SVMT. Natural voice samples and synthetic comparison stimuli were also assessed using a perceptual magnitude estimation (ME) task. RESULTS ME data validated the correspondence of the set of comparison stimuli to varying perceived strain. Perceived strain magnitudes of the comparison stimuli increased significantly and linearly with computed sharpness (r 2 = .99). A linear regression revealed that strain matching values were significantly predicted by computed sharpness (r 2 = .96) and perceived strain magnitudes (r 2 = .95) of the natural voice stimuli. CONCLUSION The perception of vocal strain is strongly associated with computed sharpness and is captured accurately and precisely using an SVMT, in which the independent variable is the bandpass filter gain (in steps of equal sharpness) applied to the comparison stimuli.
Collapse
Affiliation(s)
- Yeonggwang Park
- Department of Communication Sciences & Disorders, University of South Florida, Tampa
| | - Supraja Anand
- Department of Communication Sciences & Disorders, University of South Florida, Tampa
| | - Sophia M. Gifford
- Department of Communication Sciences & Disorders, University of South Florida, Tampa
| | - Rahul Shrivastav
- Office of the Provost & Executive Vice President, Indiana University, Bloomington
| | - David A. Eddins
- Department of Communication Sciences & Disorders, University of South Florida, Tampa
| |
Collapse
|
11
|
Park Y, Anand S, Kopf LM, Shrivastav R, Eddins DA. Interactions Between Breathy and Rough Voice Qualities and Their Contributions to Overall Dysphonia Severity. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:4071-4084. [PMID: 36260821 PMCID: PMC9940885 DOI: 10.1044/2022_jslhr-22-00012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
PURPOSE Dysphonic voices typically present multiple voice quality dimensions. This study investigated potential interactions between perceived breathiness and roughness and their contributions to overall dysphonia severity. METHOD Synthetic stimuli based on four talkers were created to systematically map out potential interactions. For each talker, a stimulus matrix composed of 49 stimuli (seven breathiness steps × seven roughness steps) was created by varying aspiration noise and open quotient to manipulate breathiness and superimposing amplitude modulation of varying depths to simulate roughness. One-dimensional matching (1DMA) and magnitude estimation (1DME) tasks were used to measure perceived breathiness, roughness, their potential interactions, and overall dysphonia severity. Additional 1DME tasks were used to assess a set of natural stimuli that varied along both breathiness and roughness. RESULTS For the synthetic stimuli, the 1DMA task indicated little interaction between the two voice qualities. For the 1DME task, breathiness magnitude was influenced by roughness step to a greater extent than roughness magnitude was influenced by breathiness step. The additive contributions of breathiness and roughness to overall severity gradually diminished with increasing breathiness and roughness steps, possibly reflecting a ceiling effect in the 1DME task. For the natural stimuli, little consistent interaction was observed between breathiness and roughness. CONCLUSIONS The matching task revealed minimal interaction between perceived breathiness and roughness, whereas the magnitude estimation task revealed some interaction between the two qualities and their cumulative contributions to overall dysphonia severity. Task differences are discussed in terms of differences in response bias and the role of perceptual anchors. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.21313701.
Collapse
Affiliation(s)
- Yeonggwang Park
- Department of Communication Sciences and Disorders, University of South Florida, Tampa
| | - Supraja Anand
- Department of Communication Sciences and Disorders, University of South Florida, Tampa
| | - Lisa M. Kopf
- Department of Speech, Language and Hearing Sciences, The George Washington University, Washington, DC
| | - Rahul Shrivastav
- Office of the Provost and Executive Vice President, Indiana University, Bloomington
- Department of Communicative Sciences and Disorders, Michigan State University, East Lansing
| | - David A. Eddins
- Department of Communication Sciences and Disorders, University of South Florida, Tampa
- Department of Communicative Sciences and Disorders, Michigan State University, East Lansing
| |
Collapse
|
12
|
Sanuki T, Oridate N, Tateya I, Nito T, Mizoguchi K, Tanabe K. Validity of intraoperative voice monitoring undergoing type 2 thyroplasty with titanium bridges for adductor spasmodic dysphonia. Laryngoscope Investig Otolaryngol 2022; 7:1481-1490. [PMID: 36258849 PMCID: PMC9575097 DOI: 10.1002/lio2.898] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Accepted: 08/08/2022] [Indexed: 11/09/2022] Open
Abstract
Objectives The success of type 2 thyroplasty (TP2) for adductor spasmodic dysphonia (AdSD) depends on the selection of optimally sized titanium bridges, which requires accurate assessment of intraoperative vocal changes. While this procedure has traditionally been performed according to the laryngologist's experience, the most appropriate method for voice monitoring and selection of titanium bridge size remains to be determined. This study aimed to investigate evaluation parameters useful for voice monitoring, as these may allow less experienced surgeons to perform TP2 properly. Methods In this prospective study, voice monitoring was performed in 18 patients with AdSD patients undergoing TP2. Evaluations were performed preoperatively, intraoperatively, 13 weeks postoperatively, and 52 weeks postoperatively using GRBAS (grade, roughness, breathiness, asthenia, and strain), as well as perceptual judgment and acoustic analyses. Results Preoperative and intraoperative assessments of the G, R, B, and S parameters, perceptual judgment, and harmonic‐to‐noise ratio (HNR) were in moderate or better agreement. Intraoperative and 13‐ or 52‐week postoperative measurements of the R, B, and G parameters and strangulation, tremor, and HNR were also in high agreement. When two different sizes of titanium bridges were compared (unselected vs. selected), ratings for G, R, S, strangulation, tremor, jitter, shimmer, HNR, standard deviation of F0, and degree of voice breaks were better for the selected width than the unselected width. Conclusion The candidate items for intraoperative voice monitoring during TP2 for AdSD are G, R, strangulation, tremor, and HNR. The use of these items may help to ensure successful TP2 and contribute to the advancement of laryngeal framework surgery. Level of evidence Level 4.
Collapse
Affiliation(s)
- Tetsuji Sanuki
- Department of Otolaryngology – Head and Neck Surgery Nagoya City University Graduate School of Medical Sciences Nagoya Aichi Japan
| | - Nobuhiko Oridate
- Department of Otolaryngology – Head and Neck Surgery Yokohama City University Yokohama Kanagawa Japan
| | - Ichiro Tateya
- Department of Otolaryngology – Head and Neck Surgery School of Medicine, Fujita Health University Toyoake Aichi Japan
| | - Takaharu Nito
- Department of Otolaryngology Saitama Medical Center, Saitama Medical University Kawagoe Saitama Japan
| | - Kenji Mizoguchi
- Department of Otorhinolaryngology National Hospital Organization Hokkaido Medical Center Sapporo Hokkaido Japan
| | - Kenichiro Tanabe
- Pathophysiology and Bioregulation, St. Marianna University Graduate School of Medicine Kawasaki Kanagawa Japan
| |
Collapse
|
13
|
Behlau M, Almeida AA, Amorim G, Balata P, Bastos S, Cassol M, Constantini AC, Eckley C, Englert M, Gama ACC, Gielow I, Guimarães B, Lima LR, Lopes L, Madazio G, Moreti F, Mouffron V, Nemr K, Oliveira P, Padovani M, Ribeiro VV, Silverio K, Vaiano T, Yamasaki R. Reducing the GAP between science and clinic: lessons from academia and professional practice - part A: perceptual-auditory judgment of vocal quality, acoustic vocal signal analysis and voice self-assessment. Codas 2022; 34:e20210240. [PMID: 35920467 PMCID: PMC9886186 DOI: 10.1590/2317-1782/20212021240pt] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Accepted: 10/11/2021] [Indexed: 02/03/2023] Open
Abstract
During the XXVIII Brazilian Congress of SBFa, 24 specialists met and, from a leading position on scientific research as a tool for connecting laboratory and clinic, five fronts of knowledge of the voice specialty were discussed as following: Perceptual-auditory judgment of vocal quality; 2. Acoustic analysis of the vocal signal; 3. Voice self-assessment; 4. Traditional techniques of therapy; 5. Modern techniques of electrostimulation and photobiomodulation (PBMT) in voice. Part "a" of this publication was associated with the consolidation of the analyses of the first three aspects. The trend in the perceptual-auditory judgment of vocal quality was related to the use of standard protocols. The acoustic evaluation of the vocal signal is accessible and can be done descriptively or by extraction of parameters, thus preferring multiparametric measures. Finally, the analysis of the individual himself closes this triad of voice documentation, which will be the basis for the conclusion of the evaluation, reference for monitoring progress, and evaluation of treatment results.
Collapse
Affiliation(s)
- Mara Behlau
- Centro de Estudos da Voz – CEV - São Paulo (SP), Brasil.,Escola Paulista de Medicina – EPM, Universidade Federal de São Paulo – UNIFESP - São Paulo (SP), Brasil.
| | - Anna Alice Almeida
- Centro de Estudos da Voz – CEV - São Paulo (SP), Brasil.,Universidade Federal da Paraíba – UFPB - João Pessoa (PB), Brasil.
| | - Geová Amorim
- Universidade Federal de Alagoas – UFAL - Maceió, AL, Brasil.
| | - Patrícia Balata
- APTA COMUNICAÇÃO - Recife (PE), Brasil.,Universidade Federal de Pernambuco – UFPE - Recife (PE), Brasil.
| | - Sávio Bastos
- Centro de Fotobiomodulação e Saúde – CFOTOBIOS - Belém (PA), Brasil.
| | - Mauricéia Cassol
- Universidade Federal de Ciências da Saúde de Porto Alegre – UFCSPA - Porto Alegre (RS), Brasil.
| | | | - Claudia Eckley
- Faculdade de Ciências Médicas da Santa Casa de São Paulo – FCMSCSP - São Paulo (SP), Brasil.
| | - Marina Englert
- Centro de Estudos da Voz – CEV - São Paulo (SP), Brasil.
| | | | - Ingrid Gielow
- Centro de Estudos da Voz – CEV - São Paulo (SP), Brasil.
| | - Bruno Guimarães
- Clínica Bruno Guimarães Serviços de Fonoaudiologia e Fisioterapia - Fortaleza (CE), Brasil.
| | | | - Leonardo Lopes
- Centro de Estudos da Voz – CEV - São Paulo (SP), Brasil.,Universidade Federal da Paraíba – UFPB - João Pessoa (PB), Brasil.
| | | | - Felipe Moreti
- Centro de Estudos da Voz – CEV - São Paulo (SP), Brasil.,Centro Universitário da Faculdade de Medicina do ABC – FMABC - Santo André (SP), Brasil.,Complexo Hospitalar Municipal de São Bernardo do Campo – CHMSBC - São Bernardo do Campo (SP), Brasil.
| | - Vanessa Mouffron
- Universidade Federal de Minas Gerais – UFMG - Belo Horizonte (MG), Brasil.
| | - Katia Nemr
- Universidade de São Paulo – USP - São Paulo (SP), Brasil.
| | | | - Marina Padovani
- Centro de Estudos da Voz – CEV - São Paulo (SP), Brasil.,Faculdade de Ciências Médicas da Santa Casa de São Paulo – FCMSCSP - São Paulo (SP), Brasil.
| | - Vanessa Veis Ribeiro
- Centro de Estudos da Voz – CEV - São Paulo (SP), Brasil.,Universidade Federal da Paraíba – UFPB - João Pessoa (PB), Brasil.
| | - Kelly Silverio
- Faculdade de Odontologia de Bauru – FOB, Universidade de São Paulo – USP - Bauru (SP), Brasil.
| | - Thays Vaiano
- Centro de Estudos da Voz – CEV - São Paulo (SP), Brasil.
| | - Rosiane Yamasaki
- Centro de Estudos da Voz – CEV - São Paulo (SP), Brasil.,Escola Paulista de Medicina – EPM, Universidade Federal de São Paulo – UNIFESP - São Paulo (SP), Brasil.
| |
Collapse
|
14
|
Bispo NDO, Yamasaki R, Padovani MMP, Behlau M. Repetition of anchor stimuli and nature of vocal samples in the perceptual auditory judgment performed by speech-language pathology students. Codas 2022; 34:e20210064. [PMID: 35081197 PMCID: PMC9886121 DOI: 10.1590/2317-1782/20212021064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Accepted: 07/20/2021] [Indexed: 02/03/2023] Open
Abstract
PURPOSE Verify the effect of anchor repetition in the perceptual auditory judgement of the type of vocal deviation performed by speech-language pathology (SLP) students; analyze the correlation between the amount of different vocal dimensions in the same stimuli and accuracy; investigate the correlation between type of vocal deviation and accuracy. METHODS 54 SLP students were divided in two groups: Group with repetition (GwR), 28 students; and, Group with no repetition (GnR), 26 students. The analyzed sample counted with 220 dysphonic human voices, vowel /ε/. The GwR heard three anchor stimuli before the judgement and every 20 voices during the assessment. The GnR heard the anchor only before beginning the judgement. The anchor stimuli counted with one rough, one breathy and one strain voice. These classifications were compared with reference judgements from three expert SLPs. The intra and inter-rater reliability, the correlation between the amount of different vocal dimensions in the same stimuli and type of vocal deviation with accuracy were assessed. RESULTS The accuracy between type of deviation was similar among groups. The GwR presented slightly higher intra and inter-rater reliability. The student's accuracy was inversely proportional to the amount of different vocal dimensions in the stimuli. Breathiness presented the highest accuracy and strain presented the lowest accuracy. CONCLUSION The repetition of anchor stimuli improved intra and inter-rater reliability. However, it was not effective in the accuracy of the type of vocal deviation. The amount of different vocal dimensions in the stimuli have influence in the students' accuracy.
Collapse
Affiliation(s)
| | - Rosiane Yamasaki
- Universidade Federal de São Paulo – UNIFESP - São Paulo (SP), Brasil.
| | - Marina Martins Pereira Padovani
- Centro de Estudos da Voz – CEV - São Paulo (SP), Brasil.
- Faculdade de Ciências Médicas Santa Casa de São Paulo - São Paulo (SP), Brasil.
| | - Mara Behlau
- Centro de Estudos da Voz – CEV - São Paulo (SP), Brasil.
| |
Collapse
|
15
|
Behlau M, Almeida AA, Amorim G, Balata P, Bastos S, Cassol M, Constantini AC, Eckley C, Englert M, Gama ACC, Gielow I, Guimarães B, Lima LR, Lopes L, Madazio G, Moreti F, Mouffron V, Nemr K, Oliveira P, Padovani M, Ribeiro VV, Silverio K, Vaiano T, Yamasaki R. Reducing the GAP between science and clinic: lessons from academia and professional practice - part A: perceptual-auditory judgment of vocal quality, acoustic vocal signal analysis and voice self-assessment. Codas 2022. [DOI: 10.1590/2317-1782/20212021240en] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/03/2023] Open
Abstract
ABSTRACT During the XXVIII Brazilian Congress of SBFa, 24 specialists met and, from a leading position on scientific research as a tool for connecting laboratory and clinic, five fronts of knowledge of the voice specialty were discussed as following: Perceptual-auditory judgment of vocal quality; 2. Acoustic analysis of the vocal signal; 3. Voice self-assessment; 4. Traditional techniques of therapy; 5. Modern techniques of electrostimulation and photobiomodulation (PBMT) in voice. Part “a” of this publication was associated with the consolidation of the analyses of the first three aspects. The trend in the perceptual-auditory judgment of vocal quality was related to the use of standard protocols. The acoustic evaluation of the vocal signal is accessible and can be done descriptively or by extraction of parameters, thus preferring multiparametric measures. Finally, the analysis of the individual himself closes this triad of voice documentation, which will be the basis for the conclusion of the evaluation, reference for monitoring progress, and evaluation of treatment results.
Collapse
Affiliation(s)
- Mara Behlau
- Centro de Estudos da Voz, Brasil; Universidade Federal de São Paulo, Brasil
| | - Anna Alice Almeida
- Centro de Estudos da Voz, Brasil; Universidade Federal da Paraíba, Brasil
| | | | - Patrícia Balata
- APTA COMUNICAÇÃO, Brasil; Universidade Federal de Pernambuco, Brasil
| | | | - Mauricéia Cassol
- Universidade Federal de Ciências da Saúde de Porto Alegre, Brasil
| | | | - Claudia Eckley
- Faculdade de Ciências Médicas da Santa Casa de São Paulo, Brasil
| | | | | | | | - Bruno Guimarães
- Clínica Bruno Guimarães Serviços de Fonoaudiologia e Fisioterapia, Brasil
| | | | - Leonardo Lopes
- Centro de Estudos da Voz, Brasil; Universidade Federal da Paraíba, Brasil
| | | | - Felipe Moreti
- Centro de Estudos da Voz, Brasil; Centro Universitário da Faculdade de Medicina do ABC, Brasil; Complexo Hospitalar Municipal de São Bernardo do Campo, Brasil
| | | | | | | | - Marina Padovani
- Centro de Estudos da Voz, Brasil; Faculdade de Ciências Médicas da Santa Casa de São Paulo, Brasil
| | | | | | | | - Rosiane Yamasaki
- Centro de Estudos da Voz, Brasil; Universidade Federal de São Paulo, Brasil
| |
Collapse
|
16
|
Santos PCMD, Vieira MN, Sansão JPH, Gama ACC. Effect of synthesized voice anchors on auditory-perceptual voice evaluation. Codas 2021; 33:e20190197. [PMID: 33950146 DOI: 10.1590/2317-1782/20202019197] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2019] [Accepted: 03/25/2020] [Indexed: 11/21/2022] Open
Abstract
PURPOSE To analyze if the use of synthesized voice anchor emissions in auditory-perceptual evaluation improves intra- and inter-rater agreement. METHODS This is a quantitative study. Thirty-two inexperienced evaluators were selected and performed two activities on a Programming Interface created by the authors: Active Calibrator Activity - auditory-perceptual evaluation of the roughness and breathiness parameters as 0-no deviation, 1-slight deviation, 2-moderate deviation, or 3-intense deviation of 25 voices with the support of anchored emissions of synthesized voices; and Inactive Calibrator Activity - auditory-perceptual evaluation of these same voices without the support of anchored vocal emissions. The voices were randomized for each activity, and the order of the activities was drawn randomly for each evaluator. The second activity was performed 15 days after the first. The Kappa coefficient was used to analyze intra- and inter-rater agreement, and the confidence interval (CI) was employed to compare concordances. RESULTS Inter-rater agreement was higher for the intense degree of the breathiness parameter in the Active Calibrator Activity when compared to the Inactive Calibrator Activity, as well as the intra-rater agreement of the roughness parameter.333. CONCLUSION Use of anchor emissions of synthesized voices directly in the evaluation improves intra- and inter-rater agreement in auditory-perceptual voice analysis.
Collapse
Affiliation(s)
| | - Maurílio Nunes Vieira
- Departamento de Engenharia Eletrônica, Escola de Engenharia, Universidade Federal de Minas Gerais - UFMG - Belo Horizonte (MG), Brasil
| | - João Pedro Hallack Sansão
- Departamento de Tecnologia em Engenharia Civil, Computação, Automação, Telemática e Humanidades, Universidade Federal de São João Del Rei - UFSJ - Ouro Branco (MG), Brasil
| | - Ana Cristina Côrtes Gama
- Departamento de Fonoaudiologia, Faculdade de Medicina, Universidade Federal de Minas Gerais - UFMG - Belo Horizonte (MG), Brasil
| |
Collapse
|
17
|
Wong DWM, Chan RW, Wu CH. Effect of Training With Anchors on Auditory-Perceptual Evaluation of Dysphonia in Speech-Language Pathology Students. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:1136-1156. [PMID: 33769849 DOI: 10.1044/2020_jslhr-20-00214] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Purpose Auditory-perceptual evaluation is essential for the clinical assessment of voice disorders. Unstable perceptual voice evaluation has been shown for inexperienced listeners as compared to expert listeners. We examined the effects of perceptual training with external auditory anchors with and without immediate feedback on the evaluation of roughness and breathiness of natural, nonsynthesized speech stimuli (reading of a standard passage) in speech-language pathology students. Method Perceptual voice evaluation and training with anchors using a visual analog scale was implemented with a computer software. Forty-eight speech-language pathology students were randomly assigned into three groups, feedback group (Group F), no feedback group (Group NF), and control group (Group C), attending one training session and four assessment sessions (before training, immediately after training, and 1 and 7 weeks after training). Group F received training with anchors with immediate feedback, Group NF received training without immediate feedback, and Group C received sham training (exposure session). Results Training with anchors significantly increased the rating accuracy (agreement with expert ratings) on both roughness and breathiness for Group F, with the effects lasting for 7 weeks. No significant changes in rating accuracy with training were observed for Group NF and Group C. No improvements in intra- and interrater reliability as well as intrarater agreement were observed in all three groups, whereas interrater agreement on breathiness (but not roughness) significantly increased for all groups, with the effect lasting for 7 weeks only for Group F. Conclusions These findings suggested that perceptual training with external auditory anchors and the use of immediate feedback could be effective for facilitating the development of perceptual voice evaluation skills in speech-language pathology students. Further studies involving more extensive training with stimuli covering a full range of dysphonia severity categories and improvements in design of the training protocol are recommended to verify these results.
Collapse
Affiliation(s)
- Denise Wai-Man Wong
- Department of Speech Language Pathology and Audiology, National Taipei University of Nursing and Health Sciences, Taiwan
| | - Roger W Chan
- Department of Speech Language Pathology and Audiology, National Taipei University of Nursing and Health Sciences, Taiwan
| | - Chia-Hsin Wu
- Department of Speech Language Pathology and Audiology, National Taipei University of Nursing and Health Sciences, Taiwan
| |
Collapse
|
18
|
Walden PR, Khayumov J. The Use of Auditory-Perceptual Training as a Research Method: A Summary Review. J Voice 2020; 36:322-334. [PMID: 32747174 DOI: 10.1016/j.jvoice.2020.06.032] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2020] [Revised: 06/23/2020] [Accepted: 06/26/2020] [Indexed: 11/15/2022]
Abstract
OBJECTIVE The purpose of this descriptive review was to document the current state of training to perform auditory-perceptual analysis as reported in the voice literature. METHODS A review of the literature was performed. RESULTS Thirty-six articles were included in the review. The theoretical basis of training, specific training methods employed, duration of training, stimuli used to train, vocal qualities trained, and the type of listeners used are reported. CONCLUSION There is wide variation to training procedures used in research including auditory-perceptual evaluation of voice quality. In order to begin to discover how to best train listeners for research and clinical settings, attention to the training methods used in research is necessary. Further, these training methods must be explicitly acknowledged and described to allow for adequate evaluation of research findings, comparison across studies, and to determine for which populations results might be applicable. The conceptual framework outlined in this study is a starting point to review voice quality research and to design future studies for which auditory-perceptual evaluation is taught to listeners.
Collapse
|
19
|
Kankare E, Barsties V. Latoszek B, Maryn Y, Asikainen M, Rorarius E, Vilpas S, Ilomäki I, Tyrmi J, Rantala L, Laukkanen AM. The acoustic voice quality index version 02.02 in the Finnish-speaking population. LOGOP PHONIATR VOCO 2019; 45:49-56. [DOI: 10.1080/14015439.2018.1556332] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
- Elina Kankare
- Department of Phoniatrics, Tampere University Hospital, Tampere, Finland
- Speech and Voice Research Laboratory, University of Tampere, Tampere, Finland
| | | | - Youri Maryn
- Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium
- Sint-Augustinus Hospital, European Institute for ORL-HNS, Antwerp, Belgium
- Faculty of Education Health & Social Work, University College Ghent, Ghent, Belgium
- Department of Speech Language and Hearing Sciences, University of Ghent, Ghent, Belgium
| | - Marja Asikainen
- Department of Phoniatrics, Tampere University Hospital, Tampere, Finland
| | - Eija Rorarius
- Department of Phoniatrics, Tampere University Hospital, Tampere, Finland
| | - Sarkku Vilpas
- Department of Phoniatrics, Tampere University Hospital, Tampere, Finland
| | - Irma Ilomäki
- Speech and Voice Research Laboratory, University of Tampere, Tampere, Finland
| | - Jaana Tyrmi
- Speech and Voice Research Laboratory, University of Tampere, Tampere, Finland
| | - Leena Rantala
- Logopedics, School of Social Sciences and Humanities, University of Tampere, Tampere, Finland
| | | |
Collapse
|