1
|
Pommée T, Mbagira D, Morsomme D. French-Language Adaptation of the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V). J Voice 2024:S0892-1997(24)00085-7. [PMID: 38582726 DOI: 10.1016/j.jvoice.2024.03.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Revised: 03/11/2024] [Accepted: 03/12/2024] [Indexed: 04/08/2024]
Abstract
OBJECTIVES This study aimed to adapt the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) protocol for perceptual voice assessment to the French language. The primary objective was to achieve consensus among an international panel of voice experts on the content of the adapted protocol. METHODS To ensure the relevance and robustness of the French CAPE-V protocol, this study employed a systematic Delphi method and involved an international panel primarily comprising speech therapists and lecturers from France and Belgium. The multi-stage process included an initial panel size of 15 experts. Three rounds of online questionnaires, integrating both quantitative and qualitative data collection, were conducted. Participants provided feedback and ratings on various protocol elements until a consensus was reached. Adaptations targeted the choice of task stimuli (sustained vowel, sentence reading, semi-spontaneous speech), of the rating scales, and vocal quality terminology. RESULTS The Delphi process achieved consensus on all elements of the adapted CAPE-V protocol. Notably, the sustained vowel task saw consensus in favor of the vowel /a/. Sentence adaptations achieved substantial agreement, with the final set unanimously approved. The simple Visual Analog Scale emerged as the preferred rating scale. Agreement on terms for describing vocal qualities marked a crucial step in establishing a shared vocabulary among French-speaking voice experts. CONCLUSIONS The study successfully adapted the CAPE-V protocol for perceptual voice assessment to the French language through a systematic Delphi process. The final protocol closely resembles the original English version, maintaining its structure and core objectives. Consensus on sustained vowel tasks, sentence adaptations, rating scales, and vocal quality terminology indicates the relevance and robustness of the adapted protocol. Ongoing validation studies in France demonstrate the potential clinical utility of the adapted CAPE-V in French-speaking contexts, representing a significant step toward standardized and validated voice assessment tools for clinicians and researchers globally.
Collapse
Affiliation(s)
- Timothy Pommée
- Sunnybrook Research Institute, Toronto, Ontario, Canada; Department of Speech-Language Pathology, Rehabilitation Sciences Institute, University of Toronto, Toronto, Ontario, Canada.
| | - Déborah Mbagira
- Research Unit for a Life-Course Perspective on Health and Education, Université de Liège, Liège, Belgium
| | - Dominique Morsomme
- Research Unit for a Life-Course Perspective on Health and Education, Université de Liège, Liège, Belgium
| |
Collapse
|
2
|
Wang Z, Müller M, Caffier F, Caffier PP. Harnessing Machine Learning in Vocal Arts Medicine: A Random Forest Application for "Fach" Classification in Opera. Diagnostics (Basel) 2023; 13:2870. [PMID: 37761237 PMCID: PMC10528521 DOI: 10.3390/diagnostics13182870] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 08/28/2023] [Accepted: 09/04/2023] [Indexed: 09/29/2023] Open
Abstract
Vocal arts medicine provides care and prevention strategies for professional voice disorders in performing artists. The issue of correct "Fach" determination depending on the presence of a lyric or dramatic voice structure is of crucial importance for opera singers, as chronic overuse often leads to vocal fold damage. To avoid phonomicrosurgery or prevent a premature career end, our aim is to offer singers an improved, objective fach counseling using digital sound analyses and machine learning procedures. For this purpose, a large database of 2004 sound samples from professional opera singers was compiled. Building on this dataset, we employed a classic ensemble learning method, namely the Random Forest algorithm, to construct an efficient fach classifier. This model was trained to learn from features embedded within the sound samples, subsequently enabling voice classification as either lyric or dramatic. As a result, the developed system can decide with an accuracy of about 80% in most examined voice types whether a sound sample has a lyric or dramatic character. To advance diagnostic tools and health in vocal arts medicine and singing voice pedagogy, further machine learning methods will be applied to find the best and most efficient classification method based on artificial intelligence approaches.
Collapse
Affiliation(s)
- Zehui Wang
- Institute for Digital Transformation, University of Applied Sciences Ravensburg-Weingarten, Doggenriedstraße, 88250 Weingarten, Germany;
| | - Matthias Müller
- Occupational College of Music BFSM Krumbach, Mindelheimer Str. 47, 86381 Krumbach, Germany;
| | - Felix Caffier
- School of Computing, Communication and Business, HTW Berlin University of Applied Sciences, Treskowallee 8, 10318 Berlin, Germany;
| | - Philipp P. Caffier
- Department of Audiology and Phoniatrics, Charité—Universitätsmedizin Berlin, Campus Charité Mitte, Charitéplatz 1, 10117 Berlin, Germany
| |
Collapse
|
3
|
Curtis JA, Borders JC, Dakin AE, Troche MS. Auditory-Perceptual Assessments of Cough: Characterizing Rater Reliability and the Effects of a Standardized Training Protocol. Folia Phoniatr Logop 2023; 76:77-90. [PMID: 37544291 DOI: 10.1159/000533372] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Accepted: 07/24/2023] [Indexed: 08/08/2023] Open
Abstract
INTRODUCTION Auditory-perceptual assessments of cough are commonly used by speech-language pathologists working with people with swallowing disorders with emerging evidence beginning to demonstrate their validity; however, their reliability among novice clinicians is unknown. Therefore, the primary aim of this study was to characterize the reliability of auditory-perceptual assessments of cough among a group of novice clinicians. As a secondary aim, we assessed the effects of a standardized training protocol on the reliability of auditory-perceptual assessments of cough. METHODS Twelve novice clinicians blindly rated ten auditory-perceptual cough descriptors for 120 cough audio clips. Standardized training was then completed by the group of clinicians. The same cough audio clips were then re-randomized and blindly rated. Reliability was analyzed pre- and post-training within each clinician (intra-rater), between each unique pair of raters (dyad-level inter-rater), and for the entire group of raters (group-level inter-rater) using intraclass correlation coefficients and Cohen's Kappa. RESULTS Pre-training reliability was greatest for measures of strength, effectiveness, and normality and lowest when judging the type of expiratory maneuver (cough, throat clear, huff, other). The measures that improved the most with training were ratings of perceived crispness, amount of voicing, and type of expiratory maneuver. Intra-rater reliability coefficients ranged from 0.580 to 0.903 pre-training and 0.756-0.904 post-training. Dyad-level inter-rater reliability coefficients ranged from 0.295 to 0.745 pre-training and 0.450-0.804 post-training. Group-level inter-rater reliability coefficients ranged from 0.454 to 0.919 pre-training and 0.558-0.948 post-training. CONCLUSION Reliability of auditory-perceptual assessments varied across perceptual cough descriptors, but all appeared within the range of what has been historically reported for auditory-perceptual assessments of voice and visual-perceptual assessments of swallowing and cough airflow. Reliability improved for most cough descriptors following 30-60 min of standardized training. Future research is needed to examine the validity of auditory-perceptual assessments of cough by assessing the relationship between perceptual cough descriptors and instrumental measures of cough effectiveness to better understand the role of perceptual assessments in clinical practice.
Collapse
Affiliation(s)
- James A Curtis
- Department of Otolaryngology-Head and Neck Surgery, Aerodigestive Innovations Research Lab (AIR), Weill Cornell Medical College, New York, New York, USA
- Department of Biobehavioral Sciences, Laboratory for the Study of Upper Airway Dysfunction, Teachers College, Columbia University, New York, New York, USA
| | - James C Borders
- Department of Biobehavioral Sciences, Laboratory for the Study of Upper Airway Dysfunction, Teachers College, Columbia University, New York, New York, USA
| | - Avery E Dakin
- Department of Biobehavioral Sciences, Laboratory for the Study of Upper Airway Dysfunction, Teachers College, Columbia University, New York, New York, USA
| | - Michelle S Troche
- Department of Biobehavioral Sciences, Laboratory for the Study of Upper Airway Dysfunction, Teachers College, Columbia University, New York, New York, USA
| |
Collapse
|
4
|
Venkatraman Y, Mahalingam S, Boominathan P. Development and Validation of Sentences in Tamil for Psychoacoustic Evaluation of Voice Using the Consensus Auditory-Perceptual Evaluation of Voice. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:4539-4556. [PMID: 36368051 DOI: 10.1044/2022_jslhr-22-00169] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
PURPOSE The Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) is a standardized instrument used in voice assessment to assess voice quality. It has been translated and culturally adapted in several languages. This study aimed at developing and validating a Tamil version of CAPE-V through auditory perceptual evaluation of remotely recorded voice samples. METHOD The Tamil version was adapted with permission from the American Speech-Language-Hearing Association to match the rationale in English CAPE-V. The sentences were constructed by the first author and validated for content by two panels of experts. Forty-five participants (15 cases and 30 controls) were included in the study. Data recording was conducted online (Zoom Video Communications, Inc., app) for all samples. Three raters participated in the auditory evaluation and scored all samples using Grade, Roughness, Breathiness, Asthenia, and Strain (GRBAS) scale followed by CAPE-V with a week gap in between. Twenty percent of samples were repeated to assess intrarater reliability. The intrarater and interrater reliability measures for Tamil CAPE-V were established using intraclass coefficients (ICCs). To ensure construct validity, group differences were determined between the cases and controls. The concurrent validity was established by correlating Tamil CAPE-V with the GRBAS scale. RESULTS The intrarater reliability for Tamil CAPE-V ranged from moderate to excellent (ICC: .610-.998). The Tamil CAPE-V obtained moderate to good interrater reliability for all parameters (ICC: .525-.790) except pitch (ICC: .405). The differences between the cases and controls were statistically significant (p < .01). The correlation between CAPE-V and GRBAS was strong for overall severity, breathiness, and strain (r s = .725-.861) and moderate for roughness (r s = .678). CONCLUSION The Tamil CAPE-V is a reliable and valid tool for auditory perceptual evaluation in Tamil-speaking populations. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.21513885.
Collapse
Affiliation(s)
- Yamini Venkatraman
- Department of Speech, Language and Hearing Sciences, Sri Ramachandra Faculty of Audiology & Speech Language Pathology, Sri Ramachandra Institute of Higher Education and Research (Deemed to be University), Chennai, India
| | - Shenbagavalli Mahalingam
- Department of Speech, Language and Hearing Sciences, Sri Ramachandra Faculty of Audiology & Speech Language Pathology, Sri Ramachandra Institute of Higher Education and Research (Deemed to be University), Chennai, India
| | - Prakash Boominathan
- Department of Speech, Language and Hearing Sciences, Sri Ramachandra Faculty of Audiology & Speech Language Pathology, Sri Ramachandra Institute of Higher Education and Research (Deemed to be University), Chennai, India
| |
Collapse
|
5
|
Aghadoost S, Molazeinal Y, Khoddami SM, Shokuhifar G, Dabirmoghaddam P, Saffari M. Dysphonia Severity Index and Consensus Auditory-Perceptual Evaluation of Voice Outcomes, and Their Relation in Hospitalized Patients with COVID-19. J Voice 2022:S0892-1997(22)00384-8. [PMID: 36642593 PMCID: PMC9712076 DOI: 10.1016/j.jvoice.2022.11.034] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Revised: 11/24/2022] [Accepted: 11/28/2022] [Indexed: 12/05/2022]
Abstract
OBJECTIVES This study aimed to compare the results of the Dysphonia Severity Index (DSI) and Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) between patients hospitalized with COVID-19 and healthy subjects, as well as to investigate the correlation between DSI and CAPE-V. STUDY DESIGN Cross-sectional survey. MATERIAL AND METHODS Eighty subjects, 40 COVID-19 patients (with a mean age of 41.2± 5.41) and 40 healthy subjects (with a mean age of 44.50± 3.50) participated in this study. Assessments included the DSI for aerodynamic-acoustic measurement and the Persian version of Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) for evaluating auditory-perceptual voice quality. Data were analyzed by means of the independent t-test and Pearson correlation at the 5% significance level. RESULTS The results showed COVID-19 patients got significantly lower score in DSI compared to healthy subjects (P < 0.05). Moreover, the patients with COVID-19 had higher scores in all categories of voice production (severity, roughness, loudness, pitch, strain and breathiness) than the healthy group (P < 0.05). Comparing the result of the two voice assessments in each group revealed that there was a greater negative significant correlation in the diseased group (r p: -0.68, P: 0.001) than in the healthy group (r p: -0.37,P: 0.049). CONCLUSIONS Hospitalized COVID-19 patients experience deviations in the voice quality and acoustic-aerodynamic features of their voice. Also, the results of this study showed the patient group had higher perceptual dysphonia and lower voice quality compared to the healthy group. Further studies are recommended to determine the relationship between objective and subjective voice evaluation in patients with COVID-19 after recovery.
Collapse
Affiliation(s)
- Samira Aghadoost
- Department of Speech Therapy, School of Rehabilitation, Tehran University of Medical Sciences, Tehran, Tehran, Iran.
| | - Yasamin Molazeinal
- School of Rehabilitation, Tehran University of Medical Sciences, Tehran, Tehran, Iran
| | - Seyyedeh Maryam Khoddami
- Department of Speech Therapy, School of Rehabilitation, Tehran University of Medical Sciences, Tehran, Tehran, Iran
| | - Ghazaal Shokuhifar
- Department of audiology, University of Social Welfare and Rehabilitation, Tehran, Tehran, Iran
| | - Payman Dabirmoghaddam
- Otolaryngology Research Center, Tehran University of Medical Sciences, Tehran, Tehran, Iran
| | - Maryam Saffari
- Department of radiology, faculty of medicine, Kashan University of Medical Sciences, Tehran, Tehran, Iran
| |
Collapse
|
6
|
Walden PR, Rau S. Individual Voice Dimensions' Prediction of Overall Dysphonia Severity on Two Auditory-Perceptual Scales. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:2759-2777. [PMID: 35868295 DOI: 10.1044/2022_jslhr-21-00689] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
BACKGROUND Auditory-perceptual evaluation of dysphonic voice is an essential clinical activity that characterizes the nature of dysphonia and aids in planning its clinical management. Although there are multidimensional acoustic measures that correlate well with overall severity ratings, they tend to include measures that have only small or moderate correlations with individual voice characteristics frequently perceptually measured (e.g., breathiness or roughness). Given this difference between perceptual and acoustic measures, it is unclear how much individual voice characteristics contribute to a listener's perception of overall severity of dysphonia. PURPOSE The purpose of this study was to explore individual voice characteristics' relative contribution to the rating of overall dysphonia severity and to explore sex-related differences. METHOD Two hundred ninety-six voice samples were accessed from the Perceptual Voice Qualities Database. Roughness, breathiness, asthenia, strain, pitch, and loudness ratings from the Grade, Roughness, Breathiness, Asthenia, Strain and Consensus Auditory-Perceptual Evaluation of Voice scales were used to predict overall voice quality severity in linear regression with bootstrapped coefficients. RESULTS Roughness, breathiness, and strain were the strongest predictors of overall severity. Asthenia and, to a lesser extent, pitch were also significant predictors of overall severity. Loudness was not a significant predictor. There were several sex-related differences noted, as well as differences related to the scale used. CONCLUSIONS Breathiness, roughness, and strain were all important predictors of overall severity for all regressions. Clinicians should be aware of scale-related differences if they are using auditory-perceptual measures to choose voice therapy targets. Analyses accounting for perceptual strategy differences were recommended for future studies.
Collapse
Affiliation(s)
| | - Sydney Rau
- Department of Communication Sciences and Disorders, St. John's University, Queens, NY
| |
Collapse
|