1
|
Kim HB, Song J, Park S, Lee YO. Classification of laryngeal diseases including laryngeal cancer, benign mucosal disease, and vocal cord paralysis by artificial intelligence using voice analysis. Sci Rep 2024; 14:9297. [PMID: 38654036 DOI: 10.1038/s41598-024-58817-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Accepted: 04/03/2024] [Indexed: 04/25/2024] Open
Abstract
Voice change is often the first sign of laryngeal cancer, leading to diagnosis through hospital laryngoscopy. Screening for laryngeal cancer solely based on voice could enhance early detection. However, identifying voice indicators specific to laryngeal cancer is challenging, especially when differentiating it from other laryngeal ailments. This study presents an artificial intelligence model designed to distinguish between healthy voices, laryngeal cancer voices, and those of the other laryngeal conditions. We gathered voice samples of individuals with laryngeal cancer, vocal cord paralysis, benign mucosal diseases, and healthy participants. Comprehensive testing was conducted to determine the best mel-frequency cepstral coefficient conversion and machine learning techniques, with results analyzed in-depth. In our tests, laryngeal diseases distinguishing from healthy voices achieved an accuracy of 0.85-0.97. However, when multiclass classification, accuracy ranged from 0.75 to 0.83. These findings highlight the challenges of artificial intelligence-driven voice-based diagnosis due to overlaps with benign conditions but also underscore its potential.
Collapse
Affiliation(s)
- Hyun-Bum Kim
- Department of Otolaryngology-Head and Neck Surgery, The Catholic University of Korea, Seoul, South Korea
| | - Jaemin Song
- Department of Industrial and Data Engineering, Hongik University, Seoul, South Korea
| | - Seho Park
- Department of Industrial and Data Engineering, Hongik University, Seoul, South Korea
| | - Yong Oh Lee
- Department of Industrial and Data Engineering, Hongik University, Seoul, South Korea.
| |
Collapse
|
2
|
Anderer S, Hswen Y. Digital Avatars and Personalized Voices-How AI Is Helping to Restore Speech to Patients. JAMA 2024; 331:1259-1261. [PMID: 38517420 DOI: 10.1001/jama.2023.23556] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 03/23/2024]
Abstract
In this Medical News article, Edward Chang, MD, chair of the department of neurological surgery at the University of California, San Francisco Weill Institute for Neurosciences joins JAMA Editor in Chief Kirsten Bibbins-Domingo, PhD, MD, MAS, to discuss the potential for AI to revolutionize communication for those unable to speak due to aphasia.
Collapse
|
3
|
Che Z, Wan X, Xu J, Duan C, Zheng T, Chen J. Speaking without vocal folds using a machine-learning-assisted wearable sensing-actuation system. Nat Commun 2024; 15:1873. [PMID: 38472193 PMCID: PMC10933441 DOI: 10.1038/s41467-024-45915-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2023] [Accepted: 02/06/2024] [Indexed: 03/14/2024] Open
Abstract
Voice disorders resulting from various pathological vocal fold conditions or postoperative recovery of laryngeal cancer surgeries, are common causes of dysphonia. Here, we present a self-powered wearable sensing-actuation system based on soft magnetoelasticity that enables assisted speaking without relying on the vocal folds. It holds a lightweighted mass of approximately 7.2 g, skin-alike modulus of 7.83 × 105 Pa, stability against skin perspiration, and a maximum stretchability of 164%. The wearable sensing component can effectively capture extrinsic laryngeal muscle movement and convert them into high-fidelity and analyzable electrical signals, which can be translated into speech signals with the assistance of machine learning algorithms with an accuracy of 94.68%. Then, with the wearable actuation component, the speech could be expressed as voice signals while circumventing vocal fold vibration. We expect this approach could facilitate the restoration of normal voice function and significantly enhance the quality of life for patients with dysfunctional vocal folds.
Collapse
Affiliation(s)
- Ziyuan Che
- Department of Bioengineering, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Xiao Wan
- Department of Bioengineering, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Jing Xu
- Department of Bioengineering, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Chrystal Duan
- Department of Bioengineering, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Tianqi Zheng
- Department of Bioengineering, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Jun Chen
- Department of Bioengineering, University of California, Los Angeles, Los Angeles, CA, 90095, USA.
| |
Collapse
|
4
|
Feinstein H, Daşdöğen Ü, Libertus ME, Awan SN, Galera RI, Dohar JE, Abbott KV. Cognitive Mechanisms in Pediatric Voice Therapy - An Initial Examination. J Voice 2024; 38:538.e11-538.e22. [PMID: 34750034 PMCID: PMC9068827 DOI: 10.1016/j.jvoice.2021.09.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Revised: 09/19/2021] [Accepted: 09/21/2021] [Indexed: 11/26/2022]
Abstract
INTRODUCTION Voice disorders are a common communication disorder in children. Behavioral voice therapy is recommended by both Otolaryngologists and Speech-Language Pathologists as a first-line approach for treatment of benign vocal fold lesions that affect children in large numbers. However, the role of cognitive mechanisms critical to voice therapy have not yet been explored. OBJECTIVE This proof-of-concept study aims to provide preliminary data on the potential relation between cognitive abilities and behavioral results of voice therapy for children with benign phonotraumatic vocal fold lesions. METHODS Six children (4;05 -9;02 years) diagnosed with vocal fold nodules completed a battery of cognitive tests from the NEuroPSYchological (NEPSY-II) Assessment and a standard course of "Adventures in Voice" therapy. Recordings pre and post intervention were analyzed acoustically using the Cepstral Spectral Index of Dysphonia (CSID) and perceptually using visual analog scales (VAS) for severity and resonance separately. Raw and age-corrected scaled scores from the NEPSY-II were then examined for their possible relation to voice outcomes. RESULTS Multiple cognitive functions correlated with voice outcomes. Raw score measures for Design Fluency, Inhibition, Fingertip Tapping, and Narrative Memory correlated favorably with all voice outcome measures. Age correlated with all NEPSY-II raw scores and perceptual voice outcome measures. Scaled scores for Auditory Attention, Design Fluency, Fingertip Tapping (Dominant hand), and Narrative Memory correlated with all voice outcome measures. CONCLUSION Results suggest that there is merit to further investigation of the relation between cognitive skills and their development and voice treatment outcomes in children with benign phonotraumatic lesions. Future studies with larger samples will build on present findings.
Collapse
Affiliation(s)
- Hagar Feinstein
- Department of Communication Sciences and Disorders, University of Delaware, Newark, Delaware, USA.
| | - Ümit Daşdöğen
- Department of Communication Sciences and Disorders, University of Delaware, Newark, Delaware, USA
| | - Melissa E Libertus
- Department of Psychology and Learning Research and Development Center, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - Shaheen N Awan
- Department of Communication Sciences and Disorders, University of South Florida, Tampa, Florida, USA
| | - Rhona I Galera
- Department of Rehabilitation Services, Lucile Packard Children's Hospital Stanford University, Palo Alto, California, USA
| | - Joseph E Dohar
- Division of Pediatric Otolaryngology, Children's Hospital of Pittsburgh - UPMC, Pittsburgh, Pennsylvania, USA; Department of Otolaryngology, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, USA; Department of Communication Science and Disorders, University of Pittsburgh School of Health and Rehabilitation, Pittsburgh, Pennsylvania, USA
| | | |
Collapse
|
5
|
Chung HR, Lee Y, Reddy NK, Zhang Z, Chhetri DK. Effects of Thyroarytenoid Activation Induced Vibratory Asymmetry on Voice Acoustics and Perception. Laryngoscope 2024; 134:1327-1332. [PMID: 37676064 DOI: 10.1002/lary.31046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2023] [Revised: 06/25/2023] [Accepted: 08/22/2023] [Indexed: 09/08/2023]
Abstract
INTRODUCTION Asymmetry of vocal fold (VF) vibration is common in patients with voice complaints and also observed in 10% of normophonic individuals. Although thyroarytenoid (TA) muscle activation plays a crucial role in regulating VF vibration, how TA activation asymmetry relates to voice acoustics and perception is unclear. We evaluated the relationship between TA activation asymmetry and the resulting acoustics and perception. METHODS An in vivo canine model of phonation was used to create symmetric and increasingly asymmetric VF vibratory conditions via graded stimulation of bilateral TA muscles. Naïve listeners (n = 89) rated the perceptual quality of 100 unique voice samples using a visual sort-and-rate task. For each phonatory condition, cepstral peak prominence (CPP), harmonic amplitude (H1-H2), and root-mean-square (RMS) energy of the voice were measured. The relationships between these metrics, vibratory asymmetry, and perceptual ratings were evaluated. RESULTS Increasing levels of TA asymmetry resulted in declining listener preference. Furthermore, only severely asymmetric audio samples were perceptually distinguishable from symmetric and mildly asymmetric conditions. CPP was negatively correlated with TA asymmetry: voices produced with larger degrees of asymmetry were associated with lower CPP values. Listeners preferred audio samples with higher values of CPP, high RMS energy, and lower H1-H2 (less breathy). CONCLUSION Listeners are sensitive to changes in voice acoustics related to vibratory asymmetry. Although increasing vibratory asymmetry is correlated with decreased perceptual ratings, mild asymmetries are perceptually tolerated. This study contributes to our understanding of voice production and quality by identifying perceptually salient and clinically meaningful asymmetry. LEVEL OF EVIDENCE N/A (Basic Science Study) Laryngoscope, 134:1327-1332, 2024.
Collapse
Affiliation(s)
- Hye Rhyn Chung
- David Geffen School of Medicine at UCLA, 10833 Le Conte Avenue, Los Angeles, California, U.S.A
| | - Yoonjeong Lee
- Department of Head & Neck Surgery, David Geffen School of Medicine at UCLA, Los Angeles, California, U.S.A
- Department of Linguistics, University of Michigan, Ann Arbor, Michigan, U.S.A
| | - Neha K Reddy
- David Geffen School of Medicine at UCLA, 10833 Le Conte Avenue, Los Angeles, California, U.S.A
| | - Zhaoyan Zhang
- Department of Head & Neck Surgery, David Geffen School of Medicine at UCLA, Los Angeles, California, U.S.A
| | - Dinesh K Chhetri
- Department of Head & Neck Surgery, David Geffen School of Medicine at UCLA, Los Angeles, California, U.S.A
| |
Collapse
|
6
|
Crews T, Vonk J, McGuire M. Catcalls: exotic cats discriminate the voices of familiar caregivers. PeerJ 2024; 12:e16904. [PMID: 38371372 PMCID: PMC10874594 DOI: 10.7717/peerj.16904] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Accepted: 01/17/2024] [Indexed: 02/20/2024] Open
Abstract
Background The ability to differentiate familiar from unfamiliar humans has been considered a product of domestication or early experience. Few studies have focused on voice recognition in Felidae despite the fact that this family presents the rare opportunity to compare domesticated species to their wild counterparts and to examine the role of human rearing. Methods We tested whether non-domesticated Felidae species recognized familiar human voices by exposing them to audio playbacks of familiar and unfamiliar humans. In a pilot study, we presented seven cats of five species with playbacks of voices that varied in familiarity and use of the cats' names. In the main study, we presented 24 cats of 10 species with unfamiliar and then familiar voice playbacks using a habituation-dishabituation paradigm. We anticipated that human rearing and use of the cats' names would result in greater attention to the voices, as measured by the latency, intensity, and duration of responses regardless of subject sex and subfamily. Results Cats responded more quickly and with greater intensity (e.g., full versus partial head turn, both ears moved versus one ear twitching) to the most familiar voice in both studies. They also responded for longer durations to the familiar voice compared to the unfamiliar voices in the main study. Use of the cats' name and rearing history did not significantly impact responding. These findings suggest that close human contact rather than domestication is associated with the ability to discriminate between human voices and that less social species may have socio-cognitive abilities akin to those of more gregarious species. With cats of all species being commonly housed in human care, it is important to know that they differentiate familiar from unfamiliar human voices.
Collapse
Affiliation(s)
- Taylor Crews
- Department of Psychology, Oakland University, Rochester, MI, United States of America
| | - Jennifer Vonk
- Department of Psychology, Oakland University, Rochester, MI, United States of America
| | - Molly McGuire
- Animal Wellbeing, Zoo Miami, Miami, FL, United States of America
| |
Collapse
|
7
|
Morton-Jones ME, Gladden LB, Kavazis AN, Sandage MJ. A Tutorial on Skeletal Muscle Metabolism and the Role of Blood Lactate: Implications for Speech Production. J Speech Lang Hear Res 2024; 67:369-383. [PMID: 38157288 DOI: 10.1044/2023_jslhr-23-00531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2024]
Abstract
PURPOSE The purpose of this tutorial is threefold: (a) present relevant exercise science literature on skeletal muscle metabolism and synthesize the limited available research on metabolism of the adult human speech musculature in an effort to elucidate the role of metabolism in speech production; (b) introduce a well-studied metabolic serum biomarker in exercise science, lactate, and the potential usefulness of investigating this metabolite, through a well-established exercise science methodology, to better understand metabolism of the musculature involved in voice production; and (c) discuss exercise physiology considerations for future voice science research that seeks to investigate blood lactate and metabolism in voice physiology in an ecologically valid manner. METHOD This tutorial begins with relevant exercise science literature on the basic cellular processes of muscle contraction that require energy and the metabolic mechanisms that regenerate the energy required for task execution. The tutorial next synthesizes the available research investigating metabolism of the adult human speech musculature. This is followed by the authors proposing a hypothesis of speech metabolism based on the voice science literature and the application of well-studied exercise science principles of muscle physiology. The tutorial concludes with a discussion and the potential usefulness of lactate in investigations to better understand the metabolism of the musculature involved in vocal demand tasks. CONCLUSION The role of metabolism during speech (respiratory, laryngeal, and articulatory) is an understudied yet critical aspect of speech physiology that warrants further study to better understand the metabolic systems that are used to meet vocal demands.
Collapse
Affiliation(s)
| | | | | | - Mary J Sandage
- Department of Speech, Language, and Hearing Sciences, Auburn University, AL
| |
Collapse
|
8
|
Kapsner-Smith MR, Abur D, Eadie TL, Stepp CE. Test-Retest Reliability of Behavioral Assays of Feedforward and Feedback Auditory-Motor Control of Voice and Articulation. J Speech Lang Hear Res 2024; 67:34-48. [PMID: 37992404 PMCID: PMC11000789 DOI: 10.1044/2023_jslhr-23-00038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Revised: 07/24/2023] [Accepted: 09/25/2023] [Indexed: 11/24/2023]
Abstract
PURPOSE Behavioral assays of feedforward and feedback auditory-motor control of voice and articulation frequently are used to make inferences about underlying neural mechanisms and to study speech development and disorders. However, no studies have examined the test-retest reliability of such measures, which is critical for rigorous study of auditory-motor control. Thus, the purpose of the present study was to assess the reliability of assays of feedforward and feedback control in voice versus articulation domains. METHOD Twenty-eight participants (14 cisgender women, 12 cisgender men, one transgender man, one transmasculine/nonbinary) who denied any history of speech, hearing, or neurological impairment were measured for responses to predictable versus unexpected auditory feedback perturbations of vocal (fundamental frequency, fo) and articulatory (first formant, F1) acoustic parameters twice, with 3-6 weeks between sessions. Reliability was measured with intraclass correlations. RESULTS Opposite patterns of reliability were observed for fo and F1; fo reflexive responses showed good reliability and fo adaptive responses showed poor reliability, whereas F1 reflexive responses showed poor reliability and F1 adaptive responses showed moderate reliability. However, a criterion-referenced categorical measurement of fo adaptive responses as typical versus atypical showed substantial test-retest agreement. CONCLUSIONS Individual responses to some behavioral assays of auditory-motor control of speech should be interpreted with caution, which has implications for several fields of research. Additional research is needed to establish reliable criterion-referenced measures of F1 adaptive responses as well as fo and F1 reflexive responses. Furthermore, the opposite patterns of test-retest reliability observed for voice versus articulation add to growing evidence for differences in underlying neural control mechanisms.
Collapse
Affiliation(s)
| | - Defne Abur
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
- Department of Computational Linguistics, Center for Language and Cognition, University of Groningen, the Netherlands
- Research School of Behavioral and Cognitive Neurosciences, University of Groningen, the Netherlands
| | - Tanya L. Eadie
- Department of Speech and Hearing Sciences, University of Washington, Seattle
| | - Cara E. Stepp
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
- Department of Otolaryngology–Head and Neck Surgery, Boston University School of Medicine, MA
| |
Collapse
|
9
|
Toles LE, Shembel AC. Acoustic and Physiologic Correlates of Vocal Effort in Individuals With and Without Primary Muscle Tension Dysphonia. Am J Speech Lang Pathol 2024; 33:237-247. [PMID: 37931092 PMCID: PMC11000796 DOI: 10.1044/2023_ajslp-23-00159] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Revised: 08/23/2023] [Accepted: 09/16/2023] [Indexed: 11/08/2023]
Abstract
OBJECTIVES The aims of this study were to determine relationships between vocal effort and (a) acoustic correlates of vocal output and (b) supraglottic compression in individuals with primary muscle tension dysphonia (pMTD) and without voice disorders (controls) in the context of a vocal load challenge. METHOD Twenty-six individuals with pMTD and 35 vocally healthy controls participated in a 30-min vocal load challenge. The pre- and postload relationships among self-ratings of vocal effort, various acoustic voice measures, and supraglottic compression (mediolateral and anteroposterior) were tested with multiple regression models and post hoc Pearson's correlations. Acoustic measures included cepstral peak prominence (CPP), low-to-high spectral ratio, difference in intensity between the first two harmonics, fundamental frequency, and sound pressure level (dB SPL). RESULTS Regression models for CPP and mediolateral compression were statistically significant. Vocal effort, diagnosis of pMTD, and vocal demand were each significant variables influencing CPP measures. CPP was lower in the pMTD group across stages. There was no statistical change in CPP following the vocal load challenge within either group, but both groups had an increase in vocal effort postload. Vocal effort and diagnosis influenced the mediolateral compression model. Mediolateral compression was higher in the pMTD group across stages and had a negative relationship with vocal effort, but it did not differ after vocal loading. CONCLUSIONS CPP and mediolateral supraglottic compression were influenced by vocal effort and diagnosis of pMTD. Increased vocal effort was associated with lower CPP, particularly after vocal load, and decreased mediolateral supraglottic compression in the pMTD group.
Collapse
Affiliation(s)
- Laura E. Toles
- Department of Otolaryngology–Head and Neck Surgery, The University of Texas Southwestern Medical Center, Dallas
| | - Adrianna C. Shembel
- Department of Otolaryngology–Head and Neck Surgery, The University of Texas Southwestern Medical Center, Dallas
- School of Behavioral and Brain Sciences, Department of Speech, Language, and Hearing, The University of Texas at Dallas, Richardson
| |
Collapse
|
10
|
Khoshhal Mollasaraei Z, Behroozmand R. Impairment of the internal forward model and feedback mechanisms for vocal sensorimotor control in post-stroke aphasia: evidence from directional responses to altered auditory feedback. Exp Brain Res 2024; 242:225-239. [PMID: 37999725 PMCID: PMC10849397 DOI: 10.1007/s00221-023-06743-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2023] [Accepted: 11/05/2023] [Indexed: 11/25/2023]
Abstract
The present study examined opposing and following vocal responses to altered auditory feedback (AAF) to determine how damage to left-hemisphere brain networks impairs the internal forward model and feedback mechanisms in post-stroke aphasia. Forty-nine subjects with aphasia and sixty age-matched controls performed speech vowel production tasks while their auditory feedback was altered using randomized ± 100 cents upward and downward pitch-shift stimuli. Data analysis revealed that when vocal responses were averaged across all trials (i.e., opposing and following), the overall magnitude of vocal compensation was significantly reduced in the aphasia group compared with controls. In addition, when vocal responses were analyzed separately for opposing and following trials, subjects in the aphasia group showed a significantly lower percentage of opposing and higher percentage of following vocal response trials compared with controls, particularly for the upward pitch-shift stimuli. However, there was no significant difference in the magnitude of opposing and following vocal responses between the two groups. These findings further support previous evidence on the impairment of vocal sensorimotor control in aphasia and provide new insights into the distinctive impact of left-hemisphere stroke on the internal forward model and feedback mechanisms. In this context, we propose that the lower percentage of opposing responses in aphasia may be accounted for by deficits in feedback-dependent mechanisms of audio-vocal integration and motor control. In addition, the higher percentage of following responses may reflect aberrantly increased reliance of the speech system on the internal forward model for generating sensory predictions during vocal error detection and motor control.
Collapse
Affiliation(s)
- Zeinab Khoshhal Mollasaraei
- NeuroSyntax Lab, Department of Communication Sciences and Disorders, Arnold School of Public Health, University of South Carolina, 915 Greene Street, Columbia, SC, 29208, USA
| | - Roozbeh Behroozmand
- Speech Neuroscience Lab, Department of Speech, Language, and Hearing, Callier Center for Communication Disorders, School of Behavioral and Brain Sciences, The University of Texas at Dallas, 2811 N. Floyd Rd, Richardson, TX, 75080, USA.
| |
Collapse
|
11
|
Gartling GJ, van Mersbergen M, Crow K, Lewandowski A, Smith LJ, Gartner-Schmidt JL. The Patient Experience: The Relationship Between Vocal Handicap, Congruency, Perceived Present Control, and Mood Across Four Voice Disorders. J Voice 2024; 38:244.e15-244.e27. [PMID: 34615615 DOI: 10.1016/j.jvoice.2021.08.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Revised: 08/19/2021] [Accepted: 08/25/2021] [Indexed: 10/20/2022]
Abstract
OBJECTIVE The potential for negative sequalae in psychosocial well-being presents clinical importance to the assessment of voice disorders. Despite the impairment voice disorders cause in the psychosocial domain, the clinical assessment of these disorders relies heavily on visual perceptual judgments of the larynx, audio-perceptual, as well as acoustic and aerodynamic measures. While these measures aid in accurate diagnosis and are necessary for standard of care, they present little insight into the patient experience of having a voice disorder. DESIGN Retrospective between-subject, non-experimental design. METHODS Data from 335 patients from the University of Pittsburgh Voice Center were collected from scores of the Voice Handicap Index-10 (VHI-10) and two recent questionnaires, the Voice Present Perceived Control scale (VPPC), and the Vocal Congruency Scale (VCS). Examining how these voice-specific scales related to three mental health screeners for stress (Perceived Stress Scale-4), anxiety (Generalized Anxiety Disorder-7) and depression (Patient Health Questionnaire-9) were also examined. Patient diagnoses included primary muscle tension dysphonia (pMTD), unilateral vocal fold paralysis (UVFP), vocal fold atrophy, and mid membranous vocal fold lesions. RESULTS There were significant differences in scores from the voice-specific scales between diagnostic groups with UVFP being the highest (worst) in VHI-10 and UVFP being the lowest (worst) in VCS compared to healthy controls. There was no significant difference in VPPC scores between diagnostic groups. Results showed statistically significant inverse relationships between the VHI-10 and the VPPC and between the VHI-10 and VCS for all diagnostic groups. A significant direct relationship was found between the VPPC and the VCS for patients diagnosed with MTD, UVFP and Lesions. In sum, patients with UVFP presented with the most frequent and sometimes strongest relationships between voice and mental health measures. DISCUSSION This study marks an initial investigation into the nuanced patient experience of having a voice disorder. Three theoretically unrelated voice constructs: handicap, perceived control, and sense of self, were measured via self-report. Results from this study describe the patient experience correlating to these constructs with weak correlations to stress, anxiety, and depression. Findings also clearly suggest that patient experience varies among diagnostic groups, as well as varying constructs. Measures of multiple constructs of patient perception provide valuable insight into a patient's experience of their voice disorder, guidance on the direction of voice treatment, and justification for such treatments.
Collapse
Affiliation(s)
- Gary J Gartling
- Communication Science and Disorders, University of Pittsburgh, Pittsburgh, Pennsylvania
| | - Miriam van Mersbergen
- School of Communication Sciences and Disorders, The University of Memphis, Memphis, Tennessee
| | - Karen Crow
- Louisville Center for Voice Care, Louisville, Kentucky
| | - Ali Lewandowski
- University of Pittsburgh Voice Center, Department of Otolaryngology, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania
| | - Libby J Smith
- University of Pittsburgh Voice Center, Department of Otolaryngology, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania
| | - Jackie L Gartner-Schmidt
- Communication Science and Disorders, University of Pittsburgh, Pittsburgh, Pennsylvania; University of Pittsburgh Voice Center, Department of Otolaryngology, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania.
| |
Collapse
|
12
|
Lester-Smith RA, Derrick E, Larson CR. Characterization of Source-Filter Interactions in Vocal Vibrato Using a Neck-Surface Vibration Sensor: A Pilot Study. J Voice 2024; 38:1-9. [PMID: 34649740 PMCID: PMC8995401 DOI: 10.1016/j.jvoice.2021.08.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Revised: 08/18/2021] [Accepted: 08/23/2021] [Indexed: 11/23/2022]
Abstract
PURPOSE Vocal vibrato is a singing technique that involves periodic modulation of fundamental frequency (fo) and intensity. The physiological sources of modulation within the speech mechanism and the interactions between the laryngeal source and vocal tract filter in vibrato are not fully understood. Therefore, the purpose of this study was to determine if differences in the rate and extent of fo and intensity modulation could be captured using simultaneously recorded signals from a neck-surface vibration sensor and a microphone, which represent features of the source before and after supraglottal vocal tract filtering. METHOD Nine classically-trained singers produced sustained vowels with vibrato while simultaneous signals were recorded using a vibration sensor and a microphone. Acoustical analyses were performed to measure the rate and extent of fo and intensity modulation for each trial. Paired-samples sign tests were used to analyze differences between the rate and extent of fo and intensity modulation in the vibration sensor and microphone signals. RESULTS The rate and extent of fo modulation and the extent of intensity modulation were equivalent in the vibration sensor and microphone signals, but the rate of intensity modulation was significantly higher in the microphone signal than in the vibration sensor signal. Larger differences in the rate of intensity modulation were seen with vowels that typically have smaller differences between the first and second formant frequencies. CONCLUSIONS This study demonstrated that the rate of intensity modulation at the source prior to supraglottal vocal tract filtering, as measured in neck-surface vibration sensor signals, was lower than the rate of intensity modulation after supraglottal vocal tract filtering, as measured in microphone signals. The difference in rate varied based on the vowel. These findings provide further support of the resonance-harmonics interaction in vocal vibrato. Further investigation is warranted to determine if differences in the physiological source(s) of vibrato account for inconsistent relationships between the extent of intensity modulation in neck-surface vibration sensor and microphone signals.
Collapse
Affiliation(s)
- Rosemary A Lester-Smith
- Department of Physical Medicine & Rehabilitation, Feinberg School of Medicine, Northwestern University, Chicago, Illinois.
| | - Elaina Derrick
- Department of Speech, Language and Hearing Sciences, Moody College of Communication, The University of Texas at Austin, Austin, Texas
| | - Charles R Larson
- Department of Communication Sciences and Disorders, Northwestern University, Evanston, Illinois
| |
Collapse
|
13
|
Yan J, Hou J, Zhang H, Yang X, Sheng Y, Du X, Kong D, Wang Z, Ren X, Wu L. Immediate effect of recurrent laryngeal nerve stimulation in patients with idiopathic unilateral vocal fold paralysis. Acta Otolaryngol 2024; 144:65-70. [PMID: 38265886 DOI: 10.1080/00016489.2024.2306961] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Accepted: 01/12/2024] [Indexed: 01/26/2024]
Abstract
BACKGROUND There is a lack of effective treatment for idiopathic unilateral vocal fold paralysis (IUVFP). A better phonation was reported by patients after laryngeal nerve stimulation during our clinical examination. OBJECTIVES This study aims to investigate immediate effect of recurrent laryngeal nerve (RLN) stimulation on phonation in patients with IUVFP. MATERIAL AND METHODS Sixty-two patients with clinically identified IUVFP underwent RLN stimulation with needle electrodes. Laryngoscopy, acoustic analysis, and voice perception assessment were performed for quantitative comparison of vocal function and voice quality before and after the intervention. RESULTS Laryngoscopic images showed a larger motion range of the paralyzed vocal fold (p < .01) and better glottal closure (p < .01) after RLN stimulation. Acoustic analysis revealed that the dysphonia severity index increased significantly (p < .01) while the jitter and shimmer decreased after the intervention (p < .05). According to perceptual evaluation, RLN stimulation significantly increased RBH grades in patients with IUVFP (p < .01). Furthermore, the improvement in voice perception had a moderate positive correlation with the decrease in the glottal closure. CONCLUSIONS AND SIGNIFICANCE This study shows a short-term improvement of phonation in IUVFP patients after RLN stimulation, which provides proof-of-concept for trialing a controlled delivery of RLN stimulation and assessing durability of any observed responses.
Collapse
Affiliation(s)
- Jing Yan
- Department of Otorhinolaryngology Head and Neck Surgery, the Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an Jiaotong University, Xi'an, Shaanxi, P. R. China
| | - Jin Hou
- Department of Otorhinolaryngology Head and Neck Surgery, the Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an Jiaotong University, Xi'an, Shaanxi, P. R. China
| | - Huihui Zhang
- Department of Otorhinolaryngology Head and Neck Surgery, the Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an Jiaotong University, Xi'an, Shaanxi, P. R. China
| | - Xinyi Yang
- The Key Laboratory of Biomedical Information Engineering of Ministry of Education, Department of Biomedical Engineering, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi, P. R. China
| | - Ying Sheng
- Department of Otorhinolaryngology Head and Neck Surgery, the Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an Jiaotong University, Xi'an, Shaanxi, P. R. China
| | - Xiaoying Du
- Department of Otorhinolaryngology Head and Neck Surgery, the Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an Jiaotong University, Xi'an, Shaanxi, P. R. China
| | - Demin Kong
- Department of Otorhinolaryngology Head and Neck Surgery, the Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an Jiaotong University, Xi'an, Shaanxi, P. R. China
| | - Zhenghui Wang
- Department of Otorhinolaryngology Head and Neck Surgery, the Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an Jiaotong University, Xi'an, Shaanxi, P. R. China
| | - Xiaoyong Ren
- Department of Otorhinolaryngology Head and Neck Surgery, the Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an Jiaotong University, Xi'an, Shaanxi, P. R. China
| | - Liang Wu
- The Key Laboratory of Biomedical Information Engineering of Ministry of Education, Department of Biomedical Engineering, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi, P. R. China
| |
Collapse
|
14
|
Sundberg J, Salomão GL, Scherer KR. Emotional expressivity in singing. Assessing physiological and acoustic indicators of two opera singers' voice characteristics. J Acoust Soc Am 2024; 155:18-28. [PMID: 38169520 DOI: 10.1121/10.0023938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Accepted: 11/21/2023] [Indexed: 01/05/2024]
Abstract
In an earlier study, we analyzed how audio signals obtained from three professional opera singers varied when they sang one octave wide eight-tone scales in ten different emotional colors. The results showed systematic variations in voice source and long-term-average spectrum (LTAS) parameters associated with major emotion "families". For two of the singers, subglottal pressure (PSub) also was recorded, thus allowing analysis of an additional main physiological voice control parameter, glottal resistance (defined as the ratio between PSub and glottal flow), and related to glottal adduction. In the present study, we analyze voice source and LTAS parameters derived from the audio signal and their correlation with Psub and glottal resistance. The measured parameters showed a systematic relationship with the four emotion families observed in our previous study. They also varied systematically with values of the ten emotions along the valence, power, and arousal dimensions; valence showed a significant correlation with the ratio between acoustic voice source energy and subglottal pressure, while Power varied significantly with sound level and two measures related to the spectral dominance of the lowest spectrum partial. the fundamental.
Collapse
Affiliation(s)
- Johan Sundberg
- Department of Speech Music and Hearing, School of Electrical Engineering, Royal Institute of Technology (KTH), Stockholm, Sweden
| | - Gláucia Laís Salomão
- Stockholm University Brain Imaging Centre (SUBIC), Department of Linguistics, Stockholm University, Stockholm, Sweden
| | - Klaus R Scherer
- Department of Psychology, University of Geneva, Geneva, Switzerland
| |
Collapse
|
15
|
Alves MDC, Mancini PC, Teixeira LC. Modifications of auditory feedback and its effects on the voice of adult subjects: a scoping review. Codas 2023; 36:e20220202. [PMID: 38126424 PMCID: PMC10750862 DOI: 10.1590/2317-1782/20232022202pt] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Accepted: 05/29/2023] [Indexed: 12/23/2023] Open
Abstract
INTRODUCTION The auditory perception of voice and its production involve auditory feedback, kinesthetic cues and the feedforward system that produce different effects for the voice. The Lombard, Sidetone and Pitch-Shift-Reflex effects are the most studied. The mapping of scientific experiments on changes in auditory feedback for voice motor control makes it possible to examine the existing literature on the phenomenon and may contribute to voice training or therapies. PURPOSE To map experiments and research results with manipulation of auditory feedback for voice motor control in adults. METHOD Scope review following the Checklist Preferred Reporting Items for Systematic reviews and Meta-Analyses extension (PRISMA-ScR) to answer the question: "What are the investigation methods and main research findings on the manipulation of auditory feedback in voice self-monitoring of adults?". The search protocol was based on the Population, Concept, and Context (PCC) mnemonic strategy, in which the population is adult individuals, the concept is the manipulation of auditory feedback and the context is on motor voice control. Articles were searched in the databases: BVS/Virtual Health Library, MEDLINE/Medical Literature Analysis and Retrieval System online, COCHRANE, CINAHL/Cumulative Index to Nursing and Allied Health Literature, SCOPUS and WEB OF SCIENCE. RESULTS 60 articles were found, 19 on the Lombard Effect, 25 on the Pitch-shift-reflex effect, 12 on the Sidetone effect and four on the Sidetone/Lombard effect. The studies are in agreement that the insertion of a noise that masks the auditory feedback causes an increase in the individual's speech intensity and that the amplification of the auditory feedback promotes the reduction of the sound pressure level in the voice production. A reflex response to the change in pitch is observed in the auditory feedback, however, with particular characteristics in each study. CONCLUSION The material and method of the experiments are different, there are no standardizations in the tasks, the samples are varied and often reduced. The methodological diversity makes it difficult to generalize the results. The main findings of research on auditory feedback on voice motor control confirm that in the suppression of auditory feedback, the individual tends to increase the intensity of the voice. In auditory feedback amplification, the individual decreases the intensity and has greater control over the fundamental frequency, and in frequency manipulations, the individual tends to correct the manipulation. The few studies with dysphonic individuals show that they behave differently from non-dysphonic individuals.
Collapse
Affiliation(s)
- Moisés do Carmo Alves
- Programa de Pós-graduação em Ciências Fonoaudiológicas, Departamento de Fonoaudiologia, Faculdade de Medicina, Universidade Federal de Minas Gerais – UFMG - Belo Horizonte (MG), Brasil.
| | - Patrícia Cotta Mancini
- Programa de Pós-graduação em Ciências Fonoaudiológicas, Departamento de Fonoaudiologia, Faculdade de Medicina, Universidade Federal de Minas Gerais – UFMG - Belo Horizonte (MG), Brasil.
| | - Leticia Caldas Teixeira
- Programa de Pós-graduação em Ciências Fonoaudiológicas, Departamento de Fonoaudiologia, Faculdade de Medicina, Universidade Federal de Minas Gerais – UFMG - Belo Horizonte (MG), Brasil.
| |
Collapse
|
16
|
Bradshaw AR, Lametti DR, Shiller DM, Jasmin K, Huang R, McGettigan C. Speech motor adaptation during synchronous and metronome-timed speech. J Exp Psychol Gen 2023; 152:3476-3489. [PMID: 37616075 DOI: 10.1037/xge0001459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/25/2023]
Abstract
Sensorimotor integration during speech has been investigated by altering the sound of a speaker's voice in real time; in response, the speaker learns to change their production of speech sounds in order to compensate (adaptation). This line of research has however been predominantly limited to very simple speaking contexts, typically involving (a) repetitive production of single words and (b) production of speech while alone, without the usual exposure to other voices. This study investigated adaptation to a real-time perturbation of the first and second formants during production of sentences either in synchrony with a prerecorded voice (synchronous speech group) or alone (solo speech group). Experiment 1 (n = 30) found no significant difference in the average magnitude of compensatory formant changes between the groups; however, synchronous speech resulted in increased between-individual variability in such formant changes. Participants also showed acoustic-phonetic convergence to the voice they were synchronizing with prior to introduction of the feedback alteration. Furthermore, the extent to which the changes required for convergence agreed with those required for adaptation was positively correlated with the magnitude of subsequent adaptation. Experiment 2 tested an additional group with a metronome-timed speech task (n = 15) and found a similar pattern of increased between-participant variability in formant changes. These findings demonstrate that speech motor adaptation can be measured robustly at the group level during performance of more complex speaking tasks; however, further work is needed to resolve whether self-voice adaptation and other-voice convergence reflect additive or interactive effects during sensorimotor control of speech. (PsycInfo Database Record (c) 2023 APA, all rights reserved).
Collapse
Affiliation(s)
- Abigail R Bradshaw
- Department of Speech, Hearing and Phonetic Sciences, University College London
| | | | - Douglas M Shiller
- School of Speech-Language Pathology and Audiology, Universite de Montreal
| | - Kyle Jasmin
- Department of Psychology, Royal Holloway, University of London
| | - Ruiling Huang
- Department of Speech, Hearing and Phonetic Sciences, University College London
| | - Carolyn McGettigan
- Department of Speech, Hearing and Phonetic Sciences, University College London
| |
Collapse
|
17
|
Pautz N, McDougall K, Mueller-Johnson K, Nolan F, Paver A, Smith HMJ. Identifying unfamiliar voices: Examining the system variables of sample duration and parade size. Q J Exp Psychol (Hove) 2023; 76:2804-2822. [PMID: 36718784 PMCID: PMC10655699 DOI: 10.1177/17470218231155738] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Revised: 01/13/2023] [Accepted: 01/17/2023] [Indexed: 02/01/2023]
Abstract
Voice identification parades can be unreliable due to the error-prone nature of earwitness responses. UK government guidelines recommend that voice parades should have nine voices, each played for 60 s. This makes parades resource-consuming to construct. In this article, we conducted two experiments to see if voice parade procedures could be simplified. In Experiment 1 (N = 271, 135 female), we investigated if reducing the duration of the voice samples on a nine-voice parade would negatively affect identification performance using both conventional logistic and signal detection approaches. In Experiment 2 (N = 270, 136 female), we first explored if the same sample duration conditions used in Experiment 1 would lead to different outcomes if we reduced the parade size to include only six voices. Following this, we pooled the data from both experiments to investigate the influence of target-position effects. The results show that 15-s sample durations result in statistically equivalent voice identification performance to the longer 60-s sample durations, but that the 30-s sample duration suffers in terms of overall signal sensitivity. This pattern of results was replicated using both a nine- and a six-voice parade. Performance on target-absent parades were at chance levels in both parade sizes, and response criteria were mostly liberal. In addition, unwanted position effects were present. The results provide initial evidence that the sample duration used in a voice parade may be reduced, but we argue that the guidelines recommending a parade with nine voices should be maintained to provide additional protection for a potentially innocent suspect given the low target-absent accuracy.
Collapse
|
18
|
Filippa M, Benis D, Adam-Darque A, Grandjean D, Hüppi PS. Preterm infants show an atypical processing of the mother's voice. Brain Cogn 2023; 173:106104. [PMID: 37949001 DOI: 10.1016/j.bandc.2023.106104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 10/27/2023] [Accepted: 10/30/2023] [Indexed: 11/12/2023]
Abstract
To understand the consequences of prematurity on language perception, it is fundamental to determine how atypical early sensory experience affects brain development. At term equivalent age, ten preterm and ten full-term newborns underwent high-density EEG during mother or stranger speech presentation, in the forward or backward order. A general group effect terms > preterms is evident in the theta frequency band, in the left temporal area, with preterms showing significant activation for strangers' and terms for the mother's voice. A significant group contrast in the low and high theta in the right temporal regions indicates higher activations for the stranger's voice in preterms. Finally, only full terms presented a late gamma band increase for the maternal voice, indicating a more mature brain response. EEG time-frequency analysis demonstrate that preterm infants are selectively responsive to stranger voices in both temporal hemispheres, and that they lack selective brain responses to their mother's forward voice.
Collapse
Affiliation(s)
- Manuela Filippa
- Division of Development and Growth, Child and Adolescent Department, Rue Willy-Donzé 1205 Genève, University of Geneva, Geneva, Switzerland; Swiss Center for Affective Sciences, Department of Psychology and Educational Sciences, University of Geneva, Boulevard Carl-Vogt 101 Genève, Geneva, Switzerland.
| | - Damien Benis
- Division of Development and Growth, Child and Adolescent Department, Rue Willy-Donzé 1205 Genève, University of Geneva, Geneva, Switzerland; Swiss Center for Affective Sciences, Department of Psychology and Educational Sciences, University of Geneva, Boulevard Carl-Vogt 101 Genève, Geneva, Switzerland
| | - Alexandra Adam-Darque
- Laboratory of Cognitive Neurorehabilitation, Department of Clinical Neuroscience, Division of Neurorehabilitation, University Hospital of Geneva and University of Geneva, Rue Gabrielle-Perret-Gentil 4, 1211 Geneva, Switzerland
| | - Didier Grandjean
- Swiss Center for Affective Sciences, Department of Psychology and Educational Sciences, University of Geneva, Boulevard Carl-Vogt 101 Genève, Geneva, Switzerland
| | - Petra S Hüppi
- Division of Development and Growth, Child and Adolescent Department, Rue Willy-Donzé 1205 Genève, University of Geneva, Geneva, Switzerland
| |
Collapse
|
19
|
Serry MA, Alzamendi GA, Zañartu M, Peterson SD. An Euler-Bernoulli-type beam model of the vocal folds for describing curved and incomplete glottal closure patterns. J Mech Behav Biomed Mater 2023; 147:106130. [PMID: 37774440 PMCID: PMC10787517 DOI: 10.1016/j.jmbbm.2023.106130] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Revised: 06/30/2023] [Accepted: 09/16/2023] [Indexed: 10/01/2023]
Abstract
Incomplete glottal closure is a laryngeal configuration wherein the glottis is not fully obstructed prior to phonation. It has been linked to inefficient voice production and voice disorders. Various incomplete glottal closure patterns can arise and the mechanisms driving them are not well understood. In this work, we introduce an Euler-Bernoulli composite beam vocal fold (VF) model that produces qualitatively similar incomplete glottal closure patterns as those observed in experimental and high-fidelity numerical studies, thus offering insights into the potential underlying physical mechanisms. Refined physiological insights are pursued by incorporating the beam model into a VF posturing model that embeds the five intrinsic laryngeal muscles. Analysis of the combined model shows that co-activating the lateral cricoarytenoid (LCA) and interarytenoid (IA) muscles without activating the thyroarytenoid (TA) muscle results in a bowed (convex) VF geometry with closure at the posterior margin only; this is primarily attributed to the reactive moments at the anterior VF margin. This bowed pattern can also arise during VF compression (due to extrinsic laryngeal muscle activation for example), wherein the internal moment induced passively by the TA muscle tissue is the predominant mechanism. On the other hand, activating the TA muscle without incorporating other adductory muscles results in anterior and mid-membranous glottal closure, a concave VF geometry, and a posterior glottal opening driven by internal moments induced by TA muscle activation. In the case of initial full glottal closure, the posterior cricoarytenoid (PCA) muscle activation cancels the adductory effects of the LCA and IA muscles, resulting in a concave VF geometry and posterior glottal opening. Furthermore, certain maneuvers involving co-activation of all adductory muscles result in an hourglass glottal shape due to a reactive moment at the anterior VF margin and moderate internal moment induced by TA muscle activation. These findings have implications regarding potential laryngeal maneuvers in patients with voice disorders involving imbalances or excessive tension in the laryngeal muscles such as muscle tension dysphonia.
Collapse
Affiliation(s)
- Mohamed A Serry
- Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada
| | - Gabriel A Alzamendi
- Institute for Research and Development on Bioengineering and Bioinformatics (IBB), CONICET-UNER, Oro Verde, Entre Ríos 3100, Argentina
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Sean D Peterson
- Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada.
| |
Collapse
|
20
|
Savareh S, Moradi N, Yazdi MJS, Soltani M, Latifi M. Immediate Effects of Semi-occluded Vocal Tract Exercises as a Vocal Warm-Up in Singers. J Voice 2023; 37:875-880. [PMID: 34256979 DOI: 10.1016/j.jvoice.2021.05.014] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Revised: 05/20/2021] [Accepted: 05/26/2021] [Indexed: 10/20/2022]
Abstract
OBJECTIVE The present study aimed to investigate the Immediate Effects of Semi-occluded Vocal Tract Exercises (SOVTEs) as a vocal warm-up on the Electrical Activity of Extrinsic Laryngeal Muscles and acoustic parameters of voice in singers. STUDY DESIGN This study used a pre-/post-interventional (quasi-experimental) design and a simple non-random sampling method. SETTING Musculoskeletal Rehabilitation Research Center. METHOD Surface electromyography of the suprahyoid and infrahyoid muscle areas and acoustic measurements were assessed before and immediately after performing three SOVTEs: lip-trill, humming, and straw phonation for 20 minutes as a vocal warm-up in eleven healthy male singers with an average age of 26.5 ± 4.2. RESULTS After performing the vocal warm-up, root mean square in channels two and three (left and right infrahyoid muscle areas) in all vocal tasks, including /i/, /u/, and counting from 20 to 30 was significantly decreased (P < 0.05). The acoustic parameters, F1, and F1-F0 difference were significantly decreased (P < 0.05). CONCLUSION The results suggest that after performing SOVTEs as a vocal warm-up, the electrical activity of the extrinsic laryngeal muscles is reduced. The acoustic analysis also demonstrated a decrease in F0, F1, and F1-F0 variables. Therefore, it seems that the results of acoustic and electromyographic assessments are in line and it is likely that SOVTEs through the vocal economy lead to a reduction in the load applied to the phonatory system.
Collapse
Affiliation(s)
- Sirvan Savareh
- Musculoskeletal Rehabilitation Research Center, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran
| | - Negin Moradi
- Musculoskeletal Rehabilitation Research Center, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran.
| | | | - Majid Soltani
- Musculoskeletal Rehabilitation Research Center, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran
| | - Mahmood Latifi
- Department of Statistics and Epidemiology, School of Health, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran
| |
Collapse
|
21
|
Marks KL, Díaz Cádiz ME, Toles LE, Buckley DP, Tracy LF, Noordzji JP, Grillone GA, Stepp CE. Automated Creak Differentiates Adductor Laryngeal Dystonia and Muscle Tension Dysphonia. Laryngoscope 2023; 133:2687-2694. [PMID: 36715109 PMCID: PMC10387123 DOI: 10.1002/lary.30588] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Revised: 12/21/2022] [Accepted: 01/11/2023] [Indexed: 01/31/2023]
Abstract
OBJECTIVE The purpose of this study was to determine whether automated estimates of vocal creak would differentiate speakers with adductor laryngeal dystonia (AdLD) from speakers with muscle tension dysphonia (MTD) and speakers without voice disorders. METHODS Sixteen speakers with AdLD, sixteen speakers with MTD, and sixteen speakers without voice disorders were recorded in a quiet environment reading aloud a standard paragraph. An open-source creak detector was used to calculate the percentage of creak (% creak) in each of the speaker's six recorded sentences. RESULTS A Kruskal-Wallis one-way analysis of variance revealed a statistically significant effect of group on the % creak with a large effect size. Pairwise Wilcoxon tests revealed a statistically significant difference in % creak between speakers with AdLD and controls as well as between speakers with AdLD and MTD. Receiver operating characteristic curve analyses indicated that % creak differentiated AdLD from both controls and speakers with MTD with high sensitivity and specificity (area under the curve statistics of 0.94 and 0.86, respectively). CONCLUSION Percentage of creak as calculated by an automated creak detector may be useful as a quantitative indicator of AdLD, demonstrating the potential for use as a screening tool or to aid in a differential diagnosis. LEVEL OF EVIDENCE 3 Laryngoscope, 133:2687-2694, 2023.
Collapse
Affiliation(s)
- Katherine L Marks
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, Massachusetts, U.S.A
| | - Manuel E Díaz Cádiz
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, Massachusetts, U.S.A
| | - Laura E Toles
- Department of Otolaryngology - Head and Neck Surgery, University of Texas Southwestern Medical Center, Dallas, Texas, U.S.A
| | - Daniel P Buckley
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, Massachusetts, U.S.A
- Department of Otolaryngology - Head and Neck Surgery, Boston University School of Medicine, Boston, Massachusetts, U.S.A
| | - Lauren F Tracy
- Department of Otolaryngology - Head and Neck Surgery, Boston University School of Medicine, Boston, Massachusetts, U.S.A
| | - J Pieter Noordzji
- Department of Otolaryngology - Head and Neck Surgery, Boston University School of Medicine, Boston, Massachusetts, U.S.A
| | - Gregory A Grillone
- Department of Otolaryngology - Head and Neck Surgery, Boston University School of Medicine, Boston, Massachusetts, U.S.A
| | - Cara E Stepp
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, Massachusetts, U.S.A
- Department of Otolaryngology - Head and Neck Surgery, Boston University School of Medicine, Boston, Massachusetts, U.S.A
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, U.S.A
| |
Collapse
|
22
|
Schwambergová D, Pátková Ž, Třebická Fialová J, Třebický V, Stella D, Havlíček J. Immunoactivation Affects Perceived Body Odor and Facial but Not Vocal Attractiveness. Evol Psychol 2023; 21:14747049231218010. [PMID: 38087426 PMCID: PMC10722934 DOI: 10.1177/14747049231218010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2023] [Accepted: 11/16/2023] [Indexed: 12/18/2023] Open
Abstract
Several previous studies have shown that in mammals, the health status of conspecifics can be assessed based on perceptual cues. Olfactory, visual, or acoustic cues may lead to avoidant behavior, thus reducing the risk of contagion by close contact with infected individuals. We tested whether immune system activation after immunization leads to perceptible changes in body odor and facial and vocal attractiveness in humans. We have experimentally activated the immune system of male participants using vaccination against hepatitis A/B and meningococcus. Their body odor, facial photographs, and vocal recordings were collected before and 14 days after vaccination. Subsequently, the body odor samples, facial photographs, and vocal recordings were assessed by female raters for their attractiveness and healthiness. We have also measured skin coloration (from facial photographs and in vivo using a spectrophotometer), vocal parameters, and C-reactive protein (CRP) levels as a marker of inflammation. We found an increase in perceived body odor attractiveness, a decrease in facial attractiveness and healthiness, and no change in vocal attractiveness 14 days after vaccination compared to the prevaccination condition. Moreover, there was no change in facial coloration or vocal parameters between the prevaccination and postvaccination conditions. Prevaccination CRP levels were negatively associated with body odor and facial attractiveness and positively associated with body odor intensity. Overall, our results suggest that perceived body odor as well as facial but not vocal attractiveness may provide cues to activation of the immune response and that each modality may carry specific information about the individual's condition.
Collapse
Affiliation(s)
- Dagmar Schwambergová
- Department of Zoology, Faculty of Science, Charles University, Prague, Czech Republic
| | - Žaneta Pátková
- Department of Philosophy and History of Sciences, Faculty of Science, Charles University, Prague, Czech Republic
| | | | - Vít Třebický
- Faculty of Physical Education and Sport, Charles University, Prague, Czech Republic
| | - David Stella
- Department of the Human Dimensions of Global Change, Global Change Research Institute CAS, Brno, Czech Republic
| | - Jan Havlíček
- Department of Zoology, Faculty of Science, Charles University, Prague, Czech Republic
| |
Collapse
|
23
|
Redman YG, Glasner JD, D'Orazio D, Bottalico P. Singing in different performance spaces: The effect of room acoustics on singers' perception. J Acoust Soc Am 2023; 154:2256-2264. [PMID: 37819023 DOI: 10.1121/10.0021331] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 09/24/2023] [Indexed: 10/13/2023]
Abstract
Classical singers' performances vary across different acoustic environments. The changes in the delivery are influenced by the singer's perception of the venue's acoustics. This study investigated these relationships using nine professional or semi-professional classical singers. Participants performed Giordani's "Caro mio ben" aria in five venues, and the acoustic parameters reverberance (T30 and EDT), clarity (C80), early vocal support (STv), and tonal color (EDTf) were measured. From a factor analysis of the subjective analysis three major factors emerged that, we propose, would represent three generalized percepts of Room Supportiveness, Room Noiselessness, and Room Timbre. These percepts correlated significantly with objective acoustic parameters traditionally linked to vocal support, reverberation, and timbre. Room Supportiveness and Room Noiselessness significantly contributed to the singers' likability of the acoustic environment, while Room Timbre did not. This indicates that singers' perceptual preference for a performance space may be influenced by factors affecting both auditory feedback and vocal function. These findings underscore the need for performing space designers to consider the unique needs of all stakeholders, including listeners and performers. The study contributes to the bridging of the gap between subjective perceptions and objective measurements, providing valuable insights for acoustic design considerations.
Collapse
Affiliation(s)
- Yvonne Gonzales Redman
- School of Music, University of Illinois Urbana Champaign, Champaign, Illinois 61820, USA
| | - Joshua D Glasner
- School of Graduate and Professional Studies, Delaware Valley University, Doylestown, Pennsylvania 18901, USA
| | - Dario D'Orazio
- Department of Industrial Engineering, University of Bologna, Bologna 40136, Italy
| | - Pasquale Bottalico
- Department of Speech and Hearing Science, University of Illinois Urbana Champaign, Champaign, Illinois 61820, USA
| |
Collapse
|
24
|
Misono S, Xu J, Oh J, Sombrio A, Stockness A, Mahnan A, Konczak J. Atypical Activation of Laryngeal Somatosensory-Motor Cortex During Vocalization in People With Unexplained Chronic Cough. JAMA Otolaryngol Head Neck Surg 2023; 149:820-827. [PMID: 37471077 PMCID: PMC10360007 DOI: 10.1001/jamaoto.2023.1757] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2023] [Accepted: 05/23/2023] [Indexed: 07/21/2023]
Abstract
Importance Unexplained chronic cough is common and has substantial negative quality-of-life implications, yet its causes are not well understood. A better understanding of how peripheral and central neural processes contribute to chronic cough is essential for treatment design. Objective To determine if people with chronic cough exhibit signs of abnormal neural processing over laryngeal sensorimotor cortex during voluntary laryngeal motor activity such as vocalization. Design, Setting, and Participants This was a cross-sectional study of a convenience sample of participants with chronic cough and healthy participants. Testing was performed in an acoustically and electromagnetically shielded chamber. In a single visit, electroencephalographic (EEG) signals were recorded from participants with chronic cough and healthy participants during voice production. The chronic cough group participants presented with unexplained cough of 8 weeks or longer duration with prior medical evaluation including negative results of chest imaging. None of the participants had a history of any neurologic disease known to impair vocalization or swallowing. Data collection for the healthy control group occurred from February 2 to June 28, 2018, and for the chronic cough group, from November 22, 2021, to June 21, 2022. Data analysis was performed from May 1 to October 30, 2022. Exposure Participants with or without chronic cough. Main Outcome Measures Event-related spectral perturbation over the laryngeal area of somatosensory-motor cortex from 0 to 30 Hz (ie, θ, α, and β bands) and event-related coherence as a measure of synchronous activity between somatosensory and motor cortical regions. Results The chronic cough group comprised 13 participants with chronic cough (mean [SD] age, 63.5 [7.8] years; 9 women and 4 men) and the control group, 10 healthy age-matched individuals (mean [SD] age, 60.3 [13.9] years; 6 women and 4 men). In the chronic cough group, the typical movement-related desynchronization over somatosensory-motor cortex during vocalization was significantly reduced across θ, α, and β frequency bands when compared with the control group. Conclusions and Relevance This cross-sectional study found that the typical movement-related suppression of brain oscillatory activity during vocalization is weak or absent in people with chronic cough. Thus, chronic cough affects sensorimotor cortical activity during the asymptomatic voluntary activation of laryngeal muscles.
Collapse
Affiliation(s)
- Stephanie Misono
- Department of Otolaryngology, Head and Neck Surgery, University of Minnesota, Minneapolis
| | - Jiapeng Xu
- Human Sensorimotor Control Laboratory, School of Kinesiology, University of Minnesota, Minneapolis
| | - Jinseok Oh
- Human Sensorimotor Control Laboratory, School of Kinesiology, University of Minnesota, Minneapolis
- Center for Clinical Movement Science, University of Minnesota, Minneapolis
- Department of Behavioral Pediatrics, Children’s Hospital Los Angeles, Los Angeles, California
| | - Anna Sombrio
- Department of Otolaryngology, Head and Neck Surgery, University of Minnesota, Minneapolis
| | - Ali Stockness
- Department of Otolaryngology, Head and Neck Surgery, University of Minnesota, Minneapolis
| | - Arash Mahnan
- Reality Labs Health and Safety UXR, Meta, Redmond, Washington
| | - Jürgen Konczak
- Human Sensorimotor Control Laboratory, School of Kinesiology, University of Minnesota, Minneapolis
- Center for Clinical Movement Science, University of Minnesota, Minneapolis
| |
Collapse
|
25
|
Liu D, Chang Y, Dai G, Guo Z, Jones JA, Li T, Chen X, Chen M, Li J, Wu X, Liu P, Liu H. Right, but not left, posterior superior temporal gyrus is causally involved in vocal feedback control. Neuroimage 2023; 278:120282. [PMID: 37468021 DOI: 10.1016/j.neuroimage.2023.120282] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2023] [Revised: 06/25/2023] [Accepted: 07/16/2023] [Indexed: 07/21/2023] Open
Abstract
The posterior superior temporal gyrus (pSTG) has been implicated in the integration of auditory feedback and motor system for controlling vocal production. However, the question as to whether and how the pSTG is causally involved in vocal feedback control is currently unclear. To this end, the present study selectively stimulated the left or right pSTG with continuous theta burst stimulation (c-TBS) in healthy participants, then used event-related potentials to investigate neurobehavioral changes in response to altered auditory feedback during vocal pitch regulation. The results showed that, compared to control (vertex) stimulation, c-TBS over the right pSTG led to smaller vocal compensations for pitch perturbations accompanied by smaller cortical N1 and larger P2 responses. Enhanced P2 responses received contributions from the right-lateralized temporal and parietal regions as well as the insula, and were significantly correlated with suppressed vocal compensations. Surprisingly, these effects were not found when comparing c-TBS over the left pSTG with control stimulation. Our findings provide evidence, for the first time, that supports a causal relationship between right, but not left, pSTG and auditory-motor integration for vocal pitch regulation. This lends support to a right-lateralized contribution of the pSTG in not only the bottom-up detection of vocal feedback errors but also the involvement of driving motor commands for error correction in a top-down manner.
Collapse
Affiliation(s)
- Dongxu Liu
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, China
| | - Yichen Chang
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, China
| | - Guangyan Dai
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, China
| | - Zhiqiang Guo
- School of Computer, Zhuhai College of Science and Technology, Zhuhai, China
| | - Jeffery A Jones
- Department of Psychology and Laurier Centre for Cognitive Neuroscience, Wilfrid Laurier University, Waterloo, Ontario N2L 3C5, Canada
| | - Tingni Li
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, China; Centre for Eye and Vision Research, 17W Science Park, Hong Kong SAR, China
| | - Xi Chen
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, China
| | - Mingyun Chen
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, China
| | - Jingting Li
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, China
| | - Xiuqin Wu
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, China.
| | - Peng Liu
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, China.
| | - Hanjun Liu
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, China; Guangdong Provincial Key Laboratory of Brain Function and Disease, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China.
| |
Collapse
|
26
|
Calvache Mora CA, Cantor-Cutiva LC, Hunter EJ, Guzmán M, Soláque L. Systematic Review of Literature on Vocal Demand Response: Understanding Physiology, Measurements, and Associated Factors. Folia Phoniatr Logop 2023; 76:1-21. [PMID: 37393892 DOI: 10.1159/000531678] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Accepted: 06/13/2023] [Indexed: 07/04/2023] Open
Abstract
PURPOSE Considering the conceptual migration from vocal load and vocal loading to vocal demand and vocal demand response, this review of literature aimed to identify physiological explanations, reported measurements, and associated factors (vocal demands) reported in the literature when considering the phonatory response to a vocal demand. METHODS A systematic review of literature, following the PRISMA Statement, was conducted using Web of Science, PubMed, Scopus, and ScienceDirect. Data were analyzed and presented in two parts. First, a bibliometric analysis, co-occurrence analysis, and content analysis were performed. Three criteria that got article inclusion were defined: (1) written in English, Spanish, and Portuguese; (2) published between 2009 and 2021; and (3) focused on vocal load and loading, vocal demand response, and voice assessment parameters. A total of 54 publications met the criteria and were included in this review. The second part included a conceptual framework based on the content analysis of three aspects of vocal demand response: (1) physiological explanations, (2) reported measurements, and (3) vocal demands. RESULTS AND CONCLUSION As would be expected since vocal demand response is a relatively new term and not yet commonly used in literature when discussing way that the speakers respond to communicative scenarios, most of the studies reviewed (both historical and recent) still use the term of vocal load and vocal loading. Although there is a broad variety of literature discussing a wide range of vocal demands and voice parameters used to characterize the vocal demand response, results show that there is consistency across the studies. While vocal demand response is unique and intrinsic to the talker, associated factors that contribute to this response include both internal talker and external talker factors. Internal factors include muscle stiffness, viscosity in the phonatory system, vocal fold tissue damage, elevated sound pressure levels during occupational voice demands, extended periods of voice use, suboptimal body posture, difficulties in breathing technique, and sleep disturbances. Associated external factors include the working environment (noise, acoustics, temperature, humidity). In conclusion, although vocal demand response is intrinsic to the speaker, the speaker's response is affected by external vocal demands. However, due to the wide methods to evaluate vocal demand response, it has been difficult to establish its contribution to voice disorders in the general population and, specifically, among occupational voice users. This literature review identified commonly reported parameters and factors that may help clinicians and researchers define vocal demand response.
Collapse
Affiliation(s)
- Carlos Alberto Calvache Mora
- Department of Communicative Sciences and Disorders, Corporación Universitaria Iberoamericana, Bogotá, Colombia
- Department of Mechatronics Engineering, Universidad Militar Nueva Granada, Bogotá, Colombia
- Vocology Center, Bogotá, Colombia
| | - Lady Catherine Cantor-Cutiva
- Vocology Center, Bogotá, Colombia
- Department of Communicative Sciences and Disorders, Michigan State University, East Lansing, Michigan, USA
| | - Eric J Hunter
- Department of Communicative Sciences and Disorders, Michigan State University, East Lansing, Michigan, USA
| | - Marco Guzmán
- Department of Communication Sciences and Disorders, Universidad de los Andes, Santiago, Chile
| | - Leonardo Soláque
- Department of Mechatronics Engineering, Universidad Militar Nueva Granada, Bogotá, Colombia
| |
Collapse
|
27
|
Wen F, Gao J, Ke W, Zuo B, Dai Y, Ju Y, Long J. The Effect of Face-Voice Gender Consistency on Impression Evaluation. Arch Sex Behav 2023; 52:1123-1139. [PMID: 36719490 DOI: 10.1007/s10508-022-02524-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/09/2021] [Revised: 12/24/2022] [Accepted: 12/25/2022] [Indexed: 06/18/2023]
Abstract
Face and voice are important information cues of interpersonal interaction. Most previous studies have investigated the cross-modal perception of face and voice from the perspective of cognitive psychology, but few empirical studies have focused on the effect of gender consistency of face and voice on the impression evaluation of the target from the perspective of social cognition. Based on the two-stage model of stereotype activation and the stereotype content model, this research examined the effects of face-voice gender consistency on impression evaluation (gender categorization and warmth competence evaluation) by using a cross-modal priming paradigm (Study 1, 20 males and 23 females, Mage = 21.00, SDage = 2.59), a sequential presentation task (Study 2a, 57 males and 70 females, Mage = 18.54, SDage = 1.54; Study 2b, 52 males and 51 females, Mage = 18.54, SDage = 1.36), and a simultaneous presentation task (Study 3, 51 males and 55 females, Mage = 23.58, SDage = 3.20), respectively. The results showed that: (1) there was a face-voice gender consistency preference in gender categorization, and the response of face-voice consistent condition was faster than that of inconsistent condition; (2) compared with the face-voice gender-inconsistent individuals, the participants showed a higher and more stable evaluation of the warmth and competence of the gender-consistent individuals, indicating the effect of matching preference of the face-voice gender consistency on the impression evaluation; (3) people paid more attention to the gender information of faces in the impression evaluation, and the female face could improve people's evaluation on the target's warmth and competence; (4) males were more intolerant of face-voice gender inconsistency when presented sequentially; the "voice needs to match face" effect was stronger for females when presented simultaneously. These findings, on the one hand, enrich and expand previous theories and research on cross-modal processing of face and voice from the perspective of social cognitive impression evaluation; on the other hand, these findings have important practical implications for impression management and decision-making in social interaction.
Collapse
Affiliation(s)
- Fangfang Wen
- School of Psychology, Center for Studies of Social Psychology, Central China Normal University, Wuhan, 430079, China
| | - Jia Gao
- School of Psychology, Center for Studies of Social Psychology, Central China Normal University, Wuhan, 430079, China
| | - Wenlin Ke
- School of Psychology, Center for Studies of Social Psychology, Central China Normal University, Wuhan, 430079, China
| | - Bin Zuo
- School of Psychology, Center for Studies of Social Psychology, Central China Normal University, Wuhan, 430079, China.
| | - Yu Dai
- School of Psychology, Center for Studies of Social Psychology, Central China Normal University, Wuhan, 430079, China
| | - Yiyan Ju
- School of Psychology, Center for Studies of Social Psychology, Central China Normal University, Wuhan, 430079, China
| | - Jiahui Long
- School of Psychology, Center for Studies of Social Psychology, Central China Normal University, Wuhan, 430079, China
| |
Collapse
|
28
|
Jeong H, Yoo JY, Ouyang W, Greane ALJX, Wiebe AJ, Huang I, Lee YJ, Lee JY, Kim J, Ni X, Kim S, Huynh HLT, Zhong I, Chin YX, Gu J, Johnson AM, Brancaccio T, Rogers JA. Closed-loop network of skin-interfaced wireless devices for quantifying vocal fatigue and providing user feedback. Proc Natl Acad Sci U S A 2023; 120:e2219394120. [PMID: 36802437 PMCID: PMC9992836 DOI: 10.1073/pnas.2219394120] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Accepted: 01/12/2023] [Indexed: 02/23/2023] Open
Abstract
Vocal fatigue is a measurable form of performance fatigue resulting from overuse of the voice and is characterized by negative vocal adaptation. Vocal dose refers to cumulative exposure of the vocal fold tissue to vibration. Professionals with high vocal demands, such as singers and teachers, are especially prone to vocal fatigue. Failure to adjust habits can lead to compensatory lapses in vocal technique and an increased risk of vocal fold injury. Quantifying and recording vocal dose to inform individuals about potential overuse is an important step toward mitigating vocal fatigue. Previous work establishes vocal dosimetry methods, that is, processes to quantify vocal fold vibration dose but with bulky, wired devices that are not amenable to continuous use during natural daily activities; these previously reported systems also provide limited mechanisms for real-time user feedback. This study introduces a soft, wireless, skin-conformal technology that gently mounts on the upper chest to capture vibratory responses associated with vocalization in a manner that is immune to ambient noises. Pairing with a separate, wirelessly linked device supports haptic feedback to the user based on quantitative thresholds in vocal usage. A machine learning-based approach enables precise vocal dosimetry from the recorded data, to support personalized, real-time quantitation and feedback. These systems have strong potential to guide healthy behaviors in vocal use.
Collapse
Affiliation(s)
- Hyoyoung Jeong
- Querrey Simpson Institute for Bioelectronics, Northwestern University, Evanston, IL60208
- Department of Electrical and Computer Engineering, University of California, Davis, CA95616
| | - Jae-Young Yoo
- Querrey Simpson Institute for Bioelectronics, Northwestern University, Evanston, IL60208
| | - Wei Ouyang
- Querrey Simpson Institute for Bioelectronics, Northwestern University, Evanston, IL60208
| | - Aurora Lee Jean Xue Greane
- Bienen School of Music, Northwestern University, Evanston, IL60208
- Department of Biomedical Engineering, Northwestern University, Evanston, IL60208
| | | | - Ivy Huang
- Querrey Simpson Institute for Bioelectronics, Northwestern University, Evanston, IL60208
- Department of Materials Science Engineering, Northwestern University, Evanston, IL60208
| | - Young Joong Lee
- Querrey Simpson Institute for Bioelectronics, Northwestern University, Evanston, IL60208
- Department of Mechanical Engineering, Northwestern University, Evanston, IL60208
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA02142
| | - Jong Yoon Lee
- Querrey Simpson Institute for Bioelectronics, Northwestern University, Evanston, IL60208
- Sibel Health, Niles, IL60714
| | - Joohee Kim
- Querrey Simpson Institute for Bioelectronics, Northwestern University, Evanston, IL60208
- Center for Bionics, Biomedical Research Institute, Korea Institute of Science and Technology, Seoul02792, South Korea
| | - Xinchen Ni
- Querrey Simpson Institute for Bioelectronics, Northwestern University, Evanston, IL60208
| | - Suyeon Kim
- Department of Biomedical Engineering, Northwestern University, Evanston, IL60208
| | - Huong Le-Thien Huynh
- Department of Biomedical Engineering, Northwestern University, Evanston, IL60208
| | - Isabel Zhong
- Department of Biomedical Engineering, Northwestern University, Evanston, IL60208
| | - Yu Xuan Chin
- Department of Biomedical Engineering, Northwestern University, Evanston, IL60208
| | - Jianyu Gu
- Querrey Simpson Institute for Bioelectronics, Northwestern University, Evanston, IL60208
| | - Aaron M. Johnson
- Department of Otolaryngology-Head and Neck Surgery, Grossman School of Medicine, New York University, New York, NY10016
- Department of Rehabilitation Medicine, Grossman School of Medicine, New York University, New York, NY10016
| | | | - John A. Rogers
- Querrey Simpson Institute for Bioelectronics, Northwestern University, Evanston, IL60208
- Department of Biomedical Engineering, Northwestern University, Evanston, IL60208
- Department of Materials Science Engineering, Northwestern University, Evanston, IL60208
- Department of Mechanical Engineering, Northwestern University, Evanston, IL60208
- Department of Electrical and Computer Engineering, Northwestern University, Evanston, IL60208
- Department of Chemistry, Northwestern University, Evanston, IL60208
- Department of Neurological Surgery, Northwestern University, Evanston, IL60208
| |
Collapse
|
29
|
Rhone AE, Rupp K, Hect JL, Harford E, Tranel D, Howard MA, Abel TJ. Electrocorticography reveals the dynamics of famous voice responses in human fusiform gyrus. J Neurophysiol 2023; 129:342-346. [PMID: 36576268 PMCID: PMC9886354 DOI: 10.1152/jn.00459.2022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Revised: 12/14/2022] [Accepted: 12/20/2022] [Indexed: 12/29/2022] Open
Abstract
Voice and face processing occur through convergent neural systems that facilitate speaker recognition. Neuroimaging studies suggest that familiar voice processing engages early visual cortex, including the bilateral fusiform gyrus (FG) on the basal temporal lobe. However, what role the FG plays in voice processing and whether it is driven by bottom-up or top-down mechanisms is unresolved. In this study we directly examined neural responses to famous voices and faces in human FG with direct cortical surface recordings (electrocorticography) in epilepsy surgery patients. We tested the hypothesis that neural populations in human FG respond to famous voices and investigated the temporal properties of voice responses in FG. Recordings were acquired from five adult participants during a person identification task using visual and auditory stimuli from famous speakers (U.S. Presidents Barack Obama, George W. Bush, and Bill Clinton). Patients were presented with images of presidents or clips of their voices and asked to identify the portrait/speaker. Our results demonstrate that a subset of face-responsive sites in and near FG also exhibit voice responses that are both lower in magnitude and delayed (300-600 ms) compared with visual responses. The dynamics of voice processing revealed by direct cortical recordings suggests a top-down feedback-mediated response to famous voices in FG that may facilitate speaker identification.NEW & NOTEWORTHY Interactions between auditory and visual cortices play an important role in person identification, but the dynamics of these interactions remain poorly understood. We performed direct brain recordings of fusiform face cortex in human epilepsy patients performing a famous voice naming task, revealing the dynamics of famous voice processing in human fusiform face cortex. The findings support a model of top-down interactions from auditory to visual cortex to facilitate famous voice recognition.
Collapse
Affiliation(s)
- Ariane E Rhone
- Department of Neurosurgery, University of Iowa, Iowa City, Iowa
| | - Kyle Rupp
- Department of Neurological Surgery, University of Pittsburgh, Pittsburgh, Pennsylvania
| | - Jasmine L Hect
- Department of Neurological Surgery, University of Pittsburgh, Pittsburgh, Pennsylvania
| | - Emily Harford
- Department of Neurological Surgery, University of Pittsburgh, Pittsburgh, Pennsylvania
| | - Daniel Tranel
- Department of Psychology, University of Iowa, Iowa City, Iowa
| | | | - Taylor J Abel
- Department of Neurological Surgery, University of Pittsburgh, Pittsburgh, Pennsylvania
- Department of Bioengineering, University of Pittsburgh, Pittsburgh, Pennsylvania
| |
Collapse
|
30
|
Patel S, Hebert K, Korzyukov O, Larson CR. Effects of sensorimotor voice training on event-related potentials to pitch-shifted auditory feedback. PLoS One 2023; 18:e0269326. [PMID: 36662730 PMCID: PMC9858400 DOI: 10.1371/journal.pone.0269326] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Accepted: 11/20/2022] [Indexed: 01/21/2023] Open
Abstract
The pitch perturbation technique is a validated technique that has been used for over 30 years to understand how people control their voice. This technique involves altering a person's voice pitch in real-time while they produce a vowel (commonly, a prolonged /a/ sound). Although post-task changes in the voice have been observed in several studies (e.g., a change in mean fo across the duration of the experiment), the potential for using the pitch perturbation technique as a training tool for voice pitch regulation and/or modification has not been explored. The present study examined changes in event related potentials (ERPs) and voice pitch in three groups of subjects due to altered voice auditory feedback following a brief, four-day training period. Participants in the opposing group were trained to change their voice fo in the opposite direction of a pitch perturbation stimulus. Participants in the following group were trained to change their voice fo in the same direction as the pitch perturbation stimulus. Participants in the non-varying group did not voluntarily change their pitch, but instead were asked to hold their voice constant when they heard pitch perturbations. Results showed that all three types of training affected the ERPs and the voice pitch-shift response from pre-training to post-training (i.e., "hold your voice pitch steady" task; an indicator of voice pitch regulation). Across all training tasks, the N1 and P2 components of the ERPs occurred earlier, and the P2 component of the ERPs occurred with larger amplitude post-training. The voice responses also occurred earlier but with a smaller amplitude following training. These results demonstrate that participation in pitch-shifted auditory feedback tasks even for brief periods of time can modulate the automatic tendency to compensate for alterations in voice pitch feedback and has therapeutic potential.
Collapse
Affiliation(s)
- Sona Patel
- Department of Communication Sciences and Disorders, Northwestern University, Evanston, IL, United States of America
- Department of Speech-Language Pathology, Seton Hall University, Nutley, NJ, United States of America
| | - Karen Hebert
- Department of Occupational Therapy, University of South Dakota, Vermillion, SD, United States of America
| | - Oleg Korzyukov
- Department of Communication Sciences and Disorders, Northwestern University, Evanston, IL, United States of America
- Department of Communication Sciences and Disorders, University of Wisconsin—Whitewater, Whitewater, WI, United States of America
| | - Charles R. Larson
- Department of Communication Sciences and Disorders, Northwestern University, Evanston, IL, United States of America
| |
Collapse
|
31
|
Ngo QC, Motin MA, Pah ND, Drotár P, Kempster P, Kumar D. Computerized analysis of speech and voice for Parkinson's disease: A systematic review. Comput Methods Programs Biomed 2022; 226:107133. [PMID: 36183641 DOI: 10.1016/j.cmpb.2022.107133] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2022] [Revised: 09/13/2022] [Accepted: 09/13/2022] [Indexed: 06/16/2023]
Abstract
BACKGROUND AND OBJECTIVE Speech impairment is an early symptom of Parkinson's disease (PD). This study has summarized the literature related to speech and voice in detecting PD and assessing its severity. METHODS A systematic review of the literature from 2010 to 2021 to investigate analysis methods and signal features. The keywords "Automatic analysis" in conjunction with "PD speech" or "PD voice" were used, and the PubMed and ScienceDirect databases were searched. A total of 838 papers were found on the first run, of which 189 were selected. One hundred and forty-seven were found to be suitable for the review. The different datasets, recording protocols, signal analysis methods and features that were reported are listed. Values of the features that separate PD patients from healthy controls were tabulated. Finally, the barriers that limit the wide use of computerized speech analysis are discussed. RESULTS Speech and voice may be valuable markers for PD. However, large differences between the datasets make it difficult to compare different studies. In addition, speech analytic methods that are not informed by physiological understanding may alienate clinicians. CONCLUSIONS The potential usefulness of speech and voice for the detection and assessment of PD is confirmed by evidence from the classification and correlation results.
Collapse
Affiliation(s)
| | - Mohammod Abdul Motin
- Biosignals Lab, RMIT University, Melbourne, Australia; Department of Electrical & Electronic Engineering, Rajshahi University of Engineering & Technology, Rajshahi 6204, Bangladesh
| | - Nemuel Daniel Pah
- Biosignals Lab, RMIT University, Melbourne, Australia; Universitas Surabaya, Indonesia
| | - Peter Drotár
- Intelligent Information Systems Lab, Technical University of Kosice, Letna 9, 42001, Kosice, Slovakia
| | - Peter Kempster
- Neurosciences Department, Monash Health, Clayton, VIC, Australia; Department of Medicine, School of Clinical Sciences, Monash University, Clayton, VIC, Australia
| | - Dinesh Kumar
- Biosignals Lab, RMIT University, Melbourne, Australia.
| |
Collapse
|
32
|
Abstract
Categorising voices is crucial for auditory-based social interactions. This Primer explores a PLOS Biiology study that capitalises on human intracranial recordings to describe the spatiotemporal pattern of neural activity leading to voice-selective responses in associative auditory cortex.
Collapse
Affiliation(s)
- Benjamin Morillon
- Aix Marseille University, Inserm, Institut de Neurosciences des Systèmes (INS), Marseille, France
- * E-mail:
| | - Luc H. Arnal
- Institut de l’Audition, Inserm unit 1120, Institut Pasteur, Paris, France
| | - Pascal Belin
- Aix Marseille University, CNRS, La Timone Neuroscience Institute (INT), Marseille, France
| |
Collapse
|
33
|
Rinke P, Schmidt T, Beier K, Kaul R, Scharinger M. Rapid pre-attentive processing of a famous speaker: Electrophysiological effects of Angela Merkel's voice. Neuropsychologia 2022; 173:108312. [PMID: 35781011 DOI: 10.1016/j.neuropsychologia.2022.108312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Revised: 06/27/2022] [Accepted: 06/27/2022] [Indexed: 11/18/2022]
Abstract
The recognition of human speakers by their voices is a remarkable cognitive ability. Previous research has established a voice area in the right temporal cortex involved in the integration of speaker-specific acoustic features. This integration appears to occur rapidly, especially in case of familiar voices. However, the exact time course of this process is less well understood. To this end, we here investigated the automatic change detection response of the human brain while listening to the famous voice of German chancellor Angela Merkel, embedded in the context of acoustically matched voices. A classic passive oddball paradigm contrasted short word stimuli uttered by Merkel with word stimuli uttered by two unfamiliar female speakers. Electrophysiological voice processing indices from 21 participants were quantified as mismatch negativities (MMNs) and P3a differences. Cortical sources were approximated by variable resolution electromagnetic tomography. The results showed amplitude and latency effects for both MMN and P3a: The famous (familiar) voice elicited a smaller but earlier MMN than the unfamiliar voices. The P3a, by contrast, was both larger and later for the familiar than for the unfamiliar voices. Familiar-voice MMNs originated from right-hemispheric regions in temporal cortex, overlapping with the temporal voice area, while unfamiliar-voice MMNs stemmed from left superior temporal gyrus. These results suggest that the processing of a very famous voice relies on pre-attentive right temporal processing within the first 150 ms of the acoustic signal. The findings further our understanding of the neural dynamics underlying familiar voice processing.
Collapse
Affiliation(s)
- Paula Rinke
- Research Group Phonetics, Institute of German Linguistics, Philipps-University Marburg, Germany; Center for Mind, Brain & Behavior, Universities of Marburg & Gießen, Germany
| | - Tatjana Schmidt
- Center for Mind, Brain & Behavior, Universities of Marburg & Gießen, Germany; Faculté de biologie et de médecine, University of Lausanne, Switzerland
| | - Kjartan Beier
- Research Group Phonetics, Institute of German Linguistics, Philipps-University Marburg, Germany
| | - Ramona Kaul
- Research Group Phonetics, Institute of German Linguistics, Philipps-University Marburg, Germany
| | - Mathias Scharinger
- Research Group Phonetics, Institute of German Linguistics, Philipps-University Marburg, Germany; Research Center »Deutscher Sprachatlas«, Philipps-University Marburg, Germany; Center for Mind, Brain & Behavior, Universities of Marburg & Gießen, Germany.
| |
Collapse
|
34
|
Weerathunge HR, Alzamendi GA, Cler GJ, Guenther FH, Stepp CE, Zañartu M. LaDIVA: A neurocomputational model providing laryngeal motor control for speech acquisition and production. PLoS Comput Biol 2022; 18:e1010159. [PMID: 35737706 PMCID: PMC9258861 DOI: 10.1371/journal.pcbi.1010159] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Revised: 07/06/2022] [Accepted: 05/02/2022] [Indexed: 11/18/2022] Open
Abstract
Many voice disorders are the result of intricate neural and/or biomechanical impairments that are poorly understood. The limited knowledge of their etiological and pathophysiological mechanisms hampers effective clinical management. Behavioral studies have been used concurrently with computational models to better understand typical and pathological laryngeal motor control. Thus far, however, a unified computational framework that quantitatively integrates physiologically relevant models of phonation with the neural control of speech has not been developed. Here, we introduce LaDIVA, a novel neurocomputational model with physiologically based laryngeal motor control. We combined the DIVA model (an established neural network model of speech motor control) with the extended body-cover model (a physics-based vocal fold model). The resulting integrated model, LaDIVA, was validated by comparing its model simulations with behavioral responses to perturbations of auditory vocal fundamental frequency (fo) feedback in adults with typical speech. LaDIVA demonstrated capability to simulate different modes of laryngeal motor control, ranging from short-term (i.e., reflexive) and long-term (i.e., adaptive) auditory feedback paradigms, to generating prosodic contours in speech. Simulations showed that LaDIVA’s laryngeal motor control displays properties of motor equivalence, i.e., LaDIVA could robustly generate compensatory responses to reflexive vocal fo perturbations with varying initial laryngeal muscle activation levels leading to the same output. The model can also generate prosodic contours for studying laryngeal motor control in running speech. LaDIVA can expand the understanding of the physiology of human phonation to enable, for the first time, the investigation of causal effects of neural motor control in the fine structure of the vocal signal.
Collapse
Affiliation(s)
- Hasini R. Weerathunge
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, Massachusetts, United States of America
- * E-mail:
| | - Gabriel A. Alzamendi
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
- Institute for Research and Development on Bioengineering and Bioinformatics (IBB), CONICET-UNER, Oro Verde, Argentina
| | - Gabriel J. Cler
- Department of Speech & Hearing Sciences, University of Washington, Seattle, Washington, United States of America
| | - Frank H. Guenther
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, Massachusetts, United States of America
| | - Cara E. Stepp
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, Massachusetts, United States of America
- Department of Otolaryngology-Head and Neck Surgery, Boston University School of Medicine, Boston, Massachusetts, United States of America
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| |
Collapse
|
35
|
Ning LH. Comparison of Involuntary and Volitional Responses to Pitch-Shifted Auditory Feedback: Evidence for Tone Speakers' Flexibility to Switch Between Opposing and Following Responses. J Speech Lang Hear Res 2022; 65:2160-2186. [PMID: 35537117 DOI: 10.1044/2022_jslhr-21-00597] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
PURPOSE Our audio-vocal system involves a negative feedback system that functions to correct for fundamental frequency (f 0) errors in production. Therefore, automatic and opposing responses appear when an unexpected change in voice pitch is present in auditory feedback. This study explores following responses to pitch perturbation in auditory feedback in tonal language speakers, which have been commonly overlooked or discarded by past research. We examine whether the number of response types (opposing vs. following) and their dynamic f 0 contours in tone word production vary as a function of instruction (involuntary ["to ignore"] vs. volitional ["to compensate"]). METHOD Twenty-four native speakers of Taiwanese Southern Min (TSM) produced three disyllabic TSM words while receiving pitch perturbation through headphones. The three disyllabic words were tsau55-im55 ("out of tune"; HH), kau33-uann33 ("exchange"; MM), and pan11-an11 ("handle a case"; LL) that carry an identical high-, mid-, or low-level tone. The participants were instructed either "to ignore" or "to compensate" for the pitch shifts. RESULTS Results from a Bayesian Poisson regression show that the number of opposing and following responses were split nearly 50-50 for the "ignore" condition and 55%-35% for the "compensate" condition. The simulation results indicate that the speakers were able to switch between the feedback and the feedforward mode during the testing. On the other hand, f 0 contour analyses using generalized additive models show that pitch-increasing responses (i.e., oppose to downshifts or follow upshifts) were significantly larger than pitch-decreasing responses (i.e., oppose to upshifts or follow downshifts) for the MM and LL words, but not for the HH word. CONCLUSIONS Our results provide support for the view that, for tone speakers, following responses are not uncommon. The magnitudes of pitch shift response trajectories have to do with the available pitch range for moving up or down in tone word production.
Collapse
Affiliation(s)
- Li-Hsin Ning
- Department of English, National Taiwan Normal University, Taipei City
| |
Collapse
|
36
|
O'Keeffe R, Shirazi SY, Mehrdad S, Crosby T, Johnson AM, Atashzar SF. Perilaryngeal-Cranial Functional Muscle Network Differentiates Vocal Tasks: A Multi-Channel sEMG Approach. IEEE Trans Biomed Eng 2022; 69:3678-3688. [PMID: 35594214 DOI: 10.1109/tbme.2022.3175948] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
OBJECTIVE Objective evaluation of physiological responses using non-invasive methods for the assessment of vocal performance and voice disorders has attracted great interest. This paper, for the first time, aims to implement and evaluate perilaryngeal-cranial functional muscle networks. The study investigates the variations in topographical characteristics of the network and the corresponding ability to differentiate vocal tasks. METHOD Twelve surface electromyography (sEMG) signals were collected bilaterally from six perilaryngeal and cranial muscles. Data were collected from eight subjects (four females) without a known history of voice disorders. The proposed muscle network is composed of pairwise coherence between sEMG recordings. The network metrics include (a) network degree and (b) weighted clustering coefficient (WCC). RESULTS The varied phonation tasks showed the median degree and WCC of the muscle network ascend monotonically, with a high effect size ( |rrb| ∼ 0.5). Pitch glide, singing, and speech tasks were significantly distinguishable using degree and WCC ( |rrb| ∼ 0.8). Also, pitch glide had the highest degree and WCC among all tasks (degree , WCC ). In comparison, classic spectrotemporal measures showed far less effectiveness (max |rrb|=0.12) in differentiating the vocal tasks. CONCLUSION Perilaryngeal-cranial functional muscle network was proposed in this paper. The study showed that the functional muscle network could robustly differentiate the vocal tasks while the classic assessment of muscle activation fails to differentiate. SIGNIFICANCE For the first time, we demonstrate the power of a perilaryngeal-cranial muscle network as a neurophysiological window to vocal performance. In addition, the study also discovers tasks with the highest network involvement, which may be utilized in the future to monitor voice disorders and rehabilitation.
Collapse
|
37
|
Abrams DA, Mistry PK, Baker AE, Padmanabhan A, Menon V. A Neurodevelopmental Shift in Reward Circuitry from Mother's to Nonfamilial Voices in Adolescence. J Neurosci 2022; 42:4164-4173. [PMID: 35483917 PMCID: PMC9121826 DOI: 10.1523/jneurosci.2018-21.2022] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Revised: 03/04/2022] [Accepted: 03/31/2022] [Indexed: 11/21/2022] Open
Abstract
The social worlds of young children primarily revolve around parents and caregivers, who play a key role in guiding children's social and cognitive development. However, a hallmark of adolescence is a shift in orientation toward nonfamilial social targets, an adaptive process that prepares adolescents for their independence. Little is known regarding neurobiological signatures underlying changes in adolescents' social orientation. Using functional brain imaging of human voice processing in children and adolescents (ages 7-16), we demonstrate distinct neural signatures for mother's voice and nonfamilial voices across child and adolescent development in reward and social valuation systems, instantiated in nucleus accumbens and ventromedial prefrontal cortex. While younger children showed greater activity in these brain systems for mother's voice compared with nonfamilial voices, older adolescents showed the opposite effect with increased activity for nonfamilial compared with mother's voice. Findings uncover a critical role for reward and social valuative brain systems in the pronounced changes in adolescents' orientation toward nonfamilial social targets. Our approach provides a template for examining developmental shifts in social reward and motivation in individuals with pronounced social impairments, including adolescents with autism.SIGNIFICANCE STATEMENT Children's social worlds undergo a transformation during adolescence. While socialization in young children revolves around parents and caregivers, adolescence is characterized by a shift in social orientation toward nonfamilial social partners. Here we show that this shift is reflected in neural activity measured from reward processing regions in response to brief vocal samples. When younger children hear their mother's voice, reward processing regions show greater activity compared with when they hear nonfamilial, unfamiliar voices. Strikingly, older adolescents show the opposite effect, with increased activity for nonfamilial compared with mother's voice. Findings identify the brain basis of adolescents' switch in social orientation toward nonfamilial social partners and provides a template for understanding neurodevelopment in clinical populations with social and communication difficulties.
Collapse
Affiliation(s)
| | | | | | | | - Vinod Menon
- Department of Psychiatry and Behavioral Sciences
- Department of Neurology and Neurological Sciences
- Stanford Neuroscience Institute, Stanford University School of Medicine, Stanford, California 94305
| |
Collapse
|
38
|
Awan SN, Shaikh MA, Desjardins M, Feinstein H, Abbott KV. The Effect of Microphone Frequency Response on Spectral and Cepstral Measures of Voice: An Examination of Low-Cost Electret Headset Microphones. Am J Speech Lang Pathol 2022; 31:959-973. [PMID: 35050724 PMCID: PMC9150670 DOI: 10.1044/2021_ajslp-21-00156] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Revised: 08/12/2021] [Accepted: 10/11/2021] [Indexed: 06/14/2023]
Abstract
PURPOSE The purpose of this study was to establish the frequency response of a selection of low-cost headset microphones that could be given to subjects for remote voice recordings and to examine the effect of microphone type and frequency response on key acoustic measures related to voice quality obtained from speech and vowel samples. METHOD The frequency responses of three low-cost headset microphones were evaluated using pink noise generated via a head-and-torso model. Each of the headset microphones was then used to record a series of speech and vowel samples prerecorded from 24 speakers who represented a diversity of sex, age, fundamental frequency (F o), and voice quality types. Recordings were later analyzed for the following measures: smoothed cepstral peak prominence (CPP; dB), low versus high spectral ratio (L/H ratio; dB), CPP F o (Hz), and cepstral spectral index of dysphonia (CSID). RESULTS The frequency response of the microphones under test was observed to have nonsignificant effects on measures of the CPP and CPP F o, significant effects on the CSID in speech contexts, and strong and significant effects on the measure of spectral tilt (L/H ratio). However, the correlations between the various headset microphones and a reference precision microphone were excellent (rs > .90). CONCLUSIONS The headset microphones under test all showed the capability to track a wide range of diversity in the voice signal. Though the use of higher quality microphones that have demonstrated specifications is recommended for typical research and clinical purposes, low-cost electret microphones may be used to provide valid measures of voice, specifically when the same microphone and signal chain is used for the evaluation of pre- versus posttreatment change or intergroup comparisons.
Collapse
Affiliation(s)
- Shaheen N. Awan
- Department of Communication Sciences and Disorders, University of South Florida, Tampa
| | - Mohsin A. Shaikh
- Department of Communication Sciences and Disorders, Bloomsburg University of Pennsylvania
| | - Maude Desjardins
- Department of Communication Sciences & Disorders, University of Delaware, Newark
| | - Hagar Feinstein
- Department of Communication Sciences & Disorders, University of Delaware, Newark
| | | |
Collapse
|
39
|
Fahmy S, Kan PF, Walentas Lewon J. The effects of theatre-based vocal empowerment on young Egyptian women's vocal and language characteristics. PLoS One 2022; 16:e0261294. [PMID: 34972115 PMCID: PMC8719750 DOI: 10.1371/journal.pone.0261294] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2020] [Accepted: 11/30/2021] [Indexed: 11/18/2022] Open
Abstract
This study investigates the impact of a theatre-based vocal empowerment program on the vocal and language characteristics and the self-perceptions of young bilingual Egyptian women. The program used applied theatre, a dramatic practice that promotes civic action by utilizing improvisational techniques to engage participants in exploring solutions to self-identified community concerns. These techniques supported participants' pursuit of vocal empowerment: the ability to comfortably express their intended content with a clear audible voice, accompanied by the belief that what they had to say was worthwhile. The program was implemented in Alexandria and Aswan, two Egyptian cities in different regions of the country, with distinct socio-economic profiles. Thirty-six young women from Aswan and nineteen from Alexandria participated. The program was facilitated in Arabic, for 90 minutes per day over twelve consecutive days in 2018. Participants in both groups spoke Arabic as a home language and studied English in school settings but differed in their educational experiences and English proficiency. The vocal and language characteristics of each participant were tested in Arabic and English pre- and post- program using a spontaneous speech task and a reading aloud task. Their self-perceptions were evaluated through a vocal self-perception survey. Results indicated that participants responded differently in each city. In Alexandria, participants showed significant improvement in language skills (e.g., mean length of utterance). In contrast, participants in Aswan showed a significant change in fundamental frequency. Overall, the self-surveys indicated that all participants experienced an increased sense of confidence, a stronger belief in self-authorship, and an increased desire to voice their opinions clearly in public; however, there were subtle differences between the groups. In analyzing these results, we conclude that to design effective vocal empowerment outreach programs internationally, it is necessary to consider participants' cultural backgrounds, language diversity, and socio-economic status.
Collapse
Affiliation(s)
- Sarah Fahmy
- Department of Theatre and Dance, University of Colorado Boulder, Boulder, Colorado, United States of America
- * E-mail:
| | - Pui-Fong Kan
- Department of Speech, Language, & Hearing Sciences, University of Colorado Boulder, Boulder, Colorado, United States of America
| | - Jen Walentas Lewon
- Department of Speech, Language, & Hearing Sciences, University of Colorado Boulder, Boulder, Colorado, United States of America
| |
Collapse
|
40
|
Alzamendi GA, Peterson SD, Erath BD, Hillman RE, Zañartu M. Triangular body-cover model of the vocal folds with coordinated activation of the five intrinsic laryngeal muscles. J Acoust Soc Am 2022; 151:17. [PMID: 35105008 PMCID: PMC8727069 DOI: 10.1121/10.0009169] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/14/2021] [Revised: 11/24/2021] [Accepted: 12/06/2021] [Indexed: 05/26/2023]
Abstract
Poor laryngeal muscle coordination that results in abnormal glottal posturing is believed to be a primary etiologic factor in common voice disorders such as non-phonotraumatic vocal hyperfunction. Abnormal activity of antagonistic laryngeal muscles is hypothesized to play a key role in the alteration of normal vocal fold biomechanics that results in the dysphonia associated with such disorders. Current low-order models of the vocal folds are unsatisfactory to test this hypothesis since they do not capture the co-contraction of antagonist laryngeal muscle pairs. To address this limitation, a self-sustained triangular body-cover model with full intrinsic muscle control is introduced. The proposed scheme shows good agreement with prior studies using finite element models, excised larynges, and clinical studies in sustained and time-varying vocal gestures. Simulations of vocal fold posturing obtained with distinct antagonistic muscle activation yield clear differences in kinematic, aerodynamic, and acoustic measures. The proposed tool is deemed sufficiently accurate and flexible for future comprehensive investigations of non-phonotraumatic vocal hyperfunction and other laryngeal motor control disorders.
Collapse
Affiliation(s)
- Gabriel A Alzamendi
- Institute for Research and Development on Bioengineering and Bioinformatics (IBB), CONICET-UNER, Oro Verde, Entre Ríos 3100, Argentina
| | - Sean D Peterson
- Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, Ontario, N2L 3G1, Canada
| | - Byron D Erath
- Department of Mechanical and Aerospace Engineering, Clarkson University, Potsdam, New York 13699, USA
| | - Robert E Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| |
Collapse
|
41
|
Oles N, Darrach H, Landford W, Garza M, Twose C, Park CS, Tran P, Schechter LS, Lau B, Coon D. Gender Affirming Surgery: A Comprehensive, Systematic Review of All Peer-reviewed Literature and Methods of Assessing Patient-centered Outcomes (Part 1: Breast/Chest, Face, and Voice). Ann Surg 2022; 275:e52-e66. [PMID: 33443903 DOI: 10.1097/sla.0000000000004728] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
OBJECTIVE To perform the first systematic review of all available gender-affirming surgery (GAS) publications across all procedures to assess both outcomes reported in the literature and the methods used for outcome assessment. SUMMARY OF BACKGROUND DATA Rapidly increasing clinical volumes of gender-affirming surgeries have stimulated a growing need for high-quality clinical research. Although some procedures have been performed for decades, each individual procedure has limited data, necessitating synthesis of the entire literature to understand current knowledge and guide future research. METHODS A systematic review was performed following Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines to identify all outcomes measures in GAS cohorts, including PCOs, complications, and functional outcomes. Outcome data were pooled to assess currently reported complication, satisfaction, and other outcome rates. RESULTS Overall, 15,186 references were identified, 4162 papers advanced to abstract review, and 1826 underwent full-text review. After review, there were 406 GAS cohort publications. Of non-genitoplasty titles, 35 were mastectomy, 6 mammoplasty, 21 facial feminization, and 31 voice/cartilage. Although 59.1% of non-genitoplasty papers addressed PCOs in some form, only 4.3% used instruments partially-validated in transgender patients. Overall, data were reported heterogeneously and were biased towards high-volume centers. CONCLUSIONS This study represents the most comprehensive review of GAS literature. By aggregating all previously utilized measurement instruments, this study offers a foundation for discussions about current methodologic limitations and what dimensions must be included in assessing surgical success. We have assembled a comprehensive list of outcome instruments; this offers an ideal starting basis for emerging discussions between patients and providers about deficiencies which new, better instruments and metrics must address. The lack of consistent use of the same outcome measures and validated GAS-specific instruments represent the 2 primary barriers to high-quality research where improvement efforts should be focused.
Collapse
Affiliation(s)
- Norah Oles
- Department of Plastic Surgery, Johns Hopkins University School of Medicine, Baltimore, Maryland
- Center for Transgender Health, Johns Hopkins Medicine, Baltimore, Maryland
| | - Halley Darrach
- Department of Plastic Surgery, Johns Hopkins University School of Medicine, Baltimore, Maryland
- Center for Transgender Health, Johns Hopkins Medicine, Baltimore, Maryland
| | - Wilmina Landford
- Department of Plastic Surgery, Johns Hopkins University School of Medicine, Baltimore, Maryland
- Center for Transgender Health, Johns Hopkins Medicine, Baltimore, Maryland
| | - Matthew Garza
- Department of Plastic Surgery, Johns Hopkins University School of Medicine, Baltimore, Maryland
- Center for Transgender Health, Johns Hopkins Medicine, Baltimore, Maryland
| | - Claire Twose
- Welch Medical Library, Johns Hopkins University, Baltimore, Maryland
| | - Chanjun S Park
- Department of Plastic Surgery, Johns Hopkins University School of Medicine, Baltimore, Maryland
- Center for Transgender Health, Johns Hopkins Medicine, Baltimore, Maryland
| | - Phuong Tran
- Department of Plastic Surgery, Johns Hopkins University School of Medicine, Baltimore, Maryland
- Center for Transgender Health, Johns Hopkins Medicine, Baltimore, Maryland
| | | | - Brandyn Lau
- Department of Health Policy and Management, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland
- Russell H. Morgan Department of Radiology and Radiological Science, Johns Hopkins University School of Medicine, Baltimore, Maryland
- Division of Health Sciences Informatics, Johns Hopkins University School of Medicine, Baltimore, Maryland
- Armstrong Institute for Patient Safety, Johns Hopkins University School of Medicine, Baltimore, Maryland
| | - Devin Coon
- Department of Plastic Surgery, Johns Hopkins University School of Medicine, Baltimore, Maryland
- Center for Transgender Health, Johns Hopkins Medicine, Baltimore, Maryland
| |
Collapse
|
42
|
Waters S, Kanber E, Lavan N, Belyk M, Carey D, Cartei V, Lally C, Miquel M, McGettigan C. Singers show enhanced performance and neural representation of vocal imitation. Philos Trans R Soc Lond B Biol Sci 2021; 376:20200399. [PMID: 34719245 PMCID: PMC8558773 DOI: 10.1098/rstb.2020.0399] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/06/2021] [Indexed: 12/17/2022] Open
Abstract
Humans have a remarkable capacity to finely control the muscles of the larynx, via distinct patterns of cortical topography and innervation that may underpin our sophisticated vocal capabilities compared with non-human primates. Here, we investigated the behavioural and neural correlates of laryngeal control, and their relationship to vocal expertise, using an imitation task that required adjustments of larynx musculature during speech. Highly trained human singers and non-singer control participants modulated voice pitch and vocal tract length (VTL) to mimic auditory speech targets, while undergoing real-time anatomical scans of the vocal tract and functional scans of brain activity. Multivariate analyses of speech acoustics, larynx movements and brain activation data were used to quantify vocal modulation behaviour and to search for neural representations of the two modulated vocal parameters during the preparation and execution of speech. We found that singers showed more accurate task-relevant modulations of speech pitch and VTL (i.e. larynx height, as measured with vocal tract MRI) during speech imitation; this was accompanied by stronger representation of VTL within a region of the right somatosensory cortex. Our findings suggest a common neural basis for enhanced vocal control in speech and song. This article is part of the theme issue 'Voice modulation: from origin and mechanism to social impact (Part I)'.
Collapse
Affiliation(s)
- Sheena Waters
- Department of Psychology, Royal Holloway, University of London, Egham TW20 0EX, UK
- Wolfson Institute of Preventive Medicine, Barts and The London School of Medicine and Dentistry, Charterhouse Square, London EC1M 6BQ, UK
| | - Elise Kanber
- Department of Psychology, Royal Holloway, University of London, Egham TW20 0EX, UK
- Speech, Hearing and Phonetic Sciences, University College London, 2 Wakefield Street, London WC1N 1PF, UK
| | - Nadine Lavan
- Speech, Hearing and Phonetic Sciences, University College London, 2 Wakefield Street, London WC1N 1PF, UK
- Department of Biological and Experimental Psychology, Queen Mary University of London, Mile End Road, Bethnal Green, London E1 4NS, UK
| | - Michel Belyk
- Speech, Hearing and Phonetic Sciences, University College London, 2 Wakefield Street, London WC1N 1PF, UK
| | - Daniel Carey
- Department of Psychology, Royal Holloway, University of London, Egham TW20 0EX, UK
- Data & AI, Novartis Pharmaceuticals, Novartis Global Service Center, 203 Merrion Road, Dublin 4 D04 NN12, Ireland
| | - Valentina Cartei
- Equipe de Neuro-Ethologie Sensorielle (ENES), Centre de Recherche en Neurosciences de Lyon, Université de Lyon/Saint-Etienne, 21 rue du Docteur Paul Michelon, 42100 Saint-Etienne, France
- Department of Psychology, Institute of Education, Health and Social Sciences, University of Chichester, College Lane, Chichester, West Sussex PO19 6PE, UK
| | - Clare Lally
- Department of Psychology, Royal Holloway, University of London, Egham TW20 0EX, UK
- Speech, Hearing and Phonetic Sciences, University College London, 2 Wakefield Street, London WC1N 1PF, UK
| | - Marc Miquel
- Department of Clinical Physics, Barts Health NHS Trust, London EC1A 7BE, UK
- William Harvey Research Institute, Queen Mary University of London, London EC1M 6BQ, UK
| | - Carolyn McGettigan
- Department of Psychology, Royal Holloway, University of London, Egham TW20 0EX, UK
- Speech, Hearing and Phonetic Sciences, University College London, 2 Wakefield Street, London WC1N 1PF, UK
| |
Collapse
|
43
|
Abstract
The aim of the present literature review is to describe the influence of sex hormones on the human voice in physiological conditions. As a secondary sexual organ, the larynx is affected by sex hormones and may change considerably over the lifespan. In the current review, sex hormone-related voice modifications occurring during childhood, puberty, the menstrual cycle, pregnancy and senescence are described. The roles of sex hormones (including gonadotropins, testosterone, estrogen, androstenedione, dehydroepiandrosterone, and dehydroepiandrosterone-sulfate) underlying physiological voice changes are discussed, the main differences between males and females are explained and clinical implications are taken into account.
Collapse
Affiliation(s)
- Virginia Zamponi
- Endocrinology Unit, Department of Clinical and Molecular Medicine, Sant'Andrea Hospital, Sapienza University, Rome, Italy.
| | - Rossella Mazzilli
- Endocrinology Unit, Department of Clinical and Molecular Medicine, Sant'Andrea Hospital, Sapienza University, Rome, Italy
| | - Fernando Mazzilli
- Endocrinology Unit, Department of Clinical and Molecular Medicine, Sant'Andrea Hospital, Sapienza University, Rome, Italy
| | - Marco Fantini
- Head and Neck Oncology Unit, Candiolo Cancer Institute, FPO IRCCS, Candiolo, Turin, Italy
| |
Collapse
|
44
|
Abstract
In this study we investigated the effect of sex- and age-related differences in vocal fold length, thickness, and depth on voice production in a three-dimensional vocal fold model. The results showed that the cause-effect relationships between vocal fold physiology and voice production previously identified in an adult male-like vocal fold geometry remained qualitatively the same in vocal folds with geometry representative of adult females and children. We further showed that the often-observed differences in voice production between adult males, adult females, and children can be explained by differences in length and thickness. The lower F0, higher flow rate, larger vocal fold vibration amplitude, and higher sound pressure level (SPL) in adult males as compared to adult females and children can be explained by differences in vocal fold length. In contrast, the thickness effect dominated and contributed to the larger closed quotient of vocal fold vibration, larger normalized maximum flow declination rate, and lower H1-H2 in adult males as compared to adult females and children. The effect of differences in vocal fold depth was generally small. When targeting a specific SPL, adult males experienced a lower peak vocal fold contact pressure during phonation than adult females and children.
Collapse
Affiliation(s)
- Zhaoyan Zhang
- Department of Head and Neck Surgery, University of California, Los Angeles, 31-24 Rehabilitation Center, 1000 Veteran Avenue, Los Angeles, California 90095-1794, USA
| |
Collapse
|
45
|
Anderson BE, Sommerfeldt SD. Solving one-dimensional acoustic systems using the impedance translation theorem and equivalent circuits: A graduate level homework assignment. J Acoust Soc Am 2021; 150:4155. [PMID: 34972266 DOI: 10.1121/10.0008932] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Accepted: 11/11/2021] [Indexed: 06/14/2023]
Abstract
The natural frequency resonances and sound radiation from one-dimensional acoustic systems are of great interest in the study of musical instruments, human vocal tract effects on speech, automotive exhaust pipes, duct systems for temperature control in buildings, and more. The impedance translation theorem is an approach that may be used to solve for the input impedance and therefore the resonance frequencies of one-dimensional systems. Equivalent circuits offer another approach to solving one-dimensional systems, though with equivalent circuits you can also solve for the response at any location in the system, including the radiated sound pressure. At Brigham Young University, there are two graduate level courses that teach these two techniques. One of the most challenging and memorable homework assignments from these courses is based on using one of these techniques to analyze a particular acoustic system and compare its response with the real thing. This paper discusses the basics of these two techniques and applies them to an analysis of phonemes produced by altering the human vocal tract. Details about the homework assignments are also given.
Collapse
Affiliation(s)
- Brian E Anderson
- Acoustics Research Group, Department of Physics and Astronomy, Brigham Young University, Provo, Utah 84602, USA
| | - Scott D Sommerfeldt
- Acoustics Research Group, Department of Physics and Astronomy, Brigham Young University, Provo, Utah 84602, USA
| |
Collapse
|
46
|
Asghari SZ, Farashi S, Bashirian S, Jenabi E. Distinctive prosodic features of people with autism spectrum disorder: a systematic review and meta-analysis study. Sci Rep 2021; 11:23093. [PMID: 34845298 PMCID: PMC8630064 DOI: 10.1038/s41598-021-02487-6] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2021] [Accepted: 11/16/2021] [Indexed: 12/26/2022] Open
Abstract
In this systematic review, we analyzed and evaluated the findings of studies on prosodic features of vocal productions of people with autism spectrum disorder (ASD) in order to recognize the statistically significant, most confirmed and reliable prosodic differences distinguishing people with ASD from typically developing individuals. Using suitable keywords, three major databases including Web of Science, PubMed and Scopus, were searched. The results for prosodic features such as mean pitch, pitch range and variability, speech rate, intensity and voice duration were extracted from eligible studies. The pooled standard mean difference between ASD and control groups was extracted or calculated. Using I2 statistic and Cochrane Q-test, between-study heterogeneity was evaluated. Furthermore, publication bias was assessed using funnel plot and its significance was evaluated using Egger's and Begg's tests. Thirty-nine eligible studies were retrieved (including 910 and 850 participants for ASD and control groups, respectively). This systematic review and meta-analysis showed that ASD group members had a significantly larger mean pitch (SMD = - 0.4, 95% CI [- 0.70, - 0.10]), larger pitch range (SMD = - 0.78, 95% CI [- 1.34, - 0.21]), longer voice duration (SMD = - 0.43, 95% CI [- 0.72, - 0.15]), and larger pitch variability (SMD = - 0.46, 95% CI [- 0.84, - 0.08]), compared with typically developing control group. However, no significant differences in pitch standard deviation, voice intensity and speech rate were found between groups. Chronological age of participants and voice elicitation tasks were two sources of between-study heterogeneity. Furthermore, no publication bias was observed during analyses (p > 0.05). Mean pitch, pitch range, pitch variability and voice duration were recognized as the prosodic features reliably distinguishing people with ASD from TD individuals.
Collapse
Affiliation(s)
| | - Sajjad Farashi
- Autism Spectrum Disorders Research Center, Hamadan University of Medical Sciences, Hamadan, Iran.
| | - Saeid Bashirian
- Department of Public Health, School of Health, Hamadan University of Medical Sciences, Hamadan, Iran.
| | - Ensiyeh Jenabi
- Autism Spectrum Disorders Research Center, Hamadan University of Medical Sciences, Hamadan, Iran
| |
Collapse
|
47
|
Wang JJ, Huang TY, Wu CW, Lin YC, Tseng HY, Liu CH, Lu IC, Chang PY, Chen HC, Chen HY, Dionigi G, Chiang FY, Wang LF. Improving Voice Outcomes After Thyroid Surgery - Review of Safety Parameters for Using Energy-Based Devices Near the Recurrent Laryngeal Nerve. Front Endocrinol (Lausanne) 2021; 12:793431. [PMID: 34899616 PMCID: PMC8662988 DOI: 10.3389/fendo.2021.793431] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/12/2021] [Accepted: 11/08/2021] [Indexed: 11/13/2022] Open
Abstract
Technological advances in thyroid surgery have rapidly increased in recent decades. Specifically, recently developed energy-based devices (EBDs) enable simultaneous dissection and sealing tissue. EBDs have many advantages in thyroid surgery, such as reduced blood loss, lower rate of post-operative hypocalcemia, and shorter operation time. However, the rate of recurrent laryngeal nerve (RLN) injury during EBD use has shown statistically inconsistent. EBDs generate high temperature that can cause iatrogenic thermal injury to the RLN by direct or indirect thermal spread. This article reviews relevant medical literatures of conventional electrocauteries and different mechanisms of current EBDs, and compares two safety parameters: safe distance and cooling time. In general, conventional electrocautery generates higher temperature and wider thermal spread range, but when applying EBDs near the RLN adequate activation distance and cooling time are still required to avoid inadvertent thermal injury. To improve voice outcomes in the quality-of-life era, surgeons should observe safety parameters and follow the standard procedures when using EBDs near the RLN in thyroid surgery.
Collapse
Affiliation(s)
- Jia Joanna Wang
- Department of Otorhinolaryngology-Head and Neck Surgery, International Thyroid Surgery Center, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
- Department of Otolaryngology-Head and Neck Surgery, Kaohsiung Municipal Siaogang Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Tzu-Yen Huang
- Department of Otorhinolaryngology-Head and Neck Surgery, International Thyroid Surgery Center, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
- Department of Biological Science and Technology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
| | - Che-Wei Wu
- Department of Otorhinolaryngology-Head and Neck Surgery, International Thyroid Surgery Center, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
- Center for Liquid Biopsy and Cohort Research, and Faculty of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Yi-Chu Lin
- Department of Otorhinolaryngology-Head and Neck Surgery, International Thyroid Surgery Center, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Hsin-Yi Tseng
- Department of Otorhinolaryngology-Head and Neck Surgery, International Thyroid Surgery Center, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Cheng-Hsin Liu
- Department of Otorhinolaryngology-Head and Neck Surgery, International Thyroid Surgery Center, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
- Department of Otolaryngology-Head and Neck Surgery, Kaohsiung Municipal Siaogang Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - I-Cheng Lu
- Department of Anesthesiology, Kaohsiung Municipal Siaogang Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Pi-Ying Chang
- Department of Anesthesiology, Kaohsiung Municipal Tatung Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Hui-Chun Chen
- Department of Nursing, Kaohsiung Medical University Hospital, Kaohsiung, Taiwan
| | - Hsiu-Ya Chen
- Department of Anesthesiology, Kaohsiung Medical University Hospital, Kaohsiung, Taiwan
| | - Gianlorenzo Dionigi
- Division of General Surgery, Endocrine Surgery Section, Istituto Auxologico Italiano IRCCS, Milan, Italy
- Department of Pathophysiology and Transplantation, University of Milan, Milan, Italy
| | - Feng-Yu Chiang
- Department of Otolaryngology-Head and Neck Surgery, E-Da Hospital, Kaohsiung, Taiwan
- School of Medicine, College of Medicine, I-Shou University, Kaohsiung, Taiwan
| | - Ling-Feng Wang
- Department of Otorhinolaryngology-Head and Neck Surgery, International Thyroid Surgery Center, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
| |
Collapse
|
48
|
Laeng B, Kuyateh S, Kelkar T. Substituting facial movements in singers changes the sounds of musical intervals. Sci Rep 2021; 11:22442. [PMID: 34789775 PMCID: PMC8599708 DOI: 10.1038/s41598-021-01797-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Accepted: 10/26/2021] [Indexed: 11/18/2022] Open
Abstract
Cross-modal integration is ubiquitous within perception and, in humans, the McGurk effect demonstrates that seeing a person articulating speech can change what we hear into a new auditory percept. It remains unclear whether cross-modal integration of sight and sound generalizes to other visible vocal articulations like those made by singers. We surmise that perceptual integrative effects should involve music deeply, since there is ample indeterminacy and variability in its auditory signals. We show that switching videos of sung musical intervals changes systematically the estimated distance between two notes of a musical interval so that pairing the video of a smaller sung interval to a relatively larger auditory led to compression effects on rated intervals, whereas the reverse led to a stretching effect. In addition, after seeing a visually switched video of an equally-tempered sung interval and then hearing the same interval played on the piano, the two intervals were judged often different though they differed only in instrument. These findings reveal spontaneous, cross-modal, integration of vocal sounds and clearly indicate that strong integration of sound and sight can occur beyond the articulations of natural speech.
Collapse
Affiliation(s)
- Bruno Laeng
- RITMO Centre for Interdisciplinary Studies in Rhythm, Time and Motion, University of Oslo, Forskningsveien 3A, 1094 Blindern, 0317, Oslo, Norway.
- Department of Psychology, University of Oslo, Oslo, Norway.
| | - Sarjo Kuyateh
- RITMO Centre for Interdisciplinary Studies in Rhythm, Time and Motion, University of Oslo, Forskningsveien 3A, 1094 Blindern, 0317, Oslo, Norway
- Department of Psychology, University of Oslo, Oslo, Norway
| | - Tejaswinee Kelkar
- RITMO Centre for Interdisciplinary Studies in Rhythm, Time and Motion, University of Oslo, Forskningsveien 3A, 1094 Blindern, 0317, Oslo, Norway
- Department of Musicology, University of Oslo, Oslo, Norway
| |
Collapse
|
49
|
Abstract
Joint speech behaviours where speakers produce speech in unison are found in a variety of everyday settings, and have clinical relevance as a temporary fluency-enhancing technique for people who stutter. It is currently unknown whether such synchronisation of speech timing among two speakers is also accompanied by alignment in their vocal characteristics, for example in acoustic measures such as pitch. The current study investigated this by testing whether convergence in voice fundamental frequency (F0) between speakers could be demonstrated during synchronous speech. Sixty participants across two online experiments were audio recorded whilst reading a series of sentences, first on their own, and then in synchrony with another speaker (the accompanist) in a number of between-subject conditions. Experiment 1 demonstrated significant convergence in participants' F0 to a pre-recorded accompanist voice, in the form of both upward (high F0 accompanist condition) and downward (low and extra-low F0 accompanist conditions) changes in F0. Experiment 2 demonstrated that such convergence was not seen during a visual synchronous speech condition, in which participants spoke in synchrony with silent video recordings of the accompanist. An audiovisual condition in which participants were able to both see and hear the accompanist in pre-recorded videos did not result in greater convergence in F0 compared to synchronisation with the pre-recorded voice alone. These findings suggest the need for models of speech motor control to incorporate interactions between self- and other-speech feedback during speech production, and suggest a novel hypothesis for the mechanisms underlying the fluency-enhancing effects of synchronous speech in people who stutter.
Collapse
Affiliation(s)
- Abigail R. Bradshaw
- Department of Speech, Hearing & Phonetic Sciences, University College London, London, United Kingdom
| | - Carolyn McGettigan
- Department of Speech, Hearing & Phonetic Sciences, University College London, London, United Kingdom
| |
Collapse
|
50
|
Abstract
Organizations need both employee voice and managerial endorsement to ensure high-quality decision-making and achieve organizational effectiveness. However, a preponderance of voice research focuses on employee voice with little attention paid to voice endorsement. Building on the social persuasion theory of the elaboration likelihood model, we systematically examine the sender and receiver determinants of voice endorsement and how the interplay of those determinants affects voice endorsement. By empirically analyzing 168 paired samples, we find that issue-relevant information, i.e., voicer credibility, has a positive effect on voice endorsement and matters most when leaders have high felt obligation. The results also show that the peripheral cue used in the study, i.e., positive mood, has a positive effect on voice endorsement and matters most when leaders have low felt obligation or low cognitive flexibility. We discuss the contributions of these findings and highlight limitations and directions for future research.
Collapse
Affiliation(s)
- Xiaobo Li
- Department of Psychology and Behavioral Science, Zhejiang University, Hangzhou, China
- School of Business, Zhejiang University City College, Hangzhou, China
| | - Ting Wu
- School of Business, Zhejiang University City College, Hangzhou, China
- * E-mail:
| | - Jianhong Ma
- Department of Psychology and Behavioral Science, Zhejiang University, Hangzhou, China
| |
Collapse
|