1
|
Kirsch A, Gerstenberger C, Jakubaß B, Tschernitz M, Perkins JD, Groselj‐Strele A, Lanmüller H, Jarvis JC, Kniesburges S, Döllinger M, Gugatschka M. Bilateral Functional Electrical Stimulation for the Treatment of Presbyphonia in a Sheep Model. Laryngoscope 2024; 134:848-854. [PMID: 37597167 PMCID: PMC10952233 DOI: 10.1002/lary.30984] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Revised: 07/31/2023] [Accepted: 08/03/2023] [Indexed: 08/21/2023]
Abstract
OBJECTIVES The aim of the study was to increase muscle volume and improve phonation characteristics of the aged ovine larynx by functional electrical stimulation (FES) using a minimally invasive surgical procedure. METHODS Stimulation electrodes were placed bilaterally near the terminal adduction branch of the recurrent laryngeal nerves (RLN). The electrodes were connected to battery powered pulse generators implanted subcutaneously at the neck region. Training patterns were programmed by an external programmer using a bidirectional radio frequency link. Training sessions were repeated automatically by the implant every other day for 1 week followed by every day for 8 weeks in the awake animal. Another group of animals were used as sham, with electrodes positioned but not connected to an implant. Outcome parameters included gene expression analysis, histological assessment of muscle fiber size, functional analysis, and volumetric measurements based on three-dimensional reconstructions of the entire thyroarytenoid muscle (TAM). RESULTS Increase in minimal muscle fiber diameter and an improvement in vocal efficiency were observed following FES, compared with sham animals. CONCLUSION This is the first study to demonstrate beneficial effects in the TAM of FES at molecular, histological, and functional levels. FES of the terminal branches of the RLN reversed the effects of age-related changes and improved vocal efficiency. LEVEL OF EVIDENCE NA Laryngoscope, 134:848-854, 2024.
Collapse
Affiliation(s)
- Andrijana Kirsch
- Division of Phoniatrics, ENT University HospitalMedical University of GrazGrazAustria
| | - Claus Gerstenberger
- Division of Phoniatrics, ENT University HospitalMedical University of GrazGrazAustria
| | - Bernhard Jakubaß
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck SurgeryUniversity Hospital Erlangen, Friedrich‐Alexander‐Universität Erlangen‐NürnbergErlangenGermany
| | - Magdalena Tschernitz
- Division of Phoniatrics, ENT University HospitalMedical University of GrazGrazAustria
| | | | - Andrea Groselj‐Strele
- Core Facility Computational Bioanalytics, Center for Medical ResearchMedical University of GrazGrazAustria
| | - Hermann Lanmüller
- Center of Medical Physics and Biomedical EngineeringMedical University of ViennaViennaAustria
| | - Jonathan C. Jarvis
- School of Sport and Exercise SciencesLiverpool John Moores UniversityLiverpoolUK
| | - Stefan Kniesburges
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck SurgeryUniversity Hospital Erlangen, Friedrich‐Alexander‐Universität Erlangen‐NürnbergErlangenGermany
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck SurgeryUniversity Hospital Erlangen, Friedrich‐Alexander‐Universität Erlangen‐NürnbergErlangenGermany
| | - Markus Gugatschka
- Division of Phoniatrics, ENT University HospitalMedical University of GrazGrazAustria
| |
Collapse
|
2
|
Näger C, Kniesburges S, Tur B, Schoder S, Becker S. An Investigation of Acoustic Back-Coupling in Human Phonation on a Synthetic Larynx Model. Bioengineering (Basel) 2023; 10:1343. [PMID: 38135934 PMCID: PMC10740801 DOI: 10.3390/bioengineering10121343] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 11/12/2023] [Accepted: 11/19/2023] [Indexed: 12/24/2023] Open
Abstract
In the human phonation process, acoustic standing waves in the vocal tract can influence the fluid flow through the glottis as well as vocal fold oscillation. To investigate the amount of acoustic back-coupling, the supraglottal flow field has been recorded via high-speed particle image velocimetry (PIV) in a synthetic larynx model for several configurations with different vocal tract lengths. Based on the obtained velocity fields, acoustic source terms were computed. Additionally, the sound radiation into the far field was recorded via microphone measurements and the vocal fold oscillation via high-speed camera recordings. The PIV measurements revealed that near a vocal tract resonance frequency fR, the vocal fold oscillation frequency fo (and therefore also the flow field's fundamental frequency) jumps onto fR. This is accompanied by a substantial relative increase in aeroacoustic sound generation efficiency. Furthermore, the measurements show that fo-fR-coupling increases vocal efficiency, signal-to-noise ratio, harmonics-to-noise ratio and cepstral peak prominence. At the same time, the glottal volume flow needed for stable vocal fold oscillation decreases strongly. All of this results in an improved voice quality and phonation efficiency so that a person phonating with fo-fR-coupling can phonate longer and with better voice quality.
Collapse
Affiliation(s)
- Christoph Näger
- Institute of Fluid Mechanics, Friedrich-Alexander-Universität Erlangen-Nürnberg, Cauerstraße 4, 91058 Erlangen, Germany;
| | - Stefan Kniesburges
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head & Neck Surgery, University Hospital Erlangen, Medical School, Friedrich-Alexander-Universität Erlangen-Nürnberg, Waldstrasse 1, 91054 Erlangen, Germany
| | - Bogac Tur
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head & Neck Surgery, University Hospital Erlangen, Medical School, Friedrich-Alexander-Universität Erlangen-Nürnberg, Waldstrasse 1, 91054 Erlangen, Germany
| | - Stefan Schoder
- Aeroacoustics and Vibroacoustics Group, Institute of Fundamentals and Theory in Electrical Engineering, Graz University of Technology, Inffeldgasse 16, 8010 Graz, Austria;
| | - Stefan Becker
- Institute of Fluid Mechanics, Friedrich-Alexander-Universität Erlangen-Nürnberg, Cauerstraße 4, 91058 Erlangen, Germany;
| |
Collapse
|
3
|
Cortés JP, Lin JZ, Marks KL, Espinoza VM, Ibarra EJ, Zañartu M, Hillman RE, Mehta DD. Ambulatory Monitoring of Subglottal Pressure Estimated from Neck-Surface Vibration in Individuals with and without Voice Disorders. APPLIED SCIENCES (BASEL, SWITZERLAND) 2022; 12:10692. [PMID: 36777332 PMCID: PMC9910342 DOI: 10.3390/app122110692] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
The aerodynamic voice assessment of subglottal air pressure can discriminate between speakers with typical voices from patients with voice disorders, with further evidence validating subglottal pressure as a clinical outcome measure. Although estimating subglottal pressure during phonation is an important component of a standard voice assessment, current methods for estimating subglottal pressure rely on non-natural speech tasks in a clinical or laboratory setting. This study reports on the validation of a method for subglottal pressure estimation in individuals with and without voice disorders that can be translated to connected speech to enable the monitoring of vocal function and behavior in real-world settings. During a laboratory calibration session, a participant-specific multiple regression model was derived to estimate subglottal pressure from a neck-surface vibration signal that can be recorded during natural speech production. The model was derived for vocally typical individuals and patients diagnosed with phonotraumatic vocal fold lesions, primary muscle tension dysphonia, and unilateral vocal fold paralysis. Estimates of subglottal pressure using the developed method exhibited significantly lower error than alternative methods in the literature, with average errors ranging from 1.13 to 2.08 cm H2O for the participant groups. The model was then applied during activities of daily living, thus yielding ambulatory estimates of subglottal pressure for the first time in these populations. Results point to the feasibility and potential of real-time monitoring of subglottal pressure during an individual's daily life for the prevention, assessment, and treatment of voice disorders.
Collapse
Affiliation(s)
- Juan P. Cortés
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| | - Jon Z. Lin
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Katherine L. Marks
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA 02129, USA
- Speech, Language & Hearing Sciences Department, College of Health & Rehabilitation: Sargent College, Boston University, Boston, MA 02215, USA
| | | | - Emiro J. Ibarra
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| | - Robert E. Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA 02129, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA 02114, USA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA 02115, USA
| | - Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA 02129, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA 02114, USA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
4
|
Toles LE, Seidman AY, Hillman RE, Mehta DD. Clinical Utility of the Ratio of Sound Pressure Level to Subglottal Pressure in Patients Surgically Treated for Phonotraumatic Vocal Fold Lesions. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:2778-2788. [PMID: 35914023 PMCID: PMC9911101 DOI: 10.1044/2022_jslhr-21-00658] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Revised: 04/07/2022] [Accepted: 05/02/2022] [Indexed: 06/15/2023]
Abstract
PURPOSE This study aimed to determine whether a simplified, and potentially more stable, acoustic-aerodynamic voice outcome ratio (ratio of sound pressure level [SPL] to subglottal pressure) is comparable to a traditional vocal efficiency measure (ratio of acoustic power to the product of average subglottal pressure and average phonatory airflow) in terms of the ability to detect change in vocal function following surgical removal of bilateral phonotraumatic lesions. METHOD Pre- and postoperative acoustic and aerodynamic measures were analyzed retrospectively from 75 female patients who underwent surgical removal of bilateral phonotraumatic lesions. A 2 × 2 repeated-measures analysis of variance was conducted for each of three acoustic-aerodynamic voice outcome ratios-traditional vocal efficiency, an SPL-based ratio with both airflow and subglottal pressure, and a simplified SPL-based ratio with subglottal pressure only-to investigate the main effects of treatment stage (pre- and postsurgery), loudness condition (comfortable and loud), and their interaction. Post hoc paired samples t tests were conducted for statistically significant interactions. The within-subject variability of the measures was assessed using the coefficient of variation. RESULTS Although exhibiting an expected main effect of loudness (higher values in the loud condition), the traditional vocal efficiency ratio did not exhibit a main effect of treatment. For both SPL-based ratios, there were significant main effects of treatment stage (higher values postoperatively) and loudness condition (lower values in the loud condition). Within-subject, postoperative changes in the two SPL-based ratios moderately correlated with one another. The simplified ratio of SPL to subglottal pressure (without airflow) exhibited the least within-subject variability relative to the other two acoustic-aerodynamic ratios. CONCLUSIONS These findings indicate that SPL-based acoustic-aerodynamic voice outcome ratios increase significantly following the surgical removal of phonotraumatic vocal fold lesions. The simplified ratio of SPL to subglottal pressure exhibits the least variability and can be easily obtained without requiring the measurement of airflow.
Collapse
Affiliation(s)
- Laura E. Toles
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
- The University of Texas Southwestern Medical Center, Dallas
| | | | - Robert E. Hillman
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
- Harvard Medical School, Boston, MA
| | - Daryush D. Mehta
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
- Harvard Medical School, Boston, MA
| |
Collapse
|
5
|
Vaiano T, Herbella FAM, Behlau M. PHARYNGEAL, UPPER ESOPHAGEAL SPHINCTERIC AND ESOPHAGEAL PRESSURES RESPONSES RELATED TO VOCAL TASKS AT THE LIGHT OF HIGH RESOLUTION MANOMETRY. ARQUIVOS DE GASTROENTEROLOGIA 2021; 58:296-301. [PMID: 34705962 DOI: 10.1590/s0004-2803.202100000-51] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/15/2020] [Accepted: 03/05/2021] [Indexed: 01/11/2023]
Abstract
BACKGROUND High-resolution manometry (HRM) represents a potential tool for measuring pharyngoesophageal phonation pressures. OBJECTIVE This study aims to evaluate pharyngeal, esophageal upper sphincteric and esophageal pressures during different phonation tasks. METHODS 12 (six males, mean age 27 years) professional singers underwent HRM and produced four different vocal tasks at low, medium and high vocal loudness: vowel /ae/, ascending five note scale, word /hey/ and word /go/. Pressures were measured at pharynx, upper esophageal sphincter (UES) and esophagus. Visual analysis of the HRM topographic plots were performed. RESULTS Esophageal pressures are higher during vocalization than at rest. Pharyngeal and UES phonation pressures does not differ significantly from rest pressures. Visual analysis of the topographic plots showed an important UES pressure increasement during phonation. CONCLUSION HRM is a valuable tool for measuring pharyngoesophageal pressures during phonation. Esophageal pressures are higher during phonation than at rest and tend to increase with vocal loudness increment. The topographic plot provides additional data about phonatory mechanism physiology, especially at the UES region.
Collapse
Affiliation(s)
- Thays Vaiano
- Universidade Federal de São Paulo, Escola Paulista de Medicina, Departamento de Fonoaudiologia, São Paulo, SP, Brasil.,Centro de Estudos da Voz (CEV), São Paulo, SP, Brasil
| | - Fernando A M Herbella
- Universidade Federal de São Paulo, Escola Paulista de Medicina, Departamento de Cirurgia, São Paulo, SP, Brasil
| | - Mara Behlau
- Universidade Federal de São Paulo, Escola Paulista de Medicina, Departamento de Fonoaudiologia, São Paulo, SP, Brasil.,Centro de Estudos da Voz (CEV), São Paulo, SP, Brasil
| |
Collapse
|
6
|
Lulich SM, Patel RR. Semi-occluded vocal tract exercises in healthy young adults: Articulatory, acoustic, and aerodynamic measurements during phonation at threshold. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 149:3213. [PMID: 34241146 DOI: 10.1121/10.0004792] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Accepted: 04/07/2021] [Indexed: 06/13/2023]
Abstract
Semi-occluded vocal tract exercises (SOVTEs) are increasingly popular as therapeutic exercises for patients with voice disorders. This popularity is reflected in the growing research literature, investigating the scientific principles underlying SOVTEs and their practical efficacy. This study examines several acoustic, articulatory, and aerodynamic variables before, during, and after short-duration (15 s) SOVTEs with a narrow tube in air. Participants were 20 healthy young adults, and all variables were measured at threshold phonation levels. Acoustic variables were measured with a microphone and a neck accelerometer, and include fundamental frequency, glottal open quotient, and vocal efficiency. Articulatory variables were measured with ultrasound, and include measures of the tongue tip, tongue dorsum, and posterior tongue height, and horizontal tongue length. Aerodynamic variables were measured with an intraoral pressure transducer and include subglottal, intraoral, and transglottal pressures. Lowering of the posterior tongue height and tongue dorsum height were observed with gender-specific small changes in the fundamental frequency, but there were no significant effects on the transglottal pressure or vocal efficiency. These findings suggest that the voices of healthy young adults already approach optimal performance, and the continued search for scientific evidence supporting SOVTEs should focus on populations with voice disorders.
Collapse
Affiliation(s)
- Steven M Lulich
- Department of Speech, Language and Hearing Sciences, Indiana University; 2631 East Discovery Parkway, Bloomington, Indiana 47408, USA
| | - Rita R Patel
- Department of Speech, Language and Hearing Sciences, Indiana University; 2631 East Discovery Parkway, Bloomington, Indiana 47408, USA
| |
Collapse
|
7
|
Falk S, Kniesburges S, Schoder S, Jakubaß B, Maurerlehner P, Echternach M, Kaltenbacher M, Döllinger M. 3D-FV-FE Aeroacoustic Larynx Model for Investigation of Functional Based Voice Disorders. Front Physiol 2021; 12:616985. [PMID: 33762964 PMCID: PMC7982522 DOI: 10.3389/fphys.2021.616985] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Accepted: 02/09/2021] [Indexed: 12/02/2022] Open
Abstract
For the clinical analysis of underlying mechanisms of voice disorders, we developed a numerical aeroacoustic larynx model, called simVoice, that mimics commonly observed functional laryngeal disorders as glottal insufficiency and vibrational left-right asymmetries. The model is a combination of the Finite Volume (FV) CFD solver Star-CCM+ and the Finite Element (FE) aeroacoustic solver CFS++. simVoice models turbulence using Large Eddy Simulations (LES) and the acoustic wave propagation with the perturbed convective wave equation (PCWE). Its geometry corresponds to a simplified larynx and a vocal tract model representing the vowel /a/. The oscillations of the vocal folds are externally driven. In total, 10 configurations with different degrees of functional-based disorders were simulated and analyzed. The energy transfer between the glottal airflow and the vocal folds decreases with an increasing glottal insufficiency and potentially reflects the higher effort during speech for patients being concerned. This loss of energy transfer may also have an essential influence on the quality of the sound signal as expressed by decreasing sound pressure level (SPL), Cepstral Peak Prominence (CPP), and Vocal Efficiency (VE). Asymmetry in the vocal fold oscillations also reduces the quality of the sound signal. However, simVoice confirmed previous clinical and experimental observations that a high level of glottal insufficiency worsens the acoustic signal quality more than oscillatory left-right asymmetry. Both symptoms in combination will further reduce the quality of the sound signal. In summary, simVoice allows for detailed analysis of the origins of disordered voice production and hence fosters the further understanding of laryngeal physiology, including occurring dependencies. A current walltime of 10 h/cycle is, with a prospective increase in computing power, auspicious for a future clinical use of simVoice.
Collapse
Affiliation(s)
- Sebastian Falk
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head & Neck Surgery, Friedrich-Alexander-University Erlangen-Nürnberg, Erlangen, Germany
| | - Stefan Kniesburges
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head & Neck Surgery, Friedrich-Alexander-University Erlangen-Nürnberg, Erlangen, Germany
| | - Stefan Schoder
- Institute of Fundamentals and Theory in Electrical Engineering, Division Vibro- and Aeroacoustics, Graz University of Technology, Graz, Austria
| | - Bernhard Jakubaß
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head & Neck Surgery, Friedrich-Alexander-University Erlangen-Nürnberg, Erlangen, Germany
| | - Paul Maurerlehner
- Institute of Fundamentals and Theory in Electrical Engineering, Division Vibro- and Aeroacoustics, Graz University of Technology, Graz, Austria
| | - Matthias Echternach
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Munich, Germany
| | - Manfred Kaltenbacher
- Institute of Fundamentals and Theory in Electrical Engineering, Division Vibro- and Aeroacoustics, Graz University of Technology, Graz, Austria
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head & Neck Surgery, Friedrich-Alexander-University Erlangen-Nürnberg, Erlangen, Germany
| |
Collapse
|
8
|
Marks KL, Lin JZ, Burns JA, Hron TA, Hillman RE, Mehta DD. Estimation of Subglottal Pressure From Neck Surface Vibration in Patients With Voice Disorders. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:2202-2218. [PMID: 32610028 PMCID: PMC7838842 DOI: 10.1044/2020_jslhr-19-00409] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Revised: 03/12/2020] [Accepted: 04/15/2020] [Indexed: 05/15/2023]
Abstract
Purpose Given the established linear relationship between neck surface vibration magnitude and mean subglottal pressure (Ps) in vocally healthy speakers, the purpose of this study was to better understand the impact of the presence of a voice disorder on this baseline relationship. Method Data were obtained from participants with voice disorders representing a variety of glottal conditions, including phonotraumatic vocal hyperfunction, nonphonotraumatic vocal hyperfunction, and unilateral vocal fold paralysis. Participants were asked to repeat /p/-vowel syllable strings from loud-to-soft loudness levels in multiple vowel contexts (/pa/, /pi/, /pu/) and pitch levels (comfortable, higher than comfortable, lower than comfortable). Three statistical metrics were computed to analyze the regression line between neck surface accelerometer (ACC) signal magnitude and Ps within and across pitch, vowel, and voice disorder category: coefficient of determination (r 2), slope, and intercept. Three linear mixed-effects models were used to evaluate the impact of voice disorder category, pitch level, and vowel context on the relationship between ACC signal magnitude and Ps. Results The relationship between ACC signal magnitude and Ps was statistically different in patients with voice disorders than in vocally healthy controls; patients exhibited higher levels of Ps given similar values of ACC signal magnitude. Negligible effects were found for pitch condition within each voice disorder category, and negligible-to-small effects were found for vowel context. The mean of patient-specific r 2 values was .63, ranging from .13 to .92. Conclusions The baseline, linear relationship between ACC signal magnitude and Ps is affected by the presence of a voice disorder, with the relationship being participant-specific. Further work is needed to improve ACC-based prediction of Ps, across treatment, and during naturalistic speech production.
Collapse
Affiliation(s)
- Katherine L. Marks
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Rehabilitation Science, MGH Institute of Health Professions, Boston, MA
| | - Jonathan Z. Lin
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
| | - James A. Burns
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Department of Surgery, Harvard Medical School, Boston, MA
| | - Tiffiny A. Hron
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Department of Surgery, Harvard Medical School, Boston, MA
| | - Robert E. Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Rehabilitation Science, MGH Institute of Health Professions, Boston, MA
- Department of Surgery, Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology Program, Division of Medical Sciences, Harvard Medical School, Boston, MA
| | - Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Rehabilitation Science, MGH Institute of Health Professions, Boston, MA
- Department of Surgery, Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology Program, Division of Medical Sciences, Harvard Medical School, Boston, MA
| |
Collapse
|
9
|
Riede T, Olson CR. The vocal organ of hummingbirds shows convergence with songbirds. Sci Rep 2020; 10:2007. [PMID: 32029812 PMCID: PMC7005288 DOI: 10.1038/s41598-020-58843-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2019] [Accepted: 01/09/2020] [Indexed: 11/09/2022] Open
Abstract
How sound is generated in the hummingbird syrinx is largely unknown despite their complex vocal behavior. To fill this gap, syrinx anatomy of four North American hummingbird species were investigated by histological dissection and contrast-enhanced microCT imaging, as well as measurement of vocalizations in a heliox atmosphere. The placement of the hummingbird syrinx is uniquely located in the neck rather than inside the thorax as in other birds, while the internal structure is bipartite with songbird-like anatomical features, including multiple pairs of intrinsic muscles, a robust tympanum and several accessory cartilages. Lateral labia and medial tympaniform membranes consist of an extracellular matrix containing hyaluronic acid, collagen fibers, but few elastic fibers. Their upper vocal tract, including the trachea, is shorter than predicted for their body size. There are between-species differences in syrinx measurements, despite similar overall morphology. In heliox, fundamental frequency is unchanged while upper-harmonic spectral content decrease in amplitude, indicating that syringeal sounds are produced by airflow-induced labia and membrane vibration. Our findings predict that hummingbirds have fine control of labia and membrane position in the syrinx; adaptations that set them apart from closely related swifts, yet shows convergence in their vocal organs with those of oscines.
Collapse
Affiliation(s)
- Tobias Riede
- Department of Physiology, College of Graduate Studies, Midwestern University, 19555 N 59th Ave, Glendale, AZ, 85308, United States
| | - Christopher R Olson
- Department of Physiology, College of Graduate Studies, Midwestern University, 19555 N 59th Ave, Glendale, AZ, 85308, United States.
| |
Collapse
|
10
|
Lin JZ, Espinoza VM, Marks KL, Zañartu M, Mehta DD. Improved subglottal pressure estimation from neck-surface vibration in healthy speakers producing non-modal phonation. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING 2020; 14:449-460. [PMID: 34079612 PMCID: PMC8168553 DOI: 10.1109/jstsp.2019.2959267] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Subglottal air pressure plays a major role in voice production and is a primary factor in controlling voice onset, offset, sound pressure level, glottal airflow, vocal fold collision pressures, and variations in fundamental frequency. Previous work has shown promise for the estimation of subglottal pressure from an unobtrusive miniature accelerometer sensor attached to the anterior base of the neck during typical modal voice production across multiple pitch and vowel contexts. This study expands on that work to incorporate additional accelerometer-based measures of vocal function to compensate for non-modal phonation characteristics and achieve an improved estimation of subglottal pressure. Subjects with normal voices repeated /p/-vowel syllable strings from loud-to-soft levels in multiple vowel contexts (/ɑ/, /i/, and /u/), pitch conditions (comfortable, lower than comfortable, higher than comfortable), and voice quality types (modal, breathy, strained, and rough). Subject-specific, stepwise regression models were constructed using root-mean-square (RMS) values of the accelerometer signal alone (baseline condition) and in combination with cepstral peak prominence, fundamental frequency, and glottal airflow measures derived using subglottal impedance-based inverse filtering. Five-fold cross-validation assessed the robustness of model performance using the root-mean-square error metric for each regression model. Each cross-validation fold exhibited up to a 25% decrease in prediction error when the model incorporated multidimensional aspects of the accelerometer signal compared with RMS-only models. Improved estimation of subglottal pressure for non-modal phonation was thus achievable, lending to future studies of subglottal pressure estimation in patients with voice disorders and in ambulatory voice recordings.
Collapse
Affiliation(s)
- Jon Z Lin
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114 USA
| | | | - Katherine L Marks
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114 USA
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa Maria, Valparaíso, Chile
| | - Daryush D Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital-Harvard Medical School, Boston, MA 02114 USA
| |
Collapse
|
11
|
Marks KL, Lin JZ, Fox AB, Toles LE, Mehta DD. Impact of Nonmodal Phonation on Estimates of Subglottal Pressure From Neck-Surface Acceleration in Healthy Speakers. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2019; 62:3339-3358. [PMID: 31518510 PMCID: PMC6808343 DOI: 10.1044/2019_jslhr-s-19-0067] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Purpose The purpose of this study was to evaluate the effects of nonmodal phonation on estimates of subglottal pressure (Ps) derived from the magnitude of a neck-surface accelerometer (ACC) signal and to confirm previous findings regarding the impact of vowel contexts and pitch levels in a larger cohort of participants. Method Twenty-six vocally healthy participants (18 women, 8 men) were asked to produce a series of p-vowel syllables with descending loudness in 3 vowel contexts (/a/, /i/, and /u/), 3 pitch levels (comfortable, high, and low), and 4 elicited phonatory conditions (modal, breathy, strained, and rough). Estimates of Ps for each vowel segment were obtained by averaging the intraoral air pressure plateau before and after each segment. The root-mean-square magnitude of the neck-surface ACC signal was computed for each vowel segment. Three linear mixed-effects models were used to statistically assess the effects of vowel, pitch, and phonatory condition on the linear relationship (slope and intercept) between Ps and ACC signal magnitude. Results Results demonstrated statistically significant linear relationships between ACC signal magnitude and Ps within participants but with increased intercepts for the nonmodal phonatory conditions; slopes were affected to a lesser extent. Vowel and pitch contexts did not significantly affect the linear relationship between ACC signal magnitude and Ps. Conclusion The classic linear relationship between ACC signal magnitude and Ps is significantly affected when nonmodal phonation is produced by a speaker. Future work is warranted to further characterize nonmodal phonatory characteristics to improve the ACC-based prediction of Ps during naturalistic speech production.
Collapse
Affiliation(s)
- Katherine L. Marks
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- PhD Program in Rehabilitation Sciences, MGH Institute of Health Professions, Boston, MA
| | - Jonathan Z. Lin
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
| | - Annie B. Fox
- Center for Interprofessional Studies and Innovation, MGH Institute of Health Professions, Boston, MA
| | - Laura E. Toles
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- PhD Program in Rehabilitation Sciences, MGH Institute of Health Professions, Boston, MA
| | - Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- PhD Program in Rehabilitation Sciences, MGH Institute of Health Professions, Boston, MA
- Department of Surgery, Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology Program, Division of Medical Sciences, Harvard Medical School, Boston, MA
| |
Collapse
|
12
|
Gilman M, Maira C, Hapner ER. Airflow Patterns of Running Speech in Patients With Voice Disorders. J Voice 2019; 33:277-283. [DOI: 10.1016/j.jvoice.2017.12.004] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2017] [Accepted: 12/06/2017] [Indexed: 11/15/2022]
|
13
|
Titze IR. WHERE HAS ALL THE POWER GONE? ENERGY PRODUCTION AND LOSS IN VOCALIZATION. SPEECH COMMUNICATION 2018; 101:26-33. [PMID: 30555195 PMCID: PMC6292679 DOI: 10.1016/j.specom.2018.05.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Human voice production for speech is an inefficient process in terms of energy expended to produce acoustic output. A traditional measure of vocal efficiency relates acoustic power radiated from the mouth to aerodynamic power produced in the trachea. This efficiency ranges between 0.001 % and 1.0 % in speech-like vocalization. Simplified Navier-Stokes equations for non-steady compressible airflow from trachea to lips were used to calculate steady aerodynamic power, acoustic power, and combined total power at seven strategic locations along the airway. A portion of the airway was allowed to collapse to produce self-sustained oscillation for sound production. A conversion efficiency, defined as acoustic power generated in the glottis to aerodynamic power dissipated, was found to be on the order of 10%, but wall vibration, air viscosity, and kinetic pressure losses consumed almost all of that power. This sound, reflected back and forth in the airway, was dissipated at a level on the order of 99.9 %.
Collapse
Affiliation(s)
- Ingo R. Titze
- National Center for Voice and Speech, The University of Utah, Salt Lake City, UT
- Department of Communication Sciences and Disorders, The University of Iowa, Iowa City, IA
| |
Collapse
|
14
|
Mills RD, Rivedal S, DeMorett C, Maples G, Jiang JJ. Effects of Straw Phonation Through Tubes of Varied Lengths on Sustained Vowels in Normal-Voiced Participants. J Voice 2018. [DOI: 10.1016/j.jvoice.2017.05.015] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
15
|
Titze IR, Palaparthi A. Radiation efficiency for long-range vocal communication in mammals and birds. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 143:2813. [PMID: 29857705 PMCID: PMC5948106 DOI: 10.1121/1.5034768] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/23/2018] [Revised: 04/09/2018] [Accepted: 04/11/2018] [Indexed: 06/01/2023]
Abstract
Long-distance vocal communication by birds and mammals, including humans, is facilitated largely by radiation efficiency from the mouth or beak. Here, this efficiency is defined and quantified. It depends on frequency content of vocalization, mouth opening, head and upper body geometry, and directionality. Each of these factors is described mathematically with a piston-in-a-sphere model. While this model is considered a classic, never before has the high frequency solution been applied in detail to vocalization. Results indicate that frequency content in the 1-50 kHz range can be radiated with nearly 100% efficiency if a reactance peak in the radiation impedance is utilized with adjustments of head size, mouth opening, and beam direction. Without these adjustments, radiation efficiency is generally below 1%, especially in human speech where a high fundamental frequency is a disadvantage for intelligibility. Thus, two distinct modes of vocal communication are identified, (1) short range with optimized information transfer and (2) long range with maximum efficiency for release of acoustic power.
Collapse
Affiliation(s)
- Ingo R Titze
- National Center for Voice and Speech, The University of Utah, 136 South Main Street, Suite 320, Salt Lake City, Utah 84101, USA
| | - Anil Palaparthi
- National Center for Voice and Speech, The University of Utah, 136 South Main Street, Suite 320, Salt Lake City, Utah 84101, USA
| |
Collapse
|
16
|
Fryd AS, Van Stan JH, Hillman RE, Mehta DD. Estimating Subglottal Pressure From Neck-Surface Acceleration During Normal Voice Production. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2016; 59:1335-1345. [PMID: 27959974 PMCID: PMC5399761 DOI: 10.1044/2016_jslhr-s-15-0430] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2015] [Revised: 03/10/2016] [Accepted: 05/06/2016] [Indexed: 05/10/2023]
Abstract
Purpose The purpose of this study was to evaluate the potential for estimating subglottal air pressure using a neck-surface accelerometer and to compare the accuracy of predicting subglottal air pressure relative to predicting acoustic sound pressure level (SPL). Method Indirect estimates of subglottal pressure (Psg') were obtained from 10 vocally healthy speakers during loud-to-soft repetitions of 3 different /p/-vowel gestures (/pa/, /pi/, /pu/) at 3 pitch levels in the modal register. Intraoral air pressure, neck-surface acceleration, and radiated acoustic pressure were recorded, and the root-mean-square amplitude of the acceleration signal was correlated with Psg' and SPL. Results The coefficient of determination between accelerometer level and Psg' was high when data were pooled from all vowel and pitch contexts for each participant (r2 = .68-.93). These relationships were stronger than corresponding relationships between accelerometer level and SPL (r2 = .46-.81). The average 95% prediction interval for estimating Psg' using accelerometer level was ±2.53 cm H2O, ranging from ±1.70 to ±3.74 cm H2O across participants. Conclusions Accelerometer signal amplitude correlated more strongly with Psg' than with SPL. Future work is warranted to investigate the robustness of the relationship in nonmodal voice qualities, individuals with voice disorders, and accelerometer-based ambulatory monitoring of subglottal pressure.
Collapse
Affiliation(s)
- Amanda S. Fryd
- Communication Sciences and Disorders, MGH Institute of Health Professions, Charlestown, MA
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
| | - Jarrad H. Van Stan
- Communication Sciences and Disorders, MGH Institute of Health Professions, Charlestown, MA
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
| | - Robert E. Hillman
- Communication Sciences and Disorders, MGH Institute of Health Professions, Charlestown, MA
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Department of Surgery, Harvard Medical School, Boston, MA
- Surgery & Health Sciences and Technology, Harvard Medical School, Boston, MA
| | - Daryush D. Mehta
- Communication Sciences and Disorders, MGH Institute of Health Professions, Charlestown, MA
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Department of Surgery, Harvard Medical School, Boston, MA
| |
Collapse
|