1
|
Švec JG, Kumar SP, Vencálek O, Ravichandran S, Lehoux S. Vocal Fold Kinematics and Convergent-Divergent Oscillatory Glottis: Basic Insights Using Mucosal Wave Modeling and Synthetic Kymograms. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2025; 68:1602-1617. [PMID: 40053877 DOI: 10.1044/2024_jslhr-24-00251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/09/2025]
Abstract
PURPOSE Owing to mucosal waves, the oscillatory glottis is ideally expected to be convergent during opening and divergent during closing. However, this does not necessarily hold for voice disorders. Here, we pave the way for recognizing COnvergent-DIvergent (CODI) waveforms quantitatively and study the kinematic conditions in which they occur. METHOD We simulated 3,125 laryngoscopic glottal waveforms using a kinematic vocal fold (VF) model and synthetic kymograms. We independently varied the oscillatory amplitudes of the upper and lower VF margins, AU and AL (0.1 to 1.1 mm), vertical phase difference (VPD; 0° to 125°), glottal halfwidth HW (-0.05 to 1.2 mm), and prephonatory glottal convergence angle ψCVG (-15° to 35°) to simulate normal and disordered conditions. We introduced the upper and lower margin quotients, QU and QL, quantifying the proportion of time when the upper margin is at the glottal edge during the opening, and when the lower margin is at the glottal edge during the closing, respectively. A CODI waveform was defined as the case when QU = QL = 1. RESULTS The likelihood of obtaining the CODI waveform was highest when AU and AL were similar, ψCVG was close to 0, HW was below 0.45 mm, and VPD was larger than 50°. In 88% of the simulated cases, the waveforms did not fulfill the CODI conditions. In these cases, either the lower margin was hidden during some portion of the closing phase or the upper margin was not at the glottal edge during some portion of the opening phase. CONCLUSION The study provides the basis for a better understanding of the variability of glottal waveforms and the appearance of mucosal waves related to VF kinematics.
Collapse
Affiliation(s)
- Jan G Švec
- Voice Research Lab, Department of Experimental Physics, Faculty of Science, Palacký University, Olomouc, Czechia
| | - S Pravin Kumar
- Department of Biomedical Engineering, Sri Sivasubramaniya Nadar College of Engineering, Chennai, India
| | - Ondřej Vencálek
- Department of Mathematical Analysis and Applications of Mathematics, Faculty of Science, Palacký University, Olomouc, Czechia
| | | | - Sarah Lehoux
- Voice Research Lab, Department of Experimental Physics, Faculty of Science, Palacký University, Olomouc, Czechia
| |
Collapse
|
2
|
Titze IR, Maxfield L, Manternach B, Palaparthi A, Scherer R, Wang X, Zheng X, Xue Q. Pressure Distributions in Glottal Geometries With Multichannel Airflows. J Voice 2024:S0892-1997(24)00266-2. [PMID: 39289087 PMCID: PMC11910377 DOI: 10.1016/j.jvoice.2024.08.019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2024] [Revised: 08/14/2024] [Accepted: 08/15/2024] [Indexed: 09/19/2024]
Abstract
OBJECTIVES Computer simulation of self-sustained oscillation of the vocal folds has been successful with application of simple Bernoulli-like driving pressures. As voice simulation is now applied to asymmetric vibration with complex mode structures that yield partial vocal fold contact, the driving pressures need refinement. STUDY DESIGN AND METHODOLOGY Two independent approaches were used to obtain pressure distributions. The first was a high-fidelity immersed-boundary method computation and the second was a series of pressure tap measurements on scaled-up physical models. Glottal geometries were based on normal surface modes of vibration. Samples are chosen from a large inventory of measured and calculated profiles. RESULTS Pressure distributions show the complexity that can exist in the driving forces on vocal fold surfaces. Qualitative similarity between computation and measurement was established for a variety of contact patterns, showing diverse pressure gradients in multiple directions. CONCLUSIONS Simplified Bernoulli approaches to glottal pressure distributions are defensible when a single flow channel is preserved in vocal fold oscillation. However, when there are contact islands that produce confluence or difluence of multiple airflow channels, the pressure gradients vary profoundly. Small quantitative differences were observed between measurement and calculation, primarily due to spatial sampling.
Collapse
Affiliation(s)
- Ingo R Titze
- Utah Center for Vocology, The University of Utah, Salt Lake City, Utah.
| | - Lynn Maxfield
- Utah Center for Vocology, The University of Utah, Salt Lake City, Utah
| | - Brian Manternach
- Utah Center for Vocology, The University of Utah, Salt Lake City, Utah
| | - Anil Palaparthi
- Utah Center for Vocology, The University of Utah, Salt Lake City, Utah
| | - Ronald Scherer
- Department of Communication Sciences and Disorders, Bowling Green State University, Bowling Green, Ohio
| | - Xiaojian Wang
- College of Engineering and Computing, The University of Maine, Orono, Maine
| | - Xudong Zheng
- Department of Mechanical Engineering, Rochester Institute of Technology, Rochester, New York
| | - Qian Xue
- Department of Mechanical Engineering, Rochester Institute of Technology, Rochester, New York
| |
Collapse
|
3
|
Perrine BL, Scherer RC. Using a vertical three-mass computational model of the vocal folds to match human phonation of three adult males. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023; 154:1505-1525. [PMID: 37695295 PMCID: PMC10497319 DOI: 10.1121/10.0020847] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Revised: 08/10/2023] [Accepted: 08/19/2023] [Indexed: 09/12/2023]
Abstract
Computer models of phonation are used to study various parameters that are difficult to control, measure, and observe in human subjects. Imitating human phonation by varying the prephonatory conditions of computer models offers insight into the variations that occur across human phonatory production. In the present study, a vertical three-mass computer model of phonation [Perrine, Scherer, Fulcher, and Zhai (2020). J. Acoust. Soc. Am. 147, 1727-1737], driven by empirical pressures from a physical model of the vocal folds (model M5), with a vocal tract following the design of Ishizaka and Flanagan [(1972). Bell Sys. Tech. J. 51, 1233-1268] was used to match prolonged vowels produced by three male subjects using various pitch and loudness levels. The prephonatory conditions of tissue mass and tension, subglottal pressure, glottal diameter and angle, posterior glottal gap, false vocal fold gap, and vocal tract cross-sectional areas were varied in the model to match the model output with the fundamental frequency, alternating current airflow, direct current airflow, skewing quotient, open quotient, maximum flow negative derivative, and the first three formant frequencies from the human production. Parameters were matched between the model and human subjects with an average overall percent mismatch of 4.40% (standard deviation = 6.75%), suggesting a reasonable ability of the simple low dimensional model to mimic these variables.
Collapse
Affiliation(s)
- Brittany L Perrine
- Department of Communication Sciences and Disorders, Baylor University, One Bear Place #97332, Waco, Texas 76798, USA
| | - Ronald C Scherer
- Department of Communication Sciences and Disorders, Bowling Green State University, Ridge Street, Bowling Green, Ohio 43403, USA
| |
Collapse
|
4
|
van Mersbergen M, Marchetta J, Foti D, Pillow E, Dasgupta A, Cain C, Morvant S. Comparison of Aerosol Emissions during Specific Speech Tasks. J Voice 2023:S0892-1997(23)00156-X. [PMID: 37423794 DOI: 10.1016/j.jvoice.2023.05.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2023] [Revised: 05/09/2023] [Accepted: 05/09/2023] [Indexed: 07/11/2023]
Abstract
OBJECTIVES/HYPOTHESIS Recent investigations into the behavior of aerosolized emissions from the oral cavity have shown that particulate emissions do indeed occur during speech. To date, there is little information about the relative contribution of different speech sounds in producing particle emissions in a free field. This study compares airborne aerosol generation in participants producing isolated speech sounds: fricative consonants, plosive consonants, and vowel sounds. STUDY DESIGN Prospective, reversal experimental design, where each participant served as their own control and all participants were exposed to all stimuli. METHODS While participants produced isolated speech tasks, a planar beam of laser light, a high-speed camera, and image software calculated the number of particulates detected over time. This study compared airborne aerosols emitted by human participants at a distance of 2.54 cm between the laser sheet and the mouth. RESULTS Statistically significant increases in particulate count over ambient dust distribution for all speech sounds. When collapsed across loudness levels, emitted particles in vowel sounds were statistically greater than consonants, suggesting that mouth opening, as opposed to the place of vocal tract constriction or manner of sound production, might also be influential in the degree to which particulates become aerosolized during speech. CONCLUSIONS The results of this research will inform boundary conditions for computational models of aerosolized particulates during speech.
Collapse
Affiliation(s)
- Miriam van Mersbergen
- School of Communication Sciences and Disorders, University of Memphis, Memphis, Tennessee; Department of Otolaryngology, Head and Neck Surgery, University of Tennessee Health Sciences Center, Memphis, Tennessee; Institute for Intelligent Systems, University of Memphis, Tennessee.
| | - Jeffrey Marchetta
- Department of Mechanical Engineering, University of Memphis, Memphis, Tennessee
| | - Daniel Foti
- Department of Mechanical Engineering, University of Memphis, Memphis, Tennessee
| | - Eric Pillow
- Department of Mechanical Engineering, University of Memphis, Memphis, Tennessee
| | - Apratim Dasgupta
- Department of Mechanical Engineering, University of Memphis, Memphis, Tennessee
| | - Chandler Cain
- Department of Mechanical Engineering, University of Memphis, Memphis, Tennessee
| | - Stephen Morvant
- Department of Otolaryngology, Head and Neck Surgery, University of Tennessee Health Sciences Center, Memphis, Tennessee
| |
Collapse
|
5
|
Schoder S, Kraxberger F, Falk S, Wurzinger A, Roppert K, Kniesburges S, Döllinger M, Kaltenbacher M. Error detection and filtering of incompressible flow simulations for aeroacoustic predictions of human voice. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022; 152:1425. [PMID: 36182323 DOI: 10.1121/10.0013778] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Accepted: 08/09/2022] [Indexed: 06/16/2023]
Abstract
The presented filtering technique is proposed to detect errors and correct outliers inside the acoustic sources, respectively, the first time derivative of the incompressible pressure obtained from large eddy simulations with prescribed vocal fold motion using overlay mesh methods. Regarding the perturbed convective wave equation, the time derivative of the incompressible pressure is the primary sound source in the human phonation process. However, the incompressible pressure can be erroneous and have outliers when fulfilling the divergence-free constraint of the velocity field. This error is primarily occurring for non-conserving prescribed vocal fold motions. Therefore, the method based on a continuous stationary random process was designed to detect rare events in the time derivative of the pressure. The detected events are then localized and treated by a defined window function to increase their probability. As a consequence, the data quality of the non-linearly filtered data is enhanced significantly. Furthermore, the proposed method can also be used to assess convergence of the aeroacoustic source terms, and detect regions and time intervals, which show a non-converging behavior by an impulse-like structure.
Collapse
Affiliation(s)
- Stefan Schoder
- Institute of Fundamentals and Theory in Electrical Engineering (IGTE), Graz University of Technology, Inffeldgasse 18, 8010 Graz, Austria
| | - Florian Kraxberger
- Institute of Fundamentals and Theory in Electrical Engineering (IGTE), Graz University of Technology, Inffeldgasse 18, 8010 Graz, Austria
| | - Sebastian Falk
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head & Neck Surgery, Friedrich-Alexander-University Erlangen-Nürnberg, Waldstraße 1, 91054 Erlangen, Germany
| | - Andreas Wurzinger
- Institute of Fundamentals and Theory in Electrical Engineering (IGTE), Graz University of Technology, Inffeldgasse 18, 8010 Graz, Austria
| | - Klaus Roppert
- Institute of Fundamentals and Theory in Electrical Engineering (IGTE), Graz University of Technology, Inffeldgasse 18, 8010 Graz, Austria
| | - Stefan Kniesburges
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head & Neck Surgery, Friedrich-Alexander-University Erlangen-Nürnberg, Waldstraße 1, 91054 Erlangen, Germany
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head & Neck Surgery, Friedrich-Alexander-University Erlangen-Nürnberg, Waldstraße 1, 91054 Erlangen, Germany
| | - Manfred Kaltenbacher
- Institute of Fundamentals and Theory in Electrical Engineering (IGTE), Graz University of Technology, Inffeldgasse 18, 8010 Graz, Austria
| |
Collapse
|
6
|
Zhou C, Zhang L, Wu Y, Zhang X, Wu D, Tao Z. Effects of Sulcus Vocalis Depth on Phonation in Three-Dimensional Fluid-Structure Interaction Laryngeal Models. Appl Bionics Biomech 2021; 2021:6662625. [PMID: 33897828 PMCID: PMC8052148 DOI: 10.1155/2021/6662625] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Revised: 03/19/2021] [Accepted: 03/29/2021] [Indexed: 11/29/2022] Open
Abstract
Sulcus vocalis is an indentation parallel to the edge of vocal fold, which may extend into the cover and ligament layer of the vocal fold or deeper. The effects of sulcus vocalis depth d on phonation and the vocal cord vibrations are investigated in this study. The three-dimensional laryngeal models were established for healthy vocal folds (0 mm) and different types of sulcus vocalis with the typical depth of 1 mm, 2 mm, and 3 mm. These models with fluid-structure interaction (FSI) are computed numerically by sequential coupling method, which includes an immersed boundary method (IBM) for modelling the glottal airflow, a finite-element method (FEM) for modelling vocal fold tissue. The results show that a deeper sulcus vocalis in the cover layer decreases the vibrating frequency of vocal folds and expands the prephonatory glottal half-width which increases the phonation threshold pressure. The larger sulcus vocalis depth makes vocal folds difficult to vibrate and phonate. The effects of sulcus vocalis depth suggest that the feature such as phonation threshold pressure could assist in the detection of healthy vocal folds and different types of sulcus vocalis.
Collapse
Affiliation(s)
- Changwei Zhou
- School of Optoelectronic Science and Engineering, Soochow University, Suzhou 215000, China
| | - Lili Zhang
- School of Optoelectronic Science and Engineering, Soochow University, Suzhou 215000, China
| | - Yuanbo Wu
- School of Optoelectronic Science and Engineering, Soochow University, Suzhou 215000, China
| | - Xiaojun Zhang
- School of Optoelectronic Science and Engineering, Soochow University, Suzhou 215000, China
| | - Di Wu
- School of Optoelectronic Science and Engineering, Soochow University, Suzhou 215000, China
| | - Zhi Tao
- School of Optoelectronic Science and Engineering, Soochow University, Suzhou 215000, China
| |
Collapse
|
7
|
Falk S, Kniesburges S, Schoder S, Jakubaß B, Maurerlehner P, Echternach M, Kaltenbacher M, Döllinger M. 3D-FV-FE Aeroacoustic Larynx Model for Investigation of Functional Based Voice Disorders. Front Physiol 2021; 12:616985. [PMID: 33762964 PMCID: PMC7982522 DOI: 10.3389/fphys.2021.616985] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Accepted: 02/09/2021] [Indexed: 12/02/2022] Open
Abstract
For the clinical analysis of underlying mechanisms of voice disorders, we developed a numerical aeroacoustic larynx model, called simVoice, that mimics commonly observed functional laryngeal disorders as glottal insufficiency and vibrational left-right asymmetries. The model is a combination of the Finite Volume (FV) CFD solver Star-CCM+ and the Finite Element (FE) aeroacoustic solver CFS++. simVoice models turbulence using Large Eddy Simulations (LES) and the acoustic wave propagation with the perturbed convective wave equation (PCWE). Its geometry corresponds to a simplified larynx and a vocal tract model representing the vowel /a/. The oscillations of the vocal folds are externally driven. In total, 10 configurations with different degrees of functional-based disorders were simulated and analyzed. The energy transfer between the glottal airflow and the vocal folds decreases with an increasing glottal insufficiency and potentially reflects the higher effort during speech for patients being concerned. This loss of energy transfer may also have an essential influence on the quality of the sound signal as expressed by decreasing sound pressure level (SPL), Cepstral Peak Prominence (CPP), and Vocal Efficiency (VE). Asymmetry in the vocal fold oscillations also reduces the quality of the sound signal. However, simVoice confirmed previous clinical and experimental observations that a high level of glottal insufficiency worsens the acoustic signal quality more than oscillatory left-right asymmetry. Both symptoms in combination will further reduce the quality of the sound signal. In summary, simVoice allows for detailed analysis of the origins of disordered voice production and hence fosters the further understanding of laryngeal physiology, including occurring dependencies. A current walltime of 10 h/cycle is, with a prospective increase in computing power, auspicious for a future clinical use of simVoice.
Collapse
Affiliation(s)
- Sebastian Falk
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head & Neck Surgery, Friedrich-Alexander-University Erlangen-Nürnberg, Erlangen, Germany
| | - Stefan Kniesburges
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head & Neck Surgery, Friedrich-Alexander-University Erlangen-Nürnberg, Erlangen, Germany
| | - Stefan Schoder
- Institute of Fundamentals and Theory in Electrical Engineering, Division Vibro- and Aeroacoustics, Graz University of Technology, Graz, Austria
| | - Bernhard Jakubaß
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head & Neck Surgery, Friedrich-Alexander-University Erlangen-Nürnberg, Erlangen, Germany
| | - Paul Maurerlehner
- Institute of Fundamentals and Theory in Electrical Engineering, Division Vibro- and Aeroacoustics, Graz University of Technology, Graz, Austria
| | - Matthias Echternach
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Munich, Germany
| | - Manfred Kaltenbacher
- Institute of Fundamentals and Theory in Electrical Engineering, Division Vibro- and Aeroacoustics, Graz University of Technology, Graz, Austria
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head & Neck Surgery, Friedrich-Alexander-University Erlangen-Nürnberg, Erlangen, Germany
| |
Collapse
|
8
|
Calvache C, Solaque L, Velasco A, Peñuela L. Biomechanical Models to Represent Vocal Physiology: A Systematic Review. J Voice 2021; 37:465.e1-465.e18. [PMID: 33678534 DOI: 10.1016/j.jvoice.2021.02.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Revised: 01/26/2021] [Accepted: 02/02/2021] [Indexed: 11/24/2022]
Abstract
Biomechanical modeling allows obtaining information on physical phenomena that cannot be directly observed. This study aims to review models that represent voice production. A systematic review of the literature was conducted using PubMed/Medline, SCOPUS, and IEEE Xplore databases. To select the papers, we used the protocol PRISMA Statement. A total of 53 publications were included in this review. This article considers a taxonomic classification of models found in the literature. We propose four categories in the taxonomy: (1) Models representing the Source (Vocal folds); (2) Models representing the Filter (Vocal Tract); (3) Models representing the Source - Filter Interaction; and (4) Models representing the Airflow - Source Interaction. We include a bibliographic analysis with the evolution of the publications per category. We provide an analysis of the number as well of publications in journals per year. Moreover, we present an analysis of the term occurrence and its frequency of usage, as found in the literature. In each category, different types of vocal production models are mentioned and analyzed. The models account for the analysis of evidence about aerodynamic, biomechanical, and acoustic phenomena and their correlation with the physiological processes involved in the production of the human voice. This review gives an insight into the state of the art related to the mathematical modeling of voice production, analyzed from the viewpoint of vocal physiology.
Collapse
Affiliation(s)
- Carlos Calvache
- Vocology Center, Bogotá, Colombia; Department of Mechatronics Engineering, Universidad Militar Nueva Granada, Bogotá, Colombia.
| | - Leonardo Solaque
- Department of Mechatronics Engineering, Universidad Militar Nueva Granada, Bogotá, Colombia
| | - Alexandra Velasco
- Department of Mechatronics Engineering, Universidad Militar Nueva Granada, Bogotá, Colombia
| | - Lina Peñuela
- Department of Mechatronics Engineering, Universidad Militar Nueva Granada, Bogotá, Colombia
| |
Collapse
|
9
|
Li Z, Chen Y, Chang S, Rousseau B, Luo H. A one-dimensional flow model enhanced by machine learning for simulation of vocal fold vibration. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 149:1712. [PMID: 33765799 PMCID: PMC7954577 DOI: 10.1121/10.0003561] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Revised: 01/25/2021] [Accepted: 02/01/2021] [Indexed: 06/02/2023]
Abstract
A one-dimensional (1D) unsteady and viscous flow model that is derived from the momentum and mass conservation equations is described, and to enhance this physics-based model, a machine learning approach is used to determine the unknown modeling parameters. Specifically, an idealized larynx model is constructed and ten cases of three-dimensional (3D) fluid-structure interaction (FSI) simulations are performed. The flow data are then extracted to train the 1D flow model using a sparse identification approach for nonlinear dynamical systems. As a result of training, we obtain the analytical expressions for the entrance effect and pressure loss in the glottis, which are then incorporated in the flow model to conveniently handle different glottal shapes due to vocal fold vibration. We apply the enhanced 1D flow model in the FSI simulation of both idealized vocal fold geometries and subject-specific anatomical geometries reconstructed from the magnetic resonance imaging images of rabbits' larynges. The 1D flow model is evaluated in both of these setups and shown to have robust performance. Therefore, it provides a fast simulation tool that is superior to the previous 1D models.
Collapse
Affiliation(s)
- Zheng Li
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, Tennessee 37235-1592, USA
| | - Ye Chen
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, Tennessee 37235-1592, USA
| | - Siyuan Chang
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, Tennessee 37235-1592, USA
| | - Bernard Rousseau
- Department of Communication Science and Disorders, University of Pittsburgh, Pittsburgh, Pennsylvania 15213, USA
| | - Haoxiang Luo
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, Tennessee 37235-1592, USA
| |
Collapse
|
10
|
Schoder S, Weitz M, Maurerlehner P, Hauser A, Falk S, Kniesburges S, Döllinger M, Kaltenbacher M. Hybrid aeroacoustic approach for the efficient numerical simulation of human phonation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 147:1179. [PMID: 32113301 DOI: 10.1121/10.0000785] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/21/2019] [Accepted: 02/05/2020] [Indexed: 06/10/2023]
Abstract
A hybrid aeroacoustic approach was developed for the efficient numerical computation of human phonation. In the first step, an incompressible flow simulation on a three-dimensional (3 D) computational grid, which is capable of resolving all relevant turbulent scales, is performed using STARCCM+ and finite volume method. In the second step, the acoustic source terms on the flow grid are computed and a conservative interpolation to the acoustic grid is performed. Finally, the perturbed convective wave equation is solved to obtain the acoustic field in 3 D with the finite element solver CFS++. Thereby, the conservative transformation of the acoustic sources from the flow grid to the acoustic grid is a key step to allow coarse acoustic grids without reducing accuracy. For this transformation, two different interpolation strategies are compared and grid convergence is assessed. Overall, 16 simulation setups are compared. The initial (267 000 degrees of freedom) and the optimized (21 265 degrees of freedom) simulation setup were validated by measurements of a synthetic larynx model. To conclude, the total computational time of the acoustic simulation is reduced by 95% compared to the initial simulation setup without a significant reduction of accuracy, being 7%, in the frequency range of interest.
Collapse
Affiliation(s)
- Stefan Schoder
- Institute of Mechanics and Mechatronics, TU Wien, Austria
| | - Michael Weitz
- Institute of Mechanics and Mechatronics, TU Wien, Austria
| | | | | | - Sebastian Falk
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology Head & Neck Surgery, University Hospital Erlangen, Friedrich-Alexander-University Erlangen-Nürnberg, Germany
| | - Stefan Kniesburges
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology Head & Neck Surgery, University Hospital Erlangen, Friedrich-Alexander-University Erlangen-Nürnberg, Germany
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology Head & Neck Surgery, University Hospital Erlangen, Friedrich-Alexander-University Erlangen-Nürnberg, Germany
| | | |
Collapse
|
11
|
Rios G, Morrison RJ, Song Y, Fernando SJ, Wootten C, Gelbard A, Luo H. Computational Fluid Dynamics Analysis of Surgical Approaches to Bilateral Vocal Fold Immobility. Laryngoscope 2019; 130:E57-E64. [PMID: 30883777 DOI: 10.1002/lary.27925] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2018] [Revised: 01/04/2019] [Accepted: 02/20/2019] [Indexed: 11/06/2022]
Abstract
OBJECTIVES Bilateral vocal fold immobility (BVFI) is a rare and life-threatening condition in which both vocal folds are fixed, resulting in airway obstruction associated with life-threatening respiratory compromise. Treatment of BVFI is largely surgical and remains an unsatisfactory compromise between voice, breathing, and swallowing. No comparisons between currently employed techniques currently exist. We sought to employ computational fluid dynamics (CFD) modeling to delineate the optimal surgical approach for BVFI. METHODS Utilizing clinical computed tomography of BVFI subjects, coupled with image analytics employing CFD models and subject pulmonary function data, we compared the airflow features in the baseline pathologic states and changes seen between endoscopic cordotomy, endoscopic suture lateralization, and posterior cricoid expansion. RESULTS CFD modeling demonstrated that the greatest airflow velocity occurs through the posterior glottis on inspiration and anterior glottis on expiration in both the normal condition and in BVFI. Glottic airflow velocity and resistance were significantly higher in the BVFI condition compared to normal. Geometric indices (cross-sectional area of airway) were lower in posterior cricoid expansion surgery when compared to alternate surgical approaches. CFD measures (airflow velocity and resistance) improved with all surgical approaches but were superior with posterior cricoid expansion. CONCLUSION CFD modeling can provide discrete, quantitative assessment of the airflow through the laryngeal inlet, and offers insights into the pathophysiology and changes that occur after surgery for BVFI. LEVEL OF EVIDENCE NA. Laryngoscope, 130:E57-E64, 2020.
Collapse
Affiliation(s)
- Gabriel Rios
- Department of Mechanical Engineering, School of Engineering, Vanderbilt University, Nashville, Tennessee
| | - Robert J Morrison
- Department of Otolaryngology, School of Medicine, Vanderbilt University, Nashville, Tennessee.,Department of Otolaryngology-Head & Neck Surgery, Michigan Medicine, University of Michigan, Ann Arbor, Michigan, U.S.A
| | - Yi Song
- Department of Mechanical Engineering, School of Engineering, Vanderbilt University, Nashville, Tennessee
| | - Shanik J Fernando
- Department of Otolaryngology, School of Medicine, Vanderbilt University, Nashville, Tennessee
| | - Christopher Wootten
- Department of Otolaryngology, School of Medicine, Vanderbilt University, Nashville, Tennessee
| | - Alexander Gelbard
- Department of Otolaryngology, School of Medicine, Vanderbilt University, Nashville, Tennessee
| | - Haoxiang Luo
- Department of Mechanical Engineering, School of Engineering, Vanderbilt University, Nashville, Tennessee
| |
Collapse
|
12
|
Gómez P, Schützenberger A, Semmler M, Döllinger M. Laryngeal Pressure Estimation With a Recurrent Neural Network. IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE 2018; 7:2000111. [PMID: 30680252 PMCID: PMC6331197 DOI: 10.1109/jtehm.2018.2886021] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/09/2018] [Revised: 10/24/2018] [Accepted: 11/30/2018] [Indexed: 11/24/2022]
Abstract
Quantifying the physical parameters of voice production is essential for understanding the process of phonation and can aid in voice research and diagnosis. As an alternative to invasive measurements, they can be estimated by formulating an inverse problem using a numerical forward model. However, high-fidelity numerical models are often computationally too expensive for this. This paper presents a novel approach to train a long short-term memory network to estimate the subglottal pressure in the larynx at massively reduced computational cost using solely synthetic training data. We train the network on synthetic data from a numerical two-mass model and validate it on experimental data from 288 high-speed ex vivo video recordings of porcine vocal folds from a previous study. The training requires significantly fewer model evaluations compared with the previous optimization approach. On the test set, we maintain a comparable performance of 21.2% versus previous 17.7% mean absolute percentage error in estimating the subglottal pressure. The evaluation of one sample requires a vanishingly small amount of computation time. The presented approach is able to maintain estimation accuracy of the subglottal pressure at significantly reduced computational cost. The methodology is likely transferable to estimate other parameters and training with other numerical models. This improvement should allow the adoption of more sophisticated, high-fidelity numerical models of the larynx. The vast speedup is a critical step to enable a future clinical application and knowledge of parameters such as the subglottal pressure will aid in diagnosis and treatment selection.
Collapse
Affiliation(s)
- Pablo Gómez
- Division of Phoniatrics and Pediatric AudiologyDepartment of Otorhinolaryngology, Head and Neck SurgeryUniversity Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg91054ErlangenGermany
| | - Anne Schützenberger
- Division of Phoniatrics and Pediatric AudiologyDepartment of Otorhinolaryngology, Head and Neck SurgeryUniversity Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg91054ErlangenGermany
| | - Marion Semmler
- Division of Phoniatrics and Pediatric AudiologyDepartment of Otorhinolaryngology, Head and Neck SurgeryUniversity Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg91054ErlangenGermany
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric AudiologyDepartment of Otorhinolaryngology, Head and Neck SurgeryUniversity Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg91054ErlangenGermany
| |
Collapse
|