1
|
Inoue T, Shiozawa K, Matsumoto T, Kanaya M, Tokuda IT. Nonlinear dynamics and chaos in a vocal-ventricular fold system. CHAOS (WOODBURY, N.Y.) 2024; 34:023134. [PMID: 38386906 DOI: 10.1063/5.0155215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Accepted: 01/27/2024] [Indexed: 02/24/2024]
Abstract
In humans, ventricular folds are located superiorly to the vocal folds. Under special circumstances such as voice pathology or singing, they vibrate together with the vocal folds to contribute to the production of voice. In the present study, experimental data measured from physical models of the vocal and ventricular folds were analyzed in the light of nonlinear dynamics. The physical models provide a useful experimental framework to study the biomechanics of human vocalizations. Of particular interest in this experiment are co-oscillations of the vocal and ventricular folds, occasionally accompanied by irregular dynamics. We show that such a system can be regarded as two coupled oscillators, which give rise to various cooperative behaviors such as synchronized oscillations with a 1:1 or 1:2 frequency ratio and desynchronized oscillations with torus or chaos. The insight gained from the view of nonlinear dynamics should be of significant use for the diagnosis of voice pathologies, such as ventricular fold dysphonia.
Collapse
Affiliation(s)
- Takumi Inoue
- Graduate School of Science and Engineering, Ritsumeikan University, Noji-higashi, Kusatsu, Shiga 525-8577, Japan
| | - Kota Shiozawa
- Graduate School of Science and Engineering, Ritsumeikan University, Noji-higashi, Kusatsu, Shiga 525-8577, Japan
| | - Takuma Matsumoto
- Graduate School of Science and Engineering, Ritsumeikan University, Noji-higashi, Kusatsu, Shiga 525-8577, Japan
| | - Mayuka Kanaya
- Graduate School of Science and Engineering, Ritsumeikan University, Noji-higashi, Kusatsu, Shiga 525-8577, Japan
| | - Isao T Tokuda
- Graduate School of Science and Engineering, Ritsumeikan University, Noji-higashi, Kusatsu, Shiga 525-8577, Japan
| |
Collapse
|
2
|
Tseng WH, Chiu HL, Hsiao TY, Yang TL, Shih PJ. Identification and analysis of Nonlinear behaviors of vocal fold biomechanics during phonation to assess efficacy of surgery for benign laryngeal Diseases. Comput Biol Med 2024; 169:107946. [PMID: 38176211 DOI: 10.1016/j.compbiomed.2024.107946] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Revised: 12/21/2023] [Accepted: 01/01/2024] [Indexed: 01/06/2024]
Abstract
BACKGROUND Current voice assessments focus on perceptive evaluation and acoustic analysis. The interaction of vocal tract pressure (PVT) and vocal fold (VF) vibrations are important for volume and pitch control. However, there are currently little non-invasive ways to measure PVT. Limited information has been provided by previous human trials, and interactions between PVT and VF vibrations and the potential clinical application remain unclear. Here, we propose a non-invasive method for monitoring the nonlinear characteristics of PVT and VF vibrations, analyze voices from pathological and healthy individuals, and evaluate treatment efficacy. METHOD Healthy volunteers and patients with benign laryngeal lesions were recruited for this study. PVT was estimated using an airflow interruption method, VF vibrational frequency was calculated from accelerometer signals, and nonlinear relationships between PVT and VF vibrations were analyzed. Results from healthy volunteers and patients, as well as pre- and post-operation for the patients, were compared. RESULTS For healthy volunteers, nonlinearity was exhibited as an initial increase and then prompt decrease in vibrational frequency at the end of phonation, coinciding with PVT equilibrating with the subglottal pressure upon airflow interruption. For patients, nonlinearity was present throughout the phonation period pre-operatively, but showed a similar trend to healthy volunteers post-operatively. CONCLUSION This novel method simultaneously monitors PVT and VF vibration and helps clarify the role of PVT. The results demonstrate differences in nonlinear characteristics between healthy volunteers and patients, and pre-/post-operation in patients. The method may serve as an analysis tool for clinicians to assess pathological phonation and treatment efficacy.
Collapse
Affiliation(s)
- Wen-Hsuan Tseng
- Department of Otolaryngology, National Taiwan University Hospital and National Taiwan University College of Medicine, Taipei, Taiwan; Graduate Institute of Clinical Medicine, National Taiwan University College of Medicine, Taipei, Taiwan
| | - Hsiang-Ling Chiu
- Department of Otolaryngology, National Taiwan University Hospital and National Taiwan University College of Medicine, Taipei, Taiwan
| | - Tzu-Yu Hsiao
- Department of Otolaryngology, National Taiwan University Hospital and National Taiwan University College of Medicine, Taipei, Taiwan
| | - Tsung-Lin Yang
- Department of Otolaryngology, National Taiwan University Hospital and National Taiwan University College of Medicine, Taipei, Taiwan; Graduate Institute of Clinical Medicine, National Taiwan University College of Medicine, Taipei, Taiwan; Research Center for Developmental Biology and Regenerative Medicine, National Taiwan University, Taipei, Taiwan
| | - Po-Jen Shih
- Department of Biomedical Engineering, National Taiwan University, Taipei, Taiwan.
| |
Collapse
|
3
|
de Oliveira Florencio V, Almeida AA, Balata P, Nascimento S, Brockmann-Bauser M, Lopes LW. Differences and Reliability of Linear and Nonlinear Acoustic Measures as a Function of Vocal Intensity in Individuals With Voice Disorders. J Voice 2023; 37:663-681. [PMID: 34116889 DOI: 10.1016/j.jvoice.2021.04.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Revised: 03/30/2021] [Accepted: 04/05/2021] [Indexed: 11/17/2022]
Abstract
PURPOSE Linear acoustic indices are significantly influenced by speaking voice intensity. The main aim of this work was to compare acoustic measures based on linear and nonlinear models in different speaking voice intensity levels and to analyze the reliability of those measures in different intensity levels in subjects with voice disorders. METHODS 435 samples from subjects (314 women, 121 men with a mean age of 41.07 ± 13.73) diagnosed with various voice disorders were used. In total, 17 acoustic measures were derived from the vowel /ɛ/ sustained at three intensity levels (soft, comfortable, and loud). Five were linear (standard deviation of the fundamental frequency (f0), jitter, shimmer, harmonics-to-noise ratio (HNR) and smoothed cepstral peak prominence (CPPS)), and twelve were nonlinear measures, namely correlation dimension (D2), correlation entropy (H2), first minimum of the mutual information function (FMMI), relative entropy (ENTR-R), largest Lyapunov exponent (Lyap), determinism (DET), transitivity, mean diagonal line length (Lmed), Shannon entropy (ENTR-S), mean length of vertical structures, also known as trapping time (TT), laminarity (LAM) and recurrence period density entropy (RPDE). Differences between speaking voice intensity levels were assessed by Friedman's test and Nemenyi as posthoc test. Intraclass correlation coefficient was used to investigate if each acoustic measure remains in agreement (reliability) between different voice intensity levels. RESULTS There were significant differences in all acoustic measures about vocal intensity level (P < 0.001). Intraclass correlation coefficient was very good for HNR (>0.61) and good for Lyap, DET, ENTR-S, Lmed, RPDE, and TT (0.41-0.60). CONCLUSIONS All acoustic measures varied as a function of vocal intensity in voice disordered adults, while this relation was different for linear and nonlinear measures. Only the measures HNR, Lyap, DET, ENTR-S, Lmed, RPDE and TT had an acceptable reliability between different voice intensity levels. Therefore, patient`s voice SPL should be controlled or indicated during acoustic vocal assessment.
Collapse
Affiliation(s)
- Vanessa de Oliveira Florencio
- Graduate Program in Speech, Language and Hearing Sciences, Universidade Federal da Paraíba (UFPB), João Pessoa, Paraíba, Brazil
| | - Anna Alice Almeida
- Department of Speech, Language and Hearing Sciences, Universidade Federal da Paraíba (UFPB), Graduate Program in Speech, Language and Hearing Sciences, UFPB, João Pessoa, Paraíba, Brazil
| | - Patrícia Balata
- Department of Speech, Language and Hearing Sciences, Universidade Federal da Paraíba (UFPB), Graduate Program in Speech, Language and Hearing Sciences, UFPB, João Pessoa, Paraíba, Brazil
| | - Silvana Nascimento
- Electrical Engineering Department, Instituto Federal de Educação, Ciência e Tecnologia (IFPB), João Pessoa, Paraíba, Brazil
| | - Meike Brockmann-Bauser
- Department of Phoniatrics and Speech Pathology, Clinic for Otorhinolaryngology, Head and Neck Surgery, University Hospital Zurich, University of Zurich, Frauenklinikstrasse 24, 8091 Zurich, Switzerland and University of Zurich, Zurich, Switzerland
| | - Leonardo Wanderley Lopes
- Department of Speech, Language and Hearing Sciences, Universidade Federal da Paraíba (UFPB), Graduate Program in Speech, Language and Hearing Sciences, UFPB, João Pessoa, Paraíba, Brazil.
| |
Collapse
|
4
|
Zhao W, Singh R. Deriving Vocal Fold Oscillation Information from Recorded Voice Signals Using Models of Phonation. ENTROPY (BASEL, SWITZERLAND) 2023; 25:1039. [PMID: 37509986 PMCID: PMC10378572 DOI: 10.3390/e25071039] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Revised: 07/03/2023] [Accepted: 07/07/2023] [Indexed: 07/30/2023]
Abstract
During phonation, the vocal folds exhibit a self-sustained oscillatory motion, which is influenced by the physical properties of the speaker's vocal folds and driven by the balance of bio-mechanical and aerodynamic forces across the glottis. Subtle changes in the speaker's physical state can affect voice production and alter these oscillatory patterns. Measuring these can be valuable in developing computational tools that analyze voice to infer the speaker's state. Traditionally, vocal fold oscillations (VFOs) are measured directly using physical devices in clinical settings. In this paper, we propose a novel analysis-by-synthesis approach that allows us to infer the VFOs directly from recorded speech signals on an individualized, speaker-by-speaker basis. The approach, called the ADLES-VFT algorithm, is proposed in the context of a joint model that combines a phonation model (with a glottal flow waveform as the output) and a vocal tract acoustic wave propagation model such that the output of the joint model is an estimated waveform. The ADLES-VFT algorithm is a forward-backward algorithm which minimizes the error between the recorded waveform and the output of this joint model to estimate its parameters. Once estimated, these parameter values are used in conjunction with a phonation model to obtain its solutions. Since the parameters correlate with the physical properties of the vocal folds of the speaker, model solutions obtained using them represent the individualized VFOs for each speaker. The approach is flexible and can be applied to various phonation models. In addition to presenting the methodology, we show how the VFOs can be quantified from a dynamical systems perspective for classification purposes. Mathematical derivations are provided in an appendix for better readability.
Collapse
Affiliation(s)
- Wayne Zhao
- Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Rita Singh
- School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| |
Collapse
|
5
|
Liu B, Raj H, Klein L, Jiang JJ. Evaluating the Voice Type Component Distributions of Excised Larynx Phonations at Three Subglottal Pressures. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:1447-1456. [PMID: 33887153 PMCID: PMC8608150 DOI: 10.1044/2021_jslhr-20-00429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Revised: 10/29/2020] [Accepted: 01/06/2021] [Indexed: 06/12/2023]
Abstract
Purpose The excised canine larynx provides an advantageous experimental framework in the study of voice physiology. In recent years, signal processing methods have been applied to analyze phonations in excised canine larynx experiments. However, phonations have a highly complex and nonstationary nature corresponding to different proportions of regular and chaotic signal elements. Current nonlinear dynamic methods that are used to assess the degree of irregularity in the voice fail to recognize the distribution of voice type components (VTCs). Method Based on measures of intrinsic dimension, this article presents a method to analyze the VTC distribution of phonations in excised canine larynx experiments. Thirty-nine phonation samples from 13 excised canine larynges at three different subglottal pressures were analyzed. Results Phonation produced with subglottal pressures above phonation instability pressure (PIP) and below phonation threshold pressure (PTP) resulted in high proportions of Voice Types 3 and 4, characterized by chaotic and noisy signals. Phonation produced with pressure between PTP and PIP contained mostly Type 1 voice, characterized by a regular and nearly periodic signal. Mean proportions of all VTCs varied significantly in comparisons of phonations produced with Sub-PTP and PTP as well as in comparisons of phonations produced with PTP and PIP. Conclusions Across all VTCs, the VTC profiles of normal and abnormal phonation differ significantly. Normal phonation is strongly associated with VTC1 (Voice Type Component 1), whereas abnormal phonation exhibits increased VTC4 (Voice Type Component 4). The study further demonstrates the ability of intrinsic dimension to successfully detect multiple voice types in an acoustic signal and highlights the need for expanded use of intrinsic dimension in human voice. Supplemental Material https://doi.org/10.23641/asha.14417585.
Collapse
Affiliation(s)
- Boquan Liu
- School of Humanities, Shanghai Jiao Tong University, China
- Division of Otolaryngology—Head & Neck Surgery, Department of Surgery, University of Wisconsin–Madison School of Medicine and Public Health
| | - Hayley Raj
- Division of Otolaryngology—Head & Neck Surgery, Department of Surgery, University of Wisconsin–Madison School of Medicine and Public Health
| | - Logan Klein
- Division of Otolaryngology—Head & Neck Surgery, Department of Surgery, University of Wisconsin–Madison School of Medicine and Public Health
| | - Jack J. Jiang
- Division of Otolaryngology—Head & Neck Surgery, Department of Surgery, University of Wisconsin–Madison School of Medicine and Public Health
| |
Collapse
|
6
|
Performance of Different Acoustic Measures to Discriminate Individuals With and Without Voice Disorders. J Voice 2020; 36:487-498. [PMID: 32798120 DOI: 10.1016/j.jvoice.2020.07.008] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2020] [Revised: 07/11/2020] [Accepted: 07/13/2020] [Indexed: 11/20/2022]
Abstract
The goal of this study is to compare and combine different acoustic features in discriminating subjects with and without voice disorders. A database of 484 adult patients participated in the research. All subjects recorded a sustained vowel /Ɛ/ and underwent a laryngoscopic examination of the larynx. From the results of the laryngeal examination performed by a physician and the auditory-perceptual judgment performed by a Speech-Language Pathologist, the subjects were allocated to the group with (n = 52) and without (n = 432) voice disorder. Four types of acoustic features were used: traditional measures, cepstral measures, nonlinear measures, and recurrence quantification measures. Recordings comprised the emission of the vowel /ε/. Quadratic discriminant analysis was used as classifier. Individual features in the context of traditional, cepstral, and recurrence quantification measures achieved an acceptable performance of ≥70%. Combination of measures improved the classifier performance. The best classification result (86.43% accuracy) was obtained by combining traditional linear and recurrence quantification measures. Results shown that Traditional, Cepstral, and recurrence quantification measures are promising features that capture meaningful information about voice production, which provides good classification performances. The findings of this study can be used to develop a computational tool for voice disorders diagnosis and monitoring.
Collapse
|
7
|
Lopes LW, Vieira VJD, Costa SLDNC, Correia SÉN, Behlau M. Effectiveness of Recurrence Quantification Measures in Discriminating Subjects With and Without Voice Disorders. J Voice 2020; 34:208-220. [DOI: 10.1016/j.jvoice.2018.09.004] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2018] [Revised: 09/05/2018] [Accepted: 09/06/2018] [Indexed: 10/28/2022]
|
8
|
Raine J, Pisanski K, Bond R, Simner J, Reby D. Human roars communicate upper-body strength more effectively than do screams or aggressive and distressed speech. PLoS One 2019; 14:e0213034. [PMID: 30830931 PMCID: PMC6398857 DOI: 10.1371/journal.pone.0213034] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2018] [Accepted: 02/13/2019] [Indexed: 11/27/2022] Open
Abstract
Despite widespread evidence that nonverbal components of human speech (e.g., voice pitch) communicate information about physical attributes of vocalizers and that listeners can judge traits such as strength and body size from speech, few studies have examined the communicative functions of human nonverbal vocalizations (such as roars, screams, grunts and laughs). Critically, no previous study has yet to examine the acoustic correlates of strength in nonverbal vocalisations, including roars, nor identified reliable vocal cues to strength in human speech. In addition to being less acoustically constrained than articulated speech, agonistic nonverbal vocalizations function primarily to express motivation and emotion, such as threat, and may therefore communicate strength and body size more effectively than speech. Here, we investigated acoustic cues to strength and size in roars compared to screams and speech sentences produced in both aggressive and distress contexts. Using playback experiments, we then tested whether listeners can reliably infer a vocalizer's actual strength and height from roars, screams, and valenced speech equivalents, and which acoustic features predicted listeners' judgments. While there were no consistent acoustic cues to strength in any vocal stimuli, listeners accurately judged inter-individual differences in strength, and did so most effectively from aggressive voice stimuli (roars and aggressive speech). In addition, listeners more accurately judged strength from roars than from aggressive speech. In contrast, listeners' judgments of height were most accurate for speech stimuli. These results support the prediction that vocalizers maximize impressions of physical strength in aggressive compared to distress contexts, and that inter-individual variation in strength may only be honestly communicated in vocalizations that function to communicate threat, particularly roars. Thus, in continuity with nonhuman mammals, the acoustic structure of human aggressive roars may have been selected to communicate, and to some extent exaggerate, functional cues to physical formidability.
Collapse
Affiliation(s)
- Jordan Raine
- Mammal Vocal Communication and Cognition Research Group, University of Sussex, Brighton, United Kingdom
| | - Katarzyna Pisanski
- Mammal Vocal Communication and Cognition Research Group, University of Sussex, Brighton, United Kingdom
- Equipe Neuro-Ethologie Sensorielle, ENES/Neuro-PSI CNRS UMR 9197, Bioacoustics Team, University of Lyon/Saint-Etienne, Saint-Etienne, France
| | - Rod Bond
- Mammal Vocal Communication and Cognition Research Group, University of Sussex, Brighton, United Kingdom
| | - Julia Simner
- MULTISENSE Research Lab, University of Sussex, Brighton, United Kingdom
| | - David Reby
- Mammal Vocal Communication and Cognition Research Group, University of Sussex, Brighton, United Kingdom
| |
Collapse
|
9
|
Croake DJ, Andreatta RD, Stemple JC. Descriptive Analysis of the Interactive Patterning of the Vocalization Subsystems in Healthy Participants: A Dynamic Systems Perspective. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2019; 62:215-228. [PMID: 30950696 DOI: 10.1044/2018_jslhr-s-17-0466] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Purpose Normative data for many objective voice measures are routinely used in clinical voice assessment; however, normative data reflect vocal output, but not vocalization process. The underlying physiologic processes of healthy phonation have been shown to be nonlinear and thus are likely different across individuals. Dynamic systems theory postulates that performance behaviors emerge from the nonlinear interplay of multiple physiologic components and that certain patterns are preferred and loosely governed by the interactions of physiology, task, and environment. The purpose of this study was to descriptively characterize the interactive nature of the vocalization subsystem triad in subjects with healthy voices and to determine if differing subgroups could be delineated to better understand how healthy voicing is physiologically generated. Method Respiratory kinematic, aerodynamic, and acoustic formant data were obtained from 29 individuals with healthy voices (21 female and eight male). Multivariate analyses were used to descriptively characterize the interactions among the subsystems that contributed to healthy voicing. Results Group data revealed representative measures of the 3 subsystems to be generally within the boundaries of established normative data. Despite this, 3 distinct clusters were delineated that represented 3 subgroups of individuals with differing subsystem patterning. Seven of the 9 measured variables in this study were found to be significantly different across at least 1 of the 3 subgroups indicating differing physiologic processes across individuals. Conclusion Vocal output in healthy individuals appears to be generated by distinct and preferred physiologic processes that were represented by 3 subgroups indicating that the process of vocalization is different among individuals, but not entirely idiosyncratic. Possibilities for these differences are explored using the framework of dynamic systems theory and the dynamics of emergent behaviors. A revised physiologic model of phonation that accounts for differences within and among the vocalization subsystems is described. Supplemental Material https://doi.org/10.23641/asha.7616462.
Collapse
Affiliation(s)
- Daniel J Croake
- Department of Communication Sciences and Disorders, University of Kentucky, Lexington
| | - Richard D Andreatta
- Department of Communication Sciences and Disorders, University of Kentucky, Lexington
| | - Joseph C Stemple
- Department of Communication Sciences and Disorders, University of Kentucky, Lexington
| |
Collapse
|
10
|
Affiliation(s)
- Jordan Raine
- Mammal Vocal Communication and Cognition Research Group, School of Psychology, University of Sussex, Brighton, UK
| | - Katarzyna Pisanski
- Mammal Vocal Communication and Cognition Research Group, School of Psychology, University of Sussex, Brighton, UK
| | - Julia Simner
- MULTISENSE Research Lab, School of Psychology, University of Sussex, Brighton, UK
| | - David Reby
- Mammal Vocal Communication and Cognition Research Group, School of Psychology, University of Sussex, Brighton, UK
| |
Collapse
|
11
|
Lopes LW, da Silva KE, da Silva Evangelista D, Almeida AA, Silva POC, Lucero J, Behlau M. Performance of Phonatory Deviation Diagrams in Synthesized Voice Analysis. Folia Phoniatr Logop 2018; 69:246-260. [PMID: 29719283 DOI: 10.1159/000487941] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2017] [Accepted: 02/23/2018] [Indexed: 11/19/2022] Open
Abstract
OBJECTIVE To analyze the performance of a phonatory deviation diagram (PDD) in discriminating the presence and severity of voice deviation and the predominant voice quality of synthesized voices. METHOD A speech-language pathologist performed the auditory-perceptual analysis of the synthesized voice (n = 871). The PDD distribution of voice signals was analyzed according to area, quadrant, shape, and density. RESULTS Differences in signal distribution regarding the PDD area and quadrant were detected when differentiating the signals with and without voice deviation and with different predominant voice quality. Differences in signal distribution were found in all PDD parameters as a function of the severity of voice disorder. CONCLUSION The PDD area and quadrant can differentiate normal voices from deviant synthesized voices. There are differences in signal distribution in PDD area and quadrant as a function of the severity of voice disorder and the predominant voice quality. However, the PDD area and quadrant do not differentiate the signals as a function of severity of voice disorder and differentiated only the breathy and rough voices from the normal and strained voices. PDD density is able to differentiate only signals with moderate and severe deviation. PDD shape shows differences between signals with different severities of voice deviation.
Collapse
Affiliation(s)
| | | | | | - Anna Alice Almeida
- Department of Speech Therapy, Universidade Federal da Paraíba - UFPB, João Pessoa, Brazil
| | | | - Jorge Lucero
- Department of Computer Science, Universidade de Brasília - UNB, Brasília, Brazil
| | - Mara Behlau
- Universidade Federal de São Paulo - UNIFESP and Centro de Estudos da Voz - CEV, São Paulo, São Paulo, Brazil
| |
Collapse
|
12
|
Affiliation(s)
- ISAO T. TOKUDA
- Department of Mechanical Engineering, Ritsumeikan University, Kusatsu
| |
Collapse
|
13
|
Lopes LW, Batista Simões L, Delfino da Silva J, da Silva Evangelista D, da Nóbrega e Ugulino AC, Oliveira Costa Silva P, Jefferson Dias Vieira V. Accuracy of Acoustic Analysis Measurements in the Evaluation of Patients With Different Laryngeal Diagnoses. J Voice 2017; 31:382.e15-382.e26. [DOI: 10.1016/j.jvoice.2016.08.015] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2016] [Revised: 08/20/2016] [Accepted: 08/23/2016] [Indexed: 11/29/2022]
|
14
|
Huang N, Zhang Y, Calawerts W, Jiang JJ. Optimized Nonlinear Dynamic Analysis of Pathologic Voices With Laryngeal Paralysis Based on the Minimum Embedding Dimension. J Voice 2017; 31:249.e1-249.e7. [DOI: 10.1016/j.jvoice.2016.07.021] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2016] [Revised: 07/21/2016] [Accepted: 07/25/2016] [Indexed: 10/21/2022]
|
15
|
Yang J, Wang X, Krane M, Zhang LT. Fully-coupled aeroelastic simulation with fluid compressibility - For application to vocal fold vibration. COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING 2017; 315:584-606. [PMID: 29527067 PMCID: PMC5841474 DOI: 10.1016/j.cma.2016.11.010] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
In this study, a fully-coupled fluid-structure interaction model is developed for studying dynamic interactions between compressible fluid and aeroelastic structures. The technique is built based on the modified Immersed Finite Element Method (mIFEM), a robust numerical technique to simulate fluid-structure interactions that has capabilities to simulate high Reynolds number flows and handles large density disparities between the fluid and the solid. For accurate assessment of this intricate dynamic process between compressible fluid, such as air and aeroelastic structures, we included in the model the fluid compressibility in an isentropic process and a solid contact model. The accuracy of the compressible fluid solver is verified by examining acoustic wave propagations in a closed and an open duct, respectively. The fully-coupled fluid-structure interaction model is then used to simulate and analyze vocal folds vibrations using compressible air interacting with vocal folds that are represented as layered viscoelastic structures. Using physiological geometric and parametric setup, we are able to obtain a self-sustained vocal fold vibration with a constant inflow pressure. Parametric studies are also performed to study the effects of lung pressure and vocal fold tissue stiffness in vocal folds vibrations. All the case studies produce expected airflow behavior and a sustained vibration, which provide verification and confidence in our future studies of realistic acoustical studies of the phonation process.
Collapse
Affiliation(s)
- Jubiao Yang
- Department of Mechanical, Aerospace, and Nuclear Engineering, Rensselaer Polytechnic Institute, United States
| | - Xingshi Wang
- Department of Mechanical, Aerospace, and Nuclear Engineering, Rensselaer Polytechnic Institute, United States
| | - Michael Krane
- Applied Research Lab, Pennsylvania State University, United States
| | - Lucy T. Zhang
- Department of Mechanical, Aerospace, and Nuclear Engineering, Rensselaer Polytechnic Institute, United States
| |
Collapse
|
16
|
Zhang LT, Yang J. Evaluation of aerodynamic characteristics of a coupled fluid-structure system using generalized Bernoulli's principle: An application to vocal folds vibration. JOURNAL OF COUPLED SYSTEMS AND MULTISCALE DYNAMICS 2016. [PMID: 29527541 DOI: 10.1166/jcsmd.2016.1114] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
In this work we explore the aerodynamics flow characteristics of a coupled fluid-structure interaction system using a generalized Bernoulli equation derived directly from the Cauchy momentum equations. Unlike the conventional Bernoulli equation where incompressible, inviscid, and steady flow conditions are assumed, this generalized Bernoulli equation includes the contributions from compressibility, viscous, and unsteadiness, which could be essential in defining aerodynamic characteristics. The application of the derived Bernoulli's principle is on a fully-coupled fluid-structure interaction simulation of the vocal folds vibration. The coupled system is simulated using the immersed finite element method where compressible Navier-Stokes equations are used to describe the air and an elastic pliable structure to describe the vocal fold. The vibration of the vocal fold works to open and close the glottal flow. The aerodynamics flow characteristics are evaluated using the derived Bernoulli's principles for a vibration cycle in a carefully partitioned control volume based on the moving structure. The results agree very well to experimental observations, which validate the strategy and its use in other types of flow characteristics that involve coupled fluid-structure interactions.
Collapse
Affiliation(s)
- Lucy T Zhang
- JEC 2049, Department of Mechanical, Aerospace, and Nuclear Engineering, Rensselaer Polytechnic Institute, 110 8th St., Troy, NY 12180, USA
| | - Jubiao Yang
- JEC 2049, Department of Mechanical, Aerospace, and Nuclear Engineering, Rensselaer Polytechnic Institute, 110 8th St., Troy, NY 12180, USA
| |
Collapse
|
17
|
Quantifying the Subharmonic Mucosal Wave in Excised Larynges via Digital Kymography. J Voice 2016; 31:123.e7-123.e13. [PMID: 27105856 DOI: 10.1016/j.jvoice.2016.01.009] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2015] [Accepted: 01/20/2016] [Indexed: 11/21/2022]
Abstract
OBJECTIVES In this paper, a nonlinear least squares fitting method was proposed to quantify subharmonic mucosal waves. STUDY DESIGN AND METHODS Subharmonic mucosal waves from 10 excised canine larynges were recorded using digital kymography and analyzed using nonlinear least squares and linear least squares methods. Amplitudes of fundamental and subharmonic mucosal wave components of right-upper, right-lower, left-upper, and left-lower vocal fold lips were calculated. Lastly, phase differences of fundamental and subharmonic components of the left and right vocal folds were compared. RESULTS The results showed that the nonlinear least squares analysis method provides a more effective complement to the linear fitting method for subharmonic mucosal wave extraction. There was a significant difference in amplitudes between the subharmonic and the fundamental components of mucosal waves (P < 0.05). The phase differences of the fundamental and the subharmonic components of the right and left vocal folds were not significantly different. CONCLUSIONS The application of the nonlinear least squares analysis method in digital kymography is useful for the characterization of subharmonic mucosal waves.
Collapse
|
18
|
Robertson D, Zañartu M, Cook D. Comprehensive, Population-Based Sensitivity Analysis of a Two-Mass Vocal Fold Model. PLoS One 2016; 11:e0148309. [PMID: 26845452 PMCID: PMC4742229 DOI: 10.1371/journal.pone.0148309] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2015] [Accepted: 01/15/2016] [Indexed: 01/31/2023] Open
Abstract
Previous vocal fold modeling studies have generally focused on generating detailed data regarding a narrow subset of possible model configurations. These studies can be interpreted to be the investigation of a single subject under one or more vocal conditions. In this study, a broad population-based sensitivity analysis is employed to examine the behavior of a virtual population of subjects and to identify trends between virtual individuals as opposed to investigating a single subject or model instance. Four different sensitivity analysis techniques were used in accomplishing this task. Influential relationships between model input parameters and model outputs were identified, and an exploration of the model’s parameter space was conducted. Results indicate that the behavior of the selected two-mass model is largely dominated by complex interactions, and that few input-output pairs have a consistent effect on the model. Results from the analysis can be used to increase the efficiency of optimization routines of reduced-order models used to investigate voice abnormalities. Results also demonstrate the types of challenges and difficulties to be expected when applying sensitivity analyses to more complex vocal fold models. Such challenges are discussed and recommendations are made for future studies.
Collapse
Affiliation(s)
- Daniel Robertson
- Division of Engineering, New York University–Abu Dhabi, Abu Dhabi, United Arab Emirates
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaiso, Chile
| | - Douglas Cook
- Division of Engineering, New York University–Abu Dhabi, Abu Dhabi, United Arab Emirates
- * E-mail:
| |
Collapse
|
19
|
Song Z, Zhang Y, Wei C, Wang X. Inducing rostrum interfacial waves by fluid-solid coupling in a Chinese river dolphin (Lipotesvexillifer). Phys Rev E 2016; 93:012411. [PMID: 26871105 DOI: 10.1103/physreve.93.012411] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2015] [Indexed: 06/05/2023]
Abstract
Through numerically solving the appropriate wave equations, propagation of biosonar signals in a Chinese river dolphin (baiji) was studied. The interfacial waves along the rostrum-tissue interfaces, including both compressional (longitudinal) and shear (transverse) waves in the solid rostrum through fluid-solid coupling were examined. The baiji's rostrum was found to effect acoustic beam formation not only as an interfacial wave generator but also as a sound reflector. The wave propagation patterns in the solid rostrum were found to significantly change the wave movement through the bone. Vibrations in the rostrum, expressed in solid displacement, initially increased but eventually decreased from posterior to anterior sides, indicating a complex physical process. Furthermore, the comparisons among seven cases, including the combination of (1) the rostrum, melon, and air sacs; (2) rostrum-air sacs; (3) rostrum-melon; (4) only rostrum; (5) air sacs-melon; (6) only air sacs; and (7) only melon revealed that the cases including the rostrum were better able to approach the complete system by inducing rostrum-tissue interfacial waves and reducing the differences in main beam angle and -3 dB beam width. The interfacial waves in the rostrum were considered complementary with reflection to determine the obbligato role of the rostrum in the baiji's biosonar emission. The far-field beams formed from complete fluid-solid models and non-fluid-solid models were compared to reveal the effects brought by the consideration of shear waves of the solid structures of the baiji. The results may provide useful information for further understanding the role of the rostrum in this odontocete species.
Collapse
Affiliation(s)
- Zhongchang Song
- Key Laboratory of Underwater Acoustic Communication and Marine Information Technology of the Ministry of Education, Xiamen University, Xiamen 361005, China
- College of Ocean and Earth Sciences, Xiamen University, Xiamen 361005, China
| | - Yu Zhang
- Key Laboratory of Underwater Acoustic Communication and Marine Information Technology of the Ministry of Education, Xiamen University, Xiamen 361005, China
- College of Ocean and Earth Sciences, Xiamen University, Xiamen 361005, China
- State Key Laboratory of Acoustics, Institute of Acoustics, Chinese Academy of Sciences, Beijing 100190, China
| | - Chong Wei
- Key Laboratory of Underwater Acoustic Communication and Marine Information Technology of the Ministry of Education, Xiamen University, Xiamen 361005, China
- College of Ocean and Earth Sciences, Xiamen University, Xiamen 361005, China
- Hawaii Institute of Marine Biology, University of Hawaii, 46-007 Lilipuna Road, Kaneohe, Hawaii 96744, USA
| | - Xianyan Wang
- Third Institute of Oceanography, State Oceanic Administration, Xiamen 361005, Fujian, China
| |
Collapse
|
20
|
Luque J, Luque B, Lacasa L. Scaling and universality in the human voice. J R Soc Interface 2015; 12:20141344. [PMID: 25694542 PMCID: PMC4387524 DOI: 10.1098/rsif.2014.1344] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2014] [Accepted: 01/29/2015] [Indexed: 11/12/2022] Open
Abstract
Speech is a distinctive complex feature of human capabilities. In order to understand the physics underlying speech production, in this work, we empirically analyse the statistics of large human speech datasets ranging several languages. We first show that during speech, the energy is unevenly released and power-law distributed, reporting a universal robust Gutenberg-Richter-like law in speech. We further show that such 'earthquakes in speech' show temporal correlations, as the interevent statistics are again power-law distributed. As this feature takes place in the intraphoneme range, we conjecture that the process responsible for this complex phenomenon is not cognitive, but it resides in the physiological (mechanical) mechanisms of speech production. Moreover, we show that these waiting time distributions are scale invariant under a renormalization group transformation, suggesting that the process of speech generation is indeed operating close to a critical point. These results are put in contrast with current paradigms in speech processing, which point towards low dimensional deterministic chaos as the origin of nonlinear traits in speech fluctuations. As these latter fluctuations are indeed the aspects that humanize synthetic speech, these findings may have an impact in future speech synthesis technologies. Results are robust and independent of the communication language or the number of speakers, pointing towards a universal pattern and yet another hint of complexity in human speech.
Collapse
Affiliation(s)
- Jordi Luque
- Telefonica Research, Edificio Telefonica-Diagonal 00, Barcelona, Spain
| | - Bartolo Luque
- Departamento de Matemática Aplicada y Estadística, EIAE, Universidad Politécnica de Madrid, Madrid, Spain
| | - Lucas Lacasa
- School of Mathematical Sciences, Queen Mary University of London, Mile End Road, London E14NS, UK
| |
Collapse
|
21
|
Cordeiro GF, Montagnoli AN, Ubrig MT, Menezes MHM, Tsuji DH. Comparison of Tongue and Lip Trills with Phonation of the Sustained Vowel /<i>ε</i>/ Regarding the Periodicity of the Electroglottographic Waveform and the Amplitude of the Electroglottographic Signal. ACTA ACUST UNITED AC 2015. [DOI: 10.4236/oja.2015.54018] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
22
|
Wei C, Zhang Y, Au WWL. Simulation of ultrasound beam formation of baiji (Lipotes vexillifer) with a finite element model. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2014; 136:423-429. [PMID: 24993226 DOI: 10.1121/1.4883597] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
The baiji (Lipotes vexillifer) of the Yangtze River possesses a sophisticated biosonar system. In this study, a finite element approach was used to numerically investigate the propagation of acoustic waves through the head of the Yangtze River dolphin, which possesses an inhomogeneous and complex structure. The acoustic intensity distribution predicted from models with and without the melon and/or skull showed that the emitted sound beam was narrow and formed a highly directed acoustic beam, and the skull and melon significantly enhanced the directional characteristics of the emitted sound. Finally, for a short duration impulsive source, the emitted sound pressure distributions were also simulated at different propagation times. The results provide useful information for better understanding the operation of the biosonar system in this rare and perhaps extinct animal.
Collapse
Affiliation(s)
- Chong Wei
- Key Laboratory of Underwater Acoustic Communication and Marine Information Technology of the Ministry of Education, College of Oceanography and Environmental Science, Xiamen University, Xiamen, 361100, People's Republic of China
| | - Yu Zhang
- Key Laboratory of Underwater Acoustic Communication and Marine Information Technology of the Ministry of Education, Xiamen University, Xiping Building, Xiangan South Road, Xiamen, 361005, People's Republic of China
| | - Whitlow W L Au
- Hawaii Institute of Marine Biology, University of Hawaii, 46-007 Lilipuna Road, Kaneohe, Hawaii 96744
| |
Collapse
|
23
|
Lopes LW, Costa SLDNC, Costa WCDA, Correia SÉN, Vieira VJD. Acoustic assessment of the voices of children using nonlinear analysis: proposal for assessment and vocal monitoring. J Voice 2014; 28:565-73. [PMID: 24836362 DOI: 10.1016/j.jvoice.2014.02.013] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2013] [Accepted: 02/25/2014] [Indexed: 11/18/2022]
Abstract
OBJECTIVE To analyze the accuracy of recurrence measurements, both isolated and combined, to assess the intensity of vocal disorders in children. METHOD A total of 93 children of both sexes (48 girls and 45 boys), aged between 3 and 10 years, participated. The vocal-deviation intensity was evaluated by the consensus of three speech therapists from the pronunciation of vowel /ε/ using the visual analog scale. In the acoustic analysis, eight recurrence plot characteristics were evaluated and extracted with neighborhood radius values that maintained the recurrence rate at 1%, 2%, 3%, 4%, and 5%. The classification was performed using quadratic discriminant analysis applied for individual and combined measurements. The performance was evaluated by measuring the accuracy, which related the cases correctly classified to all the analyzed cases. RESULTS In the classification cases concerning individual measure performance, the trapping time and maximum length of the diagonal lines showed the best classification potential to discriminate between healthy and disturbed voices, with accuracy rates above 80%. In the healthy and mild deviation cases, the trend (TREND) measure was also relevant. For the mild versus moderate deviation classification, the best performance was obtained by the TREND measure (85.00% ± 7.64%). A gain was obtained in the classification rate when the measures of recurrence were combined, reaching an accuracy of 95.00% ± 5.00%, for discriminating between healthy voices and those with mild deviation. CONCLUSIONS The measures of recurrence, either alone or combined, may be useful in detecting healthy and disturbed voices and in differentiating the intensity of vocal disorders in children.
Collapse
Affiliation(s)
- Leonardo Wanderley Lopes
- Department of Speech and Language Pathology, Universidade Federal da Paraíba, João Pessoa, Paraíba, Brazil.
| | | | | | - Suzete Élida Nóbrega Correia
- Electrical Engineering Department, Instituto Federal de Educação, Ciência e Tecnologia, João Pessoa, Paraíba, Brazil
| | | |
Collapse
|
24
|
Xue Q, Zheng X, Mittal R, Bielamowicz S. Computational study of effects of tension imbalance on phonation in a three-dimensional tubular larynx model. J Voice 2014; 28:411-9. [PMID: 24725589 DOI: 10.1016/j.jvoice.2013.12.016] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2013] [Accepted: 12/23/2013] [Indexed: 10/25/2022]
Abstract
OBJECTIVES The present study explores the use of a continuum-based computational model to investigate the effect of left-right tension imbalance on vocal fold (VF) vibrations and glottal aerodynamics, as well as its implication on phonation. The study allows us to gain new insights into the underlying physical mechanism of irregularities induced by VF tension imbalance associated with unilateral cricothyroid muscle paralysis. METHODS A three-dimensional simulation of glottal flow and VF dynamics in a tubular laryngeal model with tension imbalance was conducted by using a coupled flow-structure interaction computational model. Tension imbalance was modeled by reducing by 20% the Young's modulus of one of the VFs, while holding VF length constant. Effects of tension imbalance on vibratory characteristic of the VFs and on the time-varying properties of glottal airflow as well as the aerodynamic energy transfer are comprehensively analyzed. RESULTS AND CONCLUSIONS The analysis demonstrates that the continuum-based biomechanical model can provide a good description of phonatory dynamics in tension imbalance conditions. It is found that although 20% tension imbalance does not have noticeable effects on the fundamental frequency, it does lead to a larger glottal flow leakage and asymmetric vibrations of the two VFs. A detailed analysis of the energy transfer suggests that the majority of the energy is consumed by the lateral motion of the VFs and the net energy transferred to the softer fold is less than the one transferred to the normal fold.
Collapse
Affiliation(s)
- Qian Xue
- Department of Mechanical Engineering, University of Maine, Orono, Maine
| | - Xudong Zheng
- Department of Mechanical Engineering, University of Maine, Orono, Maine.
| | - Rajat Mittal
- Department of Mechanical Engineering, Johns Hopkins University, Baltimore, Maryland
| | - Steven Bielamowicz
- Division of Otolaryngology, The George Washington University, Washington, District of Columbia
| |
Collapse
|
25
|
Ankışhan H, Yılmaz D. Comparison of SVM and ANFIS for snore related sounds classification by using the largest Lyapunov exponent and entropy. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2013; 2013:238937. [PMID: 24194786 PMCID: PMC3806117 DOI: 10.1155/2013/238937] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/28/2013] [Revised: 08/13/2013] [Accepted: 08/15/2013] [Indexed: 11/17/2022]
Abstract
Snoring, which may be decisive for many diseases, is an important indicator especially for sleep disorders. In recent years, many studies have been performed on the snore related sounds (SRSs) due to producing useful results for detection of sleep apnea/hypopnea syndrome (SAHS). The first important step of these studies is the detection of snore from SRSs by using different time and frequency domain features. The SRSs have a complex nature that is originated from several physiological and physical conditions. The nonlinear characteristics of SRSs can be examined with chaos theory methods which are widely used to evaluate the biomedical signals and systems, recently. The aim of this study is to classify the SRSs as snore/breathing/silence by using the largest Lyapunov exponent (LLE) and entropy with multiclass support vector machines (SVMs) and adaptive network fuzzy inference system (ANFIS). Two different experiments were performed for different training and test data sets. Experimental results show that the multiclass SVMs can produce the better classification results than ANFIS with used nonlinear quantities. Additionally, these nonlinear features are carrying meaningful information for classifying SRSs and are able to be used for diagnosis of sleep disorders such as SAHS.
Collapse
Affiliation(s)
- Haydar Ankışhan
- Department of Biomedical Equipment Technology, Vocational School of Technology, Başkent University, 06810 Ankara, Turkey
| | - Derya Yılmaz
- Department of Electrical and Electronic Engineering, Faculty of Engineering, Başkent University, 06810 Ankara, Turkey
| |
Collapse
|
26
|
Yu X, Zhang Y. Ray chaos in an architectural acoustic semi-stadium system. CHAOS (WOODBURY, N.Y.) 2013; 23:013107. [PMID: 23556944 DOI: 10.1063/1.4772969] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
The semi-stadium system is composed of a semicircular cap and a rectilinear platform. In this study, a dynamic model of the side, position, and angle variables is applied to investigate the acoustic ray chaos of the architectural semi-stadium system. The Lyapunov exponent is calculated in order to quantitatively describe ray instability. The model can be reduced to the semi-circular and rectilinear platform systems when the rectilinear length is sufficiently small and large. The quasi-rectilinear platform and the semicircular systems both produce regular trajectories with the maximal Lyapunov exponent approaching zero. Ray localizations, such as flutter-echo and sound focusing, are found in these two systems. However, the semi-stadium system produces chaotic ray behaviors with positive Lyapunov exponents and reduces ray localizations. Furthermore, as the rectilinear length increases, the scaling laws of the Lyapunov exponent of the semi-stadium system are revealed and compared with those of the stadium system. The results suggest the potential application of the proposed model to simulate chaotic dynamics of acoustic ray in architectural enclosed systems.
Collapse
Affiliation(s)
- Xiaojian Yu
- Department of Architecture, Xiamen University Tan Kah Kee College, Zhangzhou, Fujian 363105, China
| | | |
Collapse
|
27
|
Comparative Analysis of the Closed Quotient for Lip and Tongue Trills in Relation to the Sustained Vowel /ε/. J Voice 2012; 26:e17-22. [DOI: 10.1016/j.jvoice.2010.07.004] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2010] [Accepted: 07/09/2010] [Indexed: 11/22/2022]
|
28
|
Erath BD, Zañartu M, Peterson SD, Plesniak MW. Nonlinear vocal fold dynamics resulting from asymmetric fluid loading on a two-mass model of speech. CHAOS (WOODBURY, N.Y.) 2011; 21:033113. [PMID: 21974648 DOI: 10.1063/1.3615726] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Nonlinear vocal fold dynamics arising from asymmetric flow formations within the glottis are investigated using a two-mass model of speech with asymmetric vocal fold tensioning, representative of unilateral vocal fold paralysis. A refined theoretical boundary-layer flow solver is implemented to compute the intraglottal pressures, providing a more realistic description of the flow than the standard one-dimensional, inviscid Bernoulli flow solution. Vocal fold dynamics are investigated for subglottal pressures of 0.6 < p(s) < 1.5 kPa and tension asymmetries of 0.5 < Q < 0.8. As tension asymmetries become pronounced the asymmetric flow incites nonlinear behavior in the vocal fold dynamics at subglottal pressures that are associated with normal speech, behavior that is not captured with standard Bernoulli flow solvers. Regions of bifurcation, coexistence of solutions, and chaos are identified.
Collapse
Affiliation(s)
- Byron D Erath
- Department of Mechanical and Aerospace Engineering, The George Washington University, Washington DC 20052, USA.
| | | | | | | |
Collapse
|
29
|
Yang A, Stingl M, Berry DA, Lohscheller J, Voigt D, Eysholdt U, Dollinger M. Computation of physiological human vocal fold parameters by mathematical optimization of a biomechanical model. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2011; 130:948-64. [PMID: 21877808 PMCID: PMC3195891 DOI: 10.1121/1.3605551] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
With the use of an endoscopic, high-speed camera, vocal fold dynamics may be observed clinically during phonation. However, observation and subjective judgment alone may be insufficient for clinical diagnosis and documentation of improved vocal function, especially when the laryngeal disease lacks any clear morphological presentation. In this study, biomechanical parameters of the vocal folds are computed by adjusting the corresponding parameters of a three-dimensional model until the dynamics of both systems are similar. First, a mathematical optimization method is presented. Next, model parameters (such as pressure, tension and masses) are adjusted to reproduce vocal fold dynamics, and the deduced parameters are physiologically interpreted. Various combinations of global and local optimization techniques are attempted. Evaluation of the optimization procedure is performed using 50 synthetically generated data sets. The results show sufficient reliability, including 0.07 normalized error, 96% correlation, and 91% accuracy. The technique is also demonstrated on data from human hemilarynx experiments, in which a low normalized error (0.16) and high correlation (84%) values were achieved. In the future, this technique may be applied to clinical high-speed images, yielding objective measures with which to document improved vocal function of patients with voice disorders.
Collapse
Affiliation(s)
- Anxiong Yang
- Department of Phoniatrics and Pediatric Audiology, University Hospital Erlangen, Medical School, Erlangen, Germany.
| | | | | | | | | | | | | |
Collapse
|
30
|
Tao C, Regner MF, Zhang Y, Jiang JJ. Experimental and Theoretical Investigations of Phonation Threshold Pressure as a Function of Vocal Fold Elongation. ACTA ACUST UNITED AC 2011; 97:669-677. [PMID: 25530744 DOI: 10.3813/aaa.918446] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
The relationship between the vocal fold elongation and the phonation threshold pressure (PTP) was experimentally and theoretically investigated. The PTP values of seventeen excised canine larynges with 0% to 15% bilateral vocal fold elongations in 5% elongation steps were measured using an excised larynx phonation system. It was found that twelve larynges exhibited a monotonic relationship between PTP and elongation; in these larynges, the 0% elongation condition had the lowest PTP. Five larynges exhibited a PTP minimum at 5% elongation. To provide a theoretical explanation of these phenomena, a two-mass model was modified to simulate vibration of the elongated vocal folds. Two pairs of longitudinal springs were used to represent the longitudinal elastin in the vocal folds. This model showed that when the vocal folds were elongated, the increased longitudinal tension would increase the PTP value and the increased vocal fold length would decrease the PTP value. The antagonistic effects contributed by these two factors were found to be able to cause either a monotonic or a non-monotonic relationship between PTP and elongation, which were consistent with experimental observations. Because PTP describes the ease of phonation, this study suggests that there may exist a nonzero optimal vocal fold elongation for the greatest ease for phonation in some larynges.
Collapse
Affiliation(s)
- Chao Tao
- Department of Surgery, Division of Otolaryngology - Head and Neck Surgery, University of Wisconsin, School of Medicine and Public Health, Madison, WI53792-7375, USA ; Key Laboratory of Modern Acoustics, Nanjing University, Nanjing 210093, China
| | - Michael F Regner
- Key Laboratory of Modern Acoustics, Nanjing University, Nanjing 210093, China
| | - Yu Zhang
- Key Laboratory of Underwater Acoustic Communication and Marine Information Technology of the Ministry of Education, Xiamen University, Xiamen Fujian 361005, China
| | - Jack J Jiang
- Department of Surgery, Division of Otolaryngology - Head and Neck Surgery, University of Wisconsin, School of Medicine and Public Health, Madison, WI53792-7375, USA
| |
Collapse
|
31
|
Mittal R, Zheng X, Bhardwaj R, Seo JH, Xue Q, Bielamowicz S. Toward a simulation-based tool for the treatment of vocal fold paralysis. Front Physiol 2011; 2:19. [PMID: 21556320 PMCID: PMC3089877 DOI: 10.3389/fphys.2011.00019] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2011] [Accepted: 04/13/2011] [Indexed: 11/13/2022] Open
Abstract
Advances in high-performance computing are enabling a new generation of software tools that employ computational modeling for surgical planning. Surgical management of laryngeal paralysis is one area where such computational tools could have a significant impact. The current paper describes a comprehensive effort to develop a software tool for planning medialization laryngoplasty where a prosthetic implant is inserted into the larynx in order to medialize the paralyzed vocal fold (VF). While this is one of the most common procedures used to restore voice in patients with VF paralysis, it has a relatively high revision rate, and the tool being developed is expected to improve surgical outcomes. This software tool models the biomechanics of airflow-induced vibration in the human larynx and incorporates sophisticated approaches for modeling the turbulent laryngeal flow, the complex dynamics of the VFs, as well as the production of voiced sound. The current paper describes the key elements of the modeling approach, presents computational results that demonstrate the utility of the approach and also describes some of the limitations and challenges.
Collapse
Affiliation(s)
- Rajat Mittal
- Department of Mechanical Engineering, Johns Hopkins UniversityBaltimore, MD, USA
- Institute for Computational Medicine, Johns Hopkins UniversityBaltimore, MD, USA
| | - Xudong Zheng
- Department of Mechanical Engineering, Johns Hopkins UniversityBaltimore, MD, USA
- Institute for Computational Medicine, Johns Hopkins UniversityBaltimore, MD, USA
| | - Rajneesh Bhardwaj
- Department of Mechanical Engineering, Johns Hopkins UniversityBaltimore, MD, USA
| | - Jung Hee Seo
- Department of Mechanical Engineering, Johns Hopkins UniversityBaltimore, MD, USA
| | - Qian Xue
- Department of Mechanical Engineering, Johns Hopkins UniversityBaltimore, MD, USA
| | - Steven Bielamowicz
- Division of Otolaryngology, George Washington UniversityWashington, DC, USA
| |
Collapse
|
32
|
Zañartu M, Mehta DD, Ho JC, Wodicka GR, Hillman RE. Observation and analysis of in vivo vocal fold tissue instabilities produced by nonlinear source-filter coupling: a case study. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2011; 129:326-39. [PMID: 21303014 PMCID: PMC3055289 DOI: 10.1121/1.3514536] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
Different source-related factors can lead to vocal fold instabilities and bifurcations referred to as voice breaks. Nonlinear coupling in phonation suggests that changes in acoustic loading can also be responsible for this unstable behavior. However, no in vivo visualization of tissue motion during these acoustically induced instabilities has been reported. Simultaneous recordings of laryngeal high-speed videoendoscopy, acoustics, aerodynamics, electroglottography, and neck skin acceleration are obtained from a participant consistently exhibiting voice breaks during pitch glide maneuvers. Results suggest that acoustically induced and source-induced instabilities can be distinguished at the tissue level. Differences in vibratory patterns are described through kymography and phonovibrography; measures of glottal area, open/speed quotient, and amplitude/phase asymmetry; and empirical orthogonal function decomposition. Acoustically induced tissue instabilities appear abruptly and exhibit irregular vocal fold motion after the bifurcation point, whereas source-induced ones show a smoother transition. These observations are also reflected in the acoustic and acceleration signals. Added aperiodicity is observed after the acoustically induced break, and harmonic changes appear prior to the bifurcation for the source-induced break. Both types of breaks appear to be subcritical bifurcations due to the presence of hysteresis and amplitude changes after the frequency jumps. These results are consistent with previous studies and the nonlinear source-filter coupling theory.
Collapse
Affiliation(s)
- Matías Zañartu
- School of Electrical and Computer Engineering, Purdue University, West Lafayette, Indiana 47907, USA.
| | | | | | | | | |
Collapse
|
33
|
Zhang Y, Regner MF, Jiang JJ. Theoretical modeling and experimental high-speed imaging of elongated vocal folds. IEEE Trans Biomed Eng 2010; 58:2725-31. [PMID: 21118763 DOI: 10.1109/tbme.2010.2095012] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
In this paper, the role of vocal fold elongation in governing glottal movement dynamics was theoretically and experimentally investigated. A theoretical model was first proposed to incorporate vocal fold elongation into the two-mass model. This model predicted the direct and nondirect components of the glottal time series as a function of vocal fold elongation. Furthermore, high-speed digital imaging was applied in excised larynx experiments to visualize vocal fold vibrations with variable vocal fold elongation from -10% to 50% and subglottal pressures of 18- and 24-cm H(2)O. Comparison between theoretical model simulations and experimental observations showed good agreement. A relative maximum was seen in the nondirect component of glottal area, suggesting that an optimal elongation could maximize the vocal fold vibratory power. However, sufficiently large vocal fold elongations caused the nondirect component to approach zero and the direct component to approach a constant. These results showed that vocal fold elongation plays an important role in governing the dynamics of glottal area movement and validated the applicability of the proposed theoretical model and high-speed imaging to investigate laryngeal activity.
Collapse
Affiliation(s)
- Yu Zhang
- Laboratory of Underwater Acoustic Communication and Marine Information Technology of the Ministry of Education, College of Oceanography and Environmental Science, Xiamen University, Xiamen 361005, China.
| | | | | |
Collapse
|
34
|
Hanson KP, Zhang Y, Jiang JJ. Ex vivo canine vocal fold lamina propria rehydration after varying dehydration levels. J Voice 2010; 25:657-62. [PMID: 20951551 DOI: 10.1016/j.jvoice.2010.06.005] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2010] [Accepted: 06/10/2010] [Indexed: 10/18/2022]
Abstract
OBJECTIVES To examine the recoverability of canine vocal fold (VF) lamina propria (LP) on rehydration from varying dehydration levels. STUDY DESIGN Open, controlled experimental trial. METHODS The VF LP was excised en bloc using a scalpel from 10 canine larynges, providing 20 tissue samples. The initial volume of each sample was measured. Ten samples were dehydrated to 30% by mass and the other 10 samples to 70%. Each sample was rehydrated in 0.9% saline until the mass stabilized. The liquid mass and volume fractions, liquid:solid mass and volume ratios, and the fractions of the original tissue masses and volumes were calculated. RESULTS All calculated parameters were significantly different between 30% and 70% dehydration recovery, with all parameters lesser in the 70% dehydration treatment group. Half of the tissue samples subjected to 30% dehydration fully recovered to their original volumes, whereas only one of the 10 samples subjected to 70% dehydration fully recovered its volume. CONCLUSIONS The level of attainable rehydration recovery of VF LP tissue in an ex vivo setting depends on the level of dehydration. The results correspond to the biphasic theory and may be used to help model the biomechanical and physiological properties of VF LP tissue during rehydration.
Collapse
Affiliation(s)
- Kevin P Hanson
- Department of Surgery, Division of Otolaryngology-Head and Neck Surgery, University of Wisconsin School of Medicine and Public Health, Madison, Wisconsin, USA
| | | | | |
Collapse
|
35
|
Riede T, Goller F. Peripheral mechanisms for vocal production in birds - differences and similarities to human speech and singing. BRAIN AND LANGUAGE 2010; 115:69-80. [PMID: 20153887 PMCID: PMC2896990 DOI: 10.1016/j.bandl.2009.11.003] [Citation(s) in RCA: 80] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/18/2009] [Accepted: 11/03/2009] [Indexed: 05/28/2023]
Abstract
Song production in songbirds is a model system for studying learned vocal behavior. As in humans, bird phonation involves three main motor systems (respiration, vocal organ and vocal tract). The avian respiratory mechanism uses pressure regulation in air sacs to ventilate a rigid lung. In songbirds sound is generated with two independently controlled sound sources, which reside in a uniquely avian vocal organ, the syrinx. However, the physical sound generation mechanism in the syrinx shows strong analogies to that in the human larynx, such that both can be characterized as myoelastic-aerodynamic sound sources. Similarities include active adduction and abduction, oscillating tissue masses which modulate flow rate through the organ and a layered structure of the oscillating tissue masses giving rise to complex viscoelastic properties. Differences in the functional morphology of the sound producing system between birds and humans require specific motor control patterns. The songbird vocal apparatus is adapted for high speed, suggesting that temporal patterns and fast modulation of sound features are important in acoustic communication. Rapid respiratory patterns determine the coarse temporal structure of song and maintain gas exchange even during very long songs. The respiratory system also contributes to the fine control of airflow. Muscular control of the vocal organ regulates airflow and acoustic features. The upper vocal tract of birds filters the sounds generated in the syrinx, and filter properties are actively adjusted. Nonlinear source-filter interactions may also play a role. The unique morphology and biomechanical system for sound production in birds presents an interesting model for exploring parallels in control mechanisms that give rise to highly convergent physical patterns of sound generation. More comparative work should provide a rich source for our understanding of the evolution of complex sound producing systems.
Collapse
Affiliation(s)
- Tobias Riede
- Department of Biology and National Center for Voice and Speech, University of Utah, Salt Lake City, 84112, USA
| | | |
Collapse
|
36
|
Hanson KP, Zhang Y, Jiang JJ. Parameters quantifying dehydration in canine vocal fold lamina propria. Laryngoscope 2010; 120:1363-9. [PMID: 20564654 DOI: 10.1002/lary.20927] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
OBJECTIVES/HYPOTHESIS The goal of this study was to measure the solid and liquid volume and mass of canine vocal fold lamina propria tissue at varying dehydration levels, and to calculate parameters to test the biphasic theory of vocal fold physiology and biomechanics. STUDY DESIGN Open, controlled, experimental trial. METHODS The vocal fold lamina propria was dissected from 15 canine larynges, yielding 30 tissue samples. The initial volumes and masses of the tissue samples were measured. The masses of the tissue samples were then measured every 2 minutes during 30%, 50%, and 70% dehydration, with 10 samples subjected to each of the three treatments, followed by complete dehydration to yield the solid component of the tissue. The liquid mass and volume fractions and liquid:solid mass and volume ratios of the vocal fold lamina propria samples were calculated. RESULTS The liquid mass and volume fractions and liquid:solid mass and volume ratios were significantly different at each dehydration level, except for the liquid:solid volume ratios at 30% versus 50% dehydration. Linear regression analysis suggested that all of the solid and liquid parameters measured could be predicted by dehydration level based on inverse, linear relationships. CONCLUSIONS These results provide further experimental evidence supporting the biphasic theory and suggest that the extent of vocal fold lamina propria tissue dehydration may be quantified based on the biphasic model parameters.
Collapse
Affiliation(s)
- Kevin P Hanson
- Department of Surgery, Division of Otolaryngology-Head and Neck Surgery, University of Wisconsin School of Medicine and Public Health, Madison, Wisconsin, USA
| | | | | |
Collapse
|
37
|
Xue Q, Mittal R, Zheng X, Bielamowicz S. A computational study of the effect of vocal-fold asymmetry on phonation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 128:818-27. [PMID: 20707451 PMCID: PMC2933257 DOI: 10.1121/1.3458839] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
Unilateral laryngeal paralysis leads to tension imbalance and hence to asynchronous movements between the two vocal folds during phonation. In the current study, a computational model of phonation that couples a two-mass model of the vocal folds with a Navier-Stokes model of the glottal airflow, has been used to examine the dynamics of vocal fold configurations with tension imbalance and its implications for phonation. The simulations show that tension imbalance influences phonation onset, intensity as well as the fundamental phonation frequency. Distinct non-linear effects such as period-doubling bifurcation and preferential frequency selection are also observed.
Collapse
Affiliation(s)
- Q Xue
- Department of Mechanical Engineering, Johns Hopkins University, 126 Latrobe Hall, 3400 North Charles Street, Baltimore, Maryland 21218, USA.
| | | | | | | |
Collapse
|
38
|
Kimura M, Imagawa H, Nito T, Sakakibara KI, Chan RW, Tayama N. Arytenoid Adduction for Correcting Vocal Fold Asymmetry: High-Speed Imaging. Ann Otol Rhinol Laryngol 2010; 119:439-46. [DOI: 10.1177/000348941011900703] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Objectives We hypothesized that high-speed digital imaging provides a quantitative method to evaluate the effect of arytenoid adduction for the correction of asymmetric and irregular vocal fold vibration in unilateral vocal fold paralysis. Methods Six subjects with unilateral vocal fold paralysis participated in the study (4 male, 2 female; mean [±SD] age, 52.5 ± 21.3 years). Videokymographic and laryngotopographic methods for image analysis were performed for highspeed recordings of vocal fold vibration for visualizing the glottal vibratory patterns, and for quantifying the frequency of vibration of each vocal fold, respectively. Comparisons of the paralyzed and the normal vocal folds were made before and after arytenoid adduction. Results Analysis of the laryngotopographs revealed 2 distinct frequencies of vibration for the paralyzed and the contralateral vocal folds for all subjects before surgery. After arytenoid adduction, the vibration frequencies became identical or nearly identical in all subjects. Conclusions Asymmetric vibration in vocal fold paralysis was exemplified by differences in vibration frequency between the vocal folds. The present data showed that after arytenoid adduction the vibration frequencies and the vibratory patterns of the contralateral vocal folds approached symmetry. This surgical procedure could improve the functional symmetry of the larynx for phonation.
Collapse
Affiliation(s)
- Miwako Kimura
- Departments of Otolaryngology–Head and Neck Surgery, Tokyo, Japan
- University of Texas Southwestern Medical Center, Dallas, Texas, Department of Otolaryngology, International Medical Center of Japan, Tokyo, Japan
| | - Hiroshi Imagawa
- Department of Otorhinolaryngology–Head and Neck Surgery, University of Tokyo, Tokyo, Japan
| | - Takaharu Nito
- Department of Otorhinolaryngology–Head and Neck Surgery, University of Tokyo, Tokyo, Japan
| | - Ken-Ichi Sakakibara
- Department of Communication Disorders, Health Sciences University of Hokkaido, Hokkaido, Japan
| | - Roger W. Chan
- Departments of Otolaryngology–Head and Neck Surgery, Tokyo, Japan
- Biomedical Engineering, Tokyo, Japan
| | - Niro Tayama
- University of Texas Southwestern Medical Center, Dallas, Texas, Department of Otolaryngology, International Medical Center of Japan, Tokyo, Japan
- Department of Otorhinolaryngology–Head and Neck Surgery, University of Tokyo, Tokyo, Japan
| |
Collapse
|
39
|
Chai L, Sprecher AJ, Zhang Y, Liang Y, Chen H, Jiang JJ. Perturbation and nonlinear dynamic analysis of adult male smokers. J Voice 2010; 25:342-7. [PMID: 20472394 DOI: 10.1016/j.jvoice.2010.01.006] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2009] [Accepted: 01/11/2010] [Indexed: 11/16/2022]
Abstract
OBJECTIVE Smoking results in a voice change, and the perception by smokers of an abnormal voice may encourage quitting behavior. Moreover, a disordered voice is often the first sign of vocal pathology. Efforts to evaluate voice have focused on classical acoustic analysis; however, nonlinear dynamic analysis has been shown to be a reliable objective method for the evaluation of voice. We compare the discriminatory ability of these two methods when applied to normal and smokers' voices. STUDY DESIGN Prospective study. METHODS The study included 73 subjects, 36 nonsmokers and 37 smokers. A segment of sustained vowel production was obtained from each subject. Acoustic dimension and correlation dimension (D2) analyses were applied to the data. Results were compared with a Mann-Whitney rank sum test, logistic regression, and receiver operating characteristics (ROC) analysis. RESULTS D2 values for smokers were significantly higher than D2 values for nonsmokers (P<0.001). Jitter and shimmer analysis showed higher values for these parameters among smokers. Logistic regression indicated a higher predictive power with D2, and ROC analysis found no significant difference between the analysis methods. DISCUSSION This study indicated that D2 is highly sensitive to changes associated with smoking and has the potential to be implemented clinically as an indicator of abnormal voice. Further research could focus on using nonlinear dynamic analysis to create a normative database, producing standards for monitoring voice changes caused by cigarette smoking.
Collapse
Affiliation(s)
- Lingying Chai
- Shanghai EENT Hospital of Fudan University, Shanghai, China
| | | | | | | | | | | |
Collapse
|
40
|
Little MA, Costello DAE, Harries ML. Objective dysphonia quantification in vocal fold paralysis: comparing nonlinear with classical measures. J Voice 2009; 25:21-31. [PMID: 19900790 DOI: 10.1016/j.jvoice.2009.04.004] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2009] [Accepted: 04/20/2009] [Indexed: 11/30/2022]
Abstract
Clinical acoustic voice-recording analysis is usually performed using classical perturbation measures, including jitter, shimmer, and noise-to-harmonic ratios (NHRs). However, restrictive mathematical limitations of these measures prevent analysis for severely dysphonic voices. Previous studies of alternative nonlinear random measures addressed wide varieties of vocal pathologies. Here, we analyze a single vocal pathology cohort, testing the performance of these alternative measures alongside classical measures. We present voice analysis pre- and postoperatively in 17 patients with unilateral vocal fold paralysis (UVFP). The patients underwent standard medialization thyroplasty surgery, and the voices were analyzed using jitter, shimmer, NHR, nonlinear recurrence period density entropy (RPDE), detrended fluctuation analysis (DFA), and correlation dimension. In addition, we similarly analyzed 11 healthy controls. Systematizing the preanalysis editing of the recordings, we found that the novel measures were more stable and, hence, reliable than the classical measures on healthy controls. RPDE and jitter are sensitive to improvements pre- to postoperation. Shimmer, NHR, and DFA showed no significant change (P>0.05). All measures detect statistically significant and clinically important differences between controls and patients, both treated and untreated (P<0.001, area under curve [AUC]>0.7). Pre- to postoperation grade, roughness, breathiness, asthenia, and strain (GRBAS) ratings show statistically significant and clinically important improvement in overall dysphonia grade (G) (AUC=0.946, P<0.001). Recalculating AUCs from other study data, we compare these results in terms of clinical importance. We conclude that, when preanalysis editing is systematized, nonlinear random measures may be useful for monitoring UVFP-treatment effectiveness, and there may be applications to other forms of dysphonia.
Collapse
Affiliation(s)
- Max A Little
- Systems Analysis, Modeling and Prediction Group, University of Oxford, Oxford, United Kingdom
| | | | | |
Collapse
|
41
|
Zhang Y, Krausert CR, Kelly MP, Jiang JJ. Typing vocal fold vibratory patterns in excised larynx experiments via digital kymography. Ann Otol Rhinol Laryngol 2009; 118:598-605. [PMID: 19746760 DOI: 10.1177/000348940911800812] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
OBJECTIVES Signal typing is central to the understanding of vocal fold vibratory patterns. Digital kymography (DKG) allows the direct observation of vocal fold vibratory patterns, and therefore, using DKG for vibratory signal typing may provide a useful complement to traditional signal typing techniques. METHODS Video data collected from 20 larynges excised from mongrel dogs were observed with DKG in order to find examples of type 1 (nearly periodic), type 2 (subharmonic), and type 3 (aperiodic) vibratory patterns. The time series, frequency spectra, and correlation dimensions were calculated for each signal type. RESULTS The type 1 pattern showed a periodic time series of glottal edges and a discrete frequency spectrum. The type 2 vibratory pattern displayed a time series of alternating high- and low-amplitude waves and a frequency spectrum that included a subharmonic (F0/2) frequency component. Regular and symmetric vibratory patterns were observed in the type 1 and type 2 patterns. The type 3 vibratory pattern was characterized by an aperiodic time series of glottal edges, a broadband frequency spectrum, and irregular and asymmetric vibratory patterns. The correlation dimension estimates increased from type 1 to type 2 to type 3. CONCLUSIONS Imaging with DKG demonstrated an ability to assign a signal type to various laryngeal vibrations. Signal typing techniques utilizing direct observation of the vocal folds could be useful in determining valid methods for the analysis of vocal fold vibrations.
Collapse
Affiliation(s)
- Yu Zhang
- Department of Surgery, Division of Otolaryngology-Head and Neck Surgery, University of Wisconsin-Madison School of Medicine and Public Health, Madison, Wisconsin, USA
| | | | | | | |
Collapse
|
42
|
Liquid accumulation in vibrating vocal fold tissue: a simplified model based on a fluid-saturated porous solid theory. J Voice 2009; 24:260-9. [PMID: 19660905 DOI: 10.1016/j.jvoice.2008.09.005] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2008] [Accepted: 09/22/2008] [Indexed: 11/20/2022]
Abstract
The human vocal fold is treated as a continuous, transversally isotropic, porous solid saturated with liquid. A set of mathematical equations, based on the theory of fluid-saturated porous solids, is developed to formulate the vibration of the vocal fold tissue. As the fluid-saturated porous tissue model degenerates to the continuous elastic tissue model when the relative movement of liquid in the porous tissue is ignored, it can be considered a more general description of vocal fold tissue than the continuous, elastic model. Using the fluid-saturated porous tissue model, the vibration of a bunch of one-dimensional fibers in the vocal fold is analytically solved based on the small-amplitude assumption. It is found that the vibration of the tissue will lead to the accumulation of excess liquid in the midmembranous vocal fold. The degree of liquid accumulation is positively proportional to the vibratory amplitude and frequency. The correspondence between the liquid distribution predicted by the porous tissue theory and the location of vocal nodules observed in clinical practice, provides theoretical evidence for the liquid accumulation hypothesis of vocal nodule formation (Jiang, Ph.D., dissertation, 1991, University of Iowa).
Collapse
|
43
|
Henriquez P, Alonso JB, Ferrer MA, Travieso CM, Godino-Llorente JI, Diaz-de-Maria F. Characterization of Healthy and Pathological Voice Through Measures Based on Nonlinear Dynamics. ACTA ACUST UNITED AC 2009. [DOI: 10.1109/tasl.2009.2016734] [Citation(s) in RCA: 99] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
44
|
Welham NV, Montequin DW, Tateya I, Tateya T, Choi SH, Bless DM. A rat excised larynx model of vocal fold scar. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2009; 52:1008-20. [PMID: 19641079 PMCID: PMC2719832 DOI: 10.1044/1092-4388(2009/08-0049)] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
PURPOSE To develop and evaluate a rat excised larynx model for the measurement of acoustic, aerodynamic, and vocal fold vibratory changes resulting from vocal fold scar. METHOD Twenty-four 4-month-old male Sprague-Dawley rats were assigned to 1 of 4 experimental groups: chronic vocal fold scar, chronic vocal fold scar treated with 100-ng basic fibroblast growth factor (bFGF), chronic vocal fold scar treated with saline (sham treatment), and unscarred untreated control. Following tissue harvest, histological and immunohistochemical data were collected to confirm extracellular matrix alteration in the chronic scar group; acoustic, aerodynamic, and high-speed digital imaging data were collected using an excised larynx setup in all groups. Phonation threshold pressure (P(th)), glottal resistance (R(g)), glottal efficiency (E(g)), vibratory amplitude, and vibratory area were used as dependent variables. RESULTS Chronically scarred vocal folds were characterized by elevated collagen Types I and III and reduced hyaluronic acid abundance. Phonation was achieved, and data were collected from all control and bFGF-treated larynges; however, phonation was not achieved with 3 of 6 chronically scarred and 1 of 6 saline-treated larynges. Compared with control, the chronic scar group was characterized by elevated P(th), reduced E(g), and intralarynx vibratory amplitude and area asymmetry. The bFGF group was characterized by P(th) below control-group levels, E(g) comparable with control, and vocal fold vibratory amplitude and area symmetry comparable with control. The sham group was characterized by P(th) comparable with control, E(g) superior to control, and vocal fold vibratory amplitude and area symmetry comparable with control. CONCLUSIONS The excised larynx model reported here demonstrated robust deterioration across phonatory indices under the scar condition and sensitivity to treatment-induced change under the bFGF condition. The improvement observed under the sham condition may reflect unanticipated therapeutic benefit or artifact. This model holds promise as a tool for the functional characterization of biomechanical tissue changes resulting from vocal fold scar and the evaluation of experimental therapies.
Collapse
Affiliation(s)
- Nathan V Welham
- University of Wisconsin School of Medicine and Public Health, Madison, WI 53792, USA.
| | | | | | | | | | | |
Collapse
|
45
|
Luo H, Mittal R, Bielamowicz SA. Analysis of flow-structure interaction in the larynx during phonation using an immersed-boundary method. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2009; 126:816-24. [PMID: 19640046 PMCID: PMC2730715 DOI: 10.1121/1.3158942] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
A recently developed immersed-boundary method is used to model the flow-structure interaction associated with the human phonation. The glottal airflow is modeled as a two-dimensional incompressible flow driven by a constant subglottal pressure, and the vocal folds are modeled as a pair of three-layered, two-dimensional, viscoelastic structures. Both the fluid dynamics and viscoelasticity are solved on fixed Cartesian grids using a sharp-interface immersed boundary method. It is found that the vibration mode and frequency of the vocal fold model are associated with the eigenmodes of the structures, and that the transition of the vibration mode takes place during onset of the sustained vibration. The computed glottal waveforms of the volume flux, velocity, and pressure are reasonably realistic. The glottal flow features an unsteady jet whose direction is deflected by the large-scale vortices in the supraglottal region. A detailed analysis of the flow and vocal fold vibrations is conducted in order to gain insights into the biomechanics of phonation.
Collapse
Affiliation(s)
- Haoxiang Luo
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, TN 37235, USA.
| | | | | |
Collapse
|
46
|
Tao C, Jiang JJ. Effects of mucosal loading on vocal fold vibration. CHAOS (WOODBURY, N.Y.) 2009; 19:023113. [PMID: 19566248 PMCID: PMC2832046 DOI: 10.1063/1.3120293] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2008] [Accepted: 03/24/2009] [Indexed: 05/28/2023]
Abstract
A chain model was proposed in this study to examine the effects of mucosal loading on vocal fold vibration. Mucosal loading was defined as the loading caused by the interaction between the vocal folds and the surrounding tissue. In the proposed model, the vocal folds and the surrounding tissue were represented by a series of oscillators connected by a coupling spring. The lumped masses, springs, and dampers of the oscillators modeled the tissue properties of mass, stiffness, and viscosity, respectively. The coupling spring exemplified the tissue interactions. By numerically solving this chain model, the effects of mucosal loading on the phonation threshold pressure, phonation instability pressure, and energy distribution in a voice production system were studied. It was found that when mucosal loading is small, phonation threshold pressure increases with the damping constant R(r), the mass constant R(m), and the coupling constant R(mu) of mucosal loading but decreases with the stiffness constant R(k). Phonation instability pressure is also related to mucosal loading. It was found that phonation instability pressure increases with the coupling constant R(mu) but decreases with the stiffness constant R(k) of mucosal loading. Therefore, it was concluded that mucosal loading directly affects voice production.
Collapse
Affiliation(s)
- Chao Tao
- Department of Surgery, Division of Otolaryngology Head and Neck Surgery, University of Wisconsin Medical School, Madison, Wisconsin 53792-7375, USA.
| | | |
Collapse
|
47
|
Phillips R, Zhang Y, Keuler M, Tao C, Jiang JJ. Measurement of liquid and solid component parameters in canine vocal fold lamina propria. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2009; 125:2282-2287. [PMID: 19354403 PMCID: PMC2677265 DOI: 10.1121/1.3086276] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/25/2008] [Revised: 01/29/2009] [Accepted: 01/30/2009] [Indexed: 05/27/2023]
Abstract
This study aimed to measure solid and liquid component parameters for canine vocal fold lamina propria tissue, as is consistent with the solid and liquid fraction parameters in the context of the biphasic theory. A liquid-displacement apparatus was developed and utilized to estimate volumes of small samples of tissue. Accuracy was determined by calibrations with an object of known mass and density (copper). The experimental apparatus was then used to determine the volume of eight tissue samples, followed by an apparently complete dehydration of the samples, yielding the dry or solid tissue. The mass and volume fractions of the liquid component were sufficiently higher than those of the solid component. These results represent preliminary experimental evidence for the biphasic composition (solid-liquid) of canine lamina propria tissue as predicted in the biphasic theory. This study presents an effective experimental method to estimate some of the biphasic model parameters, and may provide a valuable application in exploring the viscoelastic behaviors of vocal fold lamina propria tissue.
Collapse
Affiliation(s)
- Robert Phillips
- Department of Surgery, Division of Otolaryngology Head and Neck Surgery, University of Wisconsin-Medical School, Madison, Wisconsin 53792-7375, USA
| | | | | | | | | |
Collapse
|
48
|
Zheng X, Bielamowicz S, Luo H, Mittal R. A computational study of the effect of false vocal folds on glottal flow and vocal fold vibration during phonation. Ann Biomed Eng 2009; 37:625-42. [PMID: 19142730 DOI: 10.1007/s10439-008-9630-9] [Citation(s) in RCA: 80] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2008] [Accepted: 12/22/2008] [Indexed: 11/27/2022]
Abstract
The false vocal folds are believed to be components of the acoustic filter that is responsible for shaping the voice. However, the effects of false vocal folds on the vocal fold vibration and the glottal aerodynamic during phonation remain unclear. This effect has implications for computational modeling of phonation as well as for understanding laryngeal pathologies such as glottal incompetence resulting from unilateral vocal fold paralysis. In this study, a high fidelity, two-dimensional computational model, which combines an immersed boundary method for the airflow and a continuum, finite-element method for the vocal folds, is used to examine the effect of the false vocal folds on flow-induced vibration (FIV) of the true vocal folds and the dynamics of the glottal jet. The model is notionally based on a laryngeal CT scan and employs realistic flow conditions and tissue properties. Results show that the false vocal folds potentially have a significant impact on phonation. The false vocal folds reduce the glottal flow impedance and increase the amplitude as well as the mean glottal jet velocity. The false vocal folds also enhance the intensity of the monopole acoustic sources in the glottis. A mechanism for reduction in flow impedance due to the false vocal folds is proposed.
Collapse
Affiliation(s)
- Xudong Zheng
- Department of Mechanical and Aerospace Engineering, The George Washington University, Suite T729, 801 22nd Street, Philips Hall, NW, Washington, D.C. 20052, USA
| | | | | | | |
Collapse
|
49
|
Zhang Y, Jiang JJ. Asymmetric spatiotemporal chaos induced by a polypoid mass in the excised larynx. CHAOS (WOODBURY, N.Y.) 2008; 18:043102. [PMID: 19123612 PMCID: PMC2688821 DOI: 10.1063/1.2988251] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/30/2008] [Accepted: 08/23/2008] [Indexed: 05/24/2023]
Abstract
In this paper, asymmetric spatiotemporal chaos induced by a polypoid mass simulating the laryngeal pathology of a vocal polyp is experimentally observed using high-speed imaging in an excised larynx. Spatiotemporal analysis reveals that the normal vocal folds show spatiotemporal correlation and symmetry. Normal vocal fold vibrations are dominated mainly by the first vibratory eigenmode. However, pathological vocal folds with a polypoid mass show broken symmetry and spatiotemporal irregularity. The spatial correlation is decreased. The pathological vocal folds spread vibratory energy across a large number of eigenmodes and induce asymmetric spatiotemporal chaos. High-order eigenmodes show complicated dynamics. Spatiotemporal analysis provides a valuable biomedical application for investigating the spatiotemporal chaotic dynamics of pathological vocal fold systems with a polypoid mass and may represent a valuable clinical tool for the detection of laryngeal mass lesion using high-speed imaging.
Collapse
Affiliation(s)
- Yu Zhang
- Department of Surgery, Division of Otolaryngology Head and Neck Surgery, University of Wisconsin Medical School, Madison, Wisconsin 53792-7375, USA.
| | | |
Collapse
|
50
|
Luo H, Mittal R, Zheng X, Bielamowicz SA, Walsh RJ, Hahn JK. An immersed-boundary method for flow-structure interaction in biological systems with application to phonation. JOURNAL OF COMPUTATIONAL PHYSICS 2008; 227:9303-9332. [PMID: 19936017 PMCID: PMC2701221 DOI: 10.1016/j.jcp.2008.05.001] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
A new numerical approach for modeling a class of flow-structure interaction problems typically encountered in biological systems is presented. In this approach, a previously developed, sharp-interface, immersed-boundary method for incompressible flows is used to model the fluid flow and a new, sharp-interface Cartesian grid, immersed boundary method is devised to solve the equations of linear viscoelasticity that governs the solid. The two solvers are coupled to model flow-structure interaction. This coupled solver has the advantage of simple grid generation and efficient computation on simple, single-block structured grids. The accuracy of the solid-mechanics solver is examined by applying it to a canonical problem. The solution methodology is then applied to the problem of laryngeal aerodynamics and vocal fold vibration during human phonation. This includes a three-dimensional eigen analysis for a multi-layered vocal fold prototype as well as two-dimensional, flow-induced vocal fold vibration in a modeled larynx. Several salient features of the aerodynamics as well as vocal-fold dynamics are presented.
Collapse
Affiliation(s)
- Haoxiang Luo
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Pl., Nashville, TN 37235-1592
| | - Rajat Mittal
- Department of Mechanical and Aerospace Engineering, George Washington University, Washington, DC 20052
| | - Xudong Zheng
- Department of Mechanical and Aerospace Engineering, George Washington University, Washington, DC 20052
| | | | - Raymond J. Walsh
- Department of Anatomy and Cell Biology, George Washington University, Washington, DC 20052
| | - James K. Hahn
- Department of Computer Science, George Washington University, Washington, DC 20052
| |
Collapse
|