1
|
Leppävuori M, Lammentausta E, Peuna A, Bode MK, Jokelainen J, Ojala J, Nieminen MT. Characterizing Vocal Tract Dimensions in the Vocal Modes Using Magnetic Resonance Imaging. J Voice 2021; 35:804.e27-804.e42. [DOI: 10.1016/j.jvoice.2020.01.015] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2019] [Revised: 01/15/2020] [Accepted: 01/16/2020] [Indexed: 11/25/2022]
|
2
|
Saldías M, Laukkanen AM, Guzmán M, Miranda G, Stoney J, Alku P, Sundberg J. The Vocal Tract in Loud Twang-Like Singing While Producing High and Low Pitches. J Voice 2021; 35:807.e1-807.e23. [DOI: 10.1016/j.jvoice.2020.02.005] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Revised: 02/04/2020] [Accepted: 02/06/2020] [Indexed: 11/25/2022]
|
3
|
Palaparthi A, Titze IR. Analysis of Glottal Inverse Filtering in the Presence of Source-Filter Interaction. SPEECH COMMUNICATION 2020; 123:98-108. [PMID: 32921855 PMCID: PMC7485927 DOI: 10.1016/j.specom.2020.07.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
The validity of glottal inverse filtering (GIF) to obtain a glottal flow waveform from radiated pressure signal in the presence and absence of source-filter interaction was studied systematically. A driven vocal fold surface model of vocal fold vibration was used to generate source signals. A one-dimensional wave reflection algorithm was used to solve for acoustic pressures in the vocal tract. Several test signals were generated with and without source-filter interaction at various fundamental frequencies and vowels. Linear Predictive Coding (LPC), Quasi Closed Phase (QCP), and Quadratic Programming (QPR) based algorithms, along with supraglottal impulse response, were used to inverse filter the radiated pressure signals to obtain the glottal flow pulses. The accuracy of each algorithm was tested for its recovery of maximum flow declination rate (MFDR), peak glottal flow, open phase ripple factor, closed phase ripple factor, and mean squared error. The algorithms were also tested for their absolute relative errors of the Normalized Amplitude Quotient, the Quasi-Open Quotient, and the Harmonic Richness Factor. The results indicated that the mean squared error decreased with increase in source-filter interaction level suggesting that the inverse filtering algorithms perform better in the presence of source-filter interaction. All glottal inverse filtering algorithms predicted the open phase ripple factor better than the closed phase ripple factor of a glottal flow waveform, irrespective of the source-filter interaction level. Major prediction errors occurred in the estimation of the closed phase ripple factor, MFDR, peak glottal flow, normalized amplitude quotient, and Quasi-Open Quotient. Feedback-related nonlinearity (source-filter interaction) affected the recovered signal primarily when f o was well below the first formant frequency of a vowel. The prediction error increased when f o was close to the first formant frequency due to the difficulty of estimating the precise value of resonance frequencies, which was exacerbated by nonlinear kinetic losses in the vocal tract.
Collapse
Affiliation(s)
- Anil Palaparthi
- National Center for Voice and Speech, The University of Utah, Salt Lake City, UT 84112, USA
- Department of Biomedical Engineering, The University of Utah, Salt Lake City, UT 84112, USA
| | - Ingo R. Titze
- National Center for Voice and Speech, The University of Utah, Salt Lake City, UT 84112, USA
- Department of Biomedical Engineering, The University of Utah, Salt Lake City, UT 84112, USA
| |
Collapse
|
4
|
Traser L, Birkholz P, Flügge TV, Kamberger R, Burdumy M, Richter B, Korvink JG, Echternach M. Relevance of the Implementation of Teeth in Three-Dimensional Vocal Tract Models. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2017; 60:2379-2393. [PMID: 28898358 DOI: 10.1044/2017_jslhr-s-16-0395] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/10/2016] [Accepted: 02/23/2017] [Indexed: 06/07/2023]
Abstract
PURPOSE Recently, efforts have been made to investigate the vocal tract using magnetic resonance imaging (MRI). Due to technical limitations, teeth were omitted in many previous studies on vocal tract acoustics. However, the knowledge of how teeth influence vocal tract acoustics might be important in order to estimate the necessity of implementing teeth in vocal tract models. The aim of this study was therefore to estimate the effect of teeth on vocal tract acoustics. METHOD The acoustic properties of 18 solid (3-dimensional printed) vocal tract models without teeth were compared to the same 18 models including teeth in terms of resonance frequencies (fRn). The fRn were obtained from the transfer functions of these models excited by white noise at the glottis level. The models were derived from MRI data of 2 trained singers performing 3 different vowel conditions (/i/, /a/, and /u/) in speech and low-pitched and high-pitched singing. RESULTS Depending on the oral configuration, models exhibiting side cavities or side branches were characterized by major changes in the transfer function when teeth were implemented via the introduction of pole-zero pairs. CONCLUSIONS To avoid errors in modeling, teeth should be included in 3-dimensional vocal tract models for acoustic evaluation. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.5386771.
Collapse
Affiliation(s)
- Louisa Traser
- Institute of Musicians' Medicine, Freiburg University Medical Center, Germany
- Department of Otolaryngology, Freiburg University Medical Center, Germany
- Faculty of Medicine, University of Freiburg, Germany
| | - Peter Birkholz
- Institute of Acoustics and Speech Communication, Technische Universität, Dresden, Germany
| | - Tabea Viktoria Flügge
- Faculty of Medicine, University of Freiburg, Germany
- Department of Craniomaxillofacial Surgery, Freiburg University Medical Center, Germany
| | - Robert Kamberger
- Laboratory of Simulation, Department of Microsystems Engineering-IMTEK, University of Freiburg, Germany
| | - Michael Burdumy
- Faculty of Medicine, University of Freiburg, Germany
- Department of Medical Physics, Radiology, Freiburg University Medical Center, Germany
| | - Bernhard Richter
- Institute of Musicians' Medicine, Freiburg University Medical Center, Germany
- Faculty of Medicine, University of Freiburg, Germany
| | - Jan Gerrit Korvink
- Institute of Microstructure Technology, Karlsruhe Institute of Technology, Eggenstein-Leopoldshafen, Germany
| | - Matthias Echternach
- Institute of Musicians' Medicine, Freiburg University Medical Center, Germany
- Faculty of Medicine, University of Freiburg, Germany
| |
Collapse
|
5
|
Bergeret-Cassagne H, Lazard DS, Lefort M, Hachi S, Leenhardt L, Menegaux F, Russ G, Trésallet C, Frouin F. Sonographic Dynamic Description of the Laryngeal Tract: Definition of Quantitative Measures to Characterize Vocal Fold Motion and Estimation of Their Normal Values. JOURNAL OF ULTRASOUND IN MEDICINE : OFFICIAL JOURNAL OF THE AMERICAN INSTITUTE OF ULTRASOUND IN MEDICINE 2017; 36:1037-1044. [PMID: 28072470 DOI: 10.7863/ultra.16.05014] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/09/2016] [Revised: 07/01/2016] [Indexed: 06/06/2023]
Abstract
Vocal fold motion was analyzed during free breathing using two-dimensional dynamic ultrasound imaging. Two cadavers were first analyzed to define easily identifiable landmarks. Motion of the laryngeal tract was then analyzed in an axial plane. Left and right arytenoids and thyroid cartilage were defined on images corresponding to abduction and adduction of the laryngeal tract. Associated area measurements were established for 50 healthy subjects. All area indices were significantly larger during abduction than adduction. Symmetry of motion was established by comparing each hemi-larynx, and mobility fractions were defined. Normal values of laryngeal motion during free breathing were thus established.
Collapse
Affiliation(s)
- Héloïse Bergeret-Cassagne
- Sorbonne Universités, UPMC Univ Paris 06, AP-HP, Hôpital Pitié-Salpêtrière, Department of General, Visceral and Endocrine Surgery, Paris, France
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, Inserm, Laboratoire d'Imagerie Biomédicale (LIB), Paris, France
| | - Diane S Lazard
- Sorbonne Universités, UPMC Univ Paris 06, AP-HP, Hôpital Pitié-Salpêtrière, Department of General, Visceral and Endocrine Surgery, Paris, France
- Institut Arthur Vernes, ENT Surgery, Paris, France
| | - Muriel Lefort
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, Inserm, Laboratoire d'Imagerie Biomédicale (LIB), Paris, France
| | - Siham Hachi
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, Inserm, Laboratoire d'Imagerie Biomédicale (LIB), Paris, France
| | - Laurence Leenhardt
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, Inserm, Laboratoire d'Imagerie Biomédicale (LIB), Paris, France
- Sorbonne Universités, UPMC Univ Paris 06, AP-HP, Hôpital Pitié-Salpêtrière, Thyroid and Endocrine Tumors Department, Paris, France
| | - Fabrice Menegaux
- Sorbonne Universités, UPMC Univ Paris 06, AP-HP, Hôpital Pitié-Salpêtrière, Department of General, Visceral and Endocrine Surgery, Paris, France
| | - Gilles Russ
- Sorbonne Universités, UPMC Univ Paris 06, AP-HP, Hôpital Pitié-Salpêtrière, Thyroid and Endocrine Tumors Department, Paris, France
| | - Christophe Trésallet
- Sorbonne Universités, UPMC Univ Paris 06, AP-HP, Hôpital Pitié-Salpêtrière, Department of General, Visceral and Endocrine Surgery, Paris, France
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, Inserm, Laboratoire d'Imagerie Biomédicale (LIB), Paris, France
| | - Frédérique Frouin
- Inserm, CEA, Univ Paris-Sud, CNRS, Université Paris-Saclay, Laboratoire Imagerie Moléculaire In Vivo (IMIV), CEA/I2BM/Service Hospitalier Frédéric Joliot, Orsay, France
| |
Collapse
|
6
|
Macari AT, Ziade G, Turfe Z, Chidiac A, Alam E, Hamdan AL. Correlation Between the Position of the Hyoid Bone on Lateral Cephalographs and Formant Frequencies. J Voice 2015; 30:757.e21-757.e26. [PMID: 26604010 DOI: 10.1016/j.jvoice.2015.08.020] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2015] [Accepted: 08/27/2015] [Indexed: 11/16/2022]
Abstract
OBJECTIVES The objective of this study is to examine the F1, F2, F3, and F4 during sustained vowels /ɑ/, /i/, /o/, /u/. STUDY DESIGN Prospective cross-sectional study. METHODS Fifty-two consecutive patients aged between 9 years and 38 years were invited to participate in this study. Linear measurements included linear vertical distance from the hyoid bone to the sella turcica (H-S); linear vertical distance from the hyoid bone to the posterior nasal spine (H-PNS); linear measure from the hyoid bone to the most anterior point of the cervical vertebra C3 (H-C3); and linear vertical distance from the hyoid bone to the mandibular plane (H-MP). RESULTS The results showed a moderate and statistically significant correlation between the average fundamental frequency for the vowel /ɑ/ and H-C3, H-S, and H-PNS and another moderate negative correlation between F3 and F4, and the vertical position of the hyoid bone H-C3 and H-S. For the vowel /i/, there was a moderate negative correlation between F1, F3, and F4 and H-S and also a moderate negative correlation between F3 and F4 and H-C3. For the vowel /o/, there was a moderate negative correlation between F4 and H-S and H-PNS. For the vowel /u/, only F4 correlated significantly with H-S. CONCLUSION There is a moderate correlation between the high formants, mostly F4, and the cephalo-caudal position of the hyoid bone.
Collapse
Affiliation(s)
- Anthony T Macari
- Department of Otolaryngology-Head & Neck Surgery, Orthodontics and Dentofacial Orthopedics Division, American University of Beirut Medical Center, Beirut, Lebanon
| | - Georges Ziade
- Department of Otolaryngology-Head & Neck Surgery, American University of Beirut Medical Center, Beirut, Lebanon
| | - Zaahir Turfe
- Michigan State University College of Human Medicine, East Lansing, Michigan
| | | | - Elie Alam
- Department of Otolaryngology-Head & Neck Surgery, American University of Beirut Medical Center, Beirut, Lebanon
| | - Abdul-Latif Hamdan
- Department of Otolaryngology-Head & Neck Surgery, American University of Beirut Medical Center, Beirut, Lebanon.
| |
Collapse
|
7
|
A Comparison of Different Methods to Generate Tooth Surface Models Without Applying Ionizing Radiation for Digital 3-Dimensional Image Fusion With Magnetic Resonance Imaging–Based Data of the Head and Neck Region. J Comput Assist Tomogr 2015; 39:882-9. [PMID: 26295193 DOI: 10.1097/rct.0000000000000293] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
8
|
Richardson K, Sussman JE, Stathopoulos ET, Huber JE. The effect of increased vocal intensity on interarticulator timing in speakers with Parkinson’s disease: a preliminary analysis. JOURNAL OF COMMUNICATION DISORDERS 2014; 52:44-64. [PMID: 25459460 PMCID: PMC4293242 DOI: 10.1016/j.jcomdis.2014.09.004] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/22/2013] [Revised: 09/18/2014] [Accepted: 09/28/2014] [Indexed: 05/19/2023]
Abstract
PURPOSE The purpose of the current study was to investigate the effect of increased vocal intensity on interarticulator timing in individuals with Parkinson's disease (PD). METHODS Ten individuals with mild to moderate hypophonia, secondary to PD, were selected for study. Over an 8-week treatment period, multi-talker babble noise was presented monaurally to the individuals with PD during everyday communication contexts to elicit increased vocal intensity (Lombard effect). Outcome measures included sound pressure level (SPL), voice onset time (VOT), VOT ratio, percent voicing, and speech intelligibility. RESULTS Group and individual participant responses to the treatment are reported and discussed. Speakers with PD were shown to significantly increase SPL in response to treatment. Six of the 10 speakers showed improved temporal coordination between the laryngeal and supralaryngeal mechanisms (interarticulator timing) in response to treatment. Four of the 10 speakers, however, showed reduced laryngeal–supralaryngeal timing at the end of treatment. Group speech intelligibility scores were significantly higher post-treatment as compared to pre-treatment. CONCLUSIONS Voice treatment during everyday communication resulted in improved temporal coordination across the laryngeal and supralaryngeal mechanisms for the majority of speakers with PD and made them easier to understand. Further investigations are planned to explore individual differences in response to treatment. The identification of speaker-specific voicing and devoicing strategies is consistent with the heterogeneous nature of PD. LEARNING OUTCOMES Readers will be able to: 1. Describe the speech and voice characteristics of individuals with Parkinson's disease. 2. Define the Lombard effect. 3. Describe acoustic measures of voice onset time and percent voicing. 4. Describe the effect of voice treatment on voice onset time and percent voicing in individuals with Parkinson's disease.
Collapse
Affiliation(s)
- Kelly Richardson
- University at Buffalo, Department of Communicative Disorders & Sciences, 3435 Main St., 122 Cary Hall, Buffalo, NY 14214 USA, 1-716-829-5549
| | - Joan E. Sussman
- University at Buffalo, Department of Communicative Disorders & Sciences, 3435 Main St., 122 Cary Hall, Buffalo, NY 14214 USA, 1-716-829-5549
| | - Elaine T. Stathopoulos
- University at Buffalo, Department of Communicative Disorders & Sciences, 3435 Main St., 122 Cary Hall, Buffalo, NY 14214 USA, 1-716-829-5549
| | - Jessica E. Huber
- University at Buffalo, Department of Communicative Disorders & Sciences, 3435 Main St., 122 Cary Hall, Buffalo, NY 14214 USA, 1-716-829-5549
| |
Collapse
|
9
|
Waaramaa T, Palo P, Kankare E. Emotions in freely varying and mono-pitched vowels, acoustic and EGG analyses. LOGOP PHONIATR VOCO 2014; 40:156-70. [PMID: 24998780 DOI: 10.3109/14015439.2014.934277] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Vocal emotions are expressed either by speech or singing. The difference is that in singing the pitch is predetermined while in speech it may vary freely. It was of interest to study whether there were voice quality differences between freely varying and mono-pitched vowels expressed by professional actors. Given their profession, actors have to be able to express emotions both by speech and singing. Electroglottogram and acoustic analyses of emotional utterances embedded in expressions of freely varying vowels [a:], [i:], [u:] (96 samples) and mono-pitched protracted vowels (96 samples) were studied. Contact quotient (CQEGG) was calculated using 35%, 55%, and 80% threshold levels. Three different threshold levels were used in order to evaluate their effects on emotions. Genders were studied separately. The results suggested significant gender differences for CQEGG 80% threshold level. SPL, CQEGG, and F4 were used to convey emotions, but to a lesser degree, when F0 was predetermined. Moreover, females showed fewer significant variations than males. Both genders used more hypofunctional phonation type in mono-pitched utterances than in the expressions with freely varying pitch. The present material warrants further study of the interplay between CQEGG threshold levels and formant frequencies, and listening tests to investigate the perceptual value of the mono-pitched vowels in the communication of emotions.
Collapse
Affiliation(s)
- Teija Waaramaa
- a University of Tampere, School of Communication, Media and Theatre , Tampere , Finland
| | - Pertti Palo
- b Clinical Audiology, Speech and Language (CASL) Research Centre, Queen Margaret University , UK
| | - Elina Kankare
- c University of Tampere, Speech and Voice Research Laboratory, School of Education , Tampere , Finland.,d Ear and Oral Diseases, Department of Phoniatrics in Tampere University Hospital , Tampere , Finland
| |
Collapse
|
10
|
Cartei V, Reby D. Acting Gay: Male Actors Shift the Frequency Components of Their Voices Towards Female Values When Playing Homosexual Characters. JOURNAL OF NONVERBAL BEHAVIOR 2011. [DOI: 10.1007/s10919-011-0123-4] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/16/2022]
|
11
|
Echternach M, Sundberg J, Baumann T, Markl M, Richter B. Vocal tract area functions and formant frequencies in opera tenors' modal and falsetto registers. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2011; 129:3955-63. [PMID: 21682417 DOI: 10.1121/1.3589249] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
According to recent model investigations, vocal tract resonance is relevant to vocal registers. However, no experimental corroboration of this claim has been published so far. In the present investigation, ten professional tenors' vocal tract configurations were analyzed using MRI volumetry. All subjects produced a sustained tone on the pitch F4 (349 Hz) on the vowel /a/ (1) in modal and (2) in falsetto register. The area functions were estimated from the MRI data and their associated formant frequencies were calculated. In a second condition the same subjects repeated the same tasks in a sound treated room and their formant frequencies were estimated by means of inverse filtering. In both recordings similar formant frequencies were observed. Vocal tract shapes differed between modal and falsetto register. In modal as compared to falsetto the lip opening and the oral cavity were wider and the first formant frequency was higher. In this sense the presented results are in agreement with the claim that the formant frequencies differ between registers.
Collapse
Affiliation(s)
- Matthias Echternach
- Institut of Musicians' Medicine, Freiburg University Medical Center, Breisacher Strasse 60, 79106 Freiburg, Germany
| | | | | | | | | |
Collapse
|
12
|
Echternach M, Traser L, Markl M, Richter B. Vocal tract configurations in male alto register functions. J Voice 2011; 25:670-7. [PMID: 21402469 DOI: 10.1016/j.jvoice.2010.09.008] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2010] [Accepted: 09/20/2010] [Indexed: 11/28/2022]
Abstract
PURPOSE Professional male altos (countertenors) mostly use a register function, which is considered to be derived from falsetto. However, the sound produced differs in professional altos compared with the modal register or falsetto of untrained voices. The aim of this study was to analyze differences of the vocal tract shapes in male alto register functions. MATERIAL AND METHODS Dynamic real-time magnetic resonance imaging of eight frames per second was used to analyze the vocal tract profile in seven professional male altos who sang on the vowel /a/, an ascending and descending scale from G3 (196 Hz) to E4 (330 Hz). The scale included their register transition from modal register to stage (counter) falsetto and naïve falsetto. RESULTS Register transitions from modal register to stage falsetto were associated with increased lip opening, jaw retraction, elevation and back positioning of the tongue, pharynx narrowing, uvula elevation, drop of larynx height, and tilting of the larynx. Differences between stage and naïve falsetto were found mostly with regard to lip opening and pharynx width. CONCLUSIONS The differences between the vocal tract configurations might have an impact on the acoustic characteristics observed in professional male alto register functions.
Collapse
Affiliation(s)
- Matthias Echternach
- Institute of Musicians' Medicine, Freiburg University Medical Center, Freiburg, Germany.
| | | | | | | |
Collapse
|
13
|
Echternach M, Sundberg J, Arndt S, Breyer T, Markl M, Schumacher M, Richter B. Vocal tract and register changes analysed by real-time MRI in male professional singers—a pilot study. LOGOP PHONIATR VOCO 2009; 33:67-73. [PMID: 18569645 DOI: 10.1080/14015430701875653] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
14
|
Nunes RB, Souza AMVD, Duprat ADC, Silva MADAE, Costa RC, Paulino JG. Vocal tract analysis in patients with vocal fold nodules, clefts and cysts. Braz J Otorhinolaryngol 2009; 75:188-92. [PMID: 19575103 PMCID: PMC9450662 DOI: 10.1016/s1808-8694(15)30777-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2007] [Accepted: 03/13/2008] [Indexed: 12/04/2022] Open
Abstract
The supraglottic plan represents an important dimension in vocal production, and its characterization is very important in the evaluation and treatment approach of dysphonic individuals. Aim to check if certain glottic configurations are related to specific adjustments in the vocal tract. To use nasal and laryngeal fibroscopy to assess the frequency of supraglottic vocal tract adjustments in dysphonic women with nodules, clefts and cysts. Methods We assessed 31 dysphonic women, with age ranging between 18 and 45 years, with vocal alteration and a diagnosis of nodules, middle-posterior cleft and cyst, and we carried out a summarized evaluation of the sensory-motor and oral systems and the patients were submitted to video-laryngostroboscopy and nasal and laryngeal fibroscopy. Three distinct groups were selected: patients with bilateral nodules, clefts and cysts, with similar glottic configuration. Their vocal tracts were visually analyzed through exams of nasal and laryngeal fibroscopy, by speech and hearing therapists and otorhinolaryngologists, checking the following parameters: supraglottic constriction, larynx vertical mobility, pharyngeal constriction and tongue mobility. The data was statistically described and treated. Results during visual analysis we did not find statistically significant differences which would separate the glottic alterations groups. Conclusion There was no correlation between supraglottic tract adjustments with any particular type of glottic alteration. These are individual behaviors that generate adjustments and justify the different vocal qualities in patients with the same type of laryngeal alteration.
Collapse
|
15
|
Hanayama EM, Camargo ZA, Tsuji DH, Rebelo Pinho SM. Metallic Voice: Physiological and Acoustic Features. J Voice 2009; 23:62-70. [PMID: 17324555 DOI: 10.1016/j.jvoice.2006.12.006] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2006] [Accepted: 12/18/2006] [Indexed: 10/23/2022]
Abstract
The metallic voice is usually confused with ring or nasality by singers and nontrained listeners, who are not used to perceptual vocal analysis. They believe a metallic voice results from a rise in fundamental frequency. A diagnostic error in this aspect may lead to lowering pitch, an incorrect procedure that could cause vocal overload and fatigue. The purpose of this article is to study the quality of metallic voice considering the correlation between information of the physiological and acoustic plans, based on a perceptive consensual assumption. Fiberscopic video pharyngolaryngoscopy was performed on 21 professional singers while speaking vowel [e]--in normal and metallic modes to observe muscular movements and structural changes of the velopharynx, pharynx, and larynx. Vocal samples captured simultaneously to the fiberscopic examination were acoustically analyzed. Frequency and amplitude of the first four formants (F(1), F(2), F(3), and F(4)) were extracted by means of linear predictor coefficients (LPC) spectrum and were statistically analyzed. Vocal tract adjustments such as velar lowering, pharyngeal wall narrowing, laryngeal rise, aryepiglottic, and lateral laryngeal constrictions were frequently found; there were no significant changes in frequency and amplitude of F(1) in the metallic voice; there were significant increases in amplitudes of F(2), F(3), and F(4) and in frequency for F(2); metallic voice perceived as louder was correlated to an increase in amplitude of F(3) and F(4). Physiological adjustments of velopharynx, pharynx, and larynx are combined in characterizing the metallic voice and can be acoustically related to changes in formant pattern.
Collapse
Affiliation(s)
- Eliana Midori Hanayama
- Center for Specialization in Clinical Speech Pathology and Audiology, CEFAC, São Paulo, Brazil.
| | | | | | | |
Collapse
|
16
|
Döllinger M, Berry DA, Montequin DW. The influence of epilarynx area on vocal fold dynamics. Otolaryngol Head Neck Surg 2006; 135:724-729. [PMID: 17071302 DOI: 10.1016/j.otohns.2006.04.007] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2005] [Accepted: 04/25/2006] [Indexed: 10/24/2022]
Abstract
OBJECTIVE This study investigated the influence of epilarynx area on an excised human vocal fold during phonation. STUDY DESIGN A hemilarynx set-up using an excised human larynx was used. An artificial vocal tract with an epilarynx tube of variable cross-sectional area was attached. High-speed imaging was performed and standard phonatory variables were measured. RESULTS Glottal airflow, fundamental frequency, and sound level increased as a function of subglottal pressure. A decrease in epilarynx area decreased phonation threshold pressure, glottal airflow, and vocal fold displacements and velocities. CONCLUSIONS Preliminary experimental results confirm that narrowing the epilarynx area facilitates phonation by decreasing phonation threshold pressure, presumably through impedance matching of the glottal source and vocal tract. SIGNIFICANCE As this phenomenon associated with epilarynx narrowing is further quantified and generalized, eventually new surgical alterations of the epilaryngeal structure may be suggested to facilitate phonation, in addition to standard phonosurgical procedures.
Collapse
Affiliation(s)
- Michael Döllinger
- Laryngeal Dynamics Laboratory, Division of Head & Neck Surgery, UCLA School of Medicine, Los Angeles, California.
| | | | | |
Collapse
|
17
|
Xue SA, Hao GJP, Mayo R. Volumetric measurements of vocal tracts for male speakers from different races. CLINICAL LINGUISTICS & PHONETICS 2006; 20:691-702. [PMID: 17342877 DOI: 10.1080/02699200500297716] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
Research examining physiologic and acoustic characteristics of culturally diverse populations is sorely needed, but rarely reported. The major aim of this study was to quantify vocal tract dimensional parameters (oral length, oral volume, pharyngeal length, pharyngeal volume, total vocal tract length and total vocal tract volume) of adult male speakers from three different racial populations (White American, African American, and Chinese). It also attempted to investigate if volumetric differences in the speakers' vocal tracts, like length differences, would contribute to the acoustic characteristics of these speakers from different races. The findings of this study support the hypothesis that speakers from different races may have morphological differences in their vocal tract dimensions, and these morphological differences (especially volumetric differences) could be partially responsible for the formant frequency differences in a vowel sound void of specific language/dialectal impacts. The study has provided speech scientists, speech-language pathologists, linguists and other health professionals with a new and preliminary acoustic and physiological database for adult male speakers from these three different races.
Collapse
Affiliation(s)
- Steve An Xue
- Human Vocal Tract Lab, Department of Speech and Hearing Sciences, Portland State University, Portland, OR 97207-0751, USA.
| | | | | |
Collapse
|
18
|
Xue SA, Hao JG. Normative standards for vocal tract dimensions by race as measured by acoustic pharyngometry. J Voice 2006; 20:391-400. [PMID: 16243483 DOI: 10.1016/j.jvoice.2005.05.001] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/17/2005] [Indexed: 11/28/2022]
Abstract
SUMMARY Acoustic pharyngometry evaluates the geometry of the vocal tract with acoustic reflections and provides information about vocal tract cross-sectional area and volume from lip to the glottis. Variations in vocal tract diameters are needed for speech scientists to validate various acoustic models and for medical professionals since the advent of endoscopic surgical techniques. Race is known to be one of the most important factors affecting the oral and nasal structures. This study compared vocal tract dimensions of White American, African American, and Chinese male and female speakers. One hundred and twenty healthy adult subjects with equal numbers of men and women were divided among three races. Subjects were controlled for age, gender, height, and weight. Six dimensional parameters of the speakers' vocal tract cavities were measured with acoustic reflection technology (AR). Significant gender and race main effects were found in certain vocal tract dimensions. The findings of this study now provide speech scientists, speech-language pathologists, and other health professionals with a new anatomical database of vocal tract variations for adult speakers from three different races.
Collapse
Affiliation(s)
- Steve An Xue
- Department of Speech and Hearing Sciences, Human Vocal Tract Research Lab, Portland State University, Portland, OR 97207, USA.
| | | |
Collapse
|
19
|
Ogawa T, Enciso R, Shintaku WH, Clark GT. Evaluation of cross-section airway configuration of obstructive sleep apnea. ACTA ACUST UNITED AC 2006; 103:102-8. [PMID: 17178502 PMCID: PMC1868407 DOI: 10.1016/j.tripleo.2006.06.008] [Citation(s) in RCA: 163] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2005] [Revised: 06/01/2006] [Accepted: 06/02/2006] [Indexed: 10/24/2022]
Abstract
Upper airway imaging techniques can be useful to identify the exact location and nature of the obstruction in obstructive sleep apnea (OSA) patients. Ten OSA patients and 10 non-OSA control subjects were imaged using cone-beam computed tomography (NewTom QR-DVT9000) to compare their upper airway structure. The OSA subjects presented higher BMI (OSA: 29.5 +/- 9.05 kg/m(2); non-OSA: 23.1 +/- 3.05 kg/m(2) [P = .034]), lower total volume (mm(3)) of the airway (OSA: 4868.4 +/- 1863.9; non-OSA: 6051.7 +/- 1756.4 [P = .054]), statistically significantly smaller anterior-posterior dimension (mm) of the minimum cross-section segment (OSA: 4.6 +/- 1.2; non-OSA: 7.8 +/- 3.31 [P = .009]), and smaller minimum cross-section area (OSA: 45.8+/-17.5 mm(2); non-OSA: 146.9 +/- 111.7 mm(2) [P = .011]) positioned below the occlusal plane in 70% of the cases (OSA:7 out of 10; non-OSA: 5 out of 10 [P = .030]). The OSA group presented a concave or elliptic shaped airway and the non-OSA group presented a concave, round, or square shaped airway.
Collapse
Affiliation(s)
- Takumi Ogawa
- Visiting Scholar, Orofacial Pain/Oral Medicine Center, Division of Diagnostic Sciences, School of Dentistry, University of Southern California Los Angeles, CA 90089-0641
| | - Reyes Enciso
- Clinical Assistant Professor, Division of Craniofacial Sciences and Therapeutics, School of Dentistry, University of Southern California Los Angeles, CA 90089-0641
| | - Werner H. Shintaku
- Clinical Assistant Professor, Oral Radiology Clinic, Division of Diagnostic Sciences, School of Dentistry, University of Southern California Los Angeles, CA 90089-0641
| | - Glenn T. Clark
- Professor and Program Director, Orofacial Pain and Oral Medicine Center, Division of Diagnostic Sciences, School of Dentistry, University of Southern California Los Angeles, CA 90089-0641
| |
Collapse
|
20
|
Xue SA, Hao GJ. Changes in the human vocal tract due to aging and the acoustic correlates of speech production: a pilot study. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2003; 46:689-701. [PMID: 14696995 DOI: 10.1044/1092-4388(2003/054)] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
This investigation used a derivation of acoustic reflection (AR) technology to make cross-sectional measurements of changes due to aging in the oral and pharyngeal lumina of male and female speakers. The purpose of the study was to establish preliminary normative data for such changes and to obtain acoustic measurements of changes due to aging in the formant frequencies of selected spoken vowels and their long-term average spectra (LTAS) analysis. Thirty-eight young men and women and 38 elderly men and women were involved in the study. The oral and pharyngeal lumina of the participants were measured with AR technology, and their formant frequencies were analyzed using the Kay Elemetrics Computerized Speech Lab. The findings have delineated specific and similar patterns of aging changes in human vocal tract configurations in speakers of both genders. Namely, the oral cavity length and volume of elderly speakers increased significantly compared to their young cohorts. The total vocal tract volume of elderly speakers also showed a significant increment, whereas the total vocal tract length of elderly speakers did not differ significantly from their young cohorts. Elderly speakers of both genders also showed similar patterns of acoustic changes of speech production, that is, consistent lowering of formant frequencies (especially F1) across selected vowel productions. Although new research models are still needed to succinctly account for the speech acoustic changes of the elderly, especially for their specific patterns of human vocal tract dimensional changes, this study has innovatively applied the noninvasive and cost-effective AR technology to monitor age-related human oral and pharyngeal lumina changes that have direct consequences for speech production.
Collapse
|
21
|
Tom K, Titze IR. Vocal intensity in falsetto phonation of a countertenor: an analysis by synthesis approach. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2001; 110:1667-1676. [PMID: 11572375 DOI: 10.1121/1.1396331] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
An analysis by synthesis paradigm was implemented to model glottal airflow and vocal tract acoustics for the falsetto phonation of a trained countertenor. Changes in vocal intensity were measured as a function of subglottal pressure, open quotient of the time-varying glottal airflow pulse, and formant tuning. The contributions of laryngeal adduction (open quotient of the glottal flow pulse) and of formant tuning to intensity change were derived from modeled data. The findings were: (1) Subglottal pressure accounted for almost 90% of the variation in SPL in falsetto phonation. (2) The open quotient of the glottal flow pulse was remarkably constant in these falsetto phonations, and thus did not affect vocal intensity significantly. (3) Formant tuning occurred in two out of nine possibilities for the vowel /a/. These instances did not support the concept of systematic exploitation of formant tuning.
Collapse
Affiliation(s)
- K Tom
- Department of Speech Communication, California State University Fullerton, 92831, USA
| | | |
Collapse
|