1
|
Jelinger J, Perta K, Lee J, Wiksten N, Bae Y. Oropharyngeal and Aryepiglottic Narrowing for Twang: A Magnetic Resonance Imaging Study. J Voice 2024:S0892-1997(24)00192-9. [PMID: 38964963 DOI: 10.1016/j.jvoice.2024.06.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2024] [Revised: 06/18/2024] [Accepted: 06/19/2024] [Indexed: 07/06/2024]
Abstract
OBJECTIVE This study aimed to compare vocal tract configurations between speech and twang qualities. METHODS Magnetic resonance imaging data were acquired from five professional vocalists while producing the sustained vowel /i/. Width and area measurements were obtained from the axial (ie, transverse) images to evaluate oropharyngeal narrowing and aryepiglottic (AES) narrowing. RESULTS Four out of five participants exhibited a smaller vocal tract area for twang than for speech at the oropharyngeal level, with the extent of narrowing ranging from 18.8% to 49.6%. Only one participant showed a meaningful decrease in oropharyngeal anteroposterior (AP) width, while three participants showed meaningful decreases in oropharyngeal mediolateral (ML) width for twang compared to speech. At the AES level, all participants showed a smaller vocal tract area for twang than for speech, with the extent of narrowing ranging from 11.8% to 52.4%. Two participants exhibited meaningful decreases in AES AP width, while three participants showed meaningful decreases in AES ML width for twang compared to speech. CONCLUSIONS Axial imaging revealed oropharyngeal and AES narrowing associated with twang, more prominent in the ML than the AP dimension. Notable individual variations in the mechanism and degree of narrowing at the oropharyngeal and AES levels were observed. The degree of narrowing varied among participants, highlighting the complexity of physiological maneuvers involved in twang production. Future research is necessary to identify broader patterns in twang production for effective pedagogic and therapeutic applications.
Collapse
Affiliation(s)
- Jessica Jelinger
- Voice and Resonance Laboratory, The Ohio State University, Columbus, Ohio; Department of Speech and Hearing Science, The Ohio State University, Columbus, Ohio
| | - Karen Perta
- Department of Communication Sciences and Disorders, Elmhurst University, Elmhurst, Illinois
| | - Jennifer Lee
- Voice and Resonance Laboratory, The Ohio State University, Columbus, Ohio
| | - Nicole Wiksten
- Department of Speech and Hearing Science, The Ohio State University, Columbus, Ohio
| | - Youkyung Bae
- Voice and Resonance Laboratory, The Ohio State University, Columbus, Ohio; Department of Speech and Hearing Science, The Ohio State University, Columbus, Ohio.
| |
Collapse
|
2
|
Burk F, Traser L, Burdumy M, Richter B, Echternach M. Dynamic changes of vocal tract dimensions with sound pressure level during messa di vocea). THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023; 154:3595-3603. [PMID: 38038612 DOI: 10.1121/10.0022582] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Accepted: 11/14/2023] [Indexed: 12/02/2023]
Abstract
The messa di voce (MdV), which consists of a continuous crescendo and subsequent decrescendo on one pitch is one of the more difficult exercises of the technical repertoire of Western classical singing. With rising lung pressure, regulatory adjustments both on the level of the glottis and the vocal tract are required to keep the pitch stable. The dynamic changes of vocal tract dimensions with the bidirectional variation of sound pressure level (SPL) during MdV were analyzed by two-dimensional real-time magnetic resonance imaging (25 frames/s) and synchronous audio recordings in 12 professional singer subjects. Close associations in the respective articulatory kinetics were found between SPL and lip opening, jaw opening, pharynx width, uvula elevation, and vertical larynx position. However, changes in vocal tract dimensions during plateaus of SPL suggest that perceived loudness could have been varied beyond the dimension of SPL. Further multimodal investigation, including the analysis of sound spectra, is needed for a better understanding of the role of vocal tract resonances in the control of vocal loudness in human phonation.
Collapse
Affiliation(s)
- Fabian Burk
- Department of Otorhinolaryngology and Plastic Surgery, SRH Wald-Klinikum Gera, Gera, Germany
- Institute of Musicians' Medicine, University Medical Center Freiburg, Freiburg im Breisgau, Germany
| | - Louisa Traser
- Institute of Musicians' Medicine, University Medical Center Freiburg, Freiburg im Breisgau, Germany
| | - Michael Burdumy
- Department of Radiology, Medical Physics, University Medical Center Freiburg, Freiburg im Breisgau, Germany
| | - Bernhard Richter
- Institute of Musicians' Medicine, University Medical Center Freiburg, Freiburg im Breisgau, Germany
| | - Matthias Echternach
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Munich, Germany
| |
Collapse
|
3
|
Köberlein M, Birkholz P, Burdumy M, Richter B, Burk F, Traser L, Echternach M. Investigation of resonance strategies of high pitch singing sopranos using dynamic three-dimensional magnetic resonance imaging. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 150:4191. [PMID: 34972262 DOI: 10.1121/10.0008903] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Accepted: 11/10/2021] [Indexed: 06/14/2023]
Abstract
Resonance-strategies with respect to vocal registers, i.e., frequency-ranges of uniform, demarcated voice quality, for the highest part of the female voice are still not completely understood. The first and second vocal tract resonances usually determine vowels. If the fundamental frequency exceeds the vowel-shaping resonance frequencies of speech, vocal tract resonances are tuned to voice source partials. It has not yet been clarified if such tuning is applicable for the entire voice-range, particularly for the top pitches. We investigated professional sopranos who regularly sing pitches above C6 (1047 Hz). Dynamic three-dimensional (3D) magnetic resonance imaging was used to calculate resonances for pitches from C5 (523 Hz) to C7 (2093 Hz) with different vowel configurations ([a:], [i:], [u:]), and different contexts (scales or octave jumps). A spectral analysis and an acoustic analysis of 3D-printed vocal tract models were conducted. The results suggest that there is no exclusive register-defining resonance-strategy. The intersection of fundamental frequency and first vocal tract resonance was not found to necessarily indicate a register shift. The articulators and the vocal tract resonances were either kept without significant adjustments, or the fR1:fo-tuning, wherein the first vocal tract resonance enhances the fundamental frequency, was applied until F6 (1396 Hz). An fR2:fo-tuning was not observed.
Collapse
Affiliation(s)
- Marie Köberlein
- Medical Faculty of the Albert-Ludwigs-University Freiburg, Freiburg Institute for Musicians' Medicine, University Medical Center Freiburg, University of Music Freiburg, Elsässer Straße 2m, 79110, Freiburg, Germany
| | - Peter Birkholz
- Institute of Acoustics and Speech Communication, Technische Universität Dresden, Germany
| | - Michael Burdumy
- Department of Medical Physics, Radiology, Freiburg University Medical Center, Germany
| | - Bernhard Richter
- Medical Faculty of the Albert-Ludwigs-University Freiburg, Freiburg Institute for Musicians' Medicine, University Medical Center Freiburg, University of Music Freiburg, Elsässer Straße 2m, 79110, Freiburg, Germany
| | - Fabian Burk
- Department of Otorhinolaryngology, Head and Neck Surgery, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Louisa Traser
- Medical Faculty of the Albert-Ludwigs-University Freiburg, Freiburg Institute for Musicians' Medicine, University Medical Center Freiburg, University of Music Freiburg, Elsässer Straße 2m, 79110, Freiburg, Germany
| | - Matthias Echternach
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, University Hospital, LMU Munich, Germany
| |
Collapse
|
4
|
Lynn E, Narayanan SS, Lammert AC. Dark tone quality and vocal tract shaping in soprano song production: Insights from real-time MRI. JASA EXPRESS LETTERS 2021; 1:075202. [PMID: 34291230 PMCID: PMC8273971 DOI: 10.1121/10.0005109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Accepted: 05/10/2021] [Indexed: 06/13/2023]
Abstract
Tone quality termed "dark" is an aesthetically important property of Western classical voice performance and has been associated with lowered formant frequencies, lowered larynx, and widened pharynx. The present study uses real-time magnetic resonance imaging with synchronous audio recordings to investigate dark tone quality in four professionally trained sopranos with enhanced ecological validity and a relatively complete view of the vocal tract. Findings differ from traditional accounts, indicating that labial narrowing may be the primary driver of dark tone quality across performers, while many other aspects of vocal tract shaping are shown to differ significantly in a performer-specific way.
Collapse
Affiliation(s)
- Elisabeth Lynn
- Department of Biomedical Engineering, Worcester Polytechnic Institute, Worcester, Massachusetts 01690, USA
| | - Shrikanth S Narayanan
- Signal Analysis and Interpretation Laboratory, University of Southern California, Los Angeles, California 95616, USA , ,
| | - Adam C Lammert
- Department of Biomedical Engineering, Worcester Polytechnic Institute, Worcester, Massachusetts 01690, USA
| |
Collapse
|
5
|
Echternach M, Herbst CT, Köberlein M, Story B, Döllinger M, Gellrich D. Are source-filter interactions detectable in classical singing during vowel glides? THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 149:4565. [PMID: 34241428 DOI: 10.1121/10.0005432] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2020] [Accepted: 06/03/2021] [Indexed: 06/13/2023]
Abstract
In recent studies, it has been assumed that vocal tract formants (Fn) and the voice source could interact. However, there are only few studies analyzing this assumption in vivo. Here, the vowel transition /i/-/a/-/u/-/i/ of 12 professional classical singers (6 females, 6 males) when phonating on the pitch D4 [fundamental frequency (ƒo) ca. 294 Hz] were analyzed using transnasal high speed videoendoscopy (20.000 fps), electroglottography (EGG), and audio recordings. Fn data were calculated using a cepstral method. Source-filter interaction candidates (SFICs) were determined by (a) algorithmic detection of major intersections of Fn/nƒo and (b) perceptual assessment of the EGG signal. Although the open quotient showed some increase for the /i-a/ and /u-i/ transitions, there were no clear effects at the expected Fn/nƒo intersections. In contrast, ƒo adjustments and changes in the phonovibrogram occurred at perceptually derived SFICs, suggesting level-two interactions. In some cases, these were constituted by intersections between higher nƒo and Fn. The presented data partially corroborates that vowel transitions may result in level-two interactions also in professional singers. However, the lack of systematically detectable effects suggests either the absence of a strong interaction or existence of confounding factors, which may potentially counterbalance the level-two-interactions.
Collapse
Affiliation(s)
- Matthias Echternach
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Marchioninistrasse 15, Munich, 81377, Germany
| | - Christian T Herbst
- Antonio Salieri Department of Vocal Studies and Vocal Research in Music Education, University of Music and Performing Arts Vienna, Vienna, Austria
| | - Marie Köberlein
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Marchioninistrasse 15, Munich, 81377, Germany
| | - Brad Story
- Department of Speech, Language, and Hearing Sciences, University of Arizona, Tucson, Arizona 85718, USA
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology Head and Neck Surgery, University Hospital Erlangen, Medical School Waldstrasse 1, Erlangen, 91054, Germany
| | - Donata Gellrich
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Marchioninistrasse 15, Munich, 81377, Germany
| |
Collapse
|
6
|
Perta K, Bae Y, Obert K. A pilot investigation of twang quality using magnetic resonance imaging. LOGOP PHONIATR VOCO 2020; 46:77-85. [DOI: 10.1080/14015439.2020.1757147] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Affiliation(s)
- Karen Perta
- Division of Social and Behavioral Sciences, Ohio State University, Columbus, OH, USA
| | - Youkyung Bae
- Division of Social and Behavioral Sciences, Speech and Hearing Science, Ohio State University, Columbus, OH, USA
| | - Kerrie Obert
- College of Medicine, Ohio State University, Columbus, OH, USA
| |
Collapse
|
7
|
Kim YC. Fast upper airway magnetic resonance imaging for assessment of speech production and sleep apnea. PRECISION AND FUTURE MEDICINE 2018. [DOI: 10.23838/pfm.2018.00100] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
|
8
|
Vos RR, Murphy DT, Howard DM, Daffern H. Determining the Relevant Criteria for Three-dimensional Vocal Tract Characterization. J Voice 2018. [DOI: 10.1016/j.jvoice.2017.04.001] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
9
|
Echternach M, Burk F, Köberlein M, Selamtzis A, Döllinger M, Burdumy M, Richter B, Herbst CT. Laryngeal evidence for the first and second passaggio in professionally trained sopranos. PLoS One 2017; 12:e0175865. [PMID: 28467509 PMCID: PMC5414960 DOI: 10.1371/journal.pone.0175865] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2016] [Accepted: 03/31/2017] [Indexed: 11/18/2022] Open
Abstract
Introduction Due to a lack of empirical data, the current understanding of the laryngeal mechanics in the passaggio regions (i.e., the fundamental frequency ranges where vocal registration events usually occur) of the female singing voice is still limited. Material and methods In this study the first and second passaggio regions of 10 professionally trained female classical soprano singers were analyzed. The sopranos performed pitch glides from A3 (ƒo = 220 Hz) to A4 (ƒo = 440 Hz) and from A4 (ƒo = 440 Hz) to A5 (ƒo = 880 Hz) on the vowel [iː]. Vocal fold vibration was assessed with trans-nasal high speed videoendoscopy at 20,000 fps, complemented by simultaneous electroglottographic (EGG) and acoustic recordings. Register breaks were perceptually rated by 12 voice experts. Voice stability was documented with the EGG-based sample entropy. Glottal opening and closing patterns during the passaggi were analyzed, supplemented with open quotient data extracted from the glottal area waveform. Results In both the first and the second passaggio, variations of vocal fold vibration patterns were found. Four distinct patterns emerged: smooth transitions with either increasing or decreasing durations of glottal closure, abrupt register transitions, and intermediate loss of vocal fold contact. Audible register transitions (in both the first and second passaggi) generally coincided with higher sample entropy values and higher open quotient variance through the respective passaggi. Conclusions Noteworthy vocal fold oscillatory registration events occur in both the first and the second passaggio even in professional sopranos. The respective transitions are hypothesized to be caused by either (a) a change of laryngeal biomechanical properties; or by (b) vocal tract resonance effects, constituting level 2 source-filter interactions.
Collapse
Affiliation(s)
- Matthias Echternach
- Institute of Musicians’ Medicine, University of Freiburg Medical Center, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Fabian Burk
- Institute of Musicians’ Medicine, University of Freiburg Medical Center, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Marie Köberlein
- Institute of Musicians’ Medicine, University of Freiburg Medical Center, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Andreas Selamtzis
- Royal Technical University, Music Acoustics. Lindstedtsvägen 24, Stockholm, Sweden
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology Head & Neck Surgery, University Hospital Erlangen, Medical School, Waldstrasse 1, Erlangen, Germany
| | - Michael Burdumy
- Department of Medical Physics, University of Freiburg Medical Center, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Bernhard Richter
- Institute of Musicians’ Medicine, University of Freiburg Medical Center, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Christian Thomas Herbst
- Laboratory of Bio-Acoustics, Department of Cognitive Biology, University of Vienna, Althanstraße 14, Vienna, Austria
- * E-mail:
| |
Collapse
|
10
|
Töger J, Sorensen T, Somandepalli K, Toutios A, Lingala SG, Narayanan S, Nayak K. Test-retest repeatability of human speech biomarkers from static and real-time dynamic magnetic resonance imaging. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 141:3323. [PMID: 28599561 PMCID: PMC5436977 DOI: 10.1121/1.4983081] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
Static anatomical and real-time dynamic magnetic resonance imaging (RT-MRI) of the upper airway is a valuable method for studying speech production in research and clinical settings. The test-retest repeatability of quantitative imaging biomarkers is an important parameter, since it limits the effect sizes and intragroup differences that can be studied. Therefore, this study aims to present a framework for determining the test-retest repeatability of quantitative speech biomarkers from static MRI and RT-MRI, and apply the framework to healthy volunteers. Subjects (n = 8, 4 females, 4 males) are imaged in two scans on the same day, including static images and dynamic RT-MRI of speech tasks. The inter-study agreement is quantified using intraclass correlation coefficient (ICC) and mean within-subject standard deviation (σe). Inter-study agreement is strong to very strong for static measures (ICC: min/median/max 0.71/0.89/0.98, σe: 0.90/2.20/6.72 mm), poor to strong for dynamic RT-MRI measures of articulator motion range (ICC: 0.26/0.75/0.90, σe: 1.6/2.5/3.6 mm), and poor to very strong for velocities (ICC: 0.21/0.56/0.93, σe: 2.2/4.4/16.7 cm/s). In conclusion, this study characterizes repeatability of static and dynamic MRI-derived speech biomarkers using state-of-the-art imaging. The introduced framework can be used to guide future development of speech biomarkers. Test-retest MRI data are provided free for research use.
Collapse
Affiliation(s)
- Johannes Töger
- Ming Hsieh Department of Electrical Engineering, University of Southern California, 3740 McClintock Avenue, EEB 400, Los Angeles, California 90089-2560, USA
| | - Tanner Sorensen
- Ming Hsieh Department of Electrical Engineering, University of Southern California, 3740 McClintock Avenue, EEB 400, Los Angeles, California 90089-2560, USA
| | - Krishna Somandepalli
- Ming Hsieh Department of Electrical Engineering, University of Southern California, 3740 McClintock Avenue, EEB 400, Los Angeles, California 90089-2560, USA
| | - Asterios Toutios
- Ming Hsieh Department of Electrical Engineering, University of Southern California, 3740 McClintock Avenue, EEB 400, Los Angeles, California 90089-2560, USA
| | - Sajan Goud Lingala
- Ming Hsieh Department of Electrical Engineering, University of Southern California, 3740 McClintock Avenue, EEB 400, Los Angeles, California 90089-2560, USA
| | - Shrikanth Narayanan
- Ming Hsieh Department of Electrical Engineering, University of Southern California, 3740 McClintock Avenue, EEB 400, Los Angeles, California 90089-2560, USA
| | - Krishna Nayak
- Ming Hsieh Department of Electrical Engineering, University of Southern California, 3740 McClintock Avenue, EEB 400, Los Angeles, California 90089-2560, USA
| |
Collapse
|
11
|
Burdumy M, Traser L, Burk F, Richter B, Echternach M, Korvink JG, Hennig J, Zaitsev M. One-second MRI of a three-dimensional vocal tract to measure dynamic articulator modifications. J Magn Reson Imaging 2016; 46:94-101. [PMID: 27943448 DOI: 10.1002/jmri.25561] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2016] [Accepted: 11/08/2016] [Indexed: 11/10/2022] Open
Abstract
PURPOSE To enable three-dimensional (3D) vocal tract imaging of dynamic singing or speech tasks at voxel sizes of 1.6 × 1.6 × 1.3 mm3 at 1.3 s per image. MATERIALS AND METHODS A Stack-of-Stars method was implemented and enhanced to allow for fast and efficient k-space sampling of the box-shaped vocal tract using a 3 Tesla MRI system. Images were reconstructed using an off-line image reconstruction using compressed sensing theory, leading to the abovementioned spatial and temporal resolutions. To validate spatial resolution, a phantom with holes of defined sizes was measured. The applicability of the imaging method was validated in an eight-subject study of amateur singers that were required to sustain phonation at a constant pitch, past their comfortable expiratory level. A segmentation of the vocal tract over all phonation time steps was done for one subject. Anatomical distances (larynx position and pharynx width) were calculated and compared for all subjects. RESULTS Analysis of the phantom study revealed that the imaging method could provide at least 1.6 mm isotropic resolution. Visual inspection of the segmented vocal tract during phonation showed modifications of the lips, tongue, and larynx position in all three dimensions. The mean larynx position per subject amounted to 52-85 mm, deviating up to 5% over phonation time. Parameter pharynx width was 32-181 mm2 on average per subject, deviating up to 16% over phonation time. Visual inspection of the parameter course revealed no common compensation strategy for long sustained phonation. CONCLUSION The results of both phantom and in vivo measurements show the applicability of the fast 3D imaging method for voice research and indicate that modifications in all three dimensions can be observed and quantified. LEVEL OF EVIDENCE 2 Technical Efficacy: Stage 1 J. MAGN. RESON. IMAGING 2017;46:94-101.
Collapse
Affiliation(s)
- Michael Burdumy
- University Medical Center Freiburg, Department of Radiology, Medical Physics, Freiburg, Germany.,University Medical Center Freiburg, Institute of Musicians' Medicine, Freiburg, Germany
| | - Louisa Traser
- University Medical Center Freiburg, Institute of Musicians' Medicine, Freiburg, Germany.,Department of Oto-Rhino-Laryngology, Head and Neck Surgery, University Medical Center, Freiburg, Germany
| | - Fabian Burk
- University Medical Center Freiburg, Institute of Musicians' Medicine, Freiburg, Germany
| | - Bernhard Richter
- University Medical Center Freiburg, Institute of Musicians' Medicine, Freiburg, Germany
| | - Matthias Echternach
- University Medical Center Freiburg, Institute of Musicians' Medicine, Freiburg, Germany
| | - Jan G Korvink
- Institute of Microstructure Technology, Karlsruhe Institute of Technology, Karlsruhe, Germany
| | - Jürgen Hennig
- University Medical Center Freiburg, Department of Radiology, Medical Physics, Freiburg, Germany
| | - Maxim Zaitsev
- University Medical Center Freiburg, Department of Radiology, Medical Physics, Freiburg, Germany
| |
Collapse
|
12
|
Echternach M, Burk F, Burdumy M, Traser L, Richter B. Morphometric Differences of Vocal Tract Articulators in Different Loudness Conditions in Singing. PLoS One 2016; 11:e0153792. [PMID: 27096935 PMCID: PMC4838265 DOI: 10.1371/journal.pone.0153792] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2015] [Accepted: 04/04/2016] [Indexed: 11/30/2022] Open
Abstract
Introduction Dynamic MRI analysis of phonation has gathered interest in voice and speech physiology. However, there are limited data addressing the extent to which articulation is dependent on loudness. Material and Methods 12 professional singer subjects of different voice classifications were analysed concerning the vocal tract profiles recorded with dynamic real-time MRI with 25fps in different pitch and loudness conditions. The subjects were asked to sing ascending scales on the vowel /a/ in three loudness conditions (comfortable = mf, very soft = pp, very loud = ff, respectively). Furthermore, fundamental frequency and sound pressure level were analysed from the simultaneously recorded optical audio signal after noise cancellation. Results The data show articulatory differences with respect to changes of both pitch and loudness. Here, lip opening and pharynx width were increased. While the vertical larynx position was rising with pitch it was lower for greater loudness. Especially, the lip opening and pharynx width were more strongly correlated with the sound pressure level than with pitch. Conclusion For the vowel /a/ loudness has an effect on articulation during singing which should be considered when articulatory vocal tract data are interpreted.
Collapse
Affiliation(s)
- Matthias Echternach
- Institute of Musicians’ Medicine, Freiburg University Medical Center, Breisacher Str. 60, 79106 Freiburg, Germany
- * E-mail:
| | - Fabian Burk
- Institute of Musicians’ Medicine, Freiburg University Medical Center, Breisacher Str. 60, 79106 Freiburg, Germany
| | - Michael Burdumy
- Institute of Musicians’ Medicine, Freiburg University Medical Center, Breisacher Str. 60, 79106 Freiburg, Germany
- Department of Medical Physics, Radiology, Freiburg University Medical Center, Breisacher Str. 60, 79106 Freiburg, Germany
| | - Louisa Traser
- Institute of Musicians’ Medicine, Freiburg University Medical Center, Breisacher Str. 60, 79106 Freiburg, Germany
- Department of Otorhinolaryngology, Freiburg University Medical Center, Kilianstr. 5, 79106 Freiburg, Germany
| | - Bernhard Richter
- Institute of Musicians’ Medicine, Freiburg University Medical Center, Breisacher Str. 60, 79106 Freiburg, Germany
| |
Collapse
|
13
|
Lingala SG, Zhu Y, Kim YC, Toutios A, Narayanan S, Nayak KS. A fast and flexible MRI system for the study of dynamic vocal tract shaping. Magn Reson Med 2016; 77:112-125. [PMID: 26778178 DOI: 10.1002/mrm.26090] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2015] [Revised: 11/06/2015] [Accepted: 11/24/2015] [Indexed: 11/07/2022]
Abstract
PURPOSE The aim of this work was to develop and evaluate an MRI-based system for study of dynamic vocal tract shaping during speech production, which provides high spatial and temporal resolution. METHODS The proposed system utilizes (a) custom eight-channel upper airway coils that have high sensitivity to upper airway regions of interest, (b) two-dimensional golden angle spiral gradient echo acquisition, (c) on-the-fly view-sharing reconstruction, and (d) off-line temporal finite difference constrained reconstruction. The system also provides simultaneous noise-cancelled and temporally aligned audio. The system is evaluated in 3 healthy volunteers, and 1 tongue cancer patient, with a broad range of speech tasks. RESULTS We report spatiotemporal resolutions of 2.4 × 2.4 mm2 every 12 ms for single-slice imaging, and 2.4 × 2.4 mm2 every 36 ms for three-slice imaging, which reflects roughly 7-fold acceleration over Nyquist sampling. This system demonstrates improved temporal fidelity in capturing rapid vocal tract shaping for tasks, such as producing consonant clusters in speech, and beat-boxing sounds. Novel acoustic-articulatory analysis was also demonstrated. CONCLUSION A synergistic combination of custom coils, spiral acquisitions, and constrained reconstruction enables visualization of rapid speech with high spatiotemporal resolution in multiple planes. Magn Reson Med 77:112-125, 2017. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Sajan Goud Lingala
- Electrical Engineering, University of Southern California, Los Angeles, CA
| | - Yinghua Zhu
- Electrical Engineering, University of Southern California, Los Angeles, CA
| | | | - Asterios Toutios
- Electrical Engineering, University of Southern California, Los Angeles, CA
| | | | - Krishna S Nayak
- Electrical Engineering, University of Southern California, Los Angeles, CA
| |
Collapse
|
14
|
Echternach M, Birkholz P, Traser L, Flügge TV, Kamberger R, Burk F, Burdumy M, Richter B. Articulation and vocal tract acoustics at soprano subject's high fundamental frequencies. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2015; 137:2586-2595. [PMID: 25994691 DOI: 10.1121/1.4919356] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
The role of the vocal tract for phonation at very high soprano fundamental frequencies (F0s) is not yet understood in detail. In this investigation, two experiments were carried out with a single professional high soprano subject. First, using two dimensional (2D) dynamic real-time magnetic resonance imaging (MRI) (24 fps) midsagittal and coronal vocal tract shapes were analyzed while the subject sang a scale from Bb5 (932 Hz) to G6 (1568 Hz). In a second experiment, volumetric vocal tract MRI data were recorded from sustained phonations (13 s) for the pitches C6 (1047 Hz) and G6 (1568 Hz). Formant frequencies were measured in physical models created by 3D printing, and calculated from area functions obtained from the 3D vocal tract shapes. The data showed that there were only minor modifications of the vocal tract shape. These changes involved a decrease of the piriform sinus as well as small changes of tongue position. Formant frequencies did not exhibit major differences between C6 and G6 for F1 and F3, respectively. Only F2 was slightly raised for G6. For G6, however, F2 is not excited by any voice source partial. Therefore, this investigation was not able to confirm that the analyzed professional soprano subject adjusted formants to voice source partials for the analyzed F0s.
Collapse
Affiliation(s)
- Matthias Echternach
- Institute of Musicians' Medicine, Freiburg University Medical Center, Breisacher Str. 60, 79106 Freiburg, Germany
| | - Peter Birkholz
- Institute of Acoustics and Speech Communication, Technische Universität Dresden, Dresden, 01062 Dresden, Germany
| | - Louisa Traser
- Institute of Musicians' Medicine, Freiburg University Medical Center, Breisacher Str. 60, 79106 Freiburg, Germany
| | - Tabea V Flügge
- Department of Craniomaxillofacial Surgery, Freiburg University Medical Center, Hugstetterstr. 55, 79106 Freiburg, Germany
| | - Robert Kamberger
- Laboratory of Simulation, Department of Microsystems Engineering-IMTEK, University of Freiburg, Georges-Köhler-Allee 102, 79110 Freiburg, Germany
| | - Fabian Burk
- Institute of Musicians' Medicine, Freiburg University Medical Center, Breisacher Str. 60, 79106 Freiburg, Germany
| | - Michael Burdumy
- Department of Radiology, Medical Physics, Freiburg University Medical Center, Breisacher Str. 60a, 79106 Freiburg, Germany
| | - Bernhard Richter
- Institute of Musicians' Medicine, Freiburg University Medical Center, Breisacher Str. 60, 79106 Freiburg, Germany
| |
Collapse
|
15
|
Echternach M, Dippold S, Richter B. High-speed imaging using rigid laryngoscopy for the analysis of register transitions in professional operatic tenors. LOGOP PHONIATR VOCO 2014; 41:1-8. [DOI: 10.3109/14015439.2014.936499] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
16
|
Vocal Tract Configurations in Tenors' Passaggio in Different Vowel Conditions—A Real-Time Magnetic Resonance Imaging Study. J Voice 2014; 28:262.e1-262.e8. [DOI: 10.1016/j.jvoice.2013.10.009] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2013] [Accepted: 10/11/2013] [Indexed: 11/18/2022]
|
17
|
Dynamic real-time magnetic resonance imaging for the analysis of voice physiology. Curr Opin Otolaryngol Head Neck Surg 2013; 20:450-7. [PMID: 23086261 DOI: 10.1097/moo.0b013e3283585f87] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
PURPOSE OF REVIEW For a number of years, it has been possible to use dynamic real-time magnetic resonance imaging (MRI) to analyse the dynamic processes which occur in the human body. In the fields of laryngology and phoniatrics, such dynamic processes are found not only in swallowing, but also in voice and speech production. This article aims to present an overview of how the use of MRI might add to our current understanding of the dynamic processes involved in voice production. RECENT FINDINGS It is shown that up to now the analysis of vocal fold oscillations has been limited by MRI's relatively low sampling rate of up to 50 Hz. Nevertheless, more detailed analysis does seem possible with regard to the modulation of the power source and vocal tract. SUMMARY Dynamic real-time MRI offers a great opportunity for the analysis of voice production in all stages of the voice production system.
Collapse
|
18
|
Traser L, Burdumy M, Richter B, Vicari M, Echternach M. The Effect of Supine and Upright Position on Vocal Tract Configurations During Singing—A Comparative Study in Professional Tenors. J Voice 2013; 27:141-8. [DOI: 10.1016/j.jvoice.2012.11.002] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2012] [Accepted: 11/13/2012] [Indexed: 10/27/2022]
|
19
|
Rua Ventura SM, Freitas DRS, Ramos IMA, Tavares JMR. Morphologic Differences in the Vocal Tract Resonance Cavities of Voice Professionals: An MRI-Based Study. J Voice 2013; 27:132-40. [DOI: 10.1016/j.jvoice.2012.11.010] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2012] [Accepted: 11/30/2012] [Indexed: 11/25/2022]
|