1
|
Elemans CPH, Jiang W, Jensen MH, Pichler H, Mussman BR, Nattestad J, Wahlberg M, Zheng X, Xue Q, Fitch WT. Evolutionary novelties underlie sound production in baleen whales. Nature 2024; 627:123-129. [PMID: 38383781 DOI: 10.1038/s41586-024-07080-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Accepted: 01/16/2024] [Indexed: 02/23/2024]
Abstract
Baleen whales (mysticetes) use vocalizations to mediate their complex social and reproductive behaviours in vast, opaque marine environments1. Adapting to an obligate aquatic lifestyle demanded fundamental physiological changes to efficiently produce sound, including laryngeal specializations2-4. Whereas toothed whales (odontocetes) evolved a nasal vocal organ5, mysticetes have been thought to use the larynx for sound production1,6-8. However, there has been no direct demonstration that the mysticete larynx can phonate, or if it does, how it produces the great diversity of mysticete sounds9. Here we combine experiments on the excised larynx of three mysticete species with detailed anatomy and computational models to show that mysticetes evolved unique laryngeal structures for sound production. These structures allow some of the largest animals that ever lived to efficiently produce frequency-modulated, low-frequency calls. Furthermore, we show that this phonation mechanism is likely to be ancestral to all mysticetes and shares its fundamental physical basis with most terrestrial mammals, including humans10, birds11, and their closest relatives, odontocetes5. However, these laryngeal structures set insurmountable physiological limits to the frequency range and depth of their vocalizations, preventing them from escaping anthropogenic vessel noise12,13 and communicating at great depths14, thereby greatly reducing their active communication range.
Collapse
Affiliation(s)
- Coen P H Elemans
- Sound Communication and Behaviour Group, Department of Biology, University of Southern Denmark, Odense, Denmark.
| | - Weili Jiang
- Department of Mechanical Engineering, Rochester Institute of Technology, Rochester, NY, USA
| | - Mikkel H Jensen
- Sound Communication and Behaviour Group, Department of Biology, University of Southern Denmark, Odense, Denmark
| | - Helena Pichler
- Department of Behavioral and Cognitive Biology, Faculty of Life Sciences, University of Vienna, Vienna, Austria
| | - Bo R Mussman
- Department of Radiology, Odense University Hospital, Odense, Denmark
- Department of Clinical Research, University of Southern Denmark, Odense, Denmark
| | - Jacob Nattestad
- Department of Radiology, Odense University Hospital, Odense, Denmark
| | - Magnus Wahlberg
- Sound Communication and Behaviour Group, Department of Biology, University of Southern Denmark, Odense, Denmark
| | - Xudong Zheng
- Department of Mechanical Engineering, Rochester Institute of Technology, Rochester, NY, USA
| | - Qian Xue
- Department of Mechanical Engineering, Rochester Institute of Technology, Rochester, NY, USA
| | - W Tecumseh Fitch
- Department of Behavioral and Cognitive Biology, Faculty of Life Sciences, University of Vienna, Vienna, Austria.
- Vienna Cognitive Science Hub, University of Vienna, Vienna, Austria.
| |
Collapse
|
2
|
Werner R, Fuchs S, Trouvain J, Kürbis S, Möbius B, Birkholz P. Acoustics of Breath Noises in Human Speech: Descriptive and Three-Dimensional Modeling Approaches. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023:1-15. [PMID: 37971432 DOI: 10.1044/2023_jslhr-23-00112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2023]
Abstract
PURPOSE Breathing is ubiquitous in speech production, crucial for structuring speech, and a potential diagnostic indicator for respiratory diseases. However, the acoustic characteristics of speech breathing remain underresearched. This work aims to characterize the spectral properties of human inhalation noises in a large speaker sample and explore their potential similarities with speech sounds. Speech sounds are mostly realized with egressive airflow. To account for this, we investigated the effect of airflow direction (inhalation vs. exhalation) on acoustic properties of certain vocal tract (VT) configurations. METHOD To characterize human inhalation, we describe spectra of breath noises produced by human speakers from two data sets comprising 34 female and 100 male participants. To investigate the effect of airflow direction, three-dimensional-printed VT models of a male and a female speaker with static VT configurations of four vowels and four fricatives were used. An airstream was directed through these VT configurations in both directions, and their spectral consequences were analyzed. RESULTS For human inhalations, we found spectra with a decreasing slope and several weak peaks below 3 kHz. These peaks show moderate (female) to strong (male) overlap with resonances found for participants inhaling with a VT configuration of a central vowel. Results for the VT models suggest that airflow direction is crucial for spectral properties of sibilants, /ç/, and /i:/, but not the other sounds we investigated. Inhalation noise is most similar to /ə/ where airflow direction does not play a role. CONCLUSIONS Inhalation is realized on ingressive airflow, and inhalation noises have specific resonance properties that are most similar to /ə/ but occur without phonation. Airflow direction does not play a role in this specific VT configuration, but subglottal resonances may do. For future work, we suggest investigating the articulation of speech breathing and link it to current work on pause postures. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.24520585.
Collapse
Affiliation(s)
- Raphael Werner
- Department of Language Science and Technology, Saarland University, Saarbrücken, Germany
| | - Susanne Fuchs
- Leibniz-Centre General Linguistics (ZAS), Berlin, Germany
| | - Jürgen Trouvain
- Department of Language Science and Technology, Saarland University, Saarbrücken, Germany
| | - Steffen Kürbis
- Institute of Acoustics and Speech Communication, Technische Universität Dresden, Germany
| | - Bernd Möbius
- Department of Language Science and Technology, Saarland University, Saarbrücken, Germany
| | - Peter Birkholz
- Institute of Acoustics and Speech Communication, Technische Universität Dresden, Germany
| |
Collapse
|
3
|
Deshpande G, Schuller BW, Deshpande P, Joshi AR. Automatic Breathing Pattern Analysis from Reading-Speech Signals. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2023; 2023:1-4. [PMID: 38083041 DOI: 10.1109/embc40787.2023.10340727] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2023]
Abstract
As the speech production mechanism is related to the breathing process, speech signals and breathing patterns impact each other. Breathing patterns are the physiological signals which help in understanding the psychological, physiological and cognitive states of an individual. Capturing such patterns relies on the availability of equipment such as respiratory belts, which are costly and uncomfortable to wear for long duration. In this paper, we attempt to extract the breathing patterns from speech signals, which are easily available and can be recorded using a smartphone's microphone. In the presented work, simultaneous speech and breath signals are captured from 100 Indians of the age group 20 to 25 years while they read a phonetically balanced passage in English language. We have identified five distinct breathing templates; following two broad speech-breath categories, exhibited by the speakers while they read the same passage. For one of the two categories, the time domain features with regression network can extract the breathing patterns from speech with a Pearson correlation coefficient of 0.70. By computational modelling, we distinguish these two breathing categories from speech with a classification accuracy of 79%.
Collapse
|
4
|
Patel RR, Sandage MJ, Golzarri-Arroyo L. High-Speed Videoendoscopic and Acoustic Characteristics of Inspiratory Phonation. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023; 66:1192-1207. [PMID: 36917802 DOI: 10.1044/2022_jslhr-22-00502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
PURPOSE Given the importance of inspiratory phonation for assessment of vocal fold structure, the aim of this investigation was to evaluate and describe the vocal fold vibratory characteristics of inspiratory phonation using high-speed videoendoscopy in healthy volunteers. The study also examined the empirical relationship between cepstral peak prominence (CPP) and glottal area waveform measurements derived from simultaneous high-speed videoendoscopy and audio recordings. METHOD Vocally healthy adults (33 women, 28 men) volunteered for this investigation and completed high-speed videoendoscopic assessment of vocal fold function for two trials of an expiratory/inspiratory phonation task at normal pitch and normal loudness. Twelve glottal area waveform measures and acoustic CPP values were extracted for analyses. RESULTS Inspiratory phonation resulted in shorter closing time, longer duration of the opening phase, and faster closing phase velocity compared to expiratory phonation. Sex differences were elucidated. CPP changes for inspiratory phonation were predicted by changes in the glottal area index and waveform symmetry index, whereas changes in CPP during expiratory phonation were predicted by changes in asymmetry quotient, glottal area index, and amplitude periodicity. CONCLUSIONS Vocal fold vibratory differences were identified for inspiratory phonation when compared to expiratory phonation, the latter of which has been studied more extensively. This investigation provides important basic inspiratory phonation data to better understand laryngeal physiology in vivo and provides a basic model from which to further study inspiratory phonation in a larger population representing a broader age range. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.22223812.
Collapse
Affiliation(s)
- Rita R Patel
- Department of Speech, Language and Hearing Sciences, Indiana University Bloomington
| | - Mary J Sandage
- Department of Speech, Language & Hearing Sciences, Auburn University, AL
| | | |
Collapse
|
5
|
Hasegawa H, Nakagawa T, Noguchi K, Tokuda IT. Experimental Study on Inspiratory Phonation Using Physical Model of the Vocal Folds. J Voice 2022:S0892-1997(22)00023-6. [PMID: 35227554 DOI: 10.1016/j.jvoice.2022.01.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2021] [Revised: 01/25/2022] [Accepted: 01/26/2022] [Indexed: 11/21/2022]
Abstract
In inspiratory phonation, the air is inhaled from the mouth. The inhaled air passes through the glottis towards the lungs, thereby inducing the vocal fold vibrations. Such phonation takes place in various situations such as sighs, laughter, and crying. To characterize the inspiratory phonation, an experimental study was carried out using a physical model of the vocal folds. By reversing the direction of the airflow that passed through the vocal fold model, the inspiratory phonation was experimentally realized and compared with the normal expiratory phonation. Our experiments revealed that the phonation threshold pressures as well as the volume flow rates decreased under the inspiratory condition. Accordingly, the vocal efficiency was increased. The fundamental frequency was also increased under the inspiratory condition. The kymograms showed that phase of the upper edge of the vocal fold advanced that of the lower edge under the inspiratory phonation. A mathematical model of the vocal folds was further constructed to elucidate these experiments. Except for few aspects, our experimental findings are in good agreement with the preceding studies on inspiratory phonation (e.g., reversed propagation of the mucosal waves observed in a singer, increased pitches in human subjects, and use of inspiratory phonation in speech therapy).
Collapse
Affiliation(s)
- Hiroto Hasegawa
- Graduate School of Science and Engineering, Ritsumeikan University, Noji-higashi, Kusatsu, Shiga 525-8577, Japan
| | - Takumi Nakagawa
- Graduate School of Science and Engineering, Ritsumeikan University, Noji-higashi, Kusatsu, Shiga 525-8577, Japan
| | - Kohei Noguchi
- Department of Mechanical Engineering, Ritsumeikan University, Noji-higashi, Kusatsu, Shiga 525-8577, Japan
| | - Isao T Tokuda
- Graduate School of Science and Engineering, Ritsumeikan University, Noji-higashi, Kusatsu, Shiga 525-8577, Japan.
| |
Collapse
|
6
|
Anikin A, Reby D. Ingressive phonation conveys arousal in human nonverbal vocalizations. BIOACOUSTICS 2022. [DOI: 10.1080/09524622.2022.2039295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Affiliation(s)
- Andrey Anikin
- Division of Cognitive Science, Lund University, Lund, Sweden
- Enes Sensory Neuro-Ethology Lab, Crnl, Jean Monnet University of Saint Étienne, St-Étienne, France
| | - David Reby
- Enes Sensory Neuro-Ethology Lab, Crnl, Jean Monnet University of Saint Étienne, St-Étienne, France
| |
Collapse
|
7
|
Patel RR, Ternström S. Quantitative and Qualitative Electroglottographic Wave Shape Differences in Children and Adults Using Voice Map-Based Analysis. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:2977-2995. [PMID: 34319772 DOI: 10.1044/2021_jslhr-20-00717] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Purpose The purpose of this study is to identify the extent to which various measurements of contacting parameters differ between children and adults during habitual range and overlap vocal frequency/intensity, using voice map-based assessment of noninvasive electroglottography (EGG). Method EGG voice maps were analyzed from 26 adults (22-45 years) and 22 children (4-8 years) during connected speech and vowel /a/ over the habitual range and the overlap vocal frequency/intensity from the voice range profile task on the vowel /a/. Mean and standard deviations of contact quotient by integration, normalized contacting speed, quotient of speed by integration, and cycle-rate sample entropy were obtained. Group differences were evaluated using the linear mixed model analysis for the habitual range connected speech and the vowel, whereas analysis of covariance was conducted for the overlap vocal frequency/intensity from the voice range profile task. Presence of a "knee" on the EGG wave shape was determined by visual inspection of the presence of convexity along the decontacting slope of the EGG pulse and the presence of the second derivative zero-crossing. Results The contact quotient by integration, normalized contacting speed, quotient of speed by integration, and cycle-rate sample entropy were significantly different in children compared to (a) adult males for habitual range and (b) adult males and adult females for the overlap vocal frequency/intensity. None of the children had a "knee" on the decontacting slope of the EGG slope. Conclusion EGG parameters of contact quotient by integration, normalized contacting speed, quotient of speed by integration, cycle-rate sample entropy, and absence of a "knee" on the decontacting slope characterize the wave shape differences between children and adults, whereas the normalized contacting speed, quotient of speed by integration, cycle-rate sample entropy, and presence of a "knee" on the downward pulse slope characterize the wave shape differences between adult males and adult females. Supplemental Material https://doi.org/10.23641/asha.15057345.
Collapse
Affiliation(s)
- Rita R Patel
- Department of Speech, Language and Hearing Sciences, Indiana University Bloomington
| | - Sten Ternström
- Division of Speech, Music, and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden
| |
Collapse
|
8
|
Patel RR, Sandage MJ, Kluess H, Plexico LW. High-Speed Characterization of Vocal Fold Vibrations in Normally Cycling and Postmenopausal Women: Randomized Double-Blind Analyses. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:1869-1888. [PMID: 33971105 PMCID: PMC8740695 DOI: 10.1044/2021_jslhr-20-00706] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Purpose The aim of this study was to examine the influence of menstrual cycle phases (follicular, ovulatory, luteal, and ischemic) and hormone levels (estradiol, testosterone, progesterone, and neuropeptide Y) on vocal fold vibrations in reproductive and postmenopausal women. Method Glottal area waveforms were extracted from high-speed videoendoscopy during sustained phonation, inhalation phonation, and voice onset/offset in the reproductive (n = 15) and postmenopausal (n = 13) groups. Linear mixed-model analysis was conducted to evaluate hormone levels and high-speed videoendoscopy outcome variables between the reproductive and postmenopausal groups. In the reproductive group, simple linear regression and multiple regression were conducted to determine the effects of hormones on the dependent variables. Results Group differences between reproductive and postmenopausal women were identified for stiffness index, oscillatory onset time, and oscillatory offset time. Neuropeptide Y hormone in the ischemic phase significantly predicted changes in the reproductive group for some dependent variables; however, the relationship varied for sustained phonation and inhalation phonation. Conclusion These findings provide preliminary evidence that vocal fold vibrations in the reproductive group are different predominantly in the ischemic phase due to neuropeptide Y changes.
Collapse
Affiliation(s)
- Rita R. Patel
- Department of Speech, Language and Hearing Sciences, Indiana University, Bloomington
| | - Mary J. Sandage
- Department of Speech, Language, and Hearing Sciences, Auburn University, AL
| | | | - Laura W. Plexico
- Department of Speech, Language, and Hearing Sciences, Auburn University, AL
| |
Collapse
|
9
|
Wermke K, Sereschk N, May V, Salinger V, Sanchez MR, Shehata-Dieler W, Wirbelauer J. The Vocalist in the Crib: the Flexibility of Respiratory Behaviour During Crying in Healthy Neonates. J Voice 2021; 35:94-103. [DOI: 10.1016/j.jvoice.2019.07.004] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2019] [Revised: 07/06/2019] [Accepted: 07/08/2019] [Indexed: 11/26/2022]
|
10
|
Fornhammar L, Sundberg J, Fuchs M, Pieper L. Measuring Voice Effects of Vibrato-Free and Ingressive Singing: A Study of Phonation Threshold Pressures. J Voice 2020; 36:479-486. [PMID: 33071148 DOI: 10.1016/j.jvoice.2020.07.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2020] [Revised: 07/17/2020] [Accepted: 07/21/2020] [Indexed: 11/28/2022]
Abstract
BACKGROUND Phonation threshold pressure (PTP), showing the lowest subglottal pressure producing vocal fold vibration, has been found useful for documenting various effects of phonatory conditions. The need for such documentation is relevant also to the teaching of singing, particularly in view of vocal demands raised in some contemporary as well as early music compositions. The aim of the present study was to test the usefulness of PTP measurement for evaluating phonatory effects of vibrato-free and ingressive singing in professional singers. METHODS PTP was measured at a middle, a high and a low pitch in two female and two male singers before and after recording voice range profiles (i) in habitual technique, ie, with vibrato, (ii) in vibrato-free, and (iii) in ingressive phonation. Effects on vocal fold status were examined by videolaryngostroboscopy. RESULTS After careful instruction of the singers, no problems were found in applying the PTP method. In some singers videolaryngostroboscopy showed effects after the experiment, eg, in terms of increased mucus and more complete glottal closure. After ingressive phonation PTP increased substantially at high pitch in one singer but changed marginally in the other singers. CONCLUSION The method seems useful for assessing and interpreting effects of singing in different styles and as a part of voice diagnostics. Therefore, it seems worthwhile to automatize PTP measurement.
Collapse
Affiliation(s)
- Lisa Fornhammar
- MuTri Doctoral School, University of the Arts Helsinki, Finland; Universität Mozarteum Salzburg, Austria; Hochschule für Musik und Theater Leipzig, Felix Mendelssohn Bartholdy, Germany.
| | - Johan Sundberg
- Department of Speech Music Hearing, KTH, Stockholm, Sweden; University College of Music Education Stockholm, Sweden
| | - Michael Fuchs
- Division of Phoniatrics and Audiology, University of Leipzig, Germany; Hochschule für Musik und Theater Leipzig, Felix Mendelssohn Bartholdy, Germany
| | - Lennart Pieper
- Division of Phoniatrics and Audiology, University of Leipzig, Germany
| |
Collapse
|
11
|
Electroglottography – An Update. J Voice 2020; 34:503-526. [DOI: 10.1016/j.jvoice.2018.12.014] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2018] [Revised: 12/27/2018] [Accepted: 12/28/2018] [Indexed: 11/21/2022]
|
12
|
Paolillo NP, Carrozza L, Osio M, Rosa E, Scalabrin M. Inspiratory Vocal Fry: Anatomical and Physiological Aspects, Application in Speech Therapy, Vocal Pedagogy and Singing. A Pilot study. J Voice 2019; 35:394-399. [PMID: 31708370 DOI: 10.1016/j.jvoice.2019.10.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2019] [Revised: 10/04/2019] [Accepted: 10/04/2019] [Indexed: 10/25/2022]
Abstract
OBJECTIVE Inspiratory Vocal Fry (IVF) is the voice production during inspiration of a sound with vocal fry perceptual characteristics. The existing scientific literature shows a lack of studies on it. The aim of the study is to highlight anatomical and physiological characteristics of IVF, to assess its effects on spoken and singing voice, to confirm the potential usefulness in speech therapy and vocal pedagogy. METHODS Thirty-two healthy subjects (17 male and 15 female) underwent videolaryngostroboscopy to assess the degree of false vocal folds adduction, pharyngeal wall contraction, and degree of vocal folds stretching in different types of phonation: expiratory and inspiratory phonation, Expiratory Vocal Fry (EVF) and IVF. All these parameters were evaluated by a group of three speech therapists and one phoniatrician not belonging to the research group. In addition, for each subject an electroglottography was performed for all the types of phonation previously mentioned, highlighting Contact Quotient (CQ) and Closing/Closed Quotient (CCQ). Three subjects underwent electromyography for a preliminary study of the muscle activation in IVF. RESULTS False vocal folds adduction (P value = 0.000005) and pharyngeal wall contraction (P value = 0.001155) were significantly reduced in IVF compared to EVF; on the contrary, vocal folds stretching was significantly higher in IVF (P value = 0.000031). Electroglottographic CQ was significantly higher in IVF compared to EVF (P value = 0.019592) and the other types of phonation. Similar results were obtained considering CCQ, as IVF values for this parameter was significantly higher compared to EVF (P value = 0.013062) and expiratory phonation (P value = 0.001324). As regards electromyography, medial thyroarytenoid (TA) motor units were more recruited in IVF, while lateral TA motor units were more recruited in EVF. According to our results, IVF is characterized by higher elastic tension due to a reduced hypertonic contraction of TA muscle and a higher contraction of cricothyroid muscle. Electroglottographic results showed a wider vibratory cycle with an improved massaging effect on vocal folds mucosa. electromyography preliminary analysis confirmed our findings. CONCLUSION IVF could be an excellent and useful exercise to reduce muscular hypertonic tension and to facilitate mucosal elasticity. It could be potentially applied in speech therapy approach to dysfunctional and organic dysphonias, post-surgical treatment, in pedagogy and practice of artistic voice.
Collapse
Affiliation(s)
| | | | - Maurizio Osio
- University of Milan, Italy; Department of Neurology, Ospedale Sacco of Milan
| | | | | |
Collapse
|
13
|
Selamtzis A, Ternström S, Richter B, Burk F, Köberlein M, Echternach M. A comparison of electroglottographic and glottal area waveforms for phonation type differentiation in male professional singers. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 144:3275. [PMID: 30599695 DOI: 10.1121/1.5066456] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2017] [Accepted: 10/14/2018] [Indexed: 06/09/2023]
Abstract
This study compares the use of electroglottograms (EGGs) and glottal area waveforms (GAWs) to study phonation in different vibratory states as produced by professionally trained singers. Six western classical tenors were asked to phonate pitch glides from modal to falsetto phonation, or from modal to their stage voice above the passaggio (SVaP). For each pitch glide the sample entropy (SampEn) of the EGG signal was calculated to detect the occurrence of phonatory instabilities and establish a "ground truth" for the performed phonation type. The cycles before the maximum SampEn were labeled as modal, and the cycles after the peak were labeled as either falsetto, or SVaP. Three automatic categorizations of vibratory state were performed using clustering: one based only on the EGG, one based on the GAW, and one based on their combination. The error rate (clustering vs ground truth) was, on average, lower than 10% for all of the three settings, revealing no special advantage of the GAW over EGG, and vice versa. Modal voice cycles exhibited a larger contact quotient, larger normalized derivative peak ratio, and lower rise time, compared to SVaP and falsetto. The GAW-based normalized maximum area declination rate was larger in SVaP compared to modal voice.
Collapse
Affiliation(s)
- Andreas Selamtzis
- Department of Speech, Music and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Lindstedtsvägen 24, Breisacherstraße 60, Stockholm, SE-100 44, Sweden
| | - Sten Ternström
- Department of Speech, Music and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Lindstedtsvägen 24, Breisacherstraße 60, Stockholm, SE-100 44, Sweden
| | - Bernard Richter
- Institute of Musicians' Medicine, Freiburg University Medical Center, Breisacher Strausse 60, Freiburg, 79106, Germany
| | - Fabian Burk
- Department of Otorhinolaryngology, University Medical Center Schleswig-Holstein, Arnold-Heller-Straße 3, 24105 Kiel, Germany
| | - Marie Köberlein
- Institute of Musicians' Medicine, Freiburg University Medical Center, Breisacher Strausse 60, Freiburg, 79106, Germany
| | - Matthias Echternach
- Department of Otorhinolaryngology, Division of Phoniatrics and Pediatric Audiology, Munich University, Campus Großhadern, Marchioninistraße 15, Munich, 81377, Germany
| |
Collapse
|
14
|
Wermke K, Haschemi AA, Hesse V, Robb MP. Inspiratory Phonation in Baby Voice. J Voice 2018; 32:185-191. [DOI: 10.1016/j.jvoice.2017.04.005] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2017] [Accepted: 04/12/2017] [Indexed: 11/17/2022]
|
15
|
Moerman M, Vanhecke F, Van Assche L, Vercruysse J. Vocal Tract Morphology in Inhaling Singing: Characteristics During Vowel Production-A Case Study in a Professional Singer. J Voice 2017; 32:643.e17-643.e23. [PMID: 28886973 DOI: 10.1016/j.jvoice.2017.08.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2017] [Revised: 08/01/2017] [Accepted: 08/01/2017] [Indexed: 11/16/2022]
Abstract
BACKGROUND A professional singer produced various vowels on a comfortable loudness and pitch in an inspiratory and expiratory phonation manner. The present study investigates the morphological differences and tries to find a link with the acoustical characteristics. OBJECTIVES/HYPOTHESIS We hypothesize that features, constantly present over all vowels, characterize inhaling phonation and that the formant frequencies reflect the morphological findings. STUDY DESIGN A prospective case study was carried out. METHODS A female singer uttered the vowels /a/, /e/, /i/, /o/, and /u/ in a supine position under magnetic resonance imaging, on a comfortable loudness and pitch, in both inhaling and exhaling manner. The exact same parameters as in previous reports were measured (1-3). Acoustical analysis was performed with Praat. RESULTS Wilcoxon directional testing demonstrates a statistically significant difference in (1) the distance between the lips, (2) the antero-posterior tongue diameter, (3) the distance between the lips and the tip of the tongue, (4) the distance between the epiglottis and the posterior pharyngeal wall, (5) the narrowing of the subglottic space, and (6) the oropharyngeal and the hypopharyngeal areas. Acoustical analysis reveals slightly more noise and irregularity during reverse phonation. The central frequency of F0 and F1 is identical, whereas that of F2 and F3 increases, and that of F4 varies. CONCLUSIONS A smaller mouth opening, a narrowing of the subglottic space, a larger supralaryngeal inlet, and a smaller antero-posterior tongue diameter can be considered as morphological characteristics for reverse phonation. Acoustically, reverse phonation discretely contains more noise and perturbation. The formant frequency distribution concurs with a mouth narrowing and pharyngeal widening during inhaling.
Collapse
Affiliation(s)
- Mieke Moerman
- Department of ENT/Phoniatrics/Head and Neck Surgery, AZ Maria Middelares, Ghent, Belgium.
| | - Françoise Vanhecke
- Institute for Psychoacustics and Electronic Music (IPEM), Ghent University, Ghent, Belgium
| | | | | |
Collapse
|
16
|
Investigation of the Immediate Effects of Humming on Vocal Fold Vibration Irregularity Using Electroglottography and High-speed Laryngoscopy in Patients With Organic Voice Disorders. J Voice 2017; 31:48-56. [DOI: 10.1016/j.jvoice.2016.03.010] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2016] [Accepted: 03/17/2016] [Indexed: 11/22/2022]
|
17
|
Vanhecke F, Lebacq J, Moerman M, Manfredi C, Raes GW, DeJonckere PH. Physiology and Acoustics of Inspiratory Phonation. J Voice 2015; 30:769.e9-769.e18. [PMID: 26706750 DOI: 10.1016/j.jvoice.2015.11.001] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2015] [Accepted: 11/10/2015] [Indexed: 10/22/2022]
Abstract
INTRODUCTION Inspiratory phonation (IP) means phonating with inspiratory airflow. Some vocalists remarkably master this technique, to such an extent that it offers new dramatic, aesthetic, and functional possibilities in singing specific contemporary music. The present study aims to a better understanding of the physiological backgrounds of IP. MATERIAL AND METHODS A total of 51 inhaling utterances were compared with 61 exhaling utterances in a professional soprano highly skilled in inhaling singing, by means of high-speed single-line scanning and advanced acoustic analysis. Ranges of intensity and Fo were kept similar. RESULTS The main differences are: (1) an inversion of the mucosal wave, (2) a smaller closed quotient in IP, (3) a larger opening/closing quotient in IP with the additional difference that in IP, the quotient is larger than 1 (opening slower than closing), whereas it is less than 1 in expiratory mode (opening faster than closing), (4) a larger vocal-fold excursion in IP, (5) higher values of adaptive normalized noise energy in IP, and (6) a steeper slope of harmonic peaks in IP. However, jitter values are similar (within normal range), as well as damping ratios and central formant frequencies. The two voicing modes cannot be differentiated by blind listening. CONCLUSION The basic physiological mechanisms are comparable in both voicing modes, although with specific differences. IP is actually to be considered as an "extended vocal technique," a term applied to vocalization in art music, which falls outside of traditional classical singing styles, but with remarkable possibilities in skilled vocalists.
Collapse
Affiliation(s)
- Françoise Vanhecke
- IPEM-Department of Musicology, School of Arts, University College, Ghent, Belgium
| | - Jean Lebacq
- Neurosciences Institute, University of Louvain B-1200 Brussels, Belgium
| | - Mieke Moerman
- AZ Maria Middelares, Kortrijksesteenweg 1026, 9000 Ghent, Belgium
| | - Claudia Manfredi
- Università degli Studi di Firenze, Department of Information Engineering, Via S. Marta 3, 50139 Firenze, Italy
| | | | - Philippe H DeJonckere
- IPEM-Department of Musicology, School of Arts, University College, Ghent, Belgium; Neurosciences, University of Leuven and Federal Institute of Occupational Diseases, Brussels, Belgium..
| |
Collapse
|
18
|
Moerman M, Vanhecke F, Van Assche L, Vercruysse J, Daemers K, Leman M. Vocal Tract Morphology in Inhaling Singing: An MRI-Based Study. J Voice 2015; 30:466-71. [PMID: 26122925 DOI: 10.1016/j.jvoice.2015.05.001] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2015] [Accepted: 05/06/2015] [Indexed: 11/30/2022]
Abstract
OBJECTIVES/HYPOTHESIS Inhaling singing is a recently developed singing technique explored by the soprano singer Françoise Vanhecke. It is based on an inspiratory airflow instead of an expiratory airflow. This article describes the anatomical structural differences of the vocal tract between inhaling and exhaling singing. We hypothesize that the vocal tract alters significantly in inhaling singing, especially concerning the configuration of the anatomical structures in the oral cavity and the subglottal region. STUDY DESIGN This is a prospective study. METHODS A professional singer (F.V.) performed sustained tones from F5 chromatically rising up to Bb5 on the vowel /a/. Vocal tract anatomy is assessed by magnetic resonance imaging. RESULTS Wilcoxon directional testing demonstrates (1) that the vocal tract volume above the glottal region does not differ statistically in contrast to the subglottal region and (2) significant changes in the configuration of the tongue, the upright position of the epiglottis, the length of the floor of mouth, and the distance between the teeth. CONCLUSIONS The narrowing of the subglottis is considered to be secondary to suction forces used in the inhaling singing technique. The changes in the anatomical structures above the vocal folds possibly suggest a valve-like function controlling the air inlet together with the regulator function of the resonator capacities of the vocal tract.
Collapse
Affiliation(s)
- Mieke Moerman
- Department for ENT/Phoniatrics and Head and Neck Surgery & Department of Radiology, AZ Maria Middelares, Ghent, Belgium.
| | | | - Lieven Van Assche
- Department for ENT/Phoniatrics and Head and Neck Surgery & Department of Radiology, AZ Maria Middelares, Ghent, Belgium
| | - Johan Vercruysse
- Department for ENT/Phoniatrics and Head and Neck Surgery & Department of Radiology, AZ Maria Middelares, Ghent, Belgium
| | - Kristin Daemers
- Department for ENT/Phoniatrics and Head and Neck Surgery & Department of Radiology, AZ Maria Middelares, Ghent, Belgium
| | - Marc Leman
- IPEM-Department of Musicology, Ghent University, Ghent, Belgium
| |
Collapse
|
19
|
Automatic detection of the expiratory and inspiratory phases in newborn cry signals. Biomed Signal Process Control 2015. [DOI: 10.1016/j.bspc.2015.03.007] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
20
|
Selamtzis A, Ternström S. Analysis of vibratory states in phonation using spectral features of the electroglottographic signal. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2014; 136:2773-2783. [PMID: 25373977 DOI: 10.1121/1.4896466] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
The vocal folds can oscillate in several different ways, manifest to practitioners and clinicians as "registers" or "mechanisms," of which the two most often considered are modal voice and falsetto voice. Here these will be taken as instances of different "vibratory states," i.e., distinct quasi-stationary patterns of vibration of the vocal folds. State transitions are common in biomechanical nonlinear oscillators, and they are often abrupt and impossible to predict exactly. Therefore, vibratory states are a source of confounding variation, for instance when acquiring a voice range profile (VRP). In the quest for a state-based, non-invasive VRP, a semi-automatic method based on the short-term spectrum of the electroglottographic (EGG) signal was developed. The method identifies rapid vibratory state transitions, such as the modal-falsetto switch, and clusters the EGG data based on their similarities in the relative levels and phases of the lower frequency components. Productions of known modal and falsetto voice were accurately clustered by a Gaussian mixture model. When mapped into the VRP, this EGG-based clustering revealed connected regions of different vibratory sub-regimes in both modal and falsetto.
Collapse
Affiliation(s)
- Andreas Selamtzis
- Department of Speech, Music and Hearing, School of Computer Science and Communication, KTH Royal Institute of Technology, SE-100 44 Stockholm, Sweden
| | - Sten Ternström
- Department of Speech, Music and Hearing, School of Computer Science and Communication, KTH Royal Institute of Technology, SE-100 44 Stockholm, Sweden
| |
Collapse
|
21
|
Immediate Effects of Humming on Computed Electroglottographic Parameters in Patients With Muscle Tension Dysphonia. J Voice 2014; 28:733-41. [DOI: 10.1016/j.jvoice.2014.02.004] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2013] [Accepted: 02/10/2014] [Indexed: 11/23/2022]
|
22
|
Statistical Analysis of the Reliability of Acoustic and Electroglottographic Perturbation Parameters for the Detection of Vocal Roughness. J Voice 2014; 28:263.e9-263.e16. [DOI: 10.1016/j.jvoice.2013.07.005] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2013] [Accepted: 07/15/2013] [Indexed: 11/22/2022]
|
23
|
The Effect of Gender on Measures of Electroglottographic Contact Quotient. J Voice 2013; 27:433-40. [DOI: 10.1016/j.jvoice.2013.03.007] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2013] [Accepted: 03/20/2013] [Indexed: 11/20/2022]
|
24
|
Baken R, Švec JG. Letter to the Editor. J Voice 2012; 26:676.e9-11; author reply 676.e13-9. [DOI: 10.1016/j.jvoice.2011.10.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2011] [Accepted: 10/05/2011] [Indexed: 11/16/2022]
|
25
|
Hosokawa K, Yoshida M, Yoshii T, Takenaka Y, Hashimoto M, Ogawa M, Inohara H. Effectiveness of the Computed Analysis of Electroglottographic Signals in Muscle Tension Dysphonia. Folia Phoniatr Logop 2012; 64:145-50. [PMID: 22965009 DOI: 10.1159/000342146] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Affiliation(s)
- K Hosokawa
- Department of Otorhinolaryngology, Head and Neck Surgery, Osaka University Graduate School of Medicine, Toyonaka-Heisei Hospital, Osaka, Japan.
| | | | | | | | | | | | | |
Collapse
|
26
|
Aucouturier JJ, Nonaka Y, Katahira K, Okanoya K. Segmentation of expiratory and inspiratory sounds in baby cry audio recordings using hidden Markov models. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2011; 130:2969-77. [PMID: 22087925 DOI: 10.1121/1.3641377] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
The paper describes an application of machine learning techniques to identify expiratory and inspiration phases from the audio recording of human baby cries. Crying episodes were recorded from 14 infants, spanning four vocalization contexts in their first 12 months of age; recordings from three individuals were annotated manually to identify expiratory and inspiratory sounds and used as training examples to segment automatically the recordings of the other 11 individuals. The proposed algorithm uses a hidden Markov model architecture, in which state likelihoods are estimated either with Gaussian mixture models or by converting the classification decisions of a support vector machine. The algorithm yields up to 95% classification precision (86% average), and its ability generalizes over different babies, different ages, and vocalization contexts. The technique offers an opportunity to quantify expiration duration, count the crying rate, and other time-related characteristics of baby crying for screening, diagnosis, and research purposes over large populations of infants.
Collapse
Affiliation(s)
- Jean-Julien Aucouturier
- Department of Computer and Information Science, Temple University, Japan Campus, 2-8-12 Minami Azabu, Minato-ku, Tokyo 106-0047, Japan.
| | | | | | | |
Collapse
|
27
|
Recurrent neural networks in computer-based clinical decision support for laryngopathies: an experimental study. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2011; 2011:289398. [PMID: 22007195 PMCID: PMC3189461 DOI: 10.1155/2011/289398] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/20/2011] [Accepted: 07/12/2011] [Indexed: 12/03/2022]
Abstract
The main goal of this paper is to give the basis for creating a computer-based clinical decision support (CDS)
system for laryngopathies. One of approaches which can be used in the proposed CDS is based on the speech signal analysis using recurrent neural networks (RNNs). RNNs can be used for pattern recognition in time series data due to their ability of memorizing some information from the past. The Elman networks (ENs) are a classical representative of RNNs. To improve learning ability of ENs, we may modify and combine them with another kind of RNNs, namely, with the Jordan networks. The modified Elman-Jordan networks (EJNs) manifest a faster and more exact achievement of the target pattern. Validation experiments were carried out on speech signals of patients from the control group and with two kinds of laryngopathies.
Collapse
|
28
|
Interarticulator timing control during inspiratory phonation. J Voice 2010; 25:319-25. [PMID: 20346622 DOI: 10.1016/j.jvoice.2010.01.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2009] [Accepted: 01/05/2010] [Indexed: 11/24/2022]
Abstract
OBJECTIVE The ability of Cantonese speakers in producing aspirated and unaspirated stops and stops at different places of articulation using expiratory phonation (EP) and inspiratory phonation (IP) was compared. Interarticulator timing during stop production using EP and IP was examined. Voice onset time (VOT) associated with EP and IP stops was compared with stop identification scores by naïve listeners. SUBJECTS AND METHODS Aspirated and unaspirated voiceless stops (/p(h), t(h), k(h), p, t, k/) followed by the vowel /a/ were produced by 15 male and 15 female Cantonese speakers using EP and IP. VOT values were measured and isolated speech samples of stop productions were identified by 10 naïve listeners. Percent correct identification of stops was obtained from the 10 listeners. RESULTS Perceptual data showed that production of IP stops were associated with reduced accuracy in stop identification, with predominant errors in aspiration perception. Acoustic analysis showed that IP stops were generally produced with significantly shorter VOT than their EP counterparts. In addition, effect of place of articulation on VOT was also found for both IP and EP stops, notably with velar stops being associated with significantly longer VOT values than bilabial and alveolar stops. CONCLUSIONS The findings that IP stops were produced with shorter VOT as compared with EP stops imply that the articulatory-phonatory coordination during IP was not the same as that during EP, causing a discrepancy in the timing control between articulators.
Collapse
|
29
|
Winkler R, Sendlmeier W. EGG open quotient in aging voices—changes with increasing chronological age and its perception. LOGOP PHONIATR VOCO 2009; 31:51-6. [PMID: 16754276 DOI: 10.1080/14015430500445534] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
This paper presents the results of open quotient (OQ) measurements in electroglottographic (EGG) signals of young (18-30 years) and elderly (60-82 years) male and female speakers. The paper further presents quantitative results of the relation between the EGG OQ and the perception of a speaker's age. Higgins and Saxman found a decreased EGG OQ with increased age for females, while the EGG OQ increased for males as the speaker's age increased in sustained vowel material 1. Although laryngeal degeneration due to increased age seems to occur to a lesser extent in females, the significant decrease of the OQ in elderly female voices could not be explained in terms of age-related physiological changes. Linville found increased spectral amplitudes in the region of F0 for the elderly (obtained by long-term average spectra (LTAS) measurements of read speech material), independent of gender, which could be indirectly interpreted as an increasing OQ 3. We measured the EGG OQ, not only for sustained vowels but also in vowels taken from isolated words and read speech material. To analyse the relation between breathiness in terms of an increased EGG OQ and the mean perceived age per stimulus, a perception test was carried out, in which listeners were asked to estimate speaker's age based on sustained /a/-vowels varying in vocal effort (soft-normal-loud) during production. 1) The decreased EGG OQ for elderly females originally found by Higgins and Saxman 1 is not apparent in our data for sustained /a/-vowels; for males, however, we also found an increased EGG OQ for the elderly speakers. 2) In addition, an increased EGG OQ for the group of elderly in comparison to the younger males occurs for the unstressed syllable of the word material. 3) Our results show a strong positive relation between perceived age and EGG OQ in male vowel stimuli. Regarding 2), depending on the speech task at least a male speaker's voice gets more breathy as age increases. Considering 3), increased breathiness may contribute to the listener's perception of increased age.
Collapse
Affiliation(s)
- Ralf Winkler
- Department of Speech Communication and Phonetics, Technical University, Berlin, Germany.
| | | |
Collapse
|
30
|
Lowell SY, Barkmeier-Kraemer JM, Hoit JD, Story BH. Respiratory and laryngeal function during spontaneous speaking in teachers with voice disorders. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2008; 51:333-49. [PMID: 18367681 DOI: 10.1044/1092-4388(2008/025)] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
PURPOSE To determine if respiratory and laryngeal function during spontaneous speaking were different for teachers with voice disorders compared with teachers without voice problems. METHOD Eighteen teachers, 9 with and 9 without voice disorders, were included in this study. Respiratory function was measured with magnetometry, and laryngeal function was measured with electroglottography during 3 spontaneous speaking tasks: a simulated teaching task at a typical loudness level, a simulated teaching task at an increased loudness level, and a conversational speaking task. Electroglottography measures were also obtained for 3 structured speaking tasks: a paragraph reading task, a sustained vowel, and a maximum phonation time vowel. RESULTS Teachers with voice disorders started and ended their breath groups at significantly smaller lung volumes than teachers without voice problems during teaching-related speaking tasks; however, there were no between-group differences in laryngeal measures. Task-related differences were found on several respiratory measures and on one laryngeal measure. CONCLUSIONS These findings suggest that teachers with voice disorders used different speech breathing strategies than teachers without voice problems. Implications for clinical management of teachers with voice disorders are discussed.
Collapse
Affiliation(s)
- Soren Y Lowell
- National Institute of Neurological Disorders and Stroke, Laryngeal and Speech Section, Bethesda, MD 20892, USA.
| | | | | | | |
Collapse
|
31
|
Finger LS, Cielo CA. Aspectos fisiológicos e clínicos da técnica fonoterapêutica de fonação reversa. ACTA ACUST UNITED AC 2007. [DOI: 10.1590/s0034-72992007000200020] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
A fonação reversa é a produção de voz durante a inspiração, realizada espontaneamente em situações como o suspiro. OBJETIVO: Realizar uma revisão da literatura, descrevendo achados relacionados à utilização da fonação reversa na prática clínica, à anatomofisiologia de sua produção e seus efeitos no trato vocal, e às indicações e contra-indicações da técnica para os distúrbios e para o aperfeiçoamento da voz. RESULTADOS: Foram encontrados relatos de mudanças significativas no trato vocal durante a produção da fonação reversa, como o relaxamento dos ventrículos, o afastamento das pregas ventriculares, o aumento da freqüência fundamental, o movimento inverso da onda mucosa, além da facilitação do estudo dinâmico da laringe, quando associada à endoscopia, o que possibilita melhor definição da localização das lesões nas camadas da lâmina própria da mucosa das pregas vocais. CONCLUSÃO: Existem poucos estudos que descrevem o comportamento laríngeo durante a fonação reversa e, para que essa técnica utilizada de forma mais precisa e objetiva, acredita-se que ainda devem ser realizados estudos que visem a comprovar sua eficácia na prática clínica.
Collapse
|
32
|
Finger LS, Cielo CA. Reverse phonation -physiologic and clinical aspects of this speech voice therapy modality. Braz J Otorhinolaryngol 2007; 73:271-7. [PMID: 17589738 PMCID: PMC9450581 DOI: 10.1016/s1808-8694(15)31077-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2006] [Accepted: 07/20/2006] [Indexed: 11/24/2022] Open
Abstract
UNLABELLED Reverse phonation is the voice production during inspiration, accomplished spontaneously in situations such as when a person sighs. AIM to do a literature review, describing discoveries related to the use of the reverse phonation in the clinical practice, the anatomy and physiology of its production and its effects in vocal treatments; and moreover, indications and problems of the technique for speech disorders treatment and voice enhancement. RESULTS there were reports of significant changes in vocal treatment during with the use of reverse phonation: ventricular distention, ventricular folds separation, increase in the fundamental frequency, mucous wave inverse movement; and it also facilitates the dynamic study of the larynx when associated with endoscopy, making it possible to have a better definition of lesion localization in vocal folds superficial lamina propria layers. CONCLUSION There are few studies describing larynx behavior during reverse phonation and, for this technique to be used in a more precise and objective way, more studies are necessary in order to prove its effectiveness in practical matters.
Collapse
|
33
|
Higgins MB, Schulte L. Gender differences in vocal fold contact computed from electroglottographic signals: the influence of measurement criteria. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2002; 111:1865-1871. [PMID: 12002870 DOI: 10.1121/1.1456517] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
EGGW is a phonatory parameter that can be derived from electroglottographic (EGG) signals and used to infer the relative degree of vocal fold contact. Vocal fold models predict that men will exhibit medial bulging of their vocal folds during phonation but women will not. These models lead us to expect gender differences in the magnitude of EGGW. Nevertheless, significant gender differences in EGGW for adults with normal voices have not been documented in previous studies when EGGW was computed from criterion lines placed at 25%-40% of the amplitude of the uninverted EGG wave form. We hypothesized that EGGW would better reflect gender differences in vocal fold adductory patterns if EGGW was computed from portions of the wave form that were associated with more vocal fold contact. EGGW was measured for seven men and seven women with normal voices. When EGGW was computed from segments of the wave form that were associated with relatively greater vocal fold contact (i.e., using criterion levels of > or = 55%), findings were consistent with the gender-specific adductory patterns that have been proposed from vocal fold models. Guidelines for appropriate placement of criterion lines when computing EGGW are discussed.
Collapse
|
34
|
Robb MP, Chen Y, Gilbert HR, Lerman JW. Acoustic comparison of vowel articulation in normal and reverse phonation. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2001; 44:118-127. [PMID: 11218096 DOI: 10.1044/1092-4388(2001/011)] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
Acoustic characteristics of the vowels /i,u,a/ produced by adult females and males during normal phonation were compared with the same vowels produced on deliberate ingressive airflow (i.e., "reverse" phonation). Results of the analysis revealed the average fundamental frequency (F0) of reverse phonation to be significantly higher than the corresponding normal phonations. There were no significant differences noted in the vocal tract resonance (F1 and F2 frequency) values for /i/ during normal and reverse phonation. However, the F1 values for /a/ were significantly lower, and the F2 values for /u/ significantly higher, during reverse phonation. The results are discussed with regard to differences in the articulatory control of the speech mechanism during reverse phonation as compared to normal expiratory phonation. Also discussed are the implications of using reverse phonation as a voice management technique.
Collapse
Affiliation(s)
- M P Robb
- Department of Communication Sciences, University of Connecticut, Storrs, USA.
| | | | | | | |
Collapse
|
35
|
Abstract
Inspiratory phonation (IP) is the production of voice as air is taken into the lungs. Although IP is promoted as a laryngeal assessment and voice treatment technique, it has been described quantitatively in very few speakers. This study quantified changes in laryngeal adduction, fundamental frequency, and intensity during IP relative to expiratory phonation (EP). We hypothesized that IP would increase laryngeal abduction and fundamental frequency. The experiment was a within-subjects, repeated measures design with each subject serving as her own control. Participants were 10 females (ages 19-50 years) who underwent simultaneous transoral videostrobolaryngoscopy and acoustic voice recording. We found that membranous vocal fold contact decreased significantly during IP relative to EP, while the trends for change of ventricular fold squeeze during IP varied across individuals. Vocal fundamental frequency increased significantly during IP relative to EP, but intensity did not vary consistently across conditions. Without teaching or coaching, changes that occurred during IP did not carry over to EP produced immediately following IP within the same respiratory cycle.
Collapse
Affiliation(s)
- C L Kelly
- Department of Communication Sciences and Disorders, Northwestern University, Evanston, Illinois, USA
| | | |
Collapse
|