1
|
Iob NA, He L, Ternström S, Cai H, Brockmann-Bauser M. Effects of Speech Characteristics on Electroglottographic and Instrumental Acoustic Voice Analysis Metrics in Women With Structural Dysphonia Before and After Treatment. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2024; 67:1660-1681. [PMID: 38758676 DOI: 10.1044/2024_jslhr-23-00253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/19/2024]
Abstract
PURPOSE Literature suggests a dependency of the acoustic metrics, smoothed cepstral peak prominence (CPPS) and harmonics-to-noise ratio (HNR), on human voice loudness and fundamental frequency (F0). Even though this has been explained with different oscillatory patterns of the vocal folds, so far, it has not been specifically investigated. In the present work, the influence of three elicitation levels, calibrated sound pressure level (SPL), F0 and vowel on the electroglottographic (EGG) and time-differentiated EGG (dEGG) metrics hybrid open quotient (OQ), dEGG OQ and peak dEGG, as well as on the acoustic metrics CPPS and HNR, was examined, and their suitability for voice assessment was evaluated. METHOD In a retrospective study, 29 women with a mean age of 25 years (± 8.9, range: 18-53) diagnosed with structural vocal fold pathologies were examined before and after voice therapy or phonosurgery. Both acoustic and EGG signals were recorded simultaneously during the phonation of the sustained vowels /ɑ/, /i/, and /u/ at three elicited levels of loudness (soft/comfortable/loud) and unconstrained F0 conditions. RESULTS A linear mixed-model analysis showed a significant effect of elicitation effort levels on peak dEGG, HNR, and CPPS (all p < .01). Calibrated SPL significantly influenced HNR and CPPS (both p < .01). Furthermore, F0 had a significant effect on peak dEGG and CPPS (p < .0001). All metrics showed significant changes with regard to vowel (all p < .05). However, the treatment had no effect on the examined metrics, regardless of the treatment type (surgery vs. voice therapy). CONCLUSIONS The value of the investigated metrics for voice assessment purposes when sampled without sufficient control of SPL and F0 is limited, in that they are significantly influenced by the phonatory context, be it speech or elicited sustained vowels. Future studies should explore the diagnostic value of new data collation approaches such as voice mapping, which take SPL and F0 effects into account.
Collapse
Affiliation(s)
- Naomi Anna Iob
- Division of Phoniatrics and Speech Pathology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Zurich, University of Zurich, Switzerland
| | - Lei He
- Division of Phoniatrics and Speech Pathology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Zurich, University of Zurich, Switzerland
- Department of Computational Linguistics, University of Zurich, Switzerland
| | - Sten Ternström
- Division of Speech, Music and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Huanchen Cai
- Division of Speech, Music and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Meike Brockmann-Bauser
- Division of Phoniatrics and Speech Pathology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Zurich, University of Zurich, Switzerland
| |
Collapse
|
2
|
Codino J, Jackson-Menaldi MC, Rubin A, Torres ME. Automated Quantification of Inflection Events in The Electroglottographic Signal. J Voice 2023; 37:640-647. [PMID: 34162494 DOI: 10.1016/j.jvoice.2021.05.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2021] [Revised: 05/01/2021] [Accepted: 05/10/2021] [Indexed: 11/26/2022]
Affiliation(s)
- Juliana Codino
- Lakeshore Professional Voice Center, Lakeshore Ear, Nose and Throat Center, MI, USA
| | - María Cristina Jackson-Menaldi
- Laboratorio de Señales y Dinámicas no Lineales, Facultad de Ingeniería, Universidad Nacional de Entre Ríos, Argentina, National Council for Scientific and Technical Research (CONICET), Argentina
| | - Adam Rubin
- Laboratorio de Señales y Dinámicas no Lineales, Facultad de Ingeniería, Universidad Nacional de Entre Ríos, Argentina, National Council for Scientific and Technical Research (CONICET), Argentina
| | - María Eugenia Torres
- Laboratorio de Señales y Dinámicas no Lineales, Facultad de Ingeniería, Universidad Nacional de Entre Ríos, Argentina, National Council for Scientific and Technical Research (CONICET), Argentina
| |
Collapse
|
3
|
Donati E, Chousidis C, Ribeiro HDM, Russo N. Classification of Speaking and Singing Voices Using Bioimpedance Measurements and Deep Learning. J Voice 2023:S0892-1997(23)00120-0. [PMID: 37156686 DOI: 10.1016/j.jvoice.2023.03.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 03/29/2023] [Accepted: 03/29/2023] [Indexed: 05/10/2023]
Abstract
The acts of speaking and singing are different phenomena displaying distinct characteristics. The classification and distinction of these voice acts is vastly approached utilizing voice audio recordings and microphones. The use of audio recordings, however, can become challenging and computationally expensive due to the complexity of the voice signal. The research presented in this paper seeks to address this issue by implementing a deep learning classifier of speaking and singing voices based on bioimpedance measurement in replacement of audio recordings. In addition, the proposed research aims to develop a real-time voice act classification for the integration with voice-to-MIDI conversion. For such purposes, a system was designed, implemented, and tested using electroglottographic signals, Mel Frequency Cepstral Coefficients, and a deep neural network. The lack of datasets for the training of the model was tackled by creating a dedicated dataset 7200 bioimpedance measurement of both singing and speaking. The use of bioimpedance measurements allows to deliver high classification accuracy whilst keeping low computational needs for both preprocessing and classification. These characteristics, in turn, allows a fast deployment of the system for near-real-time applications. After the training, the system was broadly tested achieving a testing accuracy of 92% to 94%.
Collapse
Affiliation(s)
- Eugenio Donati
- School of Computing and Engineering, University of West London, London, UK.
| | - Christos Chousidis
- Department of Music and Media, Institute of Sound Recording, University of Surrey, Guildford, UK
| | | | - Nicola Russo
- School of Computing and Engineering, University of West London, London, UK
| |
Collapse
|
4
|
Angelakis E, Kotsani N, Georgaki A. Towards a Singing Voice Multi-Sensor Analysis Tool: System Design, and Assessment Based on Vocal Breathiness. SENSORS 2021; 21:s21238006. [PMID: 34884019 PMCID: PMC8659512 DOI: 10.3390/s21238006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Revised: 11/14/2021] [Accepted: 11/19/2021] [Indexed: 11/16/2022]
Abstract
Singing voice is a human quality that requires the precise coordination of numerous kinetic functions and results in a perceptually variable auditory outcome. The use of multi-sensor systems can facilitate the study of correlations between the vocal mechanism kinetic functions and the voice output. This is directly relevant to vocal education, rehabilitation, and prevention of vocal health issues in educators; professionals; and students of singing, music, and acting. In this work, we present the initial design of a modular multi-sensor system for singing voice analysis, and describe its first assessment experiment on the ‘vocal breathiness’ qualitative characteristic. A system case study with two professional singers was conducted, utilizing signals from four sensors. Participants sung a protocol of vocal trials in various degrees of intended vocal breathiness. Their (i) vocal output, (ii) phonatory function, and (iii) respiratory behavior-per-condition were recorded through a condenser microphone (CM), an Electroglottograph (EGG), and thoracic and abdominal respiratory effort transducers (RET), respectively. Participants’ individual respiratory management strategies were studied through qualitative analysis of RET data. Microphone audio samples breathiness degree was rated perceptually, and correlation analysis was performed between sample ratings and parameters extracted from CM and EGG data. Smoothed Cepstral Peak Prominence (CPPS) and vocal folds’ Open Quotient (OQ), as computed with the Howard method (HOQ), demonstrated the higher correlation coefficients, when analyzed individually. DECOM method-computed OQ (DOQ) was also examined. Interestingly, the correlation coefficient of pitch difference between estimates from CM and EGG signals appeared to be (based on the Pearson correlation coefficient) statistically insignificant (a result that warrants investigation in larger populations). The study of multi-variate models revealed even higher correlation coefficients. Models studied were the Acoustic Breathiness Index (ABI) and the proposed multiple regression model CDH (CPPS, DOQ, and HOQ), which was attempted in order to combine analysis results from microphone and EGG signals. The model combination of ABI and the proposed CDH appeared to yield the highest correlation with perceptual breathiness ratings. Study results suggest potential for the use of a completed system version in vocal pedagogy and research, as the case study indicated system practicality, a number of pertinent correlations, and introduced topics with further research possibilities.
Collapse
|
5
|
Patel RR, Ternström S. Quantitative and Qualitative Electroglottographic Wave Shape Differences in Children and Adults Using Voice Map-Based Analysis. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:2977-2995. [PMID: 34319772 DOI: 10.1044/2021_jslhr-20-00717] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Purpose The purpose of this study is to identify the extent to which various measurements of contacting parameters differ between children and adults during habitual range and overlap vocal frequency/intensity, using voice map-based assessment of noninvasive electroglottography (EGG). Method EGG voice maps were analyzed from 26 adults (22-45 years) and 22 children (4-8 years) during connected speech and vowel /a/ over the habitual range and the overlap vocal frequency/intensity from the voice range profile task on the vowel /a/. Mean and standard deviations of contact quotient by integration, normalized contacting speed, quotient of speed by integration, and cycle-rate sample entropy were obtained. Group differences were evaluated using the linear mixed model analysis for the habitual range connected speech and the vowel, whereas analysis of covariance was conducted for the overlap vocal frequency/intensity from the voice range profile task. Presence of a "knee" on the EGG wave shape was determined by visual inspection of the presence of convexity along the decontacting slope of the EGG pulse and the presence of the second derivative zero-crossing. Results The contact quotient by integration, normalized contacting speed, quotient of speed by integration, and cycle-rate sample entropy were significantly different in children compared to (a) adult males for habitual range and (b) adult males and adult females for the overlap vocal frequency/intensity. None of the children had a "knee" on the decontacting slope of the EGG slope. Conclusion EGG parameters of contact quotient by integration, normalized contacting speed, quotient of speed by integration, cycle-rate sample entropy, and absence of a "knee" on the decontacting slope characterize the wave shape differences between children and adults, whereas the normalized contacting speed, quotient of speed by integration, cycle-rate sample entropy, and presence of a "knee" on the downward pulse slope characterize the wave shape differences between adult males and adult females. Supplemental Material https://doi.org/10.23641/asha.15057345.
Collapse
Affiliation(s)
- Rita R Patel
- Department of Speech, Language and Hearing Sciences, Indiana University Bloomington
| | - Sten Ternström
- Division of Speech, Music, and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden
| |
Collapse
|
6
|
Lã FM, Ternström S. Flow ball-assisted voice training: Immediate effects on vocal fold contacting. Biomed Signal Process Control 2020. [DOI: 10.1016/j.bspc.2020.102064] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
7
|
Electroglottography – An Update. J Voice 2020; 34:503-526. [DOI: 10.1016/j.jvoice.2018.12.014] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2018] [Revised: 12/27/2018] [Accepted: 12/28/2018] [Indexed: 11/21/2022]
|
8
|
Ternström S. Normalized time-domain parameters for electroglottographic waveforms. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 146:EL65. [PMID: 31370590 DOI: 10.1121/1.5117174] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2019] [Accepted: 06/28/2019] [Indexed: 06/10/2023]
Abstract
The electroglottographic waveform is of interest for characterizing phonation non-invasively. Existing parameterizations tend to give disparate results because they rely on somewhat arbitrary thresholds and/or contacting events. It is shown that neither are needed for formulating a normalized contact quotient and a normalized peak derivative. A heuristic combination of the two resolves also the ambiguity of a moderate contact quotient, with regard to vocal fold contacting being firm versus weak or absent. As preliminaries, schemes for electroglottography signal preconditioning and time-domain period detection are described that improve somewhat on similar methods. The algorithms are simple and compute quickly.
Collapse
Affiliation(s)
- Sten Ternström
- Department of Speech, Music and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm,
| |
Collapse
|
9
|
Selamtzis A, Ternström S, Richter B, Burk F, Köberlein M, Echternach M. A comparison of electroglottographic and glottal area waveforms for phonation type differentiation in male professional singers. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 144:3275. [PMID: 30599695 DOI: 10.1121/1.5066456] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2017] [Accepted: 10/14/2018] [Indexed: 06/09/2023]
Abstract
This study compares the use of electroglottograms (EGGs) and glottal area waveforms (GAWs) to study phonation in different vibratory states as produced by professionally trained singers. Six western classical tenors were asked to phonate pitch glides from modal to falsetto phonation, or from modal to their stage voice above the passaggio (SVaP). For each pitch glide the sample entropy (SampEn) of the EGG signal was calculated to detect the occurrence of phonatory instabilities and establish a "ground truth" for the performed phonation type. The cycles before the maximum SampEn were labeled as modal, and the cycles after the peak were labeled as either falsetto, or SVaP. Three automatic categorizations of vibratory state were performed using clustering: one based only on the EGG, one based on the GAW, and one based on their combination. The error rate (clustering vs ground truth) was, on average, lower than 10% for all of the three settings, revealing no special advantage of the GAW over EGG, and vice versa. Modal voice cycles exhibited a larger contact quotient, larger normalized derivative peak ratio, and lower rise time, compared to SVaP and falsetto. The GAW-based normalized maximum area declination rate was larger in SVaP compared to modal voice.
Collapse
Affiliation(s)
- Andreas Selamtzis
- Department of Speech, Music and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Lindstedtsvägen 24, Breisacherstraße 60, Stockholm, SE-100 44, Sweden
| | - Sten Ternström
- Department of Speech, Music and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Lindstedtsvägen 24, Breisacherstraße 60, Stockholm, SE-100 44, Sweden
| | - Bernard Richter
- Institute of Musicians' Medicine, Freiburg University Medical Center, Breisacher Strausse 60, Freiburg, 79106, Germany
| | - Fabian Burk
- Department of Otorhinolaryngology, University Medical Center Schleswig-Holstein, Arnold-Heller-Straße 3, 24105 Kiel, Germany
| | - Marie Köberlein
- Institute of Musicians' Medicine, Freiburg University Medical Center, Breisacher Strausse 60, Freiburg, 79106, Germany
| | - Matthias Echternach
- Department of Otorhinolaryngology, Division of Phoniatrics and Pediatric Audiology, Munich University, Campus Großhadern, Marchioninistraße 15, Munich, 81377, Germany
| |
Collapse
|
10
|
Ternström S, D'Amario S, Selamtzis A. Effects of the Lung Volume on the Electroglottographic Waveform in Trained Female Singers. J Voice 2018; 34:485.e1-485.e21. [PMID: 30337119 DOI: 10.1016/j.jvoice.2018.09.006] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2018] [Revised: 09/04/2018] [Accepted: 09/06/2018] [Indexed: 11/25/2022]
Abstract
OBJECTIVES To determine if in singing there is an effect of lung volume on the electroglottographic waveform, and if so, how it varies over the voice range. STUDY DESIGN Eight trained female singers sang the tune "Frère Jacques" in 18 conditions: three phonetic contexts, three dynamic levels, and high or low lung volume. Conditions were randomized and replicated. METHODS The audio and EGG signals were recorded in synchrony with signals tracking respiration and vertical larynx position. The first 10 Fourier descriptors of every EGG cycle were computed. These spectral data were clustered statistically, and the clusters were mapped by color into a voice range profile display, thus visualizing the EGG waveform changes under the influence of fo and SPL. The rank correlations and effect sizes of the relationships between relative lung volume and several adduction-related EGG wave shape metrics were similarly rendered on a color scale, in voice range profile-style 'voice maps.' RESULTS In most subjects, EGG waveforms varied considerably over the voice range. Within subjects, reproducibility was high, not only across the replications, but also across the phonetic contexts. The EGG waveforms were quite individual, as was the nature of the EGG shape variation across the range. EGG metrics were significantly correlated to changes in lung volume, in parts of the range of the song, and in most subjects. However, the effect sizes of the relative lung volume were generally much smaller than the effects of fo and SPL, and the relationships always varied, even changing polarity from one part of the range to another. CONCLUSIONS Most subjects exhibited small, reproducible effects of the relative lung volume on the EGG waveform. Some hypothesized influences of tracheal pull were seen, mostly at the lowest SPLs. The effects were however highly variable, both across the moderately wide fo-SPL range and across subjects. Different singers may be applying different techniques and compensatory behaviors with changing lung volume. The outcomes emphasize the importance of making observations over a substantial part of the voice range, and not only of phonations sustained at a few fundamental frequencies and sound levels.
Collapse
Affiliation(s)
- Sten Ternström
- Department of Speech, Music and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden.
| | - Sara D'Amario
- Department of Speech, Music and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden; Audio Lab, Department of Electronic Engineering, University of York, Heslington, United Kingdom
| | - Andreas Selamtzis
- Department of Speech, Music and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden
| |
Collapse
|
11
|
Smith SL, Titze IR. Vocal fold contact patterns based on normal modes of vibration. J Biomech 2018; 73:177-184. [PMID: 29680310 DOI: 10.1016/j.jbiomech.2018.04.011] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2017] [Revised: 03/27/2018] [Accepted: 04/01/2018] [Indexed: 11/17/2022]
Abstract
The fluid-structure interaction and energy transfer from respiratory airflow to self-sustained vocal fold oscillation continues to be a topic of interest in vocal fold research. Vocal fold vibration is driven by pressures on the vocal fold surface, which are determined by the shape of the glottis and the contact between vocal folds. Characterization of three-dimensional glottal shapes and contact patterns can lead to increased understanding of normal and abnormal physiology of the voice, as well as to development of improved vocal fold models, but a large inventory of shapes has not been directly studied previously. This study aimed to take an initial step toward characterizing vocal fold contact patterns systematically. Vocal fold motion and contact was modeled based on normal mode vibration, as it has been shown that vocal fold vibration can be almost entirely described by only the few lowest order vibrational modes. Symmetric and asymmetric combinations of the four lowest normal modes of vibration were superimposed on left and right vocal fold medial surfaces, for each of three prephonatory glottal configurations, according to a surface wave approach. Contact patterns were generated from the interaction of modal shapes at 16 normalized phases during the vibratory cycle. Eight major contact patterns were identified and characterized by the shape of the flow channel, with the following descriptors assigned: convergent, divergent, convergent-divergent, uniform, split, merged, island, and multichannel. Each of the contact patterns and its variation are described, and future work and applications are discussed.
Collapse
Affiliation(s)
- Simeon L Smith
- The National Center for Voice and Speech, The University of Utah, 136 South Main Street, Suite 320, Salt Lake City, UT 84101, United States.
| | - Ingo R Titze
- The National Center for Voice and Speech, The University of Utah, 136 South Main Street, Suite 320, Salt Lake City, UT 84101, United States
| |
Collapse
|
12
|
Krasnodębska P, Szkiełkowska A, Miaśkiewicz B, Włodarczyk E, Domeracka-Kołodziej A, Skarżyński H. Objective measurement of mucosal wave parameters in diagnosing benign lesions of the vocal folds. LOGOP PHONIATR VOCO 2018; 44:73-78. [PMID: 29318925 DOI: 10.1080/14015439.2017.1402950] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
INTRODUCTION The diagnostic procedure of phonation is dominated by subjective assessment tools. It seems reasonable to seek methods of quantitative glottal cycle assessment. OBJECTIVE The aim of our study was the analysis of open quotients (OQ) of the glottis. METHODS One hundred and twenty-four people were included in the study. Methodology was based on tools available in everyday phoniatrics practice - laryngovideostroboscopy (LVS) and electroglottography (EGG). There were statistically significant differences between control and studied group. Vocal fold polyps, nodules and edema influence glottal function in a different manner, what can be illustrated by objective glottal function parameters. Establishing Videostroboscopic Open Quotient values from three parts of glottis and Electroglottographic Quasi Open Quotient (QOQ) value, can help in dividing patients with benign lesions of vocal folds according to the type of disease. RESULTS AND CONCLUSIONS Measurement of the open quotient from three parts of the glottis helps to differentially diagnose and localize glottal vocal fold lesions. Videostroboscopic Open Quotient and Electroglottographic QOQ values can be used to quantify the glottal cycle. Videostroboscopic Open Quotient, Electroglottographic QOQ and their ratio varies depending on the type of organic dysphonia.
Collapse
Affiliation(s)
- Paulina Krasnodębska
- a Audiology and Phoniatrics Clinic , Institute of Physiology and Pathology of Hearing , Warsaw , Poland
| | - Agata Szkiełkowska
- a Audiology and Phoniatrics Clinic , Institute of Physiology and Pathology of Hearing , Warsaw , Poland.,b Audiology and Phoniatrics Faculty , Fryderyk Chopin University of Music , Warsaw , Poland
| | - Beata Miaśkiewicz
- a Audiology and Phoniatrics Clinic , Institute of Physiology and Pathology of Hearing , Warsaw , Poland
| | - Elżbieta Włodarczyk
- a Audiology and Phoniatrics Clinic , Institute of Physiology and Pathology of Hearing , Warsaw , Poland
| | - Anna Domeracka-Kołodziej
- a Audiology and Phoniatrics Clinic , Institute of Physiology and Pathology of Hearing , Warsaw , Poland
| | - Henryk Skarżyński
- a Audiology and Phoniatrics Clinic , Institute of Physiology and Pathology of Hearing , Warsaw , Poland.,b Audiology and Phoniatrics Faculty , Fryderyk Chopin University of Music , Warsaw , Poland
| |
Collapse
|
13
|
HERBST CHRISTIANT, DUNN JACOBC. Non-invasive documentation of primate voice production using electroglottography. ANTHROPOL SCI 2018. [DOI: 10.1537/ase.180201] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2022]
Affiliation(s)
| | - JACOB C. DUNN
- Department of Animal and Environmental Biology, Faculty of Science & Technology, Anglia Ruskin University, Cambridge
- Division of Biological Anthropology, University of Cambridge, Cambridge
| |
Collapse
|
14
|
Ong Tan KG. Contact Quotient of Female Singers Singing Four Pitches for Five Vowels in Normal and Pressed Phonations. J Voice 2017; 31:645.e15-645.e22. [DOI: 10.1016/j.jvoice.2017.02.014] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2016] [Accepted: 02/22/2017] [Indexed: 11/15/2022]
|
15
|
Macerata A, Nacci A, Manti M, Cianchetti M, Matteucci J, Romeo SO, Fattori B, Berrettini S, Laschi C, Ursino F. Evaluation of the Electroglottographic signal variability by amplitude-speed combined analysis. Biomed Signal Process Control 2017. [DOI: 10.1016/j.bspc.2016.10.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
16
|
Comparison of parametrization methods of electroglottographic and inverse filtered acoustic speech pressure signals in distinguishing between phonation types. Biomed Signal Process Control 2017. [DOI: 10.1016/j.bspc.2017.04.001] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
17
|
Selamtzis A, Ternström S. Investigation of the relationship between electroglottogram waveform, fundamental frequency, and sound pressure level using clustering. J Voice 2017; 31:393-400. [DOI: 10.1016/j.jvoice.2016.11.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2016] [Revised: 11/01/2016] [Accepted: 11/02/2016] [Indexed: 10/20/2022]
|
18
|
Herbst CT, Schutte HK, Bowling DL, Svec JG. Comparing Chalk With Cheese—The EGG Contact Quotient Is Only a Limited Surrogate of the Closed Quotient. J Voice 2017; 31:401-409. [DOI: 10.1016/j.jvoice.2016.11.007] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2016] [Revised: 11/06/2016] [Accepted: 11/08/2016] [Indexed: 10/20/2022]
|
19
|
Mayr A. Investigating the Voce Faringea: Physiological and Acoustic Characteristics of the Bel Canto Tenor's Forgotten Singing Practice. J Voice 2017; 31:255.e13-255.e23. [PMID: 27430862 DOI: 10.1016/j.jvoice.2016.06.010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2016] [Revised: 06/17/2016] [Accepted: 06/21/2016] [Indexed: 10/21/2022]
Abstract
Several historical sources from the first half of the 19th century mention a distinct third register mechanism particular to tenor voices of that period. This so-called voce faringea-often described as an "intermediate" register-is a virtually forgotten historical singing practice used to extend the upper range of the voice, where the singer modifies falsetto, typically a weak and often feminine sound, into a more powerful, tenor-like vocal quality. Based on an evaluation of historical voice register theories, training strategies, and the sound ideals of the historical period, an informed discussion of that technique is developed. For this study, acoustic and electroglottographic signals for tones produced on the vowel /a/ by a professional tenor/countertenor in different vocal register mechanisms-voce faringea, falsetto, chest register, and mezza voce-were recorded using the VoceVista system. Analysis of the electroglottography (EGG) and audio data revealed specific characteristics of the voce faringea with regard to both the laryngeal mechanism and the sound spectrum, including high EGG contact quotient and low speed quotient values. EGG pulses were skewed significantly to the left and displayed a distinct knee shape during the de-contacting phase of the vocal folds, which consequently indicates a vibration with a clear mucosal wave. The long-term average spectrum and power spectrum exposed a considerable amplification and dislocation of F2 in the direction of high frequencies, thus boosting the third harmonic and showing a strong concentration of acoustic energy in the area of the singer's formant cluster.
Collapse
Affiliation(s)
- Alexander Mayr
- University for Music and Performing Arts, Vienna, Austria.
| |
Collapse
|
20
|
Automated Electroglottographic Inflection Events Detection. A Pilot Study. J Voice 2016; 30:768.e1-768.e10. [DOI: 10.1016/j.jvoice.2015.10.020] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2015] [Accepted: 10/29/2015] [Indexed: 11/20/2022]
|
21
|
Hampala V, Garcia M, Švec JG, Scherer RC, Herbst CT. Relationship Between the Electroglottographic Signal and Vocal Fold Contact Area. J Voice 2016; 30:161-71. [DOI: 10.1016/j.jvoice.2015.03.018] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2015] [Accepted: 03/30/2015] [Indexed: 11/24/2022]
|
22
|
Awan SN, Krauss AR, Herbst CT. An Examination of the Relationship Between Electroglottographic Contact Quotient, Electroglottographic Decontacting Phase Profile, and Acoustical Spectral Moments. J Voice 2015; 29:519-29. [DOI: 10.1016/j.jvoice.2014.10.016] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2014] [Accepted: 10/23/2014] [Indexed: 10/23/2022]
|
23
|
Selamtzis A, Ternström S. Analysis of vibratory states in phonation using spectral features of the electroglottographic signal. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2014; 136:2773-2783. [PMID: 25373977 DOI: 10.1121/1.4896466] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
The vocal folds can oscillate in several different ways, manifest to practitioners and clinicians as "registers" or "mechanisms," of which the two most often considered are modal voice and falsetto voice. Here these will be taken as instances of different "vibratory states," i.e., distinct quasi-stationary patterns of vibration of the vocal folds. State transitions are common in biomechanical nonlinear oscillators, and they are often abrupt and impossible to predict exactly. Therefore, vibratory states are a source of confounding variation, for instance when acquiring a voice range profile (VRP). In the quest for a state-based, non-invasive VRP, a semi-automatic method based on the short-term spectrum of the electroglottographic (EGG) signal was developed. The method identifies rapid vibratory state transitions, such as the modal-falsetto switch, and clusters the EGG data based on their similarities in the relative levels and phases of the lower frequency components. Productions of known modal and falsetto voice were accurately clustered by a Gaussian mixture model. When mapped into the VRP, this EGG-based clustering revealed connected regions of different vibratory sub-regimes in both modal and falsetto.
Collapse
Affiliation(s)
- Andreas Selamtzis
- Department of Speech, Music and Hearing, School of Computer Science and Communication, KTH Royal Institute of Technology, SE-100 44 Stockholm, Sweden
| | - Sten Ternström
- Department of Speech, Music and Hearing, School of Computer Science and Communication, KTH Royal Institute of Technology, SE-100 44 Stockholm, Sweden
| |
Collapse
|
24
|
Kuang J, Keating P. Vocal fold vibratory patterns in tense versus lax phonation contrasts. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2014; 136:2784-2797. [PMID: 25373978 DOI: 10.1121/1.4896462] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
This study explores the vocal fold contact patterns of one type of phonation contrast--the tense vs lax phonation contrasts of three Yi (Loloish) languages. These contrasts are interesting because neither phonation category is very different from modal voice, and because both phonations are largely independent of the languages' tonal contrasts. Electroglottographic (EGG) recordings were made in the field, and traditional EGG measures were derived. These showed many small but significant differences between the phonations, with tense phonation having greater contact quotients and briefer but slower changes in contact. Functional data analysis was then applied to entire EGG pulse shapes. The resulting first principal component was found to be mostly strongly related to the phonation contrasts, and correlated with almost all the traditional EGG measures. Unlike the traditional measures, however, this component also seems to capture differences in abruptness of contact. Furthermore, previously collected perceptual responses from native speakers of one of the languages correlated better with this component than with any other EGG measure or any acoustic measure. The differences between these tense and lax phonations are not large, but apparently they are consistent enough, and perceptually robust enough, to support this linguistic contrast.
Collapse
Affiliation(s)
- Jianjing Kuang
- Department of Linguistics, University of Pennsylvania, 619 Williams Hall, Philadelphia, Pennsylvania 19104
| | - Patricia Keating
- Department of Linguistics, University of California, Los Angeles, Los Angeles, California 90095-1543
| |
Collapse
|
25
|
Waaramaa T, Palo P, Kankare E. Emotions in freely varying and mono-pitched vowels, acoustic and EGG analyses. LOGOP PHONIATR VOCO 2014; 40:156-70. [PMID: 24998780 DOI: 10.3109/14015439.2014.934277] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Vocal emotions are expressed either by speech or singing. The difference is that in singing the pitch is predetermined while in speech it may vary freely. It was of interest to study whether there were voice quality differences between freely varying and mono-pitched vowels expressed by professional actors. Given their profession, actors have to be able to express emotions both by speech and singing. Electroglottogram and acoustic analyses of emotional utterances embedded in expressions of freely varying vowels [a:], [i:], [u:] (96 samples) and mono-pitched protracted vowels (96 samples) were studied. Contact quotient (CQEGG) was calculated using 35%, 55%, and 80% threshold levels. Three different threshold levels were used in order to evaluate their effects on emotions. Genders were studied separately. The results suggested significant gender differences for CQEGG 80% threshold level. SPL, CQEGG, and F4 were used to convey emotions, but to a lesser degree, when F0 was predetermined. Moreover, females showed fewer significant variations than males. Both genders used more hypofunctional phonation type in mono-pitched utterances than in the expressions with freely varying pitch. The present material warrants further study of the interplay between CQEGG threshold levels and formant frequencies, and listening tests to investigate the perceptual value of the mono-pitched vowels in the communication of emotions.
Collapse
Affiliation(s)
- Teija Waaramaa
- a University of Tampere, School of Communication, Media and Theatre , Tampere , Finland
| | - Pertti Palo
- b Clinical Audiology, Speech and Language (CASL) Research Centre, Queen Margaret University , UK
| | - Elina Kankare
- c University of Tampere, Speech and Voice Research Laboratory, School of Education , Tampere , Finland.,d Ear and Oral Diseases, Department of Phoniatrics in Tampere University Hospital , Tampere , Finland
| |
Collapse
|
26
|
Dong L, Sundberg J, Kong J. Formant and voice source properties in two male Kunqu Opera roles: a pilot study. Folia Phoniatr Logop 2014; 65:294-302. [PMID: 24902631 DOI: 10.1159/000360760] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
OBJECTIVE This investigation analyzes flow glottogram and electroglottogram (EGG) parameters as well as the relationship between formant frequencies and partials in two male Kunqu Opera roles, Colorful face (CF) and Old man (OM). PARTICIPANTS AND METHODS Four male professional Kunqu Opera singers volunteered as participants, 2 singers for each role. Using inverse filtering of the audio signal flow glottogram parameters and formant frequencies were measured in each note of scales. Two EGG parameters, contact quotient (CoQ) and speed quotient, were measured. RESULTS Formant tuning was observed only in 1 of the OM singers and appeared in a pitch range lower than the passaggio range of Western male opera singers. Both the CF and the OM role singers showed high CoQ values and low values of the normalized amplitude quotient in singing. For 3 of the 4 singers CoQ and the level difference between the first and second partials showed a positive and a negative correlation with fundamental frequency (F0), respectively. CONCLUSIONS Formant tuning may be applied by a singer of the OM role, and both CF and OM role singers may use a rather pressed type of phonation, CF singers more than OM singers in the lower part of the pitch range. Most singers increased glottal adduction with rising F0.
Collapse
Affiliation(s)
- Li Dong
- Department of Chinese Language and Literature, Peking University, Beijing, China
| | | | | |
Collapse
|
27
|
Herbst CT, Lohscheller J, Švec JG, Henrich N, Weissengruber G, Fitch WT. Glottal opening and closing events investigated by electroglottography and super-high-speed video recordings. J Exp Biol 2014; 217:955-63. [DOI: 10.1242/jeb.093203] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Previous research has suggested that the peaks in the first derivative (dEGG) of the electroglottographic (EGG) signal are good approximate indicators of the events of glottal opening and closing. These findings were based on high-speed video (HSV) recordings with frame rates 10 times lower than the sampling frequencies of the corresponding EGG data. The present study attempts to corroborate these previous findings, utilizing super-HSV recordings. The HSV and EGG recordings (sampled at 27 and 44 kHz, respectively) of an excised canine larynx phonation were synchronized by an external TTL signal to within 0.037 ms. Data were analyzed by means of glottovibrograms, digital kymograms, the glottal area waveform and the vocal fold contact length (VFCL), a new parameter representing the time-varying degree of ‘zippering’ closure along the anterior–posterior (A–P) glottal axis. The temporal offsets between glottal events (depicted in the HSV recordings) and dEGG peaks in the opening and closing phase of glottal vibration ranged from 0.02 to 0.61 ms, amounting to 0.24–10.88% of the respective glottal cycle durations. All dEGG double peaks coincided with vibratory A–P phase differences. In two out of the three analyzed video sequences, peaks in the first derivative of the VFCL coincided with dEGG peaks, again co-occurring with A–P phase differences. The findings suggest that dEGG peaks do not always coincide with the events of glottal closure and initial opening. Vocal fold contacting and de-contacting do not occur at infinitesimally small instants of time, but extend over a certain interval, particularly under the influence of A–P phase differences.
Collapse
Affiliation(s)
- Christian T. Herbst
- Voice Research Laboratory, Department of Biophysics, Faculty of Science, Palacký University Olomouc, tr. 17. Listopadu 12, 771 46 Olomouc, Czech Republic
- Laboratory of Bio-Acoustics, Department of Cognitive Biology, University of Vienna, Althanstraße 14, 1090 Vienna, Austria
| | - Jörg Lohscheller
- University of Applied Sciences, Department of Computer Science, Schneidershof, 54293 Trier, Germany
| | - Jan G. Švec
- Voice Research Laboratory, Department of Biophysics, Faculty of Science, Palacký University Olomouc, tr. 17. Listopadu 12, 771 46 Olomouc, Czech Republic
| | - Nathalie Henrich
- GIPSA-lab, CNRS, Grenoble INP, Grenoble University, 11 rue des Mathématiques – BP 46, 38402 Saint Martin d'Hères cedex, France
| | - Gerald Weissengruber
- University of Veterinary Medicine Vienna, Institute for Anatomy, Histology and Embryology, Veterinärplatz 1, 1210 Vienna, Austria
| | - W. Tecumseh Fitch
- Laboratory of Bio-Acoustics, Department of Cognitive Biology, University of Vienna, Althanstraße 14, 1090 Vienna, Austria
| |
Collapse
|
28
|
|
29
|
Kankare E, Liu D, Laukkanen AM, Geneid A. EGG and Acoustic Analyses of Different Voice Samples: Comparison between Perceptual Evaluation and Voice Activity and Participation Profile. Folia Phoniatr Logop 2013; 65:98-104. [DOI: 10.1159/000354076] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
|
30
|
|
31
|
Kankare E, Laukkanen AM, Ilomäki I, Miettinen A, Pylkkänen T. Electroglottographic contact quotient in different phonation types using different amplitude threshold levels. LOGOP PHONIATR VOCO 2012; 37:127-32. [DOI: 10.3109/14015439.2012.664656] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
32
|
Kankare E, Laukkanen AM. Quasi-output-cost-ratio, perceived voice quality, and subjective evaluation in female kindergarten teachers. LOGOP PHONIATR VOCO 2012; 37:62-8. [DOI: 10.3109/14015439.2012.660500] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
33
|
Mecke AC, Sundberg J, Granqvist S, Echternach M. Comparing closed quotient in children singers' voices as measured by high-speed-imaging, electroglottography, and inverse filtering. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 131:435-441. [PMID: 22280605 DOI: 10.1121/1.3662061] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
The closed quotient, i.e., the ratio between the closed phase and the period, is commonly studied in voice research. However, the term may refer to measures derived from different methods, such as inverse filtering, electroglottography or high-speed digital imaging (HSDI). This investigation compares closed quotient data measured by these three methods in two boy singers. Each singer produced sustained tones on two different pitches and a glissando. Audio, electroglottographic signal (EGG), and HSDI were recorded simultaneously. The audio signal was inverse filtered by means of the decap program; the closed phase was defined as the flat minimum portion of the flow glottogram. Glottal area was automatically measured in the high speed images by the built-in camera software, and the closed phase was defined as the flat minimum portion of the area-signal. The EGG-signal was analyzed in four different ways using the matlab open quotient interface. The closed quotient data taken from the EGG were found to be considerably higher than those obtained from inverse filtering. Also, substantial differences were found between the closed quotient derived from HSDI and those derived from inverse filtering. The findings illustrate the importance of distinguishing between these quotients.
Collapse
Affiliation(s)
- Ann-Christine Mecke
- Hochschule für Musik und Theater Felix Mendelssohn Bartholdy Leipzig, Dittrichring 21, D-04109 Leipzig, Germany.
| | | | | | | |
Collapse
|
34
|
Perturbation Measurements in Untrained Male Voices' Transitions From Modal to Falsetto Register. J Voice 2011; 25:663-9. [DOI: 10.1016/j.jvoice.2010.01.013] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2009] [Accepted: 01/18/2010] [Indexed: 11/23/2022]
|
35
|
Mourão AM, Bassi IB, Gama ACC. Avaliação eletroglotográfica de mulheres disfônicas com lesão de massa. REVISTA CEFAC 2011. [DOI: 10.1590/s1516-18462011005000097] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
OBJETIVO: comparar os resultados da avaliação eletroglotográfica entre mulheres adultas com laudo otorrinolaringológico de nódulos vocais e presença de fenda glótica com mulheres sem alterações laríngeas. MÉTODO: trata-se de 20 mulheres, sendo 10 com exame otorrinolaringológico de nódulo vocal, associado à fenda glótica, e 10 com vozes avaliadas na análise perceptivo-auditiva como neutras e avaliação otorrinolaringológica sem alteração laríngea. A faixa etária variou de 27 a 55 anos, com média de 37,60 e 36,00 para grupo estudo e controle, respectivamente. O material de voz foi a emissão habitual da vogal sustentada /a/. Foram extraídas as medidas de quociente de contato, frequência fundamental, jitter, bem como a classificação e interpretação dos tipos de ondas e presença do joelho eletroglotográfico. A análise estatística foi realizada pelo teste Exato de Fisher e T de Student. RESULTADOS: todos os traçados foram classificados como tipo II (100%). Não foi observada diferença na presença ou ausência de joelho entre grupos. A média do quociente de contato foi de 42,52 e 45,56 para grupo estudo e controle, respectivamente. Apesar do menor valor para o grupo estudado, não houve diferença significante entre grupos. Houve relação estatisticamente significante entre grupos no que se refere à frequência fundamental e jitter. A média da frequência fundamental foi 193,18 e 211,69 e o valor médio de jitter foi 1,21 e 2,9 para o grupo estudo e controle, respectivamente. CONCLUSÃO: mulheres com lesão de massa nas pregas vocais apresentam maiores valores de jitter e menores valores para frequência fundamental eletroglotográfica.
Collapse
|
36
|
Herbst CT, Qiu Q, Schutte HK, Švec JG. Membranous and cartilaginous vocal fold adduction in singing. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2011; 129:2253-62. [PMID: 21476680 DOI: 10.1121/1.3552874] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
While vocal fold adduction is an important parameter in speech, relatively little has been known on the adjustment of the vocal fold adduction in singing. This study investigates the possibility of separate adjustments of cartilaginous and membranous vocal fold adduction in singing. Six female and seven male subjects, singers and non-singers, were asked to imitate an instructor in producing four phonation types: "aBducted falsetto" (FaB), "aDducted falsetto" (FaD), "aBducted Chest" (CaB), and "aDducted Chest" (CaD). The phonations were evaluated using videostroboscopy, videokymography (VKG), electroglottography (EGG), and audio recordings. All the subjects showed less posterior (cartilaginous) vocal fold adduction in phonation types FaB and CaB than in FaD and CaD, and less membranous vocal fold adduction (smaller closed quotient) in FaB and FaD than in CaB and CaD. The findings indicate that the exercises enabled the singers to separately manipulate (a) cartilaginous adduction and (b) membranous medialization of the glottis though vocal fold bulging. Membranous adduction (monitored via videokymographic closed quotient) was influenced by both membranous medialization and cartilaginous adduction. Individual control over these types of vocal fold adjustments allows singers to create different vocal timbres.
Collapse
Affiliation(s)
- Christian T Herbst
- Department of Biophysics, Faculty of Science, Palacký University Olomouc, 17 listopadu 12, 771 46 Olomouc, Czech Republic.
| | | | | | | |
Collapse
|
37
|
Herbst CT, Fitch WTS, Svec JG. Electroglottographic wavegrams: a technique for visualizing vocal fold dynamics noninvasively. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 128:3070-3078. [PMID: 21110602 DOI: 10.1121/1.3493423] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
A method for analyzing and displaying electroglottographic (EGG) signals (and their first derivative, DEGG) is introduced: the electroglottographic wavegram ("wavegram" hereafter). To construct a wavegram, the time-varying fundamental frequency is measured and consecutive individual glottal cycles are identified. Each cycle is locally normalized in duration and amplitude, the signal values are encoded by color intensity and the cycles are concatenated to display the entire voice sample in a single image, similar as in sound spectrography. The wavegram provides an intuitive means for quickly assessing vocal fold contact phenomena and their variation over time. Variations in vocal fold contact appear here as a sequence of events rather than single phenomena, taking place over a certain period of time, and changing with pitch, loudness and register. Multiple DEGG peaks are revealed in wavegrams to behave systematically, indicating subtle changes of vocal fold oscillatory regime. As such, EGG wavegrams promise to reveal more information on vocal fold contacting and de-contacting events than previous methods.
Collapse
Affiliation(s)
- Christian T Herbst
- Laboratory of Biophysics, Department of Experimental Physics, Faculty of Science, Palacký University Olomouc, tř 17 listopadu 12, 771 46 Olomouc, Czech Republic.
| | | | | |
Collapse
|
38
|
Daffern H, Howard DM. Voice source comparison between modern singers of early music and opera. LOGOP PHONIATR VOCO 2010; 35:68-73. [DOI: 10.3109/14015439.2010.482861] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
39
|
Using Electroglottographic Real-Time Feedback to Control Posterior Glottal Adduction during Phonation. J Voice 2010; 24:72-85. [DOI: 10.1016/j.jvoice.2008.06.003] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2008] [Accepted: 06/09/2008] [Indexed: 11/18/2022]
|
40
|
Herbst CT, Ternström S, Svec JG. Investigation of four distinct glottal configurations in classical singing--a pilot study. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2009; 125:EL104-9. [PMID: 19275279 DOI: 10.1121/1.3057860] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/26/2008] [Accepted: 11/25/2008] [Indexed: 05/24/2023]
Abstract
This study investigates four qualities of singing voice in a classically trained baritone: "naive falsetto," "countertenor falsetto," "lyrical chest" and "full chest." Laryngeal configuration and vocal fold behavior in these qualities were studied using laryngeal videostroboscopy, videokymography, electroglottography, and sound spectrography. The data suggest that the four voice qualities were produced by independently manipulating mainly two laryngeal parameters: (1) the adduction of the arytenoid cartilages and (2) the thickening of the vocal folds. An independent control of the posterior adductory muscles versus the vocalis muscle is considered to be the physiological basis for achieving these singing voice qualities.
Collapse
Affiliation(s)
- Christian T Herbst
- Department of Experimental Physics, Faculty of Science, Laboratory of Biophysics, Palacky University Olomouc, Olomouc, Czech Republic.
| | | | | |
Collapse
|
41
|
Howard DM, Svec JG. A comparison between LPV and a sister journal in the field of phoniatrics and logopedics. LOGOP PHONIATR VOCO 2008; 33:162-7. [PMID: 19051096 DOI: 10.1080/14015430802638620] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
42
|
Orlikoff RF. Voice Production during a Weightlifting and Support Task. Folia Phoniatr Logop 2008; 60:188-94. [DOI: 10.1159/000128277] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
|