1
|
Melle N, Gallego C, Lahoz-Bengoechea JM, Nieva S. Differential spectral characteristics of the Spanish fricative /s/ in the articulation of individuals with dysarthria and apraxia of speech. JOURNAL OF COMMUNICATION DISORDERS 2024; 109:106428. [PMID: 38744198 DOI: 10.1016/j.jcomdis.2024.106428] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/22/2023] [Revised: 04/11/2024] [Accepted: 04/30/2024] [Indexed: 05/16/2024]
Abstract
PURPOSE This study examines whether there are differences in the speech of speakers with dysarthria, speakers with apraxia and healthy speakers in spectral acoustic measures during production of the central-peninsular Spanish alveolar sibilant fricative /s/. METHOD To this end, production of the sibilant was analyzed in 20 subjects with dysarthria, 8 with apraxia of speech and 28 healthy speakers. Participants produced 12 sV(C) words. The variables compared across groups were the fricative's spectral amplitude difference (AmpD) and spectral moments in the temporal midpoint of fricative execution. RESULTS The results indicate that individuals with dysarthria can be distinguished from healthy speakers in terms of the spectral characteristics AmpD, standard deviation (SD), center of gravity (CoG) and skewness, the last two in context with unrounded vowel, while no differences in kurtosis were detected. Participants with AoS group differ significantly from healthy speaker group in AmpD, SD and CoG and Kurtosis, the first one followed unrounded vowel and the latter two followed by rounded vowels. In addition, speakers with apraxia of speech group returned significant differences with respect to speakers with dysarthria group in AmpD, CoG and skewness. CONCLUSIONS The differences found between the groups in the measures studied as a function of the type of vowel context could provide insights into the distinctive manifestations of motor speech disorders, contributing to the differential diagnosis between apraxia and dysarthria in motor control processes.
Collapse
Affiliation(s)
- Natalia Melle
- Dept. of Experimental Psychology, Cognitive processes and Speech Therapy, Faculty of Psychology, Complutense University of Madrid, Madrid, Spain.
| | - Carlos Gallego
- Dept. of Experimental Psychology, Cognitive processes and Speech Therapy, Faculty of Psychology, Complutense University of Madrid, Madrid, Spain
| | - José María Lahoz-Bengoechea
- Dept. of Spanish Linguistics and Literary Theory, Faculty of Philology, Complutense University of Madrid, Madrid, Spain
| | - Silvia Nieva
- Dept. of Experimental Psychology, Cognitive processes and Speech Therapy, Faculty of Psychology, Complutense University of Madrid, Madrid, Spain
| |
Collapse
|
2
|
Yu ACL, McAllister R, Mularoni N, To CKS. Brief Report: Atypical Temporal Sensitivity in Coarticulation in Autism: Evidence from Sibilant-Vowel Interaction in Cantonese. J Autism Dev Disord 2024:10.1007/s10803-024-06258-w. [PMID: 38431693 DOI: 10.1007/s10803-024-06258-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/19/2024] [Indexed: 03/05/2024]
Abstract
PURPOSE Atypicalities in the prosodic aspects of speech are commonly considered in clinical assessments of autism. While there is an increasing number of studies using objective measures to assess prosodic deficits, such studies have primarily focused on the intonational and rhythmic aspects of prosody. Little is known about prosodic deficits that are reflected at the segmental level, despite the strong connection between prosody and segmental realization. This study examines the nature of sibilant-vowel coarticulation among male adult native speakers of Cantonese with autism and those without. METHODS Fifteen Cantonese-speaking autistic (ASD) adults (mean age = 25 years) and 23 neuro-typical (NT) adults (mean age = 20 years) participated. Each participant read aloud 42 syllables with a sibilant onset in carrier phrase. Spectral means and variance, skewness and kurtosis were measured, and regressed by vocalic rounding (rounded vs. unrounded), cohort (ASD vs. NT), sibilant duration, and articulation rate. RESULTS While neurotypical participants exhibit sibilant-vowel coarticulation that are sensitive to variation in sibilant duration, autistic participants show no sensitivity to segmental temporal changes. CONCLUSION These findings point to the potential for atypicalities in prosody-segment interaction as an important characteristic of autistic speech.
Collapse
Affiliation(s)
| | | | | | - Carol K S To
- The University of Hong Kong, Hong Kong SAR, China.
| |
Collapse
|
3
|
Sfakianaki A, Nicolaidis K, Kafentzis GP. Temporal, spectral and amplitude characteristics of the Greek fricative /s/ in hearing-impaired and normal-hearing speech. CLINICAL LINGUISTICS & PHONETICS 2024:1-27. [PMID: 38271713 DOI: 10.1080/02699206.2023.2301308] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/02/2023] [Accepted: 12/29/2023] [Indexed: 01/27/2024]
Abstract
Fricatives, and especially sibilants, are very frequently misarticulated by speakers with hearing loss. Misarticulations can result in phonemic contrast weakening or loss, compromising intelligibility. The present study focuses on the examination of acoustic characteristics of the Greek alveolar fricative /s/, an articulatorily demanding sound, produced by young adult speakers with profound hearing impairment and with normal hearing. An array of variables was examined using mixed-effects and random forest models aiming to assess the effectiveness of various measures in differentiating hearing-impaired and normal-hearing /s/ production. Significant differences were found in spectral and amplitude measures, but not in temporal measures. In hearing-impaired speech, spectral slope and RMS amplitude had significantly lower values, indicating a more distributed spectrum, suggestive of decreased flow velocity through the fricative constriction. Also, a trend for concentration of energy at lower frequencies was observed suggesting more posterior fricative articulation than normal. Moreover, measures capturing the variation of frequency and amplitude over time revealed different patterns of sibilance development across time than normal, denoting the production of a less well-formed or less sibilant /s/ by speakers with hearing impairment. The investigation of contextual effects on /s/ in hearing-impaired speech showed increased spectral variance, negative skewness and lower kurtosis in the labial (rounded) context /u/ in relation to the nonlabial contexts /i/ and /a/, indicating a more diffuse, less compact spectrum with concentration at high frequencies. Findings are discussed in relation to previous literature on fricative production by speakers with hearing impairment and normal hearing in Greek and other languages.
Collapse
Affiliation(s)
- Anna Sfakianaki
- Department of Philology, University of Ioannina, Ioannina, Greece
- Computer Science Department, University of Crete, Heraklion, Greece
| | - Katerina Nicolaidis
- Department of Theoretical and Applied Linguistics, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | | |
Collapse
|
4
|
Houle N, Lerario MP, Levi SV. Spectral analysis of strident fricatives in cisgender and transfeminine speakersa). THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023; 154:3089-3100. [PMID: 37962405 PMCID: PMC10651311 DOI: 10.1121/10.0022387] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Revised: 08/24/2023] [Accepted: 10/10/2023] [Indexed: 11/15/2023]
Abstract
The spectral features of /s/ and /ʃ/ carry important sociophonetic information regarding a speaker's gender. Often, gender is misclassified as a binary of male or female, but this excludes people who may identify as transgender or nonbinary. In this study, we use a more expansive definition of gender to investigate the acoustics (duration and spectral moments) of /s/ and /ʃ/ across cisgender men, cisgender women, and transfeminine speakers in voiced and whispered speech and the relationship between spectral measures and transfeminine gender expression. We examined /s/ and /ʃ/ productions in words from 35 speakers (11 cisgender men, 17 cisgender women, 7 transfeminine speakers) and 34 speakers (11 cisgender men, 15 cisgender women, 8 transfeminine speakers), respectively. In general, /s/ and /ʃ/ center of gravity was highest in productions by cisgender women, followed by transfeminine speakers, and then cisgender men speakers. There were no other gender-related differences. Within transfeminine speakers, /s/ and /ʃ/ center of gravity and skewness were not related to the time proportion expressing their feminine spectrum gender or their Trans Women Voice Questionnaire scores. Taken together, the acoustics of /s/ and /ʃ/ may signal gender group identification but may not account for within-gender variation in transfeminine gender expression.
Collapse
Affiliation(s)
- Nichole Houle
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, Massachusetts 02215, USA
| | | | - Susannah V Levi
- Department of Communicative Sciences and Disorders, New York University, New York, New York 10012, USA
| |
Collapse
|
5
|
Shadle CH, Chen WR, Koenig LL, Preston JL. Refining and extending measures for fricative spectra, with special attention to the high-frequency rangea). THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023; 154:1932-1944. [PMID: 37768114 PMCID: PMC10540850 DOI: 10.1121/10.0021075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/05/2023] [Revised: 08/04/2023] [Accepted: 09/05/2023] [Indexed: 09/29/2023]
Abstract
Fricatives have noise sources that are filtered by the vocal tract and that typically possess energy over a much broader range of frequencies than observed for vowels and sonorant consonants. This paper introduces and refines fricative measurements that were designed to reflect underlying articulatory and aerodynamic conditions These show differences in the pattern of high-frequency energy for sibilants vs non-sibilants, voiced vs voiceless fricatives, and non-sibilants differing in place of articulation. The results confirm the utility of a spectral peak measure (FM) and low-mid frequency amplitude difference (AmpD) for sibilants. Using a higher-frequency range for defining FM for female voices for alveolars is justified; a still higher range was considered and rejected. High-frequency maximum amplitude (Fh) and amplitude difference between low- and higher-frequency regions (AmpRange) capture /f-θ/ differences in English and the dynamic amplitude range over the entire spectrum. For this dataset, with spectral information up to 15 kHz, a new measure, HighLevelD, was more effective than previously used LevelD and Slope in showing changes over time within the frication. Finally, isolated words and connected speech differ. This work contributes improved measures of fricative spectra and demonstrates the necessity of including high-frequency energy in those measures.
Collapse
Affiliation(s)
- Christine H Shadle
- Yale Child Study Center, School of Medicine, Yale University, New Haven, Connecticut 06519, USA
| | - Wei-Rong Chen
- Yale Child Study Center, School of Medicine, Yale University, New Haven, Connecticut 06519, USA
| | - Laura L Koenig
- Yale Child Study Center, School of Medicine, Yale University, New Haven, Connecticut 06519, USA
| | - Jonathan L Preston
- Department of Communication Sciences and Disorders, Syracuse University, Syracuse, New York 13244, USA
| |
Collapse
|
6
|
Hamza Y, Farhadi A, Schwarz DM, McDonough JM, Carney LH. Representations of fricatives in subcortical model responses: Comparisons with human consonant perception. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023; 154:602-618. [PMID: 37535429 PMCID: PMC10550336 DOI: 10.1121/10.0020536] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Revised: 07/11/2023] [Accepted: 07/13/2023] [Indexed: 08/05/2023]
Abstract
Fricatives are obstruent sound contrasts made by airflow constrictions in the vocal tract that produce turbulence across the constriction or at a site downstream from the constriction. Fricatives exhibit significant intra/intersubject and contextual variability. Yet, fricatives are perceived with high accuracy. The current study investigated modeled neural responses to fricatives in the auditory nerve (AN) and inferior colliculus (IC) with the hypothesis that response profiles across populations of neurons provide robust correlates to consonant perception. Stimuli were 270 intervocalic fricatives (10 speakers × 9 fricatives × 3 utterances). Computational model response profiles had characteristic frequencies that were log-spaced from 125 Hz to 8 or 20 kHz to explore the impact of high-frequency responses. Confusion matrices generated by k-nearest-neighbor subspace classifiers were based on the profiles of average rates across characteristic frequencies as feature vectors. Model confusion matrices were compared with published behavioral data. The modeled AN and IC neural responses provided better predictions of behavioral accuracy than the stimulus spectra, and IC showed better accuracy than AN. Behavioral fricative accuracy was explained by modeled neural response profiles, whereas confusions were only partially explained. Extended frequencies improved accuracy based on the model IC, corroborating the importance of extended high frequencies in speech perception.
Collapse
Affiliation(s)
- Yasmeen Hamza
- Department of Biomedical Engineering, University of Rochester, Rochester, New York 14627, USA
| | - Afagh Farhadi
- Department of Electrical and Computer Engineering, University of Rochester, Rochester, New York 14627, USA
| | - Douglas M Schwarz
- Depts. of Neuroscience and Biomedical Engineering, University of Rochester, Rochester, New York 14627, USA
| | - Joyce M McDonough
- Department of Linguistics, University of Rochester, Rochester, New York 14627, USA
| | - Laurel H Carney
- Depts. of Biomedical Engineering, Neuroscience, and Electrical and Computer Engineering, University of Rochester, Rochester, New York 14627, USA
| |
Collapse
|
7
|
Cox SR, Huang T, Chen WR, Ng ML. An acoustic study of Cantonese alaryngeal speech in different speaking conditions. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023; 153:2973. [PMID: 37212513 PMCID: PMC10205142 DOI: 10.1121/10.0019471] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Revised: 04/30/2023] [Accepted: 05/02/2023] [Indexed: 05/23/2023]
Abstract
Esophageal (ES) speech, tracheoesophageal (TE) speech, and the electrolarynx (EL) are common methods of communication following the removal of the larynx. Our recent study demonstrated that intelligibility may increase for Cantonese alaryngeal speakers using clear speech (CS) compared to their everyday "habitual speech" (HS), but the reasoning is still unclear [Hui, Cox, Huang, Chen, and Ng (2022). Folia Phoniatr. Logop. 74, 103-111]. The purpose of this study was to assess the acoustic characteristics of vowels and tones produced by Cantonese alaryngeal speakers using HS and CS. Thirty-one alaryngeal speakers (9 EL, 10 ES, and 12 TE speakers) read The North Wind and the Sun passage in HS and CS. Vowel formants, vowel space area (VSA), speaking rate, pitch, and intensity were examined, and their relationship to intelligibility were evaluated. Statistical models suggest that larger VSAs significantly improved intelligibility, but slower speaking rate did not. Vowel and tonal contrasts did not differ between HS and CS for all three groups, but the amount of information encoded in fundamental frequency and intensity differences between high and low tones positively correlated with intelligibility for TE and ES groups, respectively. Continued research is needed to understand the effects of different speaking conditions toward improving acoustic and perceptual characteristics of Cantonese alaryngeal speech.
Collapse
Affiliation(s)
- Steven R Cox
- Department of Communication Sciences and Disorders, Adelphi University, Garden City, New York 11530, USA
| | - Ting Huang
- Haskins Laboratories, New Haven, Connecticut 06511, USA
| | - Wei-Rong Chen
- Haskins Laboratories, New Haven, Connecticut 06511, USA
| | - Manwa L Ng
- Speech Science Laboratory, Faculty of Education, University of Hong Kong, Hong Kong SAR, China
| |
Collapse
|
8
|
Ulrich N, Pellegrino F, Allassonnière-Tang M. Intra- and inter-speaker variation in eight Russian fricativesa). THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023; 153:2285. [PMID: 37092935 DOI: 10.1121/10.0017827] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Accepted: 03/26/2023] [Indexed: 05/03/2023]
Abstract
Acoustic variation is central to the study of speaker characterization. In this respect, specific phonemic classes such as vowels have been particularly studied, compared to fricatives. Fricatives exhibit important aperiodic energy, which can extend over a high-frequency range beyond that conventionally considered in phonetic analyses, often limited up to 12 kHz. We adopt here an extended frequency range up to 20.05 kHz to study a corpus of 15 812 fricatives produced by 59 speakers in Russian, a language offering a rich inventory of fricatives. We extracted two sets of parameters: the first is composed of 11 parameters derived from the frequency spectrum and duration (acoustic set) while the second is composed of 13 mel frequency cepstral coefficients (MFCCs). As a first step, we implemented machine learning methods to evaluate the potential of each set to predict gender and speaker identity. We show that gender can be predicted with a good performance by the acoustic set and even more so by MFCCs (accuracy of 0.72 and 0.88, respectively). MFCCs also predict individuals to some extent (accuracy = 0.64) unlike the acoustic set. In a second step, we provide a detailed analysis of the observed intra- and inter-speaker acoustic variation.
Collapse
Affiliation(s)
- Natalja Ulrich
- Laboratoire Dynamique Du Langage (DDL) UMR 5596, CNRS/Université Lyon 2, Lyon, France
| | - François Pellegrino
- Laboratoire Dynamique Du Langage (DDL) UMR 5596, CNRS/Université Lyon 2, Lyon, France
| | - Marc Allassonnière-Tang
- Lab Ecological-Anthropology, Unité Mixte de Recherche 7206, National Museum of Natural History, Paris, France
| |
Collapse
|
9
|
Oliver S, Keyser MMB, Jhingree S, Bocklage C, Lathrop H, Giduz N, Moss K, Blakey G, White R, Turvey T, Mielke J, Zajac D, Jacox LA. Impacts of anterior-posterior jaw disproportions on speech of dentofacial disharmony patients. Eur J Orthod 2023; 45:1-10. [PMID: 36308520 PMCID: PMC9912703 DOI: 10.1093/ejo/cjac057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
BACKGROUND/OBJECTIVES Articulation problems impact communication, development, and quality of life, and are diagnosed in 73-87% of patients with Class II Dentofacial Disharmony (DFD). We evaluated whether differences exist in stop (/t/ or/k/), fricative (/s/ or/ʃ/), and affricate (/tʃ/) consonant sounds of Class II DFD subjects, and whether extent of malocclusion correlates with severity of speech distortion. We hypothesized that Class II patients display milder distortions than Class III and anterior open bite (AOB), as Class II patients can posture into a Class I occlusion. MATERIALS/METHODS Audio and orthodontic records were collected from DFD patients (N = 53-Class II, 102-Class III, 72-Controls) who were pursuing orthodontics and orthognathic surgery. A speech pathologist perceptually scored speech. Acoustic differences in recordings were measured using Spectral Moment Analysis. RESULTS When Class II subjects were compared to controls, significant differences were found for the centroid frequency (M1) of the /s/ sound and the spectral spread (M2) of /t/, /tʃ/, and /s/ sounds, with pairwise significance for controls relative to Class II AOB and all Class II subjects. Class II AOB subjects had higher M1 and M2 values than patients with Class II closed bites and Class I controls for most sounds. When comparing across anterior-posterior (AP) groups, differences exist between controls, Class II and III DFD subjects for M1 of /t/, /tʃ/, and/ʃ/ and M2 for /t/, /tʃ/, /s/, and /ʃ/ sounds. Using linear regression, correlations between Class II and III severity and spectral measures were found for /t/ and /tʃ/ sounds. CONCLUSIONS/IMPLICATIONS Class II and III patients have a higher prevalence of qualitative distortions and spectral changes in consonants compared to controls, but Class II spectral shifts are smaller and affect fewer sounds than in Class III and AOB cohorts. Linear correlations between AP discrepancy and spectral change suggest causation and that treatment may improve articulation problems.
Collapse
Affiliation(s)
- Steven Oliver
- Division of Craniofacial and Surgical Care, Orthodontics Group, Adams School of Dentistry, University of North Carolina, Chapel Hill, NC, USA
| | - Mary Morgan Bitler Keyser
- Division of Craniofacial and Surgical Care, Orthodontics Group, Adams School of Dentistry, University of North Carolina, Chapel Hill, NC, USA
| | - Samantha Jhingree
- Division of Craniofacial and Surgical Care, Orthodontics Group, Adams School of Dentistry, University of North Carolina, Chapel Hill, NC, USA
| | - Clare Bocklage
- Division of Craniofacial and Surgical Care, Orthodontics Group, Adams School of Dentistry, University of North Carolina, Chapel Hill, NC, USA
| | - Hillary Lathrop
- Division of Craniofacial and Surgical Care, Orthodontics Group, Adams School of Dentistry, University of North Carolina, Chapel Hill, NC, USA
| | - Natalie Giduz
- Division of Craniofacial and Surgical Care, Orthodontics Group, Adams School of Dentistry, University of North Carolina, Chapel Hill, NC, USA
| | - Kevin Moss
- Division of Oral and Craniofacial Health Sciences, Adams School of Dentistry, University of North Carolina, Chapel Hill, NC, USA
| | - George Blakey
- Division of Craniofacial and Surgical Care, Oral Surgery Group, Adams School of Dentistry, University of North Carolina, Chapel Hill, NC, USA
| | - Raymond White
- Division of Craniofacial and Surgical Care, Oral Surgery Group, Adams School of Dentistry, University of North Carolina, Chapel Hill, NC, USA
| | - Timothy Turvey
- Division of Craniofacial and Surgical Care, Oral Surgery Group, Adams School of Dentistry, University of North Carolina, Chapel Hill, NC, USA
| | - Jeff Mielke
- North Carolina State University, English Department, Tompkins Hall, Raleigh, NC, USA
| | - David Zajac
- Division of Craniofacial and Surgical Care, Speech Pathology Group, Adams School of Dentistry, University of North Carolina, Chapel Hill, NC, USA
| | - Laura Anne Jacox
- Division of Craniofacial and Surgical Care, Orthodontics Group, Adams School of Dentistry, University of North Carolina, Chapel Hill, NC, USA
- Division of Oral and Craniofacial Health Sciences, Adams School of Dentistry, University of North Carolina, Chapel Hill, NC, USA
- UNC Adams School of Dentistry, 385 S Columbia St, Chapel Hill, NC 27599, USA
| |
Collapse
|
10
|
Kim H, Gurevich N. Positional asymmetries in consonant production and intelligibility in dysarthric speech. CLINICAL LINGUISTICS & PHONETICS 2023; 37:125-142. [PMID: 34955080 DOI: 10.1080/02699206.2021.2019312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Revised: 12/09/2021] [Accepted: 12/10/2021] [Indexed: 06/14/2023]
Abstract
Consonant production errors are common in dysarthric speech, but not all consonants are affected to the same extent. Currently, only limited knowledge exists regarding whether different positional allophones are affected to varying degrees in dysarthric speech. The aim of this study was to investigate the effect of consonants' position-in-words on consonant production accuracy and their relevance to speech intelligibility. To this end, the percentage of correctly articulated consonants was analyzed with respect to position-in-words, manner of articulation, and speakers' overall intelligibility in dysarthric speech secondary to cerebral palsy. Results revealed that consonants were generally more accurate in initial positions than other positions, and when they were singletons rather than clusters. However, fricatives, a category commonly noted for frequent misarticulations, exhibited no significant positional effects, indicating that fricatives were affected to a similar degree across all word positions. In addition, positional asymmetry manifested to a greater degree as intelligibility decreased. Finally, the strength of the relationship between consonant production accuracy and intelligibility varied depending on positions-in-words, with strong correlations in the medial and final cluster positions but no significant correlation for fricatives in the initial and final singleton positions. The correlation was markedly low for the initial consonants, possibly due to the resistance of initial consonants to production errors. The positional asymmetry found in this study necessitates more research on non-initial consonants and clusters for their potential in improving consonant production and speech intelligibility as a part of dysarthria management.
Collapse
Affiliation(s)
- Heejin Kim
- Linguistics, University of Illinois, Urbana-Champaign, Illinois, USA
| | - Naomi Gurevich
- Communication Sciences and Disorders, Purdue University Fort Wayne, Fort Wayne, Indiana, USA
| |
Collapse
|
11
|
Shadle CH. Alternatives to moments for characterizing fricatives: Reconsidering Forrest et al. (1988). THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023; 153:1412. [PMID: 36859163 DOI: 10.1121/10.0017231] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Accepted: 01/30/2023] [Indexed: 06/18/2023]
Abstract
Means of characterizing acoustic signals of fricatives with a few parameters have long been sought. When Forrest, Weismer, Milenkovic, and Dougall [(1988) J. Acoust. Soc. Am. 84, 115-123] described their system of treating spectra as probability density functions and computing the first four spectral moments, others quickly adopted their clearly described method, although it did not distinguish /f/ and /θ/. Various problems with their method are described, including the lack of spectral averaging, the necessity of normalizing the amplitude, and correlation between pairs of moments. Even when these issues are rectified by alternative methods, the fact remains that moments are not ideal descriptors because they can only describe departures from the shape of a normal Gaussian distribution. Fricative spectra, particularly of non-sibilants, are often quite dissimilar in shape from Gaussians. Furthermore, shape descriptors do not lend themselves to direct inferences about the production variables that caused the acoustic effects. Here, alternative parameters are defined, it is shown how to adapt them to specific experimental conditions, and tests of efficacy are proposed. These parameters are strongly linked to the articulatory and aerodynamic variables that underlie fricative production.
Collapse
|
12
|
Wikse Barrow C, Włodarczak M, Thörn L, Heldner M. Static and dynamic spectral characteristics of Swedish voiceless fricatives. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022; 152:2588. [PMID: 36456287 DOI: 10.1121/10.0014947] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Accepted: 10/05/2022] [Indexed: 06/17/2023]
Abstract
Descriptions of the acoustic characteristics of Swedish voiceless fricatives are scarce and are limited to static measures derived from the speech of a small number of speakers. The current study provides an updated acoustic description of the static (spectral, temporal, and intensity) characteristics of word-initial voiceless fricatives in Central Standard Swedish. In addition, temporal variation of spectral centre of gravity is modelled using a generalized additive mixed model. Results show that fricatives were differentiated in terms of spectral properties, duration, and intensity level, such that sibilant fricatives were generally longer and more intense than non-sibilant fricatives. Spectral centre of gravity differentiated between all places of articulation apart from labio-dental /f/. Gender differences were found for centre of gravity in /s/ but overall, sex/gender differences were small. Dynamic analyses revealed differences in curvature as well as overall level of spectral centre of gravity across the duration of the fricative, associated with place of articulation and mediated by vowel context, fricative duration, and speaker specific patterns. The results from the present study are valuable for future cross-linguistic research, and as reference for investigations concerning children's acquisition of Swedish voiceless fricatives.
Collapse
Affiliation(s)
| | | | - Lisa Thörn
- Department of Linguistics, Stockholm University, Stockholm, Sweden
| | - Mattias Heldner
- Department of Linguistics, Stockholm University, Stockholm, Sweden
| |
Collapse
|
13
|
Kharlamov V, Brenner D, Tucker BV. Temporal and spectral characteristics of conversational versus read fricatives in American English. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022; 152:2073. [PMID: 36319243 DOI: 10.1121/10.0014420] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/18/2022] [Accepted: 09/15/2022] [Indexed: 06/16/2023]
Abstract
The present study compares the production of fricatives in conversational versus read speech in American English. The goal is to examine which parameters contribute to the identification of fricatives across the two speech styles. The study surveys over 162 000 fricative tokens from the Buckeye Corpus [Pitt, Johnson, Hume, Kiesling, and Raymond (2005). Speech Commun. 45, 89-95] and the TIMIT Corpus [Zue and Seneff (1996). Recent Research towards Advanced Man-Machine Interface through Spoken Language (Elsevier, Amsterdam, the Netherlands), pp. 515-525]. A total of 18 different temporal and spectral measures are tested, including segment duration, preceding and following phone duration, spectral moments (at onset, midpoint, and/or offset), spectral peak frequency, etc. Results show that segment duration and midpoint spectral moments make the most prominent contribution to the categorization of fricatives for both speech styles. Spectral measures are more important for conversational speech, whereas duration plays a greater role for read speech. At the same time, the magnitude of the differences across speech styles is often low and many of the observed effects may be attributable to methodological differences across the corpora. Results may indicate that reduction of fricatives in conversational speech is more limited compared to the reduction of other types of speech sounds, such as plosives.
Collapse
Affiliation(s)
- Viktor Kharlamov
- Department of Languages, Linguistics, and Comparative Literature, Florida Atlantic University, Boca Raton, Florida 33431, USA
| | | | - Benjamin V Tucker
- Department of Linguistics, University of Alberta, Edmonton, Alberta T6G 2E7, Canada
| |
Collapse
|
14
|
Rong P, Hansen O, Heidrick L. Relationship between rate-elicited changes in muscular-kinematic control strategies and acoustic performance in individuals with ALS-A multimodal investigation. JOURNAL OF COMMUNICATION DISORDERS 2022; 99:106253. [PMID: 36007484 DOI: 10.1016/j.jcomdis.2022.106253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Revised: 08/08/2022] [Accepted: 08/09/2022] [Indexed: 06/15/2023]
Abstract
INTRODUCTION As a key control variable, duration has been long suspected to mediate the organization of speech motor control strategies, which has management implications for neuromotor speech disorders. This study aimed to experimentally delineate the role of duration in organizing speech motor control in neurologically healthy and impaired speakers using a voluntary speaking rate manipulation paradigm. METHODS Thirteen individuals with amyotrophic lateral sclerosis (ALS) and 10 healthy controls performed a sentence reading task three times, first at their habitual rate, then at a slower rate. A multimodal approach combining surface electromyography, kinematic, and acoustic technologies was used to record jaw muscle activities, jaw kinematics, and speech acoustics. Six muscular-kinematic features were extracted and factor-analyzed to characterize the organization of the mandibular control hierarchy. Five acoustic features were extracted, measuring the spectrotemporal properties of the diphthong /ɑɪ/ and the plosives /t/ and /k/. RESULTS The muscular-kinematic features converged into two interpretable latent factors, reflecting the level and cohesiveness/flexibility of mandibular control, respectively. Voluntary rate reduction led to a trend toward (1) finer, less cohesive, and more flexible mandibular control, and (2) increased range and decreased transition slope of the diphthong formants, across neurologically healthy and impaired groups. Differential correlations were found between the rate-elicited changes in mandibular control and acoustic performance for neurologically healthy and impaired speakers. CONCLUSIONS The results provided empirical evidence for the long-suspected but previously unsubstantiated role of duration in (re)organizing speech motor control strategies. The rate-elicited reorganization of muscular-kinematic control contributed to the acoustic performance of healthy speakers, in ways consistent with theoretical predictions. Such contributions were less consistent in impaired speakers, implying the complex nature of speaking rate reduction in ALS, possibly reflecting an interplay of disease-related constraints and volitional duration control. This information may help to stratify and identify candidates for the rate manipulation therapy.
Collapse
Affiliation(s)
- Panying Rong
- Department of Speech-Language-Hearing: Sciences & Disorders, University of Kansas, Lawrence KS, USA.
| | - Olivia Hansen
- Department of Speech-Language-Hearing: Sciences & Disorders, University of Kansas, Lawrence KS, USA; Department of Hearing & Speech, University of Kansas Medical Center, Kansas City, KS, USA
| | - Lindsey Heidrick
- Department of Hearing & Speech, University of Kansas Medical Center, Kansas City, KS, USA
| |
Collapse
|
15
|
Xu M, Shao J, Ding H, Wang L. The effect of aging on identification of Mandarin consonants in normal and whisper registers. Front Psychol 2022; 13:962242. [PMID: 36033000 PMCID: PMC9413086 DOI: 10.3389/fpsyg.2022.962242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Accepted: 07/21/2022] [Indexed: 11/17/2022] Open
Abstract
Consonant perception in older adults has been widely explored in recent years. However, how aging affects the identification of Mandarin consonants, especially in whispered condition, are under studied. Mandarin consonants and whispering speech have unique features, which may result in different identification difficulties for older adults. The current study investigated older adults' identification of Mandarin consonants in phonated and whispered conditions in comparison with the performance of younger adults. It was found that in phonated condition, older adults showed the lowest accuracy for affricatives and fricatives owing to their insensitivity to high-frequency information. The lower accuracy of affricatives and plosives was largely attributed to the difficulty in recognizing articulatory places. Identifying aspirated plosives was much more difficult than unaspirated counterparts for older adults. In whispered condition, the recognition of voiced consonants and aspirated-unaspirated distinction became challenging, especially for older adults. Contrary to the expectation, some consonants became easier to be recognized in the whispered condition, i.e., /ph, tɕh, x/. These findings enrich our understanding of how aging affects consonant identification in different languages and less ideal conditions. It also suggests that the listener's ability, language uniqueness, and characteristics of distorted speech should be all taken into consideration when investigating speech perception in adverse conditions.
Collapse
Affiliation(s)
- Min Xu
- Institute of Corpus Studies and Applications, Shanghai International Studies University, Shanghai, China
| | - Jing Shao
- Department of English Language and Literature, Hong Kong Baptist University, Kowloon Tong, Hong Kong SAR, China
- *Correspondence: Jing Shao
| | - Hongwei Ding
- Speech-Language-Hearing Center, School of Foreign Languages, Shanghai Jiao Tong University, Shanghai, China
| | - Lan Wang
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| |
Collapse
|
16
|
Jhingree S, Xie Y, Bocklage C, Giduz N, Moss K, Zajac D, Jacox LA. Validating Spectral Moment Analysis as a Quantitative Measure of Speech Distortions in Speakers With Class III Malocclusions. PERSPECTIVES OF THE ASHA SPECIAL INTEREST GROUPS 2022; 7:728-740. [PMID: 35875421 PMCID: PMC9302712 DOI: 10.1044/2022_persp-21-00315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
PURPOSE This study examined the relationship between judged speech sound distortions and spectral moment metrics in speakers with Class III malocclusion. METHODS A quantitative online survey was distributed to 30 speech specialists (clinicians and/or students) and 100 lay listeners to judge the clarity of the sounds /s/, /ʃ/, /t/ and /k/ using a visual analog scale (VAS) from recordings of 11 Class III (underbite) Dentofacial Disharmony (DFD) patients and eight Class I controls. Patients and controls were grouped according to high, moderate, and low /s/-/ʃ/ first spectral moment differences. A linear mixed model was used to analyze the data. RESULTS VAS scale ratings increased as a function of decreasing spectral contrast for both groups of listeners. VAS ratings of speech specialists were more homogenous than lay listeners, and speech specialists rated distortions as less severe than lay listeners. CONCLUSIONS Recordings of Class III DFD patients with low /s/-/ʃ/ first spectral moment differences were scored by listeners as having increased VAS scale ratings, indicative of more significant perceived speech-sound distortions. Spectral moment analysis appears to be a promising approach for characterizing speech of DFD patients and other craniofacial disorders.
Collapse
Affiliation(s)
- Samantha Jhingree
- Orthodontics, Division of Craniofacial and Surgical Care, Adams School of Dentistry, University of North Carolina, 201 Brauer Hall, CB 7450, 25799-7450, USA
- Division of Oral and Craniofacial Health Sciences, Adams School of Dentistry, University of North Carolina, CB #7455, Chapel Hill, NC 27599, USA
| | - Yushan Xie
- Orthodontics, Division of Craniofacial and Surgical Care, Adams School of Dentistry, University of North Carolina, 201 Brauer Hall, CB 7450, 25799-7450, USA
| | - Clare Bocklage
- Orthodontics, Division of Craniofacial and Surgical Care, Adams School of Dentistry, University of North Carolina, 201 Brauer Hall, CB 7450, 25799-7450, USA
- Division of Oral and Craniofacial Health Sciences, Adams School of Dentistry, University of North Carolina, CB #7455, Chapel Hill, NC 27599, USA
| | - Natalie Giduz
- UNC School of Social Work, Tate-Turner-Kuralt Building, 325 Pittsboro Street, CB #3550, Chapel Hill, NC 27599-3550 USA
| | - Kevin Moss
- Division of Oral and Craniofacial Health Sciences, Adams School of Dentistry, University of North Carolina, CB #7455, Chapel Hill, NC 27599, USA
| | - David Zajac
- Division of Craniofacial and Surgical Care, Craniofacial Center, Adams School of Dentistry, University of North Carolina, 270 Brauer Hall, CB#270, Chapel Hill, NC 25799-7450, USA
| | - Laura Anne Jacox
- Orthodontics, Division of Craniofacial and Surgical Care, Adams School of Dentistry, University of North Carolina, 201 Brauer Hall, CB 7450, 25799-7450, USA
- Division of Oral and Craniofacial Health Sciences, Adams School of Dentistry, University of North Carolina, CB #7455, Chapel Hill, NC 27599, USA
| |
Collapse
|
17
|
Rao D, Shaw J, Dockum R. Data on acoustic phonetic properties of non-coronal fricatives in monosyllabic words of Zhongjiang Chinese. Data Brief 2022; 42:108062. [PMID: 35356315 PMCID: PMC8958547 DOI: 10.1016/j.dib.2022.108062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 03/09/2022] [Accepted: 03/10/2022] [Indexed: 11/05/2022] Open
Abstract
The data reported in this article are non-coronal fricative measurements from 10 (5 male; 5 female) native speakers of Zhongjiang Chinese. Each speaker produced 10 repetitions of 90 monosyllabic words beginning with either a velar fricative, /x/, or a labial-dental fricative, /f/. The measurements reported include spectral properties often used to characterize fricative variation, including: spectrum center of gravity (CoG), spectrum standard deviation (SD), spectrum skew, spectrum kurtosis, maximum amplitude frequency, and maximum amplitude. These measurements are compared across two data filtering conditions: a high pass filter condition, in which a 300Hz high pass filter was applied to the data before spectral measurements were calculated, and a no filter condition. The 90 monosyllabic words include the target fricatives in different phonetic environments. Target words include some that historically derive from different fricatives and show variation across regional varieties of Mandarin Chinese. Subsets of the target materials enable several closely matched comparisons of items. We describe measurements across the whole dataset, comparing as well the effect that filtering has on the measurements. The data also include a CSV file with measurements of each token, which enables comparison of phonetic contexts, lexical effects and individual differences in fricative variation beyond those described here. For further discussion of the data, please refer to the full length article entitled “The role of gestural timing in non-coronal fricative mergers in Southwestern Mandarin: acoustic evidence from a dialect island. Journal of Phonetics” [6].
Collapse
|
18
|
Keyser MMB, Lathrop H, Jhingree S, Giduz N, Bocklage C, Couldwell S, Oliver S, Moss K, Frazier-Bowers S, Phillips C, Turvey T, Blakey G, White R, White RP, McMichael DL, Zajac D, Mielke J, Jacox LA. Impacts of Skeletal Anterior Open Bite Malocclusion on Speech. FACE (THOUSAND OAKS, CALIF.) 2022; 3:339-349. [PMID: 35903399 PMCID: PMC9328410 DOI: 10.1177/27325016221082229] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
INTRODUCTION Articulation problems are seen in 80-90% of dentofacial deformity (DFD) subjects compared with 5% of the general population, impacting communication and quality of life, but the causal link is unclear. We hypothesize there are both qualitative (perceptual) and quantitative (spectral) differences in properties of stop (/t/ or /k/), fricative (/s/ or /∫/), and affricate (/t∫/) consonant sounds and that severity of anterior open bite (AOB) jaw disharmonies correlates with degree of speech abnormality. METHODS To test our hypotheses, surgical orthodontic records and audio recordings were collected from DFD patients (n=39 AOB, 62 controls). A speech pathologist evaluated subjects and recordings were analyzed using spectral moment analysis (SMA) to measure sound frequency distortions. RESULTS Perceptually, there is a higher prevalence of auditory and visual speech distortions in AOB DFD patients when compared to controls. Quantitatively, a significant (p<0.01) increase in the centroid frequency (M1) was seen in the /k/, /t/, /t∫/, and /s/ sounds of AOB subjects compared to the controls. Using linear regression, correlations between AOB skeletal severity and spectral distortion were found for /k/ and /t/ sounds. CONCLUSIONS A higher prevalence of qualitative distortion and significant quantitative spectral distortions in consonant sounds were seen in AOB patients compared to controls. Additionally, severity of skeletal AOB is correlated with degree of distortion for consonant sounds. These findings provide insight into how the surgical and/or orthodontic treatment of AOB may impact speech.
Collapse
Affiliation(s)
- Mary Morgan Bitler Keyser
- Division of Craniofacial and Surgical Care, Orthodontics Group, Adams School of Dentistry, University of North Carolina, 270 Brauer Hall, CB#270, Chapel Hill, NC 25799-7450, USA
| | - Hillary Lathrop
- Division of Craniofacial and Surgical Care, Orthodontics Group, Adams School of Dentistry, University of North Carolina, 270 Brauer Hall, CB#270, Chapel Hill, NC 25799-7450, USA
| | - Samantha Jhingree
- Division of Craniofacial and Surgical Care, Orthodontics Group, Adams School of Dentistry, University of North Carolina, 270 Brauer Hall, CB#270, Chapel Hill, NC 25799-7450, USA
| | - Natalie Giduz
- Division of Craniofacial and Surgical Care, Orthodontics Group, Adams School of Dentistry, University of North Carolina, 270 Brauer Hall, CB#270, Chapel Hill, NC 25799-7450, USA
| | - Clare Bocklage
- Division of Craniofacial and Surgical Care, Orthodontics Group, Adams School of Dentistry, University of North Carolina, 270 Brauer Hall, CB#270, Chapel Hill, NC 25799-7450, USA
| | - Sandrine Couldwell
- Division of Craniofacial and Surgical Care, Oral Surgery Group, Adams School of Dentistry, University of North Carolina, 270 Brauer Hall, CB#270, Chapel Hill, NC 25799-7450, USA
| | - Steven Oliver
- Division of Craniofacial and Surgical Care, Orthodontics Group, Adams School of Dentistry, University of North Carolina, 270 Brauer Hall, CB#270, Chapel Hill, NC 25799-7450, USA
| | - Kevin Moss
- Division of Oral and Craniofacial Health Sciences, Adams School of Dentistry, University of North Carolina, 270 Brauer Hall, CB#270, Chapel Hill, NC 25799-7450, USA
| | - Sylvia Frazier-Bowers
- Division of Craniofacial and Surgical Care, Orthodontics Group, Adams School of Dentistry, University of North Carolina, 270 Brauer Hall, CB#270, Chapel Hill, NC 25799-7450, USA
| | - Ceib Phillips
- Division of Oral and Craniofacial Health Sciences, Adams School of Dentistry, University of North Carolina, 270 Brauer Hall, CB#270, Chapel Hill, NC 25799-7450, USA
| | - Timothy Turvey
- Division of Craniofacial and Surgical Care, Oral Surgery Group, Adams School of Dentistry, University of North Carolina, 270 Brauer Hall, CB#270, Chapel Hill, NC 25799-7450, USA
| | - George Blakey
- Division of Craniofacial and Surgical Care, Oral Surgery Group, Adams School of Dentistry, University of North Carolina, 270 Brauer Hall, CB#270, Chapel Hill, NC 25799-7450, USA
| | - Ray White
- Division of Craniofacial and Surgical Care, Oral Surgery Group, Adams School of Dentistry, University of North Carolina, 270 Brauer Hall, CB#270, Chapel Hill, NC 25799-7450, USA
| | | | | | - David Zajac
- Division of Craniofacial and Surgical Care, Speech Pathology Group, Adams School of Dentistry, University of North Carolina, 270 Brauer Hall, CB#270, Chapel Hill, NC 25799-7450, USA
| | - Jeff Mielke
- North Carolina State University, English Department, Tompkins Hall, Raleigh, NC, 27695, USA
| | - Laura Anne Jacox
- Division of Craniofacial and Surgical Care, Orthodontics Group, Adams School of Dentistry, University of North Carolina, 270 Brauer Hall, CB#270, Chapel Hill, NC 25799-7450, USA
- Division of Oral and Craniofacial Health Sciences, Adams School of Dentistry, University of North Carolina, 270 Brauer Hall, CB#270, Chapel Hill, NC 25799-7450, USA
| |
Collapse
|
19
|
Lathrop-Marshall H, Keyser MMB, Jhingree S, Giduz N, Bocklage C, Couldwell S, Edwards H, Glesener T, Moss K, Frazier-Bowers S, Phillips C, Turvey T, Blakey G, White R, Mielke J, Zajac D, Jacox LA. Orthognathic speech pathology: impacts of Class III malocclusion on speech. Eur J Orthod 2022; 44:340-351. [PMID: 34562076 PMCID: PMC9127721 DOI: 10.1093/ejo/cjab067] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
INTRODUCTION Patients with dentofacial disharmonies (DFDs) seek orthodontic care and orthognathic surgery to address issues with mastication, esthetics, and speech. Speech distortions are seen 18 times more frequently in Class III DFD patients than the general population, with unclear causality. We hypothesize there are significant differences in spectral properties of stop (/t/ or /k/), fricative (/s/ or /ʃ/), and affricate (/tʃ/) consonants and that severity of Class III disharmony correlates with the degree of speech abnormality. METHODS To understand how jaw disharmonies influence speech, orthodontic records and audio recordings were collected from Class III surgical candidates and reference subjects (n = 102 Class III, 62 controls). A speech pathologist evaluated subjects and recordings were quantitatively analysed by Spectral Moment Analysis for frequency distortions. RESULTS A majority of Class III subjects exhibit speech distortions. A significant increase in the centroid frequency (M1) and spectral spread (M2) was seen in several consonants of Class III subjects compared to controls. Using regression analysis, correlations between Class III skeletal severity (assessed by cephalometric measures) and spectral distortion were found for /t/ and /k/ phones. CONCLUSIONS Class III DFD patients have a higher prevalence of articulation errors and significant spectral distortions in consonants relative to controls. This is the first demonstration that severity of malocclusion is quantitatively correlated with the degree of speech distortion for consonants, suggesting causation. These findings offer insight into the complex relationship between craniofacial structures and speech distortions.
Collapse
Affiliation(s)
- Hillary Lathrop-Marshall
- Division of Craniofacial and Surgical Care, Orthodontics, Adams School of Dentistry, University of North Carolina, Chapel Hill, USA
| | - Mary Morgan B Keyser
- Division of Craniofacial and Surgical Care, Orthodontics, Adams School of Dentistry, University of North Carolina, Chapel Hill, USA
| | - Samantha Jhingree
- Division of Craniofacial and Surgical Care, Orthodontics, Adams School of Dentistry, University of North Carolina, Chapel Hill, USA
| | - Natalie Giduz
- Division of Craniofacial and Surgical Care, Orthodontics, Adams School of Dentistry, University of North Carolina, Chapel Hill, USA
| | - Clare Bocklage
- Division of Craniofacial and Surgical Care, Orthodontics, Adams School of Dentistry, University of North Carolina, Chapel Hill, USA
| | - Sandrine Couldwell
- Division of Craniofacial and Surgical Care, Oral Surgery, Adams School of Dentistry, University of North Carolina, Chapel Hill, USA
| | - Haley Edwards
- Division of Craniofacial and Surgical Care, Orthodontics, Adams School of Dentistry, University of North Carolina, Chapel Hill, USA
| | - Tim Glesener
- Division of Craniofacial and Surgical Care, Orthodontics, Adams School of Dentistry, University of North Carolina, Chapel Hill, USA
| | - Kevin Moss
- Division of Oral and Craniofacial Health Sciences, Adams School of Dentistry, University of North Carolina, Chapel Hill, USA
| | - Sylvia Frazier-Bowers
- Division of Craniofacial and Surgical Care, Orthodontics, Adams School of Dentistry, University of North Carolina, Chapel Hill, USA
| | - Ceib Phillips
- Division of Oral and Craniofacial Health Sciences, Adams School of Dentistry, University of North Carolina, Chapel Hill, USA
| | - Timothy Turvey
- Division of Craniofacial and Surgical Care, Oral Surgery, Adams School of Dentistry, University of North Carolina, Chapel Hill, USA
| | - George Blakey
- Division of Craniofacial and Surgical Care, Oral Surgery, Adams School of Dentistry, University of North Carolina, Chapel Hill, USA
| | - Ray White
- Division of Craniofacial and Surgical Care, Oral Surgery, Adams School of Dentistry, University of North Carolina, Chapel Hill, USA
| | - Jeff Mielke
- Department of English, North Carolina State University, Raleigh, USA
| | - David Zajac
- Division of Craniofacial and Surgical Care, Craniofacial Center, Adams School of Dentistry, University of North Carolina, Chapel Hill, USA
| | - Laura A Jacox
- Division of Craniofacial and Surgical Care, Orthodontics, Adams School of Dentistry, University of North Carolina, Chapel Hill, USA
- Division of Oral and Craniofacial Health Sciences, Adams School of Dentistry, University of North Carolina, Chapel Hill, USA
| |
Collapse
|
20
|
Hussain Q, Kochetov A. Acoustic classification of coronal stops of Eastern Punjabi. PHONETICA 2021; 79:77-110. [PMID: 34963204 DOI: 10.1515/phon-2021-2015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Punjabi is an Indo-Aryan language which contrasts a rich set of coronal stops at dental and retroflex places of articulation across three laryngeal configurations. Moreover, all these stops occur contrastively in various positions (word-initially, -medially, and -finally). The goal of this study is to investigate how various coronal place and laryngeal contrasts are distinguished acoustically both within and across word positions. A number of temporal and spectral correlates were examined in data from 13 speakers of Eastern Punjabi: Voice Onset Time, release and closure durations, fundamental frequency, F1-F3 formants, spectral center of gravity and standard deviation, H1*-H2*, and cepstral peak prominence. The findings indicated that higher formants and spectral measures were most important for the classification of place contrasts across word positions, whereas laryngeal contrasts were reliably distinguished by durational and voice quality measures. Word-medially and -finally, F2 and F3 of the preceding vowels played a key role in distinguishing the dental and retroflex stops, while spectral noise measures were more important word-initially. The findings of this study contribute to a better understanding of factors involved in the maintenance of typologically rare and phonetically complex sets of place and laryngeal contrasts in the coronal stops of Indo-Aryan languages.
Collapse
Affiliation(s)
- Qandeel Hussain
- Department of Linguistics, University of Toronto, Toronto, ON, Canada
| | - Alexei Kochetov
- Department of Linguistics, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
21
|
Zajac DJ, Whitt H, Baylis A, Tourian M, Garcia K. Alveolar Backing in 3-Year-Old Children with and without Repaired Cleft Palate: Preliminary Findings Related to Cleft Type and History of Otitis Media. PERSPECTIVES OF THE ASHA SPECIAL INTEREST GROUPS 2021; 6:1889-1899. [PMID: 35224200 PMCID: PMC8865393 DOI: 10.1044/2021_persp-21-00085] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
OBJECTIVE The purpose of this preliminary study was to determine if cleft type and/or history of otitis media with effusion (OM) contribute to backing of /t/ and/or /s/ in young children with and without repaired cleft palate. METHOD Participants were 39 children (mean age=36 months, range 34 to 41). Ten children had repaired unilateral cleft lip and palate (CLP), nine had repaired cleft palate only (CP), twelve had no clefts but histories of OM, and eight were typically-developing (TD) without clefts or OM history. All children were video and audio recorded during administration of the Goldman-Fristoe Test of Articulation-3 (GFTA-3). Standard scores of articulation, frequency of alveolar backing, and first spectral moments of the /t/-/k/ and /s/-/ʃ/ phonetic contrasts were obtained. RESULTS Children with CLP had lower GFTA-3 scores than both TD (p=.012) and OM (p=.001) groups. Fisher's Exact test showed that significantly more children with CLP backed alveolar targets, mostly /s/, than children with CP (p=.020). Children with CLP also had (a) reduced /t/-/k/ spectral difference compared to TD children (p=.016) and (b) reduced /s/-/ʃ/ spectral difference compared to both children with CP (p=.010) and children with OM (p=.018). Children with OM had reduced /t/-/k/ spectral difference compared to TD children (p=.009). CONCLUSIONS Cleft type contributes to alveolar backing and reduced spectral contrast of /s/-/ʃ/ in 3-year-old children with repaired cleft palate. History of OM affects spectral contrast of /t/-/k/ in non-cleft children. Etiology and clinical implications of alveolar backing are discussed.
Collapse
Affiliation(s)
- David J. Zajac
- Craniofacial Center, Division of Craniofacial and Surgical Care, Adams School of Dentistry, University of North Carolina at Chapel Hill
| | - Hannah Whitt
- Craniofacial Center, Division of Craniofacial and Surgical Care, Adams School of Dentistry, University of North Carolina at Chapel Hill, NC
| | - Adriane Baylis
- Section of Plastic and Reconstructive Surgery, Nationwide Children’s Hospital, Department of Plastic Surgery, The Ohio State University College of Medicine, Columbus, OH
| | - Maura Tourian
- Craniofacial Center, Division of Craniofacial and Surgical Care, Adams School of Dentistry, University of North Carolina at Chapel Hill, NC
| | | |
Collapse
|
22
|
Bolyanatz M, Brogan FD. Acoustic differences between Chilean and Salvadoran Spanish /s/. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 150:2446. [PMID: 34717508 DOI: 10.1121/10.0006443] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Accepted: 09/07/2021] [Indexed: 06/13/2023]
Abstract
This paper examines seven acoustic properties of /s/ in Chilean Spanish (CS) and Salvadoran Spanish (ES). Acoustic measurements from 36 speakers, balanced for gender and dialect, reveal several cross-dialectal differences: CS /s/ was significantly longer than ES /s/ in coda position and had a higher spectral center of gravity. Two findings were mitigated by gender: women in both dialects were more likely to voice /s/, but the magnitude of the difference in voicing between male and female speakers was greater in CS. There were no differences in relative intensity in CS, while in ES, /s/ used by males has significantly lower relative intensity than /s/ used by females. No dialect differences were found for variance, kurtosis, or skewness. While both CS and ES are frequently collocated under the umbrella of "/s/ weakening dialects," our results show that tokens of non-deleted /s/ are acoustically distinct. These findings suggest that shorter duration and lower spectral energy in ES could be the result of a looser or backer constriction of /s/ and may have implications for work examining cross-dialectal differences in /s/ lenition at the phonetics/phonology interface.
Collapse
Affiliation(s)
- Mariška Bolyanatz
- Department of Spanish and French Studies, Occidental College, Los Angeles, California 90041, USA
| | - Franny D Brogan
- Linguistics and Cognitive Science Department, Pomona College, Claremont, California 91711, USA
| |
Collapse
|
23
|
Ulrich N, Allassonnière-Tang M, Pellegrino F, Dediu D. Identifying the Russian voiceless non-palatalized fricatives /f/, /s/, and /ʃ/ from acoustic cues using machine learning. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 150:1806. [PMID: 34598630 DOI: 10.1121/10.0005950] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Accepted: 08/05/2021] [Indexed: 06/13/2023]
Abstract
This paper shows that machine learning techniques are very successful at classifying the Russian voiceless non-palatalized fricatives [f], [s], and [ʃ] using a small set of acoustic cues. From a data sample of 6320 tokens of read sentences produced by 40 participants, temporal and spectral measurements are extracted from the full sound, the noise duration, and the middle 30 ms windows. Furthermore, 13 mel-frequency cepstral coefficients (MFCCs) are computed from the middle 30 ms window. Classifiers based on single decision trees, random forests, support vector machines, and neural networks are trained and tested to distinguish between these three fricatives. The results demonstrate that, first, the three acoustic cue extraction techniques are similar in terms of classification accuracy (93% and 99%) but that the spectral measurements extracted from the full frication noise duration result in slightly better accuracy. Second, the center of gravity and the spectral spread are sufficient for the classification of [f], [s], and [ʃ] irrespective of contextual and speaker variation. Third, MFCCs show a marginally higher predictive power over spectral cues (<2%). This suggests that both sets of measures provide sufficient information for the classification of these fricatives and their choice depends on the particular research question or application.
Collapse
Affiliation(s)
- Natalja Ulrich
- Laboratoire Dynamique Du Langage (DDL) UMR 5596, CNRS/Université Lyon 2, Lyon, France
| | | | - François Pellegrino
- Laboratoire Dynamique Du Langage (DDL) UMR 5596, CNRS/Université Lyon 2, Lyon, France
| | - Dan Dediu
- Laboratoire Dynamique Du Langage (DDL) UMR 5596, CNRS/Université Lyon 2, Lyon, France
| |
Collapse
|
24
|
Ostrand R, Chodroff E. It's alignment all the way down, but not all the way up: Speakers align on some features but not others within a dialogue. JOURNAL OF PHONETICS 2021; 88:101074. [PMID: 34366499 PMCID: PMC8345023 DOI: 10.1016/j.wocn.2021.101074] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Abstract
During conversation, speakers modulate characteristics of their production to match their interlocutors' characteristics. This behavior is known as alignment. Speakers align at many linguistic levels, including the syntactic, lexical, and phonetic levels. As a result, alignment is often treated as a unitary phenomenon, in which evidence of alignment on one feature is cast as alignment of the entire linguistic level. This experiment investigates whether alignment can occur at some levels but not others, and on some features but not others, within a given dialogue. Participants interacted with two experimenters with highly contrasting acoustic-phonetic and syntactic profiles. The experimenters each described sets of pictures using a consistent acoustic-phonetic and syntactic profile; the participants then described new pictures to each experimenter individually. Alignment was measured as the degree to which subjects matched their current listener's speech (vs. their non-listener's) on each of several individual acoustic-phonetic and syntactic features. Additionally, a holistic measure of phonetic alignment was assessed using 323 acoustic-phonetic features analyzed jointly in a machine learning classifier. Although participants did not align on several individual spectral-phonetic or syntactic features, they did align on individual temporal-phonetic features and as measured by the holistic acoustic-phonetic profile. Thus, alignment can simultaneously occur at some levels but not others within a given dialogue, and is not a single phenomenon but rather a constellation of loosely-related effects. These findings suggest that the mechanism underlying alignment is not a primitive, automatic priming mechanism but rather guided by communicative or social factors.
Collapse
Affiliation(s)
- Rachel Ostrand
- IBM Research, Yorktown Heights, NY, USA
- Corresponding author. (R. Ostrand)
| | - Eleanor Chodroff
- Department of Language and Linguistic Science, University of York, Heslington, York, United Kingdom
| |
Collapse
|
25
|
Barbier G, Merzouki R, Bal M, Baum SR, Shiller DM. Visual feedback of the tongue influences speech adaptation to a physical modification of the oral cavity. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 150:718. [PMID: 34470311 DOI: 10.1121/10.0005520] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Accepted: 06/15/2021] [Indexed: 06/13/2023]
Abstract
Studies examining sensorimotor adaptation of speech to changing sensory conditions have demonstrated a central role for both auditory and somatosensory feedback in speech motor learning. The potential influence of visual feedback of oral articulators, which is not typically available during speech production but may nonetheless enhance oral motor control, remains poorly understood. The present study explores the influence of ultrasound visual feedback of the tongue on adaptation of speech production (focusing on the sound /s/) to a physical perturbation of the oral articulators (prosthesis altering the shape of the hard palate). Two visual feedback groups were tested that differed in the two-dimensional plane being imaged (coronal or sagittal) during practice producing /s/ words, along with a no-visual-feedback control group. Participants in the coronal condition were found to adapt their speech production across a broader range of acoustic spectral moments and syllable contexts than the no-feedback controls. In contrast, the sagittal group showed reduced adaptation compared to no-feedback controls. The results indicate that real-time visual feedback of the tongue is spontaneously integrated during speech motor adaptation, with effects that can enhance or interfere with oral motor learning depending on compatibility of the visual articulatory information with requirements of the speaking task.
Collapse
Affiliation(s)
- Guillaume Barbier
- École d'Orthophonie et d'Audiologie, Université de Montréal, Case Postale 6128, Succursale Centre-Ville, Montréal, Québec H3C 3J7, Canada
| | - Ryme Merzouki
- École d'Orthophonie et d'Audiologie, Université de Montréal, Case Postale 6128, Succursale Centre-Ville, Montréal, Québec H3C 3J7, Canada
| | - Mathilde Bal
- École d'Orthophonie et d'Audiologie, Université de Montréal, Case Postale 6128, Succursale Centre-Ville, Montréal, Québec H3C 3J7, Canada
| | - Shari R Baum
- School of Communication Sciences and Disorders, McGill University, 2001 McGill College Avenue, Suite 800, Montréal, Québec H3A 1G1, Canada
| | - Douglas M Shiller
- École d'Orthophonie et d'Audiologie, Université de Montréal, Case Postale 6128, Succursale Centre-Ville, Montréal, Québec H3C 3J7, Canada
| |
Collapse
|
26
|
Advances in Parkinson's Disease detection and assessment using voice and speech: A review of the articulatory and phonatory aspects. Biomed Signal Process Control 2021. [DOI: 10.1016/j.bspc.2021.102418] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
|
27
|
Höhle B, Fritzsche T, Boll-Avetisyan N, Hullebus M, Gafos A. Respect the surroundings: Effects of phonetic context variability on infants' learning of minimal pairs. JASA EXPRESS LETTERS 2021; 1:024401. [PMID: 36154034 DOI: 10.1121/10.0003574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Fourteen-month-olds' ability to distinguish a just learned word, /buːk/, from its minimally different word, /duːk/, was assessed under two pre-exposure conditions: one where /b, d/-initial forms occurred in a varying vowel context and another where the vowel was fixed but the final consonant varied. Infants in the experiments benefited from the variable vowel but not from the variable final consonant context, suggesting that vowel variability but not all kinds of variability are beneficial. These results are discussed in the context of time-honored observations on the vowel-dependent nature of place of articulation cues for consonants.
Collapse
Affiliation(s)
- Barbara Höhle
- University of Potsdam, Cognitive Sciences, Department of Linguistics, Karl-Liebknecht-Strasse 24-25, D-14476 Potsdam, Germany , , , ,
| | - Tom Fritzsche
- University of Potsdam, Cognitive Sciences, Department of Linguistics, Karl-Liebknecht-Strasse 24-25, D-14476 Potsdam, Germany , , , ,
| | - Natalie Boll-Avetisyan
- University of Potsdam, Cognitive Sciences, Department of Linguistics, Karl-Liebknecht-Strasse 24-25, D-14476 Potsdam, Germany , , , ,
| | - Marc Hullebus
- University of Potsdam, Cognitive Sciences, Department of Linguistics, Karl-Liebknecht-Strasse 24-25, D-14476 Potsdam, Germany , , , ,
| | - Adamantios Gafos
- University of Potsdam, Cognitive Sciences, Department of Linguistics, Karl-Liebknecht-Strasse 24-25, D-14476 Potsdam, Germany , , , ,
| |
Collapse
|
28
|
Chiu C, Wei PC, Noguchi M, Yamane N. Sibilant Fricative Merging in Taiwan Mandarin: An Investigation of Tongue Postures using Ultrasound Imaging. LANGUAGE AND SPEECH 2020; 63:877-897. [PMID: 31888408 PMCID: PMC7891883 DOI: 10.1177/0023830919896386] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
In Taiwan Mandarin, retroflex [ʂ] is allegedly merging with dental [s], reducing the traditional three-way contrast between sibilant fricatives (i.e., dental [s]-retroflex [ʂ]-alveopalatal [ɕ]) to a two-way contrast. Most of the literature on the observed merging focuses on the acoustic properties and perceptual identification of the sibilants, whereas much less attention has been drawn to the articulatory evidence accounting for the aforementioned sibilant merging. The current study employed ultrasound imaging techniques to uncover the tongue postures for the three sibilant fricatives [s, ʂ, ɕ] in Taiwan Mandarin occurring before vowels [a], [ɨ], and [o]. Results revealed varying classes of the [s-ʂ] merger: complete merging (overlap), no merging (non-overlap), and context-dependent merging (context-dependent overlap, which only occurred before [a]). The observed [s-ʂ] merger was also confirmed by the perceptual identification by trained phoneticians. Center of gravity (CoG), a reliable spectral moment of identifying different sibilant fricatives, was also measured to reflect the articulatory-acoustic correspondence. Results showed that the [s-ʂ] merger varies across speakers and may also be conditioned by vowel contexts and that articulatory mergers may not be entirely reflected in CoG values, suggesting that auxiliary articulatory gestures may be employed to maintain the acoustic contrast.
Collapse
Affiliation(s)
- Chenhao Chiu
- Graduate Institute of Linguistics, National Taiwan University, Taiwan
| | | | - Masaki Noguchi
- Department of Linguistics, University of British Columbia, Canada
| | - Noriko Yamane
- Graduate School of Integrated Arts and Sciences, Hiroshima University, Japan
| |
Collapse
|
29
|
Themistocleous C, Eckerström M, Kokkinakis D. Voice quality and speech fluency distinguish individuals with Mild Cognitive Impairment from Healthy Controls. PLoS One 2020; 15:e0236009. [PMID: 32658934 PMCID: PMC7357785 DOI: 10.1371/journal.pone.0236009] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Accepted: 06/27/2020] [Indexed: 11/19/2022] Open
Abstract
Mild Cognitive Impairment (MCI) is a syndrome characterized by cognitive decline greater than expected for an individual's age and education level. This study aims to determine whether voice quality and speech fluency distinguish patients with MCI from healthy individuals to improve diagnosis of patients with MCI. We analyzed recordings of the Cookie Theft picture description task produced by 26 patients with MCI and 29 healthy controls from Sweden and calculated measures of voice quality and speech fluency. The results show that patients with MCI differ significantly from HC with respect to acoustic aspects of voice quality, namely H1-A3, cepstral peak prominence, center of gravity, and shimmer; and speech fluency, namely articulation rate and averaged speaking time. The method proposed along with the obtainability of connected speech productions can enable quick and easy analysis of speech fluency and voice quality, providing accessible and objective diagnostic markers of patients with MCI.
Collapse
Affiliation(s)
| | - Marie Eckerström
- Department of Psychiatry and Neurochemistry, University of Gothenburg, Gothenburg, Sweden
| | - Dimitrios Kokkinakis
- Department of Swedish, University of Gothenburg, Gothenburg, Sweden
- Center of Ageing and Health—AgeCap, University of Gothenburg, Gothenburg, Sweden
| |
Collapse
|
30
|
Grandon B, Vilain A. Development of fricative production in French-speaking school-aged children using cochlear implants and children with normal hearing. JOURNAL OF COMMUNICATION DISORDERS 2020; 86:105996. [PMID: 32485648 DOI: 10.1016/j.jcomdis.2020.105996] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/18/2019] [Revised: 03/27/2020] [Accepted: 03/29/2020] [Indexed: 06/11/2023]
Abstract
In the course of productive phonological development, fricatives are among the last speech sounds to emerge and to be mastered by children, probably because of the high degree of articulatory precision they require or because of difficulties with their perception. Children with cochlear implants (CI) face additional difficulties with fricative perception, since high spectral frequency components are shown to be especially difficult to perceive with a cochlear implant. Studying fricative production in children with CIs allows to study how the partial transmission of speech sounds by cochlear implants influences children's speech production, and therefore to explore how perceptual abilities influence the late stages of phonological development. This acoustic study focuses on fricative production at three places of articulation (i.e., /f/, /s/ and /ʃ/), comparing productions by two groups of children (20 children with normal hearing (NH) vs. 13 children with CIs, all aged 5;7 to 10;7 years), and taking into account their consistency in coarticulation and the stability of their production across two different tasks (word-repetition and picture-naming). Statistical analyses were carried out by means of linear mixed-effect models. The results show that while both groups produce /ʃ/ with similar acoustic characteristics, between-group differences are found for /f/ and /s/. Furthermore, effects of consonant-vowel coarticulation are found for children with NH, and are absent for children with CIs. Effects of chronological age are only found for children with CIs (production in older children with CIs nearing that of children with NH). Our study shows that the development of fricative production of five- to 11-year-old children with CIs is affected by the children's hearing abilities and late access to auditory information. These limitations however do not prevent the children from eventually reaching a consistency similar to that of children with NH, as suggested by the fact that their production is still evolving during that age span. The results also show that the acquisition of coarticulation strategies can be impeded by degraded or delayed access to audio.
Collapse
Affiliation(s)
- Bénédicte Grandon
- Université Grenoble Alpes, CNRS, Grenoble INP, GIPSA-lab, Grenoble, France.
| | - Anne Vilain
- Université Grenoble Alpes, CNRS, Grenoble INP, GIPSA-lab, Grenoble, France; Institut Universitaire de France, France
| |
Collapse
|
31
|
Chodroff E, Wilson C. Acoustic-phonetic and auditory mechanisms of adaptation in the perception of sibilant fricatives. Atten Percept Psychophys 2020; 82:2027-2048. [PMID: 31875314 PMCID: PMC7297833 DOI: 10.3758/s13414-019-01894-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
Listeners are highly proficient at adapting to contextual variation when perceiving speech. In the present study, we examined the effects of brief speech and nonspeech contexts on the perception of sibilant fricatives. We explored three theoretically motivated accounts of contextual adaptation, based on phonetic cue calibration, phonetic covariation, and auditory contrast. Under the cue calibration account, listeners adapt by estimating a talker-specific average for each phonetic cue or dimension; under the cue covariation account, listeners adapt by exploiting consistencies in how the realization of speech sounds varies across talkers; under the auditory contrast account, adaptation results from (partial) masking of spectral components that are shared by adjacent stimuli. The spectral center of gravity, a phonetic cue to fricative identity, was manipulated for several types of context sound: /z/-initial syllables, /v/-initial syllables, and white noise matched in long-term average spectrum (LTAS) to the /z/-initial stimuli. Listeners' perception of the /s/-/ʃ/ contrast was significantly influenced by /z/-initial syllables and LTAS-matched white noise stimuli, but not by /v/-initial syllables. No significant difference in adaptation was observed between exposure to /z/-initial syllables and matched white noise stimuli, and speech did not have a considerable advantage over noise when the two were presented consecutively within a context. The pattern of findings is most consistent with the auditory contrast account of short-term perceptual adaptation. The cue covariation account makes accurate predictions for speech contexts, but not for nonspeech contexts or for the absence of a speech-versus-nonspeech difference.
Collapse
Affiliation(s)
- Eleanor Chodroff
- Department of Language and Linguistic Science, University of York, Heslington, York, YO10 5DD, UK.
| | - Colin Wilson
- Department of Cognitive Science, Johns Hopkins University, 3400 N. Charles St., Baltimore, MD, 21218, USA
| |
Collapse
|
32
|
Buyuknacar GB, Gulec A. Correlation between the cephalometric measurements and acoustic properties of /s/ sound in Turkish. J Appl Oral Sci 2020; 28:e20190399. [PMID: 32348443 PMCID: PMC7185985 DOI: 10.1590/1678-7757-2019-0399] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Accepted: 12/19/2019] [Indexed: 12/05/2022] Open
Abstract
Objectives To evaluate the acoustic properties of the /s/ sound in individuals with different occlusion types and to investigate relationships between these properties and cephalometric measurements. Methodology Sixty patients were divided into three groups based on malocclusion. Group 1 included 20 patients (mean age: 14.85±2.01 years) with Class I skeletal and dental relationships. Group 2 included 20 patients (mean age: 13.49±1.78 years) with Class II skeletal and dental relationships. Group 3 included 20 patients (mean age: 12.46±2.62 years) with Class III skeletal and dental relationships. Cephalometric tracings were obtained from cephalometric radiographs. All included patients were native speakers of Turkish. The /s/ sound was selected for center of gravity analysis. Correlations between cephalometric values and acoustic parameters were also investigated. Results The center of gravity of the /s/ sound had the lowest value in Group 2 (p<0.05). For the /s/ sound in Group 3, moderate positive correlations were found between center of gravity and Sella-Nasion to Gonion-Gnathion angle (p<0.05, r=0.444) Lower incisor to Nasion-B point (p<0.023, r=0.505), and Lower incisor to Nasion-B point angle (p<0.034; r=0.476). No correlation was found in other cephalometric measurements. Conclusions The /s/ sound was affected by malocclusion due to the changing place of articulation. Therefore, referral to an orthodontist for malocclusion treatment especially patients with class III in the early period is suggested for producing acoustically ideal sound.
Collapse
Affiliation(s)
| | - Aysegül Gulec
- Department of Orthodontics, Gaziantep University, Gaziantep, Turkey
| |
Collapse
|
33
|
Reilly KJ. Vowel and Sibilant Production in Noise: Effects of Noise Frequency and Phonological Similarity. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:1002-1017. [PMID: 32293944 PMCID: PMC7242990 DOI: 10.1044/2020_jslhr-19-00345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2019] [Revised: 11/20/2019] [Accepted: 12/30/2019] [Indexed: 06/11/2023]
Abstract
Purpose This study investigated vowel and sibilant productions in noise to determine whether responses to noise (a) are sensitive to the spectral characteristics of the noise signal and (b) are modulated by the contribution of vowel or sibilant contrasts to word discrimination. Method Vowel and sibilant productions were elicited during serial recall of three-word sequences that were produced in quiet or during exposure to speaker-specific noise signals. These signals either masked a speaker's productions of the sibilants /s/ and /ʃ/ or their productions of the vowels /a/ and /æ/. The contribution of the vowel and sibilant contrasts to word discrimination in a sequence was manipulated by varying the number of times that the target sibilant and vowel pairs occurred in the same word position in each sequence. Results Spectral noise effects were observed for both sibilants and vowels: Responses to noise were larger and/or involved to more acoustic features when the noise signal masked the acoustic characteristics of that phoneme class. Word discrimination effects were limited and consisted of only small increases in vowel duration. Interaction effects between noise and similarity indicated that the phonological similarity of sequences containing both sibilants and/or both vowels influenced articulation in ways not related to speech clarity. Conclusion The findings of this study indicate that sensorimotor control of speech exhibits some sensitivity to noise spectral characteristics. However, productions of sibilants and vowels were not sensitive to their importance in discriminating the words in a sequence. In addition, phonological similarity effects were observed that likely reflected processing demands related to the recall and sequencing of high-similarity words.
Collapse
Affiliation(s)
- Kevin J. Reilly
- Department of Audiology and Speech Pathology, College of Health Professions, University of Tennessee Health Science Center, Memphis
| |
Collapse
|
34
|
Galle ME, Klein-Packard J, Schreiber K, McMurray B. What Are You Waiting For? Real-Time Integration of Cues for Fricatives Suggests Encapsulated Auditory Memory. Cogn Sci 2020; 43. [PMID: 30648798 DOI: 10.1111/cogs.12700] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2017] [Revised: 10/15/2018] [Accepted: 10/25/2018] [Indexed: 11/30/2022]
Abstract
Speech unfolds over time, and the cues for even a single phoneme are rarely available simultaneously. Consequently, to recognize a single phoneme, listeners must integrate material over several hundred milliseconds. Prior work contrasts two accounts: (a) a memory buffer account in which listeners accumulate auditory information in memory and only access higher level representations (i.e., lexical representations) when sufficient information has arrived; and (b) an immediate integration scheme in which lexical representations can be partially activated on the basis of early cues and then updated when more information arises. These studies have uniformly shown evidence for immediate integration for a variety of phonetic distinctions. We attempted to extend this to fricatives, a class of speech sounds which requires not only temporal integration of asynchronous cues (the frication, followed by the formant transitions 150-350 ms later), but also integration across different frequency bands and compensation for contextual factors like coarticulation. Eye movements in the visual world paradigm showed clear evidence for a memory buffer. Results were replicated in five experiments, ruling out methodological factors and tying the release of the buffer to the onset of the vowel. These findings support a general auditory account for speech by suggesting that the acoustic nature of particular speech sounds may have large effects on how they are processed. It also has major implications for theories of auditory and speech perception by raising the possibility of an encapsulated memory buffer in early auditory processing.
Collapse
Affiliation(s)
- Marcus E Galle
- Department of Psychological and Brain Sciences, University of Iowa
| | | | | | - Bob McMurray
- Department of Psychological and Brain Sciences, University of Iowa.,Department of Communication Sciences and Disorders, University of Iowa.,Department of Linguistics, University of Iowa.,Department of Otolaryngology, University of Iowa
| |
Collapse
|
35
|
Hussain Q, Mielke J. An acoustic and articulatory study of laryngeal and place contrasts of Kalasha (Indo-Aryan, Dardic). THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 147:2873. [PMID: 32359277 DOI: 10.1121/10.0000999] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Accepted: 11/19/2019] [Indexed: 06/11/2023]
Abstract
The Northwestern group of Indo-Aryan (Dardic) languages is generally known to have undergone consonantal shift, which resulted in the loss of voiced aspirated (VDA) stops and affricates of Sanskrit. Kalasha, an endangered Dardic language spoken in Chitral (Northern Pakistan), still preserves the Old Indo-Aryan four-way laryngeal system. The current study examines acoustic and articulatory correlates of Kalasha's four-way laryngeal contrast across places and manners of articulation, using lingual ultrasound-imaging and several acoustic measures. The analysis included the standard acoustic [voice onset time (VOT), after prevoicing interval (API), fundamental frequency onset, first four spectral moments] and articulatory (smoothing spline analysis of variance) measures, which capture laryngeal, place, and manner differences in consonants. The results showed that VOT reliably differentiated the four-way laryngeal contrast of Kalasha. VDA stops and affricates are characterized by shorter voicing lead VOT, higher API, and lower fundamental frequency onset than their voiced unaspirated (VDUA) counterparts. However, the first four spectral moments did not distinguish the two VDUA and VDA stop series. The tongue root retraction distinguishes the voiceless stops and affricates from the voiced ones.
Collapse
Affiliation(s)
- Qandeel Hussain
- Department of English (Linguistics Program), North Carolina State University, Campus Box 8105, Raleigh, North Carolina 27695, USA
| | - Jeff Mielke
- Department of English (Linguistics Program), North Carolina State University, Campus Box 8105, Raleigh, North Carolina 27695, USA
| |
Collapse
|
36
|
Barbier G, Baum SR, Ménard L, Shiller DM. Sensorimotor adaptation across the speech production workspace in response to a palatal perturbation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 147:1163. [PMID: 32113266 DOI: 10.1121/10.0000672] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Accepted: 01/15/2020] [Indexed: 06/10/2023]
Abstract
Talkers have been shown to adapt the production of multiple vowel sounds simultaneously in response to altered auditory feedback. The present study extends this work by exploring the adaptation of speech production to a physical alteration of the vocal tract involving a palatal prosthesis that impacts both somatosensory and auditory feedback during the production of a range of consonants and vowels. Acoustic and kinematic measures of the tongue were used to examine the impact of the physical perturbation across the various speech sounds, and to assess learned changes following 20 min of speech practice involving the production of complex, variable sentences. As in prior studies, acoustic analyses showed perturbation and adaptation effects primarily for sounds directly involving interaction with the palate. Analyses of tongue kinematics, however, revealed systematic, robust effects of the perturbation and subsequent motor learning across the full range of speech sounds. The results indicate that speakers are able to reconfigure oral motor patterns during the production of multiple speech sounds spanning the articulatory workspace following a physical alteration of the vocal tract.
Collapse
Affiliation(s)
- Guillaume Barbier
- École d'orthophonie et d'audiologie, Université de Montréal, Case Postale 6128, Succursale Centre-Ville, Montréal, Québec, H3C 3J7, Canada
| | - Shari R Baum
- School of Communication Sciences and Disorders, McGill University, 2001 McGill College Avenue, Suite 800, Montréal, Québec, H3A 1G1, Canada
| | - Lucie Ménard
- Département de linguistique, Université du Québec à Montréal, Case Postale 8888, Succursale Centre-Ville, Montréal, Québec, H3C 3P8, Canada
| | - Douglas M Shiller
- École d'orthophonie et d'audiologie, Université de Montréal, Case Postale 6128, Succursale Centre-Ville, Montréal, Québec, H3C 3J7, Canada
| |
Collapse
|
37
|
Khwaileh FA, Flipsen P, Hammouri HM, Alzoubi FQ. Acoustic characteristics of Arabic pharyngealized obstruents in children with cochlear implants. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 146:893. [PMID: 31472526 DOI: 10.1121/1.5119355] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/27/2018] [Accepted: 07/10/2019] [Indexed: 06/10/2023]
Abstract
Speech production of children with cochlear implants (CIs) is generally characterized by low intelligibility and reduced phoneme accuracy. However, limited research investigated their speech production using acoustic measures. The current study examined voice onset time (VOT) for pharyngealized plosives [t,d], and spectral moments and noise duration for pharyngealized fricatives [s,ð] produced by Arabic speaking children with CIs. Productions from children with CIs were compared with both chronological age-matched and hearing experience-matched normal hearing children. Results showed that children with CIs exhibited difficulty producing distinct VOTs between plosives and produced different spectral patterns of both fricatives relative to both comparison groups; however, they were able to produce an acoustic distinction between both fricatives. Children with CIs produced the fricatives with lower spectral mean and higher skewness and kurtosis. The sources for inter-group differences in the acoustic measures appeared to be due in part to limitations in the quality of auditory input provided by CIs as well as reduced motor experience in speech production. Results suggest that VOT and spectral moments are sensitive to changes in perceived sound quality. Spectral moments analysis appears to give details on subtle aspects of fricative production at the phonetic level beyond that available using perceptual judgments.
Collapse
Affiliation(s)
- Fadwa A Khwaileh
- Division of Speech and Hearing, Department of Rehabilitation Sciences, Faculty of Applied Medical Sciences, Jordan University of Science and Technology, Irbid 22110, Jordan
| | - Peter Flipsen
- School of Communication Sciences and Disorders, Pacific University, 2043 College Way, Forest Grove, Oregon 97116, USA
| | - Hanan M Hammouri
- Department of Mathematics and Statistics, Faculty of Sciences and Arts, Jordan University of Science and Technology, Irbid 22110, Jordan
| | - Firas Q Alzoubi
- Division of Otorhinolaryngology, Faculty of Medicine, Jordan University of Science and Technology, Irbid 22110, Jordan
| |
Collapse
|
38
|
Liu CT, Chen LM, Lin YC, Cheng CY, Lin YC. Fricative productions of Mandarin-speaking children with cerebral palsy: the case of five-year-olds. CLINICAL LINGUISTICS & PHONETICS 2019; 34:256-270. [PMID: 31364876 DOI: 10.1080/02699206.2019.1636137] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Revised: 06/14/2019] [Accepted: 06/21/2019] [Indexed: 06/10/2023]
Abstract
This study aimed at improving the understanding of speech characteristics of fricatives produced by five-year-old Mandarin-acquiring children with cerebral palsy (CP). Productions from nine CP children and nine gender-and-age-matched typically developing (TD) children were collected and analyzed. Results from transcription indicated that the CP group had lower production accuracy rates for all the five fricatives in Mandarin Chinese. Additionally, when the CP children failed to articulate the target fricative segments, they tended to delete them or convert them into non-continuant segments. Results from acoustic analyses indicated that the M2 values of the labiodental [f] and the M1 and M2 values of the alveolar [s] were higher among the CP children. The experimental results revealed that: (1) Observable differences were available once the age of the groups was properly controlled and acoustical measurements were adopted; (2) the lack of finer-grained speech motor control abilities among CP children were reflected in the M1 and M2 values; (3) for segments at the anterior places, the clinical group failed to extend the articulatory gestures to the desirable positions. It is suggested that future studies focusing on different age groups and children with different native languages would help to approach the nature of articulatory barriers among individuals with CP.
Collapse
Affiliation(s)
- Chin-Ting Liu
- Department of Foreign Language, R.O.C. Naval Academy, Kaohsiung, Taiwan
| | - Li-Mei Chen
- Department of Foreign Languages & Literature, National Cheng Kung University, Tainan, Taiwan
| | - Yu-Ching Lin
- Department of Physical Medicine and Rehabilitation, College of Medicine, National Cheng Kung University, Tainan, Taiwan
- Department of Physical Medicine and Rehabilitation,National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, Tainan, Taiwan
| | - Ching-Yi Cheng
- Department of Physical Medicine and Rehabilitation,National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, Tainan, Taiwan
| | - Yung-Chieh Lin
- Department of Pediatrics, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, Tainan, Taiwan
| |
Collapse
|
39
|
Yoo H, Buder EH, Bowman DD, Bidelman GM, Oller DK. Acoustic Correlates and Adult Perceptions of Distress in Infant Speech-Like Vocalizations and Cries. Front Psychol 2019; 10:1154. [PMID: 31191389 PMCID: PMC6548812 DOI: 10.3389/fpsyg.2019.01154] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2018] [Accepted: 05/02/2019] [Indexed: 11/13/2022] Open
Abstract
Prior research has not evaluated acoustic features contributing to perception of human infant vocal distress or lack thereof on a continuum. The present research evaluates perception of infant vocalizations along a continuum ranging from the most prototypical intensely distressful cry sounds ("wails") to the most prototypical of infant sounds that typically express no distress (non-distress "vocants"). Wails are deemed little if at all related to speech while vocants are taken to be clear precursors to speech. We selected prototypical exemplars of utterances representing the whole continuum from 0 and 1 month-olds. In this initial study of the continuum, our goals are to determine (1) listener agreement on level of vocal distress across the continuum, (2) acoustic parameters predicting ratings of distress, (3) the extent to which individual listeners maintain or change their acoustic criteria for distress judgments across the study, (4) the extent to which different listeners use similar or different acoustic criteria to make judgments, and (5) the role of short-term experience among the listeners in judgments of infant vocalization distress. Results indicated that (1) both inter-rater and intra-rater listener agreement on degree of vocal distress was high, (2) the best predictors of vocal distress were number of vibratory regimes within utterances, utterance duration, spectral ratio (spectral concentration) in vibratory regimes within utterances, and mean pitch, (3) individual listeners significantly modified their acoustic criteria for distress judgments across the 10 trial blocks, (4) different listeners, while showing overall similarities in ratings of the 42 stimuli, also showed significant differences in acoustic criteria used in assigning the ratings of vocal distress, and (5) listeners who were both experienced and inexperienced in infant vocalizations coding showed high agreement in rating level of distress, but differed in the extent to which they relied on the different acoustic cues in making the ratings. The study provides clearer characterization of vocal distress expression in infants based on acoustic parameters and a new perspective on active adult perception of infant vocalizations. The results also highlight the importance of vibratory regime segmentation and analysis in acoustically based research on infant vocalizations and their perception.
Collapse
Affiliation(s)
- Hyunjoo Yoo
- Department of Communicative Disorders, The University of Alabama, Tuscaloosa, AL, United States
| | - Eugene H. Buder
- School of Communication Sciences and Disorders, The University of Memphis, Memphis, TN, United States
- Institute for Intelligent Systems, The University of Memphis, Memphis, TN, United States
| | - Dale D. Bowman
- Institute for Intelligent Systems, The University of Memphis, Memphis, TN, United States
- Department of Mathematical Sciences, The University of Memphis, Memphis, TN, United States
| | - Gavin M. Bidelman
- School of Communication Sciences and Disorders, The University of Memphis, Memphis, TN, United States
- Institute for Intelligent Systems, The University of Memphis, Memphis, TN, United States
- Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, TN, United States
| | - D. Kimbrough Oller
- School of Communication Sciences and Disorders, The University of Memphis, Memphis, TN, United States
- Institute for Intelligent Systems, The University of Memphis, Memphis, TN, United States
- Konrad Lorenz Institute for Evolution and Cognition Research, Klosterneuburg, Austria
| |
Collapse
|
40
|
Alexander JM. The S-SH Confusion Test and the Effects of Frequency Lowering. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2019; 62:1486-1505. [PMID: 31063023 DOI: 10.1044/2018_jslhr-h-18-0267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Purpose Frequency lowering in hearing aids can cause listeners to perceive [s] as [ʃ]. The S-SH Confusion Test, which consists of 66 minimal word pairs spoken by 6 female talkers, was designed to help clinicians and researchers document these negative side effects. This study's purpose was to use this new test to evaluate the hypothesis that these confusions will increase to the extent that low frequencies are altered. Method Twenty-one listeners with normal hearing were each tested on 7 conditions. Three were control conditions that were low-pass filtered at 3.3, 5.0, and 9.1 kHz. Four conditions were processed with nonlinear frequency compression (NFC): 2 had a 3.3-kHz maximum audible output frequency (MAOF), with a start frequency (SF) of 1.6 or 2.2 kHz; 2 had a 5.0-kHz MAOF, with an SF of 1.6 or 4.0 kHz. Listeners' responses were analyzed using concepts from signal detection theory. Response times were also collected as a measure of cognitive processing. Results Overall, [s] for [ʃ] confusions were minimal. As predicted, [ʃ] for [s] confusions increased for NFC conditions with a lower versus higher MAOF and with a lower versus higher SF. Response times for trials with correct [s] responses were shortest for the 9.1-kHz control and increased for the 5.0- and 3.3-kHz controls. NFC response times were also significantly longer as MAOF and SF decreased. The NFC condition with the highest MAOF and SF had statistically shorter response times than its control condition, indicating that, under some circumstances, NFC may ease cognitive processing. Conclusions Large differences in the S-SH Confusion Test across frequency-lowering conditions show that it can be used to document a major negative side effect associated with frequency lowering. Smaller but significant differences in response times for correct [s] trials indicate that NFC can help or hinder cognitive processing, depending on its settings.
Collapse
Affiliation(s)
- Joshua M Alexander
- Department of Speech, Language, and Hearing Sciences, Purdue University,West Lafayette, Indiana
| |
Collapse
|
41
|
Schreiber KE, McMurray B. Listeners can anticipate future segments before they identify the current one. Atten Percept Psychophys 2019; 81:1147-1166. [PMID: 31087271 PMCID: PMC6688751 DOI: 10.3758/s13414-019-01712-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Speech unfolds rapidly over time, and the information necessary to recognize even a single phoneme may not be available simultaneously. Consequently, listeners must both integrate prior acoustic cues and anticipate future segments. Prior work on stop consonants and vowels suggests that listeners integrate asynchronous cues by partially activating lexical entries as soon as any information is available, and then updating this when later cues arrive. However, a recent study suggests that for the voiceless sibilant fricatives (/s/ and /ʃ/), listeners wait to initiate lexical access until all cues have arrived at the onset of the vowel. Sibilants also contain coarticulatory cues that could be used to anticipate the vowel upcoming. However, given these results, it is unclear if listeners could use them fast enough to speed vowel recognition. The current study examines anticipation by asking when listeners use coarticulatory information in the frication to predict the upcoming vowel. A visual world paradigm experiment found that listeners do not wait: they anticipate the vowel immediately from the onset of the frication, even as they wait several hundred milliseconds to identify the fricative. This finding suggests listeners do not strictly process phonemes in the order that they appear; rather the dynamics of language processing may be largely internal and only loosely coupled to the dynamics of the input.
Collapse
Affiliation(s)
- Kayleen E Schreiber
- Interdisciplinary Graduate Program in Neuroscience, University of Iowa, Iowa City, IA, USA
| | - Bob McMurray
- Department of Psychological and Brain Sciences, Department of Communication Sciences and Disorders, Department of Linguistics, University of Iowa, W311 SSH, Iowa City, IA, 52242, USA.
| |
Collapse
|
42
|
Lowenstein JH, Nittrouer S. Perception-Production Links in Children's Speech. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2019; 62:853-867. [PMID: 30986136 PMCID: PMC6802887 DOI: 10.1044/2018_jslhr-s-18-0178] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/08/2018] [Revised: 09/25/2018] [Accepted: 12/13/2018] [Indexed: 06/02/2023]
Abstract
Purpose Child phonologists have long been interested in how tightly speech input constrains the speech production capacities of young children, and the question acquires clinical significance when children with hearing loss are considered. Children with sensorineural hearing loss often show differences in the spectral and temporal structures of their speech production, compared to children with normal hearing. The current study was designed to investigate the extent to which this problem can be explained by signal degradation. Method Ten 5-year-olds with normal hearing were recorded imitating 120 three-syllable nonwords presented in unprocessed form and as noise-vocoded signals. Target segments consisted of fricatives, stops, and vowels. Several measures were made: 2 duration measures (voice onset time and fricative length) and 4 spectral measures involving 2 segments (1st and 3rd moments of fricatives and 1st and 2nd formant frequencies for the point vowels). Results All spectral measures were affected by signal degradation, with vowel production showing the largest effects. Although a change in voice onset time was observed with vocoded signals for /d/, voicing category was not affected. Fricative duration remained constant. Conclusions Results support the hypothesis that quality of the input signal constrains the speech production capacities of young children. Consequently, it can be concluded that the production problems of children with hearing loss-including those with cochlear implants-can be explained to some extent by the degradation in the signal they hear. However, experience with both speech perception and production likely plays a role as well.
Collapse
Affiliation(s)
- Joanna H. Lowenstein
- Department of Speech, Language, and Hearing Sciences, University of Florida, Gainesville
| | - Susan Nittrouer
- Department of Speech, Language, and Hearing Sciences, University of Florida, Gainesville
| |
Collapse
|
43
|
Cibelli E. Training Non-Native Consonant Production with Perceptual and Articulatory Cues. PHONETICA 2019; 77:1-28. [PMID: 30836370 DOI: 10.1159/000495728] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/03/2018] [Accepted: 11/24/2018] [Indexed: 06/09/2023]
Abstract
BACKGROUND/AIMS Adult learners often struggle to produce novel phonemes in a second language and lack clear articulatory targets. This study investigates the combined efficacy of perceptual and articulatory training, the latter involving explicit instruction about tongue position and laryngeal control, for the production of non-native phonemes. METHODS Native English speakers were trained on a series of Hindi coronal stop consonants, with production assessed before, during, and after training sessions, on the basis of acoustic cues to place of articulation and voicing. RESULTS Improvement in production was most apparent during artic u-latory training, when cues to target articulation were available to learners. Some improvements were maintained after training was concluded. CONCLUSION Articu-latory training can contribute useful cues to pronunciation for early learners. Improvement in acquisition of targets varies in stability across learners and targets.
Collapse
Affiliation(s)
- Emily Cibelli
- Department of Linguistics, Northwestern University, Evanston, Illinois, USA,
| |
Collapse
|
44
|
Hussain Q, Shinohara S. Partial devoicing of voiced geminate stops in Tokyo Japanese. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 145:149. [PMID: 30710917 DOI: 10.1121/1.5078605] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/25/2017] [Accepted: 10/25/2018] [Indexed: 06/09/2023]
Abstract
Tokyo Japanese has a constraint against voiced geminate stops in its native lexicon. The present study investigates whether recently introduced word-medial voiced geminate stops [C1V1C(C)2V2] are differentiated from voiceless geminates and voiced singletons in terms of duration, voicing during closure, and spectral moments of stop release bursts. The findings suggest that the voiceless and voiced singleton stops were clearly differentiated by C2 duration. In contrast, C2 duration of the voiceless and voiced geminate stops was not significantly different. The devoicing of the word-medial stops was not only observed in voiced geminates, but voiced singletons also showed devoicing. The duration of the preceding vowel (V1) distinguished the voicing contrast in both singleton and geminate stops. The first four spectral moments of C2 stop release bursts did not distinguish the length and voicing contrasts in stops. These results indicate that, although word-medial voiced geminate stops are fully or partially devoiced, the Tokyo Japanese speakers lengthen the preceding vowels (V1) to maintain a voicing contrast. Production patterns of the voiced geminates are considered in relation to marginal or intermediate phonological contrast.
Collapse
Affiliation(s)
- Qandeel Hussain
- Department of Linguistics, ARC Centre of Excellence in Cognition and its Disorders, Macquarie University, Sydney, 16 University Avenue, Australian Hearing Hub, Balaclava Road, North Ryde, NSW 2109, Australia
| | - Shigeko Shinohara
- Laboratoire de Phonétique et Phonologie (UMR7018, Centre National de la Recherche Scientifique - Sorbonne Nouvelle), 19 rue des Bernardins, Paris, 75005, France
| |
Collapse
|
45
|
Pittman AL, Daliri A, Meadows L. Vocal Biomarkers of Mild-to-Moderate Hearing Loss in Children and Adults: Voiceless Sibilants. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2018; 61:2814-2826. [PMID: 30458528 DOI: 10.1044/2018_jslhr-h-17-0460] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2017] [Accepted: 07/08/2018] [Indexed: 06/09/2023]
Abstract
PURPOSE The purpose of this study was to determine if an objective measure of speech production could serve as a vocal biomarker for the effects of high-frequency hearing loss on speech perception. It was hypothesized that production of voiceless sibilants is governed sufficiently by auditory feedback that high-frequency hearing loss results in subtle but significant shifts in the spectral characteristics of these sibilants. METHOD Sibilant production was examined in individuals with mild to moderately severe congenital (22 children; 8-17 years old) and acquired (23 adults; 55-80 years old) hearing losses. Measures of hearing level (pure-tone average thresholds at 4 and 8 kHz), speech perception (detection of nonsense words within sentences), and speech production (spectral center of gravity [COG] for /s/ and /ʃ/) were obtained in unaided and aided conditions. RESULTS For both children and adults, detection of nonsense words increased significantly as hearing thresholds improved. Spectral COG for /ʃ/ was unaffected by hearing loss in both listening conditions, whereas the spectral COG for /s/ significantly decreased as high-frequency hearing loss increased. The distance in spectral COG between /s/ and /ʃ/ decreased significantly with increasing hearing level. COG distance significantly predicted nonsense-word detection in children but not in adults. CONCLUSIONS At least one aspect of speech production (voiceless sibilants) is measurably affected by high-frequency hearing loss and is related to speech perception in children. Speech production did not predict speech perception in adults, suggesting a more complex relationship between auditory feedback and feedforward mechanisms with age. Even so, these results suggest that this vocal biomarker may be useful for identifying the presence of high-frequency hearing loss in adults and children and for predicting the impact of hearing loss in children.
Collapse
Affiliation(s)
- Andrea L Pittman
- Department of Speech and Hearing Science, Arizona State University, Tempe
| | - Ayoub Daliri
- Department of Speech and Hearing Science, Arizona State University, Tempe
| | - Lauren Meadows
- Department of Speech and Hearing Science, Arizona State University, Tempe
| |
Collapse
|
46
|
Zharkova N, Hardcastle WJ, Gibbon FE. The dynamics of voiceless sibilant fricative production in children between 7 and 13 years old: An ultrasound and acoustic study. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 144:1454. [PMID: 30424626 DOI: 10.1121/1.5053585] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/17/2018] [Accepted: 08/23/2018] [Indexed: 06/09/2023]
Abstract
This study reports on dynamic tongue shape and spectral characteristics of sibilant fricatives /s/ and /ʃ/ in Scottish English speaking children aged between 7 and 13 years old. The sequences /əCa/ and /əCi/ were produced by 40 children, with ten participants in each age group, and two-year intervals between successive groups. Productions of the same sequences by ten adults were used for comparison with the children's data. Quantitative dynamic analyses were carried out on spectral information and on ultrasound imaging data on tongue shape. All age groups differentiated between the two consonants in the fricative centroid and in tongue shape. Vowel-on-consonant effects showed consonant-specific patterns across age groups without a consistent increase or decrease in the extent of coarticulation with increasing age. The extent of discriminability between the two fricatives increased with age on both acoustic and articulatory measures. Younger speakers were generally more variable than older speakers. Complementary findings from the centroid and tongue shape measures suggest that age-related differences are due to the ongoing maturation of controlling the tongue in coordination with other articulators, particularly the jaw, throughout childhood.
Collapse
Affiliation(s)
- Natalia Zharkova
- Clinical Audiology, Speech and Language Research Centre, Queen Margaret University, Queen Margaret University Drive, Musselburgh, East Lothian, EH21 6UU, United Kingdom
| | - William J Hardcastle
- Clinical Audiology, Speech and Language Research Centre, Queen Margaret University, Queen Margaret University Drive, Musselburgh, East Lothian, EH21 6UU, United Kingdom
| | - Fiona E Gibbon
- Department of Speech and Hearing Sciences, University College Cork, Brookfield Health Sciences Complex, College Road, Cork, Ireland
| |
Collapse
|
47
|
Johnson AA, Reidy PF, Edwards JR. Quantifying robustness of the /t/-/k/ contrast using a single, static spectral feature. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 144:EL105. [PMID: 30180689 PMCID: PMC6086688 DOI: 10.1121/1.5049702] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/02/2018] [Revised: 07/19/2018] [Accepted: 07/22/2018] [Indexed: 06/08/2023]
Abstract
Dynamic spectral shape features accurately classify /t/ and /k/ productions across speakers and contexts. This paper shows that word-initial /t/ and /k/ tokens produced by 21 adults can be differentiated using a single, static spectral feature when spectral energy concentration is considered relative to expectations within a given speaker and vowel context. Centroid and peak frequency-calculated from both acoustic and psychoacoustic spectra-were compared to determine whether one feature could reliably differentiate /t/ and /k/, and, if so, which feature best differentiated them. Centroid frequency from both acoustic and psychoacoustic spectra accurately classified productions of /t/ and /k/.
Collapse
Affiliation(s)
- Allison A Johnson
- Department of Hearing and Speech Sciences, University of Maryland-College Park, 0121 Taliaferro Hall, Chapel Drive, College Park, Maryland 20724, USA
| | - Patrick F Reidy
- Callier Center for Communication Disorders, University of Texas at Dallas, 1966 Inwood Road, Dallas, Texas 75235, USA
| | - Jan R Edwards
- Department of Hearing and Speech Sciences, University of Maryland-College Park, 0121 Taliaferro Hall, Chapel Drive, College Park, Maryland 20724, USA
| |
Collapse
|
48
|
Redmon C, Jongman A. Source characteristics of voiceless dorsal fricatives. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 144:242. [PMID: 30075652 DOI: 10.1121/1.5045345] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Aerodynamic and acoustic data on voiceless dorsal fricatives [x/χ] in Arabic, Persian, and Spanish were recorded to measure the extent to which such productions involve trilling of the uvula, thus exhibiting a sound source which, contrary to assumptions for voiceless fricatives, is mixed rather than aperiodic. Oscillation in airflow indicative of uvular vibration was present more often than not in Arabic (63%) and Persian (75%), while Spanish dorsal fricatives were more commonly produced with unimodal flow indicative of an aperiodic source. When present, uvular vibration frequencies averaged 68 Hz in Arabic and 67 Hz in Persian. Rates of uvular vibration were highly variable, however, and ranged between 40 and 116 Hz, with oscillatory periods averaging 4-5 cycles in duration, with a range of 1-12. The effect of these source characteristics on dorsal fricative acoustics was to significantly skew the spectral shape parameters (M1-M4) commonly used to characterize properties of the anterior filter; however, spectral peak frequency was found to be stable to changes in source characteristics, suggesting the occurrence of trilled tokens is not due to velar-uvular allophony, but rather is more fundamental to dorsal fricative production.
Collapse
Affiliation(s)
- Charles Redmon
- Department of Linguistics, University of Kansas, 1541 Lilac Lane, Lawrence, Kansas 66045, USA
| | - Allard Jongman
- Department of Linguistics, University of Kansas, 1541 Lilac Lane, Lawrence, Kansas 66045, USA
| |
Collapse
|
49
|
Netelenbos N, Gibb RL, Li F, Gonzalez CLR. Articulation Speaks to Executive Function: An Investigation in 4- to 6-Year-Olds. Front Psychol 2018; 9:172. [PMID: 29535655 PMCID: PMC5834422 DOI: 10.3389/fpsyg.2018.00172] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2017] [Accepted: 02/01/2018] [Indexed: 11/13/2022] Open
Abstract
Executive function (EF) and language learning play a prominent role in early childhood development. Empirical research continues to point to a concurrent relation between these two faculties. What has been given little attention, however, is the association between EF and speech articulation abilities in children. This study investigated this relation in children aged 4-6 years. Significant correlations indicated that children with better EF [via parental report of the Behavior Rating Inventory of Executive Function (BRIEF) inventory] exhibited stronger speech sound production abilities in the articulation of the "s" and "sh" sounds. Furthermore, regression analyses revealed that the Global Executive Composite (GEC) of EF as measured by the BRIEF, served as a predictor for speech sound proficiency and that speech sound proficiency served as a predictor for the GEC. Together, these results demonstrate the imbricated nature of EF and speech sound production while bearing theoretical and practical implications. From a theoretical standpoint, the close link between EF and speech articulation may indicate a common ontogenetic pathway. From a practical perspective, the results suggest that children with speech difficulties could be at higher risk for EF deficits.
Collapse
Affiliation(s)
- Nicole Netelenbos
- The Brain in Action Laboratory, University of Lethbridge, Lethbridge, AB, Canada
| | - Robbin L. Gibb
- Canadian Centre for Behavioural Neuroscience, University of Lethbridge, Lethbridge, AB, Canada
| | - Fangfang Li
- Department of Psychology, University of Lethbridge, Lethbridge, AB, Canada
| | | |
Collapse
|
50
|
Reidy PF, Kristensen K, Winn MB, Litovsky RY, Edwards JR. The Acoustics of Word-Initial Fricatives and Their Effect on Word-Level Intelligibility in Children With Bilateral Cochlear Implants. Ear Hear 2018; 38:42-56. [PMID: 27556521 PMCID: PMC5161607 DOI: 10.1097/aud.0000000000000349] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
OBJECTIVES Previous research has found that relative to their peers with normal hearing (NH), children with cochlear implants (CIs) produce the sibilant fricatives /s/ and /∫/ less accurately and with less subphonemic acoustic contrast. The present study sought to further investigate these differences across groups in two ways. First, subphonemic acoustic properties were investigated in terms of dynamic acoustic features that indexed more than just the contrast between /s/ and /∫/. Second, the authors investigated whether such differences in subphonemic acoustic contrast between sibilant fricatives affected the intelligibility of sibilant-initial single word productions by children with CIs and their peers with NH. DESIGN In experiment 1, productions of /s/ and /∫/ in word-initial prevocalic contexts were elicited from 22 children with bilateral CIs (aged 4 to 7 years) who had at least 2 years of CI experience and from 22 chronological age-matched peers with NH. Acoustic features were measured from 17 points across the fricatives: peak frequency was measured to index the place of articulation contrast; spectral variance and amplitude drop were measured to index the degree of sibilance. These acoustic trajectories were fitted with growth-curve models to analyze time-varying spectral change. In experiment 2, phonemically accurate word productions that were elicited in experiment 1 were embedded within four-talker babble and played to 80 adult listeners with NH. Listeners were asked to repeat the words, and their accuracy rate was used as a measure of the intelligibility of the word productions. Regression analyses were run to test which acoustic properties measured in experiment 1 predicted the intelligibility scores from experiment 2. RESULTS The peak frequency trajectories indicated that the children with CIs produced less acoustic contrast between /s/ and /∫/. Group differences were observed in terms of the dynamic aspects (i.e., the trajectory shapes) of the acoustic properties. In the productions by children with CIs, the peak frequency and the amplitude drop trajectories were shallower, and the spectral variance trajectories were more asymmetric, exhibiting greater increases in variance (i.e., reduced sibilance) near the fricative-vowel boundary. The listeners' responses to the word productions indicated that when produced by children with CIs, /∫/-initial words were significantly more intelligible than /s/-initial words. However, when produced by children with NH, /s/-initial words and /∫/-initial words were equally intelligible. Intelligibility was partially predicted from the acoustic properties (Cox & Snell pseudo-R > 0.190), and the significant predictors were predominantly dynamic, rather than static, ones. CONCLUSIONS Productions from children with CIs differed from those produced by age-matched NH controls in terms of their subphonemic acoustic properties. The intelligibility of sibilant-initial single-word productions by children with CIs is sensitive to the place of articulation of the initial consonant (/∫/-initial words were more intelligible than /s/-initial words), but productions by children with NH were equally intelligible across both places of articulation. Therefore, children with CIs still exhibit differential production abilities for sibilant fricatives at an age when their NH peers do not.
Collapse
Affiliation(s)
- Patrick F. Reidy
- Waisman Center, University of Wisconsin—Madison, Madison, Wisconsin, USA
- Department of Communication Sciences and Disorders, University of Wisconsin—Madison, Madison, Wisconsin, USA
| | - Kayla Kristensen
- Waisman Center, University of Wisconsin—Madison, Madison, Wisconsin, USA
- Department of Communication Sciences and Disorders, University of Wisconsin—Madison, Madison, Wisconsin, USA
| | - Matthew B. Winn
- Waisman Center, University of Wisconsin—Madison, Madison, Wisconsin, USA
- Department of Surgery, University of Wisconsin—Madison, Madison, Wisconsin, USA
- Department of Speech & Hearing Sciences, University of Washington, Seattle, Washington, USA
| | - Ruth Y. Litovsky
- Waisman Center, University of Wisconsin—Madison, Madison, Wisconsin, USA
- Department of Communication Sciences and Disorders, University of Wisconsin—Madison, Madison, Wisconsin, USA
- Department of Surgery, University of Wisconsin—Madison, Madison, Wisconsin, USA
| | - Jan R. Edwards
- Waisman Center, University of Wisconsin—Madison, Madison, Wisconsin, USA
- Department of Communication Sciences and Disorders, University of Wisconsin—Madison, Madison, Wisconsin, USA
| |
Collapse
|