1
|
Narayana S, Parsons MB, Zhang W, Franklin C, Schiller K, Choudhri AF, Fox PT, LeDoux MS, Cannito M. Mapping typical and hypokinetic dysarthric speech production network using a connected speech paradigm in functional MRI. NEUROIMAGE-CLINICAL 2020; 27:102285. [PMID: 32521476 PMCID: PMC7284131 DOI: 10.1016/j.nicl.2020.102285] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/25/2020] [Revised: 05/13/2020] [Accepted: 05/17/2020] [Indexed: 12/18/2022]
Abstract
We developed a task paradigm whereby subjects spoke aloud while minimizing head motion during functional MRI (fMRI) in order to better understand the neural circuitry involved in motor speech disorders due to dysfunction of the central nervous system. To validate our overt continuous speech paradigm, we mapped the speech production network (SPN) in typical speakers (n = 19, 10 females) and speakers with hypokinetic dysarthria as a manifestation of Parkinson disease (HKD; n = 21, 8 females) in fMRI. We then compared it with the SPN derived during overt speech production by 15O-water PET in the same group of typical speakers and another HKD cohort (n = 10, 2 females). The fMRI overt connected speech paradigm did not result in excessive motion artifacts and successfully identified the same brain areas demonstrated in the PET studies in the two cohorts. The SPN derived in fMRI demonstrated significant spatial overlap with the corresponding PET derived maps (typical speakers: r = 0.52; speakers with HKD: r = 0.43) and identified the components of the neural circuit of speech production belonging to the feedforward and feedback subsystems. The fMRI study in speakers with HKD identified significantly decreased activity in critical feedforward (bilateral dorsal premotor and motor cortices) and feedback (auditory and somatosensory areas) subsystems replicating previous PET study findings in this cohort. These results demonstrate that the overt connected speech paradigm is feasible during fMRI and can accurately localize the neural substrates of typical and disordered speech production. Our fMRI paradigm should prove useful for study of motor speech and voice disorders, including stuttering, apraxia of speech, dysarthria, and spasmodic dysphonia.
Collapse
Affiliation(s)
- Shalini Narayana
- Department of Pediatrics, Division of Pediatric Neurology, University of Tennessee Health Science Center, Memphis, TN 38103, USA; Neuroscience Institute, Le Bonheur Children's Hospital, Memphis, TN 38103, USA; Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, TN 38103, USA.
| | - Megan B Parsons
- School of Communication Sciences and Disorders, University of Memphis, Memphis, TN 38152, USA
| | - Wei Zhang
- Research Imaging Institute, University of Texas Health San Antonio, San Antonio, TX 78229, USA
| | - Crystal Franklin
- Research Imaging Institute, University of Texas Health San Antonio, San Antonio, TX 78229, USA
| | - Katherine Schiller
- Department of Pediatrics, Division of Pediatric Neurology, University of Tennessee Health Science Center, Memphis, TN 38103, USA
| | - Asim F Choudhri
- Neuroscience Institute, Le Bonheur Children's Hospital, Memphis, TN 38103, USA; Department of Radiology, Division of Neuroradiology, University of Tennessee Health Science Center, Memphis, TN 38103, USA
| | - Peter T Fox
- Research Imaging Institute, University of Texas Health San Antonio, San Antonio, TX 78229, USA
| | - Mark S LeDoux
- Veracity Neuroscience LLC, Memphis, TN 38157, USA; Department of Psychology and School of Health Studies, University of Memphis, Memphis, TN 38152, USA
| | - Michael Cannito
- Department of Communicative Disorders, University of Louisiana at Lafayette, USA
| |
Collapse
|
2
|
Montgomery A, Reed PE, Crass KA, Hubbard HI, Stith J. The effects of measurement error and vowel selection on the locus equation measure of coarticulation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2014; 136:2747-2750. [PMID: 25373974 DOI: 10.1121/1.4896460] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Effects on the slope of introducing error in the F2 Hz values in locus equations (LEs) and of using fewer than ten vowels were investigated. For each of the initial consonants /b, d, g/, 2000 simulated sets were generated using Monte Carlo techniques. The sets were altered with 50, 100, or 200 Hz error being randomly applied to each F2 value within a set. Selected vowels were then removed from the sets and the effects on the slopes were measured. Results suggest that the LE slopes are generally resistant to error and reduced number of vowels. Effects of adding 50 Hz of random error to the F2 values in sets using eight or ten vowels were minimal, yielding a mean absolute change in slope less than 0.07.
Collapse
Affiliation(s)
- Allen Montgomery
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, South Carolina 29208
| | - Paul E Reed
- Linguistics Program, University of South Carolina, Columbia, South Carolina 29208
| | - Kimberlee A Crass
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, South Carolina 29208
| | - H Isabel Hubbard
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, South Carolina 29208
| | - Joanna Stith
- Listen Foundation, Greenwood Village, Colorado 80111
| |
Collapse
|
3
|
Sussman HM, Byrd CT, Guitar B. The integrity of anticipatory coarticulation in fluent and non-fluent tokens of adults who stutter. CLINICAL LINGUISTICS & PHONETICS 2011; 25:169-186. [PMID: 21080828 DOI: 10.3109/02699206.2010.517896] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
This article analysed the acoustic structure of voiced stop + vowel sequences in a group of persons who stutter (PWS). This phonetic unit was chosen because successful production is highly dependent on the differential tweaking of right-to-left anticipatory coarticulation as a function of stop place. Thus, essential elements of both speech motor planning and execution can be parsimoniously assessed. Five adult PWS read three passages 3 times in a randomised order. These passages contained an overabundance of words beginning with initial [bV], [dV] and [gV] sequences. Digital audio and visual recordings were analysed to first identify fluent and stuttered target words, which were then spectrally analysed to yield locus equation (LE) regression plots. The slope of the LE regression function directly indexes the coarticulatory extent of the vowel's influence on the preceding stop consonant. The PWS revealed LE parameters falling within the normal ranges based on previously documented data obtained from fluent speakers. Theoretical considerations of possible underlying factors responsible for stuttering disfluencies are discussed relevant to these findings.
Collapse
Affiliation(s)
- Harvey M Sussman
- Department of Linguistics, University of Texas at Austin, Austin, TX 78712, USA.
| | | | | |
Collapse
|
4
|
Alwan A, Jiang J, Chen W. Perception of Place of Articulation for Plosives and Fricatives in Noise. SPEECH COMMUNICATION 2011; 53:195-209. [PMID: 21499546 PMCID: PMC3076800 DOI: 10.1016/j.specom.2010.09.001] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
This study aims at uncovering perceptually-relevant acoustic cues for the labial versus alveolar place of articulation distinction in syllable-initial plosives {/b/,/d/,/p/,/t/} and fricatives {/f/,/s/,/v/,/z/} in noise. Speech materials consisted of naturally-spoken consonant-vowel (CV) syllables from four talkers where the vowel was one of {/a/,/i/,/u/}. Acoustic analyses using logistic regression show that formant frequency measurements, relative spectral amplitude measurements, and burst/noise durations are generally reliable cues for labial/alveolar classification. In a subsequent perceptual experiment, each pair of syllables with the labial/alveolar distinction (e.g., /ba,da/) was presented to listeners in various levels of signal-to-noise-ratio (SNR) in a 2-AFC task. A threshold SNR was obtained for each syllable pair using sigmoid fitting of the percent correct scores. Results show that the perception of the labial/alveolar distinction in noise depends on the manner of articulation, the vowel context, and interaction between voicing and manner of articulation. Correlation analyses of the acoustic measurements and threshold SNRs show that formant frequency measurements (such as F1 and F2 onset frequencies and F2 and F3 frequency changes) become increasingly important for the perception of labial/alveolar distinctions as the SNR degrades.
Collapse
Affiliation(s)
- Abeer Alwan
- Corresponding author (Abeer Alwan), (Jintao Jiang)
| | - Jintao Jiang
- Corresponding author (Abeer Alwan), (Jintao Jiang)
| | | |
Collapse
|
5
|
Whiteside SP, Rixon E. Speech characteristics of monozygotic twins and a same-sex sibling: an acoustic case study of coarticulation patterns in read speech. PHONETICA 2003; 60:273-297. [PMID: 15004495 DOI: 10.1159/000076377] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/20/2003] [Accepted: 12/23/2003] [Indexed: 05/24/2023]
Abstract
This case study reports on an acoustic investigation of the motor speech characteristics of a set of young adult male monozygotic (MZ) twins and compares them to those of an age- and sex-matched sibling who participated in the study 2 years later to match for demographic factors. Coarticulation patterns were investigated from read samples of consonant-vowel sequences in monosyllabic words containing a variety of consonants and vowels. This was done by examining F(2) vowel onsets and F(2) vowel targets, plotted as F(2) locus equations. Data were processed for between-sibling differences using a number of statistical tests. Results indicated that the MZ twins displayed F(2) parameters and coarticulation patterns which were more similar than those of their age- and sex-matched sibling. The results of this case study therefore suggest that acoustic phonetic parameters used to index coarticulation patterns have the potential to profile some of the similarities and differences in the speech characteristics of genetically related individuals.
Collapse
Affiliation(s)
- S P Whiteside
- Department of Human Communication Sciences, University of Sheffield, Sheffield S10 2TA, United Kingdom.
| | | |
Collapse
|
6
|
Nijland L, Maassen B, van der Meulen S. Evidence of motor programming deficits in children diagnosed with DAS. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2003; 46:437-450. [PMID: 14700384 DOI: 10.1044/1092-4388(2003/036)] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
In this study the hypothesis of motor programming involvement in developmental apraxia of speech (DAS) was investigated by studying articulatory compensation. Five children with DAS and 5 normally speaking children (age 5;0 [years;months] to 6;10), and 6 adult women produced utterances in a normal speaking condition and in a bite-block condition in which the mandible was kept in a fixed position. Throughout the utterances, the course of the second formant was used to determine articulatory compensation and the effect of the bite block on anticipatory coarticulation. Results showed that the bite-block condition in normally speaking children, like in adult women, did not affect the extent of anticipatory coarticulation. In the speech of children with DAS, the bite block had large effects on coarticulatory patterns and on vowel quality, which, contrary to expectations, had improved. These results are interpreted as a clear demonstration of deficient motor programming in DAS.
Collapse
Affiliation(s)
- Lian Nijland
- Department of Pediatric Neurology, University Medical Center Nijmegen, Nijmegen, The Netherlands.
| | | | | |
Collapse
|
7
|
Subramanian A, Yairi E, Amir O. Second formant transitions in fluent speech of persistent and recovered preschool children who stutter. JOURNAL OF COMMUNICATION DISORDERS 2003; 36:59-75. [PMID: 12493638 DOI: 10.1016/s0021-9924(02)00135-1] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
UNLABELLED This study investigated frequency change and duration of the second formant (F2) transitions in perceptually fluent speech samples recorded close to stuttering onset in preschool age children. Comparisons were made among 10 children known to eventually persist in stuttering, 10 who eventually recovered from stuttering, and 10 normally fluent controls. All were enrolled in the longitudinal Stuttering Research Project at the University of Illinois. Subjects fluently repeated standard experimental sentences. The same 36 perceptually fluent target segments (syllables embedded in words) from each subject's repeated sentences were analyzed. The syllables were divided into three phonetic categories based on their initial consonant: bilabial, alveolar, and velar placement. The frequency change and duration of F2 transitions were analyzed for each of the target CV segments. F2 transition onset and offset frequencies and their interval (duration) were measured for each utterance. Data indicate that near stuttering onset, children whose stuttering eventually persisted demonstrated significantly smaller frequency change than that of the recovered group. It is suggested that the F2 transitions should continue to be investigated as a possible predictor of stuttering pathways. LEARNING OUTCOMES (1) Readers will learn about studies regarding second formant transition related to stuttering. (2) Readers will learn about differences between children who persist in stuttering and those who recover from stuttering. (3) Readers will learn about research concerned with early identification of risk criteria in persistent stuttering.
Collapse
Affiliation(s)
- Anu Subramanian
- University of Illinois, Speech and Hearing Science, 901 S. Sixth Street, Champaign, IL 61820, USA
| | | | | |
Collapse
|
8
|
Kardach J, Wincowski R, Metz DE, Schiavetti N, Whitehead RL, Hillenbrand J. Preservation of place and manner cues during simultaneous communication: a spectral moments perspective. JOURNAL OF COMMUNICATION DISORDERS 2002; 35:533-542. [PMID: 12443051 DOI: 10.1016/s0021-9924(02)00121-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
UNLABELLED Spectral moments, which describe the distribution of frequencies in a spectrum, were used to investigate the preservation of acoustic cues to intelligibility of speech produced during simultaneous communication (SC) in relation to acoustic cues produced when speaking alone. The spectral moment data obtained from speech alone (SA) were comparable to those spectral moment data reported by Jongman, Wayland, and Wong (2000) and Nittrouer (1995). The spectral moments obtained from speech produced during SC were statistically indistinguishable from those obtained during SA, indicating no measurable degradation of obstruent spectral acoustic cues during SC. EDUCATIONAL OBJECTIVES As a result of this activity, the participant will be able to (1) describe SC; (2) explain the role of SC in communication with children who are deaf; (3) describe the first, third, and fourth spectral moments of obstruent consonants; and (4) identify spectral moment patterns in speech produced during SC.
Collapse
Affiliation(s)
- Jill Kardach
- Department of Communicative Disorders & Sciences, State University of New York, Geneseo 14454, USA
| | | | | | | | | | | |
Collapse
|
9
|
Tabain M. Voiceless consonants and locus equations: a comparison with electropalatographic data on coarticulation. PHONETICA 2002; 59:20-37. [PMID: 11961419 DOI: 10.1159/000056203] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
The appropriateness of using locus equations in a study of voiceless consonants is examined. The consonants investigated are the stops /t k/ and the fricatives /theta s integral/. The slope value of the locus equation, indicating degree of coarticulation in the CV syllable, is compared with electropalatographic (EPG) data on coarticulation. It is shown that, overall, there is a very poor correlation between locus equation and EPG data as regards coarticulation. It is also shown that more accurate locus equation results - in terms of their correlation with EPG data - are obtained for stop consonants when F2 onset is sampled at stop release, rather than at the onset of voicing for the vowel. Finally, results for the voiceless consonants are compared with results for their homorganic voiced counterparts. Whilst there is no significant difference between voiced and voiceless consonants in the EPG data, there is a significant difference between voiced and voiceless consonants in the locus equation data. These results suggest that locus equations cannot provide invariant cues for stop and fricative place of articulation across the voiced-voiceless distinction.
Collapse
Affiliation(s)
- Marija Tabain
- Speech, Hearing and Language Research Centre, Department of Linguistics, Macquarie University, Sydney, Australia.
| |
Collapse
|
10
|
Sussman HM. Representation of phonological categories: a functional role for auditory columns. BRAIN AND LANGUAGE 2002; 80:1-13. [PMID: 11817886 DOI: 10.1006/brln.2001.2489] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
A speculative model for representing phonological categories for stop place of articulation is offered. The model is analogous to neural columns in both inferotemporal cortex in the macaque for achieving constancy in object recognition and in the inferior colliculus of the barn owl for resolving ambiguous phase disparities for sound localization. Hypothetical auditory columns for encoding the vowel context-induced variability of F2 transitions are described using acoustic data from locus equation scatterplots of human speech. Such analyses serve to absorb the allophonic variability inherent in coarticulated speech utterances.
Collapse
Affiliation(s)
- Harvey M Sussman
- Department of Linguistics, University of Texas, Austin, TX 78746, USA.
| |
Collapse
|
11
|
Nelson MA, Hodge MM. Effects of facial paralysis and audiovisual information on stop place identification. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2000; 43:158-171. [PMID: 10668659 DOI: 10.1044/jslhr.4301.158] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
This study investigated how listeners' perceptions of bilabial and lingua-alveolar voiced stops in auditory (A) and audiovisual (AV) presentation modes were influenced by articulatory function in a girl with bilateral facial paralysis (BFP) and a girl with normal facial movement (NFM). The Fuzzy Logic Model of Perception (FLMP) was used to make predictions about listeners' identifications of stop place based on assumptions about the nature (clear, ambiguous, or conflicting) of the A or AV cues produced by each child during /b/ and /d/ CV syllables. As predicted, (a) listeners' identification scores for NFM were very high and reliable, regardless of presentation mode or stop place, (b) listeners' identification scores for BFP were high for lingua-alveolar place, regardless of presentation mode, but more variable and less reliable than for NFM; significantly lower (overall at a chance level) for bilabial place in the A mode; and lowest for bilabial place in the AV mode. Conflicting visual cues for stop place for BFP's productions of /bV/ syllables influenced listeners' perceptions, resulting in most of her bilabial syllables being misidentified in the AV mode. F2 locus equations for each child's /bV/ and /dV/ syllables showed patterns similar to those reported by previous investigators, but with less differentiation between stop place for BFP than NFM. These acoustic results corresponded to the perceptual results obtained. (That is, when presented with only auditory information, on average, listeners perceived BFP's target /b/ syllables to be near the boundary between /b/ and /d/.)
Collapse
Affiliation(s)
- M A Nelson
- Department of Speech Pathology and Audiology, University of Alberta, Edmonton, Canada
| | | |
Collapse
|
12
|
Hertrich I, Ackermann H. Temporal and spectral aspects of coarticulation in ataxic dysarthria: an acoustic analysis. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 1999; 42:367-381. [PMID: 10229453 DOI: 10.1044/jslhr.4202.367] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
In order to analyze the impact of cerebellar disorders on temporal and spectral aspects of coarticulation, 9 individuals with cerebellar dysfunction and 9 controls were asked to produce test sentences comprising a target vowel (V = [a], [i], or [u]) within a schwa-t-V-t-schwa environment. The control speakers were investigated both at their habitual speech tempo and under a slow speaking condition. The squared distances between averaged FFT spectra served as a quantitative estimate of target-induced coarticulation, a method that can be applied to consonants as well as vowels, and which avoids the shortcomings of formant analysis. In order to test the significance of coarticulation effects at the level of individual speakers and to obtain F values as a further measure of the strength of coarticulation, multivariate tests of target effects were performed, with the first 6 principal components derived from the spectra of each speaker. First, inconsistent patterns of anticipatory vowel-to-vowel (W) interactions emerged across individuals, and neither significant group differences nor any effects of speech rate could be detected. The underlying control mechanisms thus seem to be segment-linked subject-specific specifications of the pretarget vowel, uncompromised by cerebellar dysfunction. Second, all participants exhibited highly significant anticipatory [t(h)]-to-vowel (CV) coarticulation. This effect was slightly smaller in the cerebellar group than in the control group, which can be at least partially explained by reduced spectral distances among the 3 target vowels. Speech rate did not influence the CV effects of the control group. As concerns temporal aspects of coarticulation, no significant group differences emerged in terms of length adjustments of the pretarget consonant to the intrinsic duration of the target vowel. Third, ataxic speakers showed a tendency toward enlarged perseverative vowel-to-[t(h)] (VC) and W effects if their slow speech rate was taken into account. Retentive coarticulation turned out to be similar in slow ataxic speakers and in fast-speaking controls. However, significant attenuation of these effects emerged in the latter group under the condition of decreased speech tempo. In summary, these results corroborate the suggestion of different mechanisms of gestural overlap in the temporal domain: Whereas perseverative coarticulation, presumably, reflects biomechanical or motor constraints, anticipation seems to represent higher level phonetic processing.
Collapse
Affiliation(s)
- I Hertrich
- Department of Neurology, University of Tübingen, Germany.
| | | |
Collapse
|
13
|
Tjaden K, Weismer G. Speaking-rate-induced variability in F2 trajectories. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 1998; 41:976-989. [PMID: 9771622 DOI: 10.1044/jslhr.4105.976] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
This study examined speaking-rate-induced spectral and temporal variability of F2 formant trajectories for target words produced in a carrier phrase at speaking rates ranging from fast to slow. F2 onset frequency measured at the first glottal pulse following the stop consonant release in target words was used to quantify the extent to which adjacent consonantal and vocalic gestures overlapped; F2 target frequency was operationally defined as the first occurrence of a frequency minimum or maximum following F2 onset frequency. Regression analyses indicated 70% of functions relating F2 onset and vowel duration were statistically significant. The strength of the effect was variable, however, and the direction of significant functions often differed from that predicted by a simple model of overlapping, sliding gestures. Results of a partial correlation analysis examining interrelationships among F2 onset, F2 target frequency, and vowel duration across the speaking rate range indicated that covariation of F2 target with vowel duration may obscure the relationship between F2 onset and vowel duration across rate. The results further suggested that a sliding based model of acoustic variability associated with speaking rate change only partially accounts for the present data, and that such a view accounts for some speakers' data better than others.
Collapse
Affiliation(s)
- K Tjaden
- Department of Communicative Disorders and Sciences, State University of New York at Buffalo, 14260, USA
| | | |
Collapse
|
14
|
Sussman HM, Dalston E, Gumbert S. The effect of speaking style on a locus equation characterization of stop place of articulation. PHONETICA 1998; 55:204-225. [PMID: 9933780 DOI: 10.1159/000028433] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
Locus equations were employed to assess the phonetic stability and distinctiveness of stop place categories in reduced speech. Twenty-two speakers produced stop consonant + vowel utterances in citation and spontaneous speech. Coarticulatory increases in hypoarticulated speech were documented only for /dV/ and [gV] productions in front vowel contexts. Coarticulatory extents for /bV/ and [gV] in back vowel contexts remained stable across style changes. Discriminant analyses showed equivalent levels of correct classification across speaking styles. CV reduction was quantified by use of Euclidean distances separating stop place categories. Despite sensitivity of locus equation parameters to articulatory differences encountered in informal speech, stop place categories still maintained a clear separability when plotted in a higher-order slope x y-intercept acoustic space.
Collapse
Affiliation(s)
- H M Sussman
- Department of Linguistics, University of TexasAustin, Tex., USA.
| | | | | |
Collapse
|
15
|
Guion SG. The role of perception in the sound change of velar palatalization. PHONETICA 1998; 55:18-52. [PMID: 9693343 DOI: 10.1159/000028423] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
Voiceless velar stops may become palatoalveolar affricates before front vowels. This sound change is not only one of the most common types of palatalization, but is a very common sound change in the world's languages. Nevertheless, we do not have an adequate understanding of how this sound change takes place. Three experiments reported here test the hypothesis that velar palatalization is the result of listeners' on-line perceptual reanalysis of fast rate speech. It is shown that velars before front vowels are both acoustically and perceptually similar to palatoalveolars. This supports the proposal that velar palatalization is perceptually conditioned.
Collapse
Affiliation(s)
- S G Guion
- The University of Texas at Austin, Department of Linguistics, Austin, Tex., USA.
| |
Collapse
|
16
|
Brancazio L, Fowler CA. On the relevance of locus equations for production and perception of stop consonants. PERCEPTION & PSYCHOPHYSICS 1998; 60:24-50. [PMID: 9503910 DOI: 10.3758/bf03211916] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
We examined the possible relevance of locus equations to human production and perception of stop consonants. The orderly output constraint (OOC) of Sussman, Fruchter, and Cable (1995) claims that humans have evolved to produce speech such that F2 at consonant release and F2 at vowel midpoint are linearly related for consonants so that developing perceptual systems can form representations in an F2ons-by-F2vowel space. The theory claims that this relationship described by locus equations can distinguish consonants, and that the linearity of locus equations is captured in neural representations and is thus perceptually relevant. We investigated these claims by testing how closely locus equations reflect the production and perception of stop consonants. In Experiment 1, we induced speakers to change their locus equation slope and intercept parameters systematically, but found that consonants remained distinctive in slope-by-intercept space. In Experiment 2, we presented stop-consonant syllables with their bursts removed to listeners, and compared their classification error matrices with the predictions of a model using locus equation prototypes and with those of an exemplar-based model that uses F2ons and F2vowel, but not locus equations. Both models failed to account for a large proportion of the variance in listeners' responses; the locus equation model was no better in its predictions than the exemplar model. These findings are discussed in the context of the OOC.
Collapse
Affiliation(s)
- L Brancazio
- Haskins Laboratories, New Haven, CT 06511, USA.
| | | |
Collapse
|
17
|
Fruchter D, Sussman HM. The perceptual relevance of locus equations. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 1997; 102:2997-3008. [PMID: 9373987 DOI: 10.1121/1.421012] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
Identification curves were estimated for the English consonants /b,d,g/ using five-formant CV synthetic stimuli comprehensively sampling the F2 onset-F2 vowel acoustic space in the vicinity of /b,d,g/ locus equations [H. Sussman et al., J. Acoust. Soc. Am. 90, 1309-1325 (1991)]. The stimuli included 10 English monophthongal vowel contexts, 11 levels of F2 onset per vowel, and 3 levels of F3 onset orthogonally varied with the F2 variables (10 vowels x 11 F2 onsets x 3 F3 onsets = 330 stimuli). After brief training, each of six subjects, three male and three female, was presented eight trials of each of the stimuli, one or two trials per day over a period of several days. Systems of identification curves were visualized as identification surfaces situated in locus equation acoustic space and were overlaid with acoustic data from five male speakers in order to judge the degree of correspondence between perception and acoustic data. A chi square analysis was also performed in order to quantify the correspondence between the observed perception data and expected frequencies derived from the acoustic data. The results, when interpreted in terms of a dominance hierarchy hypothesis, strongly indicate F2 onset and F2 vowel, in combination, serve as important cues for stop consonant place of articulation.
Collapse
Affiliation(s)
- D Fruchter
- Department of Linguistics, University of Texas, Austin 78712, USA
| | | |
Collapse
|
18
|
Nearey TM. Speech perception as pattern recognition. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 1997; 101:3241-3254. [PMID: 9193041 DOI: 10.1121/1.418290] [Citation(s) in RCA: 40] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
This work provides theoretical and empirical arguments in favor of an approach to phonetics that is called double-weak. It is so called because it assumes relatively weak constraints both on the articulatory gestures and on the auditory patterns that map phonological elements. This approach views speech production and perception as distinct but cooperative systems. Like the motor theory of speech perception, double-weak theory accepts that phonological units are modified by context in ways that are important to perception. It further agrees that many aspects of such context dependency have their origin in natural articulatory processes. However, double-weak theory sides with proponents of auditory theories of phonetics by accepting that the real-time objects of perception are well-defined auditory patterns. Because speakers find ways to "orderly" output conditions" (Sussman et al., 1995), listeners are able to successfully decode speech using relatively simple pattern-recognition mechanisms. It is suggested that this situation has arisen through a stylization of gestural patterns to accommodate real-time limits of the perceptual system. Results from a new perceptual experiment, involving a four-dimensional stimulus continuum and a 10-category/hVC/response set, are shown to be largely compatible with this framework.
Collapse
Affiliation(s)
- T M Nearey
- Department of Linguistics, University of Alberta, Edmonton, Canada
| |
Collapse
|
19
|
Sussman HM, Bessell N, Dalston E, Majors T. An investigation of stop place of articulation as a function of syllable position: a locus equation perspective. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 1997; 101:2826-2838. [PMID: 9165737 DOI: 10.1121/1.418567] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
Locus equations were employed to phonetically describe stop place categories as a function of syllable-initial, -medial, and -final position. Ten speakers, five male and five female, produced a total of 2700 CVC and 4500 VCV utterances that were acoustically analyzed to obtain F2 onset, F2 vowel, and F2 offset frequencies for locus equation regression analyses. In general, degree of coarticulation, as indexed by locus equation slope, was reduced for post-vocalic (VC) stops relative to pre-vocalic stops (pooled data from initial and medial positions), but significant differences were observed as a function of stop consonant. All stops showed significantly reduced R2 values and increased standard errors of estimate for VC relative to CV productions. Separability of stop place categories in a higher-order slope X y-intercept acoustic space also diminished for VC vs CV stop productions. The degradation of classic locus equation form (high correlation and linearity) for VC relative to CV productions was attributed to greater articulatory precision in the production of pre-vocalic compared to post-vocalic stops. This greater articulatory precision was interpreted as reflecting a greater need to normalize vowel context-induced variability of the F2 transition for syllable onset relative to final stops. The decline in acoustic lawfulness of syllable-final stops is discussed in terms of coarticulatory interactions and expected perceptual correlates.
Collapse
Affiliation(s)
- H M Sussman
- Department of Linguistics, University of Texas, Austin 78712, USA.
| | | | | | | |
Collapse
|
20
|
Sussman HM, Shore J. Locus equations as phonetic descriptors of consonantal place of articulation. PERCEPTION & PSYCHOPHYSICS 1996; 58:936-46. [PMID: 8768188 DOI: 10.3758/bf03205495] [Citation(s) in RCA: 43] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
This experiment tested whether locus equation coefficients, slope, and y-intercept could serve as indices of place of articulation for obstruents sharing the same place of articulation across different manner classes. Locus equations for 22 speakers were derived from CV/t/ words with initial voiced stop /d/, voiceless aspirated stop /t/, nasal /n/, voiced fricative /z/, and voiceless fricative /s/ preceding 10 vowel contexts. Post hoc tests revealed /d/ = /z/ = /n/ for slope means. Voiced /d/ and voiceless /t/ were also equivalent when F2 transition onset measurement points were equated. Scatterplots of locus equation coefficients revealed three nonoverlapping and distinct clusters when the diverse coronal group was compared with labials and velars. A discriminant analysis using slope and y-intercept as predictors successfully categorized all five coronals into one alveolar group with 87.1% accuracy. The collective results support the contention that locus equations can serve as effective phonetic descriptors of consonant place of articulation across manner classes.
Collapse
Affiliation(s)
- H M Sussman
- Department of Linguistics, University of Texas, Austin 78712, USA.
| | | |
Collapse
|
21
|
Sussman HM, Minifie FD, Buder EH, Stoel-Gammon C, Smith J. Consonant-vowel interdependencies in babbling and early words: preliminary examination of a locus equation approach. JOURNAL OF SPEECH AND HEARING RESEARCH 1996; 39:424-433. [PMID: 8729930 DOI: 10.1044/jshr.3902.424] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
Consonant-vowel productions at two distinct stages of language development were studied in a single female child. At 12 months canonical babbling syllables (N = 144) identified by a panel of listeners as comprising [bV], [dV], and [gv] tokens were acoustically analyzed by measuring F2 transition onset and F2 midvowel frequencies and plotting their relationship as locus equations for each stop category. A regression analysis performed on these scatterplots revealed differential slopes and y-intercepts as a function of stop place. The same analysis was performed 9 months later on CV utterances (N = 243) produced as syllable-initial segments of real words by the same child. Whereas labial and velar locus equation parameters moved toward more adult-like values, alveolar slope and y-intercept moved away from adult values and more in the direction of decreased coarticulation between vowel and consonant. There was greater scatter of data points around the regression line for production of words compared to babbling. These results are compared to locus equations obtained from 3-5-year-olds and adults. Locus equations appear to be useful as an empirical developmental probe to document how CV productions gradually approach adult categorical standards.
Collapse
Affiliation(s)
- H M Sussman
- Department of Communication Sciences & Disorders, University of Texas at Austin 78712, USA
| | | | | | | | | |
Collapse
|