1
|
Correcting the record: Phonetic potential of primate vocal tracts and the legacy of Philip Lieberman (1934-2022). Am J Primatol 2024:e23637. [PMID: 38741274 DOI: 10.1002/ajp.23637] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Revised: 04/24/2024] [Accepted: 04/27/2024] [Indexed: 05/16/2024]
Abstract
The phonetic potential of nonhuman primate vocal tracts has been the subject of considerable contention in recent literature. Here, the work of Philip Lieberman (1934-2022) is considered at length, and two research papers-both purported challenges to Lieberman's theoretical work-and a review of Lieberman's scientific legacy are critically examined. I argue that various aspects of Lieberman's research have been consistently misinterpreted in the literature. A paper by Fitch et al. overestimates the would-be "speech-ready" capacities of a rhesus macaque, and the data presented nonetheless supports Lieberman's principal position-that nonhuman primates cannot articulate the full extent of human speech sounds. The suggestion that no vocal anatomical evolution was necessary for the evolution of human speech (as spoken by all normally developing humans) is not supported by phonetic or anatomical data. The second challenge, by Boë et al., attributes vowel-like qualities of baboon calls to articulatory capacities based on audio data; I argue that such "protovocalic" properties likely result from disparate articulatory maneuvers compared to human speakers. A review of Lieberman's scientific legacy by Boë et al. ascribes a view of speech evolution (which the authors term "laryngeal descent theory") to Lieberman, which contradicts his writings. The present article documents a pattern of incorrect interpretations of Lieberman's theoretical work in recent literature. Finally, the apparent trend of vowel-like formant dispersions in great ape vocalization literature is discussed with regard to Lieberman's theoretical work. The review concludes that the "Lieberman account" of primate vocal tract phonetic capacities remains supported by research: the ready articulation of fully human speech reflects species-unique anatomy.
Collapse
|
2
|
Playing With Fire Compounds: The Tonal Accents of Compounds in (North) Norwegian Preschoolers' Role-Play Register. LANGUAGE AND SPEECH 2024; 67:113-139. [PMID: 37113109 DOI: 10.1177/00238309231161289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Prosodic features are some of the most salient features of dialect variation in Norway. It is therefore no wonder that the switch in prosodic systems is what is first recognized by caretakers and scholars when Norwegian children code-switch to something resembling the dialect of the capital (henceforth Urban East Norwegian, UEN) in role-play. With a focus on the system of lexical tonal accents, this paper investigates the spontaneous speech of North Norwegian children engaging in peer social role-play. By investigating F0 contours extracted from a corpus of spontaneous peer play, and comparing them with elicited baseline reference contours, this paper makes the case that children fail to apply the target tonal accent consistent with UEN in compounds in role-play, although the production of tonal accents otherwise seems to be phonetically target like UEN. Put in other words, they perform in accordance with UEN phonetics, but not UEN morpho-phonology.
Collapse
|
3
|
Foreign language acquisition of perceptually similar segments: evidence from Lower Sorbian. OPEN RESEARCH EUROPE 2024; 3:56. [PMID: 38532923 PMCID: PMC10964000 DOI: 10.12688/openreseurope.14895.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Accepted: 02/09/2024] [Indexed: 03/28/2024]
Abstract
Lower Sorbian is a moribund language spoken in Eastern Germany that features a three-way sibilant contrast, /s, ʂ, ɕ/. The vast majority of L1 speakers are above eighty years of age and virtually no young Sorbians learn Lower Sorbian as their first language. There are language revitalization programs in place, but this means that virtually all Lower Sorbian speakers are L2 learners whose first language is German. German, as opposed to Lower Sorbian, has a two-way sibilant contrast, /s, ʃ/. So, Lower Sorbian learners need to acquire a perceptually similar sibilant contrast, /ʂ, ɕ/, that commonly assimilates with a single L1 segment, /ʃ/. The two-to-one assimilation makes acquisition difficult. In this project, I examine the acquisition of the three-way sibilant contrast using ultrasound technology. The ultrasound data revealed that learners in the contemporary context do not produce a distinction between /ʂ, ɕ/ and only learners at an advanced level who had significant exposure to L1 speakers have acquired a three-way sibilant distinction. The findings are put into the context of models of L2 acquisition and generalized implications for foreign language acquisition are discussed.
Collapse
|
4
|
Exploring the performance of automatic speaker recognition using twin speech and deep learning-based artificial neural networks. Front Artif Intell 2024; 7:1287877. [PMID: 38405218 PMCID: PMC10885345 DOI: 10.3389/frai.2024.1287877] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Accepted: 01/23/2024] [Indexed: 02/27/2024] Open
Abstract
This study assessed the influence of speaker similarity and sample length on the performance of an automatic speaker recognition (ASR) system utilizing the SpeechBrain toolkit. The dataset comprised recordings from 20 male identical twin speakers engaged in spontaneous dialogues and interviews. Performance evaluations involved comparing identical twins, all speakers in the dataset (including twin pairs), and all speakers excluding twin pairs. Speech samples, ranging from 5 to 30 s, underwent assessment based on equal error rates (EER) and Log cost-likelihood ratios (Cllr). Results highlight the substantial challenge posed by identical twins to the ASR system, leading to a decrease in overall speaker recognition accuracy. Furthermore, analyses based on longer speech samples outperformed those using shorter samples. As sample size increased, standard deviation values for both intra and inter-speaker similarity scores decreased, indicating reduced variability in estimating speaker similarity/dissimilarity levels in longer speech stretches compared to shorter ones. The study also uncovered varying degrees of likeness among identical twins, with certain pairs presenting a greater challenge for ASR systems. These outcomes align with prior research and are discussed within the context of relevant literature.
Collapse
|
5
|
Effects of Personal Protective Equipment on Speech Acoustics. SISLI ETFAL HASTANESI TIP BULTENI 2023; 57:434-439. [PMID: 37900335 PMCID: PMC10600612 DOI: 10.14744/semb.2023.22556] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/09/2023] [Revised: 05/22/2023] [Accepted: 06/08/2023] [Indexed: 10/31/2023]
Abstract
Objectives The transmission of severe acute respiratory syndrome coronavirus-2 occurs primarily through droplets, which highlights the importance of protecting the oral, nasal, and conjunctival mucosas using personal protective equipment (PPE). The use of PPE can lead to communication difficulties between healthcare workers and patients. This study aimed to investigate changes in the acoustic parameters of speech sounds when different types of PPE are used. Methods A cross-sectional study was conducted, enrolling 18 healthy male and female participants. They were instructed to produce a sustained [ɑː] vowel for at least 3 s to estimate voice quality. In addition, all Turkish vowels were produced for a minimum of 200 ms. Finally, three Turkish fricative consonants ([f], [s], and [ʃ]) were produced in a consonant/vowel/consonant format with different vowel contexts within a carrier sentence. Recordings were repeated under the following conditions: no PPE, surgical mask, N99 mask, face shield, surgical mask + face shield, and N99 mask + face shield. All recordings were subjected to analysis. Results Frequency perturbation parameters did not show significant differences. However, in males, all vowels except [u] in the first formant (F1), except [ɔ] and [u] in the second formant (F2), except [ɛ] and [ɔ] in the third formant (F3), and only [i] in the fourth formant (F4) were significant. In females, all vowels except [i] in F1, except [u] in F2, all vowels in F3, and except [u] and [ɯ] in F4 were significant. Spectral moment values exhibited significance in both groups. Conclusion The use of different types of PPE resulted in variations in speech acoustic features. These findings may be attributed to the filtering effects of PPE on specific frequencies and the potential chamber effect in front of the face. Understanding the impact of PPE on speech acoustics contributes to addressing communication challenges in healthcare settings.
Collapse
|
6
|
Nasal patency and mandibular movement: clinical application in prosthodontics. GENERAL DENTISTRY 2023; 71:30-33. [PMID: 37595080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 08/20/2023]
Abstract
This case report describes a patient with a primary concern of persistent mandibular deviation during speech who experienced clinically significant improvement (mandibular movement without deviation) after improvements to nasal resistance. At the initial consultation, temporary placement of a nasal valve dilator immediately eliminated the patient's mandibular deviation during speech, indicating the need for referral to an otolaryngologist. The patient was also provided with a dental appliance to address secondary concerns of temporomandibular joint noises and cervicofacial pain. Although the dental treatment provided some relief, resolution of the patient's mandibular deviation during speech did not occur until after nasal surgery was completed. This case illustrates the importance and effects of nasal resistance and nasal patency to obtaining a reproducible mandibular position.
Collapse
|
7
|
Benefits of a professional development course on transcription for practising speech-language pathologists. INTERNATIONAL JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2023:1-13. [PMID: 37395343 DOI: 10.1080/17549507.2023.2206069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
Purpose: Transcription of speech sounds is a fundamental skill used by speech-language pathologists. Little is known about the impact of professional development courses on transcription accuracy and confidence. This study explored speech-language pathologists' use and perceptions of transcription and the effect of a professional development course on their transcription accuracy and confidence.Method: A quasi-experimental, one-group pretest-posttest design was used. Twenty-two Australian speech-language pathologists working with children with speech sound disorders participated in the course. Participants transcribed single words and completed a survey about confidence, perceptions, and the use of transcription at both time points.Result: The number of participants who reported feeling confident about using transcription significantly increased from 36.84% pre-training to 68.42% post-training. Transcription accuracy of phonemes based on point-to-point accuracy was high pre-training (88.97%) and did not significantly improve. Participants identified strategies to maintain their transcription skills.Conclusion: This study suggests speech-language pathologists transcribe single words in typical speech with high accuracy using broad transcription, and that participating in a transcription professional development course increases their transcription confidence. Further research is needed to explore different delivery methods of professional development, the impact of professional development on transcription accuracy of disordered speech, and the long-term impacts of professional development on transcription accuracy and confidence.
Collapse
|
8
|
Apraxia of Speech in the Spontaneous Speech of Nonfluent/Agrammatic Primary Progressive Aphasia. J Alzheimers Dis Rep 2023; 7:589-604. [PMID: 37313492 PMCID: PMC10259074 DOI: 10.3233/adr-220089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Accepted: 05/13/2023] [Indexed: 06/15/2023] Open
Abstract
Background Apraxia of speech (AOS) is a core feature of nonfluent/agrammatic primary progressive aphasia (naPPA), but its precise characteristics and the prevalence of AOS features in spontaneous speech are debated. Objective To assess the frequency of features of AOS in the spontaneous, connected speech of individuals with naPPA and to evaluate whether these features are associated with an underlying motor disorder such as corticobasal syndrome or progressive supranuclear palsy. Methods We examined features of AOS in 30 patients with naPPA using a picture description task. We compared these patients to 22 individuals with behavioral variant frontotemporal dementia and 30 healthy controls. Each speech sample was evaluated perceptually for lengthened speech segments and quantitatively for speech sound distortions, pauses between and within words, and articulatory groping. We compared subgroups of naPPA with and without at least two features of AOS to assess the possible contribution of a motor impairment to speech production deficits. Results naPPA patients produced both speech sound distortions and other speech sound errors. Speech segmentation was found in 27/30 (90%) of individuals. Distortions were identified in 8/30 (27%) of individuals, and other speech sound errors occurred in 18/30 (60%) of individuals. Frequent articulatory groping was observed in 6/30 (20%) of individuals. Lengthened segments were observed rarely. There were no differences in the frequencies of AOS features among naPPA subgroups as a function of extrapyramidal disease. Conclusion Features of AOS occur with varying frequency in the spontaneous speech of individuals with naPPA, independently of an underlying motor disorder.
Collapse
|
9
|
Influence of transparency and opacity on the spelling of fricative phonemes. Codas 2023; 35:e20210212. [PMID: 37283397 PMCID: PMC10266796 DOI: 10.1590/2317-1782/20232021212pt] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Accepted: 04/07/2022] [Indexed: 06/08/2023] Open
Abstract
PURPOSE (1) to verify to what extent the occurrence of possible errors is influenced by the relationship (opaque/transparent) between fricative phonemes and the graphemes with which they can be spelled; (2) verify the differences (if present or not) of relationship types among the phonemes that present common graphemic relationships. METHODS We analyzed 750 textual productions from children in the first year of Elementary School (ES), and conducted a survey of the frequency of correct answers and errors in all fricative phonemes of Brazilian Portuguese (BP). RESULTS The errors occurred in greater numbers in the group of phonemes with opaque spelling when compared with the number of errors in the group of phonemes with transparent spelling. In the first group, the errors showed a non-symmetrical behavior, since they varied according to the possibilities of graphemes for each phoneme. In the second group, the errors showed a symmetrical behavior. CONCLUSION Given the symmetry in the errors of the phonemes of the first group and the non-symmetry of those of the second group, our results point to a gradation in the occurrence of errors, which varies as a function of the transparency and degree of opacity in the relations between phonemes and graphemes of a same class.
Collapse
|
10
|
Aerosols, airflow, and more: examining the interaction of speech and the physical environment. Front Psychol 2023; 14:1184054. [PMID: 37255523 PMCID: PMC10225543 DOI: 10.3389/fpsyg.2023.1184054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Accepted: 04/28/2023] [Indexed: 06/01/2023] Open
Abstract
We describe ongoing efforts to better understand the interaction of spoken languages and their physical environments. We begin by briefly surveying research suggesting that languages evolve in ways that are influenced by the physical characteristics of their environments, however the primary focus is on the converse issue: how speech affects the physical environment. We discuss the speech-based production of airflow and aerosol particles that are buoyant in ambient air, based on some of the results in the literature. Most critically, we demonstrate a novel method used to capture aerosol, airflow, and acoustic data simultaneously. This method captures airflow data via a pneumotachograph and aerosol data via an electrical particle impactor. The data are collected underneath a laminar flow hood while participants breathe pure air, thereby eliminating background aerosol particles and isolating those produced during speech. Given the capabilities of the electrical particle impactor, which has not previously been used to analyze speech-based aerosols, the method allows for the detection of aerosol particles at temporal and physical resolutions exceeding those evident in the literature, even enabling the isolation of the role of individual sound types in the production of aerosols. The aerosols detected via this method range in size from 70 nanometers to 10 micrometers in diameter. Such aerosol particles are capable of hosting airborne pathogens. We discuss how this approach could ultimately yield data that are relevant to airborne disease transmission and offer preliminary results that illustrate such relevance. The method described can help uncover the actual articulatory gestures that generate aerosol emissions, as exemplified here through a discussion focused on plosive aspiration and vocal cord vibration. The results we describe illustrate in new ways the unseen and unheard ways in which spoken languages interact with their physical environments.
Collapse
|
11
|
On the speaker discriminatory power asymmetry regarding acoustic-phonetic parameters and the impact of speaking style. Front Psychol 2023; 14:1101187. [PMID: 37138997 PMCID: PMC10150585 DOI: 10.3389/fpsyg.2023.1101187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Accepted: 03/29/2023] [Indexed: 05/05/2023] Open
Abstract
This study aimed to assess what we refer to as the speaker discriminatory power asymmetry and its forensic implications in comparisons performed in different speaking styles: spontaneous dialogues vs. interviews. We also addressed the impact of data sampling on the speaker's discriminatory performance concerning different acoustic-phonetic estimates. The participants were 20 male speakers, Brazilian Portuguese speakers from the same dialectal area. The speech material consisted of spontaneous telephone conversations between familiar individuals, and interviews conducted between each individual participant and the researcher. Nine acoustic-phonetic parameters were chosen for the comparisons, spanning from temporal and melodic to spectral acoustic-phonetic estimates. Ultimately, an analysis based on the combination of different parameters was also conducted. Two speaker discriminatory metrics were examined: Cost Log-likelihood-ratio (Cllr) and Equal Error Rate (EER) values. A general speaker discriminatory trend was suggested when assessing the parameters individually. Parameters pertaining to the temporal acoustic-phonetic class depicted the weakest performance in terms of speaker contrasting power as evidenced by the relatively higher Cllr and EER values. Moreover, from the set of acoustic parameters assessed, spectral parameters, mainly high formant frequencies, i.e., F3 and F4, were the best performing in terms of speaker discrimination, depicting the lowest EER and Cllr scores. The results appear to suggest a speaker discriminatory power asymmetry concerning parameters from different acoustic-phonetic classes, in which temporal parameters tended to present a lower discriminatory power. The speaking style mismatch also seemed to considerably impact the speaker comparison task, by undermining the overall discriminatory performance. A statistical model based on the combination of different acoustic-phonetic estimates was found to perform best in this case. Finally, data sampling has proven to be of crucial relevance for the reliability of discriminatory power assessment.
Collapse
|
12
|
Students' Awareness of the Role of Phonetics in Construction of Removable Dental Prostheses: A Questionnaire-Based Cross-Sectional Study. Dent J (Basel) 2022; 10:dj10120227. [PMID: 36547043 PMCID: PMC9776968 DOI: 10.3390/dj10120227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 11/11/2022] [Accepted: 11/14/2022] [Indexed: 12/02/2022] Open
Abstract
Phonetics plays a major role in the fabrication of prostheses. This study aimed to assess the knowledge of students regarding the role of phonetics in denture fabrication and to improve the educational process and the clinical application. The study was conducted at the College of Dentistry, Imam Abdulrahman Bin Faisal University, and involved a survey of 344 dental students and interns. The questionnaire contained 20 questions and was divided into three sections: general knowledge, clinical correlations, and clinical evaluations. The data were collected and analyzed statistically using independent t-tests, one-way ANOVA, and Tukey’s post hoc tests. The response rate was 100%. Male and female students only differed significantly in terms of their scores for answers to general knowledge questions, with females achieving better results (p = 0.023). General knowledge varied significantly between fourth-year students and all other levels (p < 0.001), and fifth-year students and interns (p = 0.027). The clinical correlations varied significantly between fourth-year students and interns (p = 0.01), whereas the clinical evaluations varied between all the academic years and interns (fourth-year, p < 0.001; fifth-year, p = 0.003; and sixth-year, p = 0.017). The interns obtained the highest scores in all sections. There was a lack of awareness among dental students of some aspects of the role of phonetics in denture fabrication. The study highlights the deficiencies that need to be addressed and the need for adjustments to the curriculum related to removable prosthodontics in order to improve the knowledge of students regarding the role of speech in denture fabrication.
Collapse
|
13
|
The Relationship between Non-Native Perception and Phonological Patterning of Implosive Consonants. LANGUAGE AND SPEECH 2022:238309221132495. [PMID: 36440824 DOI: 10.1177/00238309221132495] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
This study uses non-native perception data to examine the relationship between perceived phonetic similarity of segments and their phonological patterning. Segments that are phonetically similar to one another are anticipated to pattern together phonologically, and segments that share articulatory or acoustic properties are also expected to be perceived as similar. What is not yet clear is whether segments that pattern together phonologically are perceived as similar. This study addresses this question by examining how L1 English listeners and L1 Guébie listeners perceive non-native implosive consonants compared with plosives and sonorants. English does not have contrastive implosives, whereas Guébie has a bilabial implosive. The bilabial implosive phonologically patterns with sonorants in Guébie, to the exclusion of obstruents. Two perception experiments show English listeners make more perceptual categorization errors between implosives and voiced plosives than Guébie listeners do, but both listener groups are more likely to classify implosives as similar to voiced plosives than sonorants. The results also show that Guébie listeners are better at categorizing non-native implosive consonants (i.e., alveolar implosives) than English listeners, showing that listeners are able to extend features or gestures from their L1 to non-native implosive consonants. The results of these experiments suggest a cross-linguistic perceptual similarity hierarchy of implosives compared with other segments that are not affected by L1 phonological patterning.
Collapse
|
14
|
Phonetic Cues in Auditory Identification of Bulgarian, Czech, Polish, and Russian Language of Origin. LANGUAGE AND SPEECH 2022:238309221119098. [PMID: 36047062 DOI: 10.1177/00238309221119098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
This work presents the results of an auditory language of origin identification experiment. Disyllabic and trisyllabic logatomes were recorded by speakers of Bulgarian, Czech, Polish, and Russian, and presented to L1 speakers of the abovementioned Slavic languages. The goals of the test were to verify the ability of lay listeners to recognize the linguistic origin of speakers, based on spoken samples with limited segmental and suprasegmental information, and to correlate the signal features with the subjects' performance. It was found that position of word stress is not an important predictor in language recognition. However, inherent vowel characteristics such as duration and vowel space computed by the means of Pillai scores correlate with subjects' performance. Both the linguistic profile and the familiarity with closely related languages also appear to be relevant predictors of listeners' performance. Finally, the information-theoretic notion of surprisal applied on regular cross-linguistic sound correspondences was correlated with recognition scores; though, the correlations did not reach the threshold of statistical significance. We conclude that auditory identification of linguistic origin by lay persons, native speakers of closely related languages, is possible even when exposed to limited segmental information, which can serve as a cue in the identification of linguistic origin.
Collapse
|
15
|
A Japanese 4-year-old with protracted phonological development: the challenge of coronals. CLINICAL LINGUISTICS & PHONETICS 2022; 36:657-669. [PMID: 35253563 DOI: 10.1080/02699206.2022.2029944] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 11/22/2021] [Accepted: 01/10/2022] [Indexed: 06/14/2023]
Abstract
This study examines the phonology of a Japanese four-year-old with mildly protracted phonological development (PPD) as a contribution to a special crosslinguistic issue presenting individual profiles in PPD within the framework of constraint-based nonlinear phonology. Although the child's word structure and vowels were well-established, certain consonant classes presented challenges. Coronal anterior obstruents often showed posteriorization (backing): dorsal stops replaced coronal stops, and with some exceptions, alveolopalatal affricates replaced anterior fricatives and affricates. The feature [+continuant] was also not yet established: palatal and bilabial fricatives and /h/ were either deleted or replaced with glottal stop; and non-anterior affricates replaced coronal fricatives. If affricates are analyzed as a sequence of [-continuant]-[+continuant], they were possible transitional elements from non-continuants to continuants. The profile culminates with suggestions for intervention based on the nonlinear phonological analysis, consistent with other papers in this special issue.
Collapse
|
16
|
Electropalatography (EPG) activities in Japan and the impact of the COVID-19 pandemic on EPG research and therapy: A report of presentations at the 7th EPG Symposium. INTERNATIONAL JOURNAL OF LANGUAGE & COMMUNICATION DISORDERS 2022; 57:906-917. [PMID: 35307940 PMCID: PMC9111328 DOI: 10.1111/1460-6984.12720] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Accepted: 02/28/2022] [Indexed: 06/14/2023]
Abstract
BACKGROUND At the 7th Electropalatography Symposium in Japan, held online on the 24 January 2021, a few speakers were invited to talk about how the COVID-19 pandemic had impacted their research and/or speech therapy that involved the use of electropalatography (EPG) as well as the procedures adopted in order to continue their work in a safe manner. The information on protective measures when using instrumental techniques in speech research and therapy may be useful for colleagues in research and the clinic. AIMS The primary aims are: (1) to find out whether there are any published recommendations regarding protective measures for using EPG in research and clinic settings; (2) to discuss the impact of the pandemic and the corresponding restrictions and general protective measures directed (or advised) by local government and professional bodies at each stage of EPG work; and (3) to share experiences in using modified procedures for face-to-face EPG therapy sessions and combined EPG teletherapy. In addition, a brief overview of EPG and a summary of EPG research and clinical activities in Japan presented by one of the symposium organizers at the symposium are included. METHODS & PROCEDURES A review of the literature regarding protective measures recommended for using EPG for speech assessment and treatment or research, supplemented by a discussion of our own experiences. MAIN CONTRIBUTION The literature review showed that there are no guidelines regarding protective measures for using EPG, but there is some advice regarding speech recording using microphones. Most published articles related to speech and language therapy (SLT) service during COVID-19 are about telepractice or general clinical guidelines for face-to-face speech therapy sessions. The protective measures for using EPG developed based on the general guidelines recommended by local government and professional bodies (e.g., using visors, transparent acrylic board) were described. Using EPG in telepractice was discussed as well. CONCLUSIONS It has been challenging to continue EPG research and therapy during the pandemic. In order to deal with this crisis, available knowledge regarding infection control and recommendations from local government and professional bodies were applied to design methods and procedures that allowed EPG research and therapy to continue. WHAT THIS PAPER ADDS What is already known on the subject There are general protective measures recommended by local government and professional bodies regarding speech therapy sessions (e.g., using personal protective equipment (PPE), social distancing), but little is known about the measures for using instrumental techniques in speech research and therapy, particularly EPG. The equipment of each instrumental technique is different, so measures that are appropriate for one may not be suitable for others. Hence, specific recommendations are needed for EPG. What this paper adds to existing knowledge This paper provides pointers to information about recommendations regarding protective measures for speech research and therapy, supplemented with suggestions specific to EPG provided by experienced users based on actual experience. What are the potential or actual clinical implications of this work? In evaluating the impact of the COVID-19 pandemic on EPG research and therapy, an analytical approach was taken to break down the steps involved in carrying out those activities, and the challenges we faced and the possible alternatives for completing the tasks were discussed. A similar approach can be applied to evaluate other aspects of speech therapy service.
Collapse
|
17
|
EPG research and therapy: further developments. CLINICAL LINGUISTICS & PHONETICS 2022:1-21. [PMID: 35652593 DOI: 10.1080/02699206.2022.2080588] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Revised: 05/13/2022] [Accepted: 05/13/2022] [Indexed: 06/15/2023]
Abstract
Electropalatography (EPG) has been used in the past 50 years for studying the patterns of contact between the tongue and the palate during speech production in typical speakers and those with speech disorders due to different causes. At the 7th EPG Symposium in Japan that was held online on 24 January 2021 (see: https://epg-research.sakura.ne.jp/), a panel of invited experts discussed their views regarding further developments and application of the technique. This paper provides a summary of this discussion. EPG offers information on articulation which cannot be replaced by other instrumental measures of speech. Identified areas for further hardware development are thinner EPG plates, better dental and palatal coverage, wireless connectivity, and sensors that provide additional articulatory information (e.g. tongue pressure, tongue-palate distance). EPG can serve as a resource for teaching speech disorders and phonetics. Furthermore, EPG therapy can be combined with telepractice in the speech therapy of clients with speech disorders.
Collapse
|
18
|
Identifying segmental and prosodic errors associated with the increasing word length effect in acquired apraxia of speech. INTERNATIONAL JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2022; 24:294-306. [PMID: 35473426 DOI: 10.1080/17549507.2022.2061593] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Purpose: Individuals with stroke-related apraxia of speech (AOS) plus aphasia tend to produce more speech errors with increasing word length. The Words of Increasing Length task (WIL) uses a 3-point scale to score word accuracy but penalises for error types that can arise either from language or motor impairment, reducing the test's sensitivity and specificity. The purpose here was to identify error types explaining variance in the WIL score, and those associated with AOS and word length.Method: Speech errors were perceptually identified on the WIL task for 51 Australian English-speaking adults with stroke-related aphasia, 25 with concomitant AOS. Multiple regression and linear mixed effects modelling were applied.Result: Variance in WIL scores was best explained with four error types: consonant additions, incorrect number of syllables, false starts and consonant substitutions/distortions. False starts were significantly associated with AOS diagnosis. Incorrect number of syllables, consonant omissions, false starts, and lexical stress errors increased in frequency for longer words and, while the interaction with diagnosis did not reach significance, the effect appeared driven by the AOS group.Conclusion: Findings provide further support for using polysyllabic word production to assess apraxic speech. The WIL task has limitations that may bias patients' performance and clinicians' perceptual evaluation. Data provide valuable information for designing a more sensitive diagnostic protocol for AOS.
Collapse
|
19
|
Rethinking the phonetics of baby-talk: Differences across Canada and Vanuatu in the articulation of mothers' speech to infants. Dev Sci 2021; 25:e13180. [PMID: 34633716 DOI: 10.1111/desc.13180] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2021] [Revised: 09/04/2021] [Accepted: 09/15/2021] [Indexed: 11/29/2022]
Abstract
Infant-directed speech (IDS) is phonetically distinct from adult-directed speech (ADS): It is typically considered to have special prosody-like higher pitch and slower speaking rates-as well as unique speech sound properties, for example, more breathy, hyperarticulated, and/or variable consonant and vowel articulation. These phonetic features are widely observed in the IDS of caregivers from urbanized contexts who speak a handful of very well-researched languages. Yet studies with more diverse socio-cultural and linguistic samples show that this "typical" IDS prosody is not consistently observed across cultures. We extended cross-cultural work by examining IDS speech segment articulation, which-like prosody-is also thought to be a characteristic phonetic feature of IDS that might aid speech and language development. Here we asked whether IDS vowels have different articulatory features compared to ADS vowels in two distinct linguistic and socio-cultural contexts: urban English-speaking Canadian mothers, and rural Lenakel- and Southwest Tanna-speaking ni-Vanuatu mothers (n = 57, 20-46 years of age). Replicating prior work, Canadian mothers had more variable vowels in IDS compared to ADS, but also did not show clear register differences for breathiness or hyperarticulation. Vowels spoken by ni-Vanuatu mothers showed very distinct articulatory tendencies, using less variable (and less breathy) IDS vowels. Along with other work showing diversity in IDS phonetics across populations, this paper suggests that any understanding of how IDS might aid speech and language development are best examined through a culturally- and linguistically-specific lens.
Collapse
|
20
|
Paradigmatic Relations Interact During the Production of Complex Words: Evidence From Variable Plurals in Dutch. Front Psychol 2021; 12:720017. [PMID: 34539520 PMCID: PMC8442732 DOI: 10.3389/fpsyg.2021.720017] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2021] [Accepted: 07/29/2021] [Indexed: 12/04/2022] Open
Abstract
A growing body of work in psycholinguistics suggests that morphological relations between word forms affect the processing of complex words. Previous studies have usually focused on a particular type of paradigmatic relation, for example the relation between paradigm members, or the relation between alternative forms filling a particular paradigm cell. However, potential interactions between different types of paradigmatic relations have remained relatively unexplored. This paper presents two corpus studies of variable plurals in Dutch to test hypotheses about potentially interacting paradigmatic effects. The first study shows that generalization across noun paradigms predicts the distribution of plural variants, and that this effect is diminished for paradigms in which the plural variants are more likely to have a strong representation in the mental lexicon. The second study demonstrates that the pronunciation of a target plural variant is affected by coactivation of the alternative variant, resulting in shorter segmental durations. This effect is dependent on the representational strength of the alternative plural variant. In sum, by exploring interactions between different types of paradigmatic relations, this paper provides evidence that storage of morphologically complex words may affect the role of generalization and coactivation during production.
Collapse
|
21
|
Language and Learner Specific Influences on the Emergence of Consonantal Place and Manner Features. Front Psychol 2021; 12:646713. [PMID: 34603114 PMCID: PMC8484525 DOI: 10.3389/fpsyg.2021.646713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2020] [Accepted: 06/16/2021] [Indexed: 11/13/2022] Open
Abstract
This article focuses on the emergence of consonantal place and manner feature categories in the speech of first language learners. Starting with an overview of current representational approaches to phonology, we take the position that only models that allow for the emergence of phonological categories at all levels of phonological representation (from sub-segmental properties of speech sounds all the way to word forms represented within the child's lexicon) can account for the data. We begin with a cross-linguistic survey of the acquisition of rhotic consonants. We show that the types of substitutions affecting different rhotics cross-linguistically can be predicted from two main observations: the phonetic characteristics of these rhotics and the larger system of categories displayed by each language. We then turn to a peculiar pattern of labial substitution for coronal continuants in the speech of a German learner. Building on previous literature on the topic, we attribute the emergence of this pattern to distributional properties of the child's developing lexicon. Together, these observations suggest that our understanding of phonological emergence must involve a consideration of multiple, potentially interacting levels of phonetic and phonological representation.
Collapse
|
22
|
Echoes of L1 Syllable Structure in L2 Phoneme Recognition. Front Psychol 2021; 12:515237. [PMID: 34354620 PMCID: PMC8329372 DOI: 10.3389/fpsyg.2021.515237] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2019] [Accepted: 03/23/2021] [Indexed: 11/13/2022] Open
Abstract
Learning to move from auditory signals to phonemic categories is a crucial component of first, second, and multilingual language acquisition. In L1 and simultaneous multilingual acquisition, learners build up phonological knowledge to structure their perception within a language. For sequential multilinguals, this knowledge may support or interfere with acquiring language-specific representations for a new phonemic categorization system. Syllable structure is a part of this phonological knowledge, and language-specific syllabification preferences influence language acquisition, including early word segmentation. As a result, we expect to see language-specific syllable structure influencing speech perception as well. Initial evidence of an effect appears in Ali et al. (2011), who argued that cross-linguistic differences in McGurk fusion within a syllable reflected listeners’ language-specific syllabification preferences. Building on a framework from Cho and McQueen (2006), we argue that this could reflect the Phonological-Superiority Hypothesis (differences in L1 syllabification preferences make some syllabic positions harder to classify than others) or the Phonetic-Superiority Hypothesis (the acoustic qualities of speech sounds in some positions make it difficult to perceive unfamiliar sounds). However, their design does not distinguish between these two hypotheses. The current research study extends the work of Ali et al. (2011) by testing Japanese, and adding audio-only and congruent audio-visual stimuli to test the effects of syllabification preferences beyond just McGurk fusion. Eighteen native English speakers and 18 native Japanese speakers were asked to transcribe nonsense words in an artificial language. English allows stop consonants in syllable codas while Japanese heavily restricts them, but both groups showed similar patterns of McGurk fusion in stop codas. This is inconsistent with the Phonological-Superiority Hypothesis. However, when visual information was added, the phonetic influences on transcription accuracy largely disappeared. This is inconsistent with the Phonetic-Superiority Hypothesis. We argue from these results that neither acoustic informativity nor interference of a listener’s phonological knowledge is superior, and sketch a cognitively inspired rational cue integration framework as a third hypothesis to explain how L1 phonological knowledge affects L2 perception.
Collapse
|
23
|
The vocal tract as a time machine: inferences about past speech and language from the anatomy of the speech organs. Philos Trans R Soc Lond B Biol Sci 2021; 376:20200192. [PMID: 33745306 PMCID: PMC8059537 DOI: 10.1098/rstb.2020.0192] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/05/2020] [Indexed: 12/14/2022] Open
Abstract
While speech and language do not fossilize, they still leave traces that can be extracted and interpreted. Here, we suggest that the shape of the hard structures of the vocal tract may also allow inferences about the speech of long-gone humans. These build on recent experimental and modelling studies, showing that there is extensive variation between individuals in the precise shape of the vocal tract, and that this variation affects speech and language. In particular, we show that detailed anatomical information concerning two components of the vocal tract (the lower jaw and the hard palate) can be extracted and digitized from the osteological remains of three historical populations from The Netherlands, and can be used to conduct three-dimensional biomechanical simulations of vowel production. We could recover the signatures of inter-individual variation between these vowels, in acoustics and articulation. While 'proof-of-concept', this study suggests that older and less well-preserved remains could be used to draw inferences about historic and prehistoric languages. Moreover, it forces us to clarify the meaning and use of the uniformitarian principle in linguistics, and to consider the wider context of language use, including the anatomy, physiology and cognition of the speakers. This article is part of the theme issue 'Reconstructing prehistoric languages'.
Collapse
|
24
|
Lexical analyses of the function and phonology of Papuan Malay word stress. PHONETICA 2021; 78:141-168. [PMID: 33892529 DOI: 10.1515/phon-2021-2003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/21/2020] [Accepted: 03/23/2021] [Indexed: 06/12/2023]
Abstract
The existence of word stress in Indonesian languages has been controversial. Recent acoustic analyses of Papuan Malay suggest that this language has word stress, counter to other studies and unlike closely related languages. The current study further investigates Papuan Malay by means of lexical (non-acoustic) analyses of two different aspects of word stress. In particular, this paper reports two distribution analyses of a word corpus, 1) investigating the extent to which stress patterns may help word recognition and 2) exploring the phonological factors that predict the distribution of stress patterns. The facilitating role of stress patterns in word recognition was investigated in a lexical analysis of word embeddings. The results show that Papuan Malay word stress (potentially) helps to disambiguate words. As for stress predictors, a random forest analysis investigated the effect of multiple morpho-phonological factors on stress placement. It was found that the mid vowels /ɛ/ and /ɔ/ play a central role in stress placement, refining the conclusions of previous work that mainly focused on /ɛ/. The current study confirms that non-acoustic research on stress can complement acoustic research in important ways. Crucially, the combined findings on stress in Papuan Malay so far give rise to an integrated perspective to word stress, in which phonetic, phonological and cognitive factors are considered.
Collapse
|
25
|
Extracting Phonetic Features From Natural Classes: A Mismatch Negativity Study of Mandarin Chinese Retroflex Consonants. Front Hum Neurosci 2021; 15:609898. [PMID: 33841113 PMCID: PMC8029992 DOI: 10.3389/fnhum.2021.609898] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Accepted: 02/23/2021] [Indexed: 11/13/2022] Open
Abstract
How speech sounds are represented in the brain is not fully understood. The mismatch negativity (MMN) has proven to be a powerful tool in this regard. The MMN event-related potential is elicited by a deviant stimulus embedded within a series of repeating standard stimuli. Listeners construct auditory memory representations of these standards despite acoustic variability. In most designs that test speech sounds, however, this variation is typically intra-category: All standards belong to the same phonetic category. In the current paper, inter-category variation is presented in the standards. These standards vary in manner of articulation but share a common phonetic feature. In the standard retroflex experimental block, Mandarin Chinese speaking participants are presented with a series of "standard" consonants that share the feature [retroflex], interrupted by infrequent non-retroflex deviants. In the non-retroflex standard experimental block, non-retroflex standards are interrupted by infrequent retroflex deviants. The within-block MMN was calculated, as was the identity MMN (iMMN) to account for intrinsic differences in responses to the stimuli. We only observed a within-block MMN to the non-retroflex deviant embedded in the standard retroflex block. This suggests that listeners extract [retroflex] despite significant inter-category variation. In the non-retroflex standard block, because there is little on which to base a coherent auditory memory representation, no within-block MMN was observed. The iMMN to the retroflex was observed in a late time-window at centro-parieto-occipital electrode sites instead of fronto-central electrodes, where the MMN is typically observed, potentially reflecting the increased difficulty posed by the added variation in the standards. In short, participants can construct auditory memory representations despite significant acoustic and inter-category phonological variation so long as a shared phonetic feature binds them together.
Collapse
|
26
|
The Effects of L1 English Constraints on the Acquisition of the L2 Spanish Alveopalatal Nasal. Front Psychol 2021; 12:640354. [PMID: 33658966 PMCID: PMC7919851 DOI: 10.3389/fpsyg.2021.640354] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2020] [Accepted: 01/21/2021] [Indexed: 11/21/2022] Open
Abstract
This study examines whether L1 English/L2 Spanish learners at different proficiency levels acquire a novel L2 phoneme, the Spanish palatal nasal /ɲ/. While alveolar /n/ is part of the Spanish and English inventories, /ɲ/, which consists of a tautosyllabic palatal nasal+glide element, is not. This crosslinguistic disparity presents potential difficulty for L1 English speakers due to L1 segmental and phonotactic constraints; the closest English approximation is the heterosyllabic sequence /nj/ (e.g., “canyon” /kænjn/ ['khæn.jn], cf. Spanish cañón “canyon” /kaɲon/ [ka.'ɲon]). With these crosslinguistic differences in mind, we ask: (1a) Do L1 English learners of L2 Spanish produce acoustically distinct Spanish /n/ and /ɲ/ and (1b) Does the distinction of /n/ and /ɲ/ vary by proficiency? In the case that learners distinguish /n/ and /ɲ/, the second question investigates the acoustic quality of /ɲ/ to determine (2a) if learners' L2 representation patterns with that of an L1 Spanish representation or if learners rely on an L1 representation (here, English /nj/) and (2b) if the acoustic quality of L2 Spanish /ɲ/ varies as a function of proficiency. Beginner (n = 9) and advanced (n = 8) L1 English/L2 Spanish speakers and a comparison group of 10 L1 Spanish/L2 English speakers completed delayed repetition tasks in which disyllabic nonce words were produced in a carrier phrase. English critical items contained an intervocalic heterosyllabic /nj/ sequence (e.g., ['phan.jə]); Spanish critical items consisted of items with either intervocalic onset /ɲ/ (e.g., ['xa.ɲa]) or /n/ ['xa.na]. We measured duration and formant contours of the following vocalic portion as acoustic indices of the /n/~/ɲ/ and /ɲ/ ~/nj/ distinctions. Results show that, while L2 Spanish learners produce an acoustically distinct /n/ ~ /ɲ/ contrast even at a low level of proficiency, the beginners produce an intermediate /ɲ/ that falls acoustically between their English /nj/ and the L1 Spanish /ɲ/ while the advanced learners' Spanish /ɲ/ and English /nj/ appear to be in the process of equivalence classification. We discuss these outcomes as they relate to the robustness of L1 phonological constraints in late L2 acquisition coupled with the role of perceptual cues, functional load, and questions of intelligibility.
Collapse
|
27
|
Validity and Reliability of the New Chinese Version of the Frontal Assessment Battery-Phonemic. J Alzheimers Dis 2021; 80:371-381. [PMID: 33554904 DOI: 10.3233/jad-201028] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
BACKGROUND Alzheimer's disease dementia (ADD) is an important health problem in the world. OBJECTIVE The present study investigated the validity and reliability of a new version of the Frontal Assessment Battery (FAB) named the FAB-phonemic (FAB-P). METHODS A total of 76 patients with ADD, 107 patients with amnestic mild cognitive impairment (aMCI), 37 patients with non-amnestic MCI (naMCI), and 123 healthy controls were included in this study. All participants were evaluated with the FAB-P and the cognitive assessments according to a standard procedure. RESULTS The global FAB-P scores in patients with ADD were lower than those of patients with aMCI, patients with naMCI, and healthy controls (p < 0.001). Patients with aMCI performed worse than healthy controls (p < 0.001). The interrater reliability, test-retest reliability, and Cronbach's alpha coefficient for the FAB-P were 0.997, 0.819, and 0.736, respectively. The test could distinguish the patients with mild ADD, aMCI, and naMCI from healthy controls with classification accuracy of 89.4%, 70.9%, and 61.6%, respectively. It could also discriminate between the patients with ADD and aMCI, between those with ADD and naMCI, and between those with aMCI and naMCI with classification accuracy of 73.8%, 83.9%, and 58.0%, respectively. The regression analysis revealed that the Montreal Cognitive Assessment and the Stroop Color Word Test Part C had the greatest contribution to FAB-P score variance. CONCLUSION The FAB-P is a valid and reliable tool for evaluating frontal lobe function and can effectively discriminate ADD, aMCI, and naMCI.
Collapse
|
28
|
Infants' discrimination of consonant contrasts in the presence and absence of talker variability. INFANCY 2021; 26:84-103. [PMID: 33063948 PMCID: PMC9794002 DOI: 10.1111/infa.12371] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2019] [Revised: 09/17/2020] [Accepted: 09/18/2020] [Indexed: 12/30/2022]
Abstract
To learn speech-sound categories, infants must identify the acoustic dimensions that differentiate categories and selectively attend to them as opposed to irrelevant dimensions. Variability on irrelevant acoustic dimensions can aid formation of robust categories in infants through adults in tasks such as word learning (e.g., Rost and McMurray, 2009) or speech-sound learning (e.g., Lively et al., 1993). At the same time, variability sometimes overwhelms learners, interfering with learning and processing. Two prior studies (Kuhl & Miller, 1982; Jusczyk, Pisoni, & Mullennix, 1992) found that irrelevant variability sometimes impaired early sound discrimination. We asked whether variability would impair or facilitate discrimination for older infants, comparing 7.5-month-old infants' discrimination of an early acquired native contrast, /p/ vs. /b/ (in the word forms /pIm/ vs. /bIm/), in Experiment 1, with an acoustically subtle, non-native contrast, /n/ vs. /ŋ/ (in /nIm/ vs. /ŋIm/), in Experiment 2. Words were spoken by one or four talkers. Infants discriminated the native but not the non-native contrast, and there were no significant effects of talker condition. We discuss implications for theories of phonological learning and avenues for future research.
Collapse
|
29
|
The versatility of creaky phonation: Segmental, prosodic, and sociolinguistic uses in the world's languages. WILEY INTERDISCIPLINARY REVIEWS. COGNITIVE SCIENCE 2020; 12:e1547. [PMID: 33015958 DOI: 10.1002/wcs.1547] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/04/2020] [Revised: 09/11/2020] [Accepted: 09/15/2020] [Indexed: 11/11/2022]
Abstract
Creaky phonation (also known as creaky voice, vocal fry, laryngealization, or glottalization) is a voice quality that refers to shortened and thickened vocal folds that vibrate at a low and quasi-regular fundamental frequency with a long period of damping. Cross-linguistically, creaky phonation can span either short or long domains. When implemented on individual vowels or consonants (as in Zapotec or Montana Salish), it can signal phonemic contrast with other voice qualities, or it can be an additional acoustic cue to enhance other contrasts, such as tone (as in Mandarin or Cantonese). Another segmental use of creaky phonation in many languages is as a variant of glottal stop. Creaky phonation can also be implemented as a prosodic element that signals the end of a phrase (as in English or Mandarin), or indicates relinquishing a conversational turn (as in Finnish). It can also express meaning in a social interaction, such as irritation (in Vietnamese). Lastly, creaky phonation can be deployed as a sociolinguistic marker to establish identities, convey affect, or distinguish one speech group from another within the same language. In some social circumstances, such as the perception that young women use creaky phonation at greater rates than men do, it can be evaluated negatively by listeners. As creaky phonation can be combined with linguistic elements at various levels and is easily perceptible, it has taken on a remarkable number of roles in our linguistic repertoires. This article is categorized under: Linguistics > Language in Mind and Brain.
Collapse
|
30
|
Abstract
The unique biomechanical and functional constraints on human speech make it a promising area for research investigating modular control of movement. The present article illustrates how a modular control approach to speech can provide insights relevant to understanding both motor control and observed variation across languages. We specifically explore the robust typological finding that languages produce different degrees of labial constriction using distinct muscle groupings and concomitantly distinct lip postures. Research has suggested that these lip postures exploit biomechanical regions of nonlinearity between neural activation and movement, also known as quantal regions, to allow movement goals to be realized despite variable activation signals. We present two sets of computer simulations showing that these labial postures can be generated under the assumption of modular control and that the corresponding modules are biomechanically robust: first to variation in the activation levels of participating muscles, and second to interference from surrounding muscles. These results provide support for the hypothesis that biomechanical robustness is an important factor in selecting the muscle groupings used for speech movements and provide insight into the neurological control of speech movements and how biomechanical and functional constraints govern the emergence of speech motor modules. We anticipate that future experimental work guided by biomechanical simulation results will provide new insights into the neural organization of speech movements.NEW & NOTEWORTHY This article provides additional evidence that speech motor control is organized in a modular fashion and that biomechanics constrain the kinds of motor modules that may emerge. It also suggests that speech can be a fruitful domain for the study of modularity and that a better understanding of speech motor modules will be useful for speech research. Finally, it suggests that biomechanical modeling can serve as a useful complement to experimental work when studying modularity.
Collapse
|
31
|
The effects of prematurity and socioeconomic deprivation on early speech perception: A story of two different delays. Dev Sci 2020; 24:e13020. [PMID: 32687657 DOI: 10.1111/desc.13020] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2019] [Revised: 06/01/2020] [Accepted: 07/11/2020] [Indexed: 11/30/2022]
Abstract
There is evidence showing that both maturational and environmental factors can impact on later language development. On the one hand, preterm birth has been found to increase the risk of deficits in the preschool and school years. Preterm children show poorer auditory discrimination, reading difficulties, poor vocabulary, less complex expressive language and lower receptive understanding than their matched controls. On the other hand, socioeconomic status (SES) indicators (i.e., income, education and occupation) have been found to be strongly related to linguistic abilities during the preschool and school years. However, there is very little information about how these factors result in lower linguistic abilities. The present study addresses this issue. To do so, we investigated early speech perception in full and preterm infants from families classed as high or low SES. Seventy-six infants were followed longitudinally at 7.5, 9, 10.5 and 12 months of age. At each test point, three studies explored infants' phonetic, prosodic and phonotactic development respectively. Results showed no significant differences between the phonetic or the phonotactic development of the preterm and the full-term infants. However, a time-lag between preterm and full-term developmental timing for prosody was found. Socioeconomic status did not have a significant effect on prosodic development. Nonetheless, phonetic and phonotactic development was affected by SES, infants from lower SES showed phonetic discrimination of non-native contrast and a preference for high-probability sequences later than their more advantaged peers. Overall these results suggest that different constraints apply to the acquisition of different phonological subcomponents.
Collapse
|
32
|
Neural Representation of Articulable and Inarticulable Novel Sound Contrasts: The Role of the Dorsal Stream. NEUROBIOLOGY OF LANGUAGE (CAMBRIDGE, MASS.) 2020; 1:339-364. [PMID: 35784619 PMCID: PMC9248853 DOI: 10.1162/nol_a_00016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/21/2019] [Accepted: 05/23/2020] [Indexed: 06/15/2023]
Abstract
The extent that articulatory information embedded in incoming speech contributes to the formation of new perceptual categories for speech sounds has been a matter of discourse for decades. It has been theorized that the acquisition of new speech sound categories requires a network of sensory and speech motor cortical areas (the "dorsal stream") to successfully integrate auditory and articulatory information. However, it is possible that these brain regions are not sensitive specifically to articulatory information, but instead are sensitive to the abstract phonological categories being learned. We tested this hypothesis by training participants over the course of several days on an articulable non-native speech contrast and acoustically matched inarticulable nonspeech analogues. After reaching comparable levels of proficiency with the two sets of stimuli, activation was measured in fMRI as participants passively listened to both sound types. Decoding of category membership for the articulable speech contrast alone revealed a series of left and right hemisphere regions outside of the dorsal stream that have previously been implicated in the emergence of non-native speech sound categories, while no regions could successfully decode the inarticulable nonspeech contrast. Although activation patterns in the left inferior frontal gyrus, the middle temporal gyrus, and the supplementary motor area provided better information for decoding articulable (speech) sounds compared to the inarticulable (sine wave) sounds, the finding that dorsal stream regions do not emerge as good decoders of the articulable contrast alone suggests that other factors, including the strength and structure of the emerging speech categories are more likely drivers of dorsal stream activation for novel sound learning.
Collapse
|
33
|
Patient satisfaction with esthetics, phonetics, and function following implant-supported fixed restorative treatment in the esthetic zone: A systematic review. J ESTHET RESTOR DENT 2020; 32:662-672. [PMID: 32715619 DOI: 10.1111/jerd.12625] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2020] [Revised: 05/21/2020] [Accepted: 06/24/2020] [Indexed: 12/16/2022]
Abstract
OBJECTIVE To determine patient satisfaction with esthetics, phonetics, and function following implant-supported fixed restorative treatment in the esthetic zone by measuring the Oral Health Related Quality of Life (OHRQoL). MATERIALS AND METHODS This systematic review follows the "Preferred reporting items for systematic review and meta-analysis protocols" (PRISMA-P) 2015 statement. Studies were searched in the databases Ovid, PubMed, Web of Science, Scopus, and the Cochrane Library. The quality of the studies included in the review was scored using the GRADE system. The impact of the findings was analyzed by calculating effect size and standardization of results across different OHRQoL measurements was achieved by calculating the percentage equivalent. RESULTS A total of 13 studies were selected to be included in his systematic review after application of the inclusion criteria. A total of six studies recorded pre- and post-treatment OHRQoL results, while the remaining seven studies provided only post-treatment results. CONCLUSIONS This review concluded that implant-supported fixed restorations in the esthetic zone have an overall positive impact on OHRQoL. However, patient satisfaction from this treatment reduced as the number of missing teeth replaced by implants was increased. CLINICAL SIGNIFICANCE When implant-supported fixed restorations are being planned in the esthetic zone, dentists need to consider the patient's perception and the subsequent impact of these restorations on the patient's quality of life. Clinicians can be assured that patient OHRQoL will increase; however, clinicians should also keep in mind the importance of the gingival frame. Furthermore, they should be aware of the challenges when planning cases with larger numbers of implants in the esthetic zone as this could lead to a lowering of a patient QoL.
Collapse
|
34
|
Phonetic-phonological performance of typical younger and aged adults from Brazil's capital city. Dement Neuropsychol 2020; 14:308-314. [PMID: 32973984 PMCID: PMC7500818 DOI: 10.1590/1980-57642020dn14-030012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Accepted: 05/04/2020] [Indexed: 11/21/2022] Open
Abstract
Given the sociodemographic diversity in Brazil, it is fundamental to understand the speech performance of a sample from the Brazilian capital. The repetition task can assess phonological and motor-phonetic planning. Previous studies found phonological-phonetic performance of speakers to be associated with education, age, and other demographic factors. OBJECTIVES To compare the phonetic-phonological performance for speech of younger and aged adults in the capital of Brazil, Federal District (FD); to compare FD performance against national normative means based on São Paulo; to determine the association of phonetic-phonological agility with sociodemographic, cognitive, and neuropsychiatric variables for the sample. METHODS Cross-sectional study. A total of 60 volunteers from the FD, comprising 30 older adults and 30 younger ones, were stratified by education into two subgroups: 2‒7 years and ≥8 years of education. Data on age, educational level, and socioeconomic status were collected. The Verbal Agility subtest of the Boston Diagnostic Aphasia Examination was applied to assess phonetic-phonological performance. RESULTS No statistically significant difference in performance for verbal agility was found between aged and younger adults from the FD. There was a statistically significant difference in the phonetic-phonological performance of the FD sample compared with the Brazilian normative mean values. Cognitive and socioeconomic variables were associated with verbal agility. CONCLUSIONS In the capital of Brazil, economic status, age, education, and cognitive variables were associated with verbal agility performance, despite there being no difference in phonetic-phonological performance between younger and aged adult groups. Regional differences in phonetic-phonological performance were also evident.
Collapse
|
35
|
Abstract
Depression is a serious problem for many older adults but is too often undetected by the person, family or providers. Although vocal patterns have been successfully used to detect and predict depression in adults aged 18 to 65 years, no studies to date have included older adults. The study purpose was to determine whether vocal patterns associated with clinical depression in younger people also signify depression in older adults. An observational, repeated measures design was used to enroll 46 volunteer older adults who completed a semi-structured interview composed the 9-item Patient Health Questionnaire or PHQ-9 depression scale and selected speech measures. Recorded interviews were analysed by machine learning algorithms to evaluate whether vocal patterns may predict presence of depression in older adults. In this study, using the PHQ-9 and a supervised machine learning algorithm accurately predicted high and low depression scores between 86% and 92% of the time. Change in raw PHQ-9 scores between interview cycles was predicted within 1.17 points. These results provide strong and promising evidence that vocal patterns can be used effectively to detect clinical depression in adults who are 65 years and older.
Collapse
|
36
|
Timing Evidence for Symbolic Phonological Representations and Phonology-Extrinsic Timing in Speech Production. Front Psychol 2020; 10:2952. [PMID: 32038364 PMCID: PMC6993048 DOI: 10.3389/fpsyg.2019.02952] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2019] [Accepted: 12/12/2019] [Indexed: 11/16/2022] Open
Abstract
The goals of this paper are (1) to discuss the key features of existing articulatory models of speech production that govern their approaches to timing, along with advantages and disadvantages of each, and (2) to evaluate these features in terms of several pieces of evidence from both the speech and nonspeech motor control literature. This evidence includes greater timing precision at movement endpoints compared to other parts of movements, suggesting the separate control of the timing of movement endpoints compared to other parts of movement. This endpoint timing precision challenges models in which all parts of a movement trajectory are controlled by the same equation of motion, but supports models in which (a) abstract, symbolic phonological representations map onto spatial and temporal characteristics of the part(s) of movement most closely related to the goal of producing a planned set of acoustic cues to signal the phonological contrast (often the endpoint), (b) movements are coordinated primarily based on the goal-related part of movement, and (c) speakers give priority to the accurate implementation of the part(s) of movement most closely related to the phonological goals. In addition, this paper presents three types of evidence for phonology-extrinsic timing, suggesting that surface duration requirements are represented during speech production. Phonology-extrinsic timing is also supported by greater timing variability for repetitions of longer intervals, assumed to be due to noise in a general-purpose (and phonology-extrinsic) timekeeping process. The evidence appears to be incompatible with models that have a unified Phonology/Phonetics Component, that do not represent the surface timing of phonetic events, and do not represent, specify and track timing by general-purpose timekeeping mechanisms. Taken together, this evidence supports an alternative approach to modeling speech production that is based on symbolic phonological representations and general-purpose, phonology-extrinsic, timekeeping mechanisms, rather than on spatio-temporal phonological representations and phonology-specific timing mechanisms. Thus, the evidence suggests that models in that alternative framework should be developed, so they can be tested with the same rigor as have models based on spatio-temporal phonological representations with phonology-intrinsic timing.
Collapse
|
37
|
Only When It Feels Good: Specific Cat Vocalizations Other Than Meowing. Animals (Basel) 2019; 9:ani9110878. [PMID: 31671749 PMCID: PMC6912413 DOI: 10.3390/ani9110878] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2019] [Revised: 09/30/2019] [Accepted: 10/14/2019] [Indexed: 11/16/2022] Open
Abstract
Our objective was to identify and characterize the types of vocalization other than meowing (VOM) in two contexts, a pleasant and an aversive situation, and to study the effect of the sex of the animal. A total of 74 cats (32 tom cats and 42 queens) living in the city of Curitiba, Brazil, participated in the study; in total, 68 (29 tom cats and 39 queens) were divided into two groups according to the stimulus they were exposed to: either a pleasant situation (PS), when they were offered a snack, or an aversive situation (AS), with the simulation of a car transport event. The other six animals (three tom cats and three queens) participated in both situations. Only the PS group presented VOM; of the 40 PS animals, 14 presented VOM, mostly acknowledgment or trill and squeak. No correlation was observed between vocalization and cat sex (p = 0.08; Pearson's Chi-Square). Results show that VOM is exclusively associated with positive situations, suggesting that these vocalizations may be relevant for understanding the valence of cat emotional state. Further studies are warranted to advance knowledge on other VOMs and on the generalization of our findings to other situations.
Collapse
|
38
|
A cross-linguistic examination of toddlers' interpretation of vowel duration. INFANCY 2019; 24:300-317. [PMID: 31576195 PMCID: PMC6771292 DOI: 10.1111/infa.12280] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2017] [Accepted: 12/02/2018] [Indexed: 11/30/2022]
Abstract
Languages differ in their phonological use of vowel duration. For the child, learning how duration contributes to lexical contrast is complicated because segmental duration is implicated in many different linguistic distinctions. Using a language-guided looking task, we measured English and Dutch 21-month-olds' recognition of familiar words with normal or manipulated vowel durations. Dutch but not English learners were affected by duration changes, even though distributions of short and long vowels in both languages are similar, and English uses vowel duration as a cue to (for example) consonant coda voicing. Additionally, we found that word recognition in Dutch toddlers was affected by shortening but not lengthening of vowels, matching an asymmetry also found in Dutch adults. Considering the subtlety of the crosslinguistic difference in the input, and the complexity of duration as a phonetic feature, our results suggest a strong capacity for phonetic analysis in children before their second birthday.
Collapse
|
39
|
Durational Evidence That Tokyo Japanese Vowel Devoicing Is Not Gradient Reduction. Front Psychol 2019; 10:821. [PMID: 31040809 PMCID: PMC6476939 DOI: 10.3389/fpsyg.2019.00821] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2019] [Accepted: 03/27/2019] [Indexed: 11/24/2022] Open
Abstract
A central question in the Japanese high vowel devoicing literature concerns whether vowels are devoiced through a categorical process or via gradient reduction. Examining how vowel height and consonantal voicing condition phrase-internal CV duration in a corpus of spontaneous Tokyo Japanese, it was found that CVs containing high vowels are substantially shorter before voiceless consonants, whilst non-high vowels do not exhibit comparable shortening. This quantitative difference between CV durations suggests a controlled temporal compression of the CV, consistent with views that Japanese vowel devoicing is produced through a categorical process targeting high vowels preceding voiceless consonants, and supports previous observations made of elicited productions.
Collapse
|
40
|
Questioning the role of lexical contrastiveness in phonological development: Converging evidence from perception and production studies. CANADIAN JOURNAL OF LINGUISTICS. LA REVUE CANADIENNE DE LINGUISTIQUE 2018; 63:580-608. [PMID: 35179525 PMCID: PMC8849088 DOI: 10.1017/cnj.2018.12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
In this article, we address relations between lexical and phonological development, with an emphasis on the notion of phonological contrast. We begin with an overview of the literature on word learning and on infant speech perception. Among other results, we report on studies showing that toddlers' perceptual abilities do not correlate with the development of phonological contrasts within their lexicons. We then engage in a systematic comparison between the lexical development of two child learners of English and their acquisition of consonants in syllable onsets. We establish a developmental timeline for each child's onset consonant system, which we compare to the types of phonological contrasts that are present in their expressive vocabularies at each relevant milestone. Like the earlier studies, ours also fails to return tangible parallels between the two areas of development. The data instead suggest that patterns of phonological development are best described in terms of the segmental categories they involve, in relative independence from measures of contrastiveness within the learners' lexicons.
Collapse
|
41
|
Cortical Measures of Phoneme-Level Speech Encoding Correlate with the Perceived Clarity of Natural Speech. eNeuro 2018; 5:eN-NWR-0084-18. [PMID: 29662947 PMCID: PMC5900464 DOI: 10.1523/eneuro.0084-18.2018] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2018] [Accepted: 03/02/2018] [Indexed: 01/22/2023] Open
Abstract
In real-world environments, humans comprehend speech by actively integrating prior knowledge (P) and expectations with sensory input. Recent studies have revealed effects of prior information in temporal and frontal cortical areas and have suggested that these effects are underpinned by enhanced encoding of speech-specific features, rather than a broad enhancement or suppression of cortical activity. However, in terms of the specific hierarchical stages of processing involved in speech comprehension, the effects of integrating bottom-up sensory responses and top-down predictions are still unclear. In addition, it is unclear whether the predictability that comes with prior information may differentially affect speech encoding relative to the perceptual enhancement that comes with that prediction. One way to investigate these issues is through examining the impact of P on indices of cortical tracking of continuous speech features. Here, we did this by presenting participants with degraded speech sentences that either were or were not preceded by a clear recording of the same sentences while recording non-invasive electroencephalography (EEG). We assessed the impact of prior information on an isolated index of cortical tracking that reflected phoneme-level processing. Our findings suggest the possibility that prior information affects the early encoding of natural speech in a dual manner. Firstly, the availability of prior information, as hypothesized, enhanced the perceived clarity of degraded speech, which was positively correlated with changes in phoneme-level encoding across subjects. In addition, P induced an overall reduction of this cortical measure, which we interpret as resulting from the increase in predictability.
Collapse
|
42
|
Seventeenth-Century 'double writing' schemes, and a 1676 letter in the phonetic script and real character of John Wilkins. NOTES AND RECORDS OF THE ROYAL SOCIETY OF LONDON 2018; 72:7-23. [PMID: 31390391 PMCID: PMC5906426 DOI: 10.1098/rsnr.2017.0041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Royal Society Classified Papers XVI contains a letter written in not one but two seemingly mysterious scripts. As a result, this letter has remained until now effectively illegible, and has been miscatalogued. These scripts are rare examples of the written forms devised by John Wilkins to accompany his proposals for an artificial language, published under the auspices of the Royal Society in 1668. This article therefore first correctly identifies and decodes this letter, which is shown to be from the Somersetshire clergyman Andrew Paschall to Robert Hooke in London in 1676, and then surveys other surviving texts written in Wilkins's scripts or language. Finally the article addresses the contents of the letter, namely its author's attempt to build a workable double writing device, in effect an early 'pantograph'. Designs for such instruments had been much touted in the 1650s, and the complex history of such proposals is unravelled properly for the first time.
Collapse
|
43
|
Abstract
We present here a musical approach to speech melody, one that takes advantage of the intervallic precision made possible with musical notation. Current phonetic and phonological approaches to speech melody either assign localized pitch targets that impoverish the acoustic details of the pitch contours and/or merely highlight a few salient points of pitch change, ignoring all the rest of the syllables. We present here an alternative model using musical notation, which has the advantage of representing the pitch of all syllables in a sentence as well as permitting a specification of the intervallic excursions among syllables and the potential for group averaging of pitch use across speakers. We tested the validity of this approach by recording native speakers of Canadian English reading unfamiliar test items aloud, spanning from single words to full sentences containing multiple intonational phrases. The fundamental-frequency trajectories of the recorded items were converted from hertz into semitones, averaged across speakers, and transcribed into musical scores of relative pitch. Doing so allowed us to quantify local and global pitch-changes associated with declarative, imperative, and interrogative sentences, and to explore the melodic dynamics of these sentence types. Our basic observation is that speech is atonal. The use of a musical score ultimately has the potential to combine speech rhythm and melody into a unified representation of speech prosody, an important analytical feature that is not found in any current linguistic approach to prosody.
Collapse
|
44
|
The role of linguistic experience in the processing of probabilistic information in production. LANGUAGE, COGNITION AND NEUROSCIENCE 2018; 33:211-226. [PMID: 29399595 PMCID: PMC5793886 DOI: 10.1080/23273798.2017.1375129] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
Speakers track the probability that a word will occur in a particular context and utilize this information during phonetic processing. For example, content words that have high probability within a discourse tend to be realized with reduced acoustic/articulatory properties. Such probabilistic information may influence L1 and L2 speech processing in distinct ways (reflecting differences in linguistic experience across groups and the overall difficulty of L2 speech processing). To examine this issue, L1 and L2 speakers performed a referential communication task, describing sequences of simple actions. The two groups of speakers showed similar effects of discourse-dependent probabilistic information on production, suggesting that L2 speakers can successfully track discourse-dependent probabilities and use such information to modulate phonetic processing.
Collapse
|
45
|
Practice makes perfect? The pedagogic value of online independent phonetic transcription practice for speech and language therapy students. CLINICAL LINGUISTICS & PHONETICS 2017; 32:249-266. [PMID: 28857633 DOI: 10.1080/02699206.2017.1350882] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Accuracy of phonetic transcription is a core skill for speech and language therapists (SLTs) worldwide (Howard & Heselwood, 2002). The current study investigates the value of weekly independent online phonetic transcription tasks to support development of this skill in year one SLT students. Using a mixed methods observational design, students enrolled in a year one phonetics module completed 10 weekly homework activities in phonetic transcription on a stand-alone tutorial site (WebFon (Bates, Matthews & Eagles, 2010)) and 5 weekly online quizzes (the 'Ulster Set' (Titterington, unpublished)). Student engagement with WebFon was measured in terms of the number of responses made to 'sparks' on the University's Virtual Learning Environment Discussion Board. Measures of phonetic transcription accuracy were obtained for the 'Ulster Set' and for a stand-alone piece of coursework at the end of the module. Qualitative feedback about experience with the online learning was gathered via questionnaire. A positive significant association was found between student engagement with WebFon and performance in the 'Ulster Set', and between performance in the 'Ulster Set' and final coursework. Students valued both online independent learning resources as each supported different learning needs. However, student compliance with WebFon was significantly lower than with the 'Ulster Set'. Motivators and inhibitors to engagement with the online resources were investigated identifying what best maximised engagement. These results indicate that while 'independent' online learning can support development of phonetic transcription skills, the activities must be carefully managed and constructively aligned to assessment providing the level of valance necessary to ensure effective engagement.
Collapse
|
46
|
Abstract
This essay outlines novel ways of communicating with patients by altering semantics, syntax, word use, or sounds. Language is viewed as a tool for coping with problems rather than a medium with which to mirror external reality or internal human nature. This view of language emerges from a pragmatic critique of truth. The broader goal of this essay is to weave together the philosophy of pragmatism, especially as it has been articulated by Richard Rorty, with the theory and practice of psychoanalysis. Clinical case examples are discussed.
Collapse
|
47
|
Abstract
This study offers evidence for an environmental effect on languages while relying on continuous linguistic and continuous ecological variables. Evidence is presented for a positive association between the typical ambient humidity of a language’s native locale and that language’s degree of reliance on vowels. The vowel-usage rates of over 4000 language varieties were obtained, and several methods were employed to test whether these usage rates are associated with ambient humidity. The results of these methods are generally consistent with the notion that reduced ambient humidity eventually yields a reduced reliance of languages on vowels, when compared to consonants. The analysis controls simultaneously for linguistic phylogeny and contact between languages. The results dovetail with previous work, based on binned data, suggesting that consonantal phonemes are more common in some ecologies. In addition to being based on continuous data and a larger data sample, however, these findings are tied to experimental research suggesting that dry air affects the behavior of the larynx by yielding increased phonatory effort. The results of this study are also consistent with previous work suggesting an interaction of aridity and tonality. The data presented here suggest that languages may evolve, like the communication systems of other species, in ways that are influenced subtly by ecological factors. It is stressed that more work is required, however, to explore this association and to establish a causal relationship between ambient air characteristics and the development of languages.
Collapse
|
48
|
Mapping the Speech Code: Cortical Responses Linking the Perception and Production of Vowels. Front Hum Neurosci 2017; 11:161. [PMID: 28439232 PMCID: PMC5383703 DOI: 10.3389/fnhum.2017.00161] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2016] [Accepted: 03/17/2017] [Indexed: 11/13/2022] Open
Abstract
The acoustic realization of speech is constrained by the physical mechanisms by which it is produced. Yet for speech perception, the degree to which listeners utilize experience derived from speech production has long been debated. In the present study, we examined how sensorimotor adaptation during production may affect perception, and how this relationship may be reflected in early vs. late electrophysiological responses. Participants first performed a baseline speech production task, followed by a vowel categorization task during which EEG responses were recorded. In a subsequent speech production task, half the participants received shifted auditory feedback, leading most to alter their articulations. This was followed by a second, post-training vowel categorization task. We compared changes in vowel production to both behavioral and electrophysiological changes in vowel perception. No differences in phonetic categorization were observed between groups receiving altered or unaltered feedback. However, exploratory analyses revealed correlations between vocal motor behavior and phonetic categorization. EEG analyses revealed correlations between vocal motor behavior and cortical responses in both early and late time windows. These results suggest that participants' recent production behavior influenced subsequent vowel perception. We suggest that the change in perception can be best characterized as a mapping of acoustics onto articulation.
Collapse
|
49
|
Phonetic analysis during treatment with rapid maxillary expander. Orthod Craniofac Res 2017; 20:21-29. [PMID: 28102014 DOI: 10.1111/ocr.12136] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/27/2016] [Indexed: 11/29/2022]
Abstract
OBJECTIVES To investigate possible changes and/or device-related impairments in phonetic habits produced by rapid maxillary expansion (RME). MATERIALS AND METHODS Thirty-five patients scheduled for RME were divided into two groups: Group A (banded two-arm Hyrax) and Group B (banded four-arm Hyrax). Speech samples were collected at six time points, before, during and after RME removal. Acoustical analysis was performed using PRAAT and BioVoice analysis tools. Ten volunteers completed a questionnaire on the acceptability of patient's speech. Maxillary dimensions and palatal volume were measured on dental casts before and after expansion using a digital gauge. RESULTS Voice analysis showed an increase in the peak frequency of fricative consonants (/s/,/ʃ/) after expansion, whereas there was no change of formant frequencies of palatal consonants (/ɲ/,/ʎ/). Vowel /i/ displayed a lowering of the first formant frequency, and an increase in the second and third formant frequencies. After bonding, Group B showed both a greater reduction in the peak frequency of fricatives and a greater increase in the formant frequencies of palatal consonants than Group A. CONCLUSION Rapid maxillary expansion causes a slight phonetic change in the acoustical parameters of both consonants and vowels. The two-arm Hyrax caused less speech impairment than the four-arm Hyrax during the treatment.
Collapse
|
50
|
Abstract
Infants struggle to apply earlier-demonstrated sound-discrimination abilities to later word learning, attending to non-constrastive acoustic dimensions (e.g., Hay et al., 2015), and not always to contrastive dimensions (e.g., Stager & Werker, 1997). One hint about the nature of infants' difficulties comes from the observation that input from multiple talkers can improve word learning (Rost & McMurray, 2009). This may be because, when a single talker says both of the to-be-learned words, consistent talker's-voice characteristics make the acoustics of the two words more overlapping (Apfelbaum & McMurray, 2011). Here, we test that notion. We taught 14-month-old infants two similar-sounding words in the Switch habituation paradigm. The same amount of overall talker variability was present as in prior multiple-talker experiments, but male and female talkers said different words, creating a gender-word correlation. Under an acoustic-similarity account, correlated talker gender should help to separate words acoustically and facilitate learning. Instead, we found that correlated talker gender impaired learning of word-object pairings compared with uncorrelated talker gender-even when gender-word pairings were always maintained in test-casting doubt on one account of the beneficial effects of talker variability. We discuss several alternate potential explanations for this effect.
Collapse
|