1
|
Wang C, Xu Y, Zhang J. Functional timing or rhythmical timing, or both? A corpus study of English and Mandarin duration. Front Psychol 2023; 13:869049. [PMID: 36743611 PMCID: PMC9895962 DOI: 10.3389/fpsyg.2022.869049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Accepted: 12/19/2022] [Indexed: 01/22/2023] Open
Abstract
It has been long held that languages of the world are divided into rhythm classes so that they are either stress-timed, syllable-timed or mora-timed. It is also known for a long time that duration serves various informational functions in speech. But it is unclear whether these two kinds of uses of duration are complementary to each other, or they are actually one and the same. There has been much empirical research that raises questions about the rhythm class hypothesis due to lack of evidence of the suggested isochrony in any language. Yet the alleged cross-language rhythm classification is still widely taken for granted and continues to be researched. Here we conducted a corpus study of English, an archetype of a stress-timed language, and Mandarin, an alleged syllable-timed language, to look for evidence of at least a tendency toward isochrony when much of the informational use of duration is controlled for. We examined the relationship between segment and syllable duration and the relationship of syllable and phrase duration in the two languages. The results show that in English syllables are largely incompressible to allow stress-timing because segment duration is inflexible to allow variable syllable duration beyond its functional use. Surprisingly, Mandarin does show a small tendency toward both equal syllable duration and equal phrase duration. Additionally, the duration of pre-boundary syllables in English increases linearly with break index, whereas in Mandarin, the duration increase stops after break index 2, which is accompanied by the insertion of silent pauses. We conclude, therefore, timing and duration in speech are predominantly used for encoding information rather being controlled by a rhythmic principle, and the residual equal-duration tendency in the two languages examined here show exactly the opposite patterns from the predictions of the rhythm class hypothesis.
Collapse
Affiliation(s)
- Chengxia Wang
- Department of Speech, Hearing, and Phonetic Sciences, University College London, London, United Kingdom,*Correspondence: Chengxia Wang, ✉
| | - Yi Xu
- Department of Speech, Hearing, and Phonetic Sciences, University College London, London, United Kingdom
| | - Jinsong Zhang
- School of Information Science, Beijing Language and Culture University, Beijing, China
| |
Collapse
|
2
|
Peter V, van Ommen S, Kalashnikova M, Mazuka R, Nazzi T, Burnham D. Language specificity in cortical tracking of speech rhythm at the mora, syllable, and foot levels. Sci Rep 2022; 12:13477. [PMID: 35931787 PMCID: PMC9356059 DOI: 10.1038/s41598-022-17401-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Accepted: 07/25/2022] [Indexed: 11/29/2022] Open
Abstract
Recent research shows that adults’ neural oscillations track the rhythm of the speech signal. However, the extent to which this tracking is driven by the acoustics of the signal, or by language-specific processing remains unknown. Here adult native listeners of three rhythmically different languages (English, French, Japanese) were compared on their cortical tracking of speech envelopes synthesized in their three native languages, which allowed for coding at each of the three language’s dominant rhythmic unit, respectively the foot (2.5 Hz), syllable (5 Hz), or mora (10 Hz) level. The three language groups were also tested with a sequence in a non-native language, Polish, and a non-speech vocoded equivalent, to investigate possible differential speech/nonspeech processing. The results first showed that cortical tracking was most prominent at 5 Hz (syllable rate) for all three groups, but the French listeners showed enhanced tracking at 5 Hz compared to the English and the Japanese groups. Second, across groups, there were no differences in responses for speech versus non-speech at 5 Hz (syllable rate), but there was better tracking for speech than for non-speech at 10 Hz (not the syllable rate). Together these results provide evidence for both language-general and language-specific influences on cortical tracking.
Collapse
Affiliation(s)
- Varghese Peter
- MARCS Institute for Brain Behaviour and Development, Western Sydney University, Penrith, NSW, Australia. .,School of Health and Behavioural Sciences, University of the Sunshine Coast, Sippy Downs, Australia.
| | - Sandrien van Ommen
- Integrative Neuroscience and Cognition Center, CNRS-Université Paris Cité, Paris, France.,Neurosciences Fondamentales, University of Geneva, Geneva, Switzerland
| | - Marina Kalashnikova
- MARCS Institute for Brain Behaviour and Development, Western Sydney University, Penrith, NSW, Australia.,BCBL, Basque Center on Cognition, Brain and Language, San Sebastian, Guipuzcoa, Spain.,IKERBASQUE, Basque Foundation for Science, Bilbao, Bizcaya, Spain
| | - Reiko Mazuka
- Laboratory for Language Development, RIKEN Center for Brain Science, Saitama, Japan.,Department of Psychology and Neuroscience, Duke University, Durham, NC, USA
| | - Thierry Nazzi
- Integrative Neuroscience and Cognition Center, CNRS-Université Paris Cité, Paris, France
| | - Denis Burnham
- MARCS Institute for Brain Behaviour and Development, Western Sydney University, Penrith, NSW, Australia
| |
Collapse
|
3
|
Iwamoto K, Kikuchi H, Mazuka R. Speech rate development in Japanese-speaking children and proficiency in mora-timed rhythm. J Exp Child Psychol 2022; 220:105411. [DOI: 10.1016/j.jecp.2022.105411] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2021] [Revised: 02/10/2022] [Accepted: 02/16/2022] [Indexed: 10/18/2022]
|
4
|
Sato K, Genda J, Minabe R, Taniguchi T. Characteristics of Japanese Electrolaryngeal Speech Produced by Untrained Speakers: An Observational Study Involving Healthy Volunteers. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:3786-3793. [PMID: 34546765 DOI: 10.1044/2021_jslhr-21-00069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Purpose The aim of this study was to investigate the characteristics of electrolaryngeal (EL) speech among untrained speakers to aid in its effective introduction and to identify syllables and words that are easy or difficult to pronounce. Method A total of 21 healthy individuals who had never used an EL were included. The participants were briefed, and tests comprising 100 Japanese syllables and 50 single words were conducted to evaluate EL speech intelligibility. A trained speaker was defined as a certified speech-language pathologist who underwent EL training for 3 months. A 5-point electrolarynx effectivity score (EES) was used for the subjective assessment of EL. Results The median (interquartile range) intelligibility scores of the untrained and trained groups were 24.0% (20.0%-34.0%) and 40.0% (36.0%-45.0%) for syllables and 48.0% (38.0%-60.0%) and 88.0% (82.0%-90.0%) for words, respectively. The intelligibility scores for syllables and words were higher in the trained group than those in the untrained group. Only two syllable subgroups (/m/ and /w/) had > 80% correct answers among untrained speakers. A total of 14 syllable subgroups (/k, kʲ, s, ɕ, t, t͡ɕ, ts, ɲ, h, ç, ɸ, p, pʲ, and a/), a number of which contained voiceless consonants, had < 40% correct answers among both speaker groups. A greater number of morae were associated with higher intelligibility scores. An EES of 4, indicating that the EL was effective, was the most frequent score. Conclusions It was difficult for untrained speakers to produce intelligible speech using an EL. Syllables, including voiceless consonants, were difficult to pronounce using an EL. Longer words with a greater number of morae were more intelligible, even for untrained EL speakers. Supplemental Material https://doi.org/10.23641/asha.16632622.
Collapse
Affiliation(s)
- Koji Sato
- Intensive Care Unit, Kanazawa University Hospital, Japan
| | - Junji Genda
- Department of Rehabilitation, Kanazawa University Hospital, Japan
| | - Ryoya Minabe
- Department of Rehabilitation, Kanazawa University Hospital, Japan
| | - Takumi Taniguchi
- Intensive Care Unit, Kanazawa University Hospital, Japan
- Department of Anesthesiology and Intensive Care Medicine, Graduate School of Medical Sciences, Kanazawa University, Japan
| |
Collapse
|
5
|
Hirose K, Asano K, Sakaguchi M, Nagao A, Nakahira M, Doi N, Kobayashi T, Hyodo M. Post-treatment clinical course following botulinum toxin injection therapy for adductor spasmodic dysphonia: Analysis of data from a placebo-controlled, randomized, double-blinded clinical trial in Japan. Laryngoscope Investig Otolaryngol 2021; 6:1088-1095. [PMID: 34667852 PMCID: PMC8513418 DOI: 10.1002/lio2.669] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2021] [Revised: 08/22/2021] [Accepted: 09/15/2021] [Indexed: 11/08/2022] Open
Abstract
OBJECTIVE Botulinum toxin (BT) therapy is a first-line treatment for spasmodic dysphonia (SD). However, a detailed chronological course and clinical factors that affect the therapeutic effect have been vague. In this study, we analyzed the data from our placebo-controlled, randomized, double-blinded parallel-group comparison/open-label clinical trial of BT (Botox) to clarify these. METHODS A total of 22 patients with abductor SD (ADSD) were enrolled. The female-to-male ratio was 20:2 with a mean age of 40.0 ± 10.3 years and a median duration of symptoms of 7.5 years. The therapeutic effect was evaluated based on the change in the number of aberrant morae (phonemes), GRBAS scale, Voice Handicap Index (VHI), and Visual Analogue Scale (VAS). RESULTS The change in the number of aberrant morae peaked at 2 weeks and lasted for 12 weeks in the BT group with significance (P < .01) compared to the placebo group. Objective improvement (number of aberrant morae and [S] element in GRBAS) preceded subjective improvement (VHI and VAS). The change in number of aberrant morae and VHI showed a significant correlation (P < .01). The changes in the number of aberrant morae, VHI, and VAS in younger subjects were greater than in older subjects. Patients who presented with post-treatment breathy hoarseness or dysphagia showed better therapeutic effects. CONCLUSIONS BT therapy was effective for ADSD based on both objective and subjective assessments. Improvements in subjective parameters were delayed compared to objective measures due to post-treatment breathy hoarseness. However, this adverse event was believed to reflect the treatment effect. LEVEL OF EVIDENCE 1b.
Collapse
Affiliation(s)
- Kahori Hirose
- Department of Otolaryngology‐Head and Neck SurgeryKochi Medical SchoolNankokuKochiJapan
| | - Kento Asano
- Integrated Center for Advanced Medical Technologies, Kochi Medical School HospitalNankokuKochiJapan
- Department of Medical InnovationOsaka University HospitalSuitaOsakaJapan
| | - Masahiko Sakaguchi
- Integrated Center for Advanced Medical Technologies, Kochi Medical School HospitalNankokuKochiJapan
- Department of Engineering InformaticsOsaka Electro‐Communication UniversityNeyagawaOsakaJapan
| | - Asuka Nagao
- Department of Otolaryngology‐Head and Neck SurgeryKochi Medical SchoolNankokuKochiJapan
| | - Maya Nakahira
- Rehabilitation DepartmentKochi Medical School HospitalNankokuKochiJapan
| | - Nao Doi
- Rehabilitation DepartmentKochi Medical School HospitalNankokuKochiJapan
| | - Taisuke Kobayashi
- Department of Otolaryngology‐Head and Neck SurgeryKochi Medical SchoolNankokuKochiJapan
| | - Masamitsu Hyodo
- Department of Otolaryngology‐Head and Neck SurgeryKochi Medical SchoolNankokuKochiJapan
| |
Collapse
|
6
|
Hyodo M, Nagao A, Asano K, Sakaguchi M, Mizoguchi K, Omori K, Tada Y, Hatakeyama H, Oridate N, Naito K, Iwata Y, Shinomiya H, Hara H, Sanuki T, Yumoto E. Botulinum toxin injection into the intrinsic laryngeal muscles to treat spasmodic dysphonia: A multicenter, placebo-controlled, randomized, double-blinded, parallel-group comparison/open-label clinical trial. Eur J Neurol 2021; 28:1548-1556. [PMID: 33393175 PMCID: PMC8248427 DOI: 10.1111/ene.14714] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Revised: 12/29/2020] [Accepted: 12/30/2020] [Indexed: 11/30/2022]
Abstract
Background and purpose Botulinum toxin (BT) injection into the laryngeal muscles has been a standard treatment for spasmodic dysphonia (SD). However, few high‐quality clinical studies have appeared, and BT is used off‐label in most countries. Methods We performed a multicenter, placebo‐controlled, randomized, double‐blinded, parallel‐group comparison/open‐label clinical trial to obtain approval for BT (Botox) therapy in Japan. Twenty‐four patients (22 with adductor SD and two with abductor SD) were enrolled. The primary end point was the change in the number of aberrant morae (phonemes) at 4 weeks after drug injection. The secondary end points included the change in the number of aberrant morae, GRBAS scale, Voice Handicap Index (VHI), and visual analog scale (VAS) over the entire study period. Results In the adductor SD group, the number of aberrant morae at 4 weeks after injection was reduced by 7.0 ± 2.30 (mean ± SE) in the BT group and 0.2 ± 0.46 in the placebo group (p = 0.0148). The improvement persisted for 12 weeks following BT injections. The strain element in GRBAS scale significantly reduced at 2 weeks after BT treatment. The VHI and VAS scores as subjective parameters also improved. In the abductor SD group, one patient responded to treatment. Adverse events included breathy hoarseness (77.3%) and aspiration when drinking (40.9%) but were mild and resolved in 4 weeks. Conclusions Botulinum toxin injection was safe and efficacious for the treatment of SD. Based on these results, BT injection therapy was approved as an SD treatment in Japan.
Collapse
Affiliation(s)
- Masamitsu Hyodo
- Department of Otolaryngology-Head and Neck Surgery, Kochi University, Nankoku, Japan
| | - Asuka Nagao
- Department of Otolaryngology-Head and Neck Surgery, Kochi University, Nankoku, Japan
| | - Kento Asano
- Department of Medical Innovation, Osaka University Hospital, Suita, Japan
| | - Masahiko Sakaguchi
- Department of Engineering Informatics, Osaka Electro Communication University, Neyagawa, Japan
| | - Kenji Mizoguchi
- Department of Otolaryngology-Head and Neck Surgery, Hokkaido University, Sapporo, Japan
| | - Koichi Omori
- Department of Otolaryngology-Head and Neck Surgery, Kyoto University, Kyoto, Japan
| | - Yasuhiro Tada
- Department of Otolaryngology, Fukushima Medical University, Fukushima, Japan
| | | | - Nobuhiko Oridate
- Department of Otolaryngology, Yokohama City University, Yokohama, Japan
| | - Kensei Naito
- Academy of Nursing, Fujita Health University, Toyoake, Japan
| | - Yoshihiro Iwata
- Department of Otolaryngology-Head and Neck Surgery, Fujita Health University, Toyoake, Japan
| | - Hirotaka Shinomiya
- Department of Otolaryngology-Head and Neck Surgery, Kobe University, Kobe
| | - Hirotaka Hara
- Department of Otolaryngology, Kawasaki Medical School, Kurashiki, Japan
| | - Tetsuji Sanuki
- Department of Otolaryngology, Nagoya City University, Nagoya, Japan
| | - Eiji Yumoto
- Department of Otolaryngology, Kumamoto University, Kumamoto, Japan
| |
Collapse
|
7
|
Shaw JA, Kawahara S. Effects of Surprisal and Entropy on Vowel Duration in Japanese. LANGUAGE AND SPEECH 2019; 62:80-114. [PMID: 29105604 DOI: 10.1177/0023830917737331] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Research on English and other languages has shown that syllables and words that contain more information tend to be produced with longer duration. This research is evolving into a general thesis that speakers articulate linguistic units with more information more robustly. While this hypothesis seems plausible from the perspective of communicative efficiency, previous support for it has come mainly from English and some other Indo-European languages. Moreover, most previous studies focus on global effects, such as the interaction of word duration and sentential/semantic predictability. The current study is focused at the level of phonotactics, exploring the effects of local predictability on vowel duration in Japanese, using the Corpus of Spontaneous Japanese. To examine gradient consonant-vowel phonotactics within a consonant-vowel-mora, consonant-conditioned Surprisal and Shannon Entropy were calculated, and their effects on vowel duration were examined, together with other linguistic factors that are known from previous research to affect vowel duration. Results show significant effects of both Surprisal and Entropy, as well as notable interactions with vowel length and vowel quality. The effect of Entropy is stronger on peripheral vowels than on central vowels. Surprisal has a stronger positive effect on short vowels than on long vowels. We interpret the main patterns and the interactions by conceptualizing Surprisal as an index of motor fluency and Entropy as an index of competition in vowel selection.
Collapse
|
8
|
Minai U, Gustafson K, Fiorentino R, Jongman A, Sereno J. Fetal rhythm-based language discrimination: a biomagnetometry study. Neuroreport 2018; 28:561-564. [PMID: 28538518 DOI: 10.1097/wnr.0000000000000794] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Using fetal biomagnetometry, this study measured changes in fetal heart rate to assess discrimination of two rhythmically different languages (English and Japanese). Two-minute passages in English and Japanese were read by the same female bilingual speaker. Twenty-four mother-fetus pairs (mean gestational age=35.5 weeks) participated. Fetal magnetocardiography was recorded while the participants were presented first with passage 1, a passage in English, and then, following an 18 min interval, with passage 2, either a different passage in English (English-English condition: N=12) or in Japanese (English-Japanese condition: N=12). The fetal magnetocardiogram was reconstructed following independent components analysis decomposition. The mean interbeat intervals were calculated for a 30 s baseline interval directly preceding each passage and for the first 30 s of each passage. We then subtracted the mean interbeat interval of the 30 s baseline interval from that of the first 30 s interval, yielding an interbeat interval change value for each passage. A significant interaction between condition and passage indicated that the English-Japanese condition elicited a more robust interbeat interval change for passage 2 (novelty phase) than for passage 1 (familiarity phase), reflecting a faster heart rate during passage 2, whereas the English-English condition did not. This effect indicates that fetuses are sensitive to the change in language from English to Japanese. These findings provide the first evidence for fetal language discrimination as assessed by fetal biomagnetometry and support the hypothesis that rhythm constitutes a prenatally available building block in language acquisition.
Collapse
Affiliation(s)
- Utako Minai
- aDepartment of Linguistics, University of Kansas, Lawrence bHoglund Brain Imaging Center, University of Kansas Medical Center, Kansas City, Kansas, USA
| | | | | | | | | |
Collapse
|
9
|
Whang J. Recoverability-driven coarticulation: Acoustic evidence from Japanese high vowel devoicing. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 143:1159. [PMID: 29495746 DOI: 10.1121/1.5024893] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
High vowel devoicing in Japanese, where /i, u/ in a C1VC2 sequence devoice when both C1 and C2 are voiceless, has been studied extensively, but factors that contribute to the devoiced vowels' likelihood of complete deletion is still debated. This study examines the effects of phonotactic predictability on the deletion of devoiced vowels. Native Tokyo Japanese speakers (N = 22) were recorded in a sound-attenuated booth reading sentences containing lexical stimuli. C1 of the stimuli were /k, ʃ/, after which either high vowel can occur, and /ʧ, ϕ, s, ç/, after which only one of the two occurs. C2 was always a stop. C1 duration and center of gravity (COG), the amplitude weighted mean of frequencies present in a signal, were measured. Duration results show that devoicing lengthens only non-fricatives, while it has either no effect or a shortening effect on fricatives. COG results show that coarticulatory effects of devoiced vowels are evident in /k, ʃ/ but not in /ʧ, ϕ, s, ç/. Devoiced high vowels, therefore, seem to be more likely to delete when the vowel is phonotactically predictable than when it is unpredictable.
Collapse
Affiliation(s)
- James Whang
- Department of Linguistics, New York University, 10 Washington Place, New York, New York 10003, USA
| |
Collapse
|
10
|
Kawahara S. Durational compensation within a CV mora in spontaneous Japanese: Evidence from the Corpus of Spontaneous Japanese. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 142:EL143. [PMID: 28764476 DOI: 10.1121/1.4994674] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Previous experimental studies showed that in Japanese, vowels are longer after shorter onset consonants; there is durational compensation within a CV-mora. In order to address whether this compensation occurs in natural speech, this study re-examines this observation using the Corpus of Spontaneous Japanese. The results, which are based on more than 200 000 CV-mora tokens, show that there is a negative correlation between the onset consonant and the following vowel in terms of their duration. The statistical significance of this negative correlation is assessed by a traditional correlation analysis as well as a bootstrap resampling analysis, which both show that it is unlikely that the observed compensation effect occurred by chance. The compensation is not perfect, however, suggesting that it is a stochastic tendency rather than an absolute principle. This paper closes with a discussion of potential factors that may interact with the durational compensation effect.
Collapse
Affiliation(s)
- Shigeto Kawahara
- The Institute of Cultural and Linguistic Studies, Keio University, 2-15-45 Mita, Minato-ku, Tokyo, Japan
| |
Collapse
|
11
|
Frost RLA, Monaghan P, Tatsumi T. Domain-general mechanisms for speech segmentation: The role of duration information in language learning. J Exp Psychol Hum Percept Perform 2016; 43:466-476. [PMID: 27893268 PMCID: PMC5327892 DOI: 10.1037/xhp0000325] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Speech segmentation is supported by multiple sources of information that may either inform language processing specifically, or serve learning more broadly. The Iambic/Trochaic Law (ITL), where increased duration indicates the end of a group and increased emphasis indicates the beginning of a group, has been proposed as a domain-general mechanism that also applies to language. However, language background has been suggested to modulate use of the ITL, meaning that these perceptual grouping preferences may instead be a consequence of language exposure. To distinguish between these accounts, we exposed native-English and native-Japanese listeners to sequences of speech (Experiment 1) and nonspeech stimuli (Experiment 2), and examined segmentation using a 2AFC task. Duration was manipulated over 3 conditions: sequences contained either an initial-item duration increase, or a final-item duration increase, or items of uniform duration. In Experiment 1, language background did not affect the use of duration as a cue for segmenting speech in a structured artificial language. In Experiment 2, the same results were found for grouping structured sequences of visual shapes. The results are consistent with proposals that duration information draws upon a domain-general mechanism that can apply to the special case of language acquisition. This study shows that adults prefer to group both sequences of shapes and sequences of speech with the final item as the one that has the longest duration. This suggests that final-item duration increase is a helpful cue for perceptual grouping in multiple domains—not just language processing. By testing native speakers of languages that use duration differently, we also show that this grouping preference does not arise from language experience.
Collapse
|
12
|
Katayama T. Effect of Phonotactic Constraints on Second Language Speech Processing. Iperception 2016; 6:2041669515615714. [PMID: 27551360 PMCID: PMC4975118 DOI: 10.1177/2041669515615714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
In this study, we examined whether phonotactic constraints of the first language affect speech processing by Japanese learners of English and whether their proficiency of the second language influences it. Native English speakers and second language speakers with a high level of language proficiency and those with a low level took part in a monitoring task. They were given two kinds of sound stimuli as target syllables (i.e., consonant–vowel and consonant–vowel–consonant) and were asked to detect them in lists of words that have stress on the first or second syllable (e.g., biscuit and beside). The results showed that both stress and phonotactics facilitated segmentation strategies by the three groups. The Japanese groups did not rely on either phonotactics or morae to segment the target syllables. They rather used stress to detect the target syllables in the English words, which is a different segmentation strategy from their first language. This study showed that phonotactic constraints did not interfere with second language processing by native Japanese speakers and provided evidence that second language speakers use the segmentation strategy that is used by native speakers of the target language.
Collapse
Affiliation(s)
- Tamami Katayama
- Department of Life Science, Prefectural University of Hiroshima, Japan
| |
Collapse
|
13
|
Nakayama M, Kinoshita S, Verdonschot RG. The Emergence of a Phoneme-Sized Unit in L2 Speech Production: Evidence from Japanese-English Bilinguals. Front Psychol 2016; 7:175. [PMID: 26941669 PMCID: PMC4763048 DOI: 10.3389/fpsyg.2016.00175] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2015] [Accepted: 01/29/2016] [Indexed: 11/13/2022] Open
Abstract
Recent research has revealed that the way phonology is constructed during word production differs across languages. Dutch and English native speakers are suggested to incrementally insert phonemes into a metrical frame, whereas Mandarin Chinese speakers use syllables and Japanese speakers use a unit called the mora (often a CV cluster such as "ka" or "ki"). The present study is concerned with the question how bilinguals construct phonology in their L2 when the phonological unit size differs from the unit in their L1. Japanese-English bilinguals of varying proficiency read aloud English words preceded by masked primes that overlapped in just the onset (e.g., bark-BENCH) or the onset plus vowel corresponding to the mora-sized unit (e.g., bell-BENCH). Low-proficient Japanese-English bilinguals showed CV priming but did not show onset priming, indicating that they use their L1 phonological unit when reading L2 English words. In contrast, high-proficient Japanese-English bilinguals showed significant onset priming. The size of the onset priming effect was correlated with the length of time spent in English-speaking countries, which suggests that extensive exposure to L2 phonology may play a key role in the emergence of a language-specific phonological unit in L2 word production.
Collapse
Affiliation(s)
- Mariko Nakayama
- Faculty of Letters, Arts and Sciences, Waseda University Tokyo, Japan
| | - Sachiko Kinoshita
- ARC Centre of Excellence in Cognition and its Disorders, SydneyNSW, Australia; Department of Psychology, Macquarie University, SydneyNSW, Australia
| | | |
Collapse
|
14
|
Lu S, Chen L, Yamada J. Taiwanese Learners’ Durational Patterns for Japanese Obstruent Geminates/Singletons and Adjacent Vowels 1. Percept Mot Skills 2016; 122:67-75. [DOI: 10.1177/0031512516630018] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
This study measured the durations of Japanese obstruent geminates/singletons and their adjacent vowels (V1 and V2) embedded in words repeatedly spoken by four intermediate-level Taiwanese learners of Japanese. The results indicated that the four learners were more or less at a native-like level in terms of geminate lengthening, but not in terms of the V1/V2 duration ratio. These findings suggest that for Chinese learners of Japanese, the learning of Japanese geminates consists of two stages: geminate/singleton phonemic contrast learning, followed by V1/V2 phonetic contrast learning. It is suggested that explicit instruction may be effective at the second stage.
Collapse
Affiliation(s)
- Shihying Lu
- Graduate School of Integrated Arts and Sciences, Hiroshima University, Japan
| | - Lijen Chen
- Department of Applied Japanese Linguistics, Ming Chuan University, Taiwan
| | - Jun Yamada
- Graduate School of Integrated Arts and Sciences, Hiroshima University, Japan
| |
Collapse
|
15
|
Roncaglia-Denissen MP, Schmidt-Kassow M, Heine A, Vuust P, Kotz SA. Enhanced musical rhythmic perception in Turkish early and late learners of German. Front Psychol 2013; 4:645. [PMID: 24065946 PMCID: PMC3778315 DOI: 10.3389/fpsyg.2013.00645] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2013] [Accepted: 08/20/2013] [Indexed: 11/13/2022] Open
Abstract
As language rhythm relies partly on general acoustic properties, such as intensity and duration, mastering two languages with distinct rhythmic properties (i.e., stress position) may enhance musical rhythm perception. We investigated whether competence in a second language (L2) with different rhythmic properties than a L1 affects musical rhythm aptitude. Turkish early (TELG) and late learners (TLLG) of German were compared to German late L2 learners of English (GLE) regarding their musical rhythmic aptitude. While Turkish and German present distinct linguistic rhythm and metric properties, German and English are rather similar in this regard. To account for inter-individual differences, we measured participants' short-term and working memory (WM) capacity, melodic aptitude, and time they spent listening to music. Both groups of Turkish L2 learners of German perceived rhythmic variations significantly better than German L2 learners of English. No differences were found between early and late learners' performance. Our findings suggest that mastering two languages with different rhythmic properties enhances musical rhythm perception, providing further evidence of shared cognitive resources between language and music.
Collapse
Affiliation(s)
- M Paula Roncaglia-Denissen
- Department of Neuropsychology, Max Planck Institute for Human Cognitive and Brain Sciences Leipzig, Germany
| | | | | | | | | |
Collapse
|
16
|
Hirata Y, Amano S. Production of single and geminate stops in Japanese three- and four-mora words. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 132:1614-1625. [PMID: 22978890 DOI: 10.1121/1.4730975] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
This study examined the durational structure of single and geminate stop distinction produced in three- and four-mora words of Japanese, (C(1))V(1)(C(2))C(2)V(2)X [(C(2))C(2) = the contrasting consonants; X = a CV mora, the moraic nasal, or a long vowel as part of V(2)]. The questions addressed were how factors such as speaking rate, segmental variability, and moraic composition of words affected the stop quantity distinction in words longer than well-studied disyllabic words, and whether there exists an invariant parameter that classified these two stop categories. Results showed that all of those factors systematically affected the duration of the contrasting stop closure, the unit of [(C(1))V(1)(C(2))C(2)V(2)], and the entire three- and four-mora words. However, the durational units of moras and words were well-structured, and the ratio of the contrasting stop closure to the [(C(1))V(1)(C(2))C(2)V(2)] unit, as well as the ratio of the closure to the entire word, were found to be invariant in indicating the stop quantity distinction. These results support the theory of relational acoustic invariance [Pickett et al., Phonetica 56, 135-157 (1999)] on the part of production. Furthermore, the results provide insight into different versions of Japanese mora hypothesis [Han, The Study of Sounds 10, 65-80 (1962); Port et al., J. Acoust. Soc. Am. 81(5), 1574-1585 (1987)], which have been under debate for five decades.
Collapse
Affiliation(s)
- Yukari Hirata
- Colgate University, Department of East Asian Languages and Literatures, 13 Oak Drive, Hamilton, New York 13346, USA.
| | | |
Collapse
|
17
|
Arvaniti A. The usefulness of metrics in the quantification of speech rhythm. JOURNAL OF PHONETICS 2012; 40:351-373. [PMID: 0 DOI: 10.1016/j.wocn.2012.02.003] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
|
18
|
Amano S, Hirata Y. Perception and production boundaries between single and geminate stops in Japanese. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 128:2049-2058. [PMID: 20968375 DOI: 10.1121/1.3458847] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
The theory of relational acoustic invariance [Pickett, E. R., et al. (1999). Phonetica 56, 135-157] was tested with the Japanese stop quantity distinction in disyllables spoken at various rates. The questions were whether the perceptual boundary between the two phonemic categories of single and geminate stops is invariant across rates, and whether there is a close correspondence between the perception and production boundaries. The durational ratio of stop closure to word (where the "word" was defined as disyllables) was previously found to be an invariant parameter that classified the two categories in production, but the present study found that this ratio varied with different speaking rates in perception. However, regression and discriminant analyses of perception and production data showed that treating stop closure as a function of word duration with an intercept term represented the perception and production boundaries very well. This result indicated that the durational ratio of adjusted stop closure (i.e., closure with an added constant) to the word was invariant and distinguished the two phonemic categories clearly. Taken together, the results support the relational acoustic invariance theory, and help refine the theory with regard to exactly what form 'invariance' can take.
Collapse
Affiliation(s)
- Shigeaki Amano
- Faculty of Human Informatics, Aichi Shukutoku University, Japan.
| | | |
Collapse
|
19
|
Takeichi H, Koyama S, Terao A, Takeuchi F, Toyosawa Y, Murohashi H. Comprehension of degraded speech sounds with m-sequence modulation: an fMRI study. Neuroimage 2009; 49:2697-706. [PMID: 19878726 DOI: 10.1016/j.neuroimage.2009.10.063] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2009] [Revised: 10/16/2009] [Accepted: 10/21/2009] [Indexed: 11/28/2022] Open
Abstract
In a recent electroencephalography (EEG) study (Takeichi et al., 2007a), we developed a new technique for assessing speech comprehension using speech degraded by m-sequence modulation and found a correlation peak with a 400-ms delay. This peak depended on the comprehensibility of the modulated speech sounds. Here we report the results of a functional magnetic resonance imaging (fMRI) experiment comparable to our previous EEG experiment. We examined brain areas related to verbal comprehension of the modulated speech sound to examine which neural system processes this modulated speech. A non-integer, alternating-block factorial design was used with 23 Japanese-speaking participants, with time reversal and m-sequence modulation as factors. A main effect of time reversal was found in the left temporal cortex along the superior temporal sulcus (BA21 and BA39), left precentral gyrus (BA6) and right inferior temporal gyrus (BA21). A main effect of modulation was found in the left postcentral gyrus (BA43) and the right medial frontal gyri (BA6) as an increase by modulation and in the left temporal cortex (BA21, 39), parahippocampal gyrus (BA34), posterior cingulate (BA23), caudate and thalamus and right superior temporal gyrus (BA38) as a decrease by modulation. An interaction effect associated specifically with non-modulated speech was found in the left frontal gyrus (BA47), left occipital cortex in the cuneus (BA18), left precuneus (BA7, 31), right precuneus (BA31) and right thalamus (forward>reverse). The other interaction effect associated specifically with modulation of speech sound was found in the inferior frontal gyrus in the opercular area (BA44) (forward>reverse). Estimated scalp projection of the component correlation function (Cao et al., 2002) for the corresponding EEG data (Takeichi et al., 2007a, showed leftward dominance. Hence, activities in the superior temporal sulcus (BA21 and BA39), which are commonly observed for speech processing, as well as left precentral gyrus (BA6) and left inferior frontal gyrus in the opercular area (BA44) is suggested to contribute to the comprehension-related EEG signal.
Collapse
Affiliation(s)
- Hiroshige Takeichi
- Laboratory for Mathematical Neuroscience, Brain Science Institute, RIKEN, 2-1 Hirosawa, Wako, Saitama 351-0198, Japan.
| | | | | | | | | | | |
Collapse
|
20
|
Löfqvist A. Tongue movement kinematics in long and short Japanese consonants. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2007; 122:512-18. [PMID: 17614508 PMCID: PMC2827771 DOI: 10.1121/1.2735102] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
This paper examines tongue movements in stop and fricative consonants where the duration of the oral closure/constriction for the consonant is varied for linguistic purposes. Native speakers of Japanese served as subjects. The linguistic material consisted of Japanese word pairs that only differed in the duration of the lingual consonant, which was either long or short. Recordings were made of tongue movements using a magnetometer system. Results show a robust difference in closure duration between the long and short consonants. Overall, the path of the tongue movement during the consonant was longer for the long than for the short consonant. All speakers decreased the speed of the tongue movement during the long consonant. These adjustments in tongue movements were most likely made to maintain the contact between the tongue and the palate for the closure and constriction.
Collapse
|
21
|
Kureta Y, Fushimi T, Tatsumi IF. The functional unit in phonological encoding: evidence for moraic representation in native Japanese speakers. J Exp Psychol Learn Mem Cogn 2006; 32:1102-19. [PMID: 16938049 DOI: 10.1037/0278-7393.32.5.1102] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Speech production studies have shown that the phonological form of a word is made up of phonemic segments in stress-timed languages (e.g., Dutch) and of syllables in syllable-timed languages (e.g., Chinese). To clarify the functional unit of mora-timed languages, the authors asked native Japanese speakers to perform an implicit priming task (A. S. Meyer, 1990, 1991). In Experiment 1, participants could speed up their production latencies when initial consonant and vowel (CV) of a target word were known in advance but failed to do so when the vowel was unknown. In Experiment 2, prior knowledge of the consonant and glide (Cj) produced no significant priming effect. However, in Experiment 3, significant effects were found for the consonant-vowel coupled with a nasal coda (CVN) and the consonant with a diphthong (CVV), compared with the consonant-vowel alone (CV). These results suggest that the implicit priming effects for Japanese are closely related to the CV-C and CV-V structure, called the mora. The authors discuss cross-linguistic differences in the phonological representation involved in phonological encoding, within current theories of word production.
Collapse
Affiliation(s)
- Yoichi Kureta
- Language, Cognition, and Brain Research Group, Tokyo Metropolitan Institute of Gerontology, Tokyo, Japan.
| | | | | |
Collapse
|
22
|
Hirata Y, Whiton J. Effects of speaking rate on the single/geminate stop distinction in Japanese. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2005; 118:1647-60. [PMID: 16240824 DOI: 10.1121/1.2000807] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
This study addressed an issue in the theory of acoustic invariance. The question was whether an invariant acoustic property exists for distinguishing Japanese single and geminate voiceless stops across rates and speakers. Four native Japanese speakers produced disyllabic words with single and geminate stops (e.g., /kako/ and /kak:o/) spoken in a carrier sentence at three speaking rates. Durations of sentences, words, stop closures, vowels preceding the contrasting stops, and voice onset times were measured. Ratios of geminate to single stop closures, geminate words to singleton words, closures to preceding vowels, and closures to words were calculated. The effect of rate on closure duration was to yield overlap between the singleton and geminate categories, and to lengthen geminate closures more than single closures as rate decreased. However, the ratio of geminate to single closure duration was unaffected by rate. Furthermore, the ratio of closure to word duration (0.35 as an optimal boundary) best classified all singleton and geminate tokens with 95.7%-98% accuracy. Thus, in spite of overlap in absolute closure duration between single and geminate voiceless stops, there is a relationally invariant measure that divides the two phonemic categories across rates and speakers, supporting the theory of relational acoustic invariance. co ustical Society of America.
Collapse
Affiliation(s)
- Yukari Hirata
- Department of East Asian Languages and Literatures, Colgate University, Hamilton, New York 13346, USA.
| | | |
Collapse
|
23
|
Tamaoka K, Makioka S. Frequency of occurrence for units of phonemes, morae, and syllables appearing in a lexical corpus of a Japanese newspaper. ACTA ACUST UNITED AC 2004; 36:531-47. [PMID: 15641442 DOI: 10.3758/bf03195600] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
On the basis of the lexical corpus created by Amano and Kondo (2000), using the Asahi newspaper, the present study provides frequencies of occurrence for units of Japanese phonemes, morae, and syllables. Among the five vowels, /a/ (23.42%), /i/ (21.54%), /u/ (23.47%), and /o/ (20.63%) showed similar frequency rates, whereas /e/ (10.94%) was less frequent. Among the 12 consonants, /k/ (17.24%), /t/ (15.53%), and /r/ (13.11%) were used often, whereas /p/ (0.60%) and /b/ (2.43%) appeared far less frequently. Among the contracted sounds, /sj/ (36.44%) showed the highest frequency, whereas /mj/ (0.27%) rarely appeared. Among the five long vowels, /aR/ (34.4%) was used most frequently, whereas /uR/ (12.11%) was not used so often. The special sound /N/ appeared very frequently in Japanese. The syllable combination /k/+V+/N/ (19.91%) appeared most frequently among syllabic combinations with the nasal /N/. The geminate (or voiceless obstruent) /Q/, when placed before the four consonants /p/, /t/, /k/, and /s/, appeared 98.87% of the time, but the remaining 1.13% did not follow the definition. The special sounds /R/, /N/, and /Q/ seem to appear very frequently in Japanese, suggesting that they are not special in terms of frequency counts. The present study further calculated frequencies for the 33 newly and officially listed morae/syllables, which are used particularly for describing alphabetic loanwords. In addition, the top 20 bi-mora frequency combinations are reported. Files of frequency indexes may be downloaded from the Psychonomic Society Web archive at http://www.psychonomic.org/archive/.
Collapse
Affiliation(s)
- Katsuo Tamaoka
- International Student Center, Hiroshima University, Hiroshima, Japan.
| | | |
Collapse
|
24
|
Menning H, Imaizumi S, Zwitserlood P, Pantev C. Plasticity of the human auditory cortex induced by discrimination learning of non-native, mora-timed contrasts of the Japanese language. Learn Mem 2002; 9:253-67. [PMID: 12359835 PMCID: PMC187135 DOI: 10.1101/lm.49402] [Citation(s) in RCA: 64] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
In this magnetoencephalographic (MEG) study, we examined with high temporal resolution the traces of learning in the speech-dominant left-hemispheric auditory cortex as a function of newly trained mora-timing. In Japanese, the "mora" is a temporal unit that divides words into almost isochronous segments (e.g., na-ka-mu-ra and to-o-kyo-o each comprises four mora). Changes in the brain responses of a group of German and Japanese subjects to differences in the mora structure of Japanese words were compared. German subjects performed a discrimination training in 10 sessions of 1.5 h each day. They learned to discriminate Japanese pairs of words (in a consonant, anni-ani; and a vowel, kiyo-kyo, condition), where the second word was shortened by one mora in eight steps of 15 msec each. A significant increase in learning performance, as reflected by behavioral measures, was observed, accompanied by a significant increase of the amplitude of the Mismatch Negativity Field (MMF). The German subjects' hit rate for detecting durational deviants increased by up to 35%. Reaction times and MMF latencies decreased significantly across training sessions. Japanese subjects showed a more sensitive MMF to smaller differences. Thus, even in young adults, perceptual learning of non-native mora-timing occurs rapidly and deeply. The enhanced behavioral and neurophysiological sensitivity found after training indicates a strong relationship between learning and (plastic) changes in the cortical substrate.
Collapse
Affiliation(s)
- Hans Menning
- Center for Biomagnetism, Institute of Experimental Audiology, Münster, Germany.
| | | | | | | |
Collapse
|