1
|
Niu Z, Li Z, Zhang Y, Wang Y, Wang R. Perceptions of social class through Chinese speech cues. Acta Psychol (Amst) 2024; 247:104324. [PMID: 38761753 DOI: 10.1016/j.actpsy.2024.104324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Revised: 04/19/2024] [Accepted: 05/15/2024] [Indexed: 05/20/2024] Open
Abstract
Speech is a complex auditory signal that contains multiple layers of linguistic and non-linguistic structure, it contains both linguistic and social class information. Perceivers are exquisitely sensitive to this layered structure and extract not only linguistic properties, but also indexical characteristics that provide information about individual talkers and groups of talkers. Social class information involves inferring the speaker's social class or forming an impression of their social status based on their speech. Previous research on social class perception in speech has primarily focused on English, with relatively little research on Chinese. This study examines social class perception in Chinese speech. Study 1 employed class judgment and evaluation tasks with a subjective social class scale as the main measure to examine whether listeners could infer class information from Chinese speech and how their own class background influenced their perception. The results of Study 1 showed that subjects could accurately discriminate between speakers' social classes, but there may be a response bias that overestimates lower-class speakers as upper-class speakers. Study 2 focused on whether the speech of different classes of speakers actually differed on a number of indicators. It was found that the speech of higher class speakers was perceived to be more standardised, more pleasant to listen to and less accent-intensive. Overall, listeners can perceive class information from Chinese speech; different classes of Chinese speech do contain different levels of indexical information. In Chinese language societies, individuals can also judge their class information through the speech, which is consistent with the relevant research results in English.
Collapse
Affiliation(s)
- Zhengkai Niu
- Philosophy and Social Science Laboratory of Reading and Development in Children and Adolescents (South China Normal University), Ministry of Education, & Center for Studies of Psychological Application, School of Psychology, South China Normal University, Guangzhou, China
| | - Zilong Li
- Philosophy and Social Science Laboratory of Reading and Development in Children and Adolescents (South China Normal University), Ministry of Education, & Center for Studies of Psychological Application, School of Psychology, South China Normal University, Guangzhou, China
| | - Yan Zhang
- Philosophy and Social Science Laboratory of Reading and Development in Children and Adolescents (South China Normal University), Ministry of Education, & Center for Studies of Psychological Application, School of Psychology, South China Normal University, Guangzhou, China
| | - Yunhan Wang
- Philosophy and Social Science Laboratory of Reading and Development in Children and Adolescents (South China Normal University), Ministry of Education, & Center for Studies of Psychological Application, School of Psychology, South China Normal University, Guangzhou, China
| | - Ruiming Wang
- Philosophy and Social Science Laboratory of Reading and Development in Children and Adolescents (South China Normal University), Ministry of Education, & Center for Studies of Psychological Application, School of Psychology, South China Normal University, Guangzhou, China.
| |
Collapse
|
2
|
Calhoun S, Yan M, Salanoa H, Taupi F, Kruse Va'ai E. Focus Effects on Immediate and Delayed Recognition of Referents in Samoan. LANGUAGE AND SPEECH 2023; 66:175-201. [PMID: 35638438 DOI: 10.1177/00238309221101396] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
This paper looks at the effect of focus-marking on the immediate and delayed recognition of referents in Samoan. Focus-marking on a word can imply the presence of alternatives to that word which are relevant to the interpretation of the utterance. Consistent with this, psycholinguistic evidence is growing that alternatives to focus-marked words are selectively activated in the immediate processing of an utterance and longer term memory. However, most of this research is on Western Germanic languages which primarily use prosodic prominence to mark focus. We explore this in two experiments using immediate and delayed probe recognition tasks in the under-studied language Samoan, which primarily uses syntactic focus-marking. Participants heard short narratives ending in a critical sentence in which the object word was either focused or not, using a cleft-like construction. In the first experiment, probe recognition, alternatives to the object word which were either mentioned or unmentioned in the narrative were responded to more slowly if the object was focus-marked. In the second experiment, delayed recognition, participants were faster to correctly recognize mentioned alternatives, and slower to reject unmentioned, if the object was focus-marked. Both results are consistent with immediate and longer-term activation of focus alternatives. There was no significant effect of focus-marking on recognition of the object word itself.
Collapse
Affiliation(s)
| | - Mengzhu Yan
- Huazhong University of Science and Technology, China
| | | | | | - Emma Kruse Va'ai
- Victoria University of Wellington, New Zealand; National University of Samoa, Samoa
| |
Collapse
|
3
|
Ji Y, Hu Y, Jiang X. Segmental and suprasegmental encoding of speaker confidence in Wuxi dialect vowels. Front Psychol 2022; 13:1028106. [PMID: 36578688 PMCID: PMC9791101 DOI: 10.3389/fpsyg.2022.1028106] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Accepted: 11/15/2022] [Indexed: 12/14/2022] Open
Abstract
Introduction Wuxi dialect is a variation of Wu dialect spoken in eastern China and is characterized by a rich tonal system. Compared with standard Mandarin speakers, those of Wuxi dialect as their mother tongue can be more efficient in varying vocal cues to encode communicative meanings in speech communication. While literature has demonstrated that speakers encode high vs. low confidence in global prosodic cues at the sentence level, it is unknown how speakers' intended confidence is encoded at a more local, phonetic level. This study aimed to explore the effects of speakers' intended confidence on both prosodic and formant features of vowels in two lexical tones (the flat tone and the contour tone) of Wuxi dialect. Methods Words of a single vowel were spoken in confident, unconfident, or neutral tone of voice by native Wuxi dialect speakers using a standard elicitation procedure. Linear-mixed effects modeling and parametric bootstrapping testing were performed. Results The results showed that (1) the speakers raised both F1 and F2 in the confident level (compared with the neutral-intending expression). Additionally, F1 can distinguish between the confident and unconfident expressions; (2) Compared with the neutral-intending expression, the speakers raised mean f0, had a greater variation of f0 and prolonged pronunciation time in the unconfident level while they raised mean intensity, had a greater variation of intensity and prolonged pronunciation time in the confident level. (3) The speakers modulated mean f0 and mean intensity to a larger extent on the flat tone than the contour tone to differentiate between levels of confidence in the voice, while they modulated f0 and intensity range more only on the contour tone. Discussion These findings shed new light on the mechanisms of segmental and suprasegmental encoding of speaker confidence and lack of confidence at the vowel level, highlighting the interplay of lexical tone and vocal expression in speech communication.
Collapse
|
4
|
Yan M, Calhoun S, Warren P. The Role of Prominence in Activating Focused Words and Their Alternatives in Mandarin: Evidence from Lexical Priming and Recognition Memory. LANGUAGE AND SPEECH 2022:238309221126108. [PMID: 36239607 DOI: 10.1177/00238309221126108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
When a sentence is produced with contrastive prosodic prominence, the word that carries the prominence becomes more salient, and alternatives to that word are usually implied. In processing, this implies that focused words and their alternatives should be more strongly activated. Previous research on focus processing has primarily been confined to Germanic languages. The current paper reports on two experiments investigating the role of prosodic prominence in immediate (Experiment 1) and long-term processing (Experiment 2) of focused words and focus alternatives in Mandarin. Prosodic prominence was effective in activating focused words and their alternatives. In the memory task, this facilitation effect was only found toward the beginning of the experiment. We attribute this difference to task-related adaptive use of prosodic prominence in utterance processing. This research sheds light on whether, when, and how listeners use prosodic prominence to identify important information and to evoke alternatives during sentence comprehension.
Collapse
Affiliation(s)
- Mengzhu Yan
- School of Foreign Languages, Huazhong University of Science and Technology, China
| | | | - Paul Warren
- School of Linguistics and Applied Language Studies, Te Herenga Waka-Victoria University of Wellington, New Zealand
| |
Collapse
|
5
|
Ip MHK, Cutler A. In Search of Salience: Focus Detection in the Speech of Different Talkers. LANGUAGE AND SPEECH 2022; 65:650-680. [PMID: 34841933 DOI: 10.1177/00238309211046029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Many different prosodic cues can help listeners predict upcoming speech. However, no research to date has assessed listeners' processing of preceding prosody from different speakers. The present experiments examine (1) whether individual speakers (of the same language variety) are likely to vary in their production of preceding prosody; (2) to the extent that there is talker variability, whether listeners are flexible enough to use any prosodic cues signaled by the individual speaker; and (3) whether types of prosodic cues (e.g., F0 versus duration) vary in informativeness. Using a phoneme-detection task, we examined whether listeners can entrain to different combinations of preceding prosodic cues to predict where focus will fall in an utterance. We used unsynthesized sentences recorded by four female native speakers of Australian English who happened to have used different preceding cues to produce sentences with prosodic focus: a combination of pre-focus overall duration cues, F0 and intensity (mean, maximum, range), and longer pre-target interval before the focused word onset (Speaker 1), only mean F0 cues, mean and maximum intensity, and longer pre-target interval (Speaker 2), only pre-target interval duration (Speaker 3), and only pre-focus overall duration and maximum intensity (Speaker 4). Listeners could entrain to almost every speaker's cues (the exception being Speaker 4's use of only pre-focus overall duration and maximum intensity), and could use whatever cues were available even when one of the cue sources was rendered uninformative. Our findings demonstrate both speaker variability and listener flexibility in the processing of prosodic focus.
Collapse
Affiliation(s)
- Martin Ho Kwan Ip
- The MARCS Institute, Western Sydney University, Australia
- ARC Centre of Excellence for the Dynamics of Language, Australia
| | - Anne Cutler
- The MARCS Institute, Western Sydney University, Australia
- ARC Centre of Excellence for the Dynamics of Language, Australia
| |
Collapse
|
6
|
Zahner-Ritter K, Zhao T, Einfeldt M, Braun B. How experience with tone in the native language affects the L2 acquisition of pitch accents. Front Psychol 2022; 13:903879. [PMID: 36059723 PMCID: PMC9437707 DOI: 10.3389/fpsyg.2022.903879] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Accepted: 07/19/2022] [Indexed: 11/25/2022] Open
Abstract
This paper tested the ability of Mandarin learners of German, whose native language has lexical tone, to imitate pitch accent contrasts in German, an intonation language. In intonation languages, pitch accents do not convey lexical information; also, pitch accents are sparser than lexical tones as they only associate with prominent words in the utterance. We compared two kinds of German pitch-accent contrasts: (1) a “non-merger” contrast, which Mandarin listeners perceive as different and (2) a “merger” contrast, which sounds more similar to Mandarin listeners. Speakers of a tone language are generally very sensitive to pitch. Hypothesis 1 (H1) therefore stated that Mandarin learners produce the two kinds of contrasts similarly to native German speakers. However, the documented sensitivity to tonal contrasts, at the expense of processing phrase-level intonational contrasts, may generally hinder target-like production of intonational pitch accents in the L2 (Hypothesis 2, H2). Finally, cross-linguistic influence (CLI) predicts a difference in the realization of these two contrasts as well as improvement with higher proficiency (Hypothesis 3, H3). We used a delayed imitation paradigm, which is well-suited for assessing L2-phonetics and -phonology because it does not necessitate access to intonational meaning. We investigated the imitation of three kinds of accents, which were associated with the sentence-final noun in short wh-questions (e.g., Wer malt denn Mandalas, lit: “Who draws PRT mandalas?” “Who likes drawing mandalas?”). In Experiment 1, 28 native speakers of Mandarin participated (14 low- and 14 high-proficient). The learners’ productions of the two kinds of contrasts were analyzed using General Additive Mixed Models to evaluate differences in pitch accent contrasts over time, in comparison to the productions of native German participants from an earlier study in our lab. Results showed a more pronounced realization of the non-merger contrast compared to German natives and a less distinct realization of the merger contrast, with beneficial effects of proficiency, lending support to H3. Experiment 2 tested low-proficient Italian learners of German (whose L1 is an intonation language) to contextualize the Mandarin data and further investigate CLI. Italian learners realized the non-merger contrast more target-like than Mandarin learners, lending additional support to CLI (H3).
Collapse
Affiliation(s)
- Katharina Zahner-Ritter
- Department of Phonetics, University of Trier, Trier, Germany
- *Correspondence: Katharina Zahner-Ritter,
| | - Tianyi Zhao
- Department of Linguistics, University of Konstanz, Konstanz, Germany
| | - Marieke Einfeldt
- Department of Linguistics, University of Konstanz, Konstanz, Germany
| | - Bettina Braun
- Department of Linguistics, University of Konstanz, Konstanz, Germany
| |
Collapse
|
7
|
Xu Y, Shen R. Aesthetic evaluation of Chinese calligraphy: a cross-cultural comparative study. CURRENT PSYCHOLOGY 2022. [DOI: 10.1007/s12144-022-03390-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
|
8
|
Guan CQ, Meng W, Morett LM, Fraundorf SH. Mapping Pitch Accents to Memory Representations in Spoken Discourse Among Chinese Learners of English: Effects of L2 Proficiency and Working Memory. Front Psychol 2022; 13:870152. [PMID: 35664143 PMCID: PMC9161639 DOI: 10.3389/fpsyg.2022.870152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2022] [Accepted: 03/09/2022] [Indexed: 11/14/2022] Open
Abstract
We examined L2 learners' interpretation of pitch accent cues in discourse memory and how these effects vary with proficiency and working memory (WM). One hundred sixty-eight L1-Chinese participants learning L2-English listened to recorded discourses containing pairs of contrastive alternatives and then took a later recognition memory test. Their language proficiency and WM were measured through standard tests and the participants were categorized into low, medium, advanced, and high advanced language proficiency groups. We analyzed recognition memory task performance using signal detection theory to tease apart response bias (an overall tendency to affirm memory probes) from sensitivity (the ability to discern whether a specific probe statement is true). The results showed a benefit of contrastive L + H* pitch accents in rejecting probes referring to items unmentioned in a discourse, but not contrastive alternatives themselves. More proficient participants also showed more accurate memory for the discourses overall, as well as a reduced overall bias to affirm the presented statements as true. Meanwhile, that the benefit of L + H* accents in rejecting either contrast probes or unmentioned probes was modulated for people with greater working memory. Participants with higher WM were quite sure that it did not exist in the memory trace as this part of discourse wasn't mentioned. The results support a contrast-uncertainty hypothesis, in which comprehenders recall the contrast set but fail to distinguish which is the correct item. Further, these effects were influenced by proficiency and by working memory, suggesting they reflect incomplete mapping between pitch accent and discourse representation.
Collapse
Affiliation(s)
- Connie Qun Guan
- School of Foreign Studies, Beijing Language and Culture University, Beijing, China
- Department of Psychology, Carnegie Mellon University, Pittsburgh, PA, United States
| | - Wanjin Meng
- China National Institute of Education Sciences, Beijing, China
| | - Laura M. Morett
- Department of Educational Studies in Psychology, Research Methodology, and Counseling, University of Alabama, Tuscaloosa, AL, United States
| | - Scott H. Fraundorf
- Department of Psychology and Learning Research and Development Center, University of Pittsburgh, Pittsburgh, PA, United States
| |
Collapse
|
9
|
Tamaoka K, Ji Y. Facilitation of processing darenimo 'any/everyone' negative Japanese sentences using prosodic entrainment. PHONETICA 2022; 79:45-75. [PMID: 35320639 DOI: 10.1515/phon-2022-2016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
The present study examined how prosody affects Japanese speakers' processing of the polarity item darenimo 'any/everyone'. Upward (LHHH pitch) and downward (HLLL pitch) prosody for darenimo associates with negative and positive polarity, respectively. In Study 1, a corpus search showed that darenimo is more often associated with negative than positive polarity. In Study 2, subjective acceptability judgments indicated that darenimo is also more likely to be perceived as acceptable by native Japanese speakers when used with negative polarity. In line with Study 2, Study 3 showed that upward prosody with negative polarity was more accurately and quickly processed than was downward prosody with either positive or negative polarity. These three studies showed a one-sided distribution of upward prosody with negative polarity, and further indicated that only upward prosody facilitates listeners' processing of negation. Early heightened pitch of darenimo provides a cue to predict an ending negation -nai in the head-final Japanese language, resulting in faster speed and higher accuracy for the processing of negative sentences (i.e., a facilitation effect) compared to their corresponding affirmative sentences.
Collapse
Affiliation(s)
- Katsuo Tamaoka
- School of Foreign Languages, Hunan University, Lushan Road (S), Yuelu District, Changsha, Hunan Province, 410082, China
- Graduate School of Humanities , Nagoya University , Furo-cho, Chikusa-ku , Nagoya-shi, Aichi-ken, Japan
| | - Yuanyuan Ji
- Graduate School of Humanities , Nagoya University , Furo-cho, Chikusa-ku , Nagoya-shi, Aichi-ken, Japan
| |
Collapse
|
10
|
Tang K, Shaw JA. Prosody leaks into the memories of words. Cognition 2021; 210:104601. [PMID: 33508575 DOI: 10.1016/j.cognition.2021.104601] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2020] [Revised: 01/11/2021] [Accepted: 01/11/2021] [Indexed: 11/30/2022]
Abstract
The average predictability (aka informativity) of a word in context has been shown to condition word duration (Seyfarth, 2014). All else being equal, words that tend to occur in more predictable environments are shorter than words that tend to occur in less predictable environments. One account of the informativity effect on duration is that the acoustic details of probabilistic reduction are stored as part of a word's mental representation. Other research has argued that predictability effects are tied to prosodic structure in integral ways. With the aim of assessing a potential prosodic basis for informativity effects in speech production, this study extends past work in two directions; it investigated informativity effects in another large language, Mandarin Chinese, and broadened the study beyond word duration to additional acoustic dimensions, pitch and intensity, known to index prosodic prominence. The acoustic information of content words was extracted from a large telephone conversation speech corpus with over 400,000 tokens and 6000 word types spoken by 1655 individuals and analyzed for the effect of informativity using frequency statistics estimated from a 431 million word subtitle corpus. Results indicated that words with low informativity have shorter durations, replicating the effect found in English. In addition, informativity had significant effects on maximum pitch and intensity, two phonetic dimensions related to prosodic prominence. Extending this interpretation, these results suggest that predictability is closely linked to prosodic prominence, and that the lexical representation of a word includes phonetic details associated with its average prosodic prominence in discourse. In other words, the lexicon absorbs prosodic influences on speech production.
Collapse
Affiliation(s)
- Kevin Tang
- Department of Linguistics, University of Florida, Gainesville, FL 32611-5454, USA.
| | - Jason A Shaw
- Department of Linguistics, Yale University, New Haven, CT 06520, USA.
| |
Collapse
|