1
|
Chen H, Xu X, Wang T. Assessing lexical ambiguity of simplified Chinese characters: Plurality and relatedness of character meanings. Q J Exp Psychol (Hove) 2024; 77:677-693. [PMID: 37198743 DOI: 10.1177/17470218231178787] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Abstract
Lexical ambiguity is pervasive among Chinese characters as many of them are polysemantic, with one orthographic form carrying unrelated meanings, related meanings, or sometimes both unrelated and related meanings. A large-scale database with ambiguity measures for simplified Chinese characters has yet to be developed, which could greatly benefit psycholinguistic research on the Chinese language or cross-language comparisons. This article reports two sets of ratings by native speakers, the perceived number of meanings (pNoM) for 4,363 characters and the perceived relatedness of meanings (pRoM) for a subset of 1,053 characters. These rating-based ambiguity measures capture the representational nuance about a character's meanings stored in average native speakers' mental lexicon, which tends to be obscured by dictionary- and corpus-based ambiguity measures. Consequently, they each account for a reliable portion of variance in the efficiency of character processing, above and beyond the effects of character frequency, age of acquisition, and other types of ambiguity measures. Theoretical and empirical implications with regard to the plurality and the relatedness of character meanings, the two focal aspects of debate on lexical ambiguity, are discussed.
Collapse
Affiliation(s)
- Huilin Chen
- School of Foreign Languages, Shanghai Jiao Tong University, Shanghai, China
| | - Xu Xu
- School of Foreign Languages, Shanghai Jiao Tong University, Shanghai, China
| | - Tianqi Wang
- School of Foreign Languages, Shanghai Jiao Tong University, Shanghai, China
- Speech Science Laboratory, The University of Hong Kong, Hong Kong, China
| |
Collapse
|
2
|
Feng C, Damian MF, Qu Q. A dataset of behavioral measures on Chinese word production in picture naming. Sci Data 2024; 11:185. [PMID: 38341458 PMCID: PMC10858894 DOI: 10.1038/s41597-024-03022-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Accepted: 01/30/2024] [Indexed: 02/12/2024] Open
Abstract
Most studies of language production have been conducted with speakers of alphabetic languages, but relatively little research has examined languages with non-alphabetic scripts, such as Chinese. Moreover, most work on language word production has investigated phonological output processing (i.e., speaking), whereas comparatively little research has focused on orthographic output, such as writing and typing. Work on non-alphabetic languages and/or written production is particularly promising, given that it speaks to universalities vs. specificity in terms of architectures and mechanisms underlying language processing across all world languages and modalities. The current article reports a dataset of word production in Chinese with spoken and written responses, which includes reaction times of 193,851 trials of naming 403 pictures obtained from 667 participants across 23 Chinese word production experiments. All data were collected in the same experimental environment and from participants with relatively homogenous characteristics, using the same protocols and parameters. The dataset enables researchers to explore how Chinese speakers produce spoken and/or written words, and to identify language-specific features underlying word production.
Collapse
Affiliation(s)
- Chen Feng
- Key Laboratory of Behavioral Science, Institute of Psychology, Chinese Academy of Sciences, Beijing, China
- Department of Psychology, University of Chinese Academy of Sciences, Beijing, China
| | - Markus F Damian
- School of Psychological Science, University of Bristol, Bristol, United Kingdom
| | - Qingqing Qu
- Key Laboratory of Behavioral Science, Institute of Psychology, Chinese Academy of Sciences, Beijing, China.
- Department of Psychology, University of Chinese Academy of Sciences, Beijing, China.
| |
Collapse
|
3
|
Peng C, Xu X, Bao Z. Sentiment annotations for 3827 simplified Chinese characters. Behav Res Methods 2024; 56:651-666. [PMID: 36754941 DOI: 10.3758/s13428-023-02068-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/13/2023] [Indexed: 02/10/2023]
Abstract
Sentiment analysis in Chinese natural language processing has been largely based on words annotated with sentiment categories or scores. Characters, however, are the basic orthographic, phonological, and in most cases, semantic units in the Chinese language. This study collected sentiment annotations for 3827 characters. The ratings demonstrated high levels of reliability, and were validated through a comparison with the ratings of some characters' word equivalents reported in a previous norming study. Relations with other lexico-semantic variables and character processing efficiency were investigated. Furthermore, analyses of the association between constituent character valence and word valence revealed semantic compositionality and sentiment fusion characteristic of larger Chinese linguistic units. These ratings for characters, expanding current Chinese sentiment lexicons, can be utilized for the purposes of more precise stimuli assessment in research on Chinese character processing and more efficient sentiment analysis equipped with annotations of single-character words.
Collapse
Affiliation(s)
- Cheng Peng
- Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai, 200240, China
| | - Xu Xu
- Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai, 200240, China.
| | - Zhen Bao
- Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai, 200240, China
| |
Collapse
|
4
|
Thierfelder P, Cai ZG, Huang S, Lin H. The Chinese lexicon of deaf readers: A database of character decisions and a comparison between deaf and hearing readers. Behav Res Methods 2023:10.3758/s13428-023-02305-z. [PMID: 38114882 DOI: 10.3758/s13428-023-02305-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/22/2023] [Indexed: 12/21/2023]
Abstract
We present a psycholinguistic study investigating lexical effects on simplified Chinese character recognition by deaf readers. Prior research suggests that deaf readers exhibit efficient orthographic processing and decreased reliance on speech-based phonology in word recognition compared to hearing readers. In this large-scale character decision study (25 participants, each evaluating 2500 real characters and 2500 pseudo-characters), we analyzed various factors influencing character recognition accuracy and speed in deaf readers. Deaf participants demonstrated greater accuracy and faster recognition when characters were more frequent, were acquired earlier, had more strokes, displayed higher orthographic complexity, were more imageable in reference, or were less concrete in reference. Comparison with a previous study of hearing readers revealed that the facilitative effect of frequency on character decision accuracy was stronger for deaf readers than hearing readers. The effect of orthographic-phonological regularity differed significantly for the two groups, indicating that deaf readers rely more on orthographic structure and less on phonological information during character recognition. Notably, increased stroke counts (i.e., higher orthographic complexity) hindered hearing readers but facilitated recognition processes in deaf readers, suggesting that deaf readers excel at recognizing characters based on orthographic structure. The database generated from this large-scale character decision study offers a valuable resource for further research and practical applications in deaf education and literacy.
Collapse
Affiliation(s)
- Philip Thierfelder
- Department of Linguistics and Modern Languages, The Chinese University of Hong Kong, Sha Tin, N.T., Hong Kong, SAR
| | - Zhenguang G Cai
- Department of Linguistics and Modern Languages, The Chinese University of Hong Kong, Sha Tin, N.T., Hong Kong, SAR.
| | - Shuting Huang
- Department of Linguistics and Modern Languages, The Chinese University of Hong Kong, Sha Tin, N.T., Hong Kong, SAR
| | - Hao Lin
- Shanghai International Studies University, 550 Dalian Road(W), Shanghai, People's Republic of China.
| |
Collapse
|
5
|
Wang T, Xu X, Xie X, Ng ML. Probing Lexical Ambiguity in Chinese Characters via Their Word Formations: Convergence of Perceived and Computed Metrics. Cogn Sci 2023; 47:e13379. [PMID: 37988245 DOI: 10.1111/cogs.13379] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2023] [Revised: 09/07/2023] [Accepted: 11/02/2023] [Indexed: 11/23/2023]
Abstract
Lexical ambiguity is pervasive in language, and the nature of the representations of an ambiguous word's multiple meanings is yet to be fully understood. With a special focus on Chinese characters, the present study first established that native speaker's perception about a character's number of meanings was heavily influenced by the availability of its distinct word formations, while whether these meanings would be perceived to be closely related was driven by further conceptual analysis. These notions were operationalized as two computed metrics, which assessed the degree of dispersion across individual word formations and the degree of propinquity across clusters of word formations, respectively, in a distributional semantic space. The observed correlations between the computed and the perceived metrics indicated that the utility of word formations to tap into meaning representations of Chinese characters was indeed cognitively plausible. The results have demonstrated the extent to which distributional semantics could inform about meaning representations of Chinese characters, which has theoretical implications for the representation of ambiguous words more generally.
Collapse
Affiliation(s)
- Tianqi Wang
- School of Foreign Languages, Shanghai Jiao Tong University
- Speech Science Laboratory, The University of Hong Kong
| | - Xu Xu
- School of Foreign Languages, Shanghai Jiao Tong University
| | - Xurong Xie
- Beijing Key Lab of Human-Computer Interaction, Institute of Software, Chinese Academy of Sciences
| | | |
Collapse
|
6
|
Su IF, Yum YN, Lau DKY. Hong Kong Chinese character psycholinguistic norms: ratings of 4376 single Chinese characters on semantic radical transparency, age-of-acquisition, familiarity, imageability, and concreteness. Behav Res Methods 2023; 55:2989-3008. [PMID: 36002627 PMCID: PMC10558066 DOI: 10.3758/s13428-022-01928-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/10/2022] [Indexed: 11/08/2022]
Abstract
Several norms of psycholinguistic features of Chinese characters exist in Mandarin Chinese, but only a few are available in Cantonese or in the traditional script, and none includes semantic radical transparency ratings. This study presents subjective ratings of age-of-acquisition (AoA), familiarity, imageability, concreteness, and semantic radical transparency in 4376 Chinese characters. The single Chinese characters were rated individually on the five dimensions by 20 native Cantonese speakers in Hong Kong to form the Hong Kong Chinese Character Psycholinguistic Norms (HKCCPN). The split-half reliability and intra-class correlations testified to the high internal reliability of the ratings. Their convergent and discriminant patterns in relations to other psycholinguistic measures echoed previous findings reported on Chinese. There were high correlations for semantic radical transparency, imageability and concreteness, and moderate-to-high correlations for AoA and familiarity among subsets of items that had been collected in previous studies. Concurrent validity analyses showed convergence in predicting behavioral response times in various tasks (lexical decision, naming, and writing-to-dictation) when compared with other Chinese character databases. High predictive validity was shown in writing-to-dictation data from an independent sample of 20 native Cantonese speakers. Several objective psycholinguistic measures (character frequency, stroke number, number of words formed, number of homophones and number of meanings) were included in this database to facilitate its use. These new ratings extend the currently available norms in language and reading research in Cantonese Chinese for researchers, clinicians, and educators, as well as provide them with a wider choice of stimuli.
Collapse
Affiliation(s)
| | - Yen Na Yum
- Department of Special Education and Counselling, The Education University of Hong Kong, 10 Lo Ping Road, NT Ting Kok, Hong Kong
| | - Dustin Kai-Yan Lau
- Department of Bilingual and Chinese Studies, Hong Kong Polytechnic University, Hung Hom, Hong Kong
| |
Collapse
|
7
|
Zhang M, Liu Z, Botezatu MR, Dang Q, Yuan Q, Han J, Liu L, Guo T. A large-scale database of Chinese characters and words collected from elementary school textbooks. Behav Res Methods 2023:10.3758/s13428-023-02214-1. [PMID: 37620745 DOI: 10.3758/s13428-023-02214-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/30/2023] [Indexed: 08/26/2023]
Abstract
Lexical databases are essential tools for studies on language processing and acquisition. Most previous Chinese lexical databases have focused on materials for adults, yet little is known about reading materials for children and how lexical properties from these materials affect children's reading comprehension. In the present study, we provided the first large database of 2999 Chinese characters and 2182 words collected from the official textbooks recently issued by the Ministry of Education (MOE) of the People's Republic of China for most elementary schools in Mainland China, as well as norms from both school-aged children and adults. The database incorporates key orthographic, phonological, and semantic factors from these lexical units. A word-naming task was used to investigate the effects of these factors in character and word processing in both adults and children. The results suggest that: (1) as the grade level increases, visual complexity of those characters and words increases whereas semantic richness and frequency decreases; (2) the effects of lexical predictors on processing both characters and words vary across children and adults; (3) the effect of age of acquisition shows different patterns on character and word-naming performance. The database is available on Open Science Framework (OSF) ( https://osf.io/ynk8c/?view_only=5186bd68549340bd923e9b6531d2c820 ) for future studies on Chinese language development.
Collapse
Affiliation(s)
- Man Zhang
- State Key Laboratory of Cognitive Neuroscience and Learning & IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, 100875, China
| | - Zeping Liu
- Department of East Asian Languages and Cultures, Indiana University, Bloomington, IN, USA
| | - Mona Roxana Botezatu
- Department of Speech, Language and Hearing Sciences, University of Missouri, Columbia, MO, USA
| | - Qinpu Dang
- State Key Laboratory of Cognitive Neuroscience and Learning & IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, 100875, China
| | - Qiming Yuan
- State Key Laboratory of Cognitive Neuroscience and Learning & IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, 100875, China
| | - Jinzhuo Han
- Chinese Language and Culture College, Beijing Normal University, Beijing, 100875, China
| | - Li Liu
- State Key Laboratory of Cognitive Neuroscience and Learning & IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, 100875, China
| | - Taomei Guo
- State Key Laboratory of Cognitive Neuroscience and Learning & IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, 100875, China.
| |
Collapse
|
8
|
Imageability ratings for 10,426 Chinese two-character words and their contribution to lexical processing. CURRENT PSYCHOLOGY 2022. [DOI: 10.1007/s12144-022-03404-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|