1
|
Shafaei-Bajestan E, Moradipour-Tari M, Uhrig P, Baayen RH. The pluralization palette: unveiling semantic clusters in English nominal pluralization through distributional semantics. MORPHOLOGY (DORDRECHT, NETHERLANDS) 2024; 34:369-413. [PMID: 39540059 PMCID: PMC11554741 DOI: 10.1007/s11525-024-09428-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Accepted: 06/11/2024] [Indexed: 11/16/2024]
Abstract
Using distributional semantics, we show that English nominal pluralization exhibits semantic clusters. For instance, the change in semantic space from singulars to plurals differs depending on whether a word denotes, e.g., a fruit, or an animal. Languages with extensive noun classes such as Swahili and Kiowa distinguish between these kind of words in their morphology. In English, even though not marked morphologically, plural semantics actually also varies by semantic class. A semantically informed method, CosClassAvg, is introduced that is compared to two other methods, one implementing a fixed shift from singular to plural, and one creating plural vectors from singular vectors using a linear mapping (FRACSS). Compared to FRACSS, CosClassAvg predicted plural vectors that were more similar to the corpus-extracted plural vectors in terms of vector length, but somewhat less similar in terms of orientation. Both FRACSS and CosClassAvg outperform the method using a fixed shift vector to create plural vectors, which does not do justice to the intricacies of English plural semantics. A computational modeling study revealed that the observed difference between the plural semantics generated by these three methods carries over to how well a computational model of the listener can understand previously unencountered plural forms. Among all methods, CosClassAvg provides a good balance for the trade-off between productivity (being able to understand novel plural forms) and faithfulness to corpus-extracted plural vectors (i.e., understanding the particulars of the meaning of a given plural form). Supplementary Information The online version contains supplementary material available at 10.1007/s11525-024-09428-9.
Collapse
Affiliation(s)
- Elnaz Shafaei-Bajestan
- Department of General and Computational Linguistics, University of Tübingen, Wilhelmstraße 19, Tübingen, 72074 Baden-Württemberg Germany
| | - Masoumeh Moradipour-Tari
- Department of General and Computational Linguistics, University of Tübingen, Wilhelmstraße 19, Tübingen, 72074 Baden-Württemberg Germany
| | - Peter Uhrig
- Department of English and American Studies, Friedrich-Alexander-Universität Erlangen-Nürnberg, Bismarckstraße 1, Erlangen, 91054 Bayern Germany
| | - R. Harald Baayen
- Department of General and Computational Linguistics, University of Tübingen, Wilhelmstraße 19, Tübingen, 72074 Baden-Württemberg Germany
| |
Collapse
|
2
|
Wei Y, Niu Y, Taft M, Carreiras M. Morphological decomposition in Chinese compound word recognition: Electrophysiological evidence. BRAIN AND LANGUAGE 2023; 241:105267. [PMID: 37121022 DOI: 10.1016/j.bandl.2023.105267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/25/2022] [Revised: 04/06/2023] [Accepted: 04/15/2023] [Indexed: 05/22/2023]
Abstract
The present study examined the effect of both morphological complexity and semantic transparency in Chinese compound word recognition. Using a visual lexical decision task, our electrophysiological results showed that transparent and opaque compounds induced stronger Left Anterior Negativity (LAN) than monomorphemic words. This result suggests that Chinese compounds might be decomposed into their constituent morphemes at the lemma level, whereas monomorphemic words are accessed as a whole-word lemma directly from the form level. In addition, transparent and opaque compounds produced a similar N400 as each other, suggesting that transparency did not show an effect on the involvement of constituent morphemes during access to the whole-word lemma. Two behavioral experiments additionally showed similar patterns to the EEG results. These findings support morphological decomposition for compounds at the lemma level as proposed by the full-parsing model, and no evidence is found to support the role of transparency during Chinese compound word recognition.
Collapse
Affiliation(s)
- Yanjun Wei
- Key Laboratory of the Cognitive Science of Language (Beijing Language and Culture University), Ministry of Education, China; Center for the Cognitive Science of Language, Beijing Language and Culture University, 100083 Beijing, China; Basque Center on Cognition, Brain and Language, 20009 Donostia-San Sebastián, Spain.
| | - Ying Niu
- Center for the Cognitive Science of Language, Beijing Language and Culture University, 100083 Beijing, China
| | - Marcus Taft
- Center for the Cognitive Science of Language, Beijing Language and Culture University, 100083 Beijing, China; School of Psychology, UNSW Sydney, NSW 2052, Australia
| | - Manuel Carreiras
- Basque Center on Cognition, Brain and Language, 20009 Donostia-San Sebastián, Spain; Departamento de Lengua Vasca y Comunicación, University of the Basque Country (UPV/EHU), 48940 Leioa, Spain; Ikerbasque, Basque Foundation for Science, 48013 Bilbao, Spain
| |
Collapse
|
3
|
Opitz A, Bordag D, Furgoni A. Mental Representation of Word Family Structure: The Case of German Infinitives, Conversion Nouns and Other Morphologically Related Forms. Front Psychol 2022; 13:910849. [PMID: 35967669 PMCID: PMC9363782 DOI: 10.3389/fpsyg.2022.910849] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Accepted: 06/21/2022] [Indexed: 11/13/2022] Open
Abstract
This study investigates how two non-finite forms, infinitives and conversion nouns, are represented in the mind of L1 and L2 speakers and what is their relationship to other members of the corresponding word family. German native speakers and proficient German learners with Czech as L1 participated in four overt priming experiments involving a grammatical judgement task. We investigated the relationship between infinitives (Experiment 1) and conversion nouns (Experiment 2) and formally identical verbal or noun forms. We further focussed on the relationship between conversion nouns and regular nominal derivation forms with two derivational suffixes: -er and -ung (Experiments 3 and 4). Our results show that the two non-finite forms differ in their relations to other members of a word family and do not constitute a special class of non-finites as suggested in previous literature. While German infinitives seem to be closer related to finite verbal forms, conversion nouns behave in the same way as other regular nominal derivatives within the same word family. As for the German L1 and L2 contrast, no significant difference in the mental representation of the examined forms was found. This finding suggests that with respect to the explored phenomena, proficient learners rely on the same linguistic organisation as L1 speakers.
Collapse
Affiliation(s)
- Andreas Opitz
- Herder-Institut, Leipzig University, Leipzig, Germany
| | - Denisa Bordag
- Herder-Institut, Leipzig University, Leipzig, Germany
- University of Haifa, Haifa, Israel
| | - Alberto Furgoni
- Herder-Institut, Leipzig University, Leipzig, Germany
- BCBL-Basque Center on Cognition, Brain and Language, Donostia-San Sebastian, Spain
| |
Collapse
|
4
|
From decomposition to distributed theories of morphological processing in reading. Psychon Bull Rev 2022; 29:1673-1702. [PMID: 35595965 DOI: 10.3758/s13423-022-02086-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/06/2022] [Indexed: 11/08/2022]
Abstract
The morphological structure of complex words impacts how they are processed during visual word recognition. This impact varies over the course of reading acquisition and for different languages and writing systems. Many theories of morphological processing rely on a decomposition mechanism, in which words are decomposed into explicit representations of their constituent morphemes. In distributed accounts, in contrast, morphological sensitivity arises from the tuning of finer-grained representations to useful statistical regularities in the form-to-meaning mapping, without the need for explicit morpheme representations. In this theoretically guided review, we summarize research into the mechanisms of morphological processing, and discuss findings within the context of decomposition and distributed accounts. Although many findings fit within a decomposition model of morphological processing, we suggest that the full range of results is more naturally explained by a distributed approach, and discuss additional benefits of adopting this perspective.
Collapse
|
5
|
Ciaccio LA, Burchert F, Semenza C. Derivational Morphology in Agrammatic Aphasia: A Comparison Between Prefixed and Suffixed Words. Front Psychol 2020; 11:1070. [PMID: 32547456 PMCID: PMC7274032 DOI: 10.3389/fpsyg.2020.01070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2019] [Accepted: 04/27/2020] [Indexed: 11/26/2022] Open
Abstract
Although a relatively large number of studies on acquired language impairments have tested the case of derivational morphology, none of these have specifically investigated whether there are differences in how prefixed and suffixed derived words are impaired. Based on linguistic and psycholinguistic considerations on prefixed and suffixed derived words, differences in how these two types of derivations are processed, and consequently impaired, are predicted. In the present study, we investigated the errors produced in reading aloud simple, prefixed, and suffixed words by three German individuals with agrammatic aphasia (NN, LG, SA). We found that, while NN and LG produced similar numbers of errors with prefixed and suffixed words, SA showed a selective impairment for prefixed words. Furthermore, NN and SA produced more errors specifically involving the affix with prefixed words than with suffixed words. We discuss our findings in terms of relative position of stem and affix in prefixed and suffixed words, as well as in terms of specific properties of prefixes and suffixes.
Collapse
Affiliation(s)
- Laura Anna Ciaccio
- Potsdam Research Institute for Multilingualism, University of Potsdam, Potsdam, Germany
| | - Frank Burchert
- Linguistics Department, University of Potsdam, Potsdam, Germany
| | - Carlo Semenza
- Department of Neuroscience (Padua Neuroscience Center), University of Padua, Padua, Italy.,IRCCS San Camillo Hospital Foundation, Neuropsychology Unit, Lido-Venice, Italy
| |
Collapse
|
6
|
Semantic transparency effects in German compounds: A large dataset and multiple-task investigation. Behav Res Methods 2020; 52:1208-1224. [PMID: 32052353 DOI: 10.3758/s13428-019-01311-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
In the present study, we provide a comprehensive analysis and a multi-dimensional dataset of semantic transparency measures for 1810 German compound words. Compound words are considered semantically transparent when the contribution of the constituents' meaning to the compound meaning is clear (as in airport), but the degree of semantic transparency varies between compounds (compare strawberry or sandman). Our dataset includes both compositional and relatedness-based semantic transparency measures, also differentiated by constituents. The measures are obtained from a computational and fully implemented semantic model based on distributional semantics. We validate the measures using data from four behavioral experiments: Explicit transparency ratings, two different lexical decision tasks using different nonwords, and an eye-tracking study. We demonstrate that different semantic effects emerge in different behavioral tasks, which can only be captured using a multi-dimensional approach to semantic transparency. We further provide the semantic transparency measures derived from the model for a dataset of 40,475 additional German compounds, as well as for 2061 novel German compounds.
Collapse
|
7
|
Günther F, Rinaldi L, Marelli M. Vector-Space Models of Semantic Representation From a Cognitive Perspective: A Discussion of Common Misconceptions. PERSPECTIVES ON PSYCHOLOGICAL SCIENCE 2019; 14:1006-1033. [DOI: 10.1177/1745691619861372] [Citation(s) in RCA: 67] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Models that represent meaning as high-dimensional numerical vectors—such as latent semantic analysis (LSA), hyperspace analogue to language (HAL), bound encoding of the aggregate language environment (BEAGLE), topic models, global vectors (GloVe), and word2vec—have been introduced as extremely powerful machine-learning proxies for human semantic representations and have seen an explosive rise in popularity over the past 2 decades. However, despite their considerable advancements and spread in the cognitive sciences, one can observe problems associated with the adequate presentation and understanding of some of their features. Indeed, when these models are examined from a cognitive perspective, a number of unfounded arguments tend to appear in the psychological literature. In this article, we review the most common of these arguments and discuss (a) what exactly these models represent at the implementational level and their plausibility as a cognitive theory, (b) how they deal with various aspects of meaning such as polysemy or compositionality, and (c) how they relate to the debate on embodied and grounded cognition. We identify common misconceptions that arise as a result of incomplete descriptions, outdated arguments, and unclear distinctions between theory and implementation of the models. We clarify and amend these points to provide a theoretical basis for future research and discussions on vector models of semantic representation.
Collapse
Affiliation(s)
- Fritz Günther
- Department of Psychology, University of Milano–Bicocca
| | - Luca Rinaldi
- Department of Psychology, University of Milano–Bicocca
- NeuroMI, Milan Center for Neuroscience, Milan, Italy
| | - Marco Marelli
- Department of Psychology, University of Milano–Bicocca
- NeuroMI, Milan Center for Neuroscience, Milan, Italy
| |
Collapse
|