1
|
Arnon I, Kirby S. Cultural evolution creates the statistical structure of language. Sci Rep 2024; 14:5255. [PMID: 38438558 PMCID: PMC10912608 DOI: 10.1038/s41598-024-56152-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Accepted: 03/01/2024] [Indexed: 03/06/2024] Open
Abstract
Human language is unique in its structure: language is made up of parts that can be recombined in a productive way. The parts are not given but have to be discovered by learners exposed to unsegmented wholes. Across languages, the frequency distribution of those parts follows a power law. Both statistical properties-having parts and having them follow a particular distribution-facilitate learning, yet their origin is still poorly understood. Where do the parts come from and why do they follow a particular frequency distribution? Here, we show how these two core properties emerge from the process of cultural evolution with whole-to-part learning. We use an experimental analog of cultural transmission in which participants copy sets of non-linguistic sequences produced by a previous participant: This design allows us to ask if parts will emerge purely under pressure for the system to be learnable, even without meanings to convey. We show that parts emerge from initially unsegmented sequences, that their distribution becomes closer to a power law over generations, and, importantly, that these properties make the sets of sequences more learnable. We argue that these two core statistical properties of language emerge culturally both as a cause and effect of greater learnability.
Collapse
Affiliation(s)
- Inbal Arnon
- Psychology Department, Hebrew University of Jerusalem, Jerusalem, Israel.
| | - Simon Kirby
- School of Philosophy, Psychology and Language Sciences, University of Edinburgh, Edinburgh, UK
| |
Collapse
|
2
|
Lavi-Rotbain O, Arnon I. Zipfian Distributions in Child-Directed Speech. Open Mind (Camb) 2023; 7:1-30. [PMID: 36891353 PMCID: PMC9987348 DOI: 10.1162/opmi_a_00070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2022] [Accepted: 11/30/2022] [Indexed: 12/23/2022] Open
Abstract
Across languages, word frequency and rank follow a power law relation, forming a distribution known as the Zipfian distribution. There is growing experimental evidence that this well-studied phenomenon may be beneficial for language learning. However, most investigations of word distributions in natural language have focused on adult-to-adult speech: Zipf's law has not been thoroughly evaluated in child-directed speech (CDS) across languages. If Zipfian distributions facilitate learning, they should also be found in CDS. At the same time, several unique properties of CDS may result in a less skewed distribution. Here, we examine the frequency distribution of words in CDS in three studies. We first show that CDS is Zipfian across 15 languages from seven language families. We then show that CDS is Zipfian from early on (six-months) and across development for five languages with sufficient longitudinal data. Finally, we show that the distribution holds across different parts of speech: Nouns, verbs, adjectives and prepositions follow a Zipfian distribution. Together, the results show that the input children hear is skewed in a particular way from early on, providing necessary (but not sufficient) support for the postulated learning advantage of such skew. They highlight the need to study skewed learning environments experimentally.
Collapse
Affiliation(s)
- Ori Lavi-Rotbain
- The Edmond and Lilly Safra Center for Brain Sciences, Hebrew University, Jerusalem, Israel
| | - Inbal Arnon
- Department of Psychology, Hebrew University, Jerusalem, Israel
| |
Collapse
|
3
|
Tal S, Smith K, Culbertson J, Grossman E, Arnon I. The Impact of Information Structure on the Emergence of Differential Object Marking: An Experimental Study. Cogn Sci 2022; 46:e13119. [PMID: 35297091 PMCID: PMC9286624 DOI: 10.1111/cogs.13119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Revised: 02/03/2022] [Accepted: 02/04/2022] [Indexed: 12/04/2022]
Abstract
Many languages exhibit differential object marking (DOM), where only certain types of grammatical objects are marked with morphological cases. Traditionally, it has been claimed that DOM arises as a way to prevent ambiguity by marking objects that might otherwise be mistaken for subjects (e.g., animate objects). While some recent experimental work supports this account, research on language typology suggests at least one alternative hypothesis. In particular, DOM may instead arise as a way of marking objects that are atypical from the point of view of information structure. According to this account, rather than being marked to avoid ambiguity, objects are marked when they are given (already familiar in the discourse) rather than new. Here, we experimentally investigate this hypothesis using two artificial language learning experiments. We find that information structure impacts participants’ object marking, but in an indirect way: atypical information structure leads to a change in word order, which then triggers increased object marking. Interestingly, this staged process of change is compatible with documented cases of DOM emergence. We argue that this process is driven by two cognitive tendencies. First, a tendency to place discourse given information before new information, and second, a tendency to mark noncanonical word order. Taken together, our findings provide corroborating evidence for the role of information structure in the emergence of DOM systems.
Collapse
Affiliation(s)
- Shira Tal
- Department of Cognitive Sciences, The Hebrew University of Jerusalem.,Centre for Language Evolution, School of Philosophy, Psychology and Language Sciences, University of Edinburgh
| | - Kenny Smith
- Centre for Language Evolution, School of Philosophy, Psychology and Language Sciences, University of Edinburgh
| | - Jennifer Culbertson
- Centre for Language Evolution, School of Philosophy, Psychology and Language Sciences, University of Edinburgh
| | - Eitan Grossman
- Department of Linguistics, The Hebrew University of Jerusalem
| | - Inbal Arnon
- Department of Psychology, The Hebrew University of Jerusalem
| |
Collapse
|
4
|
Abu-Zhaya R, Arnon I, Borovsky A. Do Children Use Multi-Word Information in Real-Time Sentence Comprehension? Cogn Sci 2022; 46:e13111. [PMID: 35297085 DOI: 10.1111/cogs.13111] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2020] [Revised: 01/14/2022] [Accepted: 01/17/2022] [Indexed: 11/29/2022]
Abstract
Meaning in language emerges from multiple words, and children are sensitive to multi-word frequency from infancy. While children successfully use cues from single words to generate linguistic predictions, it is less clear whether and how they use multi-word sequences to guide real-time language processing and whether they form predictions on the basis of multi-word information or pairwise associations. We address these questions in two visual-world eye-tracking experiments with 5- to 8-year-old children. In Experiment 1, we asked whether children generate more robust predictions for the sentence-final object of highly frequent sequences (e.g., "Throw the ball"), compared to less frequent sequences (e.g., "Throw the book"). We further examined if gaze patterns reflect event knowledge or phrasal frequency by comparing the processing of phrases that have the same event structure but differ in multi-word content (e.g., "Brush your teeth" vs. "Brush her teeth"). In the second study, we employed a training paradigm to ask if children are capable of generating predictio.ns from novel multi-word associations while controlling for the overall frequency of the sequences. While the results of Experiment 1 suggested that children primarily relied on event associations to generate real-time predictions, those of Experiment 2 showed that the same children were able to use recurring novel multi-word sequences to generate real-time linguistic predictions. Together, these findings suggest that children can draw on multi-word information to generate linguistic predictions, in a context-dependent fashion, and highlight the need to account for the influence of multi-word sequences in models of language processing.
Collapse
Affiliation(s)
| | - Inbal Arnon
- Department of Psychology, The Hebrew University of Jerusalem
| | - Arielle Borovsky
- Department of Speech, Language, and Hearing Sciences, Purdue University
| |
Collapse
|
5
|
Lavi-Rotbain O, Arnon I. The learnability consequences of Zipfian distributions in language. Cognition 2022; 223:105038. [PMID: 35123219 DOI: 10.1016/j.cognition.2022.105038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2021] [Revised: 01/15/2022] [Accepted: 01/21/2022] [Indexed: 11/03/2022]
Abstract
While the languages of the world differ in many respects, they share certain commonalties, which can provide insight on our shared cognition. Here, we explore the learnability consequences of one of the striking commonalities between languages. Across languages, word frequencies follow a Zipfian distribution, showing a power law relation between a word's frequency and its rank. While their source in language has been studied extensively, less work has explored the learnability consequences of such distributions for language learners. We propose that the greater predictability of words in this distribution (relative to less skewed distributions) can facilitate word segmentation, a crucial aspect of early language acquisition. To explore this, we quantify word predictability using unigram entropy, assess it across languages using naturalistic corpora of child-directed speech and then ask whether similar unigram predictability facilitates word segmentation in the lab. We find similar unigram entropy in child-directed speech across 15 languages. We then use an auditory word segmentation task to show that the unigram predictability levels found in natural language are uniquely facilitative for word segmentation for both children and adults. These findings illustrate the facilitative impact of skewed input distributions on learning and raise questions about the possible role of cognitive pressures in the prevalence of Zipfian distributions in language.
Collapse
Affiliation(s)
- Ori Lavi-Rotbain
- The Edmond and Lilly Safra Center for Brain Sciences, Hebrew University, Israel.
| | - Inbal Arnon
- Department of Psychology, Hebrew University, Israel
| |
Collapse
|
6
|
Ambridge B, Doherty L, Maitreyee R, Tatsumi T, Zicherman S, Mateo Pedro P, Kawakami A, Bidgood A, Pye C, Narasimhan B, Arnon I, Bekman D, Efrati A, Fabiola Can Pixabaj S, Marroquín Pelíz M, Julajuj Mendoza M, Samanta S, Campbell S, McCauley S, Berman R, Misra Sharma D, Bhaya Nair R, Fukumura K. Testing a computational model of causative overgeneralizations: Child judgment and production data from English, Hebrew, Hindi, Japanese and K'iche'. Open Res Eur 2022; 1:1. [PMID: 37645154 PMCID: PMC10446094 DOI: 10.12688/openreseurope.13008.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 12/21/2021] [Indexed: 08/31/2023]
Abstract
How do language learners avoid the production of verb argument structure overgeneralization errors ( *The clown laughed the man c.f. The clown made the man laugh), while retaining the ability to apply such generalizations productively when appropriate? This question has long been seen as one that is both particularly central to acquisition research and particularly challenging. Focussing on causative overgeneralization errors of this type, a previous study reported a computational model that learns, on the basis of corpus data and human-derived verb-semantic-feature ratings, to predict adults' by-verb preferences for less- versus more-transparent causative forms (e.g., * The clown laughed the man vs The clown made the man laugh) across English, Hebrew, Hindi, Japanese and K'iche Mayan. Here, we tested the ability of this model (and an expanded version with multiple hidden layers) to explain binary grammaticality judgment data from children aged 4;0-5;0, and elicited-production data from children aged 4;0-5;0 and 5;6-6;6 ( N=48 per language). In general, the model successfully simulated both children's judgment and production data, with correlations of r=0.5-0.6 and r=0.75-0.85, respectively, and also generalized to unseen verbs. Importantly, learners of all five languages showed some evidence of making the types of overgeneralization errors - in both judgments and production - previously observed in naturalistic studies of English (e.g., *I'm dancing it). Together with previous findings, the present study demonstrates that a simple learning model can explain (a) adults' continuous judgment data, (b) children's binary judgment data and (c) children's production data (with no training of these datasets), and therefore constitutes a plausible mechanistic account of the acquisition of verbs' argument structure restrictions.
Collapse
Affiliation(s)
- Ben Ambridge
- University of Liverpool, Liverpool, UK
- ESRC International Centre for Language and Communicative Development (LuCiD), Liverpool, UK
| | | | | | | | | | | | | | | | | | | | - Inbal Arnon
- Hebrew University of Jerusalem, Jerusalem, Israel
| | - Dani Bekman
- Hebrew University of Jerusalem, Jerusalem, Israel
| | - Amir Efrati
- Hebrew University of Jerusalem, Jerusalem, Israel
| | | | | | | | - Soumitra Samanta
- University of Liverpool, Liverpool, UK
- ESRC International Centre for Language and Communicative Development (LuCiD), Liverpool, UK
| | | | | | | | | | | | | |
Collapse
|
7
|
Abstract
The study of language acquisition has a long and contentious history: researchers disagree on what drives this process, the relevant data, and the interesting questions. Here, I outline the Starting Big approach to language learning, which emphasizes the role of multiword units in language, and of coarse-to-fine processes in learning. I outline core predictions and supporting evidence. In short, the approach argues that multiword units are integral building blocks in language; that such units can facilitate mastery of semantically opaque relations between words; and that adults rely on them less than children, which can explain (some of) their difficulty in learning a second language. The Starting Big approach is a theory of how children learn language, how language is represented, and how to explain differences between first and second language learning. I discuss the learning and processing models at the heart of the approach and their cross-linguistic implications.
Collapse
Affiliation(s)
- Inbal Arnon
- Psychology Department, Hebrew University, Jerusalem
| |
Collapse
|
8
|
Johnson T, Siegelman N, Arnon I. Individual Differences in Learning Abilities Impact Structure Addition: Better Learners Create More Structured Languages. Cogn Sci 2021; 44:e12877. [PMID: 32737928 DOI: 10.1111/cogs.12877] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2019] [Revised: 05/24/2020] [Accepted: 06/17/2020] [Indexed: 11/28/2022]
Abstract
Over the last decade, iterated learning studies have provided compelling evidence for the claim that linguistic structure can emerge from non-structured input, through the process of transmission. However, it is unclear whether individuals differ in their tendency to add structure, an issue with implications for understanding who are the agents of change. Here, we identify and test two contrasting predictions: The first sees learning as a pre-requisite for structure addition, and predicts a positive correlation between learning accuracy and structure addition, whereas the second maintains that it is those learners who struggle with learning and reproducing their input who add structure to it. This prediction is hard to test in standard iterated learning paradigms since each learner is exposed to a different input, and since structure and accuracy are computed using the same test items. Here, we test these contrasting predictions in two experiments using a one-generation artificial language learning paradigm designed to provide independent measures of learning accuracy and structure addition. Adults (N = 48 in each study) were exposed to a semi-regular language (with probabilistic structure) and had to learn it: Learning was assessed using seen items, whereas structure addition was calculated over unseen items. In both studies, we find a strong positive correlation between individuals' ability to learn the language and their tendency to add structure to it: Better learners also produced more structured languages. These findings suggest a strong link between learning and generalization. We discuss the implications of these findings for iterated language models and theories of language change more generally.
Collapse
Affiliation(s)
- Tamar Johnson
- Centre for Language Evolution, University of Edinburgh.,Department of Psychology, The Hebrew University of Jerusalem
| | - Noam Siegelman
- Department of Psychology, The Hebrew University of Jerusalem.,Haskins Laboratories
| | - Inbal Arnon
- Department of Psychology, The Hebrew University of Jerusalem
| |
Collapse
|
9
|
Abstract
Multiword units play an important role in language learning and use. It was proposed that learning from such units can facilitate mastery of certain grammatical relations, and that children and adults differ in their use of multiword units during learning, contributing to their varying language-learning trajectories. Accordingly, adults learn gender agreement better when encouraged to learn from multiword units. Previous work has not examined two core predictions of this proposal: (1) that children also benefit from initial exposure to multiword units, and (2) that their learning patterns reflect a greater reliance on multiword units compared to adults. We test both predictions using an artificial-language. As predicted, both children and adults benefit from early exposure to multiword units. In addition, when exposed to unsegmented input - adults show better learning of nouns compared to article-noun pairings, but children do not, a pattern consistent with adults' predicted tendency to focus less on multiword units.
Collapse
Affiliation(s)
- Naomi Havron
- Hebrew University of Jerusalem, Israel
- Department of Psychology, ENS, EHESS, CNRS, PSL University, France
| | | |
Collapse
|
10
|
Skarabela B, Ota M, O'Connor R, Arnon I. 'Clap your hands' or 'take your hands'? One-year-olds distinguish between frequent and infrequent multiword phrases. Cognition 2021; 211:104612. [PMID: 33578095 DOI: 10.1016/j.cognition.2021.104612] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Revised: 01/16/2021] [Accepted: 01/21/2021] [Indexed: 10/22/2022]
Abstract
Although words are often described as the basic building blocks of language, there is growing evidence that multiword sequences also play an integral role in language learning and processing. It is not known, however, whether children become sensitive to multiword information at an age when they are still building knowledge of individual words. Using a central fixation paradigm, the present study examined whether infants between 11 and 12 months (N = 36) distinguish between three-word sequences (trigrams) with similar substring frequencies but different multiword frequency in infant-directed speech (e.g., high frequency: 'clap your hands' vs. low frequency: 'take your hands'). Infants looked significantly longer at frequent trigrams compared to infrequent ones. This provides the first evidence that infants at the cusp of one-word production are already sensitive to the frequency of multiword sequences, and suggests they represent linguistic units of varying sizes from early on, raising the need to evaluate knowledge of both words and larger sequences during development.
Collapse
Affiliation(s)
- Barbora Skarabela
- School of Philosophy, Psychology and Language Sciences, University of Edinburgh, UK.
| | - Mitsuhiko Ota
- School of Philosophy, Psychology and Language Sciences, University of Edinburgh, UK
| | - Rosie O'Connor
- School of Philosophy, Psychology and Language Sciences, University of Edinburgh, UK
| | - Inbal Arnon
- Department of Psychology, Hebrew University of Jerusalem, Israel
| |
Collapse
|
11
|
Lavi-Rotbain O, Arnon I. Visual statistical learning is facilitated in Zipfian distributions. Cognition 2020; 206:104492. [PMID: 33157380 DOI: 10.1016/j.cognition.2020.104492] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2020] [Revised: 10/12/2020] [Accepted: 10/20/2020] [Indexed: 10/23/2022]
Abstract
Humans can extract co-occurrence regularities from their environment, and use them for learning. This statistical learning ability (SL) has been studied extensively as a way to explain how we learn the structure of our environment. These investigations have illustrated the impact of various distributional properties on learning. However, almost all SL studies present the regularities to be learned in uniform frequency distributions where each unit (e.g., image triplet) appears the same number of times: While the regularities themselves are informative, the appearance of the units cannot be predicted. In contrast, real-world learning environments, including the words children hear and the objects they see, are not uniform. Recent research shows that word segmentation is facilitated in a skewed (Zipfian) distribution. Here, we examine the domain-generality of the effect and ask if visual SL is also facilitated in a Zipfian distribution. We use an existing database to show that object combinations have a skewed distribution in children's environment. We then show that children and adults showed better learning in a Zipfian distribution compared to a uniform one, overall, and for low-frequency triplets. These results illustrate the facilitative impact of skewed distributions on learning across modality and age; suggest that the use of uniform distributions may underestimate performance; and point to the possible learnability advantage of such distributions in the real-world.
Collapse
Affiliation(s)
- Ori Lavi-Rotbain
- The Edmond and Lilly Safra Center for Brain Sciences, Hebrew University, Israel.
| | - Inbal Arnon
- Department of Psychology, Hebrew University, Israel
| |
Collapse
|
12
|
Ambridge B, Maitreyee R, Tatsumi T, Doherty L, Zicherman S, Pedro PM, Bannard C, Samanta S, McCauley S, Arnon I, Bekman D, Efrati A, Berman R, Narasimhan B, Sharma DM, Nair RB, Fukumura K, Campbell S, Pye C, Pixabaj SFC, Pelíz MM, Mendoza MJ. The crosslinguistic acquisition of sentence structure: Computational modeling and grammaticality judgments from adult and child speakers of English, Japanese, Hindi, Hebrew and K'iche'. Cognition 2020; 202:104310. [PMID: 32623135 PMCID: PMC7397526 DOI: 10.1016/j.cognition.2020.104310] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2019] [Revised: 03/31/2020] [Accepted: 04/13/2020] [Indexed: 11/16/2022]
Abstract
This preregistered study tested three theoretical proposals for how children form productive yet restricted linguistic generalizations, avoiding errors such as *The clown laughed the man, across three age groups (5-6 years, 9-10 years, adults) and five languages (English, Japanese, Hindi, Hebrew and K'iche'). Participants rated, on a five-point scale, correct and ungrammatical sentences describing events of causation (e.g., *Someone laughed the man; Someone made the man laugh; Someone broke the truck; ?Someone made the truck break). The verb-semantics hypothesis predicts that, for all languages, by-verb differences in acceptability ratings will be predicted by the extent to which the causing and caused event (e.g., amusing and laughing) merge conceptually into a single event (as rated by separate groups of adult participants). The entrenchment and preemption hypotheses predict, for all languages, that by-verb differences in acceptability ratings will be predicted by, respectively, the verb's relative overall frequency, and frequency in nearly-synonymous constructions (e.g., X made Y laugh for *Someone laughed the man). Analysis using mixed effects models revealed that entrenchment/preemption effects (which could not be distinguished due to collinearity) were observed for all age groups and all languages except K'iche', which suffered from a thin corpus and showed only preemption sporadically. All languages showed effects of event-merge semantics, except K'iche' which showed only effects of supplementary semantic predictors. We end by presenting a computational model which successfully simulates this pattern of results in a single discriminative-learning mechanism, achieving by-verb correlations of around r = 0.75 with human judgment data.
Collapse
Affiliation(s)
- Ben Ambridge
- University of Liverpool, United Kingdom of Great Britain and Northern Ireland; ESRC International Centre for Language and Communicative Development (LuCiD).
| | - Ramya Maitreyee
- University of Liverpool, United Kingdom of Great Britain and Northern Ireland
| | | | - Laura Doherty
- University of Liverpool, United Kingdom of Great Britain and Northern Ireland
| | | | | | - Colin Bannard
- University of Liverpool, United Kingdom of Great Britain and Northern Ireland
| | - Soumitra Samanta
- University of Liverpool, United Kingdom of Great Britain and Northern Ireland; ESRC International Centre for Language and Communicative Development (LuCiD)
| | | | | | | | | | | | | | | | | | - Kumiko Fukumura
- University of Stirling, United Kingdom of Great Britain and Northern Ireland
| | | | - Clifton Pye
- University of Kansas, United States of America
| | | | | | | |
Collapse
|
13
|
Culbertson J, Franck J, Braquet G, Barrera Navarro M, Arnon I. A learning bias for word order harmony: Evidence from speakers of non-harmonic languages. Cognition 2020; 204:104392. [PMID: 32673786 DOI: 10.1016/j.cognition.2020.104392] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2019] [Revised: 06/14/2020] [Accepted: 06/26/2020] [Indexed: 10/23/2022]
Abstract
Word order harmony describes the tendency, found across the world's languages, to consistently order syntactic heads relative to dependents. It is one of the most well-known and well-studied typological universals. Almost since it was first noted by Greenberg (1963), there has been disagreement about what role, if any, the cognitive system plays in driving harmony. Recently, a series of studies using artificial language learning experiments reported that harmonic noun phrase word orders were preferred over non-harmonic orders by English-speaking adults and children (Culbertson et al., 2012; Culbertson & Newport, 2015, 2017). However, this evidence is potentially confounded by the fact that English is itself a harmonic language (Goldberg, 2013). Here we sought to extend the results from these studies by exploring whether learners who have substantial experience with a non-harmonic language still showed a bias for harmonic patterns during learning. We found that monolingual French- and Hebrew-speaking children, whose language has a non-harmonic noun phrase order (N Adj, Num N) nevertheless preferred harmonic patterns when learning an artificial language. We also found evidence for a harmony bias across several populations of adult learners, although this interacted in complex ways with their L2 experience. Our results suggest that transfer from the L1 cannot explain the preference for harmony found in previous studies. Moreover, they provide the strongest evidence yet that a cognitive bias for harmony is a plausible candidate for shaping linguistic typology.
Collapse
Affiliation(s)
- Jennifer Culbertson
- School of Philosophy, Psychology and Language Sciences, University of Edinburgh, United Kingdom.
| | - Julie Franck
- Department of Psychology, University of Geneva, Switzerland
| | - Guillaume Braquet
- School of Philosophy, Psychology and Language Sciences, University of Edinburgh, United Kingdom
| | - Magda Barrera Navarro
- School of Philosophy, Psychology and Language Sciences, University of Edinburgh, United Kingdom
| | - Inbal Arnon
- Department of Psychology, Hebrew University of Jerusalem, Israel
| |
Collapse
|
14
|
Arnon I. Statistical Learning, Implicit Learning, and First Language Acquisition: A Critical Evaluation of Two Developmental Predictions. Top Cogn Sci 2019; 11:504-519. [PMID: 31056836 DOI: 10.1111/tops.12428] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2018] [Revised: 03/22/2019] [Accepted: 04/04/2019] [Indexed: 01/23/2023]
Abstract
The role of distributional information in language learning, and learning more generally, has been studied extensively in both the statistical learning and the implicit learning literatures. Despite the similarity in research questions, the two literatures have remained largely separate. Here, we draw on findings from the two traditions to critically evaluate two developmental predictions that are central to both. The first is the question of age invariance: Does learning improve during development or is it fully developed in infancy? The combined findings suggest that both implicit and statistical learning improve during childhood, contra the age invariance prediction. This raises questions about the role of implicit statistical learning (ISL) in explaining the age-related deterioration in language learning: Children's better language learning abilities cannot be attributed to their improved distributional learning skills. The second issue we examine is the predictive relation to language outcomes: Does variation in learning predict variation in language outcomes? While there is evidence for such links, there is concern in both research traditions about the reliability of the tasks used with children. We present data suggesting that commonly used statistical learning measures may not capture stable individual differences in children, undermining their utility for assessing the link to language outcomes in developmental samples. The evaluation of both predictions highlights the empirical parallels between the implicit and statistical learning literatures, and the need to better integrate their developmental investigation. We go on to discuss several of the open challenges facing the study of ISL during development.
Collapse
Affiliation(s)
- Inbal Arnon
- Department of Psychology, Hebrew University of Jerusalem
| |
Collapse
|
15
|
Havron N, Raviv L, Arnon I. Literate and preliterate children show different learning patterns in an artificial language learning task. J Cult Cogn Sci 2018. [DOI: 10.1007/s41809-018-0015-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
16
|
Abstract
Socio-economic status (SES) impacts the amount and type of input children hear in ways that have developmental consequences. Here, we examine the effect of SES on the use of variation sets (successive utterances with partial self-repetitions) in child-directed speech (CDS). Variation sets have been found to facilitate language learning, but have been studied only in higher-SES groups. Here, we examine their use in naturalistic speech in two languages (Hebrew and English) for both low- and high-SES caregivers. We find that variation sets are more frequent in the input of high-SES caregivers in both languages, indicating that SES also impacts structural properties of CDS.
Collapse
Affiliation(s)
- Shira Tal
- Hebrew University of Jerusalem,Cognitive Science,Mount Scopus,Jerusalem 9190501,Israel
| | - Inbal Arnon
- Hebrew University of Jerusalem,Cognitive Science,Mount Scopus,Jerusalem 9190501,Israel
| |
Collapse
|
17
|
Shufaniya A, Arnon I. Statistical Learning Is Not Age‐Invariant During Childhood: Performance Improves With Age Across Modality. Cogn Sci 2018; 42:3100-3115. [DOI: 10.1111/cogs.12692] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2018] [Revised: 08/17/2018] [Accepted: 08/30/2018] [Indexed: 10/28/2022]
Affiliation(s)
- Amir Shufaniya
- Department of Psychology The Hebrew University of Jerusalem
| | - Inbal Arnon
- Department of Psychology The Hebrew University of Jerusalem
| |
Collapse
|
18
|
Abstract
Can emergent literacy impact the size of the linguistic units children attend to? We examined children's ability to segment multiword sequences before and after they learned to read, in order to disentangle the effect of literacy and age on segmentation. We found that early readers were better at segmenting multiword units (after controlling for age, cognitive, and linguistic variables), and that improvement in literacy skills between the two sessions predicted improvement in segmentation abilities. Together, these findings suggest that literacy acquisition, rather than age, enhanced segmentation. We discuss implications for models of language learning.
Collapse
|
19
|
Lavi-Rotbain O, Arnon I. Developmental Differences Between Children and Adults in the Use of Visual Cues for Segmentation. Cogn Sci 2017; 42 Suppl 2:606-620. [DOI: 10.1111/cogs.12528] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2016] [Revised: 05/20/2017] [Accepted: 06/22/2017] [Indexed: 11/28/2022]
Affiliation(s)
- Ori Lavi-Rotbain
- The Edmond and Lilly Safra Center for Brain Sciences; Hebrew University
| | | |
Collapse
|
20
|
Raviv L, Arnon I. The developmental trajectory of children's auditory and visual statistical learning abilities: modality-based differences in the effect of age. Dev Sci 2017; 21:e12593. [DOI: 10.1111/desc.12593] [Citation(s) in RCA: 49] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2016] [Accepted: 05/25/2017] [Indexed: 11/27/2022]
Affiliation(s)
- Limor Raviv
- Psychology of Language Department; Max Planck Institute for Psycholinguistics; Nijmegen The Netherlands
- Department of Cognitive Science; Hebrew University; Jerusalem Israel
| | - Inbal Arnon
- Department of Psychology; Hebrew University; Jerusalem Israel
| |
Collapse
|
21
|
Abstract
Why are children better language learners than adults despite being worse at a range of other cognitive tasks? Here, we explore the role of multiword sequences in explaining L1-L2 differences in learning. In particular, we propose that children and adults differ in their reliance on such multiword units (MWUs) in learning, and that this difference affects learning strategies and outcomes, and leads to difficulty in learning certain grammatical relations. In the first part, we review recent findings that suggest that MWUs play a facilitative role in learning. We then discuss the implications of these findings for L1-L2 differences: We hypothesize that adults are both less likely to extract MWUs and less capable of benefiting from them in the process of learning. In the next section, we draw on psycholinguistic, developmental, and computational findings to support these predictions. We end with a discussion of the relation between this proposal and other accounts of L1-L2 difficulty.
Collapse
Affiliation(s)
| | - Morten H Christiansen
- Department of Psychology, Cornell University and School of Communication and Culture, Aarhus University
| |
Collapse
|
22
|
Christiansen MH, Arnon I. More Than Words: The Role of Multiword Sequences in Language Learning and Use. Top Cogn Sci 2017; 9:542-551. [PMID: 28503906 DOI: 10.1111/tops.12274] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2016] [Accepted: 03/10/2017] [Indexed: 12/01/2022]
Abstract
The ability to convey our thoughts using an infinite number of linguistic expressions is one of the hallmarks of human language. Understanding the nature of the psychological mechanisms and representations that give rise to this unique productivity is a fundamental goal for the cognitive sciences. A long-standing hypothesis is that single words and rules form the basic building blocks of linguistic productivity, with multiword sequences being treated as units only in peripheral cases such as idioms. The new millennium, however, has seen a shift toward construing multiword linguistic units not as linguistic rarities, but as important building blocks for language acquisition and processing. This shift-which originated within theoretical approaches that emphasize language learning and use-has far-reaching implications for theories of language representation, processing, and acquisition. Incorporating multiword units as integral building blocks blurs the distinction between grammar and lexicon; calls for models of production and comprehension that can accommodate and give rise to the effect of multiword information on processing; and highlights the importance of such units to learning. In this special topic, we bring together cutting-edge work on multiword sequences in theoretical linguistics, first-language acquisition, psycholinguistics, computational modeling, and second-language learning to present a comprehensive overview of the prominence and importance of such units in language, their possible role in explaining differences between first- and second-language learning, and the challenges the combined findings pose for theories of language.
Collapse
Affiliation(s)
- Morten H Christiansen
- Department of Psychology, Cornell University.,Centre for Interacting Minds & School of Communication and Culture, Aarhus University
| | - Inbal Arnon
- Department of Psychology, Hebrew University of Jerusalem
| |
Collapse
|
23
|
Costa A, Foucart A, Arnon I, Aparici M, Apesteguia J. Corrigendum to ‘ “Piensa” twice: On the foreign language effect in decision making’ [Cognition 130 (2) (2014) 236–254]. Cognition 2015. [DOI: 10.1016/j.cognition.2015.05.013] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
24
|
Arnon I. What can frequency effects tell us about the building blocks and mechanisms of language learning? J Child Lang 2015; 42:274-322. [PMID: 25644409 DOI: 10.1017/s0305000914000610] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
|
25
|
Costa A, Foucart A, Arnon I, Aparici M, Apesteguia J. “Piensa” twice: On the foreign language effect in decision making. Cognition 2014; 130:236-54. [DOI: 10.1016/j.cognition.2013.11.010] [Citation(s) in RCA: 104] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2013] [Revised: 11/14/2013] [Accepted: 11/14/2013] [Indexed: 10/25/2022]
|
26
|
Abstract
There is mounting evidence that language users are sensitive to the distributional properties of multi-word sequences. Such findings expand the range of information speakers are sensitive to and call for processing models that can represent larger chains of relations. In the current paper we investigate the effect of multi-word statistics on phonetic duration using a combination of experimental and corpus-based research. We ask (a) if phonetic duration is affected by multi-word frequency in both elicited and spontaneous speech, and (b) if syntactic constituency modulates the effect. We show that phonetic durations are reduced in higher frequency sequences, regardless of constituency: duration is shorter for more frequent sequences within and across syntactic boundaries. The effects are not reducible to the frequency of the individual words or substrings. These findings open up a novel set of questions about the interaction between surface distributions and higher order properties, and the resulting need (or lack thereof) to incorporate higher order properties into processing models.
Collapse
Affiliation(s)
- Inbal Arnon
- Psychology Department, University of Haifa, Israel.
| | | |
Collapse
|
27
|
de Marneffe MC, Grimm S, Arnon I, Kirby S, Bresnan J. A statistical model of the grammatical choices in child production of dative sentences. ACTA ACUST UNITED AC 2012. [DOI: 10.1080/01690965.2010.542651] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
28
|
Arnon I, Ramscar M. Granularity and the acquisition of grammatical gender: how order-of-acquisition affects what gets learned. Cognition 2011; 122:292-305. [PMID: 22169657 DOI: 10.1016/j.cognition.2011.10.009] [Citation(s) in RCA: 103] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2010] [Revised: 08/04/2011] [Accepted: 10/14/2011] [Indexed: 11/24/2022]
Abstract
Why do adult language learners typically fail to acquire second languages with native proficiency? Does prior linguistic experience influence the size of the "units" adults attend to in learning, and if so, how does this influence what gets learned? Here, we examine these questions in relation to grammatical gender, which adult learners almost invariably struggle to master. We present a model of learning that predicts that exposure to smaller units (such as nouns) before exposure to larger linguistic units (such as sentences) can critically impair learning about predictive relations between units: such as that between a noun and its article. This prediction is then confirmed by a study of adult participants learning grammatical gender in an artificial language. Adults learned both nouns and their articles better when they were first heard nouns used in context with their articles prior to hearing the nouns individually, compared with learners who first heard the nouns in isolation, prior to hearing them used in context. In the light of these results, we discuss the role gender appears to play in language, the importance of meaning in artificial grammar learning, and the implications of this work for the structure of L2-training.
Collapse
Affiliation(s)
- Inbal Arnon
- Psychology Department, University of Haifa, Haifa, Israel.
| | | |
Collapse
|
29
|
Hofmeister P, Jaeger TF, Arnon I, Sag IA, Snider N. The source ambiguity problem: Distinguishing the effects of grammar and processing on acceptability judgments. ACTA ACUST UNITED AC 2011; 28:48-87. [PMID: 23539204 DOI: 10.1080/01690965.2011.572401] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/16/2022]
Abstract
Judgments of linguistic unacceptability may theoretically arise from either grammatical deviance or significant processing difficulty. Acceptability data are thus naturally ambiguous in theories that explicitly distinguish formal and functional constraints. Here, we consider this source ambiguity problem in the context of Superiority effects: the dispreference for ordering a wh-phrase in front of a syntactically "superior" wh-phrase in multiple wh-questions, e.g. What did who buy? More specifically, we consider the acceptability contrast between such examples and so-called D-linked examples, e.g. Which toys did which parents buy? Evidence from acceptability and self-paced reading experiments demonstrates that (i) judgments and processing times for Superiority violations vary in parallel, as determined by the kind of wh-phrases they contain, (ii) judgments increase with exposure while processing times decrease, (iii) reading times are highly predictive of acceptability judgments for the same items, and (iv) the effects of the complexity of the wh-phrases combine in both acceptability judgments and reading times. This evidence supports the conclusion that D-linking effects are likely reducible to independently motivated cognitive mechanisms whose effects emerge in a wide range of sentence contexts. This in turn suggests that Superiority effects, in general, may owe their character to differential processing difficulty.
Collapse
Affiliation(s)
- Philip Hofmeister
- Center for Research in Language, University of California-San Diego, La Jolla, CA, Telephone: (650)-387-6641
| | | | | | | | | |
Collapse
|
30
|
Abstract
Children find object relative clauses difficult. They show poor comprehension that lags behind production into their fifth year. This finding has shaped models of relative clause acquisition, with appeals to processing heuristics or syntactic preferences to explain why object relatives are more difficult than subject relatives. Two studies here suggest that children (age 4 ; 6) do not find all object relatives difficult: a corpus study shows that children most often hear and produce object relatives with pronominal subjects. But they are most often tested on ones with lexical-NP subjects (e.g. The nurse that the girl is drawing). When tested on object relatives with pronominal subjects (e.g. The nurse that I am drawing), similar to those they actually hear and produce, Hebrew speakers aged 4 ; 6 show good comprehension (85% accuracy) that matches their production ability. This suggests a different path of relative clause acquisition, one that is sensitive to fine-grained distributional information.
Collapse
Affiliation(s)
- Inbal Arnon
- Linguistics Department, Margaret Jacks Hall, Stanford University, CA 94305, USA.
| |
Collapse
|
31
|
Koren D, Arnon I, Klein E. Long term course of chronic posttraumatic stress disorder in traffic accident victims: a three-year prospective follow-up study. Behav Res Ther 2001; 39:1449-58. [PMID: 11758702 DOI: 10.1016/s0005-7967(01)00025-0] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The purpose of the present study was to gather prospective longitudinal data on the long-term course and outcome of chronic posttraumatic stress disorder (PTSD). The target population for this study was 74 injured traffic accident victims who had been previously followed-up for one year after the trauma. Nineteen of the original 24 PTSD subjects (79%) and 39 of the original 50 Non-PTSD subjects (78%) were available for this study, which took place during the fourth year after the accident. Our results show that 10 (53%) of the 19 patients with PTSD at one-year still suffered from PTSD after another two-year follow-up interval, while 9 recovered from PTSD during this follow-up period. Only 2 of the 39 without PTSD at one year developed delayed onset PTSD. The best predictor of recovery from chronic PTSD was the initial level of posttraumatic reaction immediately after the accident. These results demonstrate that spontaneous recovery from PTSD can occur even among patients who are currently considered chronic. Severity of initial reaction to the trauma appears to be a major risk factor for non-remitting chronic PTSD.
Collapse
Affiliation(s)
- D Koren
- Department of Psychiatry, Rambam Medical Center, B. Rappaport Faculty of Medicine Technion I.I.T., Haifa, Israel
| | | | | |
Collapse
|
32
|
Abstract
OBJECTIVE This study was designed to assess the natural course of posttraumatic symptoms formation, as well as the degree to which acute stress reactions predict later posttraumatic stress disorder (PTSD) in injured traffic accident victims. METHOD A prospective, 1-year follow-up study was carried out on 74 injured traffic accident victims and a comparison group of 19 patients who were hospitalized for elective orthopedic surgery. Participants were interviewed within the first week following the accident, and follow-up interviews were performed 1, 3, 6, and 12 months after the accident. At 12 months, a structured clinical interview was administered to determine a formal DSM-III-R diagnosis of PTSD. RESULTS Twenty-four (32%) of the 74 traffic accident victims, but none of the 19 comparison subjects, met DSM-III-R criteria for PTSD at 1 year. Traffic accident victims who developed PTSD had higher levels of premorbid and comorbid psychopathology. Levels of posttraumatic symptoms were significantly higher from the outset in the subjects who developed PTSD and worsened progressively over the first 3 months, in contrast to subjects without PTSD, who manifested gradual amelioration of symptoms during this time. Existence of posttraumatic symptoms immediately after the accident was a better predictor of later PTSD than was accident or injury severity. CONCLUSIONS In this study, a significant portion of injured traffic accident victims manifested PTSD 1 year after the event. The development of PTSD at 1 year can be predicted as early as 1 week after the accident on the basis of the existence and severity of early PTSD-related symptoms. However, the first 3 months following the accident appear to be the critical period for the development of PTSD.
Collapse
Affiliation(s)
- D Koren
- Department of Psychiatry of the Rambam Medical Center and the B. Rappaport Faculty of Medicine, Technion, I.I.T., Haifa, Israel
| | | | | |
Collapse
|
33
|
Fox S, Arnon I. Motion sickness and anxiety. Aviat Space Environ Med 1988; 59:728-33. [PMID: 3178620] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Ninety-four Israeli pilot trainees completed a battery of anxiety related questionnaires: Taylor Manifest Anxiety Scale, EPQ, 16PF, and Spielberger's State and Trait Anxiety Scores. Self reports and flight instructor observations of motion sickness symptoms were collected after initial flights. No significant correlations were found between these two sources. Anxiety scores derived from the battery of anxiety questionnaires were correlated with self reports of motion sickness but not with instructor observations. Discussion focused on the potential limitations of external observers in assessing motion sickness, the convergence of anxiety assessments, and the overlap between anxiety and motion sickness symptoms. Methodological and practical implications conclude the review.
Collapse
Affiliation(s)
- S Fox
- Department of Psychology, Bar-Ilan University, Ramat-Gan, Israel
| | | |
Collapse
|