Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Endress AD, Johnson SP. When forgetting fosters learning: A neural network model for statistical learning. Cognition 2021;213:104621. [PMID: 33608130 DOI: 10.1016/j.cognition.2021.104621] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2020] [Revised: 12/19/2020] [Accepted: 01/28/2021] [Indexed: 11/28/2022]

For:	Endress AD, Johnson SP. When forgetting fosters learning: A neural network model for statistical learning. Cognition 2021;213:104621. [PMID: 33608130 DOI: 10.1016/j.cognition.2021.104621] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2020] [Revised: 12/19/2020] [Accepted: 01/28/2021] [Indexed: 11/28/2022]

Number

Cited by Other Article(s)

Endress AD, de Seyssel M. The specificity of sequential statistical learning: Statistical learning accumulates predictive information from unstructured input but is dissociable from (declarative) memory for words. Cognition 2025;261:106130. [PMID: 40250103 DOI: 10.1016/j.cognition.2025.106130] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2025] [Accepted: 03/24/2025] [Indexed: 04/20/2025]

Abstract

Learning statistical regularities from the environment is ubiquitous across domains and species. It might support the earliest stages of language acquisition, especially identifying and learning words from fluent speech (i.e., word-segmentation). But how do the statistical learning mechanisms involved in word-segmentation interact with the memory mechanisms needed to remember words - and with the learning situations where words need to be learned? Through computational modeling, we first show that earlier results purportedly supporting memory-based theories of statistical learning can be reproduced by memory-less Hebbian learning mechanisms. We then show that, in a memory recall task after exposure to continuous, statistically structured speech sequences, participants track the statistical structure of the speech sequences and are thus sensitive to probable syllable transitions. However, they hardly remember any items at all, with 82% producing no high-probability items. Among the 30% of participants producing (correct) high- or (incorrect) low-probability items, half produced high-probability items and half low-probability items - even while preferring high-probability items in a recognition test. Only discrete familiarization sequences with isolated words yield memories of actual items. Turning to how specific learning situations affect statistical learning, we show that it predominantly operates in continuous speech sequences like those used in earlier experiments, but not in discrete chunk sequences likely more characteristic of early language acquisition. Taken together, these results suggest that statistical learning might be specialized to accumulate distributional information, but that it is dissociable from the (declarative) memory mechanisms needed to acquire words and does not allow learners to identify probable word boundaries.

Collapse

Jiao L, Ma M, He P, Geng X, Liu X, Liu F, Ma W, Yang S, Hou B, Tang X. Brain-Inspired Learning, Perception, and Cognition: A Comprehensive Review. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025;36:5921-5941. [PMID: 38809737 DOI: 10.1109/tnnls.2024.3401711] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2024]

Pinto Arata L, Ordonez Magro L, Ramisch C, Grainger J, Rey A. The dynamics of multiword sequence extraction. Q J Exp Psychol (Hove) 2024;77:2439-2462. [PMID: 38247195 DOI: 10.1177/17470218241228548] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2024]

Verosky NJ. Associative Learning of an Unnormalized Successor Representation. Neural Comput 2024;36:1410-1423. [PMID: 38776964 DOI: 10.1162/neco_a_01675] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2023] [Accepted: 03/13/2024] [Indexed: 05/25/2024]

Benjamin L, Sablé-Meyer M, Fló A, Dehaene-Lambertz G, Al Roumi F. Long-Horizon Associative Learning Explains Human Sensitivity to Statistical and Network Structures in Auditory Sequences. J Neurosci 2024;44:e1369232024. [PMID: 38408873 PMCID: PMC10993028 DOI: 10.1523/jneurosci.1369-23.2024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Revised: 01/16/2024] [Accepted: 02/07/2024] [Indexed: 02/28/2024] Open

Tosatto L, Fagot J, Nemeth D, Rey A. Chunking as a function of sequence length. Anim Cogn 2024;28:2. [PMID: 38429566 PMCID: PMC11671558 DOI: 10.1007/s10071-024-01835-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Revised: 09/10/2023] [Accepted: 11/01/2023] [Indexed: 03/03/2024]

Endress AD. Hebbian learning can explain rhythmic neural entrainment to statistical regularities. Dev Sci 2024:e13487. [PMID: 38372153 DOI: 10.1111/desc.13487] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Revised: 12/26/2023] [Accepted: 01/29/2024] [Indexed: 02/20/2024]

Abstract

In many domains, learners extract recurring units from continuous sequences. For example, in unknown languages, fluent speech is perceived as a continuous signal. Learners need to extract the underlying words from this continuous signal and then memorize them. One prominent candidate mechanism is statistical learning, whereby learners track how predictive syllables (or other items) are of one another. Syllables within the same word predict each other better than syllables straddling word boundaries. But does statistical learning lead to memories of the underlying words-or just to pairwise associations among syllables? Electrophysiological results provide the strongest evidence for the memory view. Electrophysiological responses can be time-locked to statistical word boundaries (e.g., N400s) and show rhythmic activity with a periodicity of word durations. Here, I reproduce such results with a simple Hebbian network. When exposed to statistically structured syllable sequences (and when the underlying words are not excessively long), the network activation is rhythmic with the periodicity of a word duration and activation maxima on word-final syllables. This is because word-final syllables receive more excitation from earlier syllables with which they are associated than less predictable syllables that occur earlier in words. The network is also sensitive to information whose electrophysiological correlates were used to support the encoding of ordinal positions within words. Hebbian learning can thus explain rhythmic neural activity in statistical learning tasks without any memory representations of words. Learners might thus need to rely on cues beyond statistical associations to learn the words of their native language. RESEARCH HIGHLIGHTS: Statistical learning may be utilized to identify recurring units in continuous sequences (e.g., words in fluent speech) but may not generate explicit memory for words. Exposure to statistically structured sequences leads to rhythmic activity with a period of the duration of the underlying units (e.g., words). I show that a memory-less Hebbian network model can reproduce this rhythmic neural activity as well as putative encodings of ordinal positions observed in earlier research. Direct tests are needed to establish whether statistical learning leads to declarative memories for words.

Collapse

Sherman BE, Turk-Browne NB, Goldfarb EV. Multiple Memory Subsystems: Reconsidering Memory in the Mind and Brain. PERSPECTIVES ON PSYCHOLOGICAL SCIENCE 2024;19:103-125. [PMID: 37390333 PMCID: PMC10756937 DOI: 10.1177/17456916231179146] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/02/2023]

Yeaton J, Tosatto L, Fagot J, Grainger J, Rey A. Simple questions on simple associations: regularity extraction in non-human primates. Learn Behav 2023;51:392-401. [PMID: 37284936 PMCID: PMC10716064 DOI: 10.3758/s13420-023-00579-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/28/2023] [Indexed: 06/08/2023]

Endress AD, Johnson SP. Hebbian, correlational learning provides a memory-less mechanism for Statistical Learning irrespective of implementational choices: Reply to Tovar and Westermann (2022). Cognition 2023;230:105290. [PMID: 36240613 DOI: 10.1016/j.cognition.2022.105290] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2022] [Revised: 08/30/2022] [Accepted: 09/17/2022] [Indexed: 11/07/2022]

Sherman BE, Graves KN, Huberdeau DM, Quraishi IH, Damisah EC, Turk-Browne NB. Temporal Dynamics of Competition between Statistical Learning and Episodic Memory in Intracranial Recordings of Human Visual Cortex. J Neurosci 2022;42:9053-9068. [PMID: 36344264 PMCID: PMC9732826 DOI: 10.1523/jneurosci.0708-22.2022] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Revised: 10/10/2022] [Accepted: 10/13/2022] [Indexed: 11/09/2022] Open

Abstract

The function of long-term memory is not just to reminisce about the past, but also to make predictions that help us behave appropriately and efficiently in the future. This predictive function of memory provides a new perspective on the classic question from memory research of why we remember some things but not others. If prediction is a key outcome of memory, then the extent to which an item generates a prediction signifies that this information already exists in memory and need not be encoded. We tested this principle using human intracranial EEG as a time-resolved method to quantify prediction in visual cortex during a statistical learning task and link the strength of these predictions to subsequent episodic memory behavior. Epilepsy patients of both sexes viewed rapid streams of scenes, some of which contained regularities that allowed the category of the next scene to be predicted. We verified that statistical learning occurred using neural frequency tagging and measured category prediction with multivariate pattern analysis. Although neural prediction was robust overall, this was driven entirely by predictive items that were subsequently forgotten. Such interference provides a mechanism by which prediction can regulate memory formation to prioritize encoding of information that could help learn new predictive relationships.SIGNIFICANCE STATEMENT When faced with a new experience, we are rarely at a loss for what to do. Rather, because many aspects of the world are stable over time, we rely on past experiences to generate expectations that guide behavior. Here we show that these expectations during a new experience come at the expense of memory for that experience. From intracranial recordings of visual cortex, we decoded what humans expected to see next in a series of photographs based on patterns of neural activity. Photographs that generated strong neural expectations were more likely to be forgotten in a later behavioral memory test. Prioritizing the storage of experiences that currently lead to weak expectations could help improve these expectations in future encounters.

Collapse

Tosatto L, Bonafos G, Melmi JB, Rey A. Detecting non-adjacent dependencies is the exception rather than the rule. PLoS One 2022;17:e0270580. [PMID: 35834512 PMCID: PMC9282578 DOI: 10.1371/journal.pone.0270580] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Accepted: 06/14/2022] [Indexed: 11/24/2022] Open

Fiser J, Lengyel G. Statistical Learning in Vision. Annu Rev Vis Sci 2022;8:265-290. [PMID: 35727961 DOI: 10.1146/annurev-vision-100720-103343] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Rey A, Fagot J, Mathy F, Lazartigues L, Tosatto L, Bonafos G, Freyermuth JM, Lavigne F. Learning Higher-Order Transitional Probabilities in Nonhuman Primates. Cogn Sci 2022;46:e13121. [PMID: 35363923 DOI: 10.1111/cogs.13121] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2021] [Revised: 02/16/2022] [Accepted: 02/17/2022] [Indexed: 11/29/2022]

Fló A, Benjamin L, Palu M, Dehaene-Lambertz G. Sleeping neonates track transitional probabilities in speech but only retain the first syllable of words. Sci Rep 2022;12:4391. [PMID: 35292694 PMCID: PMC8924158 DOI: 10.1038/s41598-022-08411-w] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2021] [Accepted: 02/25/2022] [Indexed: 12/15/2022] Open

Verosky NJ, Morgan E. Pitches that Wire Together Fire Together: Scale Degree Associations Across Time Predict Melodic Expectations. Cogn Sci 2021;45:e13037. [PMID: 34606140 DOI: 10.1111/cogs.13037] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Revised: 07/22/2021] [Accepted: 07/23/2021] [Indexed: 11/29/2022]

Abstract

The ongoing generation of expectations is fundamental to listeners' experience of music, but research into types of statistical information that listeners extract from musical melodies has tended to emphasize transition probabilities and n-grams, with limited consideration given to other types of statistical learning that may be relevant. Temporal associations between scale degrees represent a different type of information present in musical melodies that can be learned from musical corpora using expectation networks, a computationally simple method based on activation and decay. Expectation networks infer the expectation of encountering one scale degree followed in the near (but not necessarily immediate) future by another given scale degree, with previous work suggesting that scale degree associations learned by expectation networks better predict listener ratings of pitch similarity than transition probabilities. The current work outlines how these learned scale degree associations can be combined to predict melodic continuations and tests the resulting predictions on a dataset of listener responses to a musical cloze task previously used to compare two other models of melodic expectation, a variable-order Markov model (IDyOM) and Temperley's music-theoretically motivated model. Under multinomial logistic regression, all three models explain significant unique variance in human melodic expectations, with coefficient estimates highest for expectation networks. These results suggest that generalized scale degree associations informed by both adjacent and nonadjacent relationships between melodic notes influence listeners' melodic predictions above and beyond n-gram context, highlighting the need to consider a broader range of statistical learning processes that may underlie listeners' expectations for upcoming musical events.

Collapse