1
|
Russin J, Pavlick E, Frank MJ. CURRICULUM EFFECTS AND COMPOSITIONALITY EMERGE WITH IN-CONTEXT LEARNING IN NEURAL NETWORKS. ARXIV 2024:arXiv:2402.08674v3. [PMID: 38410645 PMCID: PMC10896373] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 02/28/2024]
Abstract
Human learning embodies a striking duality: sometimes, we appear capable of following logical, compositional rules and benefit from structured curricula (e.g., in formal education), while other times, we rely on an incremental approach or trial-and-error, learning better from curricula that are unstructured or randomly interleaved. Influential psychological theories explain this seemingly disparate behavioral evidence by positing two qualitatively different learning systems-one for rapid, rule-based inferences and another for slow, incremental adaptation. It remains unclear how to reconcile such theories with neural networks, which learn via incremental weight updates and are thus a natural model for the latter type of learning, but are not obviously compatible with the former. However, recent evidence suggests that both metalearning neural networks and large language models are capable of "in-context learning" (ICL)-the ability to flexibly grasp the structure of a new task from a few examples given at inference time. Here, we show that networks capable of ICL can reproduce human-like learning and compositional behavior on rule-governed tasks, while at the same time replicating human behavioral phenomena in tasks lacking rule-like structure via their usual in-weight learning (IWL). Our work shows how emergent ICL can equip neural networks with fundamentally different learning properties than those traditionally attributed to them, and that these can coexist with the properties of their native IWL, thus offering a novel perspective on dual-process theories and human cognitive flexibility.
Collapse
Affiliation(s)
- Jacob Russin
- Department of Computer Science, Department of Cognitive and Psychological Sciences, Brown University
| | | | - Michael J Frank
- Department of Cognitive and Psychological Sciences, Carney Institute for Brain Science, Brown University
| |
Collapse
|
2
|
Moskovitz T, Miller KJ, Sahani M, Botvinick MM. Understanding dual process cognition via the minimum description length principle. PLoS Comput Biol 2024; 20:e1012383. [PMID: 39423224 PMCID: PMC11534269 DOI: 10.1371/journal.pcbi.1012383] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Revised: 11/04/2024] [Accepted: 08/01/2024] [Indexed: 10/21/2024] Open
Abstract
Dual-process theories play a central role in both psychology and neuroscience, figuring prominently in domains ranging from executive control to reward-based learning to judgment and decision making. In each of these domains, two mechanisms appear to operate concurrently, one relatively high in computational complexity, the other relatively simple. Why is neural information processing organized in this way? We propose an answer to this question based on the notion of compression. The key insight is that dual-process structure can enhance adaptive behavior by allowing an agent to minimize the description length of its own behavior. We apply a single model based on this observation to findings from research on executive control, reward-based learning, and judgment and decision making, showing that seemingly diverse dual-process phenomena can be understood as domain-specific consequences of a single underlying set of computational principles.
Collapse
Affiliation(s)
- Ted Moskovitz
- Gatsby Computational Neuroscience Unit, University College London, London, United Kingdom
- Google DeepMind, London, United Kingdom
| | - Kevin J. Miller
- Google DeepMind, London, United Kingdom
- Department of Ophthalmology, University College London, London, United Kingdom
| | - Maneesh Sahani
- Gatsby Computational Neuroscience Unit, University College London, London, United Kingdom
| | - Matthew M. Botvinick
- Gatsby Computational Neuroscience Unit, University College London, London, United Kingdom
- Google DeepMind, London, United Kingdom
| |
Collapse
|
3
|
Russin J, Zolfaghar M, Park SA, Boorman E, O'Reilly RC. A Neural Network Model of Continual Learning with Cognitive Control. COGSCI ... ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY. COGNITIVE SCIENCE SOCIETY (U.S.). CONFERENCE 2022; 44:1064-1071. [PMID: 37223441 PMCID: PMC10205096] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
Neural networks struggle in continual learning settings from catastrophic forgetting: when trials are blocked, new learning can overwrite the learning from previous blocks. Humans learn effectively in these settings, in some cases even showing an advantage of blocking, suggesting the brain contains mechanisms to overcome this problem. Here, we build on previous work and show that neural networks equipped with a mechanism for cognitive control do not exhibit catastrophic forgetting when trials are blocked. We further show an advantage of blocking over interleaving when there is a bias for active maintenance in the control signal, implying a tradeoff between maintenance and the strength of control. Analyses of map-like representations learned by the networks provided additional insights into these mechanisms. Our work highlights the potential of cognitive control to aid continual learning in neural networks, and offers an explanation for the advantage of blocking that has been observed in humans.
Collapse
Affiliation(s)
- Jacob Russin
- Dept. of Psychology, UC Davis
- Center for Neuroscience, UC Davis
| | - Maryam Zolfaghar
- Dept. of Computer Science, UC Davis
- Center for Neuroscience, UC Davis
| | | | - Erie Boorman
- Dept. of Psychology, UC Davis
- Center for Mind and Brain, UC Davis
| | - Randall C O'Reilly
- Dept. of Psychology, UC Davis
- Dept. of Computer Science, UC Davis
- Center for Neuroscience, UC Davis
| |
Collapse
|
4
|
Wood W, Mazar A, Neal DT. Habits and Goals in Human Behavior: Separate but Interacting Systems. PERSPECTIVES ON PSYCHOLOGICAL SCIENCE 2021; 17:590-605. [PMID: 34283681 DOI: 10.1177/1745691621994226] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
People automatically repeat behaviors that were frequently rewarded in the past in a given context. Such repetition is commonly attributed to habit, or associations in memory between a context and a response. Once habits form, contexts directly activate the response in mind. An opposing view is that habitual behaviors depend on goals. However, we show that this view is challenged by the goal independence of habits across the fields of social and health psychology, behavioral neuroscience, animal learning, and computational modeling. It also is challenged by direct tests revealing that habits do not depend on implicit goals. Furthermore, we show that two features of habit memory-rapid activation of specific responses and resistance to change-explain the different conditions under which people act on habit versus persuing goals. Finally, we tested these features with a novel secondary analysis of action-slip data. We found that habitual responses are activated regardless of goals, but they can be performed in concert with goal pursuit.
Collapse
Affiliation(s)
- Wendy Wood
- Department of Psychology, University of Southern California.,Marshall School of Business, University of Southern California
| | - Asaf Mazar
- Department of Psychology, University of Southern California
| | - David T Neal
- Catalyst Behavioral Sciences, Coral Gables, Florida.,Center for Advanced Hindsight, Duke University
| |
Collapse
|
5
|
Russin J, Zolfaghar M, Park SA, Boorman E, O'Reilly RC. Complementary Structure-Learning Neural Networks for Relational Reasoning. COGSCI ... ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY. COGNITIVE SCIENCE SOCIETY (U.S.). CONFERENCE 2021; 2021:1560-1566. [PMID: 34617073 PMCID: PMC8491570] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
The neural mechanisms supporting flexible relational inferences, especially in novel situations, are a major focus of current research. In the complementary learning systems framework, pattern separation in the hippocampus allows rapid learning in novel environments, while slower learning in neocortex accumulates small weight changes to extract systematic structure from well-learned environments. In this work, we adapt this framework to a task from a recent fMRI experiment where novel transitive inferences must be made according to implicit relational structure. We show that computational models capturing the basic cognitive properties of these two systems can explain relational transitive inferences in both familiar and novel environments, and reproduce key phenomena observed in the fMRI experiment.
Collapse
Affiliation(s)
- Jacob Russin
- Department of Psychology, UC Davis
- Center for Neuroscience, UC Davis
| | - Maryam Zolfaghar
- Department of Computer Science, UC Davis
- Center for Neuroscience, UC Davis
| | | | - Erie Boorman
- Department of Psychology, UC Davis
- Center for Mind and Brain, UC Davis
| | - Randall C O'Reilly
- Department of Psychology, UC Davis
- Department of Computer Science, UC Davis
- Center for Mind and Brain, UC Davis
- Center for Neuroscience, UC Davis
| |
Collapse
|
6
|
Herd S, Krueger K, Nair A, Mollick J, O'Reilly R. Neural Mechanisms of Human Decision-Making. COGNITIVE, AFFECTIVE & BEHAVIORAL NEUROSCIENCE 2021; 21:35-57. [PMID: 33409958 DOI: 10.3758/s13415-020-00842-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 09/28/2020] [Indexed: 11/08/2022]
Abstract
We present a theory and neural network model of the neural mechanisms underlying human decision-making. We propose a detailed model of the interaction between brain regions, under a proposer-predictor-actor-critic framework. This theory is based on detailed animal data and theories of action-selection. Those theories are adapted to serial operation to bridge levels of analysis and explain human decision-making. Task-relevant areas of cortex propose a candidate plan using fast, model-free, parallel neural computations. Other areas of cortex and medial temporal lobe can then predict likely outcomes of that plan in this situation. This optional prediction- (or model-) based computation can produce better accuracy and generalization, at the expense of speed. Next, linked regions of basal ganglia act to accept or reject the proposed plan based on its reward history in similar contexts. If that plan is rejected, the process repeats to consider a new option. The reward-prediction system acts as a critic to determine the value of the outcome relative to expectations and produce dopamine as a training signal for cortex and basal ganglia. By operating sequentially and hierarchically, the same mechanisms previously proposed for animal action-selection could explain the most complex human plans and decisions. We discuss explanations of model-based decisions, habitization, and risky behavior based on the computational model.
Collapse
Affiliation(s)
- Seth Herd
- eCortex, Inc., Boulder, CO, USA.
- University of Colorado, Boulder, CO, USA.
| | - Kai Krueger
- eCortex, Inc., Boulder, CO, USA
- University of Colorado, Boulder, CO, USA
| | - Ananta Nair
- eCortex, Inc., Boulder, CO, USA
- University of Colorado, Boulder, CO, USA
| | - Jessica Mollick
- eCortex, Inc., Boulder, CO, USA
- University of Colorado, Boulder, CO, USA
- Yale University, New Haven, CT, USA
| | - Randall O'Reilly
- eCortex, Inc., Boulder, CO, USA
- University of Colorado, Boulder, CO, USA
- University of California, Davis, Davis, CA, USA
| |
Collapse
|