1
|
Garagnani M, Vieider FM. Economic Consequences of Numerical Adaptation. Psychol Sci 2025:9567976251339195. [PMID: 40392760 DOI: 10.1177/09567976251339195] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/22/2025] Open
Abstract
Resource constraints in neural information processing imply that numerical discriminability optimally adapts to the frequency of numerical magnitudes in a decision maker's environment. Here, we tested the economic consequences of efficient numerical range adaptation in representative samples of the United Kingdom and Japan (N = 2,309) and in a replication in Austria and Hungary (N = 607). We exploited natural variation in currency units and combined it with an orthogonal variation in experimental currency units to detect the effect of habitual versus nonhabitual numerical ranges on the incidence of errors in decisions under risk. The results highlight the direct economic importance of numerical adaptation, thus calling into question standard assumptions that choice quantities are perceived without noise.
Collapse
Affiliation(s)
- Michele Garagnani
- Centre for Brain, Mind, and Markets, Department of Finance, University of Melbourne
| | - Ferdinand M Vieider
- Risk, Insurance, and Savings Laboratory, Department of Economics, Ghent University
| |
Collapse
|
2
|
Schulze C, Aka A, Bartels DM, Bucher SF, Embrey JR, Gureckis TM, Häubl G, Ho MK, Krajbich I, Moore AK, Oettingen G, Ongchoco JDK, Oprea R, Reinholtz N, Newell BR. A timeline of cognitive costs in decision-making. Trends Cogn Sci 2025:S1364-6613(25)00083-X. [PMID: 40393899 DOI: 10.1016/j.tics.2025.04.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2024] [Revised: 04/02/2025] [Accepted: 04/02/2025] [Indexed: 05/22/2025]
Abstract
Recent research from economics, psychology, cognitive science, computer science, and marketing is increasingly interested in the idea that people face cognitive costs when making decisions. Reviewing and synthesizing this research, we develop a framework of cognitive costs that organizes concepts along a temporal dimension and maps out when costs occur in the decision-making process and how they impact decisions. Our unifying framework broadens the scope of research on cognitive costs to a wider timeline of cognitive processing. We identify implications and recommendations emerging from our framework for intervening on behavior to tackle some of the most pressing issues of our day, from improving health and saving decisions to mitigating the consequences of climate change.
Collapse
Affiliation(s)
- Christin Schulze
- School of Psychology, University of New South Wales, Sydney, NSW, Australia.
| | - Ada Aka
- Stanford Graduate School of Business, Stanford, CA, USA
| | - Daniel M Bartels
- University of Chicago, Booth School of Business, Chicago, IL, USA
| | - Stefan F Bucher
- University of Cambridge, Faculty of Economics, Cambridge, UK; Massachusetts Institute of Technology, Sloan School of Management, Cambridge, MA, USA; Max Planck Institute for Biological Cybernetics, Tübingen, Germany
| | - Jake R Embrey
- School of Psychology, University of New South Wales, Sydney, NSW, Australia; University of Chicago, Booth School of Business, Chicago, IL, USA
| | - Todd M Gureckis
- New York University, Department of Psychology, New York, NY, USA
| | - Gerald Häubl
- University of Alberta, School of Business, Edmonton, AB, Canada
| | - Mark K Ho
- Stevens Institute of Technology, Department of Computer Science, Hoboken, NJ, USA
| | - Ian Krajbich
- University of California Los Angeles, Department of Psychology, Los Angeles, CA, USA
| | - Alexander K Moore
- University of Illinois Chicago, Department of Marketing, Chicago, IL, USA
| | | | - Joan D K Ongchoco
- University of British Columbia, Department of Psychology, Vancouver, BC, Canada
| | - Ryan Oprea
- University of California Santa Barbara, Department of Economics, Santa Barbara, CA, USA
| | - Nicholas Reinholtz
- University of Colorado Boulder, Leeds School of Business, Boulder, CO, USA
| | - Ben R Newell
- School of Psychology, University of New South Wales, Sydney, NSW, Australia; Institute for Climate Risk & Response, University of New South Wales, Sydney, NSW, Australia
| |
Collapse
|
3
|
D’Amato L, Luca Lancia G, Pezzulo G. The geometry of efficient codes: How rate-distortion trade-offs distort the latent representations of generative models. PLoS Comput Biol 2025; 21:e1012952. [PMID: 40354307 PMCID: PMC12068621 DOI: 10.1371/journal.pcbi.1012952] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2024] [Accepted: 03/11/2025] [Indexed: 05/14/2025] Open
Abstract
Living organisms rely on internal models of the world to act adaptively. These models, because of resource limitations, cannot encode every detail and hence need to compress information. From a cognitive standpoint, information compression can manifest as a distortion of latent representations, resulting in the emergence of representations that may not accurately reflect the external world or its geometry. Rate-distortion theory formalizes the optimal way to compress information while minimizing such distortions, by considering factors such as capacity limitations, the frequency and the utility of stimuli. However, while this theory explains why the above factors distort latent representations, it does not specify which specific distortions they produce. To address this question, here we investigate how rate-distortion trade-offs shape the latent representations of images in generative models, specifically Beta Variational Autoencoders ([Formula: see text]-VAEs), under varying constraints of model capacity, data distributions, and task objectives. By systematically exploring these factors, we identify three primary distortions in latent representations: prototypization, specialization, and orthogonalization. These distortions emerge as signatures of information compression, reflecting the model's adaptation to capacity limitations, data imbalances, and task demands. Additionally, our findings demonstrate that these distortions can coexist, giving rise to a rich landscape of latent spaces, whose geometry could differ significantly across generative models subject to different constraints. Our findings contribute to explain how the normative constraints of rate-distortion theory shape the geometry of latent representations of generative models of artificial systems and living organisms.
Collapse
Affiliation(s)
- Leo D’Amato
- Department of Control and Computer Engineering, Polytechnic University of Turin, Turin, Italy
- Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy
| | - Gian Luca Lancia
- Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy
- Department of Psychology, Sapienza University of Rome, Rome, Italy
| | - Giovanni Pezzulo
- Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy
| |
Collapse
|
4
|
Schulz L, Streicher Y, Schulz E, Bhui R, Dayan P. Mechanisms of mistrust: A Bayesian account of misinformation learning. PLoS Comput Biol 2025; 21:e1012814. [PMID: 40367148 PMCID: PMC12077715 DOI: 10.1371/journal.pcbi.1012814] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Accepted: 01/21/2025] [Indexed: 05/16/2025] Open
Abstract
From the intimate realm of personal interactions to the sprawling arena of political discourse, discerning the trustworthy from the dubious is crucial. Here, we present a novel behavioral task and accompanying Bayesian models that allow us to study key aspects of this learning process in a tightly controlled setting. In our task, participants are confronted with several different types of (mis-)information sources, ranging from ones that lie to ones with biased reporting, and have to learn these attributes under varying degrees of feedback. We formalize inference in this setting as a doubly Bayesian learning process where agents simultaneously learn about the ground truth as well as the qualities of an information source reporting on this ground truth. Our model and detailed analyses reveal how participants can generally follow Bayesian learning dynamics, highlighting a basic human ability to learn about diverse information sources. This learning is also reflected in explicit trust reports about the sources. We additionally show how participants approached the inference problem with priors that held sources to be helpful. Finally, when outside feedback was noisier, participants still learned along Bayesian lines but struggled to pick up on biases in information. Our work pins down computationally the generally impressive human ability to learn the trustworthiness of information sources while revealing minor fault lines when it comes to noisier environments and news sources with a slant.
Collapse
Affiliation(s)
- Lion Schulz
- Max Planck Institute for Biological Cybernetics, Tübingen, Germany
| | | | - Eric Schulz
- Max Planck Institute for Biological Cybernetics, Tübingen, Germany
- Helmholtz Institute for Human-Centered AI, Helmholtz Munich, Munich, Germany
| | - Rahul Bhui
- Sloan School of Management, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Institute for Data, Systems, and Society, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Peter Dayan
- Max Planck Institute for Biological Cybernetics, Tübingen, Germany
- University of Tübingen, Tübingen, Germany
| |
Collapse
|
5
|
Simon D, Read SJ. Toward a General Framework of Biased Reasoning: Coherence-Based Reasoning. PERSPECTIVES ON PSYCHOLOGICAL SCIENCE 2025; 20:421-459. [PMID: 37983541 DOI: 10.1177/17456916231204579] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2023]
Abstract
A considerable amount of experimental research has been devoted to uncovering biased forms of reasoning. Notwithstanding the richness and overall empirical soundness of the bias research, the field can be described as disjointed, incomplete, and undertheorized. In this article, we seek to address this disconnect by offering "coherence-based reasoning" as a parsimonious theoretical framework that explains a sizable number of important deviations from normative forms of reasoning. Represented in connectionist networks and processed through constraint-satisfaction processing, coherence-based reasoning serves as a ubiquitous, essential, and overwhelmingly adaptive apparatus in people's mental toolbox. This adaptive process, however, can readily be overrun by bias when the network is dominated by nodes or links that are incorrect, overweighted, or otherwise nonnormative. We apply this framework to explain a variety of well-established biased forms of reasoning, including confirmation bias, the halo effect, stereotype spillovers, hindsight bias, motivated reasoning, emotion-driven reasoning, ideological reasoning, and more.
Collapse
Affiliation(s)
- Dan Simon
- Gould School of Law, University of Southern California
- Department of Psychology, University of Southern California
| | - Stephen J Read
- Department of Psychology, University of Southern California
| |
Collapse
|
6
|
Sundh J, Millroth P, Collsiöö A, Juslin P. Enriching Psychological Research by Exploring the Source and Nature of Noise. PERSPECTIVES ON PSYCHOLOGICAL SCIENCE 2025; 20:540-554. [PMID: 40035538 PMCID: PMC12065948 DOI: 10.1177/17456916241235889] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/05/2025]
Abstract
In psychological research, noise is often considered a nuisance that obscures rather than contributes information. This simplification overlooks that noise can be informative and that by exploring the nature of the noise one can often draw additional conclusions concerning the underlying psychological processes. It is arguably only in recent years that the mainstream of researchers has taken this idea to heart and demonstrated that it can lead to breakthroughs in the understanding of human behavior. The aim of this special section is to showcase some of the ways in which systematic exploration of noise can be achieved and how it can enrich psychological research. In this introductory article, we introduce the idea of treating noise as endogenous as opposed to exogenous to the theoretical and statistical models of psychological phenomena. We then contribute a historical review of the role of noise in psychological research, including discussions of previous endogenous treatments of noise in the literature. As an illustration, we describe our own research on the precise/not precise model and show how noise distributions can be used to delineate analytic and intuitive modes of reasoning. Finally, we briefly introduce the other contributions to this special section.
Collapse
|
7
|
Mitra A, Srinivasan N, Srivastava N. Present-Focused Behavior as a Rational Adaptation to Precarity. Open Mind (Camb) 2025; 9:452-474. [PMID: 40212053 PMCID: PMC11984791 DOI: 10.1162/opmi_a_00195] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2024] [Accepted: 02/08/2025] [Indexed: 04/13/2025] Open
Abstract
Inter-temporal impulsivity has been implicated in several theoretical explanations of the self-reinforcing nature of low socioeconomic status (SES). However, how exactly this interaction transpires is yet to be identified. We hypothesize that impulsivity arises from planning failures due to unpredictable resource demands, and people learn to adapt to this by being present-focused. We tested this hypothesis across three studies using a novel paradigm in which participants used a farming simulator and chose crops with different risk and time preferences. We found that participants' revealed time preferences adaptively shortened when they faced resource shocks and expanded in the absence of such shocks. We also found greater shrinkage of temporal horizons when these shocks were unpredictable rather than predictable. Our work shows that irrationality need not be invoked to explain the occurrence of present-bias in low SES individuals, and that such behavior may simply be a rational adaptation to the environmental demands of planning under precarity.
Collapse
Affiliation(s)
- Arjun Mitra
- Department of Cognitive Science, Indian Institute of Technology Kanpur and Centre of Behavioral and Cognitive Sciences, University of Allahabad
| | - Narayanan Srinivasan
- Department of Cognitive Science, Indian Institute of Technology Kanpur and Centre of Behavioral and Cognitive Sciences, University of Allahabad
| | - Nisheeth Srivastava
- Departments of Cognitive Science and Computer Science & Engineering, Indian Institute of Technology Kanpur
| |
Collapse
|
8
|
Eluchans M, Lancia GL, Maselli A, D’Alessandro M, Gordon JR, Pezzulo G. Adaptive planning depth in human problem-solving. ROYAL SOCIETY OPEN SCIENCE 2025; 12:241161. [PMID: 40206860 PMCID: PMC11978448 DOI: 10.1098/rsos.241161] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/13/2024] [Revised: 12/20/2024] [Accepted: 03/05/2025] [Indexed: 04/11/2025]
Abstract
We humans are capable of solving challenging planning problems, but the range of adaptive strategies that we use to address them is not yet fully characterized. Here, we designed a series of problem-solving tasks that require planning at different depths. After systematically comparing the performance of participants and planning models, we found that when facing problems that require planning to a certain number of subgoals (from 1 to 8), participants make an adaptive use of their cognitive resources-namely, they tend to select an initial plan having the minimum required depth, rather than selecting the same depth for all problems. These results support the view of problem-solving as a bounded rational process, which adapts costly cognitive resources to task demands.
Collapse
Affiliation(s)
- Mattia Eluchans
- Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy
- Sapienza University of Rome, Roma, Lazio, Italy
| | - Gian Luca Lancia
- Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy
- Sapienza University of Rome, Roma, Lazio, Italy
| | - Antonella Maselli
- Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy
- Department of Biomedical and Dental Sciences and Morphofunctional Imaging, University of Messina, Messina, Italy
| | - Marco D’Alessandro
- Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy
| | | | - Giovanni Pezzulo
- Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy
| |
Collapse
|
9
|
Harantová V, Mazanec J. Generation Z's Shopping Behavior in Second-Hand Brick-and-Mortar Stores: Emotions, Gender Dynamics, and Environmental Awareness. Behav Sci (Basel) 2025; 15:413. [PMID: 40282035 PMCID: PMC12023958 DOI: 10.3390/bs15040413] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2025] [Revised: 03/13/2025] [Accepted: 03/19/2025] [Indexed: 04/29/2025] Open
Abstract
This study investigates the shopping behavior of Generation Z towards second-hand clothing in Slovakia, focusing on in-store experiences and their relationship with emotions, gender, and environmental awareness. Data were collected from 340 respondents through an online survey conducted between November 2024 and January 2025. The results indicate that feelings such as authenticity, fun, and interest in finding fashionable items are significantly associated with gender. Across all five dimensions, women perceive second-hand clothing shopping more positively than men. The biggest difference between the sexes is that women find this shopping more fun, enjoyable, and authentic. Men tend to be slightly more skeptical in their evaluation, with the lowest average score (2.65) on the question of whether shopping is "fun". The study also reveals a strong correlation between the shopping experience and consumer attitudes. Individuals with prior experience in buying second-hand clothing exhibit greater environmental awareness, a stronger emotional connection with clothing, and a higher likelihood of participating in clothing swap events. Conversely, those without experience often harbor prejudices related to hygiene and perceive second-hand shopping as time-consuming and inconvenient. These findings highlight the importance of in-store experiences and the role of emotions in shaping consumer behavior towards second-hand clothing. The results have implications for retailers and policymakers seeking to promote sustainable consumption practices and enhance the appeal of the second-hand clothing market.
Collapse
Affiliation(s)
| | - Jaroslav Mazanec
- Department of Quantitative Methods and Economic Informatics, Faculty of Operation and Economics of Transport and Communications, University of Zilina, 010 26 Zilina, Slovakia
| |
Collapse
|
10
|
Bruckner R, Heekeren HR, Nassar MR. Understanding learning through uncertainty and bias. COMMUNICATIONS PSYCHOLOGY 2025; 3:24. [PMID: 39948273 PMCID: PMC11825852 DOI: 10.1038/s44271-025-00203-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/15/2024] [Accepted: 01/28/2025] [Indexed: 02/16/2025]
Abstract
Learning allows humans and other animals to make predictions about the environment that facilitate adaptive behavior. Casting learning as predictive inference can shed light on normative cognitive mechanisms that improve predictions under uncertainty. Drawing on normative learning models, we illustrate how learning should be adjusted to different sources of uncertainty, including perceptual uncertainty, risk, and uncertainty due to environmental changes. Such models explain many hallmarks of human learning in terms of specific statistical considerations that come into play when updating predictions under uncertainty. However, humans also display systematic learning biases that deviate from normative models, as studied in computational psychiatry. Some biases can be explained as normative inference conditioned on inaccurate prior assumptions about the environment, while others reflect approximations to Bayesian inference aimed at reducing cognitive demands. These biases offer insights into cognitive mechanisms underlying learning and how they might go awry in psychiatric illness.
Collapse
Affiliation(s)
- Rasmus Bruckner
- Department of Education and Psychology, Freie Universität Berlin, Berlin, Germany.
- Institute of Psychology, University of Hamburg, Hamburg, Germany.
| | - Hauke R Heekeren
- Department of Education and Psychology, Freie Universität Berlin, Berlin, Germany
- Executive University Board, University of Hamburg, Hamburg, Germany
| | - Matthew R Nassar
- Robert J. & Nancy D. Carney Institute for Brain Science, Brown University, Providence, RI, USA
- Department of Neuroscience, Brown University, Providence, RI, USA
| |
Collapse
|
11
|
Bari BA, Gershman SJ. The Value of Non-Instrumental Information in Anxiety: Insights from a Resource-Rational Model of Planning. COMPUTATIONAL PSYCHIATRY (CAMBRIDGE, MASS.) 2025; 9:63-75. [PMID: 39959564 PMCID: PMC11827562 DOI: 10.5334/cpsy.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Accepted: 01/19/2025] [Indexed: 02/18/2025]
Abstract
Anxiety is intimately related to the desire for information and, under some accounts, thought to arise from the intolerance of uncertainty. Here, we seek to test this hypothesis by studying the relationship between trait anxiety and the willingness to pay for non-instrumental information (i.e., information that reveals whether an event will happen but cannot be used to change the outcome). We model behavior with a resource-rational model of planning, according to which non-instrumental information is useful for planning ahead, but paying for this information only makes sense if the anticipated benefits of planning outweigh the cognitive and financial costs. We find a bidirectional effect of trait anxiety factors on information seeking behavior: those with high trait somatic anxiety exhibit a stronger preference for non-instrumental information, whereas those with high trait negative affect exhibit a weaker preference. By fitting the resource-rational model, we find that this divergent desire for information arises from the utility of obtaining information for future planning (increased in somatic anxiety, decreased in negative affect). Our findings lend support to the intolerance of uncertainty hypothesis in somatic anxiety and highlight the importance of studying anxiety as a multifactorial construct.
Collapse
Affiliation(s)
- Bilal A. Bari
- Department of Psychiatry, Massachusetts General Hospital, Boston, MA, USA
- McLean Hospital, Harvard Medical School, Belmont, MA, USA
| | - Samuel J. Gershman
- Department of Psychology and Center for Brain Science, Harvard University, Cambridge, MA, USA
- Center for Brains, Minds, and Machines, Massachusetts Institute of Technology, Cambridge, MA, USA
| |
Collapse
|
12
|
Lu YL, Lu YF, Ren X, Zhang H. Exploring the bounded rationality in human decision anomalies through an assemblable computational framework. Cogn Psychol 2025; 156:101713. [PMID: 39813936 DOI: 10.1016/j.cogpsych.2025.101713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Revised: 12/18/2024] [Accepted: 01/06/2025] [Indexed: 01/18/2025]
Abstract
Some seemingly irrational decision behaviors (anomalies), once seen as flaws in human cognition, have recently received explanations from a rational perspective. The basic idea is that the brain has limited cognitive resources to process the quantities (e.g., value, probability, time, etc.) required for decision making, with specific biases arising as byproducts of the resource allocation that is optimized for the environment. While appealing for providing normative accounts, the existing resource-rational models have limitations such as inconsistent assumptions across models, a focus on optimization for one specific aspect of the environment, and limited coverage of decision anomalies. One challenging anomaly is the peanuts effect, a pervasive phenomenon in decision-making under risk that implies an interdependence between the processing of value and probability. To extend the resource rationality approach to explain the peanuts effect, here we develop a computational framework-the Assemblable Resource-Rational Modules (ARRM)-that integrates ideas from different lines of boundedly-rational decision models as freely assembled modules. The framework can accommodate the joint functioning of multiple environmental factors, and allow new models to be built and tested along with the existing ones, potentially opening a wider range of decision phenomena to bounded rationality modeling. For one new and three published datasets that cover two different task paradigms and both the gain and loss domains, our boundedly-rational models reproduce two characteristic features of the peanuts effect and outperform previous models in fitting human decision behaviors.
Collapse
Affiliation(s)
- Yi-Long Lu
- School of Psychological and Cognitive Sciences and Beijing Key Laboratory of Behavior and Mental Health Peking University Beijing China
| | - Yang-Fan Lu
- Academy for Advanced Interdisciplinary Studies Peking University Beijing China; Peking-Tsinghua Center for Life Sciences Peking University Beijing China
| | - Xiangjuan Ren
- School of Psychological and Cognitive Sciences and Beijing Key Laboratory of Behavior and Mental Health Peking University Beijing China; Peking-Tsinghua Center for Life Sciences Peking University Beijing China; PKU-IDG/McGovern Institute for Brain Research Peking University Beijing China; Max Planck Research Group NeuroCode Max Planck Institute for Human Development Berlin Germany
| | - Hang Zhang
- School of Psychological and Cognitive Sciences and Beijing Key Laboratory of Behavior and Mental Health Peking University Beijing China; Peking-Tsinghua Center for Life Sciences Peking University Beijing China; PKU-IDG/McGovern Institute for Brain Research Peking University Beijing China; State Key Laboratory of General Artificial Intelligence Peking University, Beijing, China; Chinese Institute for Brain Research Beijing China.
| |
Collapse
|
13
|
Butz MV, Mittenbühler M, Schwöbel S, Achimova A, Gumbsch C, Otte S, Kiebel S. Contextualizing predictive minds. Neurosci Biobehav Rev 2025; 168:105948. [PMID: 39580009 DOI: 10.1016/j.neubiorev.2024.105948] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 09/13/2024] [Accepted: 11/16/2024] [Indexed: 11/25/2024]
Abstract
The structure of human memory seems to be optimized for efficient prediction, planning, and behavior. We propose that these capacities rely on a tripartite structure of memory that includes concepts, events, and contexts-three layers that constitute the mental world model. We suggest that the mechanism that critically increases adaptivity and flexibility is the tendency to contextualize. This tendency promotes local, context-encoding abstractions, which focus event- and concept-based planning and inference processes on the task and situation at hand. As a result, cognitive contextualization offers a solution to the frame problem-the need to select relevant features of the environment from the rich stream of sensorimotor signals. We draw evidence for our proposal from developmental psychology and neuroscience. Adopting a computational stance, we present evidence from cognitive modeling research which suggests that context sensitivity is a feature that is critical for maximizing the efficiency of cognitive processes. Finally, we turn to recent deep-learning architectures which independently demonstrate how context-sensitive memory can emerge in a self-organized learning system constrained by cognitively-inspired inductive biases.
Collapse
Affiliation(s)
- Martin V Butz
- Cognitive Modeling, Faculty of Science, University of Tübingen, Sand 14, Tübingen 72076, Germany.
| | - Maximilian Mittenbühler
- Cognitive Modeling, Faculty of Science, University of Tübingen, Sand 14, Tübingen 72076, Germany
| | - Sarah Schwöbel
- Cognitive Computational Neuroscience, Faculty of Psychology, TU Dresden, School of Science, Dresden 01062, Germany
| | - Asya Achimova
- Cognitive Modeling, Faculty of Science, University of Tübingen, Sand 14, Tübingen 72076, Germany
| | - Christian Gumbsch
- Cognitive Modeling, Faculty of Science, University of Tübingen, Sand 14, Tübingen 72076, Germany; Chair of Cognitive and Clinical Neuroscience, Faculty of Psychology, TU Dresden, Dresden 01069, Germany
| | - Sebastian Otte
- Cognitive Modeling, Faculty of Science, University of Tübingen, Sand 14, Tübingen 72076, Germany; Adaptive AI Lab, Institute of Robotics and Cognitive Systems, University of Lübeck, Ratzeburger Allee 160, Lübeck 23562, Germany
| | - Stefan Kiebel
- Cognitive Computational Neuroscience, Faculty of Psychology, TU Dresden, School of Science, Dresden 01062, Germany
| |
Collapse
|
14
|
Schmid G, Gottwald S, Braun DA. Bounded Rational Decision Networks With Belief Propagation. Neural Comput 2024; 37:76-127. [PMID: 39383021 DOI: 10.1162/neco_a_01719] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Accepted: 07/08/2024] [Indexed: 10/11/2024]
Abstract
Complex information processing systems that are capable of a wide variety of tasks, such as the human brain, are composed of specialized units that collaborate and communicate with each other. An important property of such information processing networks is locality: there is no single global unit controlling the modules, but information is exchanged locally. Here, we consider a decision-theoretic approach to study networks of bounded rational decision makers that are allowed to specialize and communicate with each other. In contrast to previous work that has focused on feedforward communication between decision-making agents, we consider cyclical information processing paths allowing for back-and-forth communication. We adapt message-passing algorithms to suit this purpose, essentially allowing for local information flow between units and thus enabling circular dependency structures. We provide examples that show how repeated communication can increase performance given that each unit's information processing capability is limited and that decision-making systems with too few or too many connections and feedback loops achieve suboptimal utility.
Collapse
Affiliation(s)
- Gerrit Schmid
- Ulm University Institute of Neuroinformatics, 89081 Ulm, Germany
| | | | - Daniel A Braun
- Ulm University Institute of Neuroinformatics, 89081 Ulm, Germany
| |
Collapse
|
15
|
Orticio E, Meyer M, Kidd C. Exposure to detectable inaccuracies makes children more diligent fact-checkers of novel claims. Nat Hum Behav 2024; 8:2322-2329. [PMID: 39390098 DOI: 10.1038/s41562-024-01992-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Accepted: 08/25/2024] [Indexed: 10/12/2024]
Abstract
How do children decide when to believe a claim? Here we show that children fact-check claims more and are better able to catch misinformation when they have been exposed to detectable inaccuracies. In two experiments (N = 122), 4-7-year-old children exposed to falsity (as opposed to all true information) sampled more evidence before verifying a test claim in a novel domain. Children's evidentiary standards were graded: fact-checking increased with higher proportions of false statements heard during exposure. A simulation suggests that children's behaviour is adaptive, because increased fact-checking in more dubious environments supports the discovery of potential misinformation. Importantly, children were least diligent at fact-checking a new claim when all prior information was true, suggesting that sanitizing children's informational environments may inadvertently dampen their natural scepticism. Instead, these findings support the counterintuitive possibility that exposing children to some nonsense may scaffold vigilance towards more subtle misinformation in the future.
Collapse
Affiliation(s)
- Evan Orticio
- Department of Psychology, University of California, Berkeley, Berkeley, CA, USA.
| | - Martin Meyer
- Department of Psychology, University of California, Berkeley, Berkeley, CA, USA
- Department of Psychology, Yale University, New Haven, CT, USA
| | - Celeste Kidd
- Department of Psychology, University of California, Berkeley, Berkeley, CA, USA
| |
Collapse
|
16
|
Witt A, Toyokawa W, Lala KN, Gaissmaier W, Wu CM. Humans flexibly integrate social information despite interindividual differences in reward. Proc Natl Acad Sci U S A 2024; 121:e2404928121. [PMID: 39302964 PMCID: PMC11441569 DOI: 10.1073/pnas.2404928121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Accepted: 08/19/2024] [Indexed: 09/22/2024] Open
Abstract
There has been much progress in understanding human social learning, including recent studies integrating social information into the reinforcement learning framework. Yet previous studies often assume identical payoffs between observer and demonstrator, overlooking the diversity of social information in real-world interactions. We address this gap by introducing a socially correlated bandit task that accommodates payoff differences among participants, allowing for the study of social learning under more realistic conditions. Our Social Generalization (SG) model, tested through evolutionary simulations and two online experiments, outperforms existing models by incorporating social information into the generalization process, but treating it as noisier than individual observations. Our findings suggest that human social learning is more flexible than previously believed, with the SG model indicating a potential resource-rational trade-off where social learning partially replaces individual exploration. This research highlights the flexibility of humans' social learning, allowing us to integrate social information from others with different preferences, skills, or goals.
Collapse
Affiliation(s)
- Alexandra Witt
- Human and Machine Cognition Lab, University of Tübingen, Tübingen72074, Germany
| | - Wataru Toyokawa
- Social Psychology and Decision Sciences, Department of Psychology, University of Konstanz, Konstanz78464, Germany
- Computational Group Dynamics Unit, RIKEN Center for Brain Science, RIKEN, Wako351-0198, Japan
| | - Kevin N. Lala
- School of Biology, University of St Andrews, St AndrewsKY16 9AJ, United Kingdom
| | - Wolfgang Gaissmaier
- Social Psychology and Decision Sciences, Department of Psychology, University of Konstanz, Konstanz78464, Germany
| | - Charley M. Wu
- Human and Machine Cognition Lab, University of Tübingen, Tübingen72074, Germany
| |
Collapse
|
17
|
Alon N, Schulz L, Bell V, Moutoussis M, Dayan P, Barnby JM. (Mal)adaptive Mentalizing in the Cognitive Hierarchy, and Its Link to Paranoia. COMPUTATIONAL PSYCHIATRY (CAMBRIDGE, MASS.) 2024; 8:159-177. [PMID: 39280241 PMCID: PMC11396085 DOI: 10.5334/cpsy.117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/20/2024] [Accepted: 08/29/2024] [Indexed: 09/18/2024]
Abstract
Humans need to be on their toes when interacting with competitive others to avoid being taken advantage of. Too much caution out of context can, however, be detrimental and produce false beliefs of intended harm. Here, we offer a formal account of this phenomenon through the lens of Theory of Mind. We simulate agents of different depths of mentalizing within a simple game theoretic paradigm and show how, if aligned well, deep recursive mentalization gives rise to both successful deception as well as reasonable skepticism. However, we also show that if a self is mentalizing too deeply - hyper-mentalizing - false beliefs arise that a partner is trying to trick them maliciously, resulting in a material loss to the self. Importantly, we show that this is only true when hypermentalizing agents believe observed actions are generated intentionally. This theory offers a potential cognitive mechanism for suspiciousness, paranoia, and conspiratorial ideation. Rather than a deficit in Theory of Mind, paranoia may arise from the application of overly strategic thinking to ingenuous behaviour. Author Summary Interacting competitively requires vigilance to avoid deception. However, excessive caution can have adverse effects, stemming from false beliefs of intentional harm. So far there is no formal cognitive account of what may cause this suspiciousness. Here we present an examination of this phenomenon through the lens of Theory of Mind - the cognitive ability to consider the beliefs, intentions, and desires of others. By simulating interacting computer agents we illustrate how well-aligned agents can give rise to successful deception and justified skepticism. Crucially, we also reveal that overly cautious agents develop false beliefs that an ingenuous partner is attempting malicious trickery, leading to tangible losses. As well as formally defining a plausible mechanism for suspiciousness, paranoia, and conspiratorial thinking, our theory indicates that rather than a deficit in Theory of Mind, paranoia may involve an over-application of strategy to genuine behaviour.
Collapse
Affiliation(s)
- Nitay Alon
- Department of Computer Science, The Hebrew University of Jerusalem, Jerusalem, Israel
- Department of Computational Neuroscience, Max Planck Institute for Biological Cybernetics, Tübingen, Germany
| | - Lion Schulz
- Department of Computational Neuroscience, Max Planck Institute for Biological Cybernetics, Tübingen, Germany
| | - Vaughan Bell
- Clinical, Educational, and Health Psychology, University College London, United Kingdom
| | - Michael Moutoussis
- Department of Imaging Neuroscience, University College London, London, United Kingdom
| | - Peter Dayan
- Department of Computational Neuroscience, Max Planck Institute for Biological Cybernetics, Tübingen, Germany
- Department of Computer Science, University of Tübingen, Tübingen, Germany
| | - Joseph M Barnby
- Department of Psychology, Royal Holloway University of London, London, United Kingdom
- School of Psychiatry and Clinical Neuroscience, The University of Western Australia, Australia
| |
Collapse
|
18
|
Schütt HH, Kim D, Ma WJ. Reward prediction error neurons implement an efficient code for reward. Nat Neurosci 2024; 27:1333-1339. [PMID: 38898182 DOI: 10.1038/s41593-024-01671-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Accepted: 04/29/2024] [Indexed: 06/21/2024]
Abstract
We use efficient coding principles borrowed from sensory neuroscience to derive the optimal neural population to encode a reward distribution. We show that the responses of dopaminergic reward prediction error neurons in mouse and macaque are similar to those of the efficient code in the following ways: the neurons have a broad distribution of midpoints covering the reward distribution; neurons with higher thresholds have higher gains, more convex tuning functions and lower slopes; and their slope is higher when the reward distribution is narrower. Furthermore, we derive learning rules that converge to the efficient code. The learning rule for the position of the neuron on the reward axis closely resembles distributional reinforcement learning. Thus, reward prediction error neuron responses may be optimized to broadcast an efficient reward signal, forming a connection between efficient coding and reinforcement learning, two of the most successful theories in computational neuroscience.
Collapse
Affiliation(s)
- Heiko H Schütt
- Center for Neural Science and Department of Psychology, New York University, New York, NY, USA.
- Department of Behavioural and Cognitive Sciences, Université du Luxembourg, Esch-Belval, Luxembourg.
| | - Dongjae Kim
- Center for Neural Science and Department of Psychology, New York University, New York, NY, USA
- Department of AI-Based Convergence, Dankook University, Yongin, Republic of Korea
| | - Wei Ji Ma
- Center for Neural Science and Department of Psychology, New York University, New York, NY, USA
| |
Collapse
|
19
|
Mason A, Ludvig EA, Spetch ML, Madan CR. Rare and extreme outcomes in risky choice. Psychon Bull Rev 2024; 31:1301-1308. [PMID: 37973763 PMCID: PMC11192811 DOI: 10.3758/s13423-023-02415-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/21/2023] [Indexed: 11/19/2023]
Abstract
Many real-world decisions involving rare events also involve extreme outcomes. Despite this confluence, decisions-from-experience research has only examined the impact of rarity and extremity in isolation. With rare events, people typically choose as if they underestimate the probability of a rare outcome happening. Separately, people typically overestimate the probability of an extreme outcome happening. Here, for the first time, we examine the confluence of these two biases in decisions-from-experience. In a between-groups behavioural experiment, we examine people's risk preferences for rare extreme outcomes and for rare non-extreme outcomes. When outcomes are both rare and extreme, people's risk preferences shift away from traditional risk patterns for rare events: they show reduced underweighting for events that are both rare and extreme. We simulate these results using a small-sample model of decision-making that accounts for both the underweighting of rare events and the overweighting of extreme events. These separable influences on risk preferences suggest that to understand real-world risk for rare events we must also consider the extremity of the outcomes.
Collapse
Affiliation(s)
- Alice Mason
- Department of Psychology, University of Bath, Bath, United Kingdom.
- Department of Psychology, University of Warwick, Coventry, UK.
| | - Elliot A Ludvig
- Department of Psychology, University of Warwick, Coventry, UK
| | - Marcia L Spetch
- Department of Psychology, University of Alberta, Edmonton, Alberta, Canada
| | | |
Collapse
|
20
|
Malloy T, Gonzalez C. Applying Generative Artificial Intelligence to cognitive models of decision making. Front Psychol 2024; 15:1387948. [PMID: 38765837 PMCID: PMC11100990 DOI: 10.3389/fpsyg.2024.1387948] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Accepted: 04/12/2024] [Indexed: 05/22/2024] Open
Abstract
Introduction Generative Artificial Intelligence has made significant impacts in many fields, including computational cognitive modeling of decision making, although these applications have not yet been theoretically related to each other. This work introduces a categorization of applications of Generative Artificial Intelligence to cognitive models of decision making. Methods This categorization is used to compare the existing literature and to provide insight into the design of an ablation study to evaluate our proposed model in three experimental paradigms. These experiments used for model comparison involve modeling human learning and decision making based on both visual information and natural language, in tasks that vary in realism and complexity. This comparison of applications takes as its basis Instance-Based Learning Theory, a theory of experiential decision making from which many models have emerged and been applied to a variety of domains and applications. Results The best performing model from the ablation we performed used a generative model to both create memory representations as well as predict participant actions. The results of this comparison demonstrates the importance of generative models in both forming memories and predicting actions in decision-modeling research. Discussion In this work, we present a model that integrates generative and cognitive models, using a variety of stimuli, applications, and training methods. These results can provide guidelines for cognitive modelers and decision making researchers interested in integrating Generative AI into their methods.
Collapse
Affiliation(s)
- Tyler Malloy
- Dynamic Decision Making Laboratory, Department of Social and Decision Sciences, Dietrich College, Carnegie Mellon University, Pittsburgh, PA, United States
| | | |
Collapse
|
21
|
Arumugam D, Ho MK, Goodman ND, Van Roy B. Bayesian Reinforcement Learning With Limited Cognitive Load. Open Mind (Camb) 2024; 8:395-438. [PMID: 38665544 PMCID: PMC11045037 DOI: 10.1162/opmi_a_00132] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Accepted: 02/16/2024] [Indexed: 04/28/2024] Open
Abstract
All biological and artificial agents must act given limits on their ability to acquire and process information. As such, a general theory of adaptive behavior should be able to account for the complex interactions between an agent's learning history, decisions, and capacity constraints. Recent work in computer science has begun to clarify the principles that shape these dynamics by bridging ideas from reinforcement learning, Bayesian decision-making, and rate-distortion theory. This body of work provides an account of capacity-limited Bayesian reinforcement learning, a unifying normative framework for modeling the effect of processing constraints on learning and action selection. Here, we provide an accessible review of recent algorithms and theoretical results in this setting, paying special attention to how these ideas can be applied to studying questions in the cognitive and behavioral sciences.
Collapse
Affiliation(s)
| | - Mark K. Ho
- Center for Data Science, New York University
| | - Noah D. Goodman
- Department of Computer Science, Stanford University
- Department of Psychology, Stanford University
| | - Benjamin Van Roy
- Department of Electrical Engineering, Stanford University
- Department of Management Science & Engineering, Stanford University
| |
Collapse
|
22
|
Lai L, Gershman SJ. Human decision making balances reward maximization and policy compression. PLoS Comput Biol 2024; 20:e1012057. [PMID: 38669280 PMCID: PMC11078408 DOI: 10.1371/journal.pcbi.1012057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Revised: 05/08/2024] [Accepted: 04/08/2024] [Indexed: 04/28/2024] Open
Abstract
Policy compression is a computational framework that describes how capacity-limited agents trade reward for simpler action policies to reduce cognitive cost. In this study, we present behavioral evidence that humans prefer simpler policies, as predicted by a capacity-limited reinforcement learning model. Across a set of tasks, we find that people exploit structure in the relationships between states, actions, and rewards to "compress" their policies. In particular, compressed policies are systematically biased towards actions with high marginal probability, thereby discarding some state information. This bias is greater when there is redundancy in the reward-maximizing action policy across states, and increases with memory load. These results could not be explained qualitatively or quantitatively by models that did not make use of policy compression under a capacity limit. We also confirmed the prediction that time pressure should further reduce policy complexity and increase action bias, based on the hypothesis that actions are selected via time-dependent decoding of a compressed code. These findings contribute to a deeper understanding of how humans adapt their decision-making strategies under cognitive resource constraints.
Collapse
Affiliation(s)
- Lucy Lai
- Program in Neuroscience, Harvard University, Cambridge, Massachusetts, United States of America
- Theoretical Sciences Visiting Program, Okinawa Institute of Science and Technology Graduate University, Onna, Okinawa, Japan
| | - Samuel J. Gershman
- Department of Psychology and Center for Brain Science, Harvard University, Cambridge, Massachusetts, United States of America
| |
Collapse
|
23
|
Schulz L, Bhui R. Political reinforcement learners. Trends Cogn Sci 2024; 28:210-222. [PMID: 38195364 DOI: 10.1016/j.tics.2023.12.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 12/09/2023] [Accepted: 12/11/2023] [Indexed: 01/11/2024]
Abstract
Politics can seem home to the most calculating and yet least rational elements of humanity. How might we systematically characterize this spectrum of political cognition? Here, we propose reinforcement learning (RL) as a unified framework to dissect the political mind. RL describes how agents algorithmically navigate complex and uncertain domains like politics. Through this computational lens, we outline three routes to political differences, stemming from variability in agents' conceptions of a problem, the cognitive operations applied to solve the problem, or the backdrop of information available from the environment. A computational vantage on maladies of the political mind offers enhanced precision in assessing their causes, consequences, and cures.
Collapse
Affiliation(s)
- Lion Schulz
- Department of Computational Neuroscience, Max Planck Institute for Biological Cybernetics, Max-Planck-Ring 8-14, 72076 Tübingen, Germany.
| | - Rahul Bhui
- Sloan School of Management and Institute for Data, Systems, and Society, Massachusetts Institute of Technology, Cambridge, MA, USA
| |
Collapse
|
24
|
Colas JT, O’Doherty JP, Grafton ST. Active reinforcement learning versus action bias and hysteresis: control with a mixture of experts and nonexperts. PLoS Comput Biol 2024; 20:e1011950. [PMID: 38552190 PMCID: PMC10980507 DOI: 10.1371/journal.pcbi.1011950] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 02/26/2024] [Indexed: 04/01/2024] Open
Abstract
Active reinforcement learning enables dynamic prediction and control, where one should not only maximize rewards but also minimize costs such as of inference, decisions, actions, and time. For an embodied agent such as a human, decisions are also shaped by physical aspects of actions. Beyond the effects of reward outcomes on learning processes, to what extent can modeling of behavior in a reinforcement-learning task be complicated by other sources of variance in sequential action choices? What of the effects of action bias (for actions per se) and action hysteresis determined by the history of actions chosen previously? The present study addressed these questions with incremental assembly of models for the sequential choice data from a task with hierarchical structure for additional complexity in learning. With systematic comparison and falsification of computational models, human choices were tested for signatures of parallel modules representing not only an enhanced form of generalized reinforcement learning but also action bias and hysteresis. We found evidence for substantial differences in bias and hysteresis across participants-even comparable in magnitude to the individual differences in learning. Individuals who did not learn well revealed the greatest biases, but those who did learn accurately were also significantly biased. The direction of hysteresis varied among individuals as repetition or, more commonly, alternation biases persisting from multiple previous actions. Considering that these actions were button presses with trivial motor demands, the idiosyncratic forces biasing sequences of action choices were robust enough to suggest ubiquity across individuals and across tasks requiring various actions. In light of how bias and hysteresis function as a heuristic for efficient control that adapts to uncertainty or low motivation by minimizing the cost of effort, these phenomena broaden the consilient theory of a mixture of experts to encompass a mixture of expert and nonexpert controllers of behavior.
Collapse
Affiliation(s)
- Jaron T. Colas
- Department of Psychological and Brain Sciences, University of California, Santa Barbara, California, United States of America
- Division of the Humanities and Social Sciences, California Institute of Technology, Pasadena, California, United States of America
- Computation and Neural Systems Program, California Institute of Technology, Pasadena, California, United States of America
| | - John P. O’Doherty
- Division of the Humanities and Social Sciences, California Institute of Technology, Pasadena, California, United States of America
- Computation and Neural Systems Program, California Institute of Technology, Pasadena, California, United States of America
| | - Scott T. Grafton
- Department of Psychological and Brain Sciences, University of California, Santa Barbara, California, United States of America
| |
Collapse
|
25
|
Polanía R, Burdakov D, Hare TA. Rationality, preferences, and emotions with biological constraints: it all starts from our senses. Trends Cogn Sci 2024; 28:264-277. [PMID: 38341322 DOI: 10.1016/j.tics.2024.01.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Revised: 01/10/2024] [Accepted: 01/11/2024] [Indexed: 02/12/2024]
Abstract
Is the role of our sensory systems to represent the physical world as accurately as possible? If so, are our preferences and emotions, often deemed irrational, decoupled from these 'ground-truth' sensory experiences? We show why the answer to both questions is 'no'. Brain function is metabolically costly, and the brain loses some fraction of the information that it encodes and transmits. Therefore, if brains maximize objective functions that increase the fitness of their species, they should adapt to the objective-maximizing rules of the environment at the earliest stages of sensory processing. Consequently, observed 'irrationalities', preferences, and emotions stem from the necessity for our early sensory systems to adapt and process information while considering the metabolic costs and internal states of the organism.
Collapse
Affiliation(s)
- Rafael Polanía
- Decision Neuroscience Laboratory, Department of Health Sciences and Technology, ETH, Zurich, Zurich, Switzerland.
| | - Denis Burdakov
- Neurobehavioral Dynamics Laboratory, Department of Health Sciences and Technology, ETH Zurich, Zurich, Switzerland
| | - Todd A Hare
- Zurich Center for Neuroeconomics, Department of Economics, University of Zurich, Zurich, Switzerland
| |
Collapse
|
26
|
Olschewski S, Scheibehenne B. What's in a sample? Epistemic uncertainty and metacognitive awareness in risk taking. Cogn Psychol 2024; 149:101642. [PMID: 38401485 DOI: 10.1016/j.cogpsych.2024.101642] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Revised: 02/01/2024] [Accepted: 02/13/2024] [Indexed: 02/26/2024]
Abstract
In a fundamentally uncertain world, sound information processing is a prerequisite for effective behavior. Given that information processing is subject to inevitable cognitive imprecision, decision makers should adapt to this imprecision and to the resulting epistemic uncertainty when taking risks. We tested this metacognitive ability in two experiments in which participants estimated the expected value of different number distributions from sequential samples and then bet on their own estimation accuracy. Results show that estimates were imprecise, and this imprecision increased with higher distributional standard deviations. Importantly, participants adapted their risk-taking behavior to this imprecision and hence deviated from the predictions of Bayesian models of uncertainty that assume perfect integration of information. To explain these results, we developed a computational model that combines Bayesian updating with a metacognitive awareness of cognitive imprecision in the integration of information. Modeling results were robust to the inclusion of an empirical measure of participants' perceived variability. In sum, we show that cognitive imprecision is crucial to understanding risk taking in decisions from experience. The results further demonstrate the importance of metacognitive awareness as a cognitive building block for adaptive behavior under (partial) uncertainty.
Collapse
Affiliation(s)
- Sebastian Olschewski
- Department of Psychology, University of Basel, Switzerland; Warwick Business School, University of Warwick, United Kingdom.
| | | |
Collapse
|
27
|
Cushman F. Computational Social Psychology. Annu Rev Psychol 2024; 75:625-652. [PMID: 37540891 DOI: 10.1146/annurev-psych-021323-040420] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/06/2023]
Abstract
Social psychologists attempt to explain how we interact by appealing to basic principles of how we think. To make good on this ambition, they are increasingly relying on an interconnected set of formal tools that model inference, attribution, value-guided decision making, and multi-agent interactions. By reviewing progress in each of these areas and highlighting the connections between them, we can better appreciate the structure of social thought and behavior, while also coming to understand when, why, and how formal tools can be useful for social psychologists.
Collapse
Affiliation(s)
- Fiery Cushman
- Department of Psychology, Harvard University, Cambridge, Massachusetts, USA;
| |
Collapse
|
28
|
Fernandez-Velasco P, Spiers HJ. Wayfinding across ocean and tundra: what traditional cultures teach us about navigation. Trends Cogn Sci 2024; 28:56-71. [PMID: 37798182 DOI: 10.1016/j.tics.2023.09.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Revised: 09/15/2023] [Accepted: 09/15/2023] [Indexed: 10/07/2023]
Abstract
Research on human navigation by psychologists and neuroscientists has come mainly from a limited range of environments and participants inhabiting western countries. By contrast, numerous anthropological accounts illustrate the diverse ways in which cultures adapt to their surrounding environment to navigate. Here, we provide an overview of these studies and relate them to cognitive science research. The diversity of cues in traditional navigation is much higher and multimodal compared with navigation experiments in the laboratory. It typically involves an integrated system of methods, drawing on a detailed understanding of the environmental cues, specific tools, and forms part of a broader cultural system. We highlight recent methodological developments for measuring navigation skill and modelling behaviour that will aid future research into how culture and environment shape human navigation.
Collapse
Affiliation(s)
- Pablo Fernandez-Velasco
- Institute of Behavioural Neuroscience, Department of Experimental Psychology, Division of Psychology and Language Sciences, University College London, London, UK; Department of Philosophy, University of York, York, UK.
| | - Hugo J Spiers
- Institute of Behavioural Neuroscience, Department of Experimental Psychology, Division of Psychology and Language Sciences, University College London, London, UK.
| |
Collapse
|
29
|
Futrell R. An Information-Theoretic Account of Availability Effects in Language Production. Top Cogn Sci 2024; 16:38-53. [PMID: 38145974 DOI: 10.1111/tops.12716] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Revised: 11/30/2023] [Accepted: 12/01/2023] [Indexed: 12/27/2023]
Abstract
I present a computational-level model of language production in terms of a combination of information theory and control theory in which words are chosen incrementally in order to maximize communicative value subject to an information-theoretic capacity constraint. The theory generally predicts a tradeoff between ease of production and communicative accuracy. I apply the theory to two cases of apparent availability effects in language production, in which words are selected on the basis of their accessibility to a speaker who has not yet perfectly planned the rest of the utterance. Using corpus data on English relative clause complementizer dropping and experimental data on Mandarin noun classifier choice, I show that the theory reproduces the observed phenomena, providing an alternative account to Uniform Information Density and a promising general model of language production which is tightly linked to emerging theories in computational neuroscience.
Collapse
Affiliation(s)
- Richard Futrell
- Department of Language Science, University of California, Irvine
| |
Collapse
|
30
|
Giron AP, Ciranka S, Schulz E, van den Bos W, Ruggeri A, Meder B, Wu CM. Developmental changes in exploration resemble stochastic optimization. Nat Hum Behav 2023; 7:1955-1967. [PMID: 37591981 PMCID: PMC10663152 DOI: 10.1038/s41562-023-01662-1] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2022] [Accepted: 06/21/2023] [Indexed: 08/19/2023]
Abstract
Human development is often described as a 'cooling off' process, analogous to stochastic optimization algorithms that implement a gradual reduction in randomness over time. Yet there is ambiguity in how to interpret this analogy, due to a lack of concrete empirical comparisons. Using data from n = 281 participants ages 5 to 55, we show that cooling off does not only apply to the single dimension of randomness. Rather, human development resembles an optimization process of multiple learning parameters, for example, reward generalization, uncertainty-directed exploration and random temperature. Rapid changes in parameters occur during childhood, but these changes plateau and converge to efficient values in adulthood. We show that while the developmental trajectory of human parameters is strikingly similar to several stochastic optimization algorithms, there are important differences in convergence. None of the optimization algorithms tested were able to discover reliably better regions of the strategy space than adult participants on this task.
Collapse
Affiliation(s)
- Anna P Giron
- Human and Machine Cognition Lab, University of Tübingen, Tübingen, Germany
- Attention and Affect Lab, University of Tübingen, Tübingen, Germany
| | - Simon Ciranka
- Center for Adaptive Rationality, Max Planck Institute for Human Development, Berlin, Germany
- Max Planck UCL Centre for Computational Psychiatry and Ageing Research, Berlin, Germany
| | - Eric Schulz
- MPRG Computational Principles of Intelligence, Max Planck Institute for Biological Cybernetics, Tübingen, Germany
| | - Wouter van den Bos
- Department of Psychology, University of Amsterdam, Amsterdam, the Netherlands
- Amsterdam Brain and Cognition, University of Amsterdam, Amsterdam, the Netherlands
| | - Azzurra Ruggeri
- MPRG iSearch, Max Planck Institute for Human Development, Berlin, Germany
- School of Social Sciences and Technology, Technical University Munich, Munich, Germany
- Central European University, Vienna, Austria
| | - Björn Meder
- MPRG iSearch, Max Planck Institute for Human Development, Berlin, Germany
- Institute for Mind, Brain and Behavior, Health and Medical University, Potsdam, Germany
| | - Charley M Wu
- Human and Machine Cognition Lab, University of Tübingen, Tübingen, Germany.
- Center for Adaptive Rationality, Max Planck Institute for Human Development, Berlin, Germany.
| |
Collapse
|
31
|
Chen S, Futrell R, Mahowald K. An information-theoretic approach to the typology of spatial demonstratives. Cognition 2023; 240:105505. [PMID: 37598582 DOI: 10.1016/j.cognition.2023.105505] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 05/26/2023] [Accepted: 05/28/2023] [Indexed: 08/22/2023]
Abstract
We explore systems of spatial deictic words (such as 'here' and 'there') from the perspective of communicative efficiency using typological data from over 200 languages Nintemann et al. (2020). We argue from an information-theoretic perspective that spatial deictic systems balance informativity and complexity in the sense of the Information Bottleneck (Zaslavsky et al., (2018). We find that under an appropriate choice of cost function and need probability over meanings, among all the 21,146 theoretically possible spatial deictic systems, those adopted by real languages lie near an efficient frontier of informativity and complexity. Moreover, we find that the conditions that the need probability and the cost function need to satisfy for this result are consistent with the cognitive science literature on spatial cognition, especially regarding the source-goal asymmetry. We further show that the typological data are better explained by introducing a notion of consistency into the Information Bottleneck framework, which is jointly optimized along with informativity and complexity.
Collapse
Affiliation(s)
- Sihan Chen
- Department of Brain and Cognitive Sciences, MIT, United States of America.
| | - Richard Futrell
- Department of Language Science, University of California, Irvine, United States of America
| | - Kyle Mahowald
- Department of Linguistics, The University of Texas at Austin, United States of America
| |
Collapse
|
32
|
Burrell M, Pastor-Bernier A, Schultz W. Worth the Work? Monkeys Discount Rewards by a Subjective Adapting Effort Cost. J Neurosci 2023; 43:6796-6806. [PMID: 37625854 PMCID: PMC10552939 DOI: 10.1523/jneurosci.0115-23.2023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Revised: 05/25/2023] [Accepted: 05/30/2023] [Indexed: 08/27/2023] Open
Abstract
All life must solve how to allocate limited energy resources to maximize benefits from scarce opportunities. Economic theory posits decision makers optimize choice by maximizing the subjective benefit (utility) of reward minus the subjective cost (disutility) of the required effort. While successful in many settings, this model does not fully account for how experience can alter reward-effort trade-offs. Here, we test how well the subtractive model of effort disutility explains the behavior of two male nonhuman primates (Macaca mulatta) in a binary choice task in which reward quantity and physical effort to obtain were varied. Applying random utility modeling to independently estimate reward utility and effort disutility, we show the subtractive effort model better explains out-of-sample choice behavior when compared with parabolic and exponential effort discounting. Furthermore, we demonstrate that effort disutility depends on previous experience of effort: in analogy to work from behavioral labor economics, we develop a model of reference-dependent effort disutility to explain the increased willingness to expend effort following previous experience of effortful options in a session. The result of this analysis suggests that monkeys discount reward by an effort cost that is measured relative to an expected effort learned from previous trials. When this subjective cost of effort, a function of context and experience, is accounted for, trial-by-trial choices can be explained by the subtractive cost model of effort. Therefore, in searching for net utility signals that may underpin effort-based decision-making in the brain, careful measurement of subjective effort costs is an essential first step.SIGNIFICANCE STATEMENT All decision-makers need to consider how much effort they need to expend when evaluating potential options. Economic theories suggest that the optimal way to choose is by cost-benefit analysis of reward against effort. To be able to do this efficiently over many decision contexts, this needs to be done flexibly, with appropriate adaptation to context and experience. Therefore, in aiming to understand how this might be achieved in the brain, it is important to first carefully measure the subjective cost of effort. Here, we show monkeys make reward-effort cost-benefit decisions, subtracting the subjective cost of effort from the subjective value of rewards. Moreover, the subjective cost of effort is dependent on the monkeys' experience of effort in previous trials.
Collapse
Affiliation(s)
- Mark Burrell
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge CB2 3DY, United Kingdom
| | - Alexandre Pastor-Bernier
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge CB2 3DY, United Kingdom
| | - Wolfram Schultz
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge CB2 3DY, United Kingdom
| |
Collapse
|
33
|
Futrell R. Information-theoretic principles in incremental language production. Proc Natl Acad Sci U S A 2023; 120:e2220593120. [PMID: 37725652 PMCID: PMC10523564 DOI: 10.1073/pnas.2220593120] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Accepted: 07/22/2023] [Indexed: 09/21/2023] Open
Abstract
I apply a recently emerging perspective on the complexity of action selection, the rate-distortion theory of control, to provide a computational-level model of errors and difficulties in human language production, which is grounded in information theory and control theory. Language production is cast as the sequential selection of actions to achieve a communicative goal subject to a capacity constraint on cognitive control. In a series of calculations, simulations, corpus analyses, and comparisons to experimental data, I show that the model directly predicts some of the major known qualitative and quantitative phenomena in language production, including semantic interference and predictability effects in word choice; accessibility-based ("easy-first") production preferences in word order alternations; and the existence and distribution of disfluencies including filled pauses, corrections, and false starts. I connect the rate-distortion view to existing models of human language production, to probabilistic models of semantics and pragmatics, and to proposals for controlled language generation in the machine learning and reinforcement learning literature.
Collapse
Affiliation(s)
- Richard Futrell
- Department of Language Science, University of California, Irvine, CA92617
| |
Collapse
|
34
|
Chu J, Schulz LE. Not Playing by the Rules: Exploratory Play, Rational Action, and Efficient Search. Open Mind (Camb) 2023; 7:294-317. [PMID: 37416069 PMCID: PMC10320825 DOI: 10.1162/opmi_a_00076] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Accepted: 02/06/2023] [Indexed: 07/08/2023] Open
Abstract
Recent studies suggest children's exploratory play is consistent with formal accounts of rational learning. Here we focus on the tension between this view and a nearly ubiquitous feature of human play: In play, people subvert normal utility functions, incurring seemingly unnecessary costs to achieve arbitrary rewards. We show that four-and-five-year-old children not only infer playful behavior from observed violations of rational action (Experiment 1), but themselves take on unnecessary costs during both retrieval (Experiment 2) and search (Experiments 3A-B) tasks, despite acting efficiently in non-playful, instrumental contexts. We discuss the value of such apparently utility-violating behavior and why it might serve learning in the long run.
Collapse
Affiliation(s)
- Junyi Chu
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA
| | - Laura E. Schulz
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA
| |
Collapse
|
35
|
Masís J, Chapman T, Rhee JY, Cox DD, Saxe AM. Strategically managing learning during perceptual decision making. eLife 2023; 12:e64978. [PMID: 36786427 PMCID: PMC9928425 DOI: 10.7554/elife.64978] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2020] [Accepted: 01/15/2023] [Indexed: 02/15/2023] Open
Abstract
Making optimal decisions in the face of noise requires balancing short-term speed and accuracy. But a theory of optimality should account for the fact that short-term speed can influence long-term accuracy through learning. Here, we demonstrate that long-term learning is an important dynamical dimension of the speed-accuracy trade-off. We study learning trajectories in rats and formally characterize these dynamics in a theory expressed as both a recurrent neural network and an analytical extension of the drift-diffusion model that learns over time. The model reveals that choosing suboptimal response times to learn faster sacrifices immediate reward, but can lead to greater total reward. We empirically verify predictions of the theory, including a relationship between stimulus exposure and learning speed, and a modulation of reaction time by future learning prospects. We find that rats' strategies approximately maximize total reward over the full learning epoch, suggesting cognitive control over the learning process.
Collapse
Affiliation(s)
- Javier Masís
- Department of Molecular and Cellular Biology, Harvard UniversityCambridgeUnited States
- Center for Brain Science, Harvard UniversityCambridgeUnited States
| | - Travis Chapman
- Center for Brain Science, Harvard UniversityCambridgeUnited States
| | - Juliana Y Rhee
- Department of Molecular and Cellular Biology, Harvard UniversityCambridgeUnited States
- Center for Brain Science, Harvard UniversityCambridgeUnited States
| | - David D Cox
- Department of Molecular and Cellular Biology, Harvard UniversityCambridgeUnited States
- Center for Brain Science, Harvard UniversityCambridgeUnited States
| | - Andrew M Saxe
- Department of Experimental Psychology, University of OxfordOxfordUnited Kingdom
| |
Collapse
|
36
|
Heald JB, Lengyel M, Wolpert DM. Contextual inference in learning and memory. Trends Cogn Sci 2023; 27:43-64. [PMID: 36435674 PMCID: PMC9789331 DOI: 10.1016/j.tics.2022.10.004] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Revised: 10/11/2022] [Accepted: 10/12/2022] [Indexed: 11/25/2022]
Abstract
Context is widely regarded as a major determinant of learning and memory across numerous domains, including classical and instrumental conditioning, episodic memory, economic decision-making, and motor learning. However, studies across these domains remain disconnected due to the lack of a unifying framework formalizing the concept of context and its role in learning. Here, we develop a unified vernacular allowing direct comparisons between different domains of contextual learning. This leads to a Bayesian model positing that context is unobserved and needs to be inferred. Contextual inference then controls the creation, expression, and updating of memories. This theoretical approach reveals two distinct components that underlie adaptation, proper and apparent learning, respectively referring to the creation and updating of memories versus time-varying adjustments in their expression. We review a number of extensions of the basic Bayesian model that allow it to account for increasingly complex forms of contextual learning.
Collapse
Affiliation(s)
- James B Heald
- Department of Neuroscience, Columbia University, New York, NY 10027, USA; Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY 10027, USA.
| | - Máté Lengyel
- Computational and Biological Learning Lab, Department of Engineering, University of Cambridge, Cambridge, UK; Center for Cognitive Computation, Department of Cognitive Science, Central European University, Budapest, Hungary.
| | - Daniel M Wolpert
- Department of Neuroscience, Columbia University, New York, NY 10027, USA; Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY 10027, USA; Computational and Biological Learning Lab, Department of Engineering, University of Cambridge, Cambridge, UK.
| |
Collapse
|
37
|
Lancia GL, Eluchans M, D’Alessandro M, Spiers HJ, Pezzulo G. Humans account for cognitive costs when finding shortcuts: An information-theoretic analysis of navigation. PLoS Comput Biol 2023; 19:e1010829. [PMID: 36608145 PMCID: PMC9851521 DOI: 10.1371/journal.pcbi.1010829] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Revised: 01/19/2023] [Accepted: 12/19/2022] [Indexed: 01/09/2023] Open
Abstract
When faced with navigating back somewhere we have been before we might either retrace our steps or seek a shorter path. Both choices have costs. Here, we ask whether it is possible to characterize formally the choice of navigational plans as a bounded rational process that trades off the quality of the plan (e.g., its length) and the cognitive cost required to find and implement it. We analyze the navigation strategies of two groups of people that are firstly trained to follow a "default policy" taking a route in a virtual maze and then asked to navigate to various known goal destinations, either in the way they want ("Go To Goal") or by taking novel shortcuts ("Take Shortcut"). We address these wayfinding problems using InfoRL: an information-theoretic approach that formalizes the cognitive cost of devising a navigational plan, as the informational cost to deviate from a well-learned route (the "default policy"). In InfoRL, optimality refers to finding the best trade-off between route length and the amount of control information required to find it. We report five main findings. First, the navigational strategies automatically identified by InfoRL correspond closely to different routes (optimal or suboptimal) in the virtual reality map, which were annotated by hand in previous research. Second, people deliberate more in places where the value of investing cognitive resources (i.e., relevant goal information) is greater. Third, compared to the group of people who receive the "Go To Goal" instruction, those who receive the "Take Shortcut" instruction find shorter but less optimal solutions, reflecting the intrinsic difficulty of finding optimal shortcuts. Fourth, those who receive the "Go To Goal" instruction modulate flexibly their cognitive resources, depending on the benefits of finding the shortcut. Finally, we found a surprising amount of variability in the choice of navigational strategies and resource investment across participants. Taken together, these results illustrate the benefits of using InfoRL to address navigational planning problems from a bounded rational perspective.
Collapse
Affiliation(s)
- Gian Luca Lancia
- Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy
- University of Rome “La Sapienza”, Rome, Italy
| | - Mattia Eluchans
- Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy
- University of Rome “La Sapienza”, Rome, Italy
| | - Marco D’Alessandro
- Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy
| | - Hugo J. Spiers
- Institute of Behavioural Neuroscience, Department of Experimental Psychology, Division of Psychology and Language Sciences, University College London, United Kingdom
| | - Giovanni Pezzulo
- Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy
| |
Collapse
|
38
|
Appelhoff S, Hertwig R, Spitzer B. EEG-representational geometries and psychometric distortions in approximate numerical judgment. PLoS Comput Biol 2022; 18:e1010747. [PMID: 36469506 PMCID: PMC9754589 DOI: 10.1371/journal.pcbi.1010747] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Revised: 12/15/2022] [Accepted: 11/18/2022] [Indexed: 12/12/2022] Open
Abstract
When judging the average value of sample stimuli (e.g., numbers) people tend to either over- or underweight extreme sample values, depending on task context. In a context of overweighting, recent work has shown that extreme sample values were overly represented also in neural signals, in terms of an anti-compressed geometry of number samples in multivariate electroencephalography (EEG) patterns. Here, we asked whether neural representational geometries may also reflect a relative underweighting of extreme values (i.e., compression) which has been observed behaviorally in a great variety of tasks. We used a simple experimental manipulation (instructions to average a single-stream or to compare dual-streams of samples) to induce compression or anti-compression in behavior when participants judged rapid number sequences. Model-based representational similarity analysis (RSA) replicated the previous finding of neural anti-compression in the dual-stream task, but failed to provide evidence for neural compression in the single-stream task, despite the evidence for compression in behavior. Instead, the results indicated enhanced neural processing of extreme values in either task, regardless of whether extremes were over- or underweighted in subsequent behavioral choice. We further observed more general differences in the neural representation of the sample information between the two tasks. Together, our results indicate a mismatch between sample-level EEG geometries and behavior, which raises new questions about the origin of common psychometric distortions, such as diminishing sensitivity for larger values.
Collapse
Affiliation(s)
- Stefan Appelhoff
- Research Group Adaptive Memory and Decision Making, Max Planck Institute for Human Development, Berlin, Germany
- Center for Adaptive Rationality, Max Planck Institute for Human Development, Berlin, Germany
- Max Planck Dahlem Campus of Cognition, Max Planck Institute for Human Development, Berlin, Germany
| | - Ralph Hertwig
- Center for Adaptive Rationality, Max Planck Institute for Human Development, Berlin, Germany
| | - Bernhard Spitzer
- Research Group Adaptive Memory and Decision Making, Max Planck Institute for Human Development, Berlin, Germany
- Center for Adaptive Rationality, Max Planck Institute for Human Development, Berlin, Germany
- Max Planck Dahlem Campus of Cognition, Max Planck Institute for Human Development, Berlin, Germany
| |
Collapse
|
39
|
Éltető N, Nemeth D, Janacsek K, Dayan P. Tracking human skill learning with a hierarchical Bayesian sequence model. PLoS Comput Biol 2022; 18:e1009866. [PMID: 36449550 PMCID: PMC9744313 DOI: 10.1371/journal.pcbi.1009866] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2022] [Revised: 12/12/2022] [Accepted: 10/31/2022] [Indexed: 12/03/2022] Open
Abstract
Humans can implicitly learn complex perceptuo-motor skills over the course of large numbers of trials. This likely depends on our becoming better able to take advantage of ever richer and temporally deeper predictive relationships in the environment. Here, we offer a novel characterization of this process, fitting a non-parametric, hierarchical Bayesian sequence model to the reaction times of human participants' responses over ten sessions, each comprising thousands of trials, in a serial reaction time task involving higher-order dependencies. The model, adapted from the domain of language, forgetfully updates trial-by-trial, and seamlessly combines predictive information from shorter and longer windows onto past events, weighing the windows proportionally to their predictive power. As the model implies a posterior over window depths, we were able to determine how, and how many, previous sequence elements influenced individual participants' internal predictions, and how this changed with practice. Already in the first session, the model showed that participants had begun to rely on two previous elements (i.e., trigrams), thereby successfully adapting to the most prominent higher-order structure in the task. The extent to which local statistical fluctuations in trigram frequency influenced participants' responses waned over subsequent sessions, as participants forgot the trigrams less and evidenced skilled performance. By the eighth session, a subset of participants shifted their prior further to consider a context deeper than two previous elements. Finally, participants showed resistance to interference and slow forgetting of the old sequence when it was changed in the final sessions. Model parameters for individual participants covaried appropriately with independent measures of working memory and error characteristics. In sum, the model offers the first principled account of the adaptive complexity and nuanced dynamics of humans' internal sequence representations during long-term implicit skill learning.
Collapse
Affiliation(s)
- Noémi Éltető
- Max Planck Institute for Biological Cybernetics, Tübingen, Germany
- * E-mail:
| | - Dezső Nemeth
- Lyon Neuroscience Research Center, Université de Lyon, Lyon, France
- Institute of Psychology, ELTE Eötvös Loránd University, Budapest, Hungary
- Institute of Cognitive Neuroscience and Psychology, Research Centre for Natural Sciences, Budapest, Hungary
| | - Karolina Janacsek
- Institute of Psychology, ELTE Eötvös Loránd University, Budapest, Hungary
- Centre for Thinking and Learning, Institute for Lifecourse Development, Universtiy of Greenwich, London, United Kingdom
| | - Peter Dayan
- Max Planck Institute for Biological Cybernetics, Tübingen, Germany
- University of Tübingen, Tübingen, Germany
| |
Collapse
|
40
|
Schach S, Lindner A, Braun DA. Bounded rational decision-making models suggest capacity-limited concurrent motor planning in human posterior parietal and frontal cortex. PLoS Comput Biol 2022; 18:e1010585. [PMID: 36227842 PMCID: PMC9560147 DOI: 10.1371/journal.pcbi.1010585] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Accepted: 09/18/2022] [Indexed: 11/05/2022] Open
Abstract
While traditional theories of sensorimotor processing have often assumed a serial decision-making pipeline, more recent approaches have suggested that multiple actions may be planned concurrently and vie for execution. Evidence for the latter almost exclusively stems from electrophysiological studies in posterior parietal and premotor cortex of monkeys. Here we study concurrent prospective motor planning in humans by recording functional magnetic resonance imaging (fMRI) during a delayed response task engaging movement sequences towards multiple potential targets. We find that also in human posterior parietal and premotor cortex delay activity modulates both with sequence complexity and the number of potential targets. We tested the hypothesis that this modulation is best explained by concurrent prospective planning as opposed to the mere maintenance of potential targets in memory. We devise a bounded rationality model with information constraints that optimally assigns information resources for planning and memory for this task and determine predicted information profiles according to the two hypotheses. When regressing delay activity on these model predictions, we find that the concurrent prospective planning strategy provides a significantly better explanation of the fMRI-signal modulations. Moreover, we find that concurrent prospective planning is more costly and thus limited for most subjects, as expressed by the best fitting information capacities. We conclude that bounded rational decision-making models allow relating both behavior and neural representations to utilitarian task descriptions based on bounded optimal information-processing assumptions. When the future is uncertain, it can be beneficial to concurrently plan several action possibilities in advance. Electrophysiological research found evidence in monkeys that brain regions in posterior parietal and promotor cortex are indeed capable of planning several actions in parallel. We now used fMRI to study brain activity in these brain regions in humans. For our analyses we applied bounded rationality models that optimally assign information resources to fMRI activity in a complex motor planning task. We find that theoretical information costs of concurrent prospective planning explained fMRI activity profiles significantly better than assuming alternative memory-based strategies. Moreover, exploiting the model allowed us to quantify the individual capacity limit for concurrent planning and to relate these individual limits to both subjects’ behavior and to their neural representations of planning.
Collapse
Affiliation(s)
- Sonja Schach
- Institute of Neural Information Processing, University of Ulm, Ulm, Germany
- * E-mail:
| | - Axel Lindner
- Tübingen Center for Mental Health, Department of Psychiatry and Psychotherapy, University of Tübingen, Tübingen, Germany
- Centre of Neurology, Division of Neuropsychology, Hertie-Institute for Clinical Brain Research, University of Tübingen, Tübingen, Germany
| | | |
Collapse
|
41
|
Levari DE. Range-frequency effects can explain and eliminate prevalence-induced concept change. Cognition 2022; 226:105196. [DOI: 10.1016/j.cognition.2022.105196] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2021] [Revised: 05/27/2022] [Accepted: 05/31/2022] [Indexed: 11/25/2022]
|
42
|
Treadway MT, Salamone JD. Vigor, Effort-Related Aspects of Motivation and Anhedonia. Curr Top Behav Neurosci 2022; 58:325-353. [PMID: 35505057 DOI: 10.1007/7854_2022_355] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
In this chapter we provide an overview of the pharmacological and circuit mechanisms that determine the willingness to expend effort in pursuit of rewards. A particular focus will be on the role of the mesolimbic dopamine system, as well the contributing roles of limbic and cortical brains areas involved in the evaluation, selection, and invigoration of goal-directed actions. We begin with a review of preclinical studies, which have provided key insights into the brain systems that are necessary and sufficient for effort-based decision-making and have characterized novel compounds that enhance selection of high-effort activities. Next, we summarize translational studies identifying and expanding this circuitry in humans. Finally, we discuss the relevance of this work for understanding common motivational impairments as part of the broader anhedonia symptom domain associated with mental illness, and the identification of new treatment targets within this circuitry to improve motivation and effort-expenditure.
Collapse
Affiliation(s)
| | - John D Salamone
- Department of Psychological Sciences, University of Connecticut, Storrs, CT, USA
| |
Collapse
|
43
|
Eissa TL, Gold JI, Josić K, Kilpatrick ZP. Suboptimal human inference can invert the bias-variance trade-off for decisions with asymmetric evidence. PLoS Comput Biol 2022; 18:e1010323. [PMID: 35853038 PMCID: PMC9337699 DOI: 10.1371/journal.pcbi.1010323] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Revised: 07/29/2022] [Accepted: 06/22/2022] [Indexed: 11/18/2022] Open
Abstract
Solutions to challenging inference problems are often subject to a fundamental trade-off between: 1) bias (being systematically wrong) that is minimized with complex inference strategies, and 2) variance (being oversensitive to uncertain observations) that is minimized with simple inference strategies. However, this trade-off is based on the assumption that the strategies being considered are optimal for their given complexity and thus has unclear relevance to forms of inference based on suboptimal strategies. We examined inference problems applied to rare, asymmetrically available evidence, which a large population of human subjects solved using a diverse set of strategies that varied in form and complexity. In general, subjects using more complex strategies tended to have lower bias and variance, but with a dependence on the form of strategy that reflected an inversion of the classic bias-variance trade-off: subjects who used more complex, but imperfect, Bayesian-like strategies tended to have lower variance but higher bias because of incorrect tuning to latent task features, whereas subjects who used simpler heuristic strategies tended to have higher variance because they operated more directly on the observed samples but lower, near-normative bias. Our results help define new principles that govern individual differences in behavior that depends on rare-event inference and, more generally, about the information-processing trade-offs that can be sensitive to not just the complexity, but also the optimality, of the inference process. People use diverse strategies to make inferences about the world around them, often based on limited evidence. Such inference strategies may be simple but prone to systematic errors or more complex and accurate, but such trends need not always be the rule. We modeled and measured how human participants made rare-event decisions in a preregistered, online study. The participants tended to use suboptimal decision strategies that reflected an inversion of the classic bias-variance trade-off: some used complex, nearly normative strategies with mistuned evidence weights that corresponded to relatively high choice biases but lower choice variance, whereas others used simpler heuristic strategies that corresponded to lower biases but higher variance. These relationships illustrate structure in suboptimality that can be used to identify systematic sources of human errors.
Collapse
Affiliation(s)
- Tahra L. Eissa
- Department of Applied Mathematics, University of Colorado Boulder, Boulder, Colorado, United States of America
- * E-mail:
| | - Joshua I. Gold
- Department of Neuroscience, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Krešimir Josić
- Department of Mathematics, University of Houston, Houston, Texas, United States of America
- Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America
| | - Zachary P. Kilpatrick
- Department of Applied Mathematics, University of Colorado Boulder, Boulder, Colorado, United States of America
- Institute of Cognitive Science, University of Colorado Boulder, Boulder, Colorado, United States of America
| |
Collapse
|
44
|
Abstract
Motivation is key for performance in domains such as work, sport, and learning. Research has established that motivation and the willingness to invest effort generally increase as a function of reward. However, this view struggles to explain some empirical observations-for example, in the domain of sport, athletes sometimes appear to lose motivation when playing against weak opponents-this despite objective rewards being high. This and similar evidence highlight the role of subjective value in motivation and effort allocation. To capture this, here, we advance a novel theory and computational model where motivation and effort allocation arise from reference-based evaluation processes. Our proposal argues that motivation (and the ensuing willingness to exert effort) stems from subjective value, which in turns depends on one's standards about performance and on the confidence about these standards. In a series of simulations, we show that the model explains puzzling motivational dynamics and associated feelings. Crucially, the model identifies realistic standards (i.e., those matching one's own actual ability) as those more beneficial for motivation and performance. On this basis, the model establishes a normative solution to the problem of optimal allocation of effort, analogous to the optimal allocation of neural and computational resources as in efficient coding.
Collapse
|
45
|
Colombo B, Iannello P. The combined effect of music-induced emotions and neuromodulation on economic decision making: a tDCS study. JOURNAL OF COGNITIVE PSYCHOLOGY 2022. [DOI: 10.1080/20445911.2022.2084546] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
- Barbara Colombo
- Behavioral Neuroscience Lab, Champlain College, Burlington, VT, USA
| | - Paola Iannello
- Psychology Department, Catholic University of the Sacred Heart, Milan, Italy
| |
Collapse
|
46
|
Attaallah B, Petitet P, Slavkova E, Turner V, Saleh Y, Manohar SG, Husain M. Hyperreactivity to uncertainty is a key feature of subjective cognitive impairment. eLife 2022; 11:75834. [PMID: 35536752 PMCID: PMC9197396 DOI: 10.7554/elife.75834] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Accepted: 05/09/2022] [Indexed: 11/26/2022] Open
Abstract
With an increasingly ageing global population, more people are presenting with concerns about their cognitive function, but not all have an underlying neurodegenerative diagnosis. Subjective cognitive impairment (SCI) is a common condition describing self-reported deficits in cognition without objective evidence of cognitive impairment. Many individuals with SCI suffer from depression and anxiety, which have been hypothesised to account for their cognitive complaints. Despite this association between SCI and affective features, the cognitive and brain mechanisms underlying SCI are poorly understood. Here, we show that people with SCI are hyperreactive to uncertainty and that this might be a key mechanism accounting for their affective burden. Twenty-seven individuals with SCI performed an information sampling task, where they could actively gather information prior to decisions. Across different conditions, SCI participants sampled faster and obtained more information than matched controls to resolve uncertainty. Remarkably, despite their ‘urgent’ sampling behaviour, SCI participants were able to maintain their efficiency. Hyperreactivity to uncertainty indexed by this sampling behaviour correlated with the severity of affective burden including depression and anxiety. Analysis of MRI resting functional connectivity revealed that SCI participants had stronger insular-hippocampal connectivity compared to controls, which also correlated with faster sampling. These results suggest that altered uncertainty processing is a key mechanism underlying the psycho-cognitive manifestations in SCI and implicate a specific brain network target for future treatment.
Collapse
Affiliation(s)
- Bahaaeddin Attaallah
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, United Kingdom
| | - Pierre Petitet
- Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom
| | - Elista Slavkova
- Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom
| | - Vicky Turner
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, United Kingdom
| | - Youssuf Saleh
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, United Kingdom
| | - Sanjay G Manohar
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, United Kingdom
| | - Masud Husain
- Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
47
|
Time pressure changes how people explore and respond to uncertainty. Sci Rep 2022; 12:4122. [PMID: 35260717 PMCID: PMC8904509 DOI: 10.1038/s41598-022-07901-1] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Accepted: 02/28/2022] [Indexed: 12/25/2022] Open
Abstract
How does time pressure influence exploration and decision-making? We investigated this question with several four-armed bandit tasks manipulating (within subjects) expected reward, uncertainty, and time pressure (limited vs. unlimited). With limited time, people have less opportunity to perform costly computations, thus shifting the cost-benefit balance of different exploration strategies. Through behavioral, reinforcement learning (RL), reaction time (RT), and evidence accumulation analyses, we show that time pressure changes how people explore and respond to uncertainty. Specifically, participants reduced their uncertainty-directed exploration under time pressure, were less value-directed, and repeated choices more often. Since our analyses relate uncertainty to slower responses and dampened evidence accumulation (i.e., drift rates), this demonstrates a resource-rational shift towards simpler, lower-cost strategies under time pressure. These results shed light on how people adapt their exploration and decision-making strategies to externally imposed cognitive constraints.
Collapse
|
48
|
Grujic N, Brus J, Burdakov D, Polania R. Rational inattention in mice. SCIENCE ADVANCES 2022; 8:eabj8935. [PMID: 35245128 PMCID: PMC8896787 DOI: 10.1126/sciadv.abj8935] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
Behavior exhibited by humans and other organisms is generally inconsistent and biased and, thus, is often labeled irrational. However, the origins of this seemingly suboptimal behavior remain elusive. We developed a behavioral task and normative framework to reveal how organisms should allocate their limited processing resources such that sensory precision and its related metabolic investment are balanced to guarantee maximal utility. We found that mice act as rational inattentive agents by adaptively allocating their sensory resources in a way that maximizes reward consumption in previously unexperienced stimulus-reward association environments. Unexpectedly, perception of commonly occurring stimuli was relatively imprecise; however, this apparent statistical fallacy implies "awareness" and efficient adaptation to their neurocognitive limitations. Arousal systems carry reward distribution information of sensory signals, and distributional reinforcement learning mechanisms regulate sensory precision via top-down normalization. These findings reveal how organisms efficiently perceive and adapt to previously unexperienced environmental contexts within the constraints imposed by neurobiology.
Collapse
Affiliation(s)
- Nikola Grujic
- Institute for Neuroscience, Department of Health Sciences and Technology, ETH Zurich, Zurich, Switzerland
- Neuroscience Center Zürich, Zurich, Switzerland
| | - Jeroen Brus
- Neuroscience Center Zürich, Zurich, Switzerland
- Decision Neuroscience Lab, Department of Health Sciences and Technology, ETH Zurich, Zurich, Switzerland
| | - Denis Burdakov
- Institute for Neuroscience, Department of Health Sciences and Technology, ETH Zurich, Zurich, Switzerland
- Neuroscience Center Zürich, Zurich, Switzerland
- Corresponding author. (R.P.); (D.B.)
| | - Rafael Polania
- Neuroscience Center Zürich, Zurich, Switzerland
- Decision Neuroscience Lab, Department of Health Sciences and Technology, ETH Zurich, Zurich, Switzerland
- Corresponding author. (R.P.); (D.B.)
| |
Collapse
|
49
|
Fine JM, Hayden BY. The whole prefrontal cortex is premotor cortex. Philos Trans R Soc Lond B Biol Sci 2022; 377:20200524. [PMID: 34957853 PMCID: PMC8710885 DOI: 10.1098/rstb.2020.0524] [Citation(s) in RCA: 48] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2021] [Accepted: 10/01/2021] [Indexed: 11/12/2022] Open
Abstract
We propose that the entirety of the prefrontal cortex (PFC) can be seen as fundamentally premotor in nature. By this, we mean that the PFC consists of an action abstraction hierarchy whose core function is the potentiation and depotentiation of possible action plans at different levels of granularity. We argue that the apex of the hierarchy should revolve around the process of goal-selection, which we posit is inherently a form of optimization over action abstraction. Anatomical and functional evidence supports the idea that this hierarchy originates on the orbital surface of the brain and extends dorsally to motor cortex. Accordingly, our viewpoint positions the orbitofrontal cortex in a key role in the optimization of goal-selection policies, and suggests that its other proposed roles are aspects of this more general function. Our proposed perspective will reframe outstanding questions, open up new areas of inquiry and align theories of prefrontal function with evolutionary principles. This article is part of the theme issue 'Systems neuroscience through the lens of evolutionary theory'.
Collapse
Affiliation(s)
- Justin M. Fine
- Department of Neuroscience, Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, MN 55455, USA
- Department of Biomedical Engineering, University of Minnesota, Minneapolis, MN 55455, USA
| | - Benjamin Y. Hayden
- Department of Neuroscience, Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, MN 55455, USA
- Department of Biomedical Engineering, University of Minnesota, Minneapolis, MN 55455, USA
| |
Collapse
|
50
|
Foucault C, Meyniel F. Gated recurrence enables simple and accurate sequence prediction in stochastic, changing, and structured environments. eLife 2021; 10:71801. [PMID: 34854377 PMCID: PMC8735865 DOI: 10.7554/elife.71801] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Accepted: 12/01/2021] [Indexed: 11/13/2022] Open
Abstract
From decision making to perception to language, predicting what is coming next is crucial. It is also challenging in stochastic, changing, and structured environments; yet the brain makes accurate predictions in many situations. What computational architecture could enable this feat? Bayesian inference makes optimal predictions but is prohibitively difficult to compute. Here, we show that a specific recurrent neural network architecture enables simple and accurate solutions in several environments. This architecture relies on three mechanisms: gating, lateral connections, and recurrent weight training. Like the optimal solution and the human brain, such networks develop internal representations of their changing environment (including estimates of the environment’s latent variables and the precision of these estimates), leverage multiple levels of latent structure, and adapt their effective learning rate to changes without changing their connection weights. Being ubiquitous in the brain, gated recurrence could therefore serve as a generic building block to predict in real-life environments.
Collapse
Affiliation(s)
- Cédric Foucault
- INSERM, CEA, Université Paris-Saclay, Gif sur Yvette, France
| | | |
Collapse
|