1
|
Strigo IA, Simmons AN. The Intersection of Interoception and Anticipation Related to Pain. Curr Top Behav Neurosci 2025. [PMID: 40360927 DOI: 10.1007/7854_2025_588] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/15/2025]
Abstract
This chapter focuses on the interconnectedness of bodily awareness, emotion, and time perception, focusing on the roles of interoception and anticipation, pioneered by A.D. (Bud) Craig and advanced by contemporary research. Pain, a pivotal aspect of human experience, serves as a conduit for understanding our relationship with the world. Anticipation of pain, vital for survival, influences subjective pain experiences and is modulated by factors such as physiological reactivity and contextual cues. Emotional states significantly shape pain perception, with chronic pain conditions and affective disorders characterized by dysregulated pain modulation mechanisms. We discuss a state space model for pain, wherein pain functions as a latent construct shaped by both anticipatory and contemporaneous factors. Understanding these mechanisms is critical for informing clinical interventions aimed at pain management and highlights the intersection of interoception, emotion, and pain.
Collapse
Affiliation(s)
- Irina A Strigo
- Veterans Affairs & University of California, San Francisco, San Francisco, CA, USA.
| | - Alan N Simmons
- Veterans Affairs & University of California, San Diego, San Diego, CA, USA.
| |
Collapse
|
2
|
Tohidi-Moghaddam M, Tsetsos K. The timescale and direction of influence of a third inferior alternative in human value-learning. COMMUNICATIONS PSYCHOLOGY 2025; 3:56. [PMID: 40188261 PMCID: PMC11972167 DOI: 10.1038/s44271-025-00229-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/07/2024] [Accepted: 03/11/2025] [Indexed: 04/07/2025]
Abstract
The way humans and other animals represent the values of alternatives is context-dependent, as it can be distorted by inferior alternatives that are immediately available for choice (immediate context); or that were encountered in previous episodes (temporal context). Yet, the extent to which the immediate and temporal context (co-) shape context-dependent valuation remains unclear. Here, we asked human participants (onsite: N = 30, online: N = 68) to learn the values associated with three alternatives and explicitly report these values before making binary and ternary choices among the alternatives. We show that context-dependent valuation is evident in the pre-choice value estimates and manifests equally in binary and ternary choices. Accordingly, we conclude that value representations are modulated by the temporal (and not the immediate) context. The direction and across-participants variability of this modulation cannot be captured by extant normalization theories but by a mechanism constructing values through sequential binary comparisons.
Collapse
Affiliation(s)
- Maryam Tohidi-Moghaddam
- Department of Neurophysiology and Pathophysiology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany.
- Department of Psychology and Hamburg Center of Neuroscience, Universität Hamburg, Hamburg, Germany.
| | - Konstantinos Tsetsos
- Department of Neurophysiology and Pathophysiology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany.
- School of Psychological Science, University of Bristol, Bristol, UK.
| |
Collapse
|
3
|
Yi Y, Kreißl MC, Speck O, Düzel E, Hämmerer D. Decoding Salience: A Functional Magnetic Resonance Imaging Investigation of Reward and Contextual Unexpectedness in Memory Encoding and Retrieval. Hum Brain Mapp 2025; 46:e70124. [PMID: 39764707 PMCID: PMC11705450 DOI: 10.1002/hbm.70124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Revised: 12/13/2024] [Accepted: 12/19/2024] [Indexed: 01/11/2025] Open
Abstract
The present study investigated the neuromodulatory substrates of salience processing and its impact on memory encoding and behaviour, with a specific focus on two distinct types of salience: reward and contextual unexpectedness. 46 Participants performed a novel task paradigm modulating these two aspects independently and allowing for investigating their distinct and interactive effects on memory encoding while undergoing high-resolution fMRI. By using advanced image processing techniques tailored to examine midbrain and brainstem nuclei with high precision, our study additionally aimed to elucidate differential activation patterns in subcortical nuclei in response to reward-associated and contextually unexpected stimuli, including distinct pathways involving in particular dopaminergic modulation. We observed a differential involvement of the ventral striatum, substantia nigra (SN) and caudate nucleus, as well as a functional specialisation within the subregions of the cingulate cortex for the two salience types. Moreover, distinct subregions within the SN in processing salience could be identified. Dorsal areas preferentially processed salience related to stimulus processing (of both reward and contextual unexpectedness), and ventral areas were involved in salience-related memory encoding (for contextual unexpectedness only). These functional specialisations within SN are in line with different projection patterns of dorsal and ventral SN to brain areas supporting attention and memory, respectively. By disentangling stimulus processing and memory encoding related to two salience types, we hope to further consolidate our understanding of neuromodulatory structures' differential as well as interactive roles in modulating behavioural responses to salient events.
Collapse
Affiliation(s)
- Yeo‐Jin Yi
- Institute of Cognitive Neurology and Dementia ResearchOtto‐von‐Guericke UniversityMagdeburgGermany
- German Center for Neurodegenerative Diseases (DZNE)MagdeburgGermany
| | - Michael C. Kreißl
- German Center for Neurodegenerative Diseases (DZNE)MagdeburgGermany
- Division of Nuclear Medicine, Department of Nuclear MedicineOtto‐von‐Guericke UniversityMagdeburgGermany
| | - Oliver Speck
- German Center for Neurodegenerative Diseases (DZNE)MagdeburgGermany
- Biomedical Magnetic Resonance, Faculty of Natural SciencesOtto‐von‐Guericke UniversityMagdeburgGermany
- Center for Behavioral Brain SciencesMagdeburgGermany
- Leibniz Institute for NeurobiologyMagdeburgGermany
| | - Emrah Düzel
- Institute of Cognitive Neurology and Dementia ResearchOtto‐von‐Guericke UniversityMagdeburgGermany
- German Center for Neurodegenerative Diseases (DZNE)MagdeburgGermany
- Center for Behavioral Brain SciencesMagdeburgGermany
- Institute of Cognitive NeuroscienceUniversity College LondonUK
| | - Dorothea Hämmerer
- Institute of Cognitive Neurology and Dementia ResearchOtto‐von‐Guericke UniversityMagdeburgGermany
- Center for Behavioral Brain SciencesMagdeburgGermany
- Institute of Cognitive NeuroscienceUniversity College LondonUK
- Department of PsychologyUniversity of InnsbruckInnsbruckAustria
| |
Collapse
|
4
|
Butz MV, Mittenbühler M, Schwöbel S, Achimova A, Gumbsch C, Otte S, Kiebel S. Contextualizing predictive minds. Neurosci Biobehav Rev 2025; 168:105948. [PMID: 39580009 DOI: 10.1016/j.neubiorev.2024.105948] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 09/13/2024] [Accepted: 11/16/2024] [Indexed: 11/25/2024]
Abstract
The structure of human memory seems to be optimized for efficient prediction, planning, and behavior. We propose that these capacities rely on a tripartite structure of memory that includes concepts, events, and contexts-three layers that constitute the mental world model. We suggest that the mechanism that critically increases adaptivity and flexibility is the tendency to contextualize. This tendency promotes local, context-encoding abstractions, which focus event- and concept-based planning and inference processes on the task and situation at hand. As a result, cognitive contextualization offers a solution to the frame problem-the need to select relevant features of the environment from the rich stream of sensorimotor signals. We draw evidence for our proposal from developmental psychology and neuroscience. Adopting a computational stance, we present evidence from cognitive modeling research which suggests that context sensitivity is a feature that is critical for maximizing the efficiency of cognitive processes. Finally, we turn to recent deep-learning architectures which independently demonstrate how context-sensitive memory can emerge in a self-organized learning system constrained by cognitively-inspired inductive biases.
Collapse
Affiliation(s)
- Martin V Butz
- Cognitive Modeling, Faculty of Science, University of Tübingen, Sand 14, Tübingen 72076, Germany.
| | - Maximilian Mittenbühler
- Cognitive Modeling, Faculty of Science, University of Tübingen, Sand 14, Tübingen 72076, Germany
| | - Sarah Schwöbel
- Cognitive Computational Neuroscience, Faculty of Psychology, TU Dresden, School of Science, Dresden 01062, Germany
| | - Asya Achimova
- Cognitive Modeling, Faculty of Science, University of Tübingen, Sand 14, Tübingen 72076, Germany
| | - Christian Gumbsch
- Cognitive Modeling, Faculty of Science, University of Tübingen, Sand 14, Tübingen 72076, Germany; Chair of Cognitive and Clinical Neuroscience, Faculty of Psychology, TU Dresden, Dresden 01069, Germany
| | - Sebastian Otte
- Cognitive Modeling, Faculty of Science, University of Tübingen, Sand 14, Tübingen 72076, Germany; Adaptive AI Lab, Institute of Robotics and Cognitive Systems, University of Lübeck, Ratzeburger Allee 160, Lübeck 23562, Germany
| | - Stefan Kiebel
- Cognitive Computational Neuroscience, Faculty of Psychology, TU Dresden, School of Science, Dresden 01062, Germany
| |
Collapse
|
5
|
Brochard J, Daunizeau J. Efficient value synthesis in the orbitofrontal cortex explains how loss aversion adapts to the ranges of gain and loss prospects. eLife 2024; 13:e80979. [PMID: 39652465 PMCID: PMC11627503 DOI: 10.7554/elife.80979] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Accepted: 11/05/2024] [Indexed: 12/12/2024] Open
Abstract
Is irrational behavior the incidental outcome of biological constraints imposed on neural information processing? In this work, we consider the paradigmatic case of gamble decisions, where gamble values integrate prospective gains and losses. Under the assumption that neurons have a limited firing response range, we show that mitigating the ensuing information loss within artificial neural networks that synthetize value involves a specific form of self-organized plasticity. We demonstrate that the ensuing efficient value synthesis mechanism induces value range adaptation. We also reveal how the ranges of prospective gains and/or losses eventually determine both the behavioral sensitivity to gains and losses and the information content of the network. We test these predictions on two fMRI datasets from the OpenNeuro.org initiative that probe gamble decision-making but differ in terms of the range of gain prospects. First, we show that peoples' loss aversion eventually adapts to the range of gain prospects they are exposed to. Second, we show that the strength with which the orbitofrontal cortex (in particular: Brodmann area 11) encodes gains and expected value also depends upon the range of gain prospects. Third, we show that, when fitted to participant's gambling choices, self-organizing artificial neural networks generalize across gain range contexts and predict the geometry of information content within the orbitofrontal cortex. Our results demonstrate how self-organizing plasticity aiming at mitigating information loss induced by neurons' limited response range may result in value range adaptation, eventually yielding irrational behavior.
Collapse
Affiliation(s)
- Jules Brochard
- Sorbonne UniversitéParisFrance
- Institut du CerveauParisFrance
- INSERM UMR S1127ParisFrance
| | - Jean Daunizeau
- Sorbonne UniversitéParisFrance
- Institut du CerveauParisFrance
- INSERM UMR S1127ParisFrance
| |
Collapse
|
6
|
DiBerardino PAV, Filipowicz ALS, Danckert J, Anderson B. Plinko: Eliciting beliefs to build better models of statistical learning and mental model updating. Br J Psychol 2024; 115:759-786. [PMID: 39096484 DOI: 10.1111/bjop.12724] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Accepted: 07/03/2024] [Indexed: 08/05/2024]
Abstract
Prior beliefs are central to Bayesian accounts of cognition, but many of these accounts do not directly measure priors. More specifically, initial states of belief heavily influence how new information is assumed to be utilized when updating a particular model. Despite this, prior and posterior beliefs are either inferred from sequential participant actions or elicited through impoverished means. We had participants to play a version of the game 'Plinko', to first elicit individual participant priors in a theoretically agnostic manner. Subsequent learning and updating of participant beliefs was then directly measured. We show that participants hold various priors that cluster around prototypical probability distributions that in turn influence learning. In follow-up studies, we show that participant priors are stable over time and that the ability to update beliefs is influenced by a simple environmental manipulation (i.e., a short break). These data reveal the importance of directly measuring participant beliefs rather than assuming or inferring them as has been widely done in the literature to date. The Plinko game provides a flexible and fecund means for examining statistical learning and mental model updating.
Collapse
Affiliation(s)
| | | | - James Danckert
- Department of Psychology, University of Waterloo, Waterloo, Ontario, Canada
| | - Britt Anderson
- Department of Psychology, University of Waterloo, Waterloo, Ontario, Canada
| |
Collapse
|
7
|
Strigo IA, Kadlec M, Mitchell JM, Simmons AN. Identification of group differences in predictive anticipatory biasing of pain during uncertainty: preparing for the worst but hoping for the best. Pain 2024; 165:1735-1747. [PMID: 38501988 PMCID: PMC11247452 DOI: 10.1097/j.pain.0000000000003207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Revised: 01/05/2024] [Accepted: 01/09/2024] [Indexed: 03/20/2024]
Abstract
ABSTRACT Pain anticipation during conditions of uncertainty can unveil intrinsic biases, and understanding these biases can guide pain treatment interventions. This study used machine learning and functional magnetic resonance imaging to predict anticipatory responses in a pain anticipation experiment. One hundred forty-seven participants that included healthy controls (n = 57) and individuals with current and/or past mental health diagnosis (n = 90) received cues indicating upcoming pain stimuli: 2 cues predicted high and low temperatures, while a third cue introduced uncertainty. Accurate differentiation of neural patterns associated with specific anticipatory conditions was observed, involving activation in the anterior short gyrus of the insula and the nucleus accumbens. Three distinct response profiles emerged: subjects with a negative bias towards high pain anticipation, those with a positive bias towards low pain anticipation, and individuals whose predictions during uncertainty were unbiased. These profiles remained stable over one year, were consistent across diagnosed psychopathologies, and correlated with cognitive coping styles and underlying insula anatomy. The findings suggest that individualized and stable pain anticipation occurs in uncertain conditions.
Collapse
Affiliation(s)
- Irina A. Strigo
- Emotion and Pain Laboratory, San Francisco Veterans Affairs Health Care Center, San Francisco, CA, United States
- Department of Psychiatry, University of California San Francisco, San Francisco, CA, United States
| | - Molly Kadlec
- Center for Imaging of Neurodegenerative Diseases, San Francisco Veterans Affairs Health Care Center, San Francisco, CA, United States
| | - Jennifer M. Mitchell
- Department of Psychiatry, University of California San Francisco, San Francisco, CA, United States
- Department of Neurology, University of California San Francisco, San Francisco, CA, United States
| | - Alan N. Simmons
- San Diego Veterans Affairs Health Care Center, San Diego, CA, United States
- Department of Psychiatry, University of California San Diego, San Diego, CA, United States
| |
Collapse
|
8
|
Izakson L, Gal S, Shahar M, Tavor I, Levy DJ. Similar functional networks predict performance in both perceptual and value-based decision tasks. Cereb Cortex 2023; 33:2669-2681. [PMID: 35724432 DOI: 10.1093/cercor/bhac234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Revised: 05/16/2022] [Accepted: 05/16/2022] [Indexed: 11/13/2022] Open
Abstract
There are numerous commonalities between perceptual and preferential decision processes. For instance, previous studies have shown that both of these decision types are influenced by context. Also, the same computational models can explain both. However, the neural processes and functional connections that underlie these similarities between perceptual and value-based decisions are still unclear. Hence, in the current study, we examine whether perceptual and preferential processes can be explained by similar functional networks utilizing data from the Human Connectome Project. We used resting-state functional magnetic resonance imaging data to predict performance of 2 different decision-making tasks: a value-related task (the delay discounting task) and a perceptual task (the flanker task). We then examined the existence of shared predictive-network features across these 2 decision tasks. Interestingly, we found a significant positive correlation between the functional networks, which predicted the value-based and perceptual tasks. In addition, a larger functional connectivity between visual and frontal decision brain areas was a critical feature in the prediction of both tasks. These results demonstrate that functional connections between perceptual and value-related areas in the brain are inherently related to decision-making processes across domains.
Collapse
Affiliation(s)
- Liz Izakson
- Sagol School of Neuroscience, Tel Aviv University, Ramat Aviv, Tel Aviv 6997801, Israel
- Coller School of Management, Tel Aviv University, Ramat Aviv, Tel Aviv 6997801, Israel
| | - Shachar Gal
- Sagol School of Neuroscience, Tel Aviv University, Ramat Aviv, Tel Aviv 6997801, Israel
- Sackler Faculty of Medicine, Tel Aviv University, Ramat Aviv, Tel Aviv 6997801, Israel
| | - Moni Shahar
- Center of AI and Data Science, Tel Aviv University, Ramat Aviv, Tel Aviv 6997801, Israel
| | - Ido Tavor
- Sagol School of Neuroscience, Tel Aviv University, Ramat Aviv, Tel Aviv 6997801, Israel
- Sackler Faculty of Medicine, Tel Aviv University, Ramat Aviv, Tel Aviv 6997801, Israel
- Strauss Center for Computational Neuroimaging, Tel Aviv University, Ramat Aviv, Tel Aviv 6997801, Israel
| | - Dino J Levy
- Sagol School of Neuroscience, Tel Aviv University, Ramat Aviv, Tel Aviv 6997801, Israel
- Coller School of Management, Tel Aviv University, Ramat Aviv, Tel Aviv 6997801, Israel
| |
Collapse
|
9
|
Barakchian Z, Vahabie AH, Nili Ahmadabadi M. Implicit Counterfactual Effect in Partial Feedback Reinforcement Learning: Behavioral and Modeling Approach. Front Neurosci 2022; 16:631347. [PMID: 35620668 PMCID: PMC9127865 DOI: 10.3389/fnins.2022.631347] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2020] [Accepted: 03/28/2022] [Indexed: 11/13/2022] Open
Abstract
Context remarkably affects learning behavior by adjusting option values according to the distribution of available options. Displaying counterfactual outcomes, the outcomes of the unchosen option alongside the chosen one (i.e., providing complete feedback), would increase the contextual effect by inducing participants to compare the two outcomes during learning. However, when the context only consists of the juxtaposition of several options and there is no such explicit counterfactual factor (i.e., only partial feedback is provided), it is not clear whether and how the contextual effect emerges. In this research, we employ Partial and Complete feedback paradigms in which options are associated with different reward distributions. Our modeling analysis shows that the model that uses the outcome of the chosen option for updating the values of both chosen and unchosen options in opposing directions can better account for the behavioral data. This is also in line with the diffusive effect of dopamine on the striatum. Furthermore, our data show that the contextual effect is not limited to probabilistic rewards, but also extends to magnitude rewards. These results suggest that by extending the counterfactual concept to include the effect of the chosen outcome on the unchosen option, we can better explain why there is a contextual effect in situations in which there is no extra information about the unchosen outcome.
Collapse
Affiliation(s)
- Zahra Barakchian
- Department of Cognitive Neuroscience, Institute for Research in Fundamental Sciences, Tehran, Iran
- *Correspondence: Zahra Barakchian
| | - Abdol-Hossein Vahabie
- Cognitive Systems Laboratory, Control and Intelligent Processing Center of Excellence, School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran
- Department of Psychology, Faculty of Psychology and Education, University of Tehran, Tehran, Iran
| | - Majid Nili Ahmadabadi
- Cognitive Systems Laboratory, Control and Intelligent Processing Center of Excellence, School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran
| |
Collapse
|
10
|
|
11
|
Palminteri S, Lebreton M. Context-dependent outcome encoding in human reinforcement learning. Curr Opin Behav Sci 2021. [DOI: 10.1016/j.cobeha.2021.06.006] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
12
|
Tashjian SM, Zbozinek TD, Mobbs D. A Decision Architecture for Safety Computations. Trends Cogn Sci 2021; 25:342-354. [PMID: 33674206 PMCID: PMC8035229 DOI: 10.1016/j.tics.2021.01.013] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2020] [Revised: 01/28/2021] [Accepted: 01/29/2021] [Indexed: 12/11/2022]
Abstract
Accurately estimating safety is critical to pursuing nondefensive survival behaviors. However, little attention has been paid to how the human brain computes safety. We conceptualize a model that consists of two components: (i) threat-oriented evaluations that focus on threat value, imminence, and predictability; and (ii) self-oriented evaluations that focus on the agent's experience, strategies, and ability to control the situation. Our model points to the dynamic interaction between these two components as a mechanism of safety estimation. Based on a growing body of human literature, we hypothesize that distinct regions of the ventromedial prefrontal cortex (vmPFC) respond to threat and safety to facilitate survival decisions. We suggest safety is not an inverse of danger, but reflects independent computations that mediate defensive circuits and behaviors.
Collapse
Affiliation(s)
- Sarah M Tashjian
- Humanities and Social Sciences, California Institute of Technology, Pasadena, CA 91125, USA.
| | - Tomislav D Zbozinek
- Humanities and Social Sciences, California Institute of Technology, Pasadena, CA 91125, USA
| | - Dean Mobbs
- Humanities and Social Sciences, California Institute of Technology, Pasadena, CA 91125, USA; Computation and Neural Systems, California Institute of Technology, Pasadena, CA 91125, USA
| |
Collapse
|
13
|
Miendlarzewska EA, Aberg KC, Bavelier D, Schwartz S. Prior Reward Conditioning Dampens Hippocampal and Striatal Responses during an Associative Memory Task. J Cogn Neurosci 2020; 33:402-421. [PMID: 33326326 DOI: 10.1162/jocn_a_01660] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
Offering reward during encoding typically leads to better memory [Adcock, R. A., Thangavel, A., Whitfield-Gabrieli, S.,Knutson, B., & Gabrieli, J. D. E. Reward-motivated learning: Mesolimbic activation precedes memory formation. Neuron, 50, 507-517, 2006]. Whether such memory benefit persists when tested in a different task context remains, however, largely understudied [Wimmer, G. E., & Buechel, C. Reactivation of reward-related patterns from single past episodes supports memory-based decision making. Journal of Neuroscience, 36, 2868-2880, 2016]. Here, we ask whether reward at encoding leads to a generalized advantage across learning episodes, a question of high importance for any everyday life applications, from education to patient rehabilitation. Although we confirmed that offering monetary reward increased responses in the ventral striatum and pleasantness judgments for pictures used as stimuli, this immediate beneficial effect of reward did not carry over to a subsequent and different picture-location association memory task during which no reward was delivered. If anything, a trend for impaired memory accuracy was observed for the initially high-rewarded pictures as compared to low-rewarded ones. In line with this trend in behavioral performance, fMRI activity in reward (i.e., ventral striatum) and in memory (i.e., hippocampus) circuits was reduced during the encoding of new associations using previously highly rewarded pictures (compared to low-reward pictures). These neural effects extended to new pictures from same, previously highly rewarded semantic category. Twenty-four hours later, delayed recall of associations involving originally highly rewarded items was accompanied by decreased functional connectivity between the hippocampus and two brain regions implicated in value-based learning, the ventral striatum and the ventromedial PFC. We conclude that acquired reward value elicits a downward value-adjustment signal in the human reward circuit when reactivated in a novel nonrewarded context, with a parallel disengagement of memory-reward (hippocampal-striatal) networks, likely to undermine new associative learning. Although reward is known to promote learning, here we show how it may subsequently hinder hippocampal and striatal responses during new associative memory formation.
Collapse
Affiliation(s)
- Ewa A Miendlarzewska
- University of Geneva.,Campus Biotech, Geneva, Switzerland.,Montpellier Business School
| | | | | | | |
Collapse
|
14
|
Brooks HR, Sokol-Hessner P. Quantifying the immediate computational effects of preceding outcomes on subsequent risky choices. Sci Rep 2020; 10:9878. [PMID: 32555293 PMCID: PMC7303130 DOI: 10.1038/s41598-020-66502-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2019] [Accepted: 05/18/2020] [Indexed: 11/30/2022] Open
Abstract
Forty years ago, prospect theory introduced the notion that risky options are evaluated relative to their recent context, causing a significant shift in the study of risky monetary decision-making in psychology, economics, and neuroscience. Despite the central role of past experiences, it remains unclear whether, how, and how much past experiences quantitatively influence risky monetary choices moment-to-moment in a nominally learning-free setting. We analyzed a large dataset of risky monetary choices with trial-by-trial feedback to quantify how past experiences, or recent events, influence risky choice behavior and the underlying processes. We found larger recent outcomes both negatively influence subsequent risk-taking and positively influence the weight put on potential losses. Using a hierarchical Bayesian framework to fit a modified version of prospect theory, we demonstrated that the same risks will be evaluated differently given different past experiences. The computations underlying risky decision-making are fundamentally dynamic, even if the environment is not.
Collapse
Affiliation(s)
- Hayley R Brooks
- Department of Psychology, University of Denver, Denver, CO, USA
| | | |
Collapse
|
15
|
Laube C, Lorenz R, van den Bos W. Pubertal testosterone correlates with adolescent impatience and dorsal striatal activity. Dev Cogn Neurosci 2020; 42:100749. [PMID: 31942858 PMCID: PMC7242510 DOI: 10.1016/j.dcn.2019.100749] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2019] [Revised: 12/10/2019] [Accepted: 12/10/2019] [Indexed: 01/02/2023] Open
Abstract
Recent self-report and behavioral studies have demonstrated that pubertal testosterone is related to an increase in risky and impulsive behavior. Yet, the mechanisms underlying such a relationship are poorly understood. Findings from both human and rodent studies point towards distinct striatal pathways including the ventral and dorsal striatum as key target regions for pubertal hormones. In this study we investigated task-related impatience of boys between 10 and 15 years of age (N = 75), using an intertemporal choice task combined with measures of functional magnetic resonance imaging and hormonal assessment. Increased levels of testosterone were associated with a greater response bias towards choosing the smaller sooner option. Furthermore, our results show that testosterone specifically modulates the dorsal, not ventral, striatal pathway. These results provide novel insights into our understanding of adolescent impulsive and risky behaviors and how pubertal hormones are related to neural processes.
Collapse
Affiliation(s)
- Corinna Laube
- Center for Lifespan Psychology, Max Planck Institute for Human Development, Berlin, Germany.
| | - Robert Lorenz
- Center for Adaptive Rationality, Max Planck Institute for Human Development, Berlin, Germany
| | - Wouter van den Bos
- Center for Adaptive Rationality, Max Planck Institute for Human Development, Berlin, Germany; Department of Developmental Psychology, University of Amsterdam, The Netherlands
| |
Collapse
|
16
|
Rigoli F, Dolan R. Better than expected: the influence of option expectations during decision-making. Proc Biol Sci 2019; 285:20182472. [PMID: 30963894 PMCID: PMC6304046 DOI: 10.1098/rspb.2018.2472] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Our choices often arise from a consideration of options presented in a sequence (e.g. the products in a supermarket row). However, whether the precise sequential order of option presentation affects decision-making remains poorly understood. A recent model of choice proposes that, in a set of options presented sequentially, those that are better than expected will be perceived as more valuable, even when options are objectively equivalent within the set. Inspired by this proposal, we devised a novel decision-making task where we manipulated the order of option presentation together with expectations about option value. Even when we compared trials that were exactly equivalent except for option order, we observed a striking preference for options that were better than expected. Our findings show that expectations about options affect which option will be favoured within a sequence, an influence which is manifested as a preference for better-than-expected options. The findings have potential practical implications, as for example they may help policymakers in devising nudge strategies that rely on ad hoc option orders.
Collapse
Affiliation(s)
- Francesco Rigoli
- 1 Department of Psychology, City, University of London , Northampton Square, London EC1 V 0HB , UK.,2 The Wellcome Trust Centre for Neuroimaging, UCL , 12 Queen Square, London WC1N 3BG , UK
| | - Raymond Dolan
- 2 The Wellcome Trust Centre for Neuroimaging, UCL , 12 Queen Square, London WC1N 3BG , UK.,3 Max Planck UCL Centre for Computational Psychiatry and Ageing Research , London WC1B 5EH , UK
| |
Collapse
|
17
|
Reference effects on decision-making elicited by previous rewards. Cognition 2019; 192:104034. [PMID: 31387053 DOI: 10.1016/j.cognition.2019.104034] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2019] [Revised: 07/27/2019] [Accepted: 07/30/2019] [Indexed: 11/23/2022]
Abstract
Substantial evidence has highlighted reference effects occurring during decision-making, whereby subjective value is not calculated in absolute terms but relative to the distribution of rewards characterizing a context. Among these, within-choice effects are exerted by options simultaneously available during choice. These should be distinguished from between-choice effects, which depend on the distribution of options presented in the past. Influential theories on between-choice effects include Decision-by-Sampling, Expectation-as-Reference and Divisive Normalization. Surprisingly, previous literature has focused on each theory individually disregarding the others. Thus, similarities and differences among theories remain to be systematically examined. Here we fill this gap by offering an overview of the state-of-the-art of research about between-choice reference effects. Our comparison of alternative theories shows that, at present, none of them is able to account for the full range of empirical data. To address this, we propose a model inspired by previous perspectives and based on a logistic framework, hence called logistic model of subjective value. Predictions of the model are analysed in detail about reference effects and risky decision-making. We conclude that our proposal offers a compelling framework for interpreting the multifaceted manifestations of between-choice reference effects.
Collapse
|
18
|
Bavard S, Lebreton M, Khamassi M, Coricelli G, Palminteri S. Reference-point centering and range-adaptation enhance human reinforcement learning at the cost of irrational preferences. Nat Commun 2018; 9:4503. [PMID: 30374019 PMCID: PMC6206161 DOI: 10.1038/s41467-018-06781-2] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2018] [Accepted: 09/26/2018] [Indexed: 11/17/2022] Open
Abstract
In economics and perceptual decision-making contextual effects are well documented, where decision weights are adjusted as a function of the distribution of stimuli. Yet, in reinforcement learning literature whether and how contextual information pertaining to decision states is integrated in learning algorithms has received comparably little attention. Here, we investigate reinforcement learning behavior and its computational substrates in a task where we orthogonally manipulate outcome valence and magnitude, resulting in systematic variations in state-values. Model comparison indicates that subjects' behavior is best accounted for by an algorithm which includes both reference point-dependence and range-adaptation-two crucial features of state-dependent valuation. In addition, we find that state-dependent outcome valuation progressively emerges, is favored by increasing outcome information and correlated with explicit understanding of the task structure. Finally, our data clearly show that, while being locally adaptive (for instance in negative valence and small magnitude contexts), state-dependent valuation comes at the cost of seemingly irrational choices, when options are extrapolated out from their original contexts.
Collapse
Affiliation(s)
- Sophie Bavard
- Laboratoire de Neurosciences Cognitives Computationnelles, Institut National de la Santé et Recherche Médicale, 29 rue d'Ulm, 75005, Paris, France
- Département d'Etudes Cognitives, Ecole Normale Supérieure, Paris, 75005, France
- Institut d'Etudes de la Cognition, Université de Paris Sciences et Lettres, Paris, 75005, France
| | - Maël Lebreton
- CREED lab, Amsterdam School of Economics, Faculty of Business and Economics, University of Amsterdam, Roetersstraat 11, Amsterdam, 1018 WB, The Netherlands
- Amsterdam Brain and Cognition, University of Amsterdam, Amsterdam, 1018 WB, The Netherlands
- Swiss Centre for Affective Sciences, University of Geneva, 24 rue du Général-Dufour, Geneva, 1205, Switzerland
| | - Mehdi Khamassi
- Institut des Systèmes Intelligents et Robotiques, Centre National de la Recherche Scientifique, 4 place Jussieu, 75005, Paris, France
- Institut des Sciences de l'Information et de leurs Interactions, Sorbonne Universités, 3 rue Michel-Ange, Paris, 75794, France
| | - Giorgio Coricelli
- Department of Economics, University of Southern California, Los Angeles, CA, 90007, USA
- Centro Mente e Cervello, Università di Trento, corso Bettini 21, Rovereto, 38068, Italy
| | - Stefano Palminteri
- Laboratoire de Neurosciences Cognitives Computationnelles, Institut National de la Santé et Recherche Médicale, 29 rue d'Ulm, 75005, Paris, France.
- Département d'Etudes Cognitives, Ecole Normale Supérieure, Paris, 75005, France.
- Institut d'Etudes de la Cognition, Université de Paris Sciences et Lettres, Paris, 75005, France.
| |
Collapse
|
19
|
Timmer MHM, Aarts E, Esselink RAJ, Cools R. Enhanced motivation of cognitive control in Parkinson's disease. Eur J Neurosci 2018; 48:2374-2384. [PMID: 30151991 PMCID: PMC6175070 DOI: 10.1111/ejn.14137] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2018] [Revised: 08/17/2018] [Accepted: 08/20/2018] [Indexed: 01/22/2023]
Abstract
Motor and cognitive deficits in Parkinson's disease (PD) have been argued to reflect motivational deficits. In prior work, however, we have shown that motivation of cognitive control is paradoxically potentiated rather than impaired in Parkinson's disease. This is particularly surprising given the fact that Parkinson's disease is often accompanied by depression, a prototypical disorder of motivation. To replicate our previous finding and assess the effects of depression, we investigated performance of PD patients with (n = 22) and without depression (history) (n = 23) and age‐matched healthy controls (n = 23) on a task specifically designed to measure the effect of reward motivation on task‐switching. We replicated previous findings by showing contrasting effects of reward motivation on task‐switching in PD patients and age‐matched healthy controls. While the promise of high versus low reward improved task‐switching in PD, it tended to impair task‐switching in age‐matched healthy controls. There were no effects of a depression (history) diagnosis in PD patients. These findings reinforce prior observations that Parkinson's disease is accompanied by enhanced incentive motivation of cognitive control and highlight the potential of incentive motivational strategies for overcoming cognitive deficits in Parkinson's disease.
Collapse
Affiliation(s)
- Monique H M Timmer
- Donders Institute for Brain, Cognition and Behaviour, Centre for Cognitive Neuroimaging, Radboud University, Nijmegen, The Netherlands.,Department of Neurology and Parkinson Centre Nijmegen (ParC), Radboud University Medical Centre, Nijmegen, The Netherlands
| | - Esther Aarts
- Department of Neurology and Parkinson Centre Nijmegen (ParC), Radboud University Medical Centre, Nijmegen, The Netherlands
| | - Rianne A J Esselink
- Donders Institute for Brain, Cognition and Behaviour, Centre for Cognitive Neuroimaging, Radboud University, Nijmegen, The Netherlands.,Department of Neurology and Parkinson Centre Nijmegen (ParC), Radboud University Medical Centre, Nijmegen, The Netherlands
| | - Roshan Cools
- Donders Institute for Brain, Cognition and Behaviour, Centre for Cognitive Neuroimaging, Radboud University, Nijmegen, The Netherlands.,Department of Psychiatry, Radboud University Medical Centre, Nijmegen, The Netherlands
| |
Collapse
|
20
|
Seitz RJ, Paloutzian RF, Angel HF. From Believing to Belief: A General Theoretical Model. J Cogn Neurosci 2018; 30:1254-1264. [DOI: 10.1162/jocn_a_01292] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Cognitive neuroscience research has begun to explore the mental processes underlying what a belief and what believing are. Recent evidence suggests that believing involves fundamental brain functions that result in meaningful probabilistic representations, called beliefs. When relatively stable, these beliefs allow for guidance of behavior in individuals and social groups. However, they are also fluid and can be modified by new relevant information, interpersonal contact, social pressure, and situational demands. We present a theoretical model of believing that can account for the formation of both empirically grounded and metaphysical beliefs.
Collapse
|
21
|
Zimmermann J, Glimcher PW, Louie K. Multiple timescales of normalized value coding underlie adaptive choice behavior. Nat Commun 2018; 9:3206. [PMID: 30097577 PMCID: PMC6086888 DOI: 10.1038/s41467-018-05507-8] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2017] [Accepted: 07/10/2018] [Indexed: 01/25/2023] Open
Abstract
Adaptation is a fundamental process crucial for the efficient coding of sensory information. Recent evidence suggests that similar coding principles operate in decision-related brain areas, where neural value coding adapts to recent reward history. However, the circuit mechanism for value adaptation is unknown, and the link between changes in adaptive value coding and choice behavior is unclear. Here we show that choice behavior in nonhuman primates varies with the statistics of recent rewards. Consistent with efficient coding theory, decision-making shows increased choice sensitivity in lower variance reward environments. Both the average adaptation effect and across-session variability are explained by a novel multiple timescale dynamical model of value representation implementing divisive normalization. The model predicts empirical variance-driven changes in behavior despite having no explicit knowledge of environmental statistics, suggesting that distributional characteristics can be captured by dynamic model architectures. These findings highlight the importance of treating decision-making as a dynamic process and the role of normalization as a unifying computation for contextual phenomena in choice.
Collapse
Affiliation(s)
- Jan Zimmermann
- Center for Neural Science, New York University, 4 Washington Place Room 809, New York, NY, 10003, USA.
| | - Paul W Glimcher
- Center for Neural Science, New York University, 4 Washington Place Room 809, New York, NY, 10003, USA.,Institute for the Study of Decision Making, New York University, 4 Washington Place Room 809, New York, NY, 10003, USA
| | - Kenway Louie
- Center for Neural Science, New York University, 4 Washington Place Room 809, New York, NY, 10003, USA.,Institute for the Study of Decision Making, New York University, 4 Washington Place Room 809, New York, NY, 10003, USA
| |
Collapse
|
22
|
Chen X, Holland P, Galea JM. The effects of reward and punishment on motor skill learning. Curr Opin Behav Sci 2018. [DOI: 10.1016/j.cobeha.2017.11.011] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
23
|
Risk preference and choice stochasticity during decisions for other people. COGNITIVE AFFECTIVE & BEHAVIORAL NEUROSCIENCE 2018; 18:331-341. [PMID: 29549530 PMCID: PMC5889416 DOI: 10.3758/s13415-018-0572-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
In several contexts, such as finance and politics, people make choices that are relevant for others but irrelevant for oneself. Focusing on decision-making under risk, we compared monetary choices made for one's own interest with choices made on behalf of an anonymous individual. Consistent with the previous literature, other-interest choices were characterized by an increased gambling propensity. We also investigated choice stochasticity, which captures how much decisions vary in similar conditions. An aspect related to choice stochasticity is how much decisions are tuned to the option values, and we found that this was higher during self-interest than during other-interest choices. This effect was observed only in individuals who reported a motivation to distribute rewards unequally, suggesting that it may (at least partially) depend on a motivation to make accurate decisions for others. Our results indicate that, during decision-making under risk, choices for other people are characterized by a decreased tuning to the values of the options, in addition to enhanced risk seeking.
Collapse
|
24
|
Martinelli C, Rigoli F, Dolan RJ, Shergill SS. Decreased value-sensitivity in schizophrenia. Psychiatry Res 2018; 259:295-301. [PMID: 29096335 DOI: 10.1016/j.psychres.2017.10.031] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/24/2017] [Revised: 09/22/2017] [Accepted: 10/21/2017] [Indexed: 11/19/2022]
Abstract
Pathophysiology in schizophrenia has been linked to aberrant incentive salience, namely the dysfunctional processing of value linked to abnormal dopaminergic activity. In line with this, recent studies showed impaired learning of value in schizophrenia. However, how value is used to guide behaviour independently from learning, as in risky choice, has rarely been examined in this disorder. We studied value-guided choice under risk in patients with schizophrenia and in controls using a task requiring a choice between a certain monetary reward, varying trial-by-trial, and a gamble offering an equal probability of getting double this certain amount or nothing. We observed that patients compared to controls exhibited reduced sensitivity to values, implying that their choices failed to flexibly adapt to the specific values on offer. Moreover, the degree of this value sensitivity inversely correlated with aberrant salience experience, suggesting that the inability to tune choice to value may be a key element of aberrant salience in the illness. Our results help clarify the cognitive mechanisms underlying improper attribution of value in schizophrenia and may thus inform cognitive interventions aimed at reinstating value sensitivity in patients.
Collapse
Affiliation(s)
- Cristina Martinelli
- Department of Psychosis Studies, Institute of Psychiatry, Psychology & Neuroscience, King's College London, 16 De Crespigny Park, SE5 8AF London, United Kingdom.
| | - Francesco Rigoli
- Wellcome Trust Centre for Neuroimaging, University College London, 12 Queen's Square, WC1N 3BG London, United Kingdom
| | - Ray J Dolan
- Wellcome Trust Centre for Neuroimaging, University College London, 12 Queen's Square, WC1N 3BG London, United Kingdom; Max Planck UCL Centre for Computational Psychiatry and Ageing Research, University College London, WC1B 5EH London, United Kingdom
| | - Sukhwinder S Shergill
- Department of Psychosis Studies, Institute of Psychiatry, Psychology & Neuroscience, King's College London, 16 De Crespigny Park, SE5 8AF London, United Kingdom
| |
Collapse
|
25
|
Normalized value coding explains dynamic adaptation in the human valuation process. Proc Natl Acad Sci U S A 2017; 114:12696-12701. [PMID: 29133418 DOI: 10.1073/pnas.1715293114] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The notion of subjective value is central to choice theories in ecology, economics, and psychology, serving as an integrated decision variable by which options are compared. Subjective value is often assumed to be an absolute quantity, determined in a static manner by the properties of an individual option. Recent neurobiological studies, however, have shown that neural value coding dynamically adapts to the statistics of the recent reward environment, introducing an intrinsic temporal context dependence into the neural representation of value. Whether valuation exhibits this kind of dynamic adaptation at the behavioral level is unknown. Here, we show that the valuation process in human subjects adapts to the history of previous values, with current valuations varying inversely with the average value of recently observed items. The dynamics of this adaptive valuation are captured by divisive normalization, linking these temporal context effects to spatial context effects in decision making as well as spatial and temporal context effects in perception. These findings suggest that adaptation is a universal feature of neural information processing and offer a unifying explanation for contextual phenomena in fields ranging from visual psychophysics to economic choice.
Collapse
|
26
|
Rigoli F, Mathys C, Friston KJ, Dolan RJ. A unifying Bayesian account of contextual effects in value-based choice. PLoS Comput Biol 2017; 13:e1005769. [PMID: 28981514 PMCID: PMC5645156 DOI: 10.1371/journal.pcbi.1005769] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2017] [Revised: 10/17/2017] [Accepted: 09/11/2017] [Indexed: 11/18/2022] Open
Abstract
Empirical evidence suggests the incentive value of an option is affected by other options available during choice and by options presented in the past. These contextual effects are hard to reconcile with classical theories and have inspired accounts where contextual influences play a crucial role. However, each account only addresses one or the other of the empirical findings and a unifying perspective has been elusive. Here, we offer a unifying theory of context effects on incentive value attribution and choice based on normative Bayesian principles. This formulation assumes that incentive value corresponds to a precision-weighted prediction error, where predictions are based upon expectations about reward. We show that this scheme explains a wide range of contextual effects, such as those elicited by other options available during choice (or within-choice context effects). These include both conditions in which choice requires an integration of multiple attributes and conditions where a multi-attribute integration is not necessary. Moreover, the same scheme explains context effects elicited by options presented in the past or between-choice context effects. Our formulation encompasses a wide range of contextual influences (comprising both within- and between-choice effects) by calling on Bayesian principles, without invoking ad-hoc assumptions. This helps clarify the contextual nature of incentive value and choice behaviour and may offer insights into psychopathologies characterized by dysfunctional decision-making, such as addiction and pathological gambling.
Collapse
Affiliation(s)
- Francesco Rigoli
- The Wellcome Trust Centre for Neuroimaging, UCL, 12 Queen Square, London, United Kingdom
- City, University of London, Northampton Square, London, United Kingdom
| | - Christoph Mathys
- Scuola Internazionale Superiore di Studi Avanzati (SISSA), Trieste, Italy
- Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London, United Kingdom
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich and ETH Zurich, Zurich, Switzerland
| | - Karl J. Friston
- The Wellcome Trust Centre for Neuroimaging, UCL, 12 Queen Square, London, United Kingdom
| | - Raymond J. Dolan
- The Wellcome Trust Centre for Neuroimaging, UCL, 12 Queen Square, London, United Kingdom
- Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London, United Kingdom
| |
Collapse
|
27
|
Rigoli F, Chew B, Dayan P, Dolan RJ. Learning Contextual Reward Expectations for Value Adaptation. J Cogn Neurosci 2017; 30:50-69. [PMID: 28949824 DOI: 10.1162/jocn_a_01191] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
Substantial evidence indicates that subjective value is adapted to the statistics of reward expected within a given temporal context. However, how these contextual expectations are learned is poorly understood. To examine such learning, we exploited a recent observation that participants performing a gambling task adjust their preferences as a function of context. We show that, in the absence of contextual cues providing reward information, an average reward expectation was learned from recent past experience. Learning dependent on contextual cues emerged when two contexts alternated at a fast rate, whereas both cue-independent and cue-dependent forms of learning were apparent when two contexts alternated at a slower rate. Motivated by these behavioral findings, we reanalyzed a previous fMRI data set to probe the neural substrates of learning contextual reward expectations. We observed a form of reward prediction error related to average reward such that, at option presentation, activity in ventral tegmental area/substantia nigra and ventral striatum correlated positively and negatively, respectively, with the actual and predicted value of options. Moreover, an inverse correlation between activity in ventral tegmental area/substantia nigra (but not striatum) and predicted option value was greater in participants showing enhanced choice adaptation to context. The findings help understanding the mechanisms underlying learning of contextual reward expectation.
Collapse
Affiliation(s)
- Francesco Rigoli
- The Wellcome Trust Centre for Neuroimaging at University College London
| | - Benjamin Chew
- The Wellcome Trust Centre for Neuroimaging at University College London.,Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London, UK
| | - Peter Dayan
- Gatsby Computational Neuroscience Unit, University College London
| | - Raymond J Dolan
- The Wellcome Trust Centre for Neuroimaging at University College London.,Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London, UK
| |
Collapse
|
28
|
Separate mesocortical and mesolimbic pathways encode effort and reward learning signals. Proc Natl Acad Sci U S A 2017; 114:E7395-E7404. [PMID: 28808037 DOI: 10.1073/pnas.1705643114] [Citation(s) in RCA: 88] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Optimal decision making mandates organisms learn the relevant features of choice options. Likewise, knowing how much effort we should expend can assume paramount importance. A mesolimbic network supports reward learning, but it is unclear whether other choice features, such as effort learning, rely on this same network. Using computational fMRI, we show parallel encoding of effort and reward prediction errors (PEs) within distinct brain regions, with effort PEs expressed in dorsomedial prefrontal cortex and reward PEs in ventral striatum. We show a common mesencephalic origin for these signals evident in overlapping, but spatially dissociable, dopaminergic midbrain regions expressing both types of PE. During action anticipation, reward and effort expectations were integrated in ventral striatum, consistent with a computation of an overall net benefit of a stimulus. Thus, we show that motivationally relevant stimulus features are learned in parallel dopaminergic pathways, with formation of an integrated utility signal at choice.
Collapse
|
29
|
Rigoli F, Pezzulo G, Dolan R, Friston K. A Goal-Directed Bayesian Framework for Categorization. Front Psychol 2017; 8:408. [PMID: 28382008 PMCID: PMC5360703 DOI: 10.3389/fpsyg.2017.00408] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2016] [Accepted: 03/06/2017] [Indexed: 12/03/2022] Open
Abstract
Categorization is a fundamental ability for efficient behavioral control. It allows organisms to remember the correct responses to categorical cues and not for every stimulus encountered (hence eluding computational cost or complexity), and to generalize appropriate responses to novel stimuli dependant on category assignment. Assuming the brain performs Bayesian inference, based on a generative model of the external world and future goals, we propose a computational model of categorization in which important properties emerge. These properties comprise the ability to infer latent causes of sensory experience, a hierarchical organization of latent causes, and an explicit inclusion of context and action representations. Crucially, these aspects derive from considering the environmental statistics that are relevant to achieve goals, and from the fundamental Bayesian principle that any generative model should be preferred over alternative models based on an accuracy-complexity trade-off. Our account is a step toward elucidating computational principles of categorization and its role within the Bayesian brain hypothesis.
Collapse
Affiliation(s)
- Francesco Rigoli
- The Wellcome Trust Centre for Neuroimaging, Institute of Neurology, University College London London, UK
| | - Giovanni Pezzulo
- Institute of Cognitive Sciences and Technologies - National Research Council Rome, Italy
| | - Raymond Dolan
- The Wellcome Trust Centre for Neuroimaging, Institute of Neurology, University College LondonLondon, UK; Max Planck UCL Centre for Computational Psychiatry and Ageing ResearchLondon, UK
| | - Karl Friston
- The Wellcome Trust Centre for Neuroimaging, Institute of Neurology, University College London London, UK
| |
Collapse
|
30
|
Friston K, FitzGerald T, Rigoli F, Schwartenbeck P, Pezzulo G. Active Inference: A Process Theory. Neural Comput 2016; 29:1-49. [PMID: 27870614 DOI: 10.1162/neco_a_00912] [Citation(s) in RCA: 480] [Impact Index Per Article: 53.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
This article describes a process theory based on active inference and belief propagation. Starting from the premise that all neuronal processing (and action selection) can be explained by maximizing Bayesian model evidence-or minimizing variational free energy-we ask whether neuronal responses can be described as a gradient descent on variational free energy. Using a standard (Markov decision process) generative model, we derive the neuronal dynamics implicit in this description and reproduce a remarkable range of well-characterized neuronal phenomena. These include repetition suppression, mismatch negativity, violation responses, place-cell activity, phase precession, theta sequences, theta-gamma coupling, evidence accumulation, race-to-bound dynamics, and transfer of dopamine responses. Furthermore, the (approximately Bayes' optimal) behavior prescribed by these dynamics has a degree of face validity, providing a formal explanation for reward seeking, context learning, and epistemic foraging. Technically, the fact that a gradient descent appears to be a valid description of neuronal activity means that variational free energy is a Lyapunov function for neuronal dynamics, which therefore conform to Hamilton's principle of least action.
Collapse
Affiliation(s)
- Karl Friston
- Wellcome Trust Centre for Neuroimaging, UCL, London WC1N 3BG, U.K.
| | - Thomas FitzGerald
- Wellcome Trust Centre for Neuroimaging, UCL, London WC1N 3BG, U.K., and Max Planck-UCL Centre for Computational Psychiatry and Ageing Research, London WC1B 5BE, U.K.
| | - Francesco Rigoli
- Wellcome Trust Centre for Neuroimaging, UCL, London WC1N 3BG, U.K.
| | - Philipp Schwartenbeck
- Wellcome Trust Centre for Neuroimaging, UCL, London WC1N 3BG, U.K.; Max Planck-UCL Centre for Computational Psychiatry and Ageing Research, London, WC1B 5BE, U.K.; Centre for Neurocognitive Research, University of Salzburg, 5020 Salzburg, Austria; and Neuroscience Institute, Christian-Doppler-Klinik, Paracelsus Medical University Salzburg, A-5020 Salzburg, Austria
| | - Giovanni Pezzulo
- Institute of Cognitive Sciences and Technologies, National Research Council, 00185 Rome, Italy
| |
Collapse
|