1
|
Kao CH, Feng GW, Hur JK, Jarvis H, Rutledge RB. Computational models of subjective feelings in psychiatry. Neurosci Biobehav Rev 2023; 145:105008. [PMID: 36549378 PMCID: PMC9990828 DOI: 10.1016/j.neubiorev.2022.105008] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Revised: 12/02/2022] [Accepted: 12/15/2022] [Indexed: 12/23/2022]
Abstract
Research in computational psychiatry is dominated by models of behavior. Subjective experience during behavioral tasks is not well understood, even though it should be relevant to understanding the symptoms of psychiatric disorders. Here, we bridge this gap and review recent progress in computational models for subjective feelings. For example, happiness reflects not how well people are doing, but whether they are doing better than expected. This dependence on recent reward prediction errors is intact in major depression, although depressive symptoms lower happiness during tasks. Uncertainty predicts subjective feelings of stress in volatile environments. Social prediction errors influence feelings of self-worth more in individuals with low self-esteem despite a reduced willingness to change beliefs due to social feedback. Measuring affective state during behavioral tasks provides a tool for understanding psychiatric symptoms that can be dissociable from behavior. When smartphone tasks are collected longitudinally, subjective feelings provide a potential means to bridge the gap between lab-based behavioral tasks and real-life behavior, emotion, and psychiatric symptoms.
Collapse
Affiliation(s)
- Chang-Hao Kao
- Department of Psychology, Yale University, New Haven, CT, USA.
| | - Gloria W Feng
- Department of Psychology, Yale University, New Haven, CT, USA
| | - Jihyun K Hur
- Department of Psychology, Yale University, New Haven, CT, USA
| | - Huw Jarvis
- Department of Psychology, Yale University, New Haven, CT, USA; Turner Institute for Brain and Mental Health, Monash University, Clayton, Victoria, Australia; School of Psychological Sciences, Monash University, Clayton, Victoria, Australia
| | - Robb B Rutledge
- Department of Psychology, Yale University, New Haven, CT, USA; Wellcome Centre for Human Neuroimaging, University College London, London, UK.
| |
Collapse
|
2
|
Reward influences cortical representations: Commentary on the paper "The growth of cognition: free energy minimization and the embryogenesis of cortical computation" by Wright and Bourke. Phys Life Rev 2020; 36:3-4. [PMID: 33278814 DOI: 10.1016/j.plrev.2020.11.005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Accepted: 11/17/2020] [Indexed: 11/24/2022]
|
3
|
Fumagalli R. How thin rational choice theory explains choices. STUDIES IN HISTORY AND PHILOSOPHY OF SCIENCE 2020; 83:63-74. [PMID: 32958282 DOI: 10.1016/j.shpsa.2020.03.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/26/2019] [Revised: 01/23/2020] [Accepted: 03/18/2020] [Indexed: 06/11/2023]
Abstract
The critics of rational choice theory (RCT) frequently build on the contrast between so-called thick and thin applications of RCT to argue that thin RCT lacks the potential to explain the choices of real-world agents. In this paper, I draw on often-cited RCT applications in several decision sciences to demonstrate that despite this prominent critique there are at least two different senses in which thin RCT can explain real-world agents' choices. I then defend this thesis against the most influential objections put forward by the critics of RCT. In doing so, I explicate the implications of my thesis for the ongoing philosophical debate concerning the explanatory potential of RCT and the comparative merits of widely endorsed accounts of explanation.
Collapse
Affiliation(s)
- Roberto Fumagalli
- Department of Political Economy, King's College London, UK; Department of Philosophy, London School of Economics, UK; Behavioral Ethics Lab, University of Pennsylvania, US.
| |
Collapse
|
4
|
Neural Mechanisms of Reward Prediction Error in Autism Spectrum Disorder. AUTISM RESEARCH AND TREATMENT 2019; 2019:5469191. [PMID: 31354993 PMCID: PMC6634058 DOI: 10.1155/2019/5469191] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/06/2018] [Revised: 03/04/2019] [Accepted: 04/23/2019] [Indexed: 01/03/2023]
Abstract
Few studies have explored neural mechanisms of reward learning in ASD despite evidence of behavioral impairments of predictive abilities in ASD. To investigate the neural correlates of reward prediction errors in ASD, 16 adults with ASD and 14 typically developing controls performed a prediction error task during fMRI scanning. Results revealed greater activation in the ASD group in the left paracingulate gyrus during signed prediction errors and the left insula and right frontal pole during thresholded unsigned prediction errors. Findings support atypical neural processing of reward prediction errors in ASD in frontostriatal regions critical for prediction coding and reward learning. Results provide a neural basis for impairments in reward learning that may contribute to traits common in ASD (e.g., intolerance of unpredictability).
Collapse
|
5
|
Abstract
What if the brain's response to reward occurs even when there is no reward? Wouldn't that be a further concern for people prone to problem gambling and other forms of addiction, like those related to eating? Electroencephalography was employed to investigate this possibility using probabilistic feedback manipulations and measures of known event-related potentials (ERPs) related to reward processing. We tested the hypothesis-that reward-based ERPs would occur even in the absence of a tangible reward and when manipulations on expectation are implicit. The well-known P300 response potential was a key focus, and was assessed in non-gambling volunteer undergraduates on a task involving experimentally-manipulated probabilities of positive or negative feedback comprising three trial types-80, 50, or 20% positive feedback. A feedback stimulus (F1) followed a guess response between two possible outcomes (implicit win/loss), and then a second feedback stimulus (F2) was presented to confirm an alleged 'win' or 'loss' (explicit win/loss). Results revealed that amplitude of the P300 in F1-locked data (implicit manipulation) was larger (more positive) on average for feedback outcomes that were manipulated to be less likely than expected. The effect is pronounced after increased time on task (later trials), even though the majority of participants were not explicitly aware of our probability manipulations. For the explicit effects in F2-locked data, no meaningful or significant effects were observed. These findings point to the existence of proposed success-response mechanisms that operate not only explicitly but also with implicit manipulations that do not involve any direct indication of a win or loss, and are not associated with tangible rewards. Thus, there seems to be a non-explicit form of perception (we call 'implicit') associated with an internal experience of wins/losses (in the absence of actual rewards or losses) that can be measured in associated brain processes. The potential significance of these findings is discussed in terms of implications for problem gambling.
Collapse
Affiliation(s)
- A Fielding
- Action Brain and Cognition Lab, Department of Psychology, University of Otago, Level 4, William James Building, 275 Leith Walk, Dunedin, 9016, New Zealand
| | - Y Fu
- Action Brain and Cognition Lab, Department of Psychology, University of Otago, Level 4, William James Building, 275 Leith Walk, Dunedin, 9016, New Zealand
| | - E A Franz
- Action Brain and Cognition Lab, Department of Psychology, University of Otago, Level 4, William James Building, 275 Leith Walk, Dunedin, 9016, New Zealand.
| |
Collapse
|
6
|
Will GJ, Rutledge RB, Moutoussis M, Dolan RJ. Neural and computational processes underlying dynamic changes in self-esteem. eLife 2017; 6. [PMID: 29061228 PMCID: PMC5655144 DOI: 10.7554/elife.28098] [Citation(s) in RCA: 74] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2017] [Accepted: 09/06/2017] [Indexed: 12/26/2022] Open
Abstract
Self-esteem is shaped by the appraisals we receive from others. Here, we characterize neural and computational mechanisms underlying this form of social influence. We introduce a computational model that captures fluctuations in self-esteem engendered by prediction errors that quantify the difference between expected and received social feedback. Using functional MRI, we show these social prediction errors correlate with activity in ventral striatum/subgenual anterior cingulate cortex, while updates in self-esteem resulting from these errors co-varied with activity in ventromedial prefrontal cortex (vmPFC). We linked computational parameters to psychiatric symptoms using canonical correlation analysis to identify an 'interpersonal vulnerability' dimension. Vulnerability modulated the expression of prediction error responses in anterior insula and insula-vmPFC connectivity during self-esteem updates. Our findings indicate that updating of self-evaluative beliefs relies on learning mechanisms akin to those used in learning about others. Enhanced insula-vmPFC connectivity during updating of those beliefs may represent a marker for psychiatric vulnerability.
Collapse
Affiliation(s)
- Geert-Jan Will
- Wellcome Trust Centre for Neuroimaging, University College London, London, United Kingdom.,Max Planck UCL Centre for Computational Psychiatry and Ageing Research, University College London, London, United Kingdom
| | - Robb B Rutledge
- Wellcome Trust Centre for Neuroimaging, University College London, London, United Kingdom.,Max Planck UCL Centre for Computational Psychiatry and Ageing Research, University College London, London, United Kingdom
| | - Michael Moutoussis
- Wellcome Trust Centre for Neuroimaging, University College London, London, United Kingdom.,Max Planck UCL Centre for Computational Psychiatry and Ageing Research, University College London, London, United Kingdom
| | - Raymond J Dolan
- Wellcome Trust Centre for Neuroimaging, University College London, London, United Kingdom.,Max Planck UCL Centre for Computational Psychiatry and Ageing Research, University College London, London, United Kingdom
| |
Collapse
|
7
|
Rutledge RB, Moutoussis M, Smittenaar P, Zeidman P, Taylor T, Hrynkiewicz L, Lam J, Skandali N, Siegel JZ, Ousdal OT, Prabhu G, Dayan P, Fonagy P, Dolan RJ. Association of Neural and Emotional Impacts of Reward Prediction Errors With Major Depression. JAMA Psychiatry 2017; 74:790-797. [PMID: 28678984 PMCID: PMC5710549 DOI: 10.1001/jamapsychiatry.2017.1713] [Citation(s) in RCA: 101] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
IMPORTANCE Major depressive disorder (MDD) is associated with deficits in representing reward prediction errors (RPEs), which are the difference between experienced and predicted reward. Reward prediction errors underlie learning of values in reinforcement learning models, are represented by phasic dopamine release, and are known to affect momentary mood. OBJECTIVE To combine functional neuroimaging, computational modeling, and smartphone-based large-scale data collection to test, in the absence of learning-related concerns, the hypothesis that depression attenuates the impact of RPEs. DESIGN, SETTING, AND PARTICIPANTS Functional magnetic resonance imaging (fMRI) data were collected on 32 individuals with moderate MDD and 20 control participants who performed a probabilistic reward task. A risky decision task with repeated happiness ratings as a measure of momentary mood was also tested in the laboratory in 74 participants and with a smartphone-based platform in 1833 participants. The study was conducted from November 20, 2012, to February 17, 2015. MAIN OUTCOMES AND MEASURES Blood oxygen level-dependent activity was measured in ventral striatum, a dopamine target area known to represent RPEs. Momentary mood was measured during risky decision making. RESULTS Of the 52 fMRI participants (mean [SD] age, 34.0 [9.1] years), 30 (58%) were women and 32 had MDD. Of the 74 participants in the laboratory risky decision task (mean age, 34.2 [10.3] years), 44 (59%) were women and 54 had MDD. Of the smartphone group, 543 (30%) had a depression history and 1290 (70%) had no depression history; 918 (50%) were women, and 593 (32%) were younger than 30 years. Contrary to previous results in reinforcement learning tasks, individuals with moderate depression showed intact RPE signals in ventral striatum (z = 3.16; P = .002) that did not differ significantly from controls (z = 0.91; P = .36). Symptom severity correlated with baseline mood parameters in laboratory (ρ = -0.54; P < 1 × 10-6) and smartphone (ρ = -0.30; P < 1 × 10-39) data. However, participants with depression showed an intact association between RPEs and happiness in a computational model of momentary mood dynamics (z = 4.55; P < .001) that was not attenuated compared with controls (z = -0.42; P = .67). CONCLUSIONS AND RELEVANCE The neural and emotional impact of RPEs is intact in major depression. These results suggest that depression does not affect the expression of dopaminergic RPEs and that attenuated RPEs in previous reports may reflect downstream effects more closely related to aberrant behavior. The correlation between symptom severity and baseline mood parameters supports an association between depression and momentary mood fluctuations during cognitive tasks. These results demonstrate a potential for smartphones in large-scale computational phenotyping, which is a goal for computational psychiatry.
Collapse
Affiliation(s)
- Robb B. Rutledge
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London, England,Wellcome Trust Centre for Neuroimaging, University College London, London, England
| | - Michael Moutoussis
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London, England,Wellcome Trust Centre for Neuroimaging, University College London, London, England
| | - Peter Smittenaar
- Wellcome Trust Centre for Neuroimaging, University College London, London, England
| | - Peter Zeidman
- Wellcome Trust Centre for Neuroimaging, University College London, London, England
| | - Tanja Taylor
- Wellcome Trust Centre for Neuroimaging, University College London, London, England
| | - Louise Hrynkiewicz
- Wellcome Trust Centre for Neuroimaging, University College London, London, England
| | - Jordan Lam
- Wellcome Trust Centre for Neuroimaging, University College London, London, England
| | - Nikolina Skandali
- Wellcome Trust Centre for Neuroimaging, University College London, London, England
| | - Jenifer Z. Siegel
- Wellcome Trust Centre for Neuroimaging, University College London, London, England
| | - Olga T. Ousdal
- Wellcome Trust Centre for Neuroimaging, University College London, London, England,Department of Radiology, Haukeland University Hospital, Bergen, Norway
| | - Gita Prabhu
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London, England,Wellcome Trust Centre for Neuroimaging, University College London, London, England
| | - Peter Dayan
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London, England,Gatsby Computational Neuroscience Unit, University College London, London, England
| | - Peter Fonagy
- Developmental Neuroscience Unit, Anna Freud Centre, London, England
| | - Raymond J. Dolan
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London, England,Wellcome Trust Centre for Neuroimaging, University College London, London, England
| |
Collapse
|
8
|
Fouragnan E, Queirazza F, Retzler C, Mullinger KJ, Philiastides MG. Spatiotemporal neural characterization of prediction error valence and surprise during reward learning in humans. Sci Rep 2017; 7:4762. [PMID: 28684734 PMCID: PMC5500565 DOI: 10.1038/s41598-017-04507-w] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2016] [Accepted: 05/17/2017] [Indexed: 02/01/2023] Open
Abstract
Reward learning depends on accurate reward associations with potential choices. These associations can be attained with reinforcement learning mechanisms using a reward prediction error (RPE) signal (the difference between actual and expected rewards) for updating future reward expectations. Despite an extensive body of literature on the influence of RPE on learning, little has been done to investigate the potentially separate contributions of RPE valence (positive or negative) and surprise (absolute degree of deviation from expectations). Here, we coupled single-trial electroencephalography with simultaneously acquired fMRI, during a probabilistic reversal-learning task, to offer evidence of temporally overlapping but largely distinct spatial representations of RPE valence and surprise. Electrophysiological variability in RPE valence correlated with activity in regions of the human reward network promoting approach or avoidance learning. Electrophysiological variability in RPE surprise correlated primarily with activity in regions of the human attentional network controlling the speed of learning. Crucially, despite the largely separate spatial extend of these representations our EEG-informed fMRI approach uniquely revealed a linear superposition of the two RPE components in a smaller network encompassing visuo-mnemonic and reward areas. Activity in this network was further predictive of stimulus value updating indicating a comparable contribution of both signals to reward learning.
Collapse
Affiliation(s)
- Elsa Fouragnan
- Institute of Neuroscience & Psychology, University of Glasgow, Glasgow, UK.
- Department of Experimental Psychology, University of Oxford, Oxford, UK.
| | - Filippo Queirazza
- Institute of Neuroscience & Psychology, University of Glasgow, Glasgow, UK
| | - Chris Retzler
- Institute of Neuroscience & Psychology, University of Glasgow, Glasgow, UK
- Department of Behavioural & Social Sciences, University of Huddersfield, Huddersfield, UK
| | - Karen J Mullinger
- Sir Peter Mansfield Magnetic Resonance Center, School of Physics and Astronomy, University of Nottingham, Nottingham, UK
- Birmingham University Imaging Centre, School of Psychology, University of Birmingham, Birmingham, UK
| | | |
Collapse
|
9
|
Rutledge RB, de Berker AO, Espenhahn S, Dayan P, Dolan RJ. The social contingency of momentary subjective well-being. Nat Commun 2016; 7:11825. [PMID: 27293212 PMCID: PMC4909984 DOI: 10.1038/ncomms11825] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2015] [Accepted: 05/04/2016] [Indexed: 11/29/2022] Open
Abstract
Although social comparison is a known determinant of overall life satisfaction, it is not clear how it affects moment-to-moment variation in subjective emotional state. Using a novel social decision task combined with computational modelling, we show that a participant's subjective emotional state reflects not only the impact of rewards they themselves receive, but also the rewards received by a social partner. Unequal outcomes, whether advantageous or disadvantageous, reduce average momentary happiness. Furthermore, the relative impacts of advantageous and disadvantageous inequality on momentary happiness at the individual level predict a subject's generosity in a separate dictator game. These findings demonstrate a powerful social influence upon subjective emotional state, where emotional reactivity to inequality is strongly predictive of altruism in an independent task domain.
Collapse
Affiliation(s)
- Robb B. Rutledge
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London WC1B 5EH, UK
- Wellcome Trust Centre for Neuroimaging, University College London, London WC1N 3BG, UK
| | - Archy O. de Berker
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London WC1B 5EH, UK
- Wellcome Trust Centre for Neuroimaging, University College London, London WC1N 3BG, UK
- Sobell Department of Motor Neuroscience and Movement Disorders, University College London, London WC1N 3BG, UK
| | - Svenja Espenhahn
- Sobell Department of Motor Neuroscience and Movement Disorders, University College London, London WC1N 3BG, UK
| | - Peter Dayan
- Gatsby Computational Neuroscience Unit, University College London, London W1T 4JG, UK
| | - Raymond J. Dolan
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London WC1B 5EH, UK
- Wellcome Trust Centre for Neuroimaging, University College London, London WC1N 3BG, UK
| |
Collapse
|
10
|
Eldar E, Rutledge RB, Dolan RJ, Niv Y. Mood as Representation of Momentum. Trends Cogn Sci 2015; 20:15-24. [PMID: 26545853 PMCID: PMC4703769 DOI: 10.1016/j.tics.2015.07.010] [Citation(s) in RCA: 161] [Impact Index Per Article: 17.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2015] [Revised: 07/21/2015] [Accepted: 07/27/2015] [Indexed: 01/24/2023]
Abstract
Experiences affect mood, which in turn affects subsequent experiences. Recent studies suggest two specific principles. First, mood depends on how recent reward outcomes differ from expectations. Second, mood biases the way we perceive outcomes (e.g., rewards), and this bias affects learning about those outcomes. We propose that this two-way interaction serves to mitigate inefficiencies in the application of reinforcement learning to real-world problems. Specifically, we propose that mood represents the overall momentum of recent outcomes, and its biasing influence on the perception of outcomes ‘corrects’ learning to account for environmental dependencies. We describe potential dysfunctions of this adaptive mechanism that might contribute to the symptoms of mood disorders. With increasing use of computational models to understand human behavior, scientists have begun to model the dynamics of subjective states such as mood. Recent data suggest that mood reflects the cumulative impact of differences between reward outcomes and expectations. Behavioral and neural findings suggest that mood biases the perception of reward outcomes such that outcomes are perceived as better when one is in a good mood relative to when one is in a bad mood. These two lines of research establish a bidirectional interaction between mood and reinforcement learning, which may play an important adaptive role in healthy behavior, and whose dysfunction might contribute to psychiatric disorders.
Collapse
Affiliation(s)
- Eran Eldar
- Wellcome Trust Centre for Neuroimaging, University College London, London WC1N 3BG, UK; Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London WC1B 5EH, UK.
| | - Robb B Rutledge
- Wellcome Trust Centre for Neuroimaging, University College London, London WC1N 3BG, UK; Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London WC1B 5EH, UK
| | - Raymond J Dolan
- Wellcome Trust Centre for Neuroimaging, University College London, London WC1N 3BG, UK; Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London WC1B 5EH, UK
| | - Yael Niv
- Princeton Neuroscience Institute and Psychology Department, Princeton University, Princeton, NJ 08544, USA
| |
Collapse
|
11
|
Stenner MP, Rutledge RB, Zaehle T, Schmitt FC, Kopitzki K, Kowski AB, Voges J, Heinze HJ, Dolan RJ. No unified reward prediction error in local field potentials from the human nucleus accumbens: evidence from epilepsy patients. J Neurophysiol 2015; 114:781-92. [PMID: 26019312 PMCID: PMC4533060 DOI: 10.1152/jn.00260.2015] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2015] [Accepted: 05/26/2015] [Indexed: 11/22/2022] Open
Abstract
Functional magnetic resonance imaging (fMRI), cyclic voltammetry, and single-unit electrophysiology studies suggest that signals measured in the nucleus accumbens (Nacc) during value-based decision making represent reward prediction errors (RPEs), the difference between actual and predicted rewards. Here, we studied the precise temporal and spectral pattern of reward-related signals in the human Nacc. We recorded local field potentials (LFPs) from the Nacc of six epilepsy patients during an economic decision-making task. On each trial, patients decided whether to accept or reject a gamble with equal probabilities of a monetary gain or loss. The behavior of four patients was consistent with choices being guided by value expectations. Expected value signals before outcome onset were observed in three of those patients, at varying latencies and with nonoverlapping spectral patterns. Signals after outcome onset were correlated with RPE regressors in all subjects. However, further analysis revealed that these signals were better explained as outcome valence rather than RPE signals, with gamble gains and losses differing in the power of beta oscillations and in evoked response amplitudes. Taken together, our results do not support the idea that postsynaptic potentials in the Nacc represent a RPE that unifies outcome magnitude and prior value expectation. We discuss the generalizability of our findings to healthy individuals and the relation of our results to measurements of RPE signals obtained from the Nacc with other methods.
Collapse
Affiliation(s)
- Max-Philipp Stenner
- Wellcome Trust Centre for Neuroimaging, University College London, London, United Kingdom; Department of Neurology, Otto-von-Guericke University, Magdeburg, Germany;
| | - Robb B Rutledge
- Wellcome Trust Centre for Neuroimaging, University College London, London, United Kingdom; Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London, United Kingdom
| | - Tino Zaehle
- Department of Neurology, Otto-von-Guericke University, Magdeburg, Germany; Department of Behavioral Neurology, Leibniz Institute for Neurobiology, Magdeburg, Germany
| | | | - Klaus Kopitzki
- Department of Neurology, Otto-von-Guericke University, Magdeburg, Germany
| | - Alexander B Kowski
- Epilepsy-Center Berlin-Brandenburg, Department of Neurology, Charité Universitätsmedizin, Berlin, Germany; and
| | - Jürgen Voges
- Department of Behavioral Neurology, Leibniz Institute for Neurobiology, Magdeburg, Germany; Department of Stereotactic Neurosurgery, Otto-von-Guericke University, Magdeburg, Germany
| | - Hans-Jochen Heinze
- Department of Neurology, Otto-von-Guericke University, Magdeburg, Germany; Department of Behavioral Neurology, Leibniz Institute for Neurobiology, Magdeburg, Germany
| | - Raymond J Dolan
- Wellcome Trust Centre for Neuroimaging, University College London, London, United Kingdom; Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London, United Kingdom
| |
Collapse
|
12
|
Frydman C, Barberis N, Camerer C, Bossaerts P, Rangel A. Using Neural Data to Test A Theory of Investor Behavior: An Application to Realization Utility. THE JOURNAL OF FINANCE 2014; 69:907-946. [PMID: 25774065 PMCID: PMC4357577 DOI: 10.1111/jofi.12126] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
We use measures of neural activity provided by functional magnetic resonance imaging (fMRI) to test the "realization utility" theory of investor behavior, which posits that people derive utility directly from the act of realizing gains and losses. Subjects traded stocks in an experimental market while we measured their brain activity. We find that all subjects exhibit a strong disposition effect in their trading, even though it is suboptimal. Consistent with the realization utility explanation for this behavior, we find that activity in the ventromedial prefrontal cortex, an area known to encode the value of options during choices, correlates with the capital gains of potential trades; that the neural measures of realization utility correlate across subjects with their individual tendency to exhibit a disposition effect; and that activity in the ventral striatum, an area known to encode information about changes in the present value of experienced utility, exhibits a positive response when subjects realize capital gains. These results provide support for the realization utility model and, more generally, demonstrate how neural data can be helpful in testing models of investor behavior.
Collapse
Affiliation(s)
- Cary Frydman
- Marshall School of Business, University of Southern California
| | | | | | | | | |
Collapse
|
13
|
DeWitt E. Neuroeconomics: A Formal Test of Dopamine’s Role in Reinforcement Learning. Curr Biol 2014; 24:R321-4. [DOI: 10.1016/j.cub.2014.02.055] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
14
|
Phasic dopamine release in the rat nucleus accumbens symmetrically encodes a reward prediction error term. J Neurosci 2014; 34:698-704. [PMID: 24431428 DOI: 10.1523/jneurosci.2489-13.2014] [Citation(s) in RCA: 185] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Making predictions about the rewards associated with environmental stimuli and updating those predictions through feedback is an essential aspect of adaptive behavior. Theorists have argued that dopamine encodes a reward prediction error (RPE) signal that is used in such a reinforcement learning process. Recent work with fMRI has demonstrated that the BOLD signal in dopaminergic target areas meets both necessary and sufficient conditions of an axiomatic model of the RPE hypothesis. However, there has been no direct evidence that dopamine release itself also meets necessary and sufficient criteria for encoding an RPE signal. Further, the fact that dopamine neurons have low tonic firing rates that yield a limited dynamic range for encoding negative RPEs has led to significant debate about whether positive and negative prediction errors are encoded on a similar scale. To address both of these issues, we used fast-scan cyclic voltammetry to measure reward-evoked dopamine release at carbon fiber electrodes chronically implanted in the nucleus accumbens core of rats trained on a probabilistic decision-making task. We demonstrate that dopamine concentrations transmit a bidirectional RPE signal with symmetrical encoding of positive and negative RPEs. Our findings strengthen the case that changes in dopamine concentration alone are sufficient to encode the full range of RPEs necessary for reinforcement learning.
Collapse
|
15
|
Colombo M. Deep and beautiful. The reward prediction error hypothesis of dopamine. STUDIES IN HISTORY AND PHILOSOPHY OF BIOLOGICAL AND BIOMEDICAL SCIENCES 2014; 45:57-67. [PMID: 24252364 DOI: 10.1016/j.shpsc.2013.10.006] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/12/2013] [Revised: 10/21/2013] [Accepted: 10/25/2013] [Indexed: 06/02/2023]
Abstract
According to the reward-prediction error hypothesis (RPEH) of dopamine, the phasic activity of dopaminergic neurons in the midbrain signals a discrepancy between the predicted and currently experienced reward of a particular event. It can be claimed that this hypothesis is deep, elegant and beautiful, representing one of the largest successes of computational neuroscience. This paper examines this claim, making two contributions to existing literature. First, it draws a comprehensive historical account of the main steps that led to the formulation and subsequent success of the RPEH. Second, in light of this historical account, it explains in which sense the RPEH is explanatory and under which conditions it can be justifiably deemed deeper than the incentive salience hypothesis of dopamine, which is arguably the most prominent contemporary alternative to the RPEH.
Collapse
Affiliation(s)
- Matteo Colombo
- Tilburg Center for Logic, General Ethics, and Philosophy of Science, Tilburg University, P.O. Box 90153, 5000 LE Tilburg, The Netherlands.
| |
Collapse
|
16
|
Gold BP, Frank MJ, Bogert B, Brattico E. Pleasurable music affects reinforcement learning according to the listener. Front Psychol 2013; 4:541. [PMID: 23970875 PMCID: PMC3748532 DOI: 10.3389/fpsyg.2013.00541] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2013] [Accepted: 07/31/2013] [Indexed: 01/14/2023] Open
Abstract
Mounting evidence links the enjoyment of music to brain areas implicated in emotion and the dopaminergic reward system. In particular, dopamine release in the ventral striatum seems to play a major role in the rewarding aspect of music listening. Striatal dopamine also influences reinforcement learning, such that subjects with greater dopamine efficacy learn better to approach rewards while those with lesser dopamine efficacy learn better to avoid punishments. In this study, we explored the practical implications of musical pleasure through its ability to facilitate reinforcement learning via non-pharmacological dopamine elicitation. Subjects from a wide variety of musical backgrounds chose a pleasurable and a neutral piece of music from an experimenter-compiled database, and then listened to one or both of these pieces (according to pseudo-random group assignment) as they performed a reinforcement learning task dependent on dopamine transmission. We assessed musical backgrounds as well as typical listening patterns with the new Helsinki Inventory of Music and Affective Behaviors (HIMAB), and separately investigated behavior for the training and test phases of the learning task. Subjects with more musical experience trained better with neutral music and tested better with pleasurable music, while those with less musical experience exhibited the opposite effect. HIMAB results regarding listening behaviors and subjective music ratings indicate that these effects arose from different listening styles: namely, more affective listening in non-musicians and more analytical listening in musicians. In conclusion, musical pleasure was able to influence task performance, and the shape of this effect depended on group and individual factors. These findings have implications in affective neuroscience, neuroaesthetics, learning, and music therapy.
Collapse
Affiliation(s)
- Benjamin P Gold
- Cognitive Brain Research Unit, Institute of Behavioral Sciences, University of Helsinki Helsinki, Finland ; Department of Music, Finnish Center of Excellence in Interdisciplinary Music Research, University of Jyväskylä Jyväskylä, Finland
| | | | | | | |
Collapse
|
17
|
Lee AM, Oleson EB, Diergaarde L, Cheer JF, Pattij T. Cannabinoids and value-based decision making: implications for neurodegenerative disorders. ACTA ACUST UNITED AC 2012; 2:131-138. [PMID: 23162787 DOI: 10.1016/j.baga.2012.06.005] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
In recent years, disturbances in cognitive function have been increasingly recognized as important symptomatic phenomena in neurodegenerative diseases, including Parkinson's Disease (PD). Value-based decision making in particular is an important executive cognitive function that is not only impaired in patients with PD, but also shares neural substrates with PD in basal ganglia structures and the dopamine system. Interestingly, the endogenous cannabinoid system modulates dopamine function and subsequently value-based decision making. This review will provide an overview of the interdisciplinary research that has influenced our understanding of value-based decision making and the role of dopamine, particularly in the context of reinforcement learning theories, as well as recent animal and human studies that demonstrate the modulatory role of activation of cannabinoid receptors by exogenous agonists or their naturally occurring ligands. The implications of this research for the symptomatology of and potential treatments for PD are also discussed.
Collapse
Affiliation(s)
- Angela M Lee
- Department of Anatomy and Neurosciences, Neuroscience Campus Amsterdam, VU university medical center, Amsterdam, the Netherlands
| | | | | | | | | |
Collapse
|
18
|
Grygolec J, Coricelli G, Rustichini A. Positive interaction of social comparison and personal responsibility for outcomes. Front Psychol 2012; 3:25. [PMID: 22371706 PMCID: PMC3283894 DOI: 10.3389/fpsyg.2012.00025] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2011] [Accepted: 01/19/2012] [Indexed: 01/23/2023] Open
Abstract
We formulate and test a model that allows sharp separation between two different ways in which environment affects evaluation of outcomes, by comparing social vs. private and personal responsibility vs. chance. In the experiment, subjects chose between two lotteries, one low-risk and one high-risk. They could then observe the outcomes. By varying the environment between private (they could observe the outcome of the chosen lottery and the outcome of the lottery they had not chosen) and social (they could observe the outcome of the lottery chosen by another subject) we can differentiate the response and brain activity following the feedback in social and private settings. The evidence suggests that envy and pride are significant motives driving decisions and outcomes evaluation, stronger than private emotions like regret and rejoice, with ventral striatum playing a key role. When we focus on the outcome evaluation stage we demonstrate that BOLD signal in ventral striatum is increasing in the difference between obtained and counterfactual payoffs. For a given difference in payoffs, striatal responses are more pronounced in social than in private environment. Moreover, a positive interaction (complementarity) between social comparison and personal responsibility is reflected in the pattern of activity in the ventral striatum. At decision stage we observe getting ahead of the Joneses effect in ventral striatum with subjective value of risk larger in social than in private environment.
Collapse
Affiliation(s)
- Jaroslaw Grygolec
- Center for Mind/Brain Sciences, University of Trento Rovereto, Italy
| | | | | |
Collapse
|
19
|
Glimcher PW. Understanding dopamine and reinforcement learning: the dopamine reward prediction error hypothesis. Proc Natl Acad Sci U S A 2011; 108 Suppl 3:15647-54. [PMID: 21389268 PMCID: PMC3176615 DOI: 10.1073/pnas.1014269108] [Citation(s) in RCA: 497] [Impact Index Per Article: 38.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
A number of recent advances have been achieved in the study of midbrain dopaminergic neurons. Understanding these advances and how they relate to one another requires a deep understanding of the computational models that serve as an explanatory framework and guide ongoing experimental inquiry. This intertwining of theory and experiment now suggests very clearly that the phasic activity of the midbrain dopamine neurons provides a global mechanism for synaptic modification. These synaptic modifications, in turn, provide the mechanistic underpinning for a specific class of reinforcement learning mechanisms that now seem to underlie much of human and animal behavior. This review describes both the critical empirical findings that are at the root of this conclusion and the fantastic theoretical advances from which this conclusion is drawn.
Collapse
Affiliation(s)
- Paul W Glimcher
- Center for Neuroeconomics, New York University, New York, NY 10003, USA.
| |
Collapse
|
20
|
Choice from non-choice: predicting consumer preferences from blood oxygenation level-dependent signals obtained during passive viewing. J Neurosci 2011; 31:118-25. [PMID: 21209196 DOI: 10.1523/jneurosci.3214-10.2011] [Citation(s) in RCA: 149] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Decision-making is often viewed as a two-stage process, where subjective values are first assigned to each option and then the option of the highest value is selected. Converging evidence suggests that these subjective values are represented in the striatum and medial prefrontal cortex (MPFC). A separate line of evidence suggests that activation in the same areas represents the values of rewards even when choice is not required, as in classical conditioning tasks. However, it is unclear whether the same neural mechanism is engaged in both cases. To address this question we measured brain activation with functional magnetic resonance imaging while human subjects passively viewed individual consumer goods. We then sampled activation from predefined regions of interest and used it to predict subsequent choices between the same items made outside of the scanner. Our results show that activation in the striatum and MPFC in the absence of choice predicts subsequent choices, suggesting that these brain areas represent value in a similar manner whether or not choice is required.
Collapse
|
21
|
Abstract
Neuroimaging studies typically identify neural activity correlated with the predictions of highly parameterized models, like the many reward prediction error (RPE) models used to study reinforcement learning. Identified brain areas might encode RPEs or, alternatively, only have activity correlated with RPE model predictions. Here, we use an alternate axiomatic approach rooted in economic theory to formally test the entire class of RPE models on neural data. We show that measurements of human neural activity from the striatum, medial prefrontal cortex, amygdala, and posterior cingulate cortex satisfy necessary and sufficient conditions for the entire class of RPE models. However, activity measured from the anterior insula falsifies the axiomatic model, and therefore no RPE model can account for measured activity. Further analysis suggests the anterior insula might instead encode something related to the salience of an outcome. As cognitive neuroscience matures and models proliferate, formal approaches of this kind that assess entire model classes rather than specific model exemplars may take on increased significance.
Collapse
|
22
|
Thevarajah D, Webb R, Ferrall C, Dorris MC. Modeling the value of strategic actions in the superior colliculus. Front Behav Neurosci 2010; 3:57. [PMID: 20161807 PMCID: PMC2821176 DOI: 10.3389/neuro.08.057.2009] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2009] [Accepted: 12/01/2009] [Indexed: 11/25/2022] Open
Abstract
In learning models of strategic game play, an agent constructs a valuation (action value) over possible future choices as a function of past actions and rewards. Choices are then stochastic functions of these action values. Our goal is to uncover a neural signal that correlates with the action value posited by behavioral learning models. We measured activity from neurons in the superior colliculus (SC), a midbrain region involved in planning saccadic eye movements, while monkeys performed two saccade tasks. In the strategic task, monkeys competed against a computer in a saccade version of the mixed-strategy game ”matching-pennies”. In the instructed task, saccades were elicited through explicit instruction rather than free choices. In both tasks neuronal activity and behavior were shaped by past actions and rewards with more recent events exerting a larger influence. Further, SC activity predicted upcoming choices during the strategic task and upcoming reaction times during the instructed task. Finally, we found that neuronal activity in both tasks correlated with an established learning model, the Experience Weighted Attraction model of action valuation (Camerer and Ho, 1999). Collectively, our results provide evidence that action values hypothesized by learning models are represented in the motor planning regions of the brain in a manner that could be used to select strategic actions.
Collapse
Affiliation(s)
- Dhushan Thevarajah
- Department of Physiology, Centre for Neuroscience Studies and Canadian Institutes of Health Research Group in Sensory-Motor Systems, Queen's University Kingston, ON, Canada
| | | | | | | |
Collapse
|