Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Cazé RD, van der Meer MAA. Adaptive properties of differential learning rates for positive and negative outcomes. Biol Cybern 2013;107:711-719. [PMID: 24085507 DOI: 10.1007/s00422-013-0571-5] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/20/2013] [Accepted: 09/14/2013] [Indexed: 06/02/2023]

For:	Cazé RD, van der Meer MAA. Adaptive properties of differential learning rates for positive and negative outcomes. Biol Cybern 2013;107:711-719. [PMID: 24085507 DOI: 10.1007/s00422-013-0571-5] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/20/2013] [Accepted: 09/14/2013] [Indexed: 06/02/2023]

Number

Cited by Other Article(s)

Gregorová K, Eldar E, Deserno L, Reiter AMF. A cognitive-computational account of mood swings in adolescence. Trends Cogn Sci 2024;28:290-303. [PMID: 38503636 DOI: 10.1016/j.tics.2024.02.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Revised: 02/06/2024] [Accepted: 02/12/2024] [Indexed: 03/21/2024]

Karnick AT, Bauer BW, Capron DW. Negative mood and optimism bias: An experimental investigation of sadness and belief updating. J Behav Ther Exp Psychiatry 2024;82:101910. [PMID: 37714798 DOI: 10.1016/j.jbtep.2023.101910] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Revised: 08/18/2023] [Accepted: 09/02/2023] [Indexed: 09/17/2023]

Aster HC, Waltmann M, Busch A, Romanos M, Gamer M, Maria van Noort B, Beck A, Kappel V, Deserno L. Impaired flexible reward learning in ADHD patients is associated with blunted reinforcement sensitivity and neural signals in ventral striatum and parietal cortex. Neuroimage Clin 2024;42:103588. [PMID: 38471434 PMCID: PMC10943992 DOI: 10.1016/j.nicl.2024.103588] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Revised: 02/06/2024] [Accepted: 02/28/2024] [Indexed: 03/14/2024]

Abstract

Reward-based learning and decision-making are prime candidates to understand symptoms of attention deficit hyperactivity disorder (ADHD). However, only limited evidence is available regarding the neurocomputational underpinnings of the alterations seen in ADHD. This concerns flexible behavioral adaption in dynamically changing environments, which is challenging for individuals with ADHD. One previous study points to elevated choice switching in adolescent ADHD, which was accompanied by disrupted learning signals in medial prefrontal cortex. Here, we investigated young adults with ADHD (n = 17) as compared to age- and sex-matched controls (n = 17) using a probabilistic reversal learning experiment during functional magnetic resonance imaging (fMRI). The task requires continuous learning to guide flexible behavioral adaptation to changing reward contingencies. To disentangle the neurocomputational underpinnings of the behavioral data, we used reinforcement learning (RL) models, which informed the analysis of fMRI data. ADHD patients performed worse than controls particularly in trials before reversals, i.e., when reward contingencies were stable. This pattern resulted from 'noisy' choice switching regardless of previous feedback. RL modelling showed decreased reinforcement sensitivity and enhanced learning rates for negative feedback in ADHD patients. At the neural level, this was reflected in a diminished representation of choice probability in the left posterior parietal cortex in ADHD. Moreover, modelling showed a marginal reduction of learning about the unchosen option, which was paralleled by a marginal reduction in learning signals incorporating the unchosen option in the left ventral striatum. Taken together, we show that impaired flexible behavior in ADHD is due to excessive choice switching ('hyper-flexibility'), which can be detrimental or beneficial depending on the learning environment. Computationally, this resulted from blunted sensitivity to reinforcement of which we detected neural correlates in the attention-control network, specifically in the parietal cortex. These neurocomputational findings remain preliminary due to the relatively small sample size.

Collapse

Colas JT, O’Doherty JP, Grafton ST. Active reinforcement learning versus action bias and hysteresis: control with a mixture of experts and nonexperts. PLoS Comput Biol 2024;20:e1011950. [PMID: 38552190 PMCID: PMC10980507 DOI: 10.1371/journal.pcbi.1011950] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 02/26/2024] [Indexed: 04/01/2024] Open

Abstract

Active reinforcement learning enables dynamic prediction and control, where one should not only maximize rewards but also minimize costs such as of inference, decisions, actions, and time. For an embodied agent such as a human, decisions are also shaped by physical aspects of actions. Beyond the effects of reward outcomes on learning processes, to what extent can modeling of behavior in a reinforcement-learning task be complicated by other sources of variance in sequential action choices? What of the effects of action bias (for actions per se) and action hysteresis determined by the history of actions chosen previously? The present study addressed these questions with incremental assembly of models for the sequential choice data from a task with hierarchical structure for additional complexity in learning. With systematic comparison and falsification of computational models, human choices were tested for signatures of parallel modules representing not only an enhanced form of generalized reinforcement learning but also action bias and hysteresis. We found evidence for substantial differences in bias and hysteresis across participants-even comparable in magnitude to the individual differences in learning. Individuals who did not learn well revealed the greatest biases, but those who did learn accurately were also significantly biased. The direction of hysteresis varied among individuals as repetition or, more commonly, alternation biases persisting from multiple previous actions. Considering that these actions were button presses with trivial motor demands, the idiosyncratic forces biasing sequences of action choices were robust enough to suggest ubiquity across individuals and across tasks requiring various actions. In light of how bias and hysteresis function as a heuristic for efficient control that adapts to uncertainty or low motivation by minimizing the cost of effort, these phenomena broaden the consilient theory of a mixture of experts to encompass a mixture of expert and nonexpert controllers of behavior.

Collapse

Le T, Oba T, Couch L, McInerney L, Li CS. Deficits in proactive avoidance and neural responses to drinking motives in problem drinkers. RESEARCH SQUARE 2024:rs.3.rs-3924584. [PMID: 38405986 PMCID: PMC10889056 DOI: 10.21203/rs.3.rs-3924584/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/27/2024]

Abstract

Physical pain and negative emotions represent two distinct drinking motives that contribute to harmful alcohol use. Proactive avoidance which can reduce problem drinking in response to these motives appears to be impaired in problem drinkers. However, proactive avoidance and its underlying neural deficits have not been assessed experimentally. How these deficits inter-relate with drinking motives to influence alcohol use also remains unclear. The current study leveraged neuroimaging data collected in forty-one problem and forty-one social drinkers who performed a probabilistic learning go/nogo task that involved proactive avoidance of painful outcomes. We characterized the regional brain responses to proactive avoidance and identified the neural correlates of drinking to avoid physical pain and negative emotions. Behavioral results confirmed problem drinkers' proactive avoidance deficits in learning rate and performance accuracy, both which were associated with greater alcohol use. Imaging findings in problem drinkers showed that negative emotions as a drinking motive predicted attenuated right insula activation during proactive avoidance. In contrast, physical pain motive predicted reduced right putamen response. These regions' activations as well as functional connectivity with the somatomotor cortex also demonstrated a negative relationship with drinking severity and positive relationship with proactive avoidance performance. Path modeling further delineated the pathways through which physical pain and negative emotions, along with alcohol use severity, influenced the neural and behavioral measures of proactive avoidance. Taken together, the current findings provide experimental evidence for proactive avoidance deficits in problem drinkers and establish the link between their neural underpinnings and alcohol misuse.

Collapse

Zika O, Appel J, Klinge C, Shkreli L, Browning M, Wiech K, Reinecke A. Reduction of Aversive Learning Rates in Pavlovian Conditioning by Angiotensin II Antagonist Losartan: A Randomized Controlled Trial. Biol Psychiatry 2024:S0006-3223(24)00063-5. [PMID: 38309320 DOI: 10.1016/j.biopsych.2024.01.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 01/12/2024] [Accepted: 01/22/2024] [Indexed: 02/05/2024]

Le TM, Oba T, Couch L, McInerney L, Li CSR. The Neural Correlates of Individual Differences in Reinforcement Learning during Pain Avoidance and Reward Seeking. eNeuro 2024;11:ENEURO.0437-23.2024. [PMID: 38365840 PMCID: PMC10901196 DOI: 10.1523/eneuro.0437-23.2024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Revised: 01/31/2024] [Accepted: 02/05/2024] [Indexed: 02/18/2024] Open

Schaaf JV, Weidinger L, Molleman L, van den Bos W. Test-retest reliability of reinforcement learning parameters. Behav Res Methods 2023:10.3758/s13428-023-02203-4. [PMID: 37684495 DOI: 10.3758/s13428-023-02203-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/18/2023] [Indexed: 09/10/2023]

Garrett N, Sharot T. There is no belief update bias for neutral events: failure to replicate Burton et al. (2022). JOURNAL OF COGNITIVE PSYCHOLOGY 2023;35:876-886. [PMID: 38013976 PMCID: PMC10591604 DOI: 10.1080/20445911.2023.2245112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Accepted: 08/01/2023] [Indexed: 11/29/2023]

Vandendriessche H, Demmou A, Bavard S, Yadak J, Lemogne C, Mauras T, Palminteri S. Contextual influence of reinforcement learning performance of depression: evidence for a negativity bias? Psychol Med 2023;53:4696-4706. [PMID: 35726513 DOI: 10.1017/s0033291722001593] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Harada T. Exploring the effects of risk-taking, exploitation, and exploration on divergent thinking under group dynamics. Front Psychol 2023;13:1063525. [PMID: 36743628 PMCID: PMC9890061 DOI: 10.3389/fpsyg.2022.1063525] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Accepted: 12/13/2022] [Indexed: 01/19/2023] Open

Villano WJ, Kraus NI, Reneau TR, Jaso BA, Otto AR, Heller AS. Individual differences in naturalistic learning link negative emotionality to the development of anxiety. SCIENCE ADVANCES 2023;9:eadd2976. [PMID: 36598977 PMCID: PMC9812386 DOI: 10.1126/sciadv.add2976] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Accepted: 11/30/2022] [Indexed: 06/17/2023]

Colas JT, Dundon NM, Gerraty RT, Saragosa‐Harris NM, Szymula KP, Tanwisuth K, Tyszka JM, van Geen C, Ju H, Toga AW, Gold JI, Bassett DS, Hartley CA, Shohamy D, Grafton ST, O'Doherty JP. Reinforcement learning with associative or discriminative generalization across states and actions: fMRI at 3 T and 7 T. Hum Brain Mapp 2022;43:4750-4790. [PMID: 35860954 PMCID: PMC9491297 DOI: 10.1002/hbm.25988] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Revised: 05/20/2022] [Accepted: 06/10/2022] [Indexed: 11/12/2022] Open

Affiliation(s)

Jaron T. Colas Department of Psychological and Brain SciencesUniversity of CaliforniaSanta BarbaraCaliforniaUSA Division of the Humanities and Social SciencesCalifornia Institute of TechnologyPasadenaCaliforniaUSA Computation and Neural Systems Program, California Institute of TechnologyPasadenaCaliforniaUSA
Neil M. Dundon Department of Psychological and Brain SciencesUniversity of CaliforniaSanta BarbaraCaliforniaUSA Department of Child and Adolescent Psychiatry, Psychotherapy, and PsychosomaticsUniversity of FreiburgFreiburg im BreisgauGermany
Raphael T. Gerraty Department of PsychologyColumbia UniversityNew YorkNew YorkUSA Zuckerman Mind Brain Behavior Institute, Columbia UniversityNew YorkNew YorkUSA Center for Science and SocietyColumbia UniversityNew YorkNew YorkUSA
Natalie M. Saragosa‐Harris Department of PsychologyNew York UniversityNew YorkNew YorkUSA Department of PsychologyUniversity of CaliforniaLos AngelesCaliforniaUSA
Karol P. Szymula Department of BioengineeringUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
Koranis Tanwisuth Division of the Humanities and Social SciencesCalifornia Institute of TechnologyPasadenaCaliforniaUSA Department of PsychologyUniversity of CaliforniaBerkeleyCaliforniaUSA
J. Michael Tyszka Division of the Humanities and Social SciencesCalifornia Institute of TechnologyPasadenaCaliforniaUSA
Camilla van Geen Zuckerman Mind Brain Behavior Institute, Columbia UniversityNew YorkNew YorkUSA Department of PsychologyUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
Harang Ju Neuroscience Graduate GroupUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
Arthur W. Toga Laboratory of Neuro ImagingUSC Stevens Neuroimaging and Informatics Institute, Keck School of Medicine of USC, University of Southern CaliforniaLos AngelesCaliforniaUSA
Joshua I. Gold Department of NeuroscienceUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
Dani S. Bassett Department of BioengineeringUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA Department of Electrical and Systems EngineeringUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA Department of NeurologyUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA Department of PsychiatryUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA Department of Physics and AstronomyUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA Santa Fe InstituteSanta FeNew MexicoUSA
Catherine A. Hartley Department of PsychologyNew York UniversityNew YorkNew YorkUSA Center for Neural ScienceNew York UniversityNew YorkNew YorkUSA
Daphna Shohamy Department of PsychologyColumbia UniversityNew YorkNew YorkUSA Zuckerman Mind Brain Behavior Institute, Columbia UniversityNew YorkNew YorkUSA Kavli Institute for Brain ScienceColumbia UniversityNew YorkNew YorkUSA
Scott T. Grafton Department of Psychological and Brain SciencesUniversity of CaliforniaSanta BarbaraCaliforniaUSA
John P. O'Doherty Division of the Humanities and Social SciencesCalifornia Institute of TechnologyPasadenaCaliforniaUSA Computation and Neural Systems Program, California Institute of TechnologyPasadenaCaliforniaUSA

Collapse

Zamfir E, Dayan P. Interactions between attributions and beliefs at trial-by-trial level: Evidence from a novel computer game task. PLoS Comput Biol 2022;18:e1009920. [PMID: 36155635 PMCID: PMC9536582 DOI: 10.1371/journal.pcbi.1009920] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2022] [Revised: 10/06/2022] [Accepted: 08/28/2022] [Indexed: 11/19/2022] Open

Abstract

Inferring causes of the good and bad events that we experience is part of the process of building models of our own capabilities and of the world around us. Making such inferences can be difficult because of complex reciprocal relationships between attributions of the causes of particular events, and beliefs about the capabilities and skills that influence our role in bringing them about. Abnormal causal attributions have long been studied in connection with psychiatric disorders, notably depression and paranoia; however, the mechanisms behind attributional inferences and the way they can go awry are not fully understood. We administered a novel, challenging, game of skill to a substantial population of healthy online participants, and collected trial-by-trial time series of both their beliefs about skill and attributions about the causes of the success and failure of real experienced outcomes. We found reciprocal relationships that provide empirical confirmation of the attribution-self representation cycle theory. This highlights the dynamic nature of the processes involved in attribution, and validates a framework for developing and testing computational accounts of attribution-belief interactions.

As part of interpreting our experiences, we spontaneously make causal attributions and use them to update our beliefs about the world, ourselves and others. This has long been a topic of interest, particularly within psychiatry. Some theories assume that people have stable “attributional styles”, others focus on the changing nature of attribution-making and on the relationships between attributions and one’s beliefs about the self, suggesting that the two are mutually connected. In this area of research, people have traditionally been asked to imagine themselves experiencing various significant life events and report on how they would interpret those, or have been exposed to artificial and highly simplified situations in the lab. In this work, we introduce a new task to study relationships between causal attributions and beliefs: repeatedly playing an engaging and relatively complex game of skill. We show that we can detect mutual influences between attributions and beliefs at the level of individual wins and losses. This has implications for how everyday successes and failures impact our beliefs about ourselves and our well-being. It also could help understand how our interpretations of negative experiences can spiral out of control, affecting our mental health.

Collapse

Banaie Boroujeni K, Sigona MK, Treuting RL, Manuel TJ, Caskey CF, Womelsdorf T. Anterior cingulate cortex causally supports flexible learning under motivationally challenging and cognitively demanding conditions. PLoS Biol 2022;20:e3001785. [PMID: 36067198 PMCID: PMC9481162 DOI: 10.1371/journal.pbio.3001785] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Revised: 09/16/2022] [Accepted: 08/09/2022] [Indexed: 12/02/2022] Open

Banaie Boroujeni K, Watson M, Womelsdorf T. Gains and Losses Affect Learning Differentially at Low and High Attentional Load. J Cogn Neurosci 2022;34:1952-1971. [PMID: 35802604 PMCID: PMC9830784 DOI: 10.1162/jocn_a_01885] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]

Abstract

Prospective gains and losses influence cognitive processing, but it is unresolved how they modulate flexible learning in changing environments. The prospect of gains might enhance flexible learning through prioritized processing of reward-predicting stimuli, but it is unclear how far this learning benefit extends when task demands increase. Similarly, experiencing losses might facilitate learning when they trigger attentional reorienting away from loss-inducing stimuli, but losses may also impair learning by increasing motivational costs or when negative outcomes are overgeneralized. To clarify these divergent views, we tested how varying magnitudes of gains and losses affect the flexible learning of feature values in environments that varied attentional load by increasing the number of interfering object features. With this task design, we found that larger prospective gains improved learning efficacy and learning speed, but only when attentional load was low. In contrast, expecting losses impaired learning efficacy, and this impairment was larger at higher attentional load. These findings functionally dissociate the contributions of gains and losses on flexible learning, suggesting they operate via separate control mechanisms. One mechanism is triggered by experiencing loss and reduces the ability to reduce distractor interference, impairs assigning credit to specific loss-inducing features, and decreases efficient exploration during learning. The second mechanism is triggered by experiencing gains, which enhances prioritizing reward-predicting stimulus features as long as the interference of distracting features is limited. Taken together, these results support a rational theory of cognitive control during learning, suggesting that experiencing losses and experiencing distractor interference impose costs for learning.

Collapse

Nussenbaum K, Velez JA, Washington BT, Hamling HE, Hartley CA. Flexibility in valenced reinforcement learning computations across development. Child Dev 2022;93:1601-1615. [PMID: 35596654 PMCID: PMC9831067 DOI: 10.1111/cdev.13791] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]

Michely J, Eldar E, Erdman A, Martin IM, Dolan RJ. Serotonin modulates asymmetric learning from reward and punishment in healthy human volunteers. Commun Biol 2022;5:812. [PMID: 35962142 PMCID: PMC9374781 DOI: 10.1038/s42003-022-03690-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2020] [Accepted: 07/08/2022] [Indexed: 11/15/2022] Open

Louie K. Asymmetric and adaptive reward coding via normalized reinforcement learning. PLoS Comput Biol 2022;18:e1010350. [PMID: 35862443 PMCID: PMC9345478 DOI: 10.1371/journal.pcbi.1010350] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 08/02/2022] [Accepted: 07/01/2022] [Indexed: 11/18/2022] Open

Abstract

Learning is widely modeled in psychology, neuroscience, and computer science by prediction error-guided reinforcement learning (RL) algorithms. While standard RL assumes linear reward functions, reward-related neural activity is a saturating, nonlinear function of reward; however, the computational and behavioral implications of nonlinear RL are unknown. Here, we show that nonlinear RL incorporating the canonical divisive normalization computation introduces an intrinsic and tunable asymmetry in prediction error coding. At the behavioral level, this asymmetry explains empirical variability in risk preferences typically attributed to asymmetric learning rates. At the neural level, diversity in asymmetries provides a computational mechanism for recently proposed theories of distributional RL, allowing the brain to learn the full probability distribution of future rewards. This behavioral and computational flexibility argues for an incorporation of biologically valid value functions in computational models of learning and decision-making.

Reinforcement learning models are widely used to characterize reward-driven learning in biological and computational agents. Standard reinforcement learning models use linear value functions, despite strong empirical evidence that biological value representations are nonlinear functions of external rewards. Here, we examine the properties of a biologically-based nonlinear reinforcement learning algorithm employing the canonical divisive normalization function, a neural computation commonly found in sensory, cognitive, and reward coding. We show that this normalized reinforcement learning algorithm implements a simple but powerful control of how reward learning reflects relative gains and losses. This property explains diverse behavioral and neural phenomena, and suggests the importance of using biologically valid value functions in computational models of learning and decision-making.

Collapse

Dahal R, MacLellan K, Vavrek D, Dyson BJ. Assessing behavioural profiles following neutral, positive and negative feedback. PLoS One 2022;17:e0270475. [PMID: 35788745 PMCID: PMC9255737 DOI: 10.1371/journal.pone.0270475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Accepted: 06/10/2022] [Indexed: 12/02/2022] Open

Palminteri S, Lebreton M. The computational roots of positivity and confirmation biases in reinforcement learning. Trends Cogn Sci 2022;26:607-621. [PMID: 35662490 DOI: 10.1016/j.tics.2022.04.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Revised: 04/13/2022] [Accepted: 04/18/2022] [Indexed: 12/16/2022]

Dennison JB, Sazhin D, Smith DV. Decision neuroscience and neuroeconomics: Recent progress and ongoing challenges. WILEY INTERDISCIPLINARY REVIEWS. COGNITIVE SCIENCE 2022;13:e1589. [PMID: 35137549 PMCID: PMC9124684 DOI: 10.1002/wcs.1589] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Revised: 11/28/2021] [Accepted: 12/21/2021] [Indexed: 01/10/2023]

Eckstein MK, Master SL, Dahl RE, Wilbrecht L, Collins AG. Reinforcement learning and bayesian inference provide complementary models for the unique advantage of adolescents in stochastic reversal. Dev Cogn Neurosci 2022;55:101106. [PMID: 35537273 PMCID: PMC9108470 DOI: 10.1016/j.dcn.2022.101106] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Revised: 03/01/2022] [Accepted: 03/25/2022] [Indexed: 12/02/2022] Open

Pupil Correlates of Decision Variables in Mice Playing a Competitive Mixed-Strategy Game. eNeuro 2022;9:ENEURO.0457-21.2022. [PMID: 35168951 PMCID: PMC8925722 DOI: 10.1523/eneuro.0457-21.2022] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Revised: 12/21/2021] [Accepted: 01/02/2022] [Indexed: 01/29/2023] Open

Rosenbaum GM, Grassie HL, Hartley CA. Valence biases in reinforcement learning shift across adolescence and modulate subsequent memory. eLife 2022;11:e64620. [PMID: 35072624 PMCID: PMC8786311 DOI: 10.7554/elife.64620] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2020] [Accepted: 12/24/2021] [Indexed: 12/12/2022] Open

Eckstein MK, Master SL, Xia L, Dahl RE, Wilbrecht L, Collins AGE. The interpretation of computational model parameters depends on the context. eLife 2022;11:75474. [PMID: 36331872 PMCID: PMC9635876 DOI: 10.7554/elife.75474] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2021] [Accepted: 09/09/2022] [Indexed: 11/06/2022] Open

Lefebvre G, Summerfield C, Bogacz R. A Normative Account of Confirmation Bias During Reinforcement Learning. Neural Comput 2021;34:307-337. [PMID: 34758486 DOI: 10.1162/neco_a_01455] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Accepted: 07/26/2021] [Indexed: 11/04/2022]

Oba T, Katahira K, Ohira H. A learning mechanism shaping risk preferences and a preliminary test of its relationship with psychopathic traits. Sci Rep 2021;11:20853. [PMID: 34675294 PMCID: PMC8531311 DOI: 10.1038/s41598-021-00358-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2021] [Accepted: 10/07/2021] [Indexed: 11/09/2022] Open

Stewardson HJ, Sambrook TD. Reward prediction error in the ERP following unconditioned aversive stimuli. Sci Rep 2021;11:19912. [PMID: 34620955 PMCID: PMC8497484 DOI: 10.1038/s41598-021-99408-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Accepted: 09/16/2021] [Indexed: 11/15/2022] Open

Enkhtaivan E, Nishimura J, Ly C, Cochran AL. A Competition of Critics in Human Decision-Making. COMPUTATIONAL PSYCHIATRY (CAMBRIDGE, MASS.) 2021;5:81-101. [PMID: 38773993 PMCID: PMC11104313 DOI: 10.5334/cpsy.64] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/24/2021] [Accepted: 07/19/2021] [Indexed: 11/20/2022]

Xia L, Master SL, Eckstein MK, Baribault B, Dahl RE, Wilbrecht L, Collins AGE. Modeling changes in probabilistic reinforcement learning during adolescence. PLoS Comput Biol 2021;17:e1008524. [PMID: 34197447 PMCID: PMC8279421 DOI: 10.1371/journal.pcbi.1008524] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2020] [Revised: 07/14/2021] [Accepted: 05/26/2021] [Indexed: 01/17/2023] Open

Harada T. Three heads are better than two: Comparing learning properties and performances across individuals, dyads, and triads through a computational approach. PLoS One 2021;16:e0252122. [PMID: 34138907 PMCID: PMC8211165 DOI: 10.1371/journal.pone.0252122] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Accepted: 05/10/2021] [Indexed: 11/28/2022] Open

Ohta H, Satori K, Takarada Y, Arake M, Ishizuka T, Morimoto Y, Takahashi T. The asymmetric learning rates of murine exploratory behavior in sparse reward environments. Neural Netw 2021;143:218-229. [PMID: 34157646 DOI: 10.1016/j.neunet.2021.05.030] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2021] [Revised: 04/16/2021] [Accepted: 05/26/2021] [Indexed: 11/29/2022]

Gu Y, Liu T, Zhang X, Long Q, Hu N, Zhang Y, Chen A. The Event-Related Potentials Responding to Outcome Valence and Expectancy Violation during Feedback Processing. Cereb Cortex 2021;31:1060-1076. [PMID: 32995836 DOI: 10.1093/cercor/bhaa274] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2019] [Revised: 08/25/2020] [Accepted: 08/25/2020] [Indexed: 11/15/2022] Open

Information about action outcomes differentially affects learning from self-determined versus imposed choices. Nat Hum Behav 2020;4:1067-1079. [PMID: 32747804 DOI: 10.1038/s41562-020-0919-5] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2019] [Accepted: 06/26/2020] [Indexed: 11/08/2022]

Harada T. Learning From Success or Failure? - Positivity Biases Revisited. Front Psychol 2020;11:1627. [PMID: 32848998 PMCID: PMC7396482 DOI: 10.3389/fpsyg.2020.01627] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2020] [Accepted: 06/16/2020] [Indexed: 11/18/2022] Open

Biased belief updating and suboptimal choice in foraging decisions. Nat Commun 2020;11:3417. [PMID: 32647271 PMCID: PMC7347922 DOI: 10.1038/s41467-020-16964-5] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2019] [Accepted: 05/27/2020] [Indexed: 11/08/2022] Open

Metha JA, Brian ML, Oberrauch S, Barnes SA, Featherby TJ, Bossaerts P, Murawski C, Hoyer D, Jacobson LH. Separating Probability and Reversal Learning in a Novel Probabilistic Reversal Learning Task for Mice. Front Behav Neurosci 2020;13:270. [PMID: 31998088 PMCID: PMC6962304 DOI: 10.3389/fnbeh.2019.00270] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2019] [Accepted: 11/27/2019] [Indexed: 11/13/2022] Open

Affiliation(s)

Jeremy A Metha Sleep and Cognition, The Florey Institute of Neuroscience and Mental Health, Parkville, VIC, Australia.,Translational Neuroscience, Department of Pharmacology and Therapeutics, School of Biomedical Sciences, Faculty of Medicine, Dentistry and Health Sciences, The University of Melbourne, Parkville, VIC, Australia.,Brain, Mind and Markets Laboratory, Department of Finance, Faculty of Business and Economics, The University of Melbourne, Parkville, VIC, Australia
Maddison L Brian Sleep and Cognition, The Florey Institute of Neuroscience and Mental Health, Parkville, VIC, Australia.,Translational Neuroscience, Department of Pharmacology and Therapeutics, School of Biomedical Sciences, Faculty of Medicine, Dentistry and Health Sciences, The University of Melbourne, Parkville, VIC, Australia
Sara Oberrauch Sleep and Cognition, The Florey Institute of Neuroscience and Mental Health, Parkville, VIC, Australia.,Translational Neuroscience, Department of Pharmacology and Therapeutics, School of Biomedical Sciences, Faculty of Medicine, Dentistry and Health Sciences, The University of Melbourne, Parkville, VIC, Australia
Samuel A Barnes Department of Psychiatry, School of Medicine, University of California, San Diego, La Jolla, CA, United States
Travis J Featherby Behavioral Core, The Florey Institute of Neuroscience and Mental Health, Parkville, VIC, Australia
Peter Bossaerts Brain, Mind and Markets Laboratory, Department of Finance, Faculty of Business and Economics, The University of Melbourne, Parkville, VIC, Australia
Carsten Murawski Brain, Mind and Markets Laboratory, Department of Finance, Faculty of Business and Economics, The University of Melbourne, Parkville, VIC, Australia
Daniel Hoyer Sleep and Cognition, The Florey Institute of Neuroscience and Mental Health, Parkville, VIC, Australia.,Translational Neuroscience, Department of Pharmacology and Therapeutics, School of Biomedical Sciences, Faculty of Medicine, Dentistry and Health Sciences, The University of Melbourne, Parkville, VIC, Australia.,Department of Molecular Medicine, The Scripps Research Institute, La Jolla, CA, United States
Laura H Jacobson Sleep and Cognition, The Florey Institute of Neuroscience and Mental Health, Parkville, VIC, Australia.,Translational Neuroscience, Department of Pharmacology and Therapeutics, School of Biomedical Sciences, Faculty of Medicine, Dentistry and Health Sciences, The University of Melbourne, Parkville, VIC, Australia

Collapse

Nussenbaum K, Hartley CA. Reinforcement learning across development: What insights can we draw from a decade of research? Dev Cogn Neurosci 2019;40:100733. [PMID: 31770715 PMCID: PMC6974916 DOI: 10.1016/j.dcn.2019.100733] [Citation(s) in RCA: 69] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2019] [Revised: 10/24/2019] [Accepted: 11/04/2019] [Indexed: 01/02/2023] Open

Dyson BJ, Musgrave C, Rowe C, Sandhur R. Behavioural and neural interactions between objective and subjective performance in a Matching Pennies game. Int J Psychophysiol 2019;147:128-136. [PMID: 31730790 DOI: 10.1016/j.ijpsycho.2019.11.002] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Revised: 11/05/2019] [Accepted: 11/07/2019] [Indexed: 02/06/2023]

Oba T, Katahira K, Ohira H. The Effect of Reduced Learning Ability on Avoidance in Psychopathy: A Computational Approach. Front Psychol 2019;10:2432. [PMID: 31736830 PMCID: PMC6838140 DOI: 10.3389/fpsyg.2019.02432] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2019] [Accepted: 10/14/2019] [Indexed: 02/01/2023] Open

Howlett JR, Huang H, Hysek CM, Paulus MP. The effect of single-dose methylphenidate on the rate of error-driven learning in healthy males: a randomized controlled trial. Psychopharmacology (Berl) 2017;234:3353-3360. [PMID: 28864865 PMCID: PMC5886350 DOI: 10.1007/s00213-017-4723-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/04/2017] [Accepted: 08/14/2017] [Indexed: 12/30/2022]

Abstract

RATIONALE AND OBJECTIVES

Norepinephrine mediates the adjustment of error-driven learning to match the rate of change of the environment, while phasic dopamine signals prediction errors. We tested the hypothesis that pharmacologic manipulation may modulate this process.

METHODS

We administered a single dose of methylphenidate, a norepinephrine/dopamine reuptake inhibitor, or placebo in double-blind randomized fashion to 20 healthy human males, who then performed a probabilistic learning task. Each subject was tested in two sessions, receiving methylphenidate in one session and placebo in the other, in randomized order. Task performance was quantified by the percentage of trials on which subjects chose the most likely option, while learning rate was measured using a computational model-based parameter as well as with a behavioral analogue of this parameter.

RESULTS

There was a substance-by-session interaction effect on behavioral learning rate and model-based learning rate, such that subjects receiving methylphenidate exhibited higher learning rates than those receiving placebo in session 1, with no difference observed in session 2, suggesting that subjects retained the increased learning rate across sessions. Higher behavioral learning rate was associated with both higher task performance and with the model-based learning rate. Higher learning rates were advantageous given the high rate of change on the task. Subjects receiving methylphenidate and placebo began the task in session 1 with a similar behavioral learning rate, but those receiving methylphenidate rapidly increased learning rate toward the optimal value, suggesting that methylphenidate accelerated the adaptation of learning rate based on the environment.

CONCLUSIONS

The results suggest that methylphenidate may improve disrupted probabilistic learning in disorders involving noradrenergic or dopaminergic dysfunction.

Collapse

Heijne A, Rossi F, Sanfey AG. Why we stay with our social partners: Neural mechanisms of stay/leave decision-making. Soc Neurosci 2017;13:667-679. [PMID: 28820016 DOI: 10.1080/17470919.2017.1370010] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Palminteri S, Lefebvre G, Kilford EJ, Blakemore SJ. Confirmation bias in human reinforcement learning: Evidence from counterfactual feedback processing. PLoS Comput Biol 2017;13:e1005684. [PMID: 28800597 PMCID: PMC5568446 DOI: 10.1371/journal.pcbi.1005684] [Citation(s) in RCA: 64] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2017] [Revised: 08/23/2017] [Accepted: 07/14/2017] [Indexed: 11/18/2022] Open

Abstract

Previous studies suggest that factual learning, that is, learning from obtained outcomes, is biased, such that participants preferentially take into account positive, as compared to negative, prediction errors. However, whether or not the prediction error valence also affects counterfactual learning, that is, learning from forgone outcomes, is unknown. To address this question, we analysed the performance of two groups of participants on reinforcement learning tasks using a computational model that was adapted to test if prediction error valence influences learning. We carried out two experiments: in the factual learning experiment, participants learned from partial feedback (i.e., the outcome of the chosen option only); in the counterfactual learning experiment, participants learned from complete feedback information (i.e., the outcomes of both the chosen and unchosen option were displayed). In the factual learning experiment, we replicated previous findings of a valence-induced bias, whereby participants learned preferentially from positive, relative to negative, prediction errors. In contrast, for counterfactual learning, we found the opposite valence-induced bias: negative prediction errors were preferentially taken into account, relative to positive ones. When considering valence-induced bias in the context of both factual and counterfactual learning, it appears that people tend to preferentially take into account information that confirms their current choice.

While the investigation of decision-making biases has a long history in economics and psychology, learning biases have been much less systematically investigated. This is surprising as most of the choices we deal with in everyday life are recurrent, thus allowing learning to occur and therefore influencing future decision-making. Combining behavioural testing and computational modeling, here we show that the valence of an outcome biases both factual and counterfactual learning. When considering factual and counterfactual learning together, it appears that people tend to preferentially take into account information that confirms their current choice. Increasing our understanding of learning biases will enable the refinement of existing models of value-based decision-making.

Collapse

Behavioural and neural characterization of optimistic reinforcement learning. Nat Hum Behav 2017. [DOI: 10.1038/s41562-017-0067] [Citation(s) in RCA: 116] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Do learning rates adapt to the distribution of rewards? Psychon Bull Rev 2016;22:1320-7. [PMID: 25582684 DOI: 10.3758/s13423-014-0790-3] [Citation(s) in RCA: 64] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Adolescent-specific patterns of behavior and neural activity during social reinforcement learning. COGNITIVE AFFECTIVE & BEHAVIORAL NEUROSCIENCE 2015;14:683-97. [PMID: 24550063 DOI: 10.3758/s13415-014-0257-z] [Citation(s) in RCA: 83] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

Antipsychotic dose modulates behavioral and neural responses to feedback during reinforcement learning in schizophrenia. COGNITIVE AFFECTIVE & BEHAVIORAL NEUROSCIENCE 2014;14:189-201. [DOI: 10.3758/s13415-014-0261-3] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]