1
|
Röhlinger M, Albrecht C, Bellebaum C. The Role of the N170 in Linking Stimuli to Feedback-Effects of Stimulus Modality and Feedback Delay. Psychophysiology 2025; 62:e70050. [PMID: 40231805 PMCID: PMC11998638 DOI: 10.1111/psyp.70050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2024] [Revised: 03/13/2025] [Accepted: 03/14/2025] [Indexed: 04/16/2025]
Abstract
With increasing feedback delay, feedback processing appears to shift from the striatum to the hippocampus. In addition, higher-order sensory areas might be involved in bridging a temporal gap between stimulus and feedback by reactivating the representation of the feedback-predicting stimulus during feedback processing. We hypothesized that the feedback-locked N170, an occipito-temporal event-related potential (ERP) component linked to higher-order visual processing, is more pronounced when delayed feedback is provided for choices between visual compared to auditory stimuli. 35 subjects completed a probabilistic feedback learning task with immediate (1 s) and delayed (7 s) monetary feedback for choices between visual or auditory stimuli. Participants successfully learned to choose the more rewarding stimuli irrespective of stimulus modality. For the N170 amplitude over the right hemisphere, we found an interaction between feedback timing and the modality of the chosen stimulus. Only for delayed feedback, the N170 was more pronounced for choices between visual than auditory stimuli. Moreover, in this condition, the N170 amplitude particularly reflected the reward prediction error (PE), with larger amplitudes for positive PEs and lower amplitudes for negative PEs. This suggests that the N170 reflects feedback-locked reactivations in higher-order visual areas mediated by the reward PE. While these effects need to be studied further, we discuss the N170 as a counterpart to the feedback-related negativity (FRN) regarding interacting influences of feedback valence, feedback timing, and PE.
Collapse
Affiliation(s)
- Madita Röhlinger
- Institute for Experimental Psychology, Faculty of Mathematics and Natural SciencesHeinrich Heine University DüsseldorfDüsseldorfGermany
| | - Christine Albrecht
- Institute for Experimental Psychology, Faculty of Mathematics and Natural SciencesHeinrich Heine University DüsseldorfDüsseldorfGermany
| | - Christian Bellebaum
- Institute for Experimental Psychology, Faculty of Mathematics and Natural SciencesHeinrich Heine University DüsseldorfDüsseldorfGermany
| |
Collapse
|
2
|
Shibata K, Klar V, Fallon SJ, Husain M, Manohar SG. Working memory as a representational template for reinforcement learning. Sci Rep 2024; 14:27660. [PMID: 39532969 PMCID: PMC11557606 DOI: 10.1038/s41598-024-79119-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2024] [Accepted: 11/06/2024] [Indexed: 11/16/2024] Open
Abstract
Working memory (WM) and reinforcement learning (RL) both influence decision-making, but how they interact to affect behaviour remains unclear. We assessed whether RL is influenced by the format of visual stimuli held in WM, either feature-based or unified, object-based representations. In a pre-registered paradigm, participants learned stimulus-action combinations that provided reward through 80% probabilistic feedback. In parallel, participants retained the RL stimulus in WM and were asked to recall this stimulus after each RL choice. Crucially, the format of representation probed in WM was manipulated, with blocks encouraging either separate features or bound objects to be remembered. Incentivising a feature-based WM representation facilitated feature-based learning, shown by an improved choice strategy. This reveals a role of WM in providing sustained internal representations that are harnessed by RL, providing a framework by which these two cognitive processes cooperate.
Collapse
Affiliation(s)
- Kengo Shibata
- Nuffield Department of Clinical Neurosciences, John Radcliffe Hospital, University of Oxford, Level 6, West Wing, Oxford, OX3 9DU, UK.
| | - Verena Klar
- Department of Experimental Psychology, University of Oxford, Oxford, OX2 6GG, UK
| | - Sean J Fallon
- School of Psychology, University of Plymouth, Plymouth, PL4 8AA, UK
| | - Masud Husain
- Nuffield Department of Clinical Neurosciences, John Radcliffe Hospital, University of Oxford, Level 6, West Wing, Oxford, OX3 9DU, UK
- Department of Experimental Psychology, University of Oxford, Oxford, OX2 6GG, UK
| | - Sanjay G Manohar
- Nuffield Department of Clinical Neurosciences, John Radcliffe Hospital, University of Oxford, Level 6, West Wing, Oxford, OX3 9DU, UK
- Department of Experimental Psychology, University of Oxford, Oxford, OX2 6GG, UK
| |
Collapse
|
3
|
Bonte M, Brem S. Unraveling individual differences in learning potential: A dynamic framework for the case of reading development. Dev Cogn Neurosci 2024; 66:101362. [PMID: 38447471 PMCID: PMC10925938 DOI: 10.1016/j.dcn.2024.101362] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Revised: 02/02/2024] [Accepted: 03/01/2024] [Indexed: 03/08/2024] Open
Abstract
Children show an enormous capacity to learn during development, but with large individual differences in the time course and trajectory of learning and the achieved skill level. Recent progress in developmental sciences has shown the contribution of a multitude of factors including genetic variation, brain plasticity, socio-cultural context and learning experiences to individual development. These factors interact in a complex manner, producing children's idiosyncratic and heterogeneous learning paths. Despite an increasing recognition of these intricate dynamics, current research on the development of culturally acquired skills such as reading still has a typical focus on snapshots of children's performance at discrete points in time. Here we argue that this 'static' approach is often insufficient and limits advancements in the prediction and mechanistic understanding of individual differences in learning capacity. We present a dynamic framework which highlights the importance of capturing short-term trajectories during learning across multiple stages and processes as a proxy for long-term development on the example of reading. This framework will help explain relevant variability in children's learning paths and outcomes and fosters new perspectives and approaches to study how children develop and learn.
Collapse
Affiliation(s)
- Milene Bonte
- Department of Cognitive Neuroscience and Maastricht Brain Imaging Center, Faculty of Psychology and Neuroscience, Maastricht University, Maastricht, the Netherlands.
| | - Silvia Brem
- Department of Child and Adolescent Psychiatry and Psychotherapy, University Hospital of Psychiatry Zurich, University of Zurich, Switzerland; Neuroscience Center Zurich, University of Zurich and ETH Zurich, Switzerland; URPP Adaptive Brain Circuits in Development and Learning (AdaBD), University of Zurich, Zurich, Switzerland
| |
Collapse
|
4
|
Antono JE, Dang S, Auksztulewicz R, Pooresmaeili A. Distinct Patterns of Connectivity between Brain Regions Underlie the Intra-Modal and Cross-Modal Value-Driven Modulations of the Visual Cortex. J Neurosci 2023; 43:7361-7375. [PMID: 37684031 PMCID: PMC10621764 DOI: 10.1523/jneurosci.0355-23.2023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Revised: 07/30/2023] [Accepted: 08/26/2023] [Indexed: 09/10/2023] Open
Abstract
Past reward associations may be signaled from different sensory modalities; however, it remains unclear how different types of reward-associated stimuli modulate sensory perception. In this human fMRI study (female and male participants), a visual target was simultaneously presented with either an intra- (visual) or a cross-modal (auditory) cue that was previously associated with rewards. We hypothesized that, depending on the sensory modality of the cues, distinct neural mechanisms underlie the value-driven modulation of visual processing. Using a multivariate approach, we confirmed that reward-associated cues enhanced the target representation in early visual areas and identified the brain valuation regions. Then, using an effective connectivity analysis, we tested three possible patterns of connectivity that could underlie the modulation of the visual cortex: a direct pathway from the frontal valuation areas to the visual areas, a mediated pathway through the attention-related areas, and a mediated pathway that additionally involved sensory association areas. We found evidence for the third model demonstrating that the reward-related information in both sensory modalities is communicated across the valuation and attention-related brain regions. Additionally, the superior temporal areas were recruited when reward was cued cross-modally. The strongest dissociation between the intra- and cross-modal reward-driven effects was observed at the level of the feedforward and feedback connections of the visual cortex estimated from the winning model. These results suggest that, in the presence of previously rewarded stimuli from different sensory modalities, a combination of domain-general and domain-specific mechanisms are recruited across the brain to adjust the visual perception.SIGNIFICANCE STATEMENT Reward has a profound effect on perception, but it is not known whether shared or disparate mechanisms underlie the reward-driven effects across sensory modalities. In this human fMRI study, we examined the reward-driven modulation of the visual cortex by visual (intra-modal) and auditory (cross-modal) reward-associated cues. Using a model-based approach to identify the most plausible pattern of inter-regional effective connectivity, we found that higher-order areas involved in the valuation and attentional processing were recruited by both types of rewards. However, the pattern of connectivity between these areas and the early visual cortex was distinct between the intra- and cross-modal rewards. This evidence suggests that, to effectively adapt to the environment, reward signals may recruit both domain-general and domain-specific mechanisms.
Collapse
Affiliation(s)
- Jessica Emily Antono
- Perception and Cognition Lab, European Neuroscience Institute Goettingen-A Joint Initiative of the University Medical Center Goettingen and the Max-Planck-Society, Germany, Goettingen, 37077, Germany
| | - Shilpa Dang
- Perception and Cognition Lab, European Neuroscience Institute Goettingen-A Joint Initiative of the University Medical Center Goettingen and the Max-Planck-Society, Germany, Goettingen, 37077, Germany
- School of Artificial Intelligence and Data Science, Indian Institute of Technology Jodhpur, Karwar, Jodhpur 342030, India
| | - Ryszard Auksztulewicz
- Center for Cognitive Neuroscience Berlin, Free University Berlin, Berlin, 14195, Germany
| | - Arezoo Pooresmaeili
- Perception and Cognition Lab, European Neuroscience Institute Goettingen-A Joint Initiative of the University Medical Center Goettingen and the Max-Planck-Society, Germany, Goettingen, 37077, Germany
| |
Collapse
|
5
|
Pütz C, van den Berg B, Lorist MM. Dynamic modulation of neural feedback processing and attention during spatial probabilistic learning. iScience 2022; 25:104302. [PMID: 35602968 PMCID: PMC9118728 DOI: 10.1016/j.isci.2022.104302] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 03/04/2022] [Accepted: 04/21/2022] [Indexed: 11/25/2022] Open
Abstract
Learned stimulus-reward associations can modulate behavior and the underlying neural processing of information. We investigated the cascade of these neurocognitive mechanisms involved in the learning of spatial stimulus-reward associations. Using electroencephalogram recordings while participants performed a probabilistic spatial reward learning task, we observed that the feedback-related negativity component was more negative in response to loss feedback compared to gain feedback but showed no modulation by learning. The late positive component became larger in response to losses as the learning set progressed but smaller in response to gains. In addition, feedback-locked alpha frequency oscillations measured over occipital sites were predictive of N2pc amplitudes—a marker of spatial attention orienting—observed on the next trial. This relationship was found to become stronger with learning set progression. Taken together, we elucidated neurocognitive dynamics underlying feedback processing during spatial reward learning, and the subsequent effects of these learned spatial stimulus-reward associations on spatial attention. We can learn which spatial location relates to the highest probability of reward Neural processing of feedback valence was not influenced by learning LPC amplitude was dynamically modulated by learning, reflecting context updating Feedback-locked alpha power was predictive of ensuing orientation of attention
Collapse
Affiliation(s)
- Celina Pütz
- Department of Experimental Psychology, University of Groningen, Grote Kruisstraat 2/1, Groningen 9712TS, the Netherlands.,Department of Neurobiology, University of Groningen, P.O. Box 11103, Groningen 9700CC, the Netherlands.,Department of Neurology, University Medical Center Groningen, Postbus 30001, Groningen 9700RB, the Netherlands
| | - Berry van den Berg
- Department of Experimental Psychology, University of Groningen, Grote Kruisstraat 2/1, Groningen 9712TS, the Netherlands
| | - Monicque M Lorist
- Department of Experimental Psychology, University of Groningen, Grote Kruisstraat 2/1, Groningen 9712TS, the Netherlands
| |
Collapse
|
6
|
Olivo D, Di Ciano A, Mauro J, Giudetti L, Pampallona A, Kubera KM, Hirjak D, Wolf RC, Sambataro F. Neural Responses of Benefiting From the Prosocial Exchange: The Effect of Helping Behavior. Front Psychol 2021; 12:606858. [PMID: 33746829 PMCID: PMC7969530 DOI: 10.3389/fpsyg.2021.606858] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Accepted: 02/05/2021] [Indexed: 11/16/2022] Open
Abstract
Prosocial behavior is critical for the natural development of an individual as well as for promoting social relationships. Although this complex behavior results from gratuitous acts occurring between an agent and a recipient and a wealth of literature on prosocial behavior has investigated these actions, little is known about the effects on the recipient and the neurobiology underlying them. In this study, we used functional magnetic resonance imaging to identify neural correlates of receiving prosocial behavior in the context of real-world experiences, with different types of action provided by the agent, including practical help and effort appreciation. Practical help was associated with increased activation in a network of regions spanning across bilateral superior temporal sulcus, temporoparietal junction, temporal pole, and medial prefrontal cortex. Effort appreciation was associated with activation and increased task-modulated connectivity of the occipital cortex. Prosocial-dependent brain responses were associated with positive affect. Our results support the role of the theory of mind network and the visual cortices in mediating the positive effects of receiving gratuitous help. Moreover, they indicate that specific types of prosocial behavior are mediated by distinct brain networks, which further demonstrates the uniqueness of the psychological processes underlying prosocial actions.
Collapse
Affiliation(s)
- Daniele Olivo
- Department of Neuroscience (DNS), University of Padua, Padua, Italy.,Department of Medicine (DAME), University of Udine, Udine, Italy
| | | | - Jessica Mauro
- Department of Medicine (DAME), University of Udine, Udine, Italy
| | | | | | - Katharina M Kubera
- Center for Psychosocial Medicine, Department of General Psychiatry, Heidelberg University, Heidelberg, Germany
| | - Dusan Hirjak
- Department of Psychiatry and Psychotherapy, Central Institute of Mental Health, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany
| | - Robert Christian Wolf
- Center for Psychosocial Medicine, Department of General Psychiatry, Heidelberg University, Heidelberg, Germany
| | - Fabio Sambataro
- Department of Neuroscience (DNS), University of Padua, Padua, Italy.,Department of Medicine (DAME), University of Udine, Udine, Italy.,Padua Neuroscience Center, University of Padua, Padua, Italy
| |
Collapse
|
7
|
Fallon SJ. Necessary impairments? The downside of functional compensation in frontostriatal circuits in people with presymptomatic Huntingt-on's- di-sease. J Neurol Neurosurg Psychiatry 2021; 92:120-121. [PMID: 33372051 DOI: 10.1136/jnnp-2020-324678] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Accepted: 11/24/2020] [Indexed: 11/03/2022]
Affiliation(s)
- Sean James Fallon
- National Institute for Health Research Bristol Biomedical Research Centre, University of Bristol, Bristol BS8 2BN, UK .,Centre for Academic Mental Health, Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| |
Collapse
|
8
|
Signed Reward Prediction Errors in the Ventral Striatum Drive Episodic Memory. J Neurosci 2020; 41:1716-1726. [PMID: 33334870 DOI: 10.1523/jneurosci.1785-20.2020] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2020] [Revised: 12/04/2020] [Accepted: 12/07/2020] [Indexed: 11/21/2022] Open
Abstract
Recent behavioral evidence implicates reward prediction errors (RPEs) as a key factor in the acquisition of episodic memory. Yet, important neural predictions related to the role of RPEs in episodic memory acquisition remain to be tested. Humans (both sexes) performed a novel variable-choice task where we experimentally manipulated RPEs and found support for key neural predictions with fMRI. Our results show that in line with previous behavioral observations, episodic memory accuracy increases with the magnitude of signed (i.e., better/worse-than-expected) RPEs (SRPEs). Neurally, we observe that SRPEs are encoded in the ventral striatum (VS). Crucially, we demonstrate through mediation analysis that activation in the VS mediates the experimental manipulation of SRPEs on episodic memory accuracy. In particular, SRPE-based responses in the VS (during learning) predict the strength of subsequent episodic memory (during recollection). Furthermore, functional connectivity between task-relevant processing areas (i.e., face-selective areas) and hippocampus and ventral striatum increased as a function of RPE value (during learning), suggesting a central role of these areas in episodic memory formation. Our results consolidate reinforcement learning theory and striatal RPEs as key factors subtending the formation of episodic memory.SIGNIFICANCE STATEMENT Recent behavioral research has shown that reward prediction errors (RPEs), a key concept of reinforcement learning theory, are crucial to the formation of episodic memories. In this study, we reveal the neural underpinnings of this process. Using fMRI, we show that signed RPEs (SRPEs) are encoded in the ventral striatum (VS), and crucially, that SRPE VS activity is responsible for the subsequent recollection accuracy of one-shot learned episodic memory associations.
Collapse
|
9
|
Sharp ME, Duncan K, Foerde K, Shohamy D. Dopamine is associated with prioritization of reward-associated memories in Parkinson's disease. Brain 2020; 143:2519-2531. [PMID: 32844197 DOI: 10.1093/brain/awaa182] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2019] [Revised: 04/08/2020] [Accepted: 04/16/2020] [Indexed: 01/23/2023] Open
Abstract
Patients with Parkinson's disease have reduced reward sensitivity related to dopaminergic neuron loss, which is associated with impairments in reinforcement learning. Increasingly, however, dopamine-dependent reward signals are recognized to play an important role beyond reinforcement learning. In particular, it has been shown that reward signals mediated by dopamine help guide the prioritization of events for long-term memory consolidation. Meanwhile, studies of memory in patients with Parkinson's disease have focused on overall memory capacity rather than what is versus what isn't remembered, leaving open questions about the effect of dopamine replacement on the prioritization of memories by reward and the time-dependence of this effect. The current study sought to fill this gap by testing the effect of reward and dopamine on memory in patients with Parkinson's disease. We tested the effect of dopamine modulation and reward on two forms of long-term memory: episodic memory for neutral objects and memory for stimulus-value associations. We measured both forms of memory in a single task, adapting a standard task of reinforcement learning with incidental episodic encoding events of trial-unique objects. Objects were presented on each trial at the time of feedback, which was either rewarding or not. Memory for the trial-unique images and for the stimulus-value associations, and the influence of reward on both, was tested immediately after learning and 2 days later. We measured performance in Parkinson's disease patients tested either ON or OFF their dopaminergic medications and in healthy older control subjects. We found that dopamine was associated with a selective enhancement of memory for reward-associated images, but that it did not influence overall memory capacity. Contrary to predictions, this effect did not differ between the immediate and delayed memory tests. We also found that while dopamine had an effect on reward-modulated episodic memory, there was no effect of dopamine on memory for stimulus-value associations. Our results suggest that impaired prioritization of cognitive resource allocation may contribute to the early cognitive deficits of Parkinson's disease.
Collapse
Affiliation(s)
- Madeleine E Sharp
- Department of Neurology and Neurosurgery, McGill University, Montreal, Quebec, Canada
| | - Katherine Duncan
- Department of Psychology, University of Toronto, Toronto, Canada
| | - Karin Foerde
- New York State Psychiatric Institute and Department of Psychiatry, Columbia University, New York, NY, USA
| | - Daphna Shohamy
- Department of Psychology, Columbia University, New York, NY, USA.,Zuckerman Mind, Brain, Behavior Institute, Columbia University, New York, NY, USA.,Kavli Institute for Brain Science, Columbia University, New York, NY, USA
| |
Collapse
|
10
|
Cognitive Effort Modulates Connectivity between Dorsal Anterior Cingulate Cortex and Task-Relevant Cortical Areas. J Neurosci 2020; 40:3838-3848. [PMID: 32273486 DOI: 10.1523/jneurosci.2948-19.2020] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2019] [Revised: 03/24/2020] [Accepted: 03/25/2020] [Indexed: 11/21/2022] Open
Abstract
Investment of cognitive effort is required in everyday life and has received ample attention in recent neurocognitive frameworks. The neural mechanism of effort investment is thought to be structured hierarchically, with dorsal anterior cingulate cortex (dACC) at the highest level, recruiting task-specific upstream areas. In the current fMRI study, we tested whether dACC is generally active when effort demand is high across tasks with different stimuli, and whether connectivity between dACC and task-specific areas is increased depending on the task requirements and effort level at hand. For that purpose, a perceptual detection task was administered that required male and female human participants to detect either a face or a house in a noisy image. Effort demand was manipulated by adding little (low effort) or much (high effort) noise to the images. Results showed a network of dACC, anterior insula (AI), and intraparietal sulcus (IPS) to be more active when effort demand was high, independent of the performed task (face or house detection). Importantly, effort demand modulated functional connectivity between dACC and face-responsive or house-responsive perceptual areas, depending on the task at hand. This shows that dACC, AI, and IPS constitute a general effort-responsive network and suggests that the neural implementation of cognitive effort involves dACC-initiated sensitization of task-relevant areas.SIGNIFICANCE STATEMENT Although cognitive effort is generally perceived as aversive, its investment is inevitable when navigating an increasingly complex society. In this study, we demonstrate how the human brain tailors the implementation of effort to the requirements of the task at hand. We show increased effort-related activity in a network of brain areas consisting of dorsal anterior cingulate cortex (dACC), anterior insula, and intraparietal sulcus, independent of task specifics. Crucially, we also show that effort-induced functional connectivity between dACC and task-relevant areas tracks specific task demands. These results demonstrate how brain regions specialized to solve a task may be energized by dACC when effort demand is high.
Collapse
|
11
|
Fallon SJ, Dolfen N, Parolo F, Zokaei N, Husain M. Task-irrelevant financial losses inhibit the removal of information from working memory. Sci Rep 2019; 9:1673. [PMID: 30737421 PMCID: PMC6368543 DOI: 10.1038/s41598-018-36826-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2018] [Accepted: 11/23/2018] [Indexed: 11/09/2022] Open
Abstract
The receipt of financial rewards or penalties - though task-irrelevant - may exert an obligatory effect on manipulating items in working memory (WM) by constraining a forthcoming shift in attention or reinforcing attentional shifts that have previously occurred. Here, we adjudicate between these two hypotheses by varying – after encoding- the order in which task-irrelevant financial outcomes and cues indicating which items need to be retained in memory are presented (so called retrocues). We employed a “what-is-where” design that allowed for the fractionation of WM recall into separate components: identification, precision and binding (between location and identity). Principally, valence-dependent effects were observed only for precision and binding, but only when outcomes were presented before, rather than after, the retrocue. Specifically, task-irrelevant financial losses presented before the retrocue caused a systematic breakdown in binding (misbinding), whereby the features of cued and non-cued memoranda became confused, i.e., the features that made up relevant memoranda were displaced by those of non-cued (irrelevant) items. A control experiment, in which outcomes but no cues were presented, failed to produce the same effects, indicating that the inclusion of retrocues were necessary for generating this effect. These results show that the receipt of financial penalties – even when uncoupled to performance – can prevent irrelevant information from being effectively pruned from WM. These results illustrate the importance of reward-related processing to controlling the contents of WM.
Collapse
Affiliation(s)
- Sean James Fallon
- Department of Experimental Psychology, University of Oxford, Oxford, UK.
| | - Nina Dolfen
- Department of Experimental Psychology, University of Oxford, Oxford, UK.,Motor Control & Neuroplasticity Research Group, KU Leuven, Leuven, Belgium
| | - Francesca Parolo
- Department of Experimental Psychology, University of Oxford, Oxford, UK
| | - Nahid Zokaei
- Department of Experimental Psychology, University of Oxford, Oxford, UK.,Oxford Centre for Human Brain Activity, Wellcome Centre for Integrative Neuroimaging, Department of Psychiatry, University of Oxford, Oxford, UK
| | - Masud Husain
- Department of Experimental Psychology, University of Oxford, Oxford, UK.,Nuffield Department of Clinical Neurosciences, John Radcliffe Hospital, Oxford, UK
| |
Collapse
|
12
|
Scholl J, Klein-Flügge M. Understanding psychiatric disorder by capturing ecologically relevant features of learning and decision-making. Behav Brain Res 2018; 355:56-75. [PMID: 28966147 PMCID: PMC6152580 DOI: 10.1016/j.bbr.2017.09.050] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2017] [Revised: 07/24/2017] [Accepted: 09/27/2017] [Indexed: 01/06/2023]
Abstract
Recent research in cognitive neuroscience has begun to uncover the processes underlying increasingly complex voluntary behaviours, including learning and decision-making. Partly this success has been possible by progressing from simple experimental tasks to paradigms that incorporate more ecological features. More specifically, the premise is that to understand cognitions and brain functions relevant for real life, we need to introduce some of the ecological challenges that we have evolved to solve. This often entails an increase in task complexity, which can be managed by using computational models to help parse complex behaviours into specific component mechanisms. Here we propose that using computational models with tasks that capture ecologically relevant learning and decision-making processes may provide a critical advantage for capturing the mechanisms underlying symptoms of disorders in psychiatry. As a result, it may help develop mechanistic approaches towards diagnosis and treatment. We begin this review by mapping out the basic concepts and models of learning and decision-making. We then move on to consider specific challenges that emerge in realistic environments and describe how they can be captured by tasks. These include changes of context, uncertainty, reflexive/emotional biases, cost-benefit decision-making, and balancing exploration and exploitation. Where appropriate we highlight future or current links to psychiatry. We particularly draw examples from research on clinical depression, a disorder that greatly compromises motivated behaviours in real-life, but where simpler paradigms have yielded mixed results. Finally, we highlight several paradigms that could be used to help provide new insights into the mechanisms of psychiatric disorders.
Collapse
Affiliation(s)
- Jacqueline Scholl
- Department of Experimental Psychology, University of Oxford, Tinsley Building, Mansfield Road, Oxford, OX1 3SR, United Kingdom.
| | - Miriam Klein-Flügge
- Department of Experimental Psychology, University of Oxford, Tinsley Building, Mansfield Road, Oxford, OX1 3SR, United Kingdom.
| |
Collapse
|
13
|
Garcia-Lazaro HG, Bartsch MV, Boehler CN, Krebs RM, Donohue SE, Harris JA, Schoenfeld MA, Hopf JM. Dissociating Reward- and Attention-driven Biasing of Global Feature-based Selection in Human Visual Cortex. J Cogn Neurosci 2018; 31:469-481. [PMID: 30457917 DOI: 10.1162/jocn_a_01356] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
Objects that promise rewards are prioritized for visual selection. The way this prioritization shapes sensory processing in visual cortex, however, is debated. It has been suggested that rewards motivate stronger attentional focusing, resulting in a modulation of sensory selection in early visual cortex. An open question is whether those reward-driven modulations would be independent of similar modulations indexing the selection of attended features that are not associated with reward. Here, we use magnetoencephalography in human observers to investigate whether the modulations indexing global color-based selection in visual cortex are separable for target- and (monetary) reward-defining colors. To assess the underlying global color-based activity modulation, we compare the event-related magnetic field response elicited by a color probe in the unattended hemifield drawn either in the target color, the reward color, both colors, or a neutral task-irrelevant color. To test whether target and reward relevance trigger separable modulations, we manipulate attention demands on target selection while keeping reward-defining experimental parameters constant. Replicating previous observations, we find that reward and target relevance produce almost indistinguishable gain modulations in ventral extratriate cortex contralateral to the unattended color probe. Importantly, increasing attention demands on target discrimination increases the response to the target-defining color, whereas the response to the rewarded color remains largely unchanged. These observations indicate that, although task relevance and reward influence the very same feature-selective area in extrastriate visual cortex, the associated modulations are largely independent.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Jens-Max Hopf
- Otto-von-Guericke University Magdeburg.,Leibniz Institute for Neurobiology, Magdeburg
| |
Collapse
|
14
|
Anderson BA. Neurobiology of value-driven attention. Curr Opin Psychol 2018; 29:27-33. [PMID: 30472540 DOI: 10.1016/j.copsyc.2018.11.004] [Citation(s) in RCA: 70] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2018] [Revised: 10/24/2018] [Accepted: 11/08/2018] [Indexed: 01/30/2023]
Abstract
What we pay attention to is influenced by reward learning. Converging evidence points to the idea that associative reward learning changes how visual stimuli are processed in the brain, rendering learned reward cues difficult to ignore. Behavioral evidence distinguishes value-driven attention from other established control mechanisms, suggesting a distinct underlying neurobiological process. Recently, studies have begun to explore the neural substrates of this value-driven attention mechanism. Here, I review the progress that has been made in this area, and synthesize the findings to provide an integrative account of the neurobiology of value-driven attention. The proposed account can explain both attentional capture by previously rewarded targets and the modulatory effect of reward on priming, as well as the decoupling of reward history and prior task relevance in value-driven attention.
Collapse
|
15
|
Abstract
Cognitive flexibility refers to the ability to quickly reconfigure our mind, like when we switch between different tasks. This review highlights recent evidence showing that cognitive flexibility can be conditioned by simple incentives typically known to drive lower-level learning, such as stimulus-response associations. Cognitive flexibility can also become associated with, and triggered by, bottom-up contextual cues in our environment, including subliminal cues. Therefore, we suggest that the control functions that mediate cognitive flexibility are grounded in, and guided by, basic associative learning mechanisms, and abide by the same learning principles as more low-level forms of behavior. Such a learning perspective on cognitive flexibility offers new directions and important implications for further research, theory, and applications.
Collapse
Affiliation(s)
- Senne Braem
- Department of Experimental Psychology, Ghent University, Ghent, 9000, Belgium
| | - Tobias Egner
- Department of Psychology and Neuroscience, Duke University, Durham, North Carolina, 27708, USA
| |
Collapse
|
16
|
van de Vijver I, van Driel J, Hillebrand A, Cohen MX. Interactions between frontal and posterior oscillatory dynamics support adjustment of stimulus processing during reinforcement learning. Neuroimage 2018; 181:170-181. [PMID: 29990582 DOI: 10.1016/j.neuroimage.2018.07.014] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2018] [Revised: 06/15/2018] [Accepted: 07/05/2018] [Indexed: 11/29/2022] Open
Abstract
Reinforcement learning (RL) in humans is subserved by a network of striatal and frontal brain areas. The electrophysiological signatures of feedback evaluation are increasingly well understood, but how those signatures relate to the use of feedback to guide subsequent behavioral adjustment remains unclear. One mechanism for post-feedback behavioral optimization is the modulation of sensory processing. We used source-reconstructed MEG to test whether feedback affects the interactions between sources of oscillatory activity in the learning network and task-relevant stimulus-processing areas. Participants performed a probabilistic RL task in which they learned associations between colored faces and response buttons using trial-and-error feedback. Delta-band (2-4 Hz) and theta-band (4-8 Hz) power in multiple frontal regions were sensitive to feedback valence. Low and high beta-band power (12-20 and 20-30 Hz) in occipital, parietal, and temporal regions differentiated between color and face information. Consistent with our hypothesis, single-trial power-power correlations between frontal and posterior-sensory areas were modulated by the interaction between feedback valence and the relevant stimulus characteristic (color versus identity). These results suggest that long-range oscillatory coupling supports post-feedback updating of stimulus processing.
Collapse
Affiliation(s)
- Irene van de Vijver
- University of Amsterdam, Department of Psychology, Amsterdam, The Netherlands; Radboud University, Behavioural Science Institute, Nijmegen, The Netherlands.
| | - Joram van Driel
- University of Amsterdam, Department of Psychology, Amsterdam, The Netherlands; Vrije Universiteit, Department of Cognitive Psychology, Amsterdam, The Netherlands
| | - Arjan Hillebrand
- Department of Clinical Neurophysiology and Magnetoencephalography Center, VU University Medical Center, Amsterdam, The Netherlands
| | - Michael X Cohen
- University of Amsterdam, Department of Psychology, Amsterdam, The Netherlands
| |
Collapse
|
17
|
Meffert H, Penner E, VanTieghem MR, Sypher I, Leshin J, Blair RJR. The role of ventral striatum in reward-based attentional bias. Brain Res 2018; 1689:89-97. [DOI: 10.1016/j.brainres.2018.03.036] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2017] [Revised: 03/29/2018] [Accepted: 03/30/2018] [Indexed: 01/22/2023]
|
18
|
Abstract
Odor–reward association during appetitive learning is a fundamental process that requires multiple forms of plasticity. In the adult olfactory bulb, the continual production of newborn interneurons contributes to the functional plasticity of the system, placing the newcomers in a key position to participate in olfactory associative learning. Here, we demonstrate that adult-born neurons, but not preexisting ones, contain information about learned positive value. Moreover, specific heightening of this signal improves associative learning and odor value update and is enough in some cases to trigger behavior even without odor stimulus. Collectively, our findings show an important role of this adult-born interneuron population in odor–reward association and unveil the relevance of odor value encoding at early stages of sensory processing. Olfaction is an important sensory modality driving fundamental behaviors. During odor-dependent learning, a positive value is commonly assigned to an odorant, and multiple forms of plasticity are involved when such odor–reward associations are formed. In rodents, one of the mechanisms underlying plasticity in the olfactory bulb consists in recruiting new neurons daily throughout life. However, it is still unknown whether adult-born neurons might participate in encoding odor value. Here, we demonstrate that exposure to reward-associated odors specifically increases activity of adult-born neurons but not preexisting neurons. Remarkably, adult-born neuron activation during rewarded odor presentation heightens discrimination learning and enhances the ability to update the odor value during reversal association. Moreover, in some cases, activation of this interneuron population can trigger olfactory learning without sensory stimulation. Taken together, our results show a specific involvement of adult-born neurons in facilitating odor–reward association during adaptive learning.
Collapse
|
19
|
Hwang S, Meffert H, VanTieghem MR, White SF, Sinclair S, Bookheimer SY, Blair J. Neurodevelopmental Changes in Social Reinforcement Processing: A Functional Magnetic Resonance Imaging Study. CLINICAL PSYCHOPHARMACOLOGY AND NEUROSCIENCE 2017; 15:369-381. [PMID: 29073749 PMCID: PMC5678476 DOI: 10.9758/cpn.2017.15.4.369] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/30/2017] [Revised: 06/24/2017] [Accepted: 09/04/2017] [Indexed: 01/10/2023]
Abstract
Objective In the current study we investigated neurodevelopmental changes in response to social and non-social reinforcement. Methods Fifty-three healthy participants including 16 early adolescents (age, 10-15 years), 16 late adolescents (age, 15-18 years), and 21 young adults (age, 21-25 years) completed a social/non-social reward learning task while undergoing functional magnetic resonance imaging. Participants responded to fractal image stimuli and received social or non-social reward/non-rewards according to their accuracy. ANOVAs were conducted on both the blood oxygen level dependent response data and the product of a context-dependent psychophysiological interaction (gPPI) analysis involving ventromedial prefrontal cortex (vmPFC) and bilateral insula cortices as seed regions. Results Early adolescents showed significantly increased activation in the amygdala and anterior insula cortex in response to non-social monetary rewards relative to both social reward/non-reward and monetary non-rewards compared to late adolescents and young adults. In addition, early adolescents showed significantly more positive connectivity between the vmPFC/bilateral insula cortices seeds and other regions implicated in reinforcement processing (the amygdala, posterior cingulate cortex, insula cortex, and lentiform nucleus) in response to non-reward and especially social non-reward, compared to late adolescents and young adults. Conclusion It appears that early adolescence may be marked by: (i) a selective increase in responsiveness to non-social, relative to social, rewards; and (ii) enhanced, integrated functioning of reinforcement circuitry for non-reward, and in particular, with respect to posterior cingulate and insula cortices, for social non-reward.
Collapse
Affiliation(s)
- Soonjo Hwang
- Department of Psychiatry, University of Nebraska Medical Center, Omaha, NE, USA
| | - Harma Meffert
- Center for Neurobehavioral Research, Boys Town National Research Hospital, Boys Town, NE, USA
| | | | - Stuart F White
- Center for Neurobehavioral Research, Boys Town National Research Hospital, Boys Town, NE, USA
| | - Stephen Sinclair
- Section on Affective Cognitive Neuroscience, National Institute of Mental Health, National Institutes of Health, Department of Health and Human Services, Bethesda, MD, USA
| | - Susan Y Bookheimer
- Brain Research Institute, University of California, Los Angeles, Los Angeles, CA, USA
| | - James Blair
- Center for Neurobehavioral Research, Boys Town National Research Hospital, Boys Town, NE, USA
| |
Collapse
|
20
|
Braem S. Conditioning task switching behavior. Cognition 2017; 166:272-276. [DOI: 10.1016/j.cognition.2017.05.037] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2017] [Revised: 05/24/2017] [Accepted: 05/29/2017] [Indexed: 01/10/2023]
|
21
|
Reward Selectively Modulates the Lingering Neural Representation of Recently Attended Objects in Natural Scenes. J Neurosci 2017. [PMID: 28630254 DOI: 10.1523/jneurosci.0684-17.2017] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Theories of reinforcement learning and approach behavior suggest that reward can increase the perceptual salience of environmental stimuli, ensuring that potential predictors of outcome are noticed in the future. However, outcome commonly follows visual processing of the environment, occurring even when potential reward cues have long disappeared. How can reward feedback retroactively cause now-absent stimuli to become attention-drawing in the future? One possibility is that reward and attention interact to prime lingering visual representations of attended stimuli that sustain through the interval separating stimulus and outcome. Here, we test this idea using multivariate pattern analysis of fMRI data collected from male and female humans. While in the scanner, participants searched for examples of target categories in briefly presented pictures of cityscapes and landscapes. Correct task performance was followed by reward feedback that could randomly have either high or low magnitude. Analysis showed that high-magnitude reward feedback boosted the lingering representation of target categories while reducing the representation of nontarget categories. The magnitude of this effect in each participant predicted the behavioral impact of reward on search performance in subsequent trials. Other analyses show that sensitivity to reward-as expressed in a personality questionnaire and in reactivity to reward feedback in the dopaminergic midbrain-predicted reward-elicited variance in lingering target and nontarget representations. Credit for rewarding outcome thus appears to be assigned to the target representation, causing the visual system to become sensitized for similar objects in the future.SIGNIFICANCE STATEMENT How do reward-predictive visual stimuli become salient and attention-drawing? In the real world, reward cues precede outcome and reward is commonly received long after potential predictors have disappeared. How can the representation of environmental stimuli be affected by outcome that occurs later in time? Here, we show that reward acts on lingering representations of environmental stimuli that sustain through the interval between stimulus and outcome. Using naturalistic scene stimuli and multivariate pattern analysis of fMRI data, we show that reward boosts the representation of attended objects and reduces the representation of unattended objects. This interaction of attention and reward processing acts to prime vision for stimuli that may serve to predict outcome.
Collapse
|
22
|
Abstract
Reward learning is known to influence the automatic capture of attention. This study examined how the rate of learning, after high- or low-value reward outcomes, can influence future transfers into value-driven attentional capture. Participants performed an instrumental learning task that was directly followed by an attentional capture task. A hierarchical Bayesian reinforcement model was used to infer individual differences in learning from high or low reward. Results showed a strong relationship between high-reward learning rates (or the weight that is put on learning after a high reward) and the magnitude of attentional capture with high-reward colors. Individual differences in learning from high or low rewards were further related to performance differences when high- or low-value distractors were present. These findings provide novel insight into the development of value-driven attentional capture by showing how information updating after desired or undesired outcomes can influence future deployments of automatic attention.
Collapse
|
23
|
Learning to be inflexible: Enhanced attentional biases in Parkinson's disease. Cortex 2016; 82:24-34. [DOI: 10.1016/j.cortex.2016.05.005] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2015] [Revised: 04/12/2016] [Accepted: 05/12/2016] [Indexed: 01/21/2023]
|
24
|
Abstract
Recent methodological and conceptual advances have led to a fundamental reappraisal of the nature of visual working memory (WM). A large corpus of evidence now suggests that there might not be a hard limit on the number of items that can be stored. Instead, WM may be better captured by a highly limited––but flexible––resource model. More resource can be allocated to prioritized items but, crucially, at a cost of reduced recall precision for other stored items. Expectations may modulate resource distribution, for example, through neural oscillations in the alpha band increasing inhibition of irrelevant cortical regions. Our understanding of the neural architecture of WM is also undergoing radical revision. Whereas the prefrontal cortex has previously dominated research endeavors, other cortical regions, such as early visual areas, are now considered to make an essential contribution, for example holding one or more items in a privileged state or “focus of attention” within WM. By contrast, the striatum is increasingly viewed as crucial in determining why and how items are gated into memory, while the hippocampus, it has controversially been argued, might be critical in the formation of temporally resilient conjunctions across features of stored items in WM.
Collapse
Affiliation(s)
- Sean James Fallon
- Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom
| | - Nahid Zokaei
- Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom.,Department of Psychiatry, University of Oxford, Oxford, United Kingdom
| | - Masud Husain
- Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom.,Nuffield Department of Clinical Neurosciences, John Radcliffe Hospital, Oxford, United Kingdom
| |
Collapse
|
25
|
Itthipuripat S, Cha K, Rangsipat N, Serences JT. Value-based attentional capture influences context-dependent decision-making. J Neurophysiol 2015; 114:560-9. [PMID: 25995350 DOI: 10.1152/jn.00343.2015] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2015] [Accepted: 05/19/2015] [Indexed: 11/22/2022] Open
Abstract
Normative theories posit that value-based decision-making is context independent. However, decisions between two high-value options can be suboptimally biased by the introduction of a third low-value option. This context-dependent modulation is consistent with the divisive normalization of the value of each stimulus by the total value of all stimuli. In addition, an independent line of research demonstrates that pairing a stimulus with a high-value outcome can lead to attentional capture that can mediate the efficiency of visual information processing. Here we tested the hypothesis that value-based attentional capture interacts with value-based normalization to influence the optimality of decision-making. We used a binary-choice paradigm in which observers selected between two targets and the color of each target indicated the magnitude of their reward potential. Observers also had to simultaneously ignore a task-irrelevant distractor rendered in a color that was previously associated with a specific reward magnitude. When the color of the task-irrelevant distractor was previously associated with a high reward, observers responded more slowly and less optimally. Moreover, as the learned value of the distractor increased, electrophysiological data revealed an attenuation of the lateralized N1 and N2Pc responses evoked by the relevant choice stimuli and an attenuation of the late positive deflection (LPD). Collectively, these behavioral and electrophysiological data suggest that value-based attentional capture and value-based normalization jointly mediate the influence of context on free-choice decision-making.
Collapse
Affiliation(s)
- Sirawaj Itthipuripat
- Neurosciences Graduate Program, University of California, San Diego, La Jolla, California; and
| | - Kexin Cha
- Department of Psychology, University of California, San Diego, La Jolla, California
| | - Napat Rangsipat
- Department of Psychology, University of California, San Diego, La Jolla, California
| | - John T Serences
- Neurosciences Graduate Program, University of California, San Diego, La Jolla, California; and Department of Psychology, University of California, San Diego, La Jolla, California
| |
Collapse
|