1
|
Babiker S, Luzzi F, Mittner M, Csifcsák G. The Interplay Between Experimental Heat Pain and Noninvasive Stimulation of the Medial Prefrontal Cortex on Reinforcement Learning With Manipulated Outcome Contingencies. Eur J Neurosci 2025; 61:e70089. [PMID: 40162800 DOI: 10.1111/ejn.70089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2024] [Revised: 02/20/2025] [Accepted: 03/16/2025] [Indexed: 04/02/2025]
Abstract
Pain negatively affects several cognitive abilities, but knowledge about its effect on reinforcement learning (RL) is limited. During RL, instrumental choices can be influenced by heuristic tendencies to approach rewards or inhibit actions when facing potentially aversive events, introducing "Pavlovian bias" in behavior. Recent studies suggest that compromised outcome controllability enhances Pavlovian bias, a phenomenon that may be linked to suboptimal decision-making in learned helplessness (LH). Since LH is common in chronic pain syndromes, this study sought to establish a link between experimental heat pain (EHP), disrupted reward/loss contingencies, and RL performance in healthy adults. In addition, we investigated if intermittent theta burst stimulation (iTBS) above the medial prefrontal/dorsal anterior cingulate cortex (mPFC/dACC) alleviates the deleterious effects of EHP on choice behavior. In a preregistered, 2 × 2 between-group, double-blind study (N = 100), healthy adult participants underwent three blocks of an orthogonalized Go/NoGo task with two interleaved bouts of active or sham iTBS, and either EHP or warm skin stimulation combined with manipulated response-outcome contingency during the task. Although EHP did not impact response accuracy, it invigorated actions for rewards, reflecting enhanced Pavlovian bias. Whereas two bouts of iTBS attenuated Pavlovian tendencies, this effect was counteracted by EHP, indicating antagonistic effects of pain and iTBS-modulated mPFC activity on Pavlovian-instrumental interactions. Surprisingly, EHP and iTBS exerted largely similar effects on other latent parameters of RL (go-bias, learning rate, and exploration) in a manner that resembled LH. These findings shed light on the role of experimental pain and mPFC/dACC activity in LH-like choice behavior.
Collapse
Affiliation(s)
- Samy Babiker
- Department of Psychology, UiT The Arctic University of Norway, Tromsø, Norway
| | - Federica Luzzi
- Department of Neuroscience and Rehabilitation, University of Ferrara, Ferrara, Italy
- Center for Translational Neurophysiology of Speech and Communication, Italian Institute of Technology, Ferrara, Italy
| | - Matthias Mittner
- Department of Psychology, UiT The Arctic University of Norway, Tromsø, Norway
| | - Gábor Csifcsák
- Department of Psychology, UiT The Arctic University of Norway, Tromsø, Norway
| |
Collapse
|
2
|
Sequestro M, Serfaty J, Grèzes J, Mennella R. Social threat avoidance depends on action-outcome predictability. COMMUNICATIONS PSYCHOLOGY 2024; 2:100. [PMID: 39462095 PMCID: PMC11512816 DOI: 10.1038/s44271-024-00152-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Accepted: 10/17/2024] [Indexed: 10/28/2024]
Abstract
Avoiding threatening individuals is pivotal for adaptation to our social environment. Yet, it remains unclear whether social threat avoidance is subtended by goal-directed processes, in addition to stimulus-response associations. To test this, we manipulated outcome predictability during spontaneous approach/avoidance decisions from avatars displaying angry facial expressions. Across three virtual reality experiments, we showed that participants avoided more often when they could predict the outcome of their actions, indicating goal-directed processes. However, above-chance avoidance rate when facing unpredictable outcomes suggested that stimulus-response associations also played a role. We identified two latent classes of participants: the "goal-directed class" showed above-chance avoidance only in the predictable condition, while the "stimulus-response class" showed no credible difference between conditions but had a higher overall avoidance rate. The goal-directed class exhibited greater cardiac deceleration in the predictable condition, associated with better value integration in decision-making. Computationally, this class had an increased drift-rate in the predictable condition, reflecting increased value estimation of threat avoidance. In contrast, the stimulus-response class showed higher responsiveness to threat, indicated by increased drift-rate for avoidance and increased muscular activity at response time. These results support the central role of goal-directed processes in social threat avoidance and reveal its physiological and computational correlates.
Collapse
Affiliation(s)
- Matteo Sequestro
- Cognitive and Computational Neuroscience Laboratory (LNC 2), Inserm U960, Department of Cognitive Studies, École Normale Supérieure, PSL University, 29 rue d'Ulm, 75005, Paris, France.
| | - Jade Serfaty
- Cognitive and Computational Neuroscience Laboratory (LNC 2), Inserm U960, Department of Cognitive Studies, École Normale Supérieure, PSL University, 29 rue d'Ulm, 75005, Paris, France
| | - Julie Grèzes
- Cognitive and Computational Neuroscience Laboratory (LNC 2), Inserm U960, Department of Cognitive Studies, École Normale Supérieure, PSL University, 29 rue d'Ulm, 75005, Paris, France.
| | - Rocco Mennella
- Cognitive and Computational Neuroscience Laboratory (LNC 2), Inserm U960, Department of Cognitive Studies, École Normale Supérieure, PSL University, 29 rue d'Ulm, 75005, Paris, France
- Laboratory of the Interactions between Cognition Action and Emotion (LICAÉ, EA2931), UFR STAPS, Université Paris Nanterre, 200 avenue de La République, 92001, Nanterre, Cedex, France
| |
Collapse
|
3
|
Zika O, Appel J, Klinge C, Shkreli L, Browning M, Wiech K, Reinecke A. Reduction of Aversive Learning Rates in Pavlovian Conditioning by Angiotensin II Antagonist Losartan: A Randomized Controlled Trial. Biol Psychiatry 2024; 96:247-255. [PMID: 38309320 DOI: 10.1016/j.biopsych.2024.01.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 01/12/2024] [Accepted: 01/22/2024] [Indexed: 02/05/2024]
Abstract
BACKGROUND Angiotensin receptor blockade has been linked to aspects of aversive learning and memory formation and to the prevention of posttraumatic stress disorder symptom development. METHODS We investigated the influence of the angiotensin receptor blocker losartan on aversive Pavlovian conditioning using a probabilistic learning paradigm. In a double-blind, randomized, placebo-controlled design, we tested 45 (18 female) healthy volunteers during a baseline session, after application of losartan or placebo (drug session), and during a follow-up session. During each session, participants engaged in a task in which they had to predict the probability of an electrical stimulation on every trial while the true shock contingencies switched repeatedly between phases of high and low shock threat. Computational reinforcement learning models were used to investigate learning dynamics. RESULTS Acute administration of losartan significantly reduced participants' adjustment during both low-to-high and high-to-low threat changes. This was driven by reduced aversive learning rates in the losartan group during the drug session compared with baseline. The 50-mg drug dose did not induce reduction of blood pressure or change in reaction times, ruling out a general reduction in attention and engagement. Decreased adjustment of aversive expectations was maintained at a follow-up session 24 hours later. CONCLUSIONS This study shows that losartan acutely reduces Pavlovian learning in aversive environments, thereby highlighting a potential role of the renin-angiotensin system in anxiety development.
Collapse
Affiliation(s)
- Ondrej Zika
- Max Planck Institute for Human Development, Berlin, Germany
| | - Judith Appel
- Behavioural Science Institute, Radboud University Nijmegen, Nijmegen, the Netherlands
| | - Corinna Klinge
- Department of Psychiatry, University of Oxford, Oxford, United Kingdom
| | - Lorika Shkreli
- Department of Psychiatry, University of Oxford, Oxford, United Kingdom
| | - Michael Browning
- Department of Psychiatry, University of Oxford, Oxford, United Kingdom; Oxford Health NHS Trust, Warneford Hospital, Oxford, United Kingdom
| | - Katja Wiech
- Wellcome Centre for Integrative Functional Neuroimaging, Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, United Kingdom
| | - Andrea Reinecke
- Department of Psychiatry, University of Oxford, Oxford, United Kingdom; Oxford Health NHS Trust, Warneford Hospital, Oxford, United Kingdom.
| |
Collapse
|
4
|
Colas JT, O’Doherty JP, Grafton ST. Active reinforcement learning versus action bias and hysteresis: control with a mixture of experts and nonexperts. PLoS Comput Biol 2024; 20:e1011950. [PMID: 38552190 PMCID: PMC10980507 DOI: 10.1371/journal.pcbi.1011950] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 02/26/2024] [Indexed: 04/01/2024] Open
Abstract
Active reinforcement learning enables dynamic prediction and control, where one should not only maximize rewards but also minimize costs such as of inference, decisions, actions, and time. For an embodied agent such as a human, decisions are also shaped by physical aspects of actions. Beyond the effects of reward outcomes on learning processes, to what extent can modeling of behavior in a reinforcement-learning task be complicated by other sources of variance in sequential action choices? What of the effects of action bias (for actions per se) and action hysteresis determined by the history of actions chosen previously? The present study addressed these questions with incremental assembly of models for the sequential choice data from a task with hierarchical structure for additional complexity in learning. With systematic comparison and falsification of computational models, human choices were tested for signatures of parallel modules representing not only an enhanced form of generalized reinforcement learning but also action bias and hysteresis. We found evidence for substantial differences in bias and hysteresis across participants-even comparable in magnitude to the individual differences in learning. Individuals who did not learn well revealed the greatest biases, but those who did learn accurately were also significantly biased. The direction of hysteresis varied among individuals as repetition or, more commonly, alternation biases persisting from multiple previous actions. Considering that these actions were button presses with trivial motor demands, the idiosyncratic forces biasing sequences of action choices were robust enough to suggest ubiquity across individuals and across tasks requiring various actions. In light of how bias and hysteresis function as a heuristic for efficient control that adapts to uncertainty or low motivation by minimizing the cost of effort, these phenomena broaden the consilient theory of a mixture of experts to encompass a mixture of expert and nonexpert controllers of behavior.
Collapse
Affiliation(s)
- Jaron T. Colas
- Department of Psychological and Brain Sciences, University of California, Santa Barbara, California, United States of America
- Division of the Humanities and Social Sciences, California Institute of Technology, Pasadena, California, United States of America
- Computation and Neural Systems Program, California Institute of Technology, Pasadena, California, United States of America
| | - John P. O’Doherty
- Division of the Humanities and Social Sciences, California Institute of Technology, Pasadena, California, United States of America
- Computation and Neural Systems Program, California Institute of Technology, Pasadena, California, United States of America
| | - Scott T. Grafton
- Department of Psychological and Brain Sciences, University of California, Santa Barbara, California, United States of America
| |
Collapse
|
5
|
Raab HA, Goldway N, Foord C, Hartley CA. Adolescents flexibly adapt action selection based on controllability inferences. Learn Mem 2024; 31:a053901. [PMID: 38527752 PMCID: PMC11000582 DOI: 10.1101/lm.053901.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Accepted: 02/19/2024] [Indexed: 03/27/2024]
Abstract
From early in life, we encounter both controllable environments, in which our actions can causally influence the reward outcomes we experience, and uncontrollable environments, in which they cannot. Environmental controllability is theoretically proposed to organize our behavior. In controllable contexts, we can learn to proactively select instrumental actions that bring about desired outcomes. In uncontrollable environments, Pavlovian learning enables hard-wired, reflexive reactions to anticipated, motivationally salient events, providing "default" behavioral responses. Previous studies characterizing the balance between Pavlovian and instrumental learning systems across development have yielded divergent findings, with some studies observing heightened expression of Pavlovian learning during adolescence and others observing a reduced influence of Pavlovian learning during this developmental stage. In this study, we aimed to investigate whether a theoretical model of controllability-dependent arbitration between learning systems might explain these seemingly divergent findings in the developmental literature, with the specific hypothesis that adolescents' action selection might be particularly sensitive to environmental controllability. To test this hypothesis, 90 participants, aged 8-27, performed a probabilistic-learning task that enables estimation of Pavlovian influence on instrumental learning, across both controllable and uncontrollable conditions. We fit participants' data with a reinforcement-learning model in which controllability inferences adaptively modulate the dominance of Pavlovian versus instrumental control. Relative to children and adults, adolescents exhibited greater flexibility in calibrating the expression of Pavlovian bias to the degree of environmental controllability. These findings suggest that sensitivity to environmental reward statistics that organize motivated behavior may be heightened during adolescence.
Collapse
Affiliation(s)
- Hillary A Raab
- Department of Psychology, New York University, New York, New York 10003, USA
| | - Noam Goldway
- Department of Psychology, New York University, New York, New York 10003, USA
| | - Careen Foord
- Center for Neural Science, New York University, New York, New York 10003, USA
| | - Catherine A Hartley
- Department of Psychology, New York University, New York, New York 10003, USA
- Center for Neural Science, New York University, New York, New York 10003, USA
| |
Collapse
|
6
|
Payzan-LeNestour E, Doran J. Craving money? Evidence from the laboratory and the field. SCIENCE ADVANCES 2024; 10:eadi5034. [PMID: 38215199 PMCID: PMC10786414 DOI: 10.1126/sciadv.adi5034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Accepted: 12/14/2023] [Indexed: 01/14/2024]
Abstract
Continuing to gamble despite harmful consequences has plagued human life in many ways, from loss-chasing in problem gamblers to reckless investing during stock market bubbles. Here, we propose that these anomalies in human behavior can sometimes reflect Pavlovian perturbations on instrumental behavior. To show this, we combined key elements of Pavlovian psychology literature and standard economic theory into a single model. In it, when a gambling cue such as a gaming machine or a financial asset repeatedly delivers a good outcome, the agent may start engaging with the cue even when the expected value is negative. Next, we transported the theoretical framework into an experimental task and found that participants behaved like the agent in our model. Last, we applied the model to the domain of real-world financial trading and discovered an asset-pricing anomaly suggesting that market participants are susceptible to the purported Pavlovian bias.
Collapse
Affiliation(s)
| | - James Doran
- University of New South Wales Business School, UNSW Sydney, Kensington NSW 2052, Australia
| |
Collapse
|
7
|
Abdul Raheem AK, Dhannoon BN. A Novel Deep Learning Model for Drug-drug Interactions. Curr Comput Aided Drug Des 2024; 20:666-672. [PMID: 38804324 DOI: 10.2174/0115734099265663230926064638] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2023] [Revised: 07/29/2023] [Accepted: 08/16/2023] [Indexed: 05/29/2024]
Abstract
INTRODUCTION Drug-drug interactions (DDIs) can lead to adverse events and compromised treatment efficacy that emphasize the need for accurate prediction and understanding of these interactions. METHODS In this paper, we propose a novel approach for DDI prediction using two separate message-passing neural network (MPNN) models, each focused on one drug in a pair. By capturing the unique characteristics of each drug and their interactions, the proposed method aims to improve the accuracy of DDI prediction. The outputs of the individual MPNN models combine to integrate the information from both drugs and their molecular features. Evaluating the proposed method on a comprehensive dataset, we demonstrate its superior performance with an accuracy of 0.90, an area under the curve (AUC) of 0.99, and an F1-score of 0.80. These results highlight the effectiveness of the proposed approach in accurately identifying potential drugdrug interactions. RESULTS The use of two separate MPNN models offers a flexible framework for capturing drug characteristics and interactions, contributing to our understanding of DDIs. The findings of this study have significant implications for patient safety and personalized medicine, with the potential to optimize treatment outcomes by preventing adverse events. CONCLUSION Further research and validation on larger datasets and real-world scenarios are necessary to explore the generalizability and practicality of this approach.
Collapse
Affiliation(s)
- Ali K Abdul Raheem
- Department of Software, College of Information Technology, University of Babylon, Hillah, Babil, Iraq
- University of Warith Al-Anbiyaa, Karbala, Iraq
| | - Ban N Dhannoon
- Department of Computer Science, College of Science, Al-Nahrain University, Baghdad, Iraq
| |
Collapse
|
8
|
Queirazza F, Steele JD, Krishnadas R, Cavanagh J, Philiastides MG. Functional Magnetic Resonance Imaging Signatures of Pavlovian and Instrumental Valuation Systems during a Modified Orthogonalized Go/No-go Task. J Cogn Neurosci 2023; 35:2089-2109. [PMID: 37788326 DOI: 10.1162/jocn_a_02062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/05/2023]
Abstract
Motivational (i.e., Pavlovian) values interfere with instrumental responding and can lead to suboptimal decision-making. In humans, task-based neuroimaging studies have only recently started illuminating the functional neuroanatomy of Pavlovian biasing of instrumental control. To provide a mechanistic understanding of the neural dynamics underlying the Pavlovian and instrumental valuation systems, analysis of neuroimaging data has been informed by computational modeling of conditioned behavior. Nonetheless, because of collinearities in Pavlovian and instrumental predictions, previous research failed to tease out hemodynamic activity that is parametrically and dynamically modulated by coexistent Pavlovian and instrumental value expectations. Moreover, neural correlates of Pavlovian to instrumental transfer effects have so far only been identified in extinction (i.e., in the absence of learning). In this study, we devised a modified version of the orthogonalized go/no-go paradigm, which introduced Pavlovian-only catch trials to better disambiguate trial-by-trial Pavlovian and instrumental predictions in both sexes. We found that hemodynamic activity in the ventromedial pFC covaried uniquely with the model-derived Pavlovian value expectations. Notably, modulation of neural activity encoding for instrumental predictions in the supplementary motor cortex was linked to successful action selection in conflict conditions. Furthermore, hemodynamic activity in regions pertaining to the limbic system and medial pFC was correlated with synergistic Pavlovian and instrumental predictions and improved conditioned behavior during congruent trials. Altogether, our results provide new insights into the functional neuroanatomy of decision-making and corroborate the validity of our variant of the orthogonalized go/no-go task as a behavioral assay of the Pavlovian and instrumental valuation systems.
Collapse
|
9
|
Kim H, Hur JK, Kwon M, Kim S, Zoh Y, Ahn WY. Causal role of the dorsolateral prefrontal cortex in modulating the balance between Pavlovian and instrumental systems in the punishment domain. PLoS One 2023; 18:e0286632. [PMID: 37267307 PMCID: PMC10237433 DOI: 10.1371/journal.pone.0286632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Accepted: 05/19/2023] [Indexed: 06/04/2023] Open
Abstract
Previous literature suggests that a balance between Pavlovian and instrumental decision-making systems is critical for optimal decision-making. Pavlovian bias (i.e., approach toward reward-predictive stimuli and avoid punishment-predictive stimuli) often contrasts with the instrumental response. Although recent neuroimaging studies have identified brain regions that may be related to Pavlovian bias, including the dorsolateral prefrontal cortex (dlPFC), it is unclear whether a causal relationship exists. Therefore, we investigated whether upregulation of the dlPFC using transcranial current direct stimulation (tDCS) would reduce Pavlovian bias. In this double-blind study, participants were assigned to the anodal or the sham group; they received stimulation over the right dlPFC for 3 successive days. On the last day, participants performed a reinforcement learning task known as the orthogonalized go/no-go task; this was used to assess each participant's degree of Pavlovian bias in reward and punishment domains. We used computational modeling and hierarchical Bayesian analysis to estimate model parameters reflecting latent cognitive processes, including Pavlovian bias, go bias, and choice randomness. Several computational models were compared; the model with separate Pavlovian bias parameters for reward and punishment domains demonstrated the best model fit. When using a behavioral index of Pavlovian bias, the anodal group showed significantly lower Pavlovian bias in the punishment domain, but not in the reward domain, compared with the sham group. In addition, computational modeling showed that Pavlovian bias parameter in the punishment domain was lower in the anodal group than in the sham group, which is consistent with the behavioral findings. The anodal group also showed a lower go bias and choice randomness, compared with the sham group. These findings suggest that anodal tDCS may lead to behavioral suppression or change in Pavlovian bias in the punishment domain, which will help to improve comprehension of the causal neural mechanism.
Collapse
Affiliation(s)
- Hyeonjin Kim
- Department of Psychology, Seoul National University, Seoul, Korea
| | - Jihyun K. Hur
- Department of Psychology, Yale University, New Haven, Connecticut, United States of America
| | - Mina Kwon
- Department of Psychology, Seoul National University, Seoul, Korea
| | - Soyeon Kim
- Department of Psychology, Seoul National University, Seoul, Korea
| | - Yoonseo Zoh
- Department of Psychology, Princeton University, Princeton, New Jersey, United States of America
| | - Woo-Young Ahn
- Department of Psychology, Seoul National University, Seoul, Korea
- Department of Brain and Cognitive Sciences, Seoul National University, Seoul, Korea
| |
Collapse
|
10
|
Sedlinská T, Bolte L, Melsæter E, Mittner M, Csifcsák G. Transcranial direct-current stimulation enhances Pavlovian tendencies during intermittent loss of control. Front Psychiatry 2023; 14:1164208. [PMID: 37229391 PMCID: PMC10203223 DOI: 10.3389/fpsyt.2023.1164208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/12/2023] [Accepted: 04/13/2023] [Indexed: 05/27/2023] Open
Abstract
Introduction Pavlovian bias is an innate motivational tendency to approach rewards and remain passive in the face of punishment. The relative reliance on Pavlovian valuation has been found to increase when the perceived control over environmental reinforcers is compromised, leading to behavior resembling learned helplessness (LH). Methods Sixty healthy young adults underwent a Go-NoGo reinforcement learning task and received anodal high-definition transcranial direct current stimulation (HD-tDCS) over the medial prefrontal/dorsal anterior cingulate cortex in our randomized, double-blind, sham- controlled study. Furthermore, we evaluated changes in cue-locked mid-frontal theta power derived from simultaneous electroencephalography (EEG). We hypothesized that active stimulation would reduce Pavlovian bias during manipulation of outcome controllability, and the effect would be accompanied by stronger mid-frontal theta activity, representing arbitration between choice strategies in favor of instrumental relative to Pavlovian valuation. Results We found a progressive decrease in Pavlovian bias during and after loss of control over feedback. Active HD-tDCS counteracted this effect while not affecting the mid-frontal theta signal. Discussion The results were at odds with our hypotheses but also with previous findings reporting LH-like patterns during and after loss of control without brain stimulation. The discrepancy may be related to different protocols for the controllability manipulation. We argue that the subjective evaluation of task controllability is crucial in mediating the balance between Pavlovian and instrumental valuation during reinforcement learning and that the medial prefrontal/dorsal anterior cingulate cortex is a key region in this respect. These findings have implications for understanding the behavioral and neural underpinnings of LH in humans.
Collapse
Affiliation(s)
- Terezie Sedlinská
- Department of Psychology, UiT The Arctic University of Norway, Tromsø, Norway
- Psychiatric University Hospital, Zürich, Switzerland
| | - Lara Bolte
- Department of Psychology, UiT The Arctic University of Norway, Tromsø, Norway
| | - Eirik Melsæter
- Department of Psychology, UiT The Arctic University of Norway, Tromsø, Norway
| | - Matthias Mittner
- Department of Psychology, UiT The Arctic University of Norway, Tromsø, Norway
| | - Gábor Csifcsák
- Department of Psychology, UiT The Arctic University of Norway, Tromsø, Norway
| |
Collapse
|
11
|
Peng Z, He L, Wen R, Verguts T, Seger CA, Chen Q. Obsessive-compulsive disorder is characterized by decreased Pavlovian influence on instrumental behavior. PLoS Comput Biol 2022; 18:e1009945. [PMID: 36215326 PMCID: PMC9584381 DOI: 10.1371/journal.pcbi.1009945] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Revised: 10/20/2022] [Accepted: 09/28/2022] [Indexed: 02/05/2023] Open
Abstract
Obsessive-compulsive disorder (OCD) is characterized by uncontrollable repetitive actions thought to rely on abnormalities within fundamental instrumental learning systems. We investigated cognitive and computational mechanisms underlying Pavlovian biases on instrumental behavior in both clinical OCD patients and healthy controls using a Pavlovian-Instrumental Transfer (PIT) task. PIT is typically evidenced by increased responding in the presence of a positive (previously rewarded) Pavlovian cue, and reduced responding in the presence of a negative cue. Thirty OCD patients and thirty-one healthy controls completed the Pavlovian Instrumental Transfer test, which included instrumental training, Pavlovian training for positive, negative and neutral cues, and a PIT phase in which participants performed the instrumental task in the presence of the Pavlovian cues. Modified Rescorla-Wagner models were fitted to trial-by-trial data of participants to estimate underlying computational mechanism and quantify individual differences during training and transfer stages. Bayesian hierarchical methods were used to estimate free parameters and compare the models. Behavioral and computational results indicated a weaker Pavlovian influence on instrumental behavior in OCD patients than in HC, especially for negative Pavlovian cues. Our results contrast with the increased PIT effects reported for another set of disorders characterized by compulsivity, substance use disorders, in which PIT is enhanced. A possible reason for the reduced PIT in OCD may be impairment in using the contextual information provided by the cues to appropriately adjust behavior, especially when inhibiting responding when a negative cue is present. This study provides deeper insight into our understanding of deficits in OCD from the perspective of Pavlovian influences on instrumental behavior and may have implications for OCD treatment modalities focused on reducing compulsive behaviors.
Collapse
Affiliation(s)
- Ziwen Peng
- Key Laboratory of Brain, Cognition and Education Sciences, Ministry of Education, Guangzhou, China
- School of Psychology, Center for Studies of Psychological Application, and Guangdong Key Laboratory of Mental Health and Cognitive Science, South China Normal University, Guangzhou, China
- Department of Child Psychiatry, Shenzhen Kangning Hospital, Shenzhen University School of Medicine, Shenzhen, China
| | - Luning He
- School of Psychology, Center for Studies of Psychological Application, and Guangdong Key Laboratory of Mental Health and Cognitive Science, South China Normal University, Guangzhou, China
| | - Rongzhen Wen
- School of Psychology, Center for Studies of Psychological Application, and Guangdong Key Laboratory of Mental Health and Cognitive Science, South China Normal University, Guangzhou, China
| | - Tom Verguts
- Department of Experimental Psychology, Ghent University, Ghent, Belgium
| | - Carol A. Seger
- School of Psychology, Center for Studies of Psychological Application, and Guangdong Key Laboratory of Mental Health and Cognitive Science, South China Normal University, Guangzhou, China
- Department of Psychology, Colorado State University, Colorado, United States of America
- * E-mail: (CS); (QC)
| | - Qi Chen
- Key Laboratory of Brain, Cognition and Education Sciences, Ministry of Education, Guangzhou, China
- School of Psychology, Center for Studies of Psychological Application, and Guangdong Key Laboratory of Mental Health and Cognitive Science, South China Normal University, Guangzhou, China
- * E-mail: (CS); (QC)
| |
Collapse
|
12
|
Scholz V, Hook RW, Kandroodi MR, Algermissen J, Ioannidis K, Christmas D, Valle S, Robbins TW, Grant JE, Chamberlain SR, den Ouden HEM. Cortical dopamine reduces the impact of motivational biases governing automated behaviour. Neuropsychopharmacology 2022; 47:1503-1512. [PMID: 35260787 PMCID: PMC9206002 DOI: 10.1038/s41386-022-01291-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Accepted: 02/02/2022] [Indexed: 11/09/2022]
Abstract
Motivations shape our behaviour: the promise of reward invigorates, while in the face of punishment, we hold back. Abnormalities of motivational processing are implicated in clinical disorders characterised by excessive habits and loss of top-down control, notably substance and behavioural addictions. Striatal and frontal dopamine have been hypothesised to play complementary roles in the respective generation and control of these motivational biases. However, while dopaminergic interventions have indeed been found to modulate motivational biases, these previous pharmacological studies used regionally non-selective pharmacological agents. Here, we tested the hypothesis that frontal dopamine controls the balance between Pavlovian, bias-driven automated responding and instrumentally learned action values. Specifically, we examined whether selective enhancement of cortical dopamine either (i) enables adaptive suppression of Pavlovian control when biases are maladaptive; or (ii) non-specifically modulates the degree of bias-driven automated responding. Healthy individuals (n = 35) received the catechol-o-methyltransferase (COMT) inhibitor tolcapone in a randomised, double-blind, placebo-controlled cross-over design, and completed a motivational Go NoGo task known to elicit motivational biases. In support of hypothesis (ii), tolcapone globally decreased motivational bias. Specifically, tolcapone improved performance on trials where the bias was unhelpful, but impaired performance in bias-congruent conditions. These results indicate a non-selective role for cortical dopamine in the regulation of motivational processes underpinning top-down control over automated behaviour. The findings have direct relevance to understanding neurobiological mechanisms underpinning addiction and obsessive-compulsive disorders, as well as highlighting a potential trans-diagnostic novel mechanism to address such symptoms.
Collapse
Affiliation(s)
- Vanessa Scholz
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands. .,Department of Child and Adolescent Psychiatry, Psychosomatics and Psychotherapy, Centre of Mental Health, University of Würzburg, Margarete-Höppel-Platz1, 97080, Würzburg, Germany.
| | - Roxanne W. Hook
- grid.5335.00000000121885934Department of Psychiatry, University of Cambridge, Cambridge, UK
| | - Mojtaba Rostami Kandroodi
- grid.5590.90000000122931605Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands ,grid.46072.370000 0004 0612 7950School of Electrical and Computer Engineering, University of Tehran, Tehran, Iran
| | - Johannes Algermissen
- grid.5590.90000000122931605Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands
| | - Konstantinos Ioannidis
- grid.5335.00000000121885934Department of Psychiatry, University of Cambridge, Cambridge, UK ,grid.450563.10000 0004 0412 9303Cambridgeshire & Peterborough NHS Foundation Trust, Cambridge, UK ,grid.5012.60000 0001 0481 6099Department of International Health, Care and Public Health Research Institute, Maastricht University, Maastricht, The Netherlands
| | - David Christmas
- grid.5335.00000000121885934Department of Psychiatry, University of Cambridge, Cambridge, UK ,grid.450563.10000 0004 0412 9303Cambridgeshire & Peterborough NHS Foundation Trust, Cambridge, UK
| | - Stephanie Valle
- grid.170205.10000 0004 1936 7822Department of Psychiatry and Behavioral Neuroscience, University of Chicago, Chicago, IL USA
| | - Trevor W. Robbins
- grid.5335.00000000121885934Department of Psychology, and Behavioural and Clinical Neuroscience Institute, University of Cambridge, Cambridge, UK
| | - Jon E. Grant
- grid.170205.10000 0004 1936 7822Department of Psychiatry and Behavioral Neuroscience, University of Chicago, Chicago, IL USA
| | - Samuel R. Chamberlain
- grid.5335.00000000121885934Department of Psychiatry, University of Cambridge, Cambridge, UK ,grid.5491.90000 0004 1936 9297Department of Psychiatry, Faculty of Medicine, University of Southampton, Southampton, UK. Southern Health NHS Foundation Trust, Southampton, UK
| | - Hanneke E. M. den Ouden
- grid.5590.90000000122931605Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands
| |
Collapse
|
13
|
Csifcsák G, Bjørkøy J, Kuyateh S, Reithe H, Mittner M. Transcranial Direct Current Stimulation above the Medial Prefrontal Cortex Facilitates Decision-Making following Periods of Low Outcome Controllability. eNeuro 2021; 8:ENEURO.0041-21.2021. [PMID: 34433576 PMCID: PMC8425969 DOI: 10.1523/eneuro.0041-21.2021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Revised: 08/05/2021] [Accepted: 08/09/2021] [Indexed: 11/21/2022] Open
Abstract
Recent studies suggest that choice behavior in reinforcement learning tasks is shaped by the level of outcome controllability. In particular, Pavlovian bias (PB) seems to be enhanced under low levels of control, manifesting in approach tendencies toward rewards and response inhibition when facing potential losses. The medial prefrontal cortex (mPFC) has been implicated both in evaluating outcome controllability and in the recruitment of cognitive control (CC) to suppress maladaptive PB during reinforcement learning. The current study tested whether high-definition transcranial direct current stimulation (HD-tDCS) above the mPFC of healthy humans can influence PB, and counteract the previously documented, deleterious behavioral effects of low outcome controllability on decision-making. In a preregistered, between-group, double-blind study (N = 103 adults, both sexes), we tested the interaction between controllability and HD-tDCS on parameters of choice behavior in a Go/NoGo task. Relative to sham stimulation, HD-tDCS resulted in more robust performance improvement following reduced control, an effect that was more pronounced in appetitive trials. In addition, we found evidence for weaker PB when HD-tDCS was administered during low controllability over outcomes. Computational modeling revealed that parameter estimates of learning rate and choice randomness were modulated by controllability, HD-tDCS and their interaction. Overall, these results highlight the potential of our HD-tDCS protocol for interfering with choice arbitration under low levels of control, resulting in more adaptive behavior.
Collapse
Affiliation(s)
- Gábor Csifcsák
- Department of Psychology, UiT The Arctic University of Norway, Tromsø 9037, Norway
| | - Jorunn Bjørkøy
- Department of Psychology, UiT The Arctic University of Norway, Tromsø 9037, Norway
| | - Sarjo Kuyateh
- Department of Psychology, UiT The Arctic University of Norway, Tromsø 9037, Norway
| | - Haakon Reithe
- Department of Psychology, UiT The Arctic University of Norway, Tromsø 9037, Norway
| | - Matthias Mittner
- Department of Psychology, UiT The Arctic University of Norway, Tromsø 9037, Norway
| |
Collapse
|