1
|
Guo R, Teng JF, Wang YT, Yao J, Li X, Wu B, Sui JF, Long JH, Ou ZZ, He ZQ, Hu XQ, Liu SL. The parietal association cortex and its projections to the dorsal striatum are involved in histaminergic and nonhistaminergic itch processing. Brain Res Bull 2025; 226:111352. [PMID: 40274076 DOI: 10.1016/j.brainresbull.2025.111352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2025] [Revised: 04/20/2025] [Accepted: 04/22/2025] [Indexed: 04/26/2025]
Abstract
Itch is an unpleasant sensation accompanied by the urge to scratch. The act of scratching not only alleviates the itch but also activates the reward circuitry, inducing a pleasurable sensation that can perpetuate further scratching. Therefore, scratching can be characterized as both a goal-directed behavior and a reward-motivated behavior. As a key hub for sensorymotor integration and information processing in goal-directed tasks, the specific role of the posterior parietal cortex (PPC) in modulating itch remains to be elucidated. Using immunofluorescence and calcium-signal fiber photometric recordings, we found that neurons in the parietal association cortex (PtA), a subregion of the PPC, were activated during acute itch. Pharmacogenetic experiments demonstrated that both nonselective inhibition of neurons in the PtA and selective inhibition of pyramidal neuron activity in the PtA reduced the experimental itch-scratching behavior induced by subcutaneous injections of 5-HT and compound 48/80. The PtA projects to the dorsal striatum (DS), a critical component of the brain's reward circuitry, and inhibition of this pathway also diminished experimental itch-scratching behavior. Therefore, this study demonstrated for the first time that the PtA may be involved in the regulation of the goal-directed behavior of scratching of histaminergic and nonhistaminergic itch through projections to the DS.
Collapse
Affiliation(s)
- Rui Guo
- Department of Dermatology of Jiangbei Campus, The First Affiliated Hospital of Army Medical University, Chongqing 400020, China
| | - Jun-Fei Teng
- Department of Dermatology of Jiangbei Campus, The First Affiliated Hospital of Army Medical University, Chongqing 400020, China
| | - Ya-Ting Wang
- Department of Dermatology of Jiangbei Campus, The First Affiliated Hospital of Army Medical University, Chongqing 400020, China
| | - Juan Yao
- Experimental Center of Basic Medicine, Army Medical University, Chongqing 400038, China
| | - Xuan Li
- Experimental Center of Basic Medicine, Army Medical University, Chongqing 400038, China
| | - Bing Wu
- Experimental Center of Basic Medicine, Army Medical University, Chongqing 400038, China
| | - Jian-Feng Sui
- Experimental Center of Basic Medicine, Army Medical University, Chongqing 400038, China
| | - Jun-Hui Long
- Department of Dermatology of Jiangbei Campus, The First Affiliated Hospital of Army Medical University, Chongqing 400020, China
| | - Zu-Zhen Ou
- Department of Dermatology of Jiangbei Campus, The First Affiliated Hospital of Army Medical University, Chongqing 400020, China
| | - Zhi-Qiang He
- Department of Dermatology of Jiangbei Campus, The First Affiliated Hospital of Army Medical University, Chongqing 400020, China
| | - Xue-Qiang Hu
- Department of Dermatology of Jiangbei Campus, The First Affiliated Hospital of Army Medical University, Chongqing 400020, China
| | - Shu-Lei Liu
- Department of Dermatology of Jiangbei Campus, The First Affiliated Hospital of Army Medical University, Chongqing 400020, China.
| |
Collapse
|
2
|
Chase J, Li JJ, Lin WC, Tai LH, Castro F, Collins AGE, Wilbrecht L. Genetic changes linked to two different syndromic forms of autism enhance reinforcement learning in adolescent male but not female mice. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.01.15.633099. [PMID: 39868311 PMCID: PMC11760717 DOI: 10.1101/2025.01.15.633099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 01/28/2025]
Abstract
Autism Spectrum Disorder (ASD) is characterized by restricted and repetitive behaviors and social differences, both of which may manifest, in part, from underlying differences in corticostriatal circuits and reinforcement learning. Here, we investigated reinforcement learning in mice with mutations in either Tsc2 or Shank3, both high-confidence ASD risk genes associated with major syndromic forms of ASD. Using an odor-based two-alternative forced choice (2AFC) task, we tested adolescent mice of both sexes and found male Tsc2 and Shank3B heterozygote (Het) mice showed enhanced learning performance compared to their wild type (WT) siblings. No gain of function was observed in females. Using a novel reinforcement learning (RL) based computational model to infer learning rate as well as policy-level task engagement and disengagement, we found that the gain of function in males was driven by an enhanced positive learning rate in both Tsc2 and Shank3B Het mice. The gain of function in Het males was absent when mice were trained with a probabilistic reward schedule. These findings in two ASD mouse models reveal a convergent learning phenotype that shows similar sensitivity to sex and environmental uncertainty. These data can inform our understanding of both strengths and challenges associated with autism, while providing further evidence that sex and experience of uncertainty modulate autism-related phenotypes. Significance Statement Reinforcement learning is a foundational form of learning that is widely used in behavioral interventions for autism. Here, we measured reinforcement learning in adolescent mice carrying genetic mutations linked to two different syndromic forms of autism. We found that males showed strengths in reinforcement learning compared to their wild type siblings, while females showed no differences. This gain of function in males was no longer observed when uncertainty was introduced into the reward schedule for correct choices. These findings support a model in which diverse genetic changes interact with sex to generate common phenotypes underlying autism. Our data further support the idea that autism risk genes may produce strengths as well as challenges in behavioral function.
Collapse
Affiliation(s)
- Juliana Chase
- Department of Neuroscience, University of California, Berkeley, Berkeley, CA, 94720
| | - Jing-Jing Li
- Department of Neuroscience, University of California, Berkeley, Berkeley, CA, 94720
| | - Wan Chen Lin
- Department of Neuroscience, University of California, Berkeley, Berkeley, CA, 94720
| | - Lung-Hao Tai
- Department of Neuroscience, University of California, Berkeley, Berkeley, CA, 94720
| | - Fernanda Castro
- Current address: Cellular & Molecular Pharmacology, University of California, San Francisco, Mission Bay, CA 94143
- Department of Psychology, University of California, Berkeley, Berkeley, CA, 94720
| | - Anne GE Collins
- Department of Psychology, University of California, Berkeley, Berkeley, CA, 94720
- Helen Wills Neuroscience Institute, University of California, Berkeley, Berkeley, CA, 94720
| | - Linda Wilbrecht
- Department of Neuroscience, University of California, Berkeley, Berkeley, CA, 94720
- Department of Psychology, University of California, Berkeley, Berkeley, CA, 94720
| |
Collapse
|
3
|
Tang H, Bartolo R, Averbeck BB. Ventral frontostriatal circuitry mediates the computation of reinforcement from symbolic gains and losses. Neuron 2024; 112:3782-3795.e5. [PMID: 39321792 PMCID: PMC11581918 DOI: 10.1016/j.neuron.2024.08.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 07/12/2024] [Accepted: 08/28/2024] [Indexed: 09/27/2024]
Abstract
Reinforcement learning (RL), particularly in primates, is often driven by symbolic outcomes. However, it is usually studied with primary reinforcers. To examine the neural mechanisms underlying learning from symbolic outcomes, we trained monkeys on a task in which they learned to choose options that led to gains of tokens and avoid choosing options that led to losses of tokens. We then recorded simultaneously from the orbitofrontal cortex (OFC), ventral striatum (VS), amygdala (AMY), and mediodorsal thalamus (MDt). We found that the OFC played a dominant role in coding token outcomes and token prediction errors. The other areas contributed complementary functions, with the VS coding appetitive outcomes and the AMY coding the salience of outcomes. The MDt coded actions and relayed information about tokens between the OFC and VS. Thus, the OFC leads the processing of symbolic RL in the ventral frontostriatal circuitry.
Collapse
Affiliation(s)
- Hua Tang
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda, MD 20814, USA.
| | - Ramon Bartolo
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda, MD 20814, USA; Laboratory of Sensorimotor Research, National Eye Institute, National Institutes of Health, Bethesda, MD 20814, USA
| | - Bruno B Averbeck
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda, MD 20814, USA.
| |
Collapse
|
4
|
Woo JH, Costa VD, Taswell CA, Rothenhoefer KM, Averbeck BB, Soltani A. Contribution of amygdala to dynamic model arbitration under uncertainty. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.13.612869. [PMID: 39314420 PMCID: PMC11419134 DOI: 10.1101/2024.09.13.612869] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/25/2024]
Abstract
Intrinsic uncertainty in the reward environment requires the brain to run multiple models simultaneously to predict outcomes based on preceding cues or actions, commonly referred to as stimulus- and action-based learning. Ultimately, the brain also must adopt appropriate choice behavior using reliability of these models. Here, we combined multiple experimental and computational approaches to quantify concurrent learning in monkeys performing tasks with different levels of uncertainty about the model of the environment. By comparing behavior in control monkeys and monkeys with bilateral lesions to the amygdala or ventral striatum, we found evidence for dynamic, competitive interaction between stimulus-based and action-based learning, and for a distinct role of the amygdala. Specifically, we demonstrate that the amygdala adjusts the initial balance between the two learning systems, thereby altering the interaction between arbitration and learning that shapes the time course of both learning and choice behaviors. This novel role of the amygdala can account for existing contradictory observations and provides testable predictions for future studies into circuit-level mechanisms of flexible learning and choice under uncertainty.
Collapse
|
5
|
Stoll FM, Rudebeck PH. Preferences reveal dissociable encoding across prefrontal-limbic circuits. Neuron 2024; 112:2241-2256.e8. [PMID: 38640933 PMCID: PMC11223984 DOI: 10.1016/j.neuron.2024.03.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 12/04/2023] [Accepted: 03/19/2024] [Indexed: 04/21/2024]
Abstract
Individual preferences for the flavor of different foods and fluids exert a strong influence on behavior. Most current theories posit that preferences are integrated with other state variables in the orbitofrontal cortex (OFC), which is thought to derive the relative subjective value of available options to guide choice behavior. Here, we report that instead of a single integrated valuation system in the OFC, another complementary one is centered in the ventrolateral prefrontal cortex (vlPFC) in macaques. Specifically, we found that the OFC and vlPFC preferentially represent outcome flavor and outcome probability, respectively, and that preferences are separately integrated into value representations in these areas. In addition, the vlPFC, but not the OFC, represented the probability of receiving the available outcome flavors separately, with the difference between these representations reflecting the degree of preference for each flavor. Thus, both the vlPFC and OFC exhibit dissociable but complementary representations of subjective value, both of which are necessary for decision-making.
Collapse
Affiliation(s)
- Frederic M Stoll
- Nash Family Department of Neuroscience, Lipschultz Center for Cognitive Neuroscience and Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA.
| | - Peter H Rudebeck
- Nash Family Department of Neuroscience, Lipschultz Center for Cognitive Neuroscience and Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA.
| |
Collapse
|
6
|
Kobayashi K, Kable JW. Neural mechanisms of information seeking. Neuron 2024; 112:1741-1756. [PMID: 38703774 DOI: 10.1016/j.neuron.2024.04.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 01/30/2024] [Accepted: 04/08/2024] [Indexed: 05/06/2024]
Abstract
We ubiquitously seek information to make better decisions. Particularly in the modern age, when more information is available at our fingertips than ever, the information we choose to collect determines the quality of our decisions. Decision neuroscience has long adopted empirical approaches where the information available to decision-makers is fully controlled by the researchers, leaving neural mechanisms of information seeking less understood. Although information seeking has long been studied in the context of the exploration-exploitation trade-off, recent studies have widened the scope to investigate more overt information seeking in a way distinct from other decision processes. Insights gained from these studies, accumulated over the last few years, raise the possibility that information seeking is driven by the reward system signaling the subjective value of information. In this piece, we review findings from the recent studies, highlighting the conceptual and empirical relationships between distinct literatures, and discuss future research directions necessary to establish a more comprehensive understanding of how individuals seek information as a part of value-based decision-making.
Collapse
Affiliation(s)
- Kenji Kobayashi
- Department of Psychology, University of Pennsylvania, Philadelphia, PA 19104, USA.
| | - Joseph W Kable
- Department of Psychology, University of Pennsylvania, Philadelphia, PA 19104, USA.
| |
Collapse
|
7
|
Tang H, Bartolo-Orozco R, Averbeck BB. Ventral frontostriatal circuitry mediates the computation of reinforcement from symbolic gains and losses. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.03.587097. [PMID: 38617219 PMCID: PMC11014508 DOI: 10.1101/2024.04.03.587097] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/16/2024]
Abstract
Reinforcement learning (RL), particularly in primates, is often driven by symbolic outcomes. However, it is usually studied with primary reinforcers. To examine the neural mechanisms underlying learning from symbolic outcomes, we trained monkeys on a task in which they learned to choose options that led to gains of tokens and avoid choosing options that led to losses of tokens. We then recorded simultaneously from the orbitofrontal cortex (OFC), ventral striatum (VS), amygdala (AMY), and the mediodorsal thalamus (MDt). We found that the OFC played a dominant role in coding token outcomes and token prediction errors. The other areas contributed complementary functions with the VS coding appetitive outcomes and the AMY coding the salience of outcomes. The MDt coded actions and relayed information about tokens between the OFC and VS. Thus, OFC leads the process of symbolic reinforcement learning in the ventral frontostriatal circuitry.
Collapse
|
8
|
Pereira-Obilinovic U, Hou H, Svoboda K, Wang XJ. Brain mechanism of foraging: Reward-dependent synaptic plasticity versus neural integration of values. Proc Natl Acad Sci U S A 2024; 121:e2318521121. [PMID: 38551832 PMCID: PMC10998608 DOI: 10.1073/pnas.2318521121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Accepted: 01/16/2024] [Indexed: 04/02/2024] Open
Abstract
During foraging behavior, action values are persistently encoded in neural activity and updated depending on the history of choice outcomes. What is the neural mechanism for action value maintenance and updating? Here, we explore two contrasting network models: synaptic learning of action value versus neural integration. We show that both models can reproduce extant experimental data, but they yield distinct predictions about the underlying biological neural circuits. In particular, the neural integrator model but not the synaptic model requires that reward signals are mediated by neural pools selective for action alternatives and their projections are aligned with linear attractor axes in the valuation system. We demonstrate experimentally observable neural dynamical signatures and feasible perturbations to differentiate the two contrasting scenarios, suggesting that the synaptic model is a more robust candidate mechanism. Overall, this work provides a modeling framework to guide future experimental research on probabilistic foraging.
Collapse
Affiliation(s)
- Ulises Pereira-Obilinovic
- Center for Neural Science, New York University, New York, NY10003
- Allen Institute for Neural Dynamics, Seattle, WA98109
| | - Han Hou
- Allen Institute for Neural Dynamics, Seattle, WA98109
| | - Karel Svoboda
- Allen Institute for Neural Dynamics, Seattle, WA98109
| | - Xiao-Jing Wang
- Center for Neural Science, New York University, New York, NY10003
| |
Collapse
|
9
|
Nougaret S, Ferrucci L, Ceccarelli F, Sacchetti S, Benozzo D, Fascianelli V, Saunders RC, Renaud L, Genovesio A. Neurons in the monkey frontopolar cortex encode learning stage and goal during a fast learning task. PLoS Biol 2024; 22:e3002500. [PMID: 38363801 PMCID: PMC10903959 DOI: 10.1371/journal.pbio.3002500] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2023] [Revised: 02/29/2024] [Accepted: 01/17/2024] [Indexed: 02/18/2024] Open
Abstract
The frontopolar cortex (FPC) is, to date, one of the least understood regions of the prefrontal cortex. The current understanding of its function suggests that it plays a role in the control of exploratory behaviors by coordinating the activities of other prefrontal cortex areas involved in decision-making and exploiting actions based on their outcomes. Based on this hypothesis, FPC would drive fast-learning processes through a valuation of the different alternatives. In our study, we used a modified version of a well-known paradigm, the object-in-place (OIP) task, to test this hypothesis in electrophysiology. This paradigm is designed to maximize learning, enabling monkeys to learn in one trial, which is an ability specifically impaired after a lesion of the FPC. We showed that FPC neurons presented an extremely specific pattern of activity by representing the learning stage, exploration versus exploitation, and the goal of the action. However, our results do not support the hypothesis that neurons in the frontal pole compute an evaluation of different alternatives. Indeed, the position of the chosen target was strongly encoded at its acquisition, but the position of the unchosen target was not. Once learned, this representation was also found at the problem presentation, suggesting a monitoring activity of the synthetic goal preceding its acquisition. Our results highlight important features of FPC neurons in fast-learning processes without confirming their role in the disengagement of cognitive control from the current goals.
Collapse
Affiliation(s)
- Simon Nougaret
- Department of Physiology and Pharmacology, Sapienza University of Rome, Rome, Italy
| | - Lorenzo Ferrucci
- Department of Physiology and Pharmacology, Sapienza University of Rome, Rome, Italy
| | - Francesco Ceccarelli
- Department of Physiology and Pharmacology, Sapienza University of Rome, Rome, Italy
- PhD program in Behavioral Neuroscience, Sapienza University of Rome, Rome, Italy
| | - Stefano Sacchetti
- Department of Physiology and Pharmacology, Sapienza University of Rome, Rome, Italy
| | - Danilo Benozzo
- Department of Physiology and Pharmacology, Sapienza University of Rome, Rome, Italy
| | - Valeria Fascianelli
- Department of Physiology and Pharmacology, Sapienza University of Rome, Rome, Italy
| | - Richard C. Saunders
- Laboratory of Neuropsychology, National Institute of Mental Health, Bethesda, Maryland, United States of America
| | - Luc Renaud
- Institut de Neurosciences de la Timone, UMR7289, Centre National de la Recherche Scientifique and Aix-Marseille Université, Marseille, France
| | - Aldo Genovesio
- Department of Physiology and Pharmacology, Sapienza University of Rome, Rome, Italy
| |
Collapse
|
10
|
Cisler JM, Dunsmoor JE, Fonzo GA, Nemeroff CB. Latent-state and model-based learning in PTSD. Trends Neurosci 2024; 47:150-162. [PMID: 38212163 PMCID: PMC10923154 DOI: 10.1016/j.tins.2023.12.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Revised: 12/18/2023] [Accepted: 12/18/2023] [Indexed: 01/13/2024]
Abstract
Post-traumatic stress disorder (PTSD) is characterized by altered emotional and behavioral responding following a traumatic event. In this article, we review the concepts of latent-state and model-based learning (i.e., learning and inferring abstract task representations) and discuss their relevance for clinical and neuroscience models of PTSD. Recent data demonstrate evidence for brain and behavioral biases in these learning processes in PTSD. These new data potentially recast excessive fear towards trauma cues as a problem in learning and updating abstract task representations, as opposed to traditional conceptualizations focused on stimulus-specific learning. Biases in latent-state and model-based learning may also be a common mechanism targeted in common therapies for PTSD. We highlight key knowledge gaps that need to be addressed to further elaborate how latent-state learning and its associated neurocircuitry mechanisms function in PTSD and how to optimize treatments to target these processes.
Collapse
Affiliation(s)
- Josh M Cisler
- Department of Psychiatry and Behavioral Sciences, University of Texas at Austin, Austin, TX, USA; Institute for Early Life Adversity Research, University of Texas at Austin, Austin, TX, USA.
| | - Joseph E Dunsmoor
- Department of Psychiatry and Behavioral Sciences, University of Texas at Austin, Austin, TX, USA; Institute for Early Life Adversity Research, University of Texas at Austin, Austin, TX, USA
| | - Gregory A Fonzo
- Department of Psychiatry and Behavioral Sciences, University of Texas at Austin, Austin, TX, USA; Institute for Early Life Adversity Research, University of Texas at Austin, Austin, TX, USA
| | - Charles B Nemeroff
- Department of Psychiatry and Behavioral Sciences, University of Texas at Austin, Austin, TX, USA; Institute for Early Life Adversity Research, University of Texas at Austin, Austin, TX, USA
| |
Collapse
|
11
|
Giarrocco F, Costa VD, Basile BM, Pujara MS, Murray EA, Averbeck BB. Motor System-Dependent Effects of Amygdala and Ventral Striatum Lesions on Explore-Exploit Behaviors. J Neurosci 2024; 44:e1206232023. [PMID: 38296647 PMCID: PMC10860650 DOI: 10.1523/jneurosci.1206-23.2023] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Revised: 11/17/2023] [Accepted: 11/21/2023] [Indexed: 02/02/2024] Open
Abstract
Deciding whether to forego immediate rewards or explore new opportunities is a key component of flexible behavior and is critical for the survival of the species. Although previous studies have shown that different cortical and subcortical areas, including the amygdala and ventral striatum (VS), are implicated in representing the immediate (exploitative) and future (explorative) value of choices, the effect of the motor system used to make choices has not been examined. Here, we tested male rhesus macaques with amygdala or VS lesions on two versions of a three-arm bandit task where choices were registered with either a saccade or an arm movement. In both tasks we presented the monkeys with explore-exploit tradeoffs by periodically replacing familiar options with novel options that had unknown reward probabilities. We found that monkeys explored more with saccades but showed better learning with arm movements. VS lesions caused the monkeys to be more explorative with arm movements and less explorative with saccades, although this may have been due to an overall decrease in performance. VS lesions affected the monkeys' ability to learn novel stimulus-reward associations in both tasks, while after amygdala lesions this effect was stronger when choices were made with saccades. Further, on average, VS and amygdala lesions reduced the monkeys' ability to choose better options only when choices were made with a saccade. These results show that learning reward value associations to manage explore-exploit behaviors is motor system dependent and they further define the contributions of amygdala and VS to reinforcement learning.
Collapse
Affiliation(s)
- Franco Giarrocco
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda 20892-4415, MD
| | - Vincent D Costa
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda 20892-4415, MD
- Division of Neuroscience, Oregon National Primate Research Center, Beaverton 97006, OR
| | - Benjamin M Basile
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda 20892-4415, MD
- Department of Psychology, Dickinson College, Carlisle 17013, PA
| | - Maia S Pujara
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda 20892-4415, MD
| | - Elisabeth A Murray
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda 20892-4415, MD
| | - Bruno B Averbeck
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda 20892-4415, MD
| |
Collapse
|
12
|
Campbell EM, Singh G, Claus ED, Witkiewitz K, Costa VD, Hogeveen J, Cavanagh JF. Electrophysiological Markers of Aberrant Cue-Specific Exploration in Hazardous Drinkers. COMPUTATIONAL PSYCHIATRY (CAMBRIDGE, MASS.) 2023; 7:47-59. [PMID: 38774639 PMCID: PMC11104413 DOI: 10.5334/cpsy.96] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/04/2023] [Accepted: 06/28/2023] [Indexed: 05/24/2024]
Abstract
Background Hazardous drinking is associated with maladaptive alcohol-related decision-making. Existing studies have often focused on how participants learn to exploit familiar cues based on prior reinforcement, but little is known about the mechanisms that drive hazardous drinkers to explore novel alcohol cues when their value is not known. Methods We investigated exploration of novel alcohol and non-alcohol cues in hazardous drinkers (N = 27) and control participants (N = 26) during electroencephalography (EEG). A normative computational model with two free parameters was fit to estimate participants' weighting of the future value of exploration and immediate value of exploitation. Results Hazardous drinkers demonstrated increased exploration of novel alcohol cues, and conversely, increased probability of exploiting familiar alternatives instead of exploring novel non-alcohol cues. The motivation to explore novel alcohol stimuli in hazardous drinkers was driven by an elevated relative future valuation of uncertain alcohol cues. P3a predicted more exploratory decision policies driven by an enhanced relative future valuation of novel alcohol cues. P3b did not predict choice behavior, but computational parameter estimates suggested that hazardous drinkers with enhanced P3b to alcohol cues were likely to learn to exploit their immediate expected value. Conclusions Hazardous drinkers did not display atypical choice behavior, different P3a/P3b amplitudes, or computational estimates to novel non-alcohol cues-diverging from previous studies in addiction showing atypical generalized explore-exploit decisions with non-drug-related cues. These findings reveal that cue-specific neural computations may drive aberrant alcohol-related decision-making in hazardous drinkers-highlighting the importance of drug-relevant cues in studies of decision-making in addiction.
Collapse
Affiliation(s)
- Ethan M. Campbell
- Department of Psychology & Psychology Clinical Neuroscience Center, University of New Mexico, US
| | - Garima Singh
- Department of Psychology & Psychology Clinical Neuroscience Center, University of New Mexico, US
| | - Eric D. Claus
- Department of Biobehavioral Health, Pennsylvania State University, US
| | - Katie Witkiewitz
- Department of Psychology & Psychology Clinical Neuroscience Center, University of New Mexico, US
| | - Vincent D. Costa
- Division of Neuroscience, Oregon National Primate Research Center, US
| | - Jeremy Hogeveen
- Department of Psychology & Psychology Clinical Neuroscience Center, University of New Mexico, US
| | - James F. Cavanagh
- Department of Psychology & Psychology Clinical Neuroscience Center, University of New Mexico, US
| |
Collapse
|
13
|
McNally GP, Jean-Richard-Dit-Bressel P, Millan EZ, Lawrence AJ. Pathways to the persistence of drug use despite its adverse consequences. Mol Psychiatry 2023; 28:2228-2237. [PMID: 36997610 PMCID: PMC10611585 DOI: 10.1038/s41380-023-02040-z] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/20/2022] [Revised: 03/10/2023] [Accepted: 03/15/2023] [Indexed: 04/01/2023]
Abstract
The persistence of drug taking despite its adverse consequences plays a central role in the presentation, diagnosis, and impacts of addiction. Eventual recognition and appraisal of these adverse consequences is central to decisions to reduce or cease use. However, the most appropriate ways of conceptualizing persistence in the face of adverse consequences remain unclear. Here we review evidence that there are at least three pathways to persistent use despite the negative consequences of that use. A cognitive pathway for recognition of adverse consequences, a motivational pathway for valuation of these consequences, and a behavioral pathway for responding to these adverse consequences. These pathways are dynamic, not linear, with multiple possible trajectories between them, and each is sufficient to produce persistence. We describe these pathways, their characteristics, brain cellular and circuit substrates, and we highlight their relevance to different pathways to self- and treatment-guided behavior change.
Collapse
Affiliation(s)
- Gavan P McNally
- School of Psychology, UNSW Sydney, Sydney, NSW, 2052, Australia.
| | | | - E Zayra Millan
- School of Psychology, UNSW Sydney, Sydney, NSW, 2052, Australia
| | - Andrew J Lawrence
- Florey Institute of Neuroscience and Mental Health, Parkville, VIC, 3010, Australia
- Florey Department of Neuroscience and Mental Health, University of Melbourne, Melbourne, VIC, 3010, Australia
| |
Collapse
|
14
|
Hogeveen J, Medalla M, Ainsworth M, Galeazzi JM, Hanlon CA, Mansouri FA, Costa VD. What Does the Frontopolar Cortex Contribute to Goal-Directed Cognition and Action? J Neurosci 2022; 42:8508-8513. [PMID: 36351824 PMCID: PMC9665930 DOI: 10.1523/jneurosci.1143-22.2022] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Revised: 08/30/2022] [Accepted: 08/31/2022] [Indexed: 11/17/2022] Open
Abstract
Understanding the unique functions of different subregions of primate prefrontal cortex has been a longstanding goal in cognitive neuroscience. Yet, the anatomy and function of one of its largest subregions (the frontopolar cortex) remain enigmatic and underspecified. Our Society for Neuroscience minisymposium Primate Frontopolar Cortex: From Circuits to Complex Behaviors will comprise a range of new anatomic and functional approaches that have helped to clarify the basic circuit anatomy of the frontal pole, its functional involvement during performance of cognitively demanding behavioral paradigms in monkeys and humans, and its clinical potential as a target for noninvasive brain stimulation in patients with brain disorders. This review consolidates knowledge about the anatomy and connectivity of frontopolar cortex and provides an integrative summary of its function in primates. We aim to answer the question: what, if anything, does frontopolar cortex contribute to goal-directed cognition and action?
Collapse
Affiliation(s)
- Jeremy Hogeveen
- Department of Psychology & Psychology Clinical Neuroscience Center, University of New Mexico, Albuquerque, NM 87131
| | - Maria Medalla
- Department of Anatomy & Neurobiology, Boston University, Boston, MA 02118
| | - Matthew Ainsworth
- Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom, OX2 6GG
| | - Juan M Galeazzi
- Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom, OX2 6GG
| | - Colleen A Hanlon
- Department of Cancer Biology
- Department of Physiology and Pharmacology, Wake Forest School of Medicine, Winston-Salem, NC 27101
| | - Farshad Alizadeh Mansouri
- Department of Physiology, Monash Biomedicine Discovery Institute, Clayton Victoria, 3800, Australia
- ARC Centre for Integrative Brain Function, Monash University, Clayton Victoria, 3800, Australia
| | - Vincent D Costa
- Division of Neuroscience, Oregon National Primate Research Center, Beaverton, OR 97006
| |
Collapse
|
15
|
Garfield JBB, Piccoli LR, Whelan D, Staiger PK, Reynolds J, Piercy H, Lubman DI, Verdejo-Garcia A, Manning V. The effect of approach bias modification during alcohol withdrawal treatment on craving, and its relationship to post-treatment alcohol use in a randomised controlled trial. Drug Alcohol Depend 2022; 239:109621. [PMID: 36087564 DOI: 10.1016/j.drugalcdep.2022.109621] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Revised: 08/25/2022] [Accepted: 08/31/2022] [Indexed: 12/15/2022]
Abstract
BACKGROUND Approach bias modification (ApBM) for alcohol use disorder helps prevent relapse, yet the psychological mechanisms underlying its efficacy remain unclear. Alcohol craving predicts relapse and appears to be related to the biased processing of alcohol stimuli which is reduced by ApBM. However, there is little research examining whether ApBM reduces alcohol craving. METHODS In a randomised controlled trial testing the effect of 4 ApBM sessions (vs. sham training) on post-treatment alcohol use in 300 alcohol withdrawal inpatients, we administered the Alcohol Craving Questionnaire - Short Form - Revised (ACQ-SF-R) pre and post-training and at 2-week, 3, 6 and 12-month follow ups; and a cue-induced craving measure pre and post training. RESULTS Groups did not significantly differ in terms of declines in ACQ-SF-R total scores (p = .712) or cue-induced craving (p = .841) between the first and last training session, nor in terms of ACQ-SF-R scores at follow-ups (p = .509). However, the ACQ-SF-R Expectancy subscale, which assesses craving based on anticipated positive reinforcement from alcohol, was significantly lower in the ApBM group than in controls following training (p = .030), although the group x time interaction for this subscale was non-significant (p = .062). Post-intervention Expectancy scores mediated only a small portion of ApBM's effect on post-discharge alcohol use (14% in intention-to-treat analysis, p = .046; 15% in per-protocol analysis, p = .020). CONCLUSIONS ApBM does not appear to have robust, sustained effects on alcohol craving. Reduced craving is unlikely to account for ApBM's relapse prevention effects. However, further research on whether ApBM's effects are related to devaluation of alcohol reward expectancy is warranted. TRIAL REGISTRATION Australian New Zealand Clinical Trials Registry Identifier: ACTRN12617001241325.
Collapse
Affiliation(s)
- Joshua B B Garfield
- Monash Addiction Research Centre, Eastern Health Clinical School, Monash University, Melbourne, Australia; Turning Point, Eastern Health, Melbourne, Australia.
| | - Lara R Piccoli
- Turner Institute for Brain and Mental Health and School of Psychological Sciences, Monash University, Melbourne, Australia.
| | - Danielle Whelan
- Monash Addiction Research Centre, Eastern Health Clinical School, Monash University, Melbourne, Australia; Turning Point, Eastern Health, Melbourne, Australia.
| | - Petra K Staiger
- School of Psychology, Deakin University, Geelong, Australia; Centre for Drug Use, Addictive and Antisocial Behaviour Research, Deakin University, Geelong, Australia.
| | - John Reynolds
- Alfred Health and Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia.
| | - Hugh Piercy
- Monash Addiction Research Centre, Eastern Health Clinical School, Monash University, Melbourne, Australia; Turning Point, Eastern Health, Melbourne, Australia.
| | - Dan I Lubman
- Monash Addiction Research Centre, Eastern Health Clinical School, Monash University, Melbourne, Australia; Turning Point, Eastern Health, Melbourne, Australia.
| | - Antonio Verdejo-Garcia
- Monash Addiction Research Centre, Eastern Health Clinical School, Monash University, Melbourne, Australia; Turner Institute for Brain and Mental Health and School of Psychological Sciences, Monash University, Melbourne, Australia.
| | - Victoria Manning
- Monash Addiction Research Centre, Eastern Health Clinical School, Monash University, Melbourne, Australia; Turning Point, Eastern Health, Melbourne, Australia.
| |
Collapse
|
16
|
Janssen M, LeWarne C, Burk D, Averbeck BB. Hierarchical Reinforcement Learning, Sequential Behavior, and the Dorsal Frontostriatal System. J Cogn Neurosci 2022; 34:1307-1325. [PMID: 35579977 PMCID: PMC9274316 DOI: 10.1162/jocn_a_01869] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
To effectively behave within ever-changing environments, biological agents must learn and act at varying hierarchical levels such that a complex task may be broken down into more tractable subtasks. Hierarchical reinforcement learning (HRL) is a computational framework that provides an understanding of this process by combining sequential actions into one temporally extended unit called an option. However, there are still open questions within the HRL framework, including how options are formed and how HRL mechanisms might be realized within the brain. In this review, we propose that the existing human motor sequence literature can aid in understanding both of these questions. We give specific emphasis to visuomotor sequence learning tasks such as the discrete sequence production task and the M × N (M steps × N sets) task to understand how hierarchical learning and behavior manifest across sequential action tasks as well as how the dorsal cortical-subcortical circuitry could support this kind of behavior. This review highlights how motor chunks within a motor sequence can function as HRL options. Furthermore, we aim to merge findings from motor sequence literature with reinforcement learning perspectives to inform experimental design in each respective subfield.
Collapse
Affiliation(s)
| | | | - Diana Burk
- National Institute of Mental Health, Bethesda, MD
| | | |
Collapse
|
17
|
Abstract
Ancestors of macaques and humans separated into distinct lineages 25 million years ago. Despite this long separation, Hogeveen et al. (2022) show, in this issue of Neuron, that they mediate the explore-exploit tradeoff, which must be managed by any agent adapting to a dynamic environment, using similar computational and neural mechanisms.
Collapse
|
18
|
Hogeveen J, Mullins TS, Romero JD, Eversole E, Rogge-Obando K, Mayer AR, Costa VD. The neurocomputational bases of explore-exploit decision-making. Neuron 2022; 110:1869-1879.e5. [PMID: 35390278 PMCID: PMC9167768 DOI: 10.1016/j.neuron.2022.03.014] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Revised: 12/11/2021] [Accepted: 03/10/2022] [Indexed: 02/04/2023]
Abstract
Flexible decision-making requires animals to forego immediate rewards (exploitation) and try novel choice options (exploration) to discover if they are preferable to familiar alternatives. Using the same task and a partially observable Markov decision process (POMDP) model to quantify the value of choices, we first determined that the computational basis for managing explore-exploit tradeoffs is conserved across monkeys and humans. We then used fMRI to identify where in the human brain the immediate value of exploitative choices and relative uncertainty about the value of exploratory choices were encoded. Consistent with prior neurophysiological evidence in monkeys, we observed divergent encoding of reward value and uncertainty in prefrontal and parietal regions, including frontopolar cortex, and parallel encoding of these computations in motivational regions including the amygdala, ventral striatum, and orbitofrontal cortex. These results clarify the interplay between prefrontal and motivational circuits that supports adaptive explore-exploit decisions in humans and nonhuman primates.
Collapse
Affiliation(s)
- Jeremy Hogeveen
- Department of Psychology, University of New Mexico, Albuquerque, NM 87131, USA; Psychology Clinical Neuroscience Center, University of New Mexico, Albuquerque, NM 87131, USA.
| | - Teagan S Mullins
- Department of Psychology, University of New Mexico, Albuquerque, NM 87131, USA; Psychology Clinical Neuroscience Center, University of New Mexico, Albuquerque, NM 87131, USA
| | - John D Romero
- Department of Psychology, University of New Mexico, Albuquerque, NM 87131, USA; Psychology Clinical Neuroscience Center, University of New Mexico, Albuquerque, NM 87131, USA
| | - Elizabeth Eversole
- Department of Psychology, University of New Mexico, Albuquerque, NM 87131, USA; Psychology Clinical Neuroscience Center, University of New Mexico, Albuquerque, NM 87131, USA
| | - Kimberly Rogge-Obando
- Department of Biomedical Engineering, Vanderbilt University, Nashville, TN 37235, USA
| | - Andrew R Mayer
- Department of Psychology, University of New Mexico, Albuquerque, NM 87131, USA; Department of Psychiatry & Behavioral Sciences, University of New Mexico School of Medicine, Albuquerque, NM 87131, USA; Department of Neurology, University of New Mexico School of Medicine, Albuquerque, NM 87131, USA; The Mind Research Network/Lovelace Biomedical Research Institute, Pete & Nancy Domenici Hall, Albuquerque, NM 87106, USA
| | - Vincent D Costa
- Department of Behavioral Neuroscience, Oregon Health and Science University, Portland, OR 97239, USA; Division of Neuroscience, Oregon National Primate Research Center, Beaverton, OR 97006, USA.
| |
Collapse
|