1
|
Rios A, Fujita K, Isomura Y, Sato N. Adaptive circuits for action and value information in rodent operant learning. Neurosci Res 2025; 214:62-68. [PMID: 39341460 DOI: 10.1016/j.neures.2024.09.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2024] [Revised: 09/18/2024] [Accepted: 09/19/2024] [Indexed: 10/01/2024]
Abstract
Operant learning is a behavioral paradigm where animals learn to associate their actions with consequences, adapting their behavior accordingly. This review delves into the neural circuits that underpin operant learning in rodents, emphasizing the dynamic interplay between neural pathways, synaptic plasticity, and gene expression changes. We explore the cortico-basal ganglia circuits, highlighting the pivotal role of dopamine in modulating these pathways to reinforce behaviors that yield positive outcomes. We include insights from recent studies, which reveals the intricate roles of midbrain dopamine neurons in integrating action initiation and reward feedback, thereby enhancing movement-related activities in the dorsal striatum. Additionally, we discuss the molecular diversity of striatal neurons and their specific roles in reinforcement learning. The review also covers advances in transcriptome analysis techniques, such as single-cell RNA sequencing, which have provided deeper insights into the gene expression profiles associated with different neuronal populations during operant learning.
Collapse
Affiliation(s)
- Alain Rios
- Department of Physiology and Cell Biology, Tokyo Medical and Dental University (TMDU), Japan.
| | - Kyohei Fujita
- Department of Physiology and Cell Biology, Tokyo Medical and Dental University (TMDU), Japan
| | - Yoshikazu Isomura
- Department of Physiology and Cell Biology, Tokyo Medical and Dental University (TMDU), Japan.
| | - Nobuya Sato
- Department of Psychological Sciences Kwansei Gakuin University, Japan.
| |
Collapse
|
2
|
Faust TW, Mohebi A, Berke JD. Reward expectation and receipt differentially modulate the spiking of accumbens D1+ and D2+ neurons. Curr Biol 2025; 35:1285-1297.e3. [PMID: 40020662 PMCID: PMC11968066 DOI: 10.1016/j.cub.2025.02.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Revised: 10/21/2024] [Accepted: 02/04/2025] [Indexed: 03/03/2025]
Abstract
The nucleus accumbens (NAc) helps govern motivation to pursue reward. Two distinct sets of NAc projection neurons-expressing dopamine D1 vs. D2 receptors-are thought to promote and suppress motivated behaviors, respectively. However, support for this conceptual framework is limited: in particular, the spiking patterns of these distinct cell types during motivated behavior have been largely unknown. Using optogenetic tagging, we recorded the spiking of identified D1+ and D2+ neurons in the NAc core as unrestrained rats performed an operant task in which motivation to initiate work tracks recent reward rate. D1+ neurons preferentially increased firing as rats initiated trials and fired more when reward expectation was higher. By contrast, D2+ cells preferentially increased firing later in the trial, especially in response to reward delivery-a finding not anticipated from current theoretical models. Our results provide new evidence for the specific contribution of NAc D1+ cells to self-initiated approach behavior and will spur updated models of how D2+ cells contribute to learning.
Collapse
Affiliation(s)
- T W Faust
- Department of Neurology, University of California, San Francisco, San Francisco, CA 94158, USA
| | - A Mohebi
- Department of Neurology, University of California, San Francisco, San Francisco, CA 94158, USA
| | - J D Berke
- Department of Neurology, University of California, San Francisco, San Francisco, CA 94158, USA; Department of Psychiatry and Behavioral Sciences, University of California, San Francisco, San Francisco, CA 94158, USA; Neuroscience Graduate Program, University of California, San Francisco, San Francisco, CA 94158, USA; Kavli Institute for Fundamental Neuroscience, University of California, San Francisco, San Francisco, CA 94158, USA; Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, CA 94158, USA.
| |
Collapse
|
3
|
Zhu T, Areshenkoff CN, De Brouwer AJ, Nashed JY, Flanagan JR, Gallivan JP. Contractions in human cerebellar-cortical manifold structure underlie motor reinforcement learning. J Neurosci 2025; 45:e2158242025. [PMID: 40101964 PMCID: PMC12044045 DOI: 10.1523/jneurosci.2158-24.2025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2024] [Revised: 02/12/2025] [Accepted: 03/06/2025] [Indexed: 03/20/2025] Open
Abstract
How the brain learns new motor commands through reinforcement involves distributed neural circuits beyond known frontal-striatal pathways, yet a comprehensive understanding of this broader neural architecture remains elusive. Here, using human functional MRI (N = 46, 27 females) and manifold learning techniques, we identified a low-dimensional neural space that captured the dynamic changes in whole-brain functional organization during a reward-based trajectory learning task. By quantifying participants' learning rates through an Actor-Critic model, we discovered that periods of accelerated learning were characterized by significant manifold contractions across multiple brain regions, including areas of limbic and hippocampal cortex, as well as the cerebellum. This contraction reflected enhanced network integration, with notably stronger connectivity between several of these regions and the sensorimotor cerebellum correlating with higher learning rates. These findings challenge the traditional view of the cerebellum as solely involved in error-based learning, supporting the emerging view that it coordinates with other brain regions during reinforcement learning.Significance Statement This study reveals how distributed brain systems, including the cerebellum and hippocampus, alter their functional connectivity to support motor learning through reinforcement. Using advanced manifold learning techniques on functional MRI data, we examined changes in regional connectivity during reward-based learning and their relationship to learning rate. For several brain regions, we found that periods of heightened learning were associated with increased cerebellar connectivity, suggesting a key role for the cerebellum in reward-based motor learning. These findings challenge the traditional view of the cerebellum as solely involved in supervised (error-based) learning and add to a growing rodent literature supporting a role for cerebellar circuits in reward-driven learning.
Collapse
Affiliation(s)
- Tianyao Zhu
- Center for Neuroscience Studies, Queen's University, Kingston, Ontario, Canada.
| | - Corson N Areshenkoff
- Center for Neuroscience Studies, Queen's University, Kingston, Ontario, Canada
- Department of Psychology, Queen's University, Kingston, Ontario, Canada
| | - Anouk J De Brouwer
- Center for Neuroscience Studies, Queen's University, Kingston, Ontario, Canada
| | - Joseph Y Nashed
- Center for Neuroscience Studies, Queen's University, Kingston, Ontario, Canada
| | - J Randall Flanagan
- Center for Neuroscience Studies, Queen's University, Kingston, Ontario, Canada
- Department of Psychology, Queen's University, Kingston, Ontario, Canada
| | - Jason P Gallivan
- Center for Neuroscience Studies, Queen's University, Kingston, Ontario, Canada.
- Department of Psychology, Queen's University, Kingston, Ontario, Canada
- Department of Biomedical and Molecular Sciences, Queen's University, Kingston, Ontario, Canada
| |
Collapse
|
4
|
Kim CM, Chow CC, Averbeck BB. Neural dynamics of reversal learning in the prefrontal cortex and recurrent neural networks. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2024.09.14.613033. [PMID: 39372802 PMCID: PMC11451584 DOI: 10.1101/2024.09.14.613033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/08/2024]
Abstract
In probabilistic reversal learning, the choice option yielding reward with higher probability switches at a random trial. To perform optimally in this task, one has to accumulate evidence across trials to infer the probability that a reversal has occurred. We investigated how this reversal probability is represented in cortical neurons by analyzing the neural activity in the prefrontal cortex of monkeys and recurrent neural networks trained on the task. We found that in a neural subspace encoding reversal probability, its activity represented integration of reward outcomes as in a line attractor model. The reversal probability activity at the start of a trial was stationary, stable and consistent with the attractor dynamics. However, during the trial, the activity was associated with task-related behavior and became non-stationary, thus deviating from the line attractor. Fitting a predictive model to neural data showed that the stationary state at the trial start serves as an initial condition for launching the non-stationary activity. This suggested an extension of the line attractor model with behavior-induced non-stationary dynamics. The non-stationary trajectories were separable indicating that they can represent distinct probabilistic values. Perturbing the reversal probability activity in the recurrent neural networks biased choice outcomes demonstrating its functional significance. In sum, our results show that cortical networks encode reversal probability in stable stationary state at the start of a trial and utilize it to initiate non-stationary dynamics that accommodates task-related behavior while maintaining the reversal information.
Collapse
|
5
|
Halbe E, Jamieson A, Bergmann M, Mehren A, Harrison BJ, Davey CG, Stöcker T, Philipsen A, Lux S. Neural Correlates and Sex-Specific Effects of Affectively Driven Processes Underlying Decision-Making in Adult ADHD. Brain Behav 2025; 15:e70215. [PMID: 40021563 PMCID: PMC11870793 DOI: 10.1002/brb3.70215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Revised: 11/26/2024] [Accepted: 12/07/2024] [Indexed: 03/03/2025] Open
Abstract
AIMS Adults with attention-deficit/hyperactivity disorder (ADHD) often exhibit heightened risk-taking behavior due to disadvantageous decision-making. This study investigates the influence of preceding unconscious and affectively driven processes on this behavior, with a focus on sex-specific effects. METHODS Functional magnetic resonance imaging was used to examine neural activity during the anticipation phase of decision-making in 18 individuals with ADHD (10 females and 8 males) and 20 healthy controls (10 females and 10 males) using a modified version of the Balloon Analogue Risk Task. RESULTS During the anticipation of decision-making, individuals with ADHD exhibited reduced activation in the right precuneus and the right superior frontal gyrus compared to healthy controls. Sex-specific effects were exclusively observed within the ADHD group, showing increased neural activity in females compared to males in areas including the dorsolateral prefrontal cortex, left insula, right caudate, right cuneus, and precuneus. CONCLUSION These findings indicate altered neural activity in adult patients with ADHD with sex-specific differences during the anticipation of a risky decision. The study underscores the importance of the right precuneus and superior frontal gyrus in relation to metacognitive functioning and interoceptive awareness. However, further research is needed to explore the interplay of unconscious processes during decision-making in ADHD.
Collapse
Affiliation(s)
- Eva Halbe
- Department of Psychiatry and PsychotherapyUniversity of BonnBonnGermany
| | - Alec Jamieson
- Department of PsychiatryThe University of MelbourneMelbourneAustralia
| | - Moritz Bergmann
- Department of Psychiatry and PsychotherapyUniversity of BonnBonnGermany
| | - Aylin Mehren
- Department of Psychiatry and PsychotherapyUniversity of BonnBonnGermany
- Department of Cognitive Neuroscience, Donders Institute for Brain, Cognition and BehaviorRadboud University Nijmegen Medical CentreNijmegenThe Netherlands
| | - Ben J. Harrison
- Department of PsychiatryThe University of MelbourneMelbourneAustralia
| | | | - Tony Stöcker
- German Center for Neurodegenerative Diseases (DZNE)BonnGermany
| | | | - Silke Lux
- Department of Psychiatry and PsychotherapyUniversity of BonnBonnGermany
| |
Collapse
|
6
|
Calangiu I, Kollmorgen S, Reppas J, Mante V. Prospective and retrospective representations of saccadic movements in primate prefrontal cortex. Cell Rep 2025; 44:115289. [PMID: 39946232 DOI: 10.1016/j.celrep.2025.115289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2022] [Revised: 11/24/2024] [Accepted: 01/17/2025] [Indexed: 02/28/2025] Open
Abstract
The dorso-lateral prefrontal cortex (dlPFC) contributes to flexible, goal-directed behaviors. However, a coherent picture of dlPFC function is lacking, as its activity is often studied only in relation to a few events within a fully learned behavioral task. Here we obtain a comprehensive description of dlPFC activity across different task epochs, saccade types, tasks, and learning stages. We consistently observe the strongest modulation of neural activity in relation to a retrospective representation of the most recent saccade. Prospective, planning-like activity is limited to task-related, delayed saccades directly eligible for a reward. The link between prospective and retrospective representations is highly structured, potentially reflecting a hard-wired feature of saccade responses. Only prospective representations are modulated by the recent behavioral history, but neither representation is modulated by day-to-day behavioral improvements. The dlPFC thus combines tightly linked flexible and rigid representations with a dominant contribution from retrospective signals maintaining the memory of past actions.
Collapse
Affiliation(s)
- Ioana Calangiu
- Institute of Neuroinformatics and Neuroscience Center Zurich, University of Zurich and ETH Zurich, Zurich, Switzerland.
| | - Sepp Kollmorgen
- Institute of Neuroinformatics and Neuroscience Center Zurich, University of Zurich and ETH Zurich, Zurich, Switzerland
| | - John Reppas
- Department of Neurobiology and Howard Hughes Medical Institute, Stanford University, Stanford, CA, USA
| | - Valerio Mante
- Institute of Neuroinformatics and Neuroscience Center Zurich, University of Zurich and ETH Zurich, Zurich, Switzerland.
| |
Collapse
|
7
|
Xue R, Li J, Yang H. The hemispheric differences in prefrontal function of Internet game disorder and non-Internet game disorder: an activation likelihood estimation meta-analysis. Cereb Cortex 2025; 35:bhae493. [PMID: 39756429 DOI: 10.1093/cercor/bhae493] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2024] [Revised: 12/05/2024] [Accepted: 12/12/2024] [Indexed: 01/07/2025] Open
Abstract
This study explored the differences in brain activation between individuals with and without Internet gaming disorder (IGD) through activation likelihood estimation analysis. In total, 39 studies were included based on the inclusion and exclusion criteria by searching the literature in the PubMed and Web of Science databases, as well as reading other reviews. The analysis revealed that the activated brain regions in IGD were the right inferior frontal gyrus, left cingulate gyrus, and left lentiform nucleus. In comparison, the activated brain regions in non-IGD were the left middle frontal, left inferior frontal, left anterior cingulate, left precentral, and right precentral gyri. The results of the present study on differences in activation further confirm existing theoretical hypotheses. Future studies should explore hemispheric differences in prefrontal brain function between IGD and non-IGD.
Collapse
Affiliation(s)
- Rui Xue
- Key Research Base of Humanities and Social Sciences of the Ministry of Education, Academy of Psychology and Behavior, Tianjin Normal University, No. 393 Binshuixi Road, Xiqing District, Tianjin 300387, China
- Faculty of Psychology, Tianjin Normal University, No. 393 Binshuixi Road, Xiqing District, Tianjin 300387, China
| | - Jiaqi Li
- Key Research Base of Humanities and Social Sciences of the Ministry of Education, Academy of Psychology and Behavior, Tianjin Normal University, No. 393 Binshuixi Road, Xiqing District, Tianjin 300387, China
- Faculty of Psychology, Tianjin Normal University, No. 393 Binshuixi Road, Xiqing District, Tianjin 300387, China
| | - Haibo Yang
- Key Research Base of Humanities and Social Sciences of the Ministry of Education, Academy of Psychology and Behavior, Tianjin Normal University, No. 393 Binshuixi Road, Xiqing District, Tianjin 300387, China
- Faculty of Psychology, Tianjin Normal University, No. 393 Binshuixi Road, Xiqing District, Tianjin 300387, China
- Tianjin Key Laboratory of Student Mental Health and Intelligence Assessment, No. 393 Binshuixi Road, Xiqing District, Tianjin 300387, China
| |
Collapse
|
8
|
Chung YS, van den Berg B, Roberts KC, Bagdasarov A, Woldorff MG, Gaffrey MS. Electrical brain activations in preadolescents during a probabilistic reward-learning task reflect cognitive processes and behavior strategies. Front Hum Neurosci 2025; 19:1460584. [PMID: 39949988 PMCID: PMC11821623 DOI: 10.3389/fnhum.2025.1460584] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2024] [Accepted: 01/13/2025] [Indexed: 02/16/2025] Open
Abstract
Both adults and children learn through feedback to associate environmental events and choices with reward, a process known as reinforcement learning (RL). However, tasks to assess RL-related neurocognitive processes in children have been limited. This study validated a child version of the Probabilistic Reward Learning task in preadolescents (8-12 years) while recording event-related-potential (ERPs), focusing on: (1) reward-feedback sensitivity (frontal Reward-related Positivity, RewP), (2) late attention-related responses to feedback (parietal P300), and (3) attentional shifting toward favored stimuli (N2pc). Behaviorally, as expected, preadolescents could learn stimulus-reward outcome associations, but with varying performance levels. Poor learners showed greater RewP amplitudes compared to good learners. Learning strategies (i.e., Win-Lose-Stay-Shift) were reflected by feedback-elicited P300 amplitudes. Lastly, attention shifted toward to-be-chosen stimuli, as evidenced by the N2pc, but not toward more highly rewarded stimuli as in adults. These findings provide novel insights into the neural processes underlying RL in preadolescents.
Collapse
Affiliation(s)
- Yu Sun Chung
- Department of Psychology and Neuroscience, Duke University, Durham, NC, United States
- Department of Psychology, Kean University, Union, NJ, United States
| | - Berry van den Berg
- Experimental Psychology, University of Groningen, Groningen, Netherlands
| | - Kenneth C. Roberts
- Center for Cognitive Neuroscience, Departments of Psychiatry, Psychology and Neuroscience, and Neurobiology, Duke University, Durham, NC, United States
| | - Armen Bagdasarov
- Department of Psychology and Neuroscience, Duke University, Durham, NC, United States
| | - Marty G. Woldorff
- Department of Psychology and Neuroscience, Duke University, Durham, NC, United States
- Center for Cognitive Neuroscience, Departments of Psychiatry, Psychology and Neuroscience, and Neurobiology, Duke University, Durham, NC, United States
| | - Michael S. Gaffrey
- Department of Psychology and Neuroscience, Duke University, Durham, NC, United States
- Children’s Wisconsin, Milwaukee, WI, United States
- Division of Pediatric Psychology and Developmental Medicine, Department of Pediatrics, Medical College of Wisconsin, Milwaukee, WI, United States
| |
Collapse
|
9
|
Myers CE, Perskaudas R, Reddy V, Dave CV, Keilp JG, King A, Rodriguez K, Hill LS, Miller R, Interian A. Negative valuation of ambiguous feedback may predict near-term risk for suicide attempt in Veterans at high risk for suicide. Front Psychiatry 2025; 15:1492332. [PMID: 39949497 PMCID: PMC11821650 DOI: 10.3389/fpsyt.2024.1492332] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/06/2024] [Accepted: 12/20/2024] [Indexed: 02/16/2025] Open
Abstract
Background Learning from feedback - adapting behavior based on reinforcing and punishing outcomes - has been implicated in numerous psychiatric disorders, including substance misuse, post-traumatic stress disorder, and depression; an emerging literature suggests it may also play a role in suicidality. This study examined whether a feedback-based learning task with rewarding, punishing and ambiguous outcomes, followed by computational modeling, could improve near-term prospective prediction of suicide attempt in a high-risk sample. Method Veterans (N=60) at high-risk for suicide were tested on a task of reward- and punishment-based learning, at multiple sessions across a one-year period. Each session was coded according to whether the participant had (1) an actual suicide attempt (ASA); (2) another suicide-related event (OtherSE) such as suicidal behavior or suicidal ideation-related hospital admission (but not an ASA); or (3) neither (noSE) in the next 90 days. Computational modeling was used to estimate latent cognitive variables including learning rates from positive and negative outcomes, and the subjective value of ambiguous feedback. Results Optimal responding on the reward-based trials was positively associated with upcoming ASA, and remained predictive even after controlling for other standard clinical variables such as current suicidal ideation severity and prior suicide attempts. Computational modeling revealed that patients with upcoming ASA tended to view ambiguous outcomes as similar to weak punishment, while OtherSE and noSE both tended to view the ambiguous outcome as similar to weak reward. Differences in the reinforcement value of the neutral outcome remained predictive for ASA even after controlling for current suicidal ideation and prior suicide attempts. Conclusion A reinforcement learning task with ambiguous neutral outcomes may provide a useful tool to help predict near-term risk of ASA in at-risk patients. While most individuals interpret ambiguous feedback as mildly reinforcing (a "glass half full" interpretation), those with upcoming ASA tend to view it as mildly punishing (a "glass half empty" interpretation). While the current results are based on a very small sample with relatively few ASA events, and require replication in a larger sample, they provide support for the role of negative biases in feedback-based learning in the cognitive profile of suicide risk.
Collapse
Affiliation(s)
- Catherine E. Myers
- Research Service, VA New Jersey Health Care System, Department of Veterans Affairs, East Orange, NJ, United States
- Department of Pharmacology, Physiology and Neuroscience, New Jersey Medical School, Rutgers University, Newark, NJ, United States
| | - Rokas Perskaudas
- Mental Health and Behavioral Services, VA New Jersey Health Care System, Department of Veterans Affairs, Lyons, NJ, United States
- War Related Illness and Injury Study Center (WRIISC), East Orange, NJ, United States
| | - Vibha Reddy
- Research Service, VA New Jersey Health Care System, Department of Veterans Affairs, East Orange, NJ, United States
| | - Chintan V. Dave
- Research Service, VA New Jersey Health Care System, Department of Veterans Affairs, East Orange, NJ, United States
- Center for Pharmacoepidemiology and Treatment Science, Institute for Health, Health Care Policy and Aging Research, Rutgers University, New Brunswick, NJ, United States
| | - John G. Keilp
- Department of Psychiatry, Columbia University College of Physicians and Surgeons, New York State Psychiatric Institute, New York, NY, United States
| | - Arlene King
- Mental Health and Behavioral Services, VA New Jersey Health Care System, Department of Veterans Affairs, Lyons, NJ, United States
| | - Kailyn Rodriguez
- Research Service, VA New Jersey Health Care System, Department of Veterans Affairs, East Orange, NJ, United States
- Department of Psychology, Rutgers University School of Arts and Sciences, Piscataway, NJ, United States
| | - Lauren St. Hill
- Mental Health and Behavioral Services, VA New Jersey Health Care System, Department of Veterans Affairs, Lyons, NJ, United States
| | - Rachael Miller
- Mental Health and Behavioral Services, VA New Jersey Health Care System, Department of Veterans Affairs, Lyons, NJ, United States
| | - Alejandro Interian
- Mental Health and Behavioral Services, VA New Jersey Health Care System, Department of Veterans Affairs, Lyons, NJ, United States
- Department of Psychiatry, Robert Wood Johnson Medical School, Rutgers University, Piscataway, NJ, United States
| |
Collapse
|
10
|
Xu T, Zhang L, Zhou F, Fu K, Gan X, Chen Z, Zhang R, Lan C, Wang L, Kendrick KM, Yao D, Becker B. Distinct neural computations scale the violation of expected reward and emotion in social transgressions. Commun Biol 2025; 8:106. [PMID: 39838081 PMCID: PMC11751440 DOI: 10.1038/s42003-025-07561-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2024] [Accepted: 01/15/2025] [Indexed: 01/23/2025] Open
Abstract
Traditional decision-making models conceptualize humans as adaptive learners utilizing the differences between expected and actual rewards (prediction errors, PEs) to maximize outcomes, but rarely consider the influence of violations of emotional expectations (emotional PEs) and how it differs from reward PEs. Here, we conducted a fMRI experiment (n = 43) using a modified Ultimatum Game to examine how reward and emotional PEs affect punishment decisions in terms of rejecting unfair offers. Our results revealed that reward relative to emotional PEs exerted a stronger prediction to punishment decisions. On the neural level, the left dorsomedial prefrontal cortex (dmPFC) was strongly activated during reward receipt whereas the emotions engaged the bilateral anterior insula. Reward and emotional PEs were also encoded differently in brain-wide multivariate patterns, with a more sensitive neural signature observed within fronto-insular circuits for reward PE. We further identified a fronto-insular network encompassing the left anterior cingulate cortex, bilateral insula, left dmPFC and inferior frontal gyrus that encoded punishment decisions. In addition, a stronger fronto-insular pattern expression under reward PE predicted more punishment decisions. These findings underscore that reward and emotional violations interact to shape decisions in complex social interactions, while the underlying neurofunctional PEs computations are distinguishable.
Collapse
Affiliation(s)
- Ting Xu
- Faculty of Psychology, Southwest University, Chongqing, China
- Key Laboratory of Cognition and Personality, Ministry of Education, Chongqing, China
- The Center of Psychosomatic Medicine, Sichuan Provincial Center for Mental Health, Sichuan Provincial People's Hospital, University of Electronic Science and Technology of China, Chengdu, China
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| | - Lei Zhang
- Centre for Human Brain Health, School of Psychology, University of Birmingham, Birmingham, UK
- Institute for Mental Health, School of Psychology, University of Birmingham, Birmingham, UK
| | - Feng Zhou
- Faculty of Psychology, Southwest University, Chongqing, China
- Key Laboratory of Cognition and Personality, Ministry of Education, Chongqing, China
| | - Kun Fu
- The Center of Psychosomatic Medicine, Sichuan Provincial Center for Mental Health, Sichuan Provincial People's Hospital, University of Electronic Science and Technology of China, Chengdu, China
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| | - Xianyang Gan
- The Center of Psychosomatic Medicine, Sichuan Provincial Center for Mental Health, Sichuan Provincial People's Hospital, University of Electronic Science and Technology of China, Chengdu, China
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| | - Zhiyi Chen
- Faculty of Psychology, Southwest University, Chongqing, China
- Key Laboratory of Cognition and Personality, Ministry of Education, Chongqing, China
- Experimental Research Center for Medical and Psychological Science (ERC-MPS), School of Psychology, Third Military Medical University, Chongqing, China
| | - Ran Zhang
- Faculty of Psychology, Southwest University, Chongqing, China
- Key Laboratory of Cognition and Personality, Ministry of Education, Chongqing, China
| | - Chunmei Lan
- The Center of Psychosomatic Medicine, Sichuan Provincial Center for Mental Health, Sichuan Provincial People's Hospital, University of Electronic Science and Technology of China, Chengdu, China
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| | - Lan Wang
- The Center of Psychosomatic Medicine, Sichuan Provincial Center for Mental Health, Sichuan Provincial People's Hospital, University of Electronic Science and Technology of China, Chengdu, China
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| | - Keith M Kendrick
- The Center of Psychosomatic Medicine, Sichuan Provincial Center for Mental Health, Sichuan Provincial People's Hospital, University of Electronic Science and Technology of China, Chengdu, China
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| | - Dezhong Yao
- The Center of Psychosomatic Medicine, Sichuan Provincial Center for Mental Health, Sichuan Provincial People's Hospital, University of Electronic Science and Technology of China, Chengdu, China
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| | - Benjamin Becker
- Department of Psychology, The University of Hong Kong, Hong Kong, China.
- State Key Laboratory of Brain and Cognitive Sciences, The University of Hong Kong, Hong Kong, China.
| |
Collapse
|
11
|
Abdullaeva BS, Abdullaev D, Djuraeva L, Sagdullaeva DK, Kholikov A. Applications of Behavioral Economics and Neuroeconomics in Mental Health. IRANIAN JOURNAL OF PSYCHIATRY 2025; 20:93-101. [PMID: 40093521 PMCID: PMC11904746 DOI: 10.18502/ijps.v20i1.17404] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/18/2024] [Revised: 09/24/2024] [Accepted: 09/26/2024] [Indexed: 03/19/2025]
Abstract
Objective: The integration of behavioral economics and neuroeconomics into mental health offers innovative perspectives on understanding and addressing psychological disorders. This overview aims to synthesize current knowledge and explore the implications of these interdisciplinary approaches in the context of mental health. Method : In this narrative review, we summarized the current evidence regarding the applications of behavioral economics and neuroeconomics approaches in the field of mental health. Results: Behavioral economics and neuroeconomics provide valuable insights into the cognitive and emotional processes underlying mental health disorders, such as irrational decision-making, impulsivity, and self-control issues. Concepts such as loss aversion, temporal discounting, and framing effects inform the development of innovative interventions and policy initiatives. Behavioral economic interventions, including nudges, incentives, and commitment devices, show promise in promoting treatment adherence, reducing risky behaviors, and enhancing mental well-being. Neuroeconomics contributes by identifying neural markers predictive of treatment response and relapse risk, paving the way for personalized treatment approaches. Conclusion: The integration of behavioral economics and neuroeconomics into mental health research and practice holds significant potential for improving the understanding of psychological disorders and developing more effective, personalized interventions. Further research is needed to elucidate the mechanisms of action, optimize intervention strategies, and address ethical considerations associated with these approaches in mental health settings.
Collapse
Affiliation(s)
| | - Diyorjon Abdullaev
- Department of Scientific Affairs, Vice-Rector for Scientific Affairs, Urganch State Pedagogical Institute, Urgench, Uzbekistan
| | - Laylo Djuraeva
- Department of Innovation and Sciences, New Uzbekistan University, Tashkent, Uzbekistan
- The State Conservatory of Uzbekistan, Tashkent, Uzbekistan
| | - Dilfuza Karimullaevna Sagdullaeva
- Department of Uzbek Language and Classical Eastern Literature, Faculty of Classical Eastern Philology, International Islamic Academy of Uzbekistan, Tashkent, Uzbekistan
| | - Azam Kholikov
- Department of Mother Language and Teaching Methodology in Primary Education, Tashkent State Pedagogical University, Tashkent, Uzbekistan
| |
Collapse
|
12
|
Mugan U, Hoffman SL, Redish AD. Environmental complexity modulates information processing and the balance between decision-making systems. Neuron 2024; 112:4096-4114.e10. [PMID: 39476843 DOI: 10.1016/j.neuron.2024.10.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2024] [Revised: 08/12/2024] [Accepted: 10/03/2024] [Indexed: 12/21/2024]
Abstract
Behavior in naturalistic scenarios occurs in diverse environments. Adaptive strategies rely on multiple neural circuits and competing decision systems. However, past studies of rodent decision making have largely measured behavior in simple environments. To fill this gap, we recorded neural ensembles from hippocampus (HC), dorsolateral striatum (DLS), and dorsomedial prefrontal cortex (dmPFC) while rats foraged for food under changing rules in environments with varying topological complexity. Environmental complexity increased behavioral variability, lengthened HC nonlocal sequences, and modulated action caching. We found contrasting representations between DLS and HC, supporting a competition between decision systems. dmPFC activity was indicative of setting this balance, in particular predicting the extent of HC non-local coding. Inactivating mPFC impaired short-term behavioral adaptation and produced long-term deficits in balancing decision systems. Our findings reveal the dynamic nature of decision-making systems and how environmental complexity modulates their engagement with implications for behavior in naturalistic environments.
Collapse
Affiliation(s)
- Ugurcan Mugan
- Department of Neuroscience, University of Minnesota, Minneapolis, MN, USA
| | - Samantha L Hoffman
- University of Wisconsin School of Medicine and Public Health, Madison, WI, USA
| | - A David Redish
- Department of Neuroscience, University of Minnesota, Minneapolis, MN, USA.
| |
Collapse
|
13
|
Moneta N, Grossman S, Schuck NW. Representational spaces in orbitofrontal and ventromedial prefrontal cortex: task states, values, and beyond. Trends Neurosci 2024; 47:1055-1069. [PMID: 39547861 DOI: 10.1016/j.tins.2024.10.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2024] [Revised: 10/16/2024] [Accepted: 10/17/2024] [Indexed: 11/17/2024]
Abstract
The orbitofrontal cortex (OFC) and ventromedial-prefrontal cortex (vmPFC) play a key role in decision-making and encode task states in addition to expected value. We review evidence suggesting a connection between value and state representations and argue that OFC / vmPFC integrate stimulus, context, and outcome information. Comparable encoding principles emerge in late layers of deep reinforcement learning (RL) models, where single nodes exhibit similar forms of mixed-selectivity, which enables flexible readout of relevant variables by downstream neurons. Based on these lines of evidence, we suggest that outcome-maximization leads to complex representational spaces that are insufficiently characterized by linear value signals that have been the focus of most prior research on the topic. Major outstanding questions concern the role of OFC/ vmPFC in learning across tasks, in encoding of task-irrelevant aspects, and the role of hippocampus-PFC interactions.
Collapse
Affiliation(s)
- Nir Moneta
- Institute of Psychology, Universität Hamburg, 20146 Hamburg, Germany; Einstein Center for Neurosciences Berlin, Charité Universitätsmedizin Berlin, 10117, Berlin, Germany.
| | - Shany Grossman
- Institute of Psychology, Universität Hamburg, 20146 Hamburg, Germany.
| | - Nicolas W Schuck
- Institute of Psychology, Universität Hamburg, 20146 Hamburg, Germany; Max Planck UCL Centre for Computational Psychiatry and Ageing Research, Berlin, 14195 Berlin, Germany.
| |
Collapse
|
14
|
Noel JP, Zhang R, Pitkow X, Angelaki DE. Dorsolateral prefrontal cortex drives strategic aborting by optimizing long-run policy extraction. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.11.28.625897. [PMID: 39651243 PMCID: PMC11623693 DOI: 10.1101/2024.11.28.625897] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2024]
Abstract
Real world choices often involve balancing decisions that are optimized for the short-vs. long-term. Here, we reason that apparently sub-optimal single trial decisions in macaques may in fact reflect long-term, strategic planning. We demonstrate that macaques freely navigating in VR for sequentially presented targets will strategically abort offers, forgoing more immediate rewards on individual trials to maximize session-long returns. This behavior is highly specific to the individual, demonstrating that macaques reason about their own long-run performance. Reinforcement-learning (RL) models suggest this behavior is algorithmically supported by modular actor-critic networks with a policy module not only optimizing long-term value functions, but also informed of specific state-action values allowing for rapid policy optimization. The behavior of artificial networks suggests that changes in policy for a matched offer ought to be evident as soon as offers are made, even if the aborting behavior occurs much later. We confirm this prediction by demonstrating that single units and population dynamics in macaque dorsolateral prefrontal cortex (dlPFC), but not parietal area 7a or dorsomedial superior temporal area (MSTd), reflect the upcoming reward-maximizing aborting behavior upon offer presentation. These results cast dlPFC as a specialized policy module, and stand in contrast to recent work demonstrating the distributed and recurrent nature of belief-networks.
Collapse
|
15
|
Tang H, Bartolo R, Averbeck BB. Ventral frontostriatal circuitry mediates the computation of reinforcement from symbolic gains and losses. Neuron 2024; 112:3782-3795.e5. [PMID: 39321792 PMCID: PMC11581918 DOI: 10.1016/j.neuron.2024.08.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 07/12/2024] [Accepted: 08/28/2024] [Indexed: 09/27/2024]
Abstract
Reinforcement learning (RL), particularly in primates, is often driven by symbolic outcomes. However, it is usually studied with primary reinforcers. To examine the neural mechanisms underlying learning from symbolic outcomes, we trained monkeys on a task in which they learned to choose options that led to gains of tokens and avoid choosing options that led to losses of tokens. We then recorded simultaneously from the orbitofrontal cortex (OFC), ventral striatum (VS), amygdala (AMY), and mediodorsal thalamus (MDt). We found that the OFC played a dominant role in coding token outcomes and token prediction errors. The other areas contributed complementary functions, with the VS coding appetitive outcomes and the AMY coding the salience of outcomes. The MDt coded actions and relayed information about tokens between the OFC and VS. Thus, the OFC leads the processing of symbolic RL in the ventral frontostriatal circuitry.
Collapse
Affiliation(s)
- Hua Tang
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda, MD 20814, USA.
| | - Ramon Bartolo
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda, MD 20814, USA; Laboratory of Sensorimotor Research, National Eye Institute, National Institutes of Health, Bethesda, MD 20814, USA
| | - Bruno B Averbeck
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda, MD 20814, USA.
| |
Collapse
|
16
|
Blair RJR, Bashford-Largo J, Dominguez A, Dobbertin M, Blair KS, Bajaj S. Using machine learning to determine a functional classifier of reward responsiveness and its association with adolescent psychiatric symptomatology. Psychol Med 2024:1-10. [PMID: 39552378 DOI: 10.1017/s003329172400240x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/19/2024]
Abstract
BACKGROUND Machine learning (ML) has developed classifiers differentiating patient groups despite concerns regarding diagnostic reliability. An alternative strategy, used here, is to develop a functional classifier (hyperplane) (e.g. distinguishing the neural responses to received reward v. received punishment in typically developing (TD) adolescents) and then determine the functional integrity of the response (reward response distance from the hyperplane) in adolescents with externalizing and internalizing conditions and its associations with symptom clusters. METHODS Two hundred and ninety nine adolescents (mean age = 15.07 ± 2.30 years, 117 females) were divided into three groups: a training sample of TD adolescents where the Support Vector Machine (SVM) algorithm was applied (N = 65; 32 females), and two test groups- an independent sample of TD adolescents (N = 39; 14 females) and adolescents with a psychiatric diagnosis (major depressive disorder (MDD), generalized anxiety disorder (GAD), attention deficit hyperactivity disorder (ADHD) & conduct disorder (CD); N = 195, 71 females). RESULTS SVM ML analysis identified a hyperplane with accuracy = 80.77%, sensitivity = 78.38% and specificity = 88.99% that implicated feature neural regions associated with reward v. punishment (e.g. nucleus accumbens v. anterior insula cortices). Adolescents with externalizing diagnoses were significantly less likely to show a normative and significantly more likely to show a deficient reward response than the TD samples. Deficient reward response was associated with elevated CD, MDD, and ADHD symptoms. CONCLUSIONS Distinguishing the response to reward relative to punishment in TD adolescents via ML indicated notable disruptions in this response in patients with CD and ADHD and associations between reward responsiveness and CD, MDD, and ADHD symptom severity.
Collapse
Affiliation(s)
- Robert James Richard Blair
- Child and Adolescent Mental Health Center, Copenhagen University Hospital - Mental Health Services CPH, Copenhagen, Denmark
- Department of Clinical Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Johannah Bashford-Largo
- Child and Family Translational Research Center, Boys Town National Research Hospital, Boys Town, NE, USA
- Center for Brain, Biology, and Behavior, University of Nebraska-Lincoln, Lincoln, NE, USA
| | - Ahria Dominguez
- Clinical Health, Emotion, and Neuroscience (CHEN) Laboratory, Department of Neurological Sciences, College of Medicine, University of Nebraska Medical Center (UNMC), Omaha, NE, USA
| | - Matthew Dobbertin
- Child and Adolescent Psychiatric Inpatient Center, Boys Town National Research Hospital, Boys Town, NE, USA
| | - Karina S Blair
- Child and Adolescent Psychiatric Inpatient Center, Boys Town National Research Hospital, Boys Town, NE, USA
| | - Sahil Bajaj
- Department of Cancer Systems Imaging, Division of Diagnostic Imaging, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| |
Collapse
|
17
|
Aloi J, Crum KI, Blair KS, Zhang R, Bashford-Largo J, Bajaj S, Hwang S, Averbeck BB, Tottenham N, Dobbertin M, Blair RJR. Childhood neglect is associated with alterations in neural prediction error signaling and the response to novelty. Psychol Med 2024; 54:1-9. [PMID: 39445510 PMCID: PMC11578899 DOI: 10.1017/s0033291724002411] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Revised: 07/17/2024] [Accepted: 07/25/2024] [Indexed: 10/25/2024]
Abstract
BACKGROUND One in eight children experience early life stress (ELS), which increases risk for psychopathology. ELS, particularly neglect, has been associated with reduced responsivity to reward. However, little work has investigated the computational specifics of this disrupted reward response - particularly with respect to the neural response to Reward Prediction Errors (RPE) - a critical signal for successful instrumental learning - and the extent to which they are augmented to novel stimuli. The goal of the current study was to investigate the associations of abuse and neglect, and neural representation of RPE to novel and non-novel stimuli. METHODS One hundred and seventy-eight participants (aged 10-18, M = 14.9, s.d. = 2.38) engaged in the Novelty task while undergoing functional magnetic resonance imaging. In this task, participants learn to choose novel or non-novel stimuli to win monetary rewards varying from $0 to $0.30 per trial. Levels of abuse and neglect were measured using the Childhood Trauma Questionnaire. RESULTS Adolescents exposed to high levels of neglect showed reduced RPE-modulated blood oxygenation level dependent response within medial and lateral frontal cortices particularly when exploring novel stimuli (p < 0.05, corrected for multiple comparisons) relative to adolescents exposed to lower levels of neglect. CONCLUSIONS These data expand on previous work by indicating that neglect, but not abuse, is associated with impairments in neural RPE representation within medial and lateral frontal cortices. However, there was no association between neglect and behavioral impairments on the Novelty task, suggesting that these neural differences do not necessarily translate into behavioral differences within the context of the Novelty task.
Collapse
Affiliation(s)
- Joseph Aloi
- Department of Psychiatry, Indiana University School of Medicine, Indianapolis, IN, USA
- Adolescent Behavioral Health Research Program, Indiana University School of Medicine, Indianapolis, IN, USA
| | - Kathleen I. Crum
- Department of Psychiatry, Indiana University School of Medicine, Indianapolis, IN, USA
- Adolescent Behavioral Health Research Program, Indiana University School of Medicine, Indianapolis, IN, USA
- Department of Neuroscience, Medical University of South Carolina, Charleston, SC, USA
| | - Karina S. Blair
- Center for Neurobehavioral Research, Boys Town National Research Hospital, Boys Town, NE, USA
| | - Ru Zhang
- Stevens Neuroimaging and Informatics Institute, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Johannah Bashford-Largo
- Child and Family Translational Research Center, Boys Town National Research Hospital, Boys Town, NE, USA
| | - Sahil Bajaj
- Department of Cancer Systems Imaging, Division of Diagnostic Imaging, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Soonjo Hwang
- Department of Psychiatry, University of Nebraska Medical Center, Omaha, NE, USA
| | - Bruno B. Averbeck
- Section on Learning and Decision Making, Laboratory of Neuropsychology, National Institute of Mental Health, Bethesda, MD, USA
| | - Nim Tottenham
- Department of Psychology, Columbia University, New York, NY, USA
| | - Matthew Dobbertin
- Center for Neurobehavioral Research, Boys Town National Research Hospital, Boys Town, NE, USA
| | - R. James R. Blair
- Child and Adolescent Mental Health Centre, Mental Health Services, Capital Region of Denmark, Copenhagen, Denmark, USA
| |
Collapse
|
18
|
Moskovitz T, Miller KJ, Sahani M, Botvinick MM. Understanding dual process cognition via the minimum description length principle. PLoS Comput Biol 2024; 20:e1012383. [PMID: 39423224 PMCID: PMC11534269 DOI: 10.1371/journal.pcbi.1012383] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Revised: 11/04/2024] [Accepted: 08/01/2024] [Indexed: 10/21/2024] Open
Abstract
Dual-process theories play a central role in both psychology and neuroscience, figuring prominently in domains ranging from executive control to reward-based learning to judgment and decision making. In each of these domains, two mechanisms appear to operate concurrently, one relatively high in computational complexity, the other relatively simple. Why is neural information processing organized in this way? We propose an answer to this question based on the notion of compression. The key insight is that dual-process structure can enhance adaptive behavior by allowing an agent to minimize the description length of its own behavior. We apply a single model based on this observation to findings from research on executive control, reward-based learning, and judgment and decision making, showing that seemingly diverse dual-process phenomena can be understood as domain-specific consequences of a single underlying set of computational principles.
Collapse
Affiliation(s)
- Ted Moskovitz
- Gatsby Computational Neuroscience Unit, University College London, London, United Kingdom
- Google DeepMind, London, United Kingdom
| | - Kevin J. Miller
- Google DeepMind, London, United Kingdom
- Department of Ophthalmology, University College London, London, United Kingdom
| | - Maneesh Sahani
- Gatsby Computational Neuroscience Unit, University College London, London, United Kingdom
| | - Matthew M. Botvinick
- Gatsby Computational Neuroscience Unit, University College London, London, United Kingdom
- Google DeepMind, London, United Kingdom
| |
Collapse
|
19
|
Woo JH, Costa VD, Taswell CA, Rothenhoefer KM, Averbeck BB, Soltani A. Contribution of amygdala to dynamic model arbitration under uncertainty. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.13.612869. [PMID: 39314420 PMCID: PMC11419134 DOI: 10.1101/2024.09.13.612869] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/25/2024]
Abstract
Intrinsic uncertainty in the reward environment requires the brain to run multiple models simultaneously to predict outcomes based on preceding cues or actions, commonly referred to as stimulus- and action-based learning. Ultimately, the brain also must adopt appropriate choice behavior using reliability of these models. Here, we combined multiple experimental and computational approaches to quantify concurrent learning in monkeys performing tasks with different levels of uncertainty about the model of the environment. By comparing behavior in control monkeys and monkeys with bilateral lesions to the amygdala or ventral striatum, we found evidence for dynamic, competitive interaction between stimulus-based and action-based learning, and for a distinct role of the amygdala. Specifically, we demonstrate that the amygdala adjusts the initial balance between the two learning systems, thereby altering the interaction between arbitration and learning that shapes the time course of both learning and choice behaviors. This novel role of the amygdala can account for existing contradictory observations and provides testable predictions for future studies into circuit-level mechanisms of flexible learning and choice under uncertainty.
Collapse
|
20
|
Miller JA, Constantinidis C. Timescales of learning in prefrontal cortex. Nat Rev Neurosci 2024; 25:597-610. [PMID: 38937654 DOI: 10.1038/s41583-024-00836-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/03/2024] [Indexed: 06/29/2024]
Abstract
The lateral prefrontal cortex (PFC) in humans and other primates is critical for immediate, goal-directed behaviour and working memory, which are classically considered distinct from the cognitive and neural circuits that support long-term learning and memory. Over the past few years, a reconsideration of this textbook perspective has emerged, in that different timescales of memory-guided behaviour are in constant interaction during the pursuit of immediate goals. Here, we will first detail how neural activity related to the shortest timescales of goal-directed behaviour (which requires maintenance of current states and goals in working memory) is sculpted by long-term knowledge and learning - that is, how the past informs present behaviour. Then, we will outline how learning across different timescales (from seconds to years) drives plasticity in the primate lateral PFC, from single neuron firing rates to mesoscale neuroimaging activity patterns. Finally, we will review how, over days and months of learning, dense local and long-range connectivity patterns in PFC facilitate longer-lasting changes in population activity by changing synaptic weights and recruiting additional neural resources to inform future behaviour. Our Review sheds light on how the machinery of plasticity in PFC circuits facilitates the integration of learned experiences across time to best guide adaptive behaviour.
Collapse
Affiliation(s)
- Jacob A Miller
- Wu Tsai Institute, Yale University, New Haven, CT, USA
- Department of Psychiatry, Yale University School of Medicine, New Haven, CT, USA
| | - Christos Constantinidis
- Department of Biomedical Engineering, Vanderbilt University, Nashville, TN, USA.
- Neuroscience Program, Vanderbilt University, Nashville, TN, USA.
- Department of Ophthalmology and Visual Sciences, Vanderbilt University Medical Center, Nashville, TN, USA.
| |
Collapse
|
21
|
Nasti L, Vecchiato G, Heuret P, Rowe NP, Palladino M, Marcati P. A Reinforcement Learning approach to study climbing plant behaviour. Sci Rep 2024; 14:18222. [PMID: 39107370 PMCID: PMC11303795 DOI: 10.1038/s41598-024-62147-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Accepted: 05/14/2024] [Indexed: 08/10/2024] Open
Abstract
A plant's structure is the result of constant adaptation and evolution to the surrounding environment. From this perspective, our goal is to investigate the mass and radius distribution of a particular plant organ, namely the searcher shoot, by providing a Reinforcement Learning (RL) environment, that we call Searcher-Shoot, which considers the mechanics due to the mass of the shoot and leaves. We uphold the hypothesis that plants maximize their length, avoiding a maximal stress threshold. To do this, we explore whether the mass distribution along the stem is efficient, formulating a Markov Decision Process. By exploiting this strategy, we are able to mimic and thus study the plant's behavior, finding that shoots decrease their diameters smoothly, resulting in an efficient distribution of the mass. The strong accordance between our results and the experimental data allows us to remark on the strength of our approach in the analysis of biological systems traits.
Collapse
Affiliation(s)
- Lucia Nasti
- Gran Sasso Science Institute, L'Aquila, Italy.
| | | | - Patrick Heuret
- AMAP, Univ Montpellier, CIRAD, CNRS, INRAe, IRD, Montpellier, France
| | - Nicholas P Rowe
- AMAP, Univ Montpellier, CIRAD, CNRS, INRAe, IRD, Montpellier, France
| | - Michele Palladino
- Gran Sasso Science Institute, L'Aquila, Italy
- DISIM, Department of Information Engineering, Computer Science and Mathematics, University of L'Aquila, Via Vetoio, 67100, L'Aquila, Italy
| | | |
Collapse
|
22
|
Scott DN, Mukherjee A, Nassar MR, Halassa MM. Thalamocortical architectures for flexible cognition and efficient learning. Trends Cogn Sci 2024; 28:739-756. [PMID: 38886139 PMCID: PMC11305962 DOI: 10.1016/j.tics.2024.05.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2023] [Revised: 05/12/2024] [Accepted: 05/13/2024] [Indexed: 06/20/2024]
Abstract
The brain exhibits a remarkable ability to learn and execute context-appropriate behaviors. How it achieves such flexibility, without sacrificing learning efficiency, is an important open question. Neuroscience, psychology, and engineering suggest that reusing and repurposing computations are part of the answer. Here, we review evidence that thalamocortical architectures may have evolved to facilitate these objectives of flexibility and efficiency by coordinating distributed computations. Recent work suggests that distributed prefrontal cortical networks compute with flexible codes, and that the mediodorsal thalamus provides regularization to promote efficient reuse. Thalamocortical interactions resemble hierarchical Bayesian computations, and their network implementation can be related to existing gating, synchronization, and hub theories of thalamic function. By reviewing recent findings and providing a novel synthesis, we highlight key research horizons integrating computation, cognition, and systems neuroscience.
Collapse
Affiliation(s)
- Daniel N Scott
- Department of Neuroscience, Brown University, Providence, RI, USA; Robert J. and Nancy D. Carney Institute for Brain Science, Brown University, Providence, RI, USA.
| | - Arghya Mukherjee
- Department of Neuroscience, Tufts University School of Medicine, Boston, MA, USA
| | - Matthew R Nassar
- Department of Neuroscience, Brown University, Providence, RI, USA; Robert J. and Nancy D. Carney Institute for Brain Science, Brown University, Providence, RI, USA
| | - Michael M Halassa
- Department of Neuroscience, Tufts University School of Medicine, Boston, MA, USA; Department of Psychiatry, Tufts University School of Medicine, Boston, MA, USA.
| |
Collapse
|
23
|
Vassiliadis P, Beanato E, Popa T, Windel F, Morishita T, Neufeld E, Duque J, Derosiere G, Wessel MJ, Hummel FC. Non-invasive stimulation of the human striatum disrupts reinforcement learning of motor skills. Nat Hum Behav 2024; 8:1581-1598. [PMID: 38811696 PMCID: PMC11343719 DOI: 10.1038/s41562-024-01901-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 04/23/2024] [Indexed: 05/31/2024]
Abstract
Reinforcement feedback can improve motor learning, but the underlying brain mechanisms remain underexplored. In particular, the causal contribution of specific patterns of oscillatory activity within the human striatum is unknown. To address this question, we exploited a recently developed non-invasive deep brain stimulation technique called transcranial temporal interference stimulation (tTIS) during reinforcement motor learning with concurrent neuroimaging, in a randomized, sham-controlled, double-blind study. Striatal tTIS applied at 80 Hz, but not at 20 Hz, abolished the benefits of reinforcement on motor learning. This effect was related to a selective modulation of neural activity within the striatum. Moreover, 80 Hz, but not 20 Hz, tTIS increased the neuromodulatory influence of the striatum on frontal areas involved in reinforcement motor learning. These results show that tTIS can non-invasively and selectively modulate a striatal mechanism involved in reinforcement learning, expanding our tools for the study of causal relationships between deep brain structures and human behaviour.
Collapse
Affiliation(s)
- Pierre Vassiliadis
- Defitech Chair of Clinical Neuroengineering, Neuro-X Institute, École Polytechnique Fédérale de Lausanne (EPFL), Geneva, Switzerland
- Defitech Chair of Clinical Neuroengineering, Neuro-X Institute, EPFL Valais, Clinique Romande de Réadaptation, Sion, Switzerland
- Institute of Neuroscience, Université Catholique de Louvain, Brussels, Belgium
| | - Elena Beanato
- Defitech Chair of Clinical Neuroengineering, Neuro-X Institute, École Polytechnique Fédérale de Lausanne (EPFL), Geneva, Switzerland
- Defitech Chair of Clinical Neuroengineering, Neuro-X Institute, EPFL Valais, Clinique Romande de Réadaptation, Sion, Switzerland
| | - Traian Popa
- Defitech Chair of Clinical Neuroengineering, Neuro-X Institute, École Polytechnique Fédérale de Lausanne (EPFL), Geneva, Switzerland
- Defitech Chair of Clinical Neuroengineering, Neuro-X Institute, EPFL Valais, Clinique Romande de Réadaptation, Sion, Switzerland
| | - Fabienne Windel
- Defitech Chair of Clinical Neuroengineering, Neuro-X Institute, École Polytechnique Fédérale de Lausanne (EPFL), Geneva, Switzerland
- Defitech Chair of Clinical Neuroengineering, Neuro-X Institute, EPFL Valais, Clinique Romande de Réadaptation, Sion, Switzerland
| | - Takuya Morishita
- Defitech Chair of Clinical Neuroengineering, Neuro-X Institute, École Polytechnique Fédérale de Lausanne (EPFL), Geneva, Switzerland
- Defitech Chair of Clinical Neuroengineering, Neuro-X Institute, EPFL Valais, Clinique Romande de Réadaptation, Sion, Switzerland
| | - Esra Neufeld
- Foundation for Research on Information Technologies in Society, Zurich, Switzerland
| | - Julie Duque
- Institute of Neuroscience, Université Catholique de Louvain, Brussels, Belgium
| | - Gerard Derosiere
- Institute of Neuroscience, Université Catholique de Louvain, Brussels, Belgium
- Lyon Neuroscience Research Center, Impact Team, Inserm U1028, CNRS UMR5292, Lyon 1 University, Bron, France
| | - Maximilian J Wessel
- Defitech Chair of Clinical Neuroengineering, Neuro-X Institute, École Polytechnique Fédérale de Lausanne (EPFL), Geneva, Switzerland
- Defitech Chair of Clinical Neuroengineering, Neuro-X Institute, EPFL Valais, Clinique Romande de Réadaptation, Sion, Switzerland
- Department of Neurology, University Hospital Würzburg, Würzburg, Germany
| | - Friedhelm C Hummel
- Defitech Chair of Clinical Neuroengineering, Neuro-X Institute, École Polytechnique Fédérale de Lausanne (EPFL), Geneva, Switzerland.
- Defitech Chair of Clinical Neuroengineering, Neuro-X Institute, EPFL Valais, Clinique Romande de Réadaptation, Sion, Switzerland.
- Clinical Neuroscience, University of Geneva Medical School, Geneva, Switzerland.
| |
Collapse
|
24
|
Chung YS, van den Berg B, Roberts KC, Bagdasarov A, Woldorff MG, Gaffrey MS. Electrical brain activations in preadolescents during a probabilistic reward-learning task reflect cognitive processes and behavioral strategy. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.10.16.562326. [PMID: 37905129 PMCID: PMC10614771 DOI: 10.1101/2023.10.16.562326] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/02/2023]
Abstract
Both adults and children learn through feedback which environmental events and choices are associated with higher probability of reward, an ability thought to be supported by the development of fronto-striatal reward circuits. Recent developmental studies have applied computational models of reward learning to investigate such learning in children. However, tasks and measures effective for assaying the cascade of reward-learning neural processes in children have been limited. Using a child-version of a probabilistic reward-learning task while recording event-related-potential (ERP) measures of electrical brain activity, this study examined key processes of reward learning in preadolescents (8-12 years old; n=30), namely: (1) reward-feedback sensitivity, as measured by the early-latency, reward-related, frontal ERP positivity, (2) rapid attentional shifting of processing toward favored visual stimuli, as measured by the N2pc component, and (3) longer-latency attention-related responses to reward feedback as a function of behavioral strategies (i.e., Win-Stay-Lose-Shift), as measured by the central-parietal P300. Consistent with our prior work in adults, the behavioral findings indicate preadolescents can learn stimulus-reward outcome associations, but at varying levels of performance. Neurally, poor preadolescent learners (those with slower learning rates) showed greater reward-related positivity amplitudes relative to good learners, suggesting greater reward-feedback sensitivity. We also found attention shifting towards to-be-chosen stimuli, as evidenced by the N2pc, but not to more highly rewarded stimuli as we have observed in adults. Lastly, we found the behavioral learning strategy (i.e., Win-Stay-Lose-Shift) reflected by the feedback-elicited parietal P300. These findings provide novel insights into the key neural processes underlying reinforcement learning in preadolescents.
Collapse
|
25
|
Combrisson E, Basanisi R, Gueguen MCM, Rheims S, Kahane P, Bastin J, Brovelli A. Neural interactions in the human frontal cortex dissociate reward and punishment learning. eLife 2024; 12:RP92938. [PMID: 38941238 PMCID: PMC11213568 DOI: 10.7554/elife.92938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/30/2024] Open
Abstract
How human prefrontal and insular regions interact while maximizing rewards and minimizing punishments is unknown. Capitalizing on human intracranial recordings, we demonstrate that the functional specificity toward reward or punishment learning is better disentangled by interactions compared to local representations. Prefrontal and insular cortices display non-selective neural populations to rewards and punishments. Non-selective responses, however, give rise to context-specific interareal interactions. We identify a reward subsystem with redundant interactions between the orbitofrontal and ventromedial prefrontal cortices, with a driving role of the latter. In addition, we find a punishment subsystem with redundant interactions between the insular and dorsolateral cortices, with a driving role of the insula. Finally, switching between reward and punishment learning is mediated by synergistic interactions between the two subsystems. These results provide a unifying explanation of distributed cortical representations and interactions supporting reward and punishment learning.
Collapse
Affiliation(s)
- Etienne Combrisson
- Institut de Neurosciences de la Timone, Aix Marseille UniversitéMarseilleFrance
| | - Ruggero Basanisi
- Institut de Neurosciences de la Timone, Aix Marseille UniversitéMarseilleFrance
| | - Maelle CM Gueguen
- Univ. Grenoble Alpes, Inserm, U1216, Grenoble Institut NeurosciencesGrenobleFrance
| | - Sylvain Rheims
- Department of Functional Neurology and Epileptology, Hospices Civils de Lyon and University of LyonLyonFrance
| | - Philippe Kahane
- Univ. Grenoble Alpes, Inserm, U1216, CHU Grenoble Alpes, Grenoble Institut NeurosciencesGrenobleFrance
| | - Julien Bastin
- Univ. Grenoble Alpes, Inserm, U1216, Grenoble Institut NeurosciencesGrenobleFrance
| | - Andrea Brovelli
- Institut de Neurosciences de la Timone, Aix Marseille UniversitéMarseilleFrance
| |
Collapse
|
26
|
Nick Q, Gale DJ, Areshenkoff C, De Brouwer A, Nashed J, Wammes J, Zhu T, Flanagan R, Smallwood J, Gallivan J. Reconfigurations of cortical manifold structure during reward-based motor learning. eLife 2024; 12:RP91928. [PMID: 38916598 PMCID: PMC11198988 DOI: 10.7554/elife.91928] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/26/2024] Open
Abstract
Adaptive motor behavior depends on the coordinated activity of multiple neural systems distributed across the brain. While the role of sensorimotor cortex in motor learning has been well established, how higher-order brain systems interact with sensorimotor cortex to guide learning is less well understood. Using functional MRI, we examined human brain activity during a reward-based motor task where subjects learned to shape their hand trajectories through reinforcement feedback. We projected patterns of cortical and striatal functional connectivity onto a low-dimensional manifold space and examined how regions expanded and contracted along the manifold during learning. During early learning, we found that several sensorimotor areas in the dorsal attention network exhibited increased covariance with areas of the salience/ventral attention network and reduced covariance with areas of the default mode network (DMN). During late learning, these effects reversed, with sensorimotor areas now exhibiting increased covariance with DMN areas. However, areas in posteromedial cortex showed the opposite pattern across learning phases, with its connectivity suggesting a role in coordinating activity across different networks over time. Our results establish the neural changes that support reward-based motor learning and identify distinct transitions in the functional coupling of sensorimotor to transmodal cortex when adapting behavior.
Collapse
Affiliation(s)
- Qasem Nick
- Centre for Neuroscience Studies, Queen’s UniversityKingstonCanada
- Department of Psychology, Queen’s UniversityKingstonCanada
| | - Daniel J Gale
- Centre for Neuroscience Studies, Queen’s UniversityKingstonCanada
| | - Corson Areshenkoff
- Centre for Neuroscience Studies, Queen’s UniversityKingstonCanada
- Department of Psychology, Queen’s UniversityKingstonCanada
| | - Anouk De Brouwer
- Centre for Neuroscience Studies, Queen’s UniversityKingstonCanada
| | - Joseph Nashed
- Centre for Neuroscience Studies, Queen’s UniversityKingstonCanada
- Department of Medicine, Queen's UniversityKingstonCanada
| | - Jeffrey Wammes
- Centre for Neuroscience Studies, Queen’s UniversityKingstonCanada
- Department of Psychology, Queen’s UniversityKingstonCanada
| | - Tianyao Zhu
- Centre for Neuroscience Studies, Queen’s UniversityKingstonCanada
| | - Randy Flanagan
- Centre for Neuroscience Studies, Queen’s UniversityKingstonCanada
- Department of Psychology, Queen’s UniversityKingstonCanada
| | - Jonny Smallwood
- Centre for Neuroscience Studies, Queen’s UniversityKingstonCanada
- Department of Psychology, Queen’s UniversityKingstonCanada
| | - Jason Gallivan
- Centre for Neuroscience Studies, Queen’s UniversityKingstonCanada
- Department of Psychology, Queen’s UniversityKingstonCanada
- Department of Biomedical and Molecular Sciences, Queen’s UniversityKingstonCanada
| |
Collapse
|
27
|
Roswandowitz C, Kathiresan T, Pellegrino E, Dellwo V, Frühholz S. Cortical-striatal brain network distinguishes deepfake from real speaker identity. Commun Biol 2024; 7:711. [PMID: 38862808 PMCID: PMC11166919 DOI: 10.1038/s42003-024-06372-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Accepted: 05/22/2024] [Indexed: 06/13/2024] Open
Abstract
Deepfakes are viral ingredients of digital environments, and they can trick human cognition into misperceiving the fake as real. Here, we test the neurocognitive sensitivity of 25 participants to accept or reject person identities as recreated in audio deepfakes. We generate high-quality voice identity clones from natural speakers by using advanced deepfake technologies. During an identity matching task, participants show intermediate performance with deepfake voices, indicating levels of deception and resistance to deepfake identity spoofing. On the brain level, univariate and multivariate analyses consistently reveal a central cortico-striatal network that decoded the vocal acoustic pattern and deepfake-level (auditory cortex), as well as natural speaker identities (nucleus accumbens), which are valued for their social relevance. This network is embedded in a broader neural identity and object recognition network. Humans can thus be partly tricked by deepfakes, but the neurocognitive mechanisms identified during deepfake processing open windows for strengthening human resilience to fake information.
Collapse
Affiliation(s)
- Claudia Roswandowitz
- Cognitive and Affective Neuroscience Unit, Department of Psychology, University of Zurich, Zurich, Switzerland.
- Phonetics and Speech Sciences Group, Department of Computational Linguistics, University of Zurich, Zurich, Switzerland.
- Neuroscience Centre Zurich, University of Zurich and ETH Zurich, Zurich, Switzerland.
| | - Thayabaran Kathiresan
- Centre for Neuroscience of Speech, University Melbourne, Melbourne, Australia
- Redenlab, Melbourne, Australia
| | - Elisa Pellegrino
- Phonetics and Speech Sciences Group, Department of Computational Linguistics, University of Zurich, Zurich, Switzerland
| | - Volker Dellwo
- Phonetics and Speech Sciences Group, Department of Computational Linguistics, University of Zurich, Zurich, Switzerland
| | - Sascha Frühholz
- Cognitive and Affective Neuroscience Unit, Department of Psychology, University of Zurich, Zurich, Switzerland
- Neuroscience Centre Zurich, University of Zurich and ETH Zurich, Zurich, Switzerland
- Department of Psychology, University of Oslo, Oslo, Norway
| |
Collapse
|
28
|
Borra E, Ballestrazzi G, Biancheri D, Caminiti R, Luppino G. Involvement of the claustrum in the cortico-basal ganglia circuitry: connectional study in the non-human primate. Brain Struct Funct 2024; 229:1143-1164. [PMID: 38615290 PMCID: PMC11147942 DOI: 10.1007/s00429-024-02784-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2024] [Accepted: 03/04/2024] [Indexed: 04/15/2024]
Abstract
The claustrum is an ancient telencephalic subcortical structure displaying extensive, reciprocal connections with much of the cortex and receiving projections from thalamus, amygdala, and hippocampus. This structure has a general role in modulating cortical excitability and is considered to be engaged in different cognitive and motor functions, such as sensory integration and perceptual binding, salience-guided attention, top-down executive functions, as well as in the control of brain states, such as sleep and its interhemispheric integration. The present study is the first to describe in detail a projection from the claustrum to the striatum in the macaque brain. Based on tracer injections in different striatal regions and in different cortical areas, we observed a rough topography of the claustral connectivity, thanks to which a claustral zone projects to both a specific striatal territory and to cortical areas involved in a network projecting to the same striatal territory. The present data add new elements of complexity of the basal ganglia information processing mode in motor and non-motor functions and provide evidence for an influence of the claustrum on both cortical functional domains and cortico-basal ganglia circuits.
Collapse
Affiliation(s)
- Elena Borra
- Unità di Neuroscienze, Dipartimento di Medicina e Chirurgia, Università di Parma, 43100, Parma, Italy.
| | - Gemma Ballestrazzi
- Unità di Neuroscienze, Dipartimento di Medicina e Chirurgia, Università di Parma, 43100, Parma, Italy
| | - Dalila Biancheri
- Unità di Neuroscienze, Dipartimento di Medicina e Chirurgia, Università di Parma, 43100, Parma, Italy
| | - Roberto Caminiti
- Neuroscience and Behaviour Laboratory, Istituto Italiano di Tecnologia (IIT), 00161, Rome, Italy
| | - Giuseppe Luppino
- Unità di Neuroscienze, Dipartimento di Medicina e Chirurgia, Università di Parma, 43100, Parma, Italy
| |
Collapse
|
29
|
Parishar P, Rajagopalan M, Iyengar S. Changes in the dopaminergic circuitry and adult neurogenesis linked to reinforcement learning in corvids. Front Neurosci 2024; 18:1359874. [PMID: 38808028 PMCID: PMC11130420 DOI: 10.3389/fnins.2024.1359874] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Accepted: 04/29/2024] [Indexed: 05/30/2024] Open
Abstract
The caudolateral nidopallium (NCL, an analog of the prefrontal cortex) is known to be involved in learning, memory, and discrimination in corvids (a songbird), whereas the involvement of other brain regions in these phenomena is not well explored. We used house crows (Corvus splendens) to explore the neural correlates of learning and decision-making by initially training them on a shape discrimination task followed by immunohistochemistry to study the immediate early gene expression (Arc), a dopaminoceptive neuronal marker (DARPP-32, Dopamine- and cAMP-regulated phosphoprotein, Mr 32 kDa) to understand the involvement of the reward pathway and an immature neuronal marker (DCX, doublecortin) to detect learning-induced changes in adult neurogenesis. We performed neuronal counts and neuronal tracing, followed by morphometric analyses. Our present results have demonstrated that besides NCL, other parts of the caudal nidopallium (NC), avian basal ganglia, and intriguingly, vocal control regions in house crows are involved in visual discrimination. We have also found that training on the visual discrimination task can be correlated with neurite pruning in mature dopaminoceptive neurons and immature DCX-positive neurons in the NC of house crows. Furthermore, there is an increase in the incorporation of new neurons throughout NC and the medial striatum which can also be linked to learning. For the first time, our results demonstrate that a combination of structural changes in mature and immature neurons and adult neurogenesis are linked to learning in corvids.
Collapse
|
30
|
Tang H, Bartolo-Orozco R, Averbeck BB. Ventral frontostriatal circuitry mediates the computation of reinforcement from symbolic gains and losses. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.03.587097. [PMID: 38617219 PMCID: PMC11014508 DOI: 10.1101/2024.04.03.587097] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/16/2024]
Abstract
Reinforcement learning (RL), particularly in primates, is often driven by symbolic outcomes. However, it is usually studied with primary reinforcers. To examine the neural mechanisms underlying learning from symbolic outcomes, we trained monkeys on a task in which they learned to choose options that led to gains of tokens and avoid choosing options that led to losses of tokens. We then recorded simultaneously from the orbitofrontal cortex (OFC), ventral striatum (VS), amygdala (AMY), and the mediodorsal thalamus (MDt). We found that the OFC played a dominant role in coding token outcomes and token prediction errors. The other areas contributed complementary functions with the VS coding appetitive outcomes and the AMY coding the salience of outcomes. The MDt coded actions and relayed information about tokens between the OFC and VS. Thus, OFC leads the process of symbolic reinforcement learning in the ventral frontostriatal circuitry.
Collapse
|
31
|
Pietrzak M, Yngve A, Hamilton JP, Asratian A, Gauffin E, Löfberg A, Gustavson S, Persson E, Capusan AJ, Leggio L, Perini I, Tinghög G, Heilig M, Boehme R. Ghrelin decreases sensitivity to negative feedback and increases prediction-error related caudate activity in humans, a randomized controlled trial. Neuropsychopharmacology 2024; 49:1042-1049. [PMID: 38409282 PMCID: PMC11039644 DOI: 10.1038/s41386-024-01821-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 02/01/2024] [Accepted: 02/04/2024] [Indexed: 02/28/2024]
Abstract
The stomach-derived hormone ghrelin plays not only a role in feeding, starvation, and survival, but it has been suggested to also be involved in the stress response, in neuropsychiatric conditions, and in alcohol and drug use disorders. Mechanisms related to reward processing might mediate ghrelin's broader effects on complex behaviors, as indicated by animal studies and mostly correlative human studies. Here, using a within-subject double-blind placebo-controlled design with intravenous ghrelin infusion in healthy volunteers (n = 30), we tested whether ghrelin alters sensitivity to reward and punishment in a reward learning task. Parameters were derived from a computational model of participants' task behavior. The reversal learning task with monetary rewards was performed during functional brain imaging to investigate ghrelin effects on brain signals related to reward prediction errors. Compared to placebo, ghrelin decreased punishment sensitivity (t = -2.448, p = 0.021), while reward sensitivity was unaltered (t = 0.8, p = 0.43). We furthermore found increased prediction-error related activity in the dorsal striatum during ghrelin administration (region of interest analysis: t-values ≥ 4.21, p-values ≤ 0.044). Our results support a role for ghrelin in reward processing that extends beyond food-related rewards. Reduced sensitivity to negative outcomes and increased processing of prediction errors may be beneficial for food foraging when hungry but could also relate to increased risk taking and impulsivity in the broader context of addictive behaviors.
Collapse
Affiliation(s)
- Michal Pietrzak
- Center for Social and Affective Neuroscience, Department of Biomedical and Clinical Sciences, Linköping University, Linköping, 58183, Sweden
- Department of Psychiatry, Linköping University Hospital, Linköping, 58183, Sweden
- Center for Medical Imaging and Visualization, Linköping University, Linköping, 58183, Sweden
| | - Adam Yngve
- Center for Social and Affective Neuroscience, Department of Biomedical and Clinical Sciences, Linköping University, Linköping, 58183, Sweden
- Center for Medical Imaging and Visualization, Linköping University, Linköping, 58183, Sweden
| | - J Paul Hamilton
- Center for Social and Affective Neuroscience, Department of Biomedical and Clinical Sciences, Linköping University, Linköping, 58183, Sweden
- Center for Medical Imaging and Visualization, Linköping University, Linköping, 58183, Sweden
- Department of Medical and Biological Psychology, University of Bergen, Bergen, 5007, Norway
| | - Anna Asratian
- Center for Social and Affective Neuroscience, Department of Biomedical and Clinical Sciences, Linköping University, Linköping, 58183, Sweden
- Center for Medical Imaging and Visualization, Linköping University, Linköping, 58183, Sweden
| | - Emelie Gauffin
- Center for Social and Affective Neuroscience, Department of Biomedical and Clinical Sciences, Linköping University, Linköping, 58183, Sweden
- Department of Psychiatry, Linköping University Hospital, Linköping, 58183, Sweden
- Center for Medical Imaging and Visualization, Linköping University, Linköping, 58183, Sweden
| | - Andreas Löfberg
- Center for Social and Affective Neuroscience, Department of Biomedical and Clinical Sciences, Linköping University, Linköping, 58183, Sweden
- Department of Psychiatry, Linköping University Hospital, Linköping, 58183, Sweden
- Center for Medical Imaging and Visualization, Linköping University, Linköping, 58183, Sweden
| | - Sarah Gustavson
- Center for Social and Affective Neuroscience, Department of Biomedical and Clinical Sciences, Linköping University, Linköping, 58183, Sweden
- Department of Psychiatry, Linköping University Hospital, Linköping, 58183, Sweden
- Center for Medical Imaging and Visualization, Linköping University, Linköping, 58183, Sweden
| | - Emil Persson
- Division of Economics, Department of Management and Engineering, Linköping University, Linköping, 58183, Sweden
| | - Andrea J Capusan
- Center for Social and Affective Neuroscience, Department of Biomedical and Clinical Sciences, Linköping University, Linköping, 58183, Sweden
- Department of Psychiatry, Linköping University Hospital, Linköping, 58183, Sweden
| | - Lorenzo Leggio
- Section on Clinical Psychoneuroendocrinology and Neuropsychopharmacology, Translational Addiction Medicine Branch, National Institute on Drug Abuse Intramural Research Program and National Institute on Alcohol Abuse and Alcoholism, Division of Intramural Clinical and Biological Research, National Institutes of Health, Baltimore, MD, 21224, USA
| | - Irene Perini
- Center for Social and Affective Neuroscience, Department of Biomedical and Clinical Sciences, Linköping University, Linköping, 58183, Sweden
- Center for Medical Imaging and Visualization, Linköping University, Linköping, 58183, Sweden
| | - Gustav Tinghög
- Division of Economics, Department of Management and Engineering, Linköping University, Linköping, 58183, Sweden
- National Center for Health Care Priority Setting, Department of Health Medicine and Caring Sciences, Linköping University, 58183, Linköping, Sweden
| | - Markus Heilig
- Center for Social and Affective Neuroscience, Department of Biomedical and Clinical Sciences, Linköping University, Linköping, 58183, Sweden
- Department of Psychiatry, Linköping University Hospital, Linköping, 58183, Sweden
- Center for Medical Imaging and Visualization, Linköping University, Linköping, 58183, Sweden
| | - Rebecca Boehme
- Center for Social and Affective Neuroscience, Department of Biomedical and Clinical Sciences, Linköping University, Linköping, 58183, Sweden.
- Center for Medical Imaging and Visualization, Linköping University, Linköping, 58183, Sweden.
| |
Collapse
|
32
|
Colas JT, O’Doherty JP, Grafton ST. Active reinforcement learning versus action bias and hysteresis: control with a mixture of experts and nonexperts. PLoS Comput Biol 2024; 20:e1011950. [PMID: 38552190 PMCID: PMC10980507 DOI: 10.1371/journal.pcbi.1011950] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 02/26/2024] [Indexed: 04/01/2024] Open
Abstract
Active reinforcement learning enables dynamic prediction and control, where one should not only maximize rewards but also minimize costs such as of inference, decisions, actions, and time. For an embodied agent such as a human, decisions are also shaped by physical aspects of actions. Beyond the effects of reward outcomes on learning processes, to what extent can modeling of behavior in a reinforcement-learning task be complicated by other sources of variance in sequential action choices? What of the effects of action bias (for actions per se) and action hysteresis determined by the history of actions chosen previously? The present study addressed these questions with incremental assembly of models for the sequential choice data from a task with hierarchical structure for additional complexity in learning. With systematic comparison and falsification of computational models, human choices were tested for signatures of parallel modules representing not only an enhanced form of generalized reinforcement learning but also action bias and hysteresis. We found evidence for substantial differences in bias and hysteresis across participants-even comparable in magnitude to the individual differences in learning. Individuals who did not learn well revealed the greatest biases, but those who did learn accurately were also significantly biased. The direction of hysteresis varied among individuals as repetition or, more commonly, alternation biases persisting from multiple previous actions. Considering that these actions were button presses with trivial motor demands, the idiosyncratic forces biasing sequences of action choices were robust enough to suggest ubiquity across individuals and across tasks requiring various actions. In light of how bias and hysteresis function as a heuristic for efficient control that adapts to uncertainty or low motivation by minimizing the cost of effort, these phenomena broaden the consilient theory of a mixture of experts to encompass a mixture of expert and nonexpert controllers of behavior.
Collapse
Affiliation(s)
- Jaron T. Colas
- Department of Psychological and Brain Sciences, University of California, Santa Barbara, California, United States of America
- Division of the Humanities and Social Sciences, California Institute of Technology, Pasadena, California, United States of America
- Computation and Neural Systems Program, California Institute of Technology, Pasadena, California, United States of America
| | - John P. O’Doherty
- Division of the Humanities and Social Sciences, California Institute of Technology, Pasadena, California, United States of America
- Computation and Neural Systems Program, California Institute of Technology, Pasadena, California, United States of America
| | - Scott T. Grafton
- Department of Psychological and Brain Sciences, University of California, Santa Barbara, California, United States of America
| |
Collapse
|
33
|
Wu K, Lo YT, Cavaleri J, Bergosh M, Ipe J, Briggs RG, Jann KB, Murray SB, Mason XL, Liu CY, Lee DJ. Neuromodulation of Eating Disorders: A Review of Underlying Neural Network Activity and Neuromodulatory Treatments. Brain Sci 2024; 14:200. [PMID: 38539589 PMCID: PMC10968923 DOI: 10.3390/brainsci14030200] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2024] [Revised: 02/16/2024] [Accepted: 02/20/2024] [Indexed: 11/11/2024] Open
Abstract
Eating disorders are a group of psychiatric conditions that involve pathological relationships between patients and food. The most prolific of these disorders are anorexia nervosa, bulimia nervosa, and binge eating disorder. The current standard of care involves psychotherapy, pharmacotherapy, and the management of comorbid conditions, with nutritional rehabilitation reserved for severe cases of anorexia nervosa. Unfortunately, many patients often fail to respond, leaving a concerning treatment gap between the current and requisite treatments for eating disorders. To better understand the neurobiology underlying these eating disorders, investigations have been undertaken to characterize the activity of various neural networks, primarily those activated during tasks of executive inhibition, reward processing, and self-reference. Various neuromodulatory techniques have been proposed to stimulate these networks with the goal of improving patients' BMI and mental health. The aim of this review is to compile a comprehensive summarization of the current literature regarding the underlying neural connectivity of anorexia nervosa, bulimia nervosa, and binge eating disorder as well as the numerous neuromodulatory modalities that have been investigated. Importantly, we aimed to summarize the most significant clinical trials to date as well as to provide an updated assessment of the role of deep brain stimulation, summarizing numerous recently published clinical studies that have greatly contributed to the literature. In this review, we found therapeutic evidence for transcranial magnetic stimulation and transcranial direct current stimulation in treating individuals suffering from anorexia nervosa, bulimia nervosa, and binge eating disorder. We also found significant evidence for the role of deep brain stimulation, particularly as an escalatory therapy option for the those who failed standard therapy. Finally, we hope to provide promising directions for future clinical investigations.
Collapse
Affiliation(s)
- Kevin Wu
- Department of Neurological Surgery, Keck School of Medicine, University of Southern California, Los Angeles, CA 900033, USA; (Y.T.L.); (J.C.); (M.B.); (J.I.); (R.G.B.); (X.L.M.); (C.Y.L.); (D.J.L.)
| | - Yu Tung Lo
- Department of Neurological Surgery, Keck School of Medicine, University of Southern California, Los Angeles, CA 900033, USA; (Y.T.L.); (J.C.); (M.B.); (J.I.); (R.G.B.); (X.L.M.); (C.Y.L.); (D.J.L.)
- Department of Neurosurgery, National Neuroscience Institute, Singapore 308433, Singapore
| | - Jonathon Cavaleri
- Department of Neurological Surgery, Keck School of Medicine, University of Southern California, Los Angeles, CA 900033, USA; (Y.T.L.); (J.C.); (M.B.); (J.I.); (R.G.B.); (X.L.M.); (C.Y.L.); (D.J.L.)
| | - Matthew Bergosh
- Department of Neurological Surgery, Keck School of Medicine, University of Southern California, Los Angeles, CA 900033, USA; (Y.T.L.); (J.C.); (M.B.); (J.I.); (R.G.B.); (X.L.M.); (C.Y.L.); (D.J.L.)
| | - Jennifer Ipe
- Department of Neurological Surgery, Keck School of Medicine, University of Southern California, Los Angeles, CA 900033, USA; (Y.T.L.); (J.C.); (M.B.); (J.I.); (R.G.B.); (X.L.M.); (C.Y.L.); (D.J.L.)
| | - Robert G. Briggs
- Department of Neurological Surgery, Keck School of Medicine, University of Southern California, Los Angeles, CA 900033, USA; (Y.T.L.); (J.C.); (M.B.); (J.I.); (R.G.B.); (X.L.M.); (C.Y.L.); (D.J.L.)
| | - Kay B. Jann
- USC Stevens Neuroimaging and Informatics Institute, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA;
| | - Stuart B. Murray
- Department of Psychiatry and Behavioral Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA;
| | - Xenos L. Mason
- Department of Neurological Surgery, Keck School of Medicine, University of Southern California, Los Angeles, CA 900033, USA; (Y.T.L.); (J.C.); (M.B.); (J.I.); (R.G.B.); (X.L.M.); (C.Y.L.); (D.J.L.)
- Department of Neurology, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA
| | - Charles Y. Liu
- Department of Neurological Surgery, Keck School of Medicine, University of Southern California, Los Angeles, CA 900033, USA; (Y.T.L.); (J.C.); (M.B.); (J.I.); (R.G.B.); (X.L.M.); (C.Y.L.); (D.J.L.)
- USC Neurorestoration Center, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA
- Department of Biomedical Engineering, University of Southern California, Los Angeles, CA 90033, USA
| | - Darrin J. Lee
- Department of Neurological Surgery, Keck School of Medicine, University of Southern California, Los Angeles, CA 900033, USA; (Y.T.L.); (J.C.); (M.B.); (J.I.); (R.G.B.); (X.L.M.); (C.Y.L.); (D.J.L.)
- USC Neurorestoration Center, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA
- Department of Biomedical Engineering, University of Southern California, Los Angeles, CA 90033, USA
| |
Collapse
|
34
|
Wise T, Emery K, Radulescu A. Naturalistic reinforcement learning. Trends Cogn Sci 2024; 28:144-158. [PMID: 37777463 PMCID: PMC10878983 DOI: 10.1016/j.tics.2023.08.016] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 08/23/2023] [Accepted: 08/24/2023] [Indexed: 10/02/2023]
Abstract
Humans possess a remarkable ability to make decisions within real-world environments that are expansive, complex, and multidimensional. Human cognitive computational neuroscience has sought to exploit reinforcement learning (RL) as a framework within which to explain human decision-making, often focusing on constrained, artificial experimental tasks. In this article, we review recent efforts that use naturalistic approaches to determine how humans make decisions in complex environments that better approximate the real world, providing a clearer picture of how humans navigate the challenges posed by real-world decisions. These studies purposely embed elements of naturalistic complexity within experimental paradigms, rather than focusing on simplification, generating insights into the processes that likely underpin humans' ability to navigate complex, multidimensional real-world environments so successfully.
Collapse
Affiliation(s)
- Toby Wise
- Department of Neuroimaging, King's College London, London, UK.
| | - Kara Emery
- Center for Data Science, New York University, New York, NY, USA
| | - Angela Radulescu
- Center for Computational Psychiatry, Icahn School of Medicine at Mt. Sinai, New York, NY, USA
| |
Collapse
|
35
|
Giarrocco F, Costa VD, Basile BM, Pujara MS, Murray EA, Averbeck BB. Motor System-Dependent Effects of Amygdala and Ventral Striatum Lesions on Explore-Exploit Behaviors. J Neurosci 2024; 44:e1206232023. [PMID: 38296647 PMCID: PMC10860650 DOI: 10.1523/jneurosci.1206-23.2023] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Revised: 11/17/2023] [Accepted: 11/21/2023] [Indexed: 02/02/2024] Open
Abstract
Deciding whether to forego immediate rewards or explore new opportunities is a key component of flexible behavior and is critical for the survival of the species. Although previous studies have shown that different cortical and subcortical areas, including the amygdala and ventral striatum (VS), are implicated in representing the immediate (exploitative) and future (explorative) value of choices, the effect of the motor system used to make choices has not been examined. Here, we tested male rhesus macaques with amygdala or VS lesions on two versions of a three-arm bandit task where choices were registered with either a saccade or an arm movement. In both tasks we presented the monkeys with explore-exploit tradeoffs by periodically replacing familiar options with novel options that had unknown reward probabilities. We found that monkeys explored more with saccades but showed better learning with arm movements. VS lesions caused the monkeys to be more explorative with arm movements and less explorative with saccades, although this may have been due to an overall decrease in performance. VS lesions affected the monkeys' ability to learn novel stimulus-reward associations in both tasks, while after amygdala lesions this effect was stronger when choices were made with saccades. Further, on average, VS and amygdala lesions reduced the monkeys' ability to choose better options only when choices were made with a saccade. These results show that learning reward value associations to manage explore-exploit behaviors is motor system dependent and they further define the contributions of amygdala and VS to reinforcement learning.
Collapse
Affiliation(s)
- Franco Giarrocco
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda 20892-4415, MD
| | - Vincent D Costa
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda 20892-4415, MD
- Division of Neuroscience, Oregon National Primate Research Center, Beaverton 97006, OR
| | - Benjamin M Basile
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda 20892-4415, MD
- Department of Psychology, Dickinson College, Carlisle 17013, PA
| | - Maia S Pujara
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda 20892-4415, MD
| | - Elisabeth A Murray
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda 20892-4415, MD
| | - Bruno B Averbeck
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda 20892-4415, MD
| |
Collapse
|
36
|
Liuzzi L, Pine DS, Fox NA, Averbeck BB. Changes in Behavior and Neural Dynamics across Adolescent Development. J Neurosci 2023; 43:8723-8732. [PMID: 37848282 PMCID: PMC10727120 DOI: 10.1523/jneurosci.0462-23.2023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Revised: 08/28/2023] [Accepted: 09/19/2023] [Indexed: 10/19/2023] Open
Abstract
Adolescence is an important developmental period, during which substantial changes occur in brain function and behavior. Several aspects of executive function, including response inhibition, improve during this period. Correspondingly, structural imaging studies have documented consistent decreases in cortical and subcortical gray matter volume, and postmortem histologic studies have found substantial (∼40%) decreases in excitatory synapses in prefrontal cortex. Recent computational modeling work suggests that the change in synaptic density underlie improvements in task performance. These models also predict changes in neural dynamics related to the depth of attractor basins, where deeper basins can underlie better task performance. In this study, we analyzed task-related neural dynamics in a large cohort of longitudinally followed subjects (male and female) spanning early to late adolescence. We found that age correlated positively with behavioral performance in the Eriksen Flanker task. Older subjects were also characterized by deeper attractor basins around task related evoked EEG potentials during specific cognitive operations. Thus, consistent with computational models examining the effects of excitatory synaptic pruning, older adolescents showed stronger attractor dynamics during task performance.SIGNIFICANCE STATEMENT There are well-documented changes in brain and behavior during adolescent development. However, there are few mechanistic theories that link changes in the brain to changes in behavior. Here, we tested a hypothesis, put forward on the basis of computational modeling, that pruning of excitatory synapses in cortex during adolescence changes neural dynamics. We found, consistent with the hypothesis, that variability around event-related potentials shows faster decay dynamics in older adolescent subjects. The faster decay dynamics are consistent with the hypothesis that synaptic pruning during adolescent development leads to stronger attractor basins in task-related neural activity.
Collapse
Affiliation(s)
- Lucrezia Liuzzi
- Emotion and Development Branch, National Institute of Mental Health, Bethesda, 20892, MD
| | - Daniel S Pine
- Emotion and Development Branch, National Institute of Mental Health, Bethesda, 20892, MD
| | - Nathan A Fox
- Department of Human Development and Quantitative Methodology, University of Maryland, College Park, MD 20742
| | - Bruno B Averbeck
- Laboratory of Neuropsychology, National Institute of Mental Health, Bethesda, 20892, MD
| |
Collapse
|
37
|
Crum KI, Aloi J, Blair KS, Bashford-Largo J, Bajaj S, Zhang R, Hwang S, Schwartz A, Elowsky J, Filbey FM, Dobbertin M, Blair RJ. Latent profiles of substance use, early life stress, and attention/externalizing problems and their association with neural correlates of reinforcement learning in adolescents. Psychol Med 2023; 53:7358-7367. [PMID: 37144406 PMCID: PMC10625649 DOI: 10.1017/s0033291723000971] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
BACKGROUND Adolescent substance use, externalizing and attention problems, and early life stress (ELS) commonly co-occur. These psychopathologies show overlapping neural dysfunction in the form of reduced recruitment of reward processing neuro-circuitries. However, it is unclear to what extent these psychopathologies show common v. different neural dysfunctions as a function of symptom profiles, as no studies have directly compared neural dysfunctions associated with each of these psychopathologies to each other. METHODS In study 1, a latent profile analysis (LPA) was conducted in a sample of 266 adolescents (aged 13-18, 41.7% female, 58.3% male) from a residential youth care facility and the surrounding community to investigate substance use, externalizing and attention problems, and ELS psychopathologies and their co-presentation. In study 2, we examined a subsample of 174 participants who completed the Passive Avoidance learning task during functional magnetic resonance imaging to examine differential and/or common reward processing neuro-circuitry dysfunctions associated with symptom profiles based on these co-presentations. RESULTS In study 1, LPA identified profiles of substance use plus rule-breaking behaviors, attention-deficit hyperactivity disorder, and ELS. In study 2, the substance use/rule-breaking profile was associated with reduced recruitment of reward processing and attentional neuro-circuitries during the Passive Avoidance task (p < 0.05, corrected for multiple comparisons). CONCLUSIONS Findings indicate that there is reduced responsivity of striato-cortical regions when receiving outcomes on an instrumental learning task within a profile of adolescents with substance use and rule-breaking behaviors. Mitigating reward processing dysfunction specifically may represent a potential intervention target for substance-use psychopathologies accompanied by rule-breaking behaviors.
Collapse
Affiliation(s)
- Kathleen I Crum
- Department of Psychiatry, Indiana University School of Medicine, Indianapolis, IN, USA
- Division of Neuroimaging, Department of Neuroscience, Medical University of South Carolina, Charleston, SC, USA
| | - Joseph Aloi
- Department of Psychiatry, Indiana University School of Medicine, Indianapolis, IN, USA
- Center for Neurobehavioral Research, Boys Town National Research Hospital, Boys Town, NE, USA
| | - Karina S Blair
- Center for Neurobehavioral Research, Boys Town National Research Hospital, Boys Town, NE, USA
| | - Johannah Bashford-Largo
- Center for Neurobehavioral Research, Boys Town National Research Hospital, Boys Town, NE, USA
| | - Sahil Bajaj
- Center for Neurobehavioral Research, Boys Town National Research Hospital, Boys Town, NE, USA
| | - Ru Zhang
- Center for Neurobehavioral Research, Boys Town National Research Hospital, Boys Town, NE, USA
| | - Soonjo Hwang
- Department of Psychiatry, University of Nebraska Medical Center, Omaha, NE, USA
| | - Amanda Schwartz
- Department of Psychology, University of North Dakota, Grand Forks, ND, USA
| | - Jaimie Elowsky
- Department of Psychology, University of Nebraska - Lincoln, Lincoln, NE, USA
| | - Francesca M Filbey
- Center for BrainHealth, School of Behavioral and Brain Sciences, University of Texas at Dallas, Dallas, TX, USA
| | - Matthew Dobbertin
- Center for Neurobehavioral Research, Boys Town National Research Hospital, Boys Town, NE, USA
| | - R James Blair
- Center for Neurobehavioral Research, Boys Town National Research Hospital, Boys Town, NE, USA
| |
Collapse
|
38
|
Shin WG, Jyung M, Choi I, Sul S. Perceived financial well-being and its association with frontostriatal functional connectivity, real-life anticipatory experiences, and everyday happiness. Sci Rep 2023; 13:18739. [PMID: 37907524 PMCID: PMC10618479 DOI: 10.1038/s41598-023-44001-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Accepted: 10/03/2023] [Indexed: 11/02/2023] Open
Abstract
Perceived financial well-being (FWB) is an important aspect of life that can affect one's attitude toward future experiences and happiness. However, the relationship between FWB, anticipatory experiences, and happiness, and the brain's functional architecture underlying this relationship remain unknown. Here, we combined an experience sampling method, multilevel modeling, and functional neuroimaging to identify the neural correlates of FWB and their associations with real-world anticipatory experiences and everyday happiness. Behaviorally, we found that individuals with greater FWB felt more positive and more interested when they expected positive events to occur, which in turn resulted in increased everyday happiness. Furthermore, the level of FWB was significantly associated with the strength of functional connectivity (FC) between the nucleus accumbens (NAc) and ventromedial prefrontal cortex (vmPFC) and the local coherence within the vmPFC. The frontostriatal FC and local coherence within the vmPFC were further predictive of everyday happiness via the anticipatory response involving interestedness during positive expectations. Our findings suggest that individual differences in FWB could be reflected in the functional architecture of brain's reward system that may contribute to shaping positive anticipatory experiences and happiness in daily life.
Collapse
Affiliation(s)
- Won-Gyo Shin
- Social Neuroscience Laboratory, Department of Psychology, Pusan National University, 2 Busandaehak-ro 63beon-gil, Geumjeong-gu, Busan, 46241, Republic of Korea
| | - Mina Jyung
- Department of Psychology, Seoul National University, Seoul, Republic of Korea
| | - Incheol Choi
- Department of Psychology, Seoul National University, Seoul, Republic of Korea
| | - Sunhae Sul
- Social Neuroscience Laboratory, Department of Psychology, Pusan National University, 2 Busandaehak-ro 63beon-gil, Geumjeong-gu, Busan, 46241, Republic of Korea.
| |
Collapse
|
39
|
Ricciardi L, Apps M, Little S. Uncovering the neurophysiology of mood, motivation and behavioral symptoms in Parkinson's disease through intracranial recordings. NPJ Parkinsons Dis 2023; 9:136. [PMID: 37735477 PMCID: PMC10514046 DOI: 10.1038/s41531-023-00567-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Accepted: 08/07/2023] [Indexed: 09/23/2023] Open
Abstract
Neuropsychiatric mood and motivation symptoms (depression, anxiety, apathy, impulse control disorders) in Parkinson's disease (PD) are highly disabling, difficult to treat and exacerbated by current medications and deep brain stimulation therapies. High-resolution intracranial recording techniques have the potential to undercover the network dysfunction and cognitive processes that drive these symptoms, towards a principled re-tuning of circuits. We highlight intracranial recording as a valuable tool for mapping and desegregating neural networks and their contribution to mood, motivation and behavioral symptoms, via the ability to dissect multiplexed overlapping spatial and temporal neural components. This technique can be powerfully combined with behavioral paradigms and emerging computational techniques to model underlying latent behavioral states. We review the literature of intracranial recording studies investigating mood, motivation and behavioral symptomatology with reference to 1) emotional processing, 2) executive control 3) subjective valuation (reward & cost evaluation) 4) motor control and 5) learning and updating. This reveals associations between different frequency specific network activities and underlying cognitive processes of reward decision making and action control. If validated, these signals represent potential computational biomarkers of motivational and behavioural states and could lead to principled therapy development for mood, motivation and behavioral symptoms in PD.
Collapse
Affiliation(s)
- Lucia Ricciardi
- Neurosciences Research Centre, Molecular and Clinical Sciences Research Institute, St George's University of London, London, UK.
| | - Matthew Apps
- Centre for Human Brain Health, School of Psychology, University of Birmingham, Birmingham, UK
| | - Simon Little
- Movement Disorders and Neuromodulation Centre, University of California San Francisco, San Francisco, CA, USA
| |
Collapse
|
40
|
Taswell CA, Janssen M, Murray EA, Averbeck BB. The motivational role of the ventral striatum and amygdala in learning from gains and losses. Behav Neurosci 2023; 137:268-280. [PMID: 37141014 PMCID: PMC10363235 DOI: 10.1037/bne0000558] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
The ventral striatum (VS) and amygdala are two structures often implicated as essential structures for learning. The literature addressing the contribution of these areas to learning, however, is not entirely consistent. We propose that these inconsistencies are due to learning environments and the effect they have on motivation. To differentiate aspects of learning from environmental factors that affect motivation, we ran a series of experiments with varying task factors. We compared monkeys (Macaca mulatta) with VS lesions, amygdala lesions, and unoperated controls on reinforcement learning (RL) tasks that involve learning from both gains and losses as well as from deterministic and stochastic schedules of reinforcement. We found that for all three groups, performance varied by experiment. All three groups modulated their behavior in the same directions, to varying degrees, across the three experiments. This behavioral modulation is why we find deficits in some experiments, but not others. The amount of effort animals exhibited differed depending on the learning environment. Our results suggest that the VS is important for the amount of effort animals will give in rich deterministic and relatively leaner stochastic learning enivornments. We also showed that monkeys with amygdala lesions can learn stimulus-based RL in stochastic environments and environments with loss and conditioned reinforcers. These results show that learning environments shape motivation and that the VS is essential for distinct aspects of motivated behavior. (PsycInfo Database Record (c) 2023 APA, all rights reserved).
Collapse
Affiliation(s)
- Craig A Taswell
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health
| | - Miriam Janssen
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health
| | - Elisabeth A Murray
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health
| | - Bruno B Averbeck
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health
| |
Collapse
|
41
|
Wesley MJ, Lile JA. Combining noninvasive brain stimulation with behavioral pharmacology methods to study mechanisms of substance use disorder. Front Neurosci 2023; 17:1150109. [PMID: 37554294 PMCID: PMC10405288 DOI: 10.3389/fnins.2023.1150109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Accepted: 07/06/2023] [Indexed: 08/10/2023] Open
Abstract
Psychotropic drugs and transcranial magnetic stimulation (TMS) are effective for treating certain psychiatric conditions. Drugs and TMS have also been used as tools to explore the relationship between brain function and behavior in humans. Combining centrally acting drugs and TMS has proven useful for characterizing the neural basis of movement. This combined intervention approach also holds promise for improving our understanding of the mechanisms underlying disordered behavior associated with psychiatric conditions, including addiction, though challenges exist. For example, altered neocortical function has been implicated in substance use disorder, but the relationship between acute neuromodulation of neocortex with TMS and direct effects on addiction-related behaviors is not well established. We propose that the combination of human behavioral pharmacology methods with TMS can be leveraged to help establish these links. This perspective article describes an ongoing study that combines the administration of delta-9-tetrahydrocannabinol (THC), the main psychoactive compound in cannabis, with neuroimaging-guided TMS in individuals with problematic cannabis use. The study examines the impact of the left dorsolateral prefrontal cortex (DLPFC) stimulation on cognitive outcomes impacted by THC intoxication, including the subjective response to THC and the impairing effects of THC on behavioral performance. A framework for integrating TMS with human behavioral pharmacology methods, along with key details of the study design, are presented. We also discuss challenges, alternatives, and future directions.
Collapse
Affiliation(s)
- Michael J. Wesley
- Department of Behavioral Science, College of Medicine, University of Kentucky, Lexington, KY, United States
- Department of Psychiatry, College of Medicine, University of Kentucky, Lexington, KY, United States
- Department of Psychology, College of Arts and Sciences, University of Kentucky, Lexington, KY, United States
| | - Joshua A. Lile
- Department of Behavioral Science, College of Medicine, University of Kentucky, Lexington, KY, United States
- Department of Psychiatry, College of Medicine, University of Kentucky, Lexington, KY, United States
- Department of Psychology, College of Arts and Sciences, University of Kentucky, Lexington, KY, United States
| |
Collapse
|
42
|
Kim H, Hur JK, Kwon M, Kim S, Zoh Y, Ahn WY. Causal role of the dorsolateral prefrontal cortex in modulating the balance between Pavlovian and instrumental systems in the punishment domain. PLoS One 2023; 18:e0286632. [PMID: 37267307 PMCID: PMC10237433 DOI: 10.1371/journal.pone.0286632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Accepted: 05/19/2023] [Indexed: 06/04/2023] Open
Abstract
Previous literature suggests that a balance between Pavlovian and instrumental decision-making systems is critical for optimal decision-making. Pavlovian bias (i.e., approach toward reward-predictive stimuli and avoid punishment-predictive stimuli) often contrasts with the instrumental response. Although recent neuroimaging studies have identified brain regions that may be related to Pavlovian bias, including the dorsolateral prefrontal cortex (dlPFC), it is unclear whether a causal relationship exists. Therefore, we investigated whether upregulation of the dlPFC using transcranial current direct stimulation (tDCS) would reduce Pavlovian bias. In this double-blind study, participants were assigned to the anodal or the sham group; they received stimulation over the right dlPFC for 3 successive days. On the last day, participants performed a reinforcement learning task known as the orthogonalized go/no-go task; this was used to assess each participant's degree of Pavlovian bias in reward and punishment domains. We used computational modeling and hierarchical Bayesian analysis to estimate model parameters reflecting latent cognitive processes, including Pavlovian bias, go bias, and choice randomness. Several computational models were compared; the model with separate Pavlovian bias parameters for reward and punishment domains demonstrated the best model fit. When using a behavioral index of Pavlovian bias, the anodal group showed significantly lower Pavlovian bias in the punishment domain, but not in the reward domain, compared with the sham group. In addition, computational modeling showed that Pavlovian bias parameter in the punishment domain was lower in the anodal group than in the sham group, which is consistent with the behavioral findings. The anodal group also showed a lower go bias and choice randomness, compared with the sham group. These findings suggest that anodal tDCS may lead to behavioral suppression or change in Pavlovian bias in the punishment domain, which will help to improve comprehension of the causal neural mechanism.
Collapse
Affiliation(s)
- Hyeonjin Kim
- Department of Psychology, Seoul National University, Seoul, Korea
| | - Jihyun K. Hur
- Department of Psychology, Yale University, New Haven, Connecticut, United States of America
| | - Mina Kwon
- Department of Psychology, Seoul National University, Seoul, Korea
| | - Soyeon Kim
- Department of Psychology, Seoul National University, Seoul, Korea
| | - Yoonseo Zoh
- Department of Psychology, Princeton University, Princeton, New Jersey, United States of America
| | - Woo-Young Ahn
- Department of Psychology, Seoul National University, Seoul, Korea
- Department of Brain and Cognitive Sciences, Seoul National University, Seoul, Korea
| |
Collapse
|
43
|
Aquino TG, Cockburn J, Mamelak AN, Rutishauser U, O'Doherty JP. Neurons in human pre-supplementary motor area encode key computations for value-based choice. Nat Hum Behav 2023; 7:970-985. [PMID: 36959327 PMCID: PMC10330469 DOI: 10.1038/s41562-023-01548-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Accepted: 02/02/2023] [Indexed: 03/25/2023]
Abstract
Adaptive behaviour in real-world environments requires that choices integrate several variables, including the novelty of the options under consideration, their expected value and uncertainty in value estimation. Here, to probe how integration over decision variables occurs during decision-making, we recorded neurons from the human pre-supplementary motor area (preSMA), ventromedial prefrontal cortex and dorsal anterior cingulate. Unlike the other areas, preSMA neurons not only represented separate pre-decision variables for each choice option but also encoded an integrated utility signal for each choice option and, subsequently, the decision itself. Post-decision encoding of variables for the chosen option was more widely distributed and especially prominent in the ventromedial prefrontal cortex. Our findings position the human preSMA as central to the implementation of value-based decisions.
Collapse
Affiliation(s)
- Tomas G Aquino
- Computation and Neural Systems, Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, USA.
- Department of Neurosurgery, Cedars-Sinai Medical Center, Los Angeles, CA, USA.
| | - Jeffrey Cockburn
- Division of Humanities and Social Sciences, California Institute of Technology, Pasadena, CA, USA
| | - Adam N Mamelak
- Department of Neurosurgery, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Ueli Rutishauser
- Computation and Neural Systems, Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, USA
- Department of Neurosurgery, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - John P O'Doherty
- Computation and Neural Systems, Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, USA
- Division of Humanities and Social Sciences, California Institute of Technology, Pasadena, CA, USA
| |
Collapse
|
44
|
Ren P, Hou G, Ma M, Zhuang Y, Huang J, Tan M, Wu D, Luo G, Zhang Z, Rong H. Enhanced putamen functional connectivity underlies altered risky decision-making in age-related cognitive decline. Sci Rep 2023; 13:6619. [PMID: 37095127 PMCID: PMC10126002 DOI: 10.1038/s41598-023-33634-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Accepted: 04/16/2023] [Indexed: 04/26/2023] Open
Abstract
Risky decision-making is critical to survival and development, which has been compromised in elderly populations. However, the neural substrates of altered financial risk-taking behavior in aging are still under-investigated. Here we examined the intrinsic putamen network in modulating risk-taking behaviors of Balloon Analogue Risk Task in healthy young and older adults using resting-state fMRI. Compared with the young group, the elderly group showed significantly different task performance. Based on the task performance, older adults were further subdivided into two subgroups, showing young-like and over-conservative risk behaviors, regardless of cognitive decline. Compared with young adults, the intrinsic pattern of putamen connectivity was significantly different in over-conservative older adults, but not in young-like older adults. Notably, age-effects on risk behaviors were mediated via the putamen functional connectivity. In addition, the putamen gray matter volume showed significantly different relationships with risk behaviors and functional connectivity in over-conservative older adults. Our findings suggest that reward-based risky behaviors might be a sensitive indicator of brain aging, highlighting the critical role of the putamen network in maintaining optimal risky decision-making in age-related cognitive decline.
Collapse
Affiliation(s)
- Ping Ren
- Department of Geriatric Psychiatry, Shenzhen Mental Health Center/Shenzhen Kangning Hospital, Shenzhen, Guangdong, China.
| | - Gangqiang Hou
- Department of Radiology, Shenzhen Mental Health Center/Shenzhen Kangning Hospital, Shenzhen, Guangdong, China
| | - Manxiu Ma
- Queensland Brain Institute, University of Queensland, St. Lucia, QLD, Australia
| | - Yuchuan Zhuang
- Department of Electrical and Computer Engineering, University of Rochester, Rochester, NY, USA
| | - Jiayin Huang
- Department of Geriatric Psychiatry, Shenzhen Mental Health Center/Shenzhen Kangning Hospital, Shenzhen, Guangdong, China
| | - Meiling Tan
- Department of Geriatric Psychiatry, Shenzhen Mental Health Center/Shenzhen Kangning Hospital, Shenzhen, Guangdong, China
| | - Donghui Wu
- Department of Geriatric Psychiatry, Shenzhen Mental Health Center/Shenzhen Kangning Hospital, Shenzhen, Guangdong, China
| | - Guozhi Luo
- Department of Geriatric Psychiatry, Shenzhen Mental Health Center/Shenzhen Kangning Hospital, Shenzhen, Guangdong, China
| | - Zhiguo Zhang
- School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen, Guangdong, China
| | - Han Rong
- Department of Psychiatry, Shenzhen Mental Health Center/Shenzhen Kangning Hospital, Shenzhen, Guangdong, China.
| |
Collapse
|
45
|
Xu T, Zhou X, Kanen JW, Wang L, Li J, Chen Z, Zhang R, Jiao G, Zhou F, Zhao W, Yao S, Becker B. Angiotensin blockade enhances motivational reward learning via enhancing striatal prediction error signaling and frontostriatal communication. Mol Psychiatry 2023; 28:1692-1702. [PMID: 36810437 DOI: 10.1038/s41380-023-02001-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Revised: 02/09/2023] [Accepted: 02/10/2023] [Indexed: 02/23/2023]
Abstract
Adaptive human learning utilizes reward prediction errors (RPEs) that scale the differences between expected and actual outcomes to optimize future choices. Depression has been linked with biased RPE signaling and an exaggerated impact of negative outcomes on learning which may promote amotivation and anhedonia. The present proof-of-concept study combined computational modeling and multivariate decoding with neuroimaging to determine the influence of the selective competitive angiotensin II type 1 receptor antagonist losartan on learning from positive or negative outcomes and the underlying neural mechanisms in healthy humans. In a double-blind, between-subjects, placebo-controlled pharmaco-fMRI experiment, 61 healthy male participants (losartan, n = 30; placebo, n = 31) underwent a probabilistic selection reinforcement learning task incorporating a learning and transfer phase. Losartan improved choice accuracy for the hardest stimulus pair via increasing expected value sensitivity towards the rewarding stimulus relative to the placebo group during learning. Computational modeling revealed that losartan reduced the learning rate for negative outcomes and increased exploitatory choice behaviors while preserving learning for positive outcomes. These behavioral patterns were paralleled on the neural level by increased RPE signaling in orbitofrontal-striatal regions and enhanced positive outcome representations in the ventral striatum (VS) following losartan. In the transfer phase, losartan accelerated response times and enhanced VS functional connectivity with left dorsolateral prefrontal cortex when approaching maximum rewards. These findings elucidate the potential of losartan to reduce the impact of negative outcomes during learning and subsequently facilitate motivational approach towards maximum rewards in the transfer of learning. This may indicate a promising therapeutic mechanism to normalize distorted reward learning and fronto-striatal functioning in depression.
Collapse
Affiliation(s)
- Ting Xu
- The Center of Psychosomatic Medicine, Sichuan Provincial Center for Mental Health, Sichuan Provincial People's Hospital, University of Electronic Science and Technology of China, Chengdu, China.,MOE Key Laboratory for Neuroinformation, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| | - Xinqi Zhou
- The Center of Psychosomatic Medicine, Sichuan Provincial Center for Mental Health, Sichuan Provincial People's Hospital, University of Electronic Science and Technology of China, Chengdu, China.,MOE Key Laboratory for Neuroinformation, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| | - Jonathan W Kanen
- Department of Psychology, University of Cambridge, Cambridge, UK.,Behavioural and Clinical Neuroscience Institute, University of Cambridge, Cambridge, UK
| | - Lan Wang
- The Center of Psychosomatic Medicine, Sichuan Provincial Center for Mental Health, Sichuan Provincial People's Hospital, University of Electronic Science and Technology of China, Chengdu, China.,MOE Key Laboratory for Neuroinformation, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| | - Jialin Li
- Max Planck School of Cognition, Leipzig, Germany
| | - Zhiyi Chen
- Faculty of Psychology, Southwest University, Chongqing, China.,Key Laboratory of Cognition and Personality, Ministry of Education, Chongqing, China
| | - Ran Zhang
- The Center of Psychosomatic Medicine, Sichuan Provincial Center for Mental Health, Sichuan Provincial People's Hospital, University of Electronic Science and Technology of China, Chengdu, China.,MOE Key Laboratory for Neuroinformation, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| | - Guojuan Jiao
- The Center of Psychosomatic Medicine, Sichuan Provincial Center for Mental Health, Sichuan Provincial People's Hospital, University of Electronic Science and Technology of China, Chengdu, China.,MOE Key Laboratory for Neuroinformation, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| | - Feng Zhou
- Faculty of Psychology, Southwest University, Chongqing, China.,Key Laboratory of Cognition and Personality, Ministry of Education, Chongqing, China
| | - Weihua Zhao
- MOE Key Laboratory for Neuroinformation, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| | - Shuxia Yao
- MOE Key Laboratory for Neuroinformation, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| | - Benjamin Becker
- The Center of Psychosomatic Medicine, Sichuan Provincial Center for Mental Health, Sichuan Provincial People's Hospital, University of Electronic Science and Technology of China, Chengdu, China. .,MOE Key Laboratory for Neuroinformation, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China.
| |
Collapse
|
46
|
Fornaro S, Vallesi A. Functional connectivity abnormalities of brain networks in obsessive–compulsive disorder: a systematic review. CURRENT PSYCHOLOGY 2023. [DOI: 10.1007/s12144-023-04312-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/19/2023]
Abstract
Abstract
Obsessive-compulsive disorder (OCD) is characterized by cognitive abnormalities encompassing several executive processes. Neuroimaging studies highlight functional abnormalities of executive fronto-parietal network (FPN) and default-mode network (DMN) in OCD patients, as well as of the prefrontal cortex (PFC) more specifically. We aim at assessing the presence of functional connectivity (FC) abnormalities of intrinsic brain networks and PFC in OCD, possibly underlying specific computational impairments and clinical manifestations. A systematic review of resting-state fMRI studies investigating FC was conducted in unmedicated OCD patients by querying three scientific databases (PubMed, Scopus, PsycInfo) up to July 2022 (search terms: “obsessive–compulsive disorder” AND “resting state” AND “fMRI” AND “function* *connect*” AND “task-positive” OR “executive” OR “central executive” OR “executive control” OR “executive-control” OR “cognitive control” OR “attenti*” OR “dorsal attention” OR “ventral attention” OR “frontoparietal” OR “fronto-parietal” OR “default mode” AND “network*” OR “system*”). Collectively, 20 studies were included. A predominantly reduced FC of DMN – often related to increased symptom severity – emerged. Additionally, intra-network FC of FPN was predominantly increased and often positively related to clinical scores. Concerning PFC, a predominant hyper-connectivity of right-sided prefrontal links emerged. Finally, FC of lateral prefrontal areas correlated with specific symptom dimensions. Several sources of heterogeneity in methodology might have affected results in unpredictable ways and were discussed. Such findings might represent endophenotypes of OCD manifestations, possibly reflecting computational impairments and difficulties in engaging in self-referential processes or in disengaging from cognitive control and monitoring processes.
Collapse
|
47
|
Anticevic A, Halassa MM. The thalamus in psychosis spectrum disorder. Front Neurosci 2023; 17:1163600. [PMID: 37123374 PMCID: PMC10133512 DOI: 10.3389/fnins.2023.1163600] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Accepted: 03/21/2023] [Indexed: 05/02/2023] Open
Abstract
Psychosis spectrum disorder (PSD) affects 1% of the world population and results in a lifetime of chronic disability, causing devastating personal and economic consequences. Developing new treatments for PSD remains a challenge, particularly those that target its core cognitive deficits. A key barrier to progress is the tenuous link between the basic neurobiological understanding of PSD and its clinical phenomenology. In this perspective, we focus on a key opportunity that combines innovations in non-invasive human neuroimaging with basic insights into thalamic regulation of functional cortical connectivity. The thalamus is an evolutionary conserved region that forms forebrain-wide functional loops critical for the transmission of external inputs as well as the construction and update of internal models. We discuss our perspective across four lines of evidence: First, we articulate how PSD symptomatology may arise from a faulty network organization at the macroscopic circuit level with the thalamus playing a central coordinating role. Second, we discuss how recent animal work has mechanistically clarified the properties of thalamic circuits relevant to regulating cortical dynamics and cognitive function more generally. Third, we present human neuroimaging evidence in support of thalamic alterations in PSD, and propose that a similar "thalamocortical dysconnectivity" seen in pharmacological imaging (under ketamine, LSD and THC) in healthy individuals may link this circuit phenotype to the common set of symptoms in idiopathic and drug-induced psychosis. Lastly, we synthesize animal and human work, and lay out a translational path for biomarker and therapeutic development.
Collapse
Affiliation(s)
- Alan Anticevic
- School of Medicine, Yale University, New Haven, CT, United States
- *Correspondence: Alan Anticevic,
| | - Michael M. Halassa
- Department of Neuroscience, Tufts University School of Medicine, Boston, MA, United States
- Michael M. Halassa,
| |
Collapse
|
48
|
Emanuel A, Eldar E. Emotions as computations. Neurosci Biobehav Rev 2023; 144:104977. [PMID: 36435390 PMCID: PMC9805532 DOI: 10.1016/j.neubiorev.2022.104977] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Revised: 10/26/2022] [Accepted: 11/22/2022] [Indexed: 11/26/2022]
Abstract
Emotions ubiquitously impact action, learning, and perception, yet their essence and role remain widely debated. Computational accounts of emotion aspire to answer these questions with greater conceptual precision informed by normative principles and neurobiological data. We examine recent progress in this regard and find that emotions may implement three classes of computations, which serve to evaluate states, actions, and uncertain prospects. For each of these, we use the formalism of reinforcement learning to offer a new formulation that better accounts for existing evidence. We then consider how these distinct computations may map onto distinct emotions and moods. Integrating extensive research on the causes and consequences of different emotions suggests a parsimonious one-to-one mapping, according to which emotions are integral to how we evaluate outcomes (pleasure & pain), learn to predict them (happiness & sadness), use them to inform our (frustration & content) and others' (anger & gratitude) actions, and plan in order to realize (desire & hope) or avoid (fear & anxiety) uncertain outcomes.
Collapse
Affiliation(s)
- Aviv Emanuel
- Department of Psychology, Hebrew University of Jerusalem, Jerusalem 9190501, Israel; Department of Cognitive and Brain Sciences, Hebrew University of Jerusalem, Jerusalem 9190501, Israel.
| | - Eran Eldar
- Department of Psychology, Hebrew University of Jerusalem, Jerusalem 9190501, Israel; Department of Cognitive and Brain Sciences, Hebrew University of Jerusalem, Jerusalem 9190501, Israel.
| |
Collapse
|
49
|
Pearce AL, Fuchs BA, Keller KL. The role of reinforcement learning and value-based decision-making frameworks in understanding food choice and eating behaviors. Front Nutr 2022; 9:1021868. [PMID: 36483928 PMCID: PMC9722736 DOI: 10.3389/fnut.2022.1021868] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 11/04/2022] [Indexed: 11/23/2022] Open
Abstract
The obesogenic food environment includes easy access to highly-palatable, energy-dense, "ultra-processed" foods that are heavily marketed to consumers; therefore, it is critical to understand the neurocognitive processes the underlie overeating in response to environmental food-cues (e.g., food images, food branding/advertisements). Eating habits are learned through reinforcement, which is the process through which environmental food cues become valued and influence behavior. This process is supported by multiple behavioral control systems (e.g., Pavlovian, Habitual, Goal-Directed). Therefore, using neurocognitive frameworks for reinforcement learning and value-based decision-making can improve our understanding of food-choice and eating behaviors. Specifically, the role of reinforcement learning in eating behaviors was considered using the frameworks of (1) Sign-versus Goal-Tracking Phenotypes; (2) Model-Free versus Model-Based; and (3) the Utility or Value-Based Model. The sign-and goal-tracking phenotypes may contribute a mechanistic insight on the role of food-cue incentive salience in two prevailing models of overconsumption-the Extended Behavioral Susceptibility Theory and the Reactivity to Embedded Food Cues in Advertising Model. Similarly, the model-free versus model-based framework may contribute insight to the Extended Behavioral Susceptibility Theory and the Healthy Food Promotion Model. Finally, the value-based model provides a framework for understanding how all three learning systems are integrated to influence food choice. Together, these frameworks can provide mechanistic insight to existing models of food choice and overconsumption and may contribute to the development of future prevention and treatment efforts.
Collapse
Affiliation(s)
- Alaina L. Pearce
- Social Science Research Institute, Pennsylvania State University, University Park, PA, United States
- Department of Nutritional Sciences, Pennsylvania State University, University Park, PA, United States
| | - Bari A. Fuchs
- Department of Nutritional Sciences, Pennsylvania State University, University Park, PA, United States
| | - Kathleen L. Keller
- Social Science Research Institute, Pennsylvania State University, University Park, PA, United States
- Department of Nutritional Sciences, Pennsylvania State University, University Park, PA, United States
- Department of Food Science, Pennsylvania State University, University Park, PA, United States
| |
Collapse
|
50
|
Colas JT, Dundon NM, Gerraty RT, Saragosa‐Harris NM, Szymula KP, Tanwisuth K, Tyszka JM, van Geen C, Ju H, Toga AW, Gold JI, Bassett DS, Hartley CA, Shohamy D, Grafton ST, O'Doherty JP. Reinforcement learning with associative or discriminative generalization across states and actions: fMRI at 3 T and 7 T. Hum Brain Mapp 2022; 43:4750-4790. [PMID: 35860954 PMCID: PMC9491297 DOI: 10.1002/hbm.25988] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Revised: 05/20/2022] [Accepted: 06/10/2022] [Indexed: 11/12/2022] Open
Abstract
The model-free algorithms of "reinforcement learning" (RL) have gained clout across disciplines, but so too have model-based alternatives. The present study emphasizes other dimensions of this model space in consideration of associative or discriminative generalization across states and actions. This "generalized reinforcement learning" (GRL) model, a frugal extension of RL, parsimoniously retains the single reward-prediction error (RPE), but the scope of learning goes beyond the experienced state and action. Instead, the generalized RPE is efficiently relayed for bidirectional counterfactual updating of value estimates for other representations. Aided by structural information but as an implicit rather than explicit cognitive map, GRL provided the most precise account of human behavior and individual differences in a reversal-learning task with hierarchical structure that encouraged inverse generalization across both states and actions. Reflecting inference that could be true, false (i.e., overgeneralization), or absent (i.e., undergeneralization), state generalization distinguished those who learned well more so than action generalization. With high-resolution high-field fMRI targeting the dopaminergic midbrain, the GRL model's RPE signals (alongside value and decision signals) were localized within not only the striatum but also the substantia nigra and the ventral tegmental area, including specific effects of generalization that also extend to the hippocampus. Factoring in generalization as a multidimensional process in value-based learning, these findings shed light on complexities that, while challenging classic RL, can still be resolved within the bounds of its core computations.
Collapse
Affiliation(s)
- Jaron T. Colas
- Department of Psychological and Brain SciencesUniversity of CaliforniaSanta BarbaraCaliforniaUSA
- Division of the Humanities and Social SciencesCalifornia Institute of TechnologyPasadenaCaliforniaUSA
- Computation and Neural Systems Program, California Institute of TechnologyPasadenaCaliforniaUSA
| | - Neil M. Dundon
- Department of Psychological and Brain SciencesUniversity of CaliforniaSanta BarbaraCaliforniaUSA
- Department of Child and Adolescent Psychiatry, Psychotherapy, and PsychosomaticsUniversity of FreiburgFreiburg im BreisgauGermany
| | - Raphael T. Gerraty
- Department of PsychologyColumbia UniversityNew YorkNew YorkUSA
- Zuckerman Mind Brain Behavior Institute, Columbia UniversityNew YorkNew YorkUSA
- Center for Science and SocietyColumbia UniversityNew YorkNew YorkUSA
| | - Natalie M. Saragosa‐Harris
- Department of PsychologyNew York UniversityNew YorkNew YorkUSA
- Department of PsychologyUniversity of CaliforniaLos AngelesCaliforniaUSA
| | - Karol P. Szymula
- Department of BioengineeringUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
| | - Koranis Tanwisuth
- Division of the Humanities and Social SciencesCalifornia Institute of TechnologyPasadenaCaliforniaUSA
- Department of PsychologyUniversity of CaliforniaBerkeleyCaliforniaUSA
| | - J. Michael Tyszka
- Division of the Humanities and Social SciencesCalifornia Institute of TechnologyPasadenaCaliforniaUSA
| | - Camilla van Geen
- Zuckerman Mind Brain Behavior Institute, Columbia UniversityNew YorkNew YorkUSA
- Department of PsychologyUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
| | - Harang Ju
- Neuroscience Graduate GroupUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
| | - Arthur W. Toga
- Laboratory of Neuro ImagingUSC Stevens Neuroimaging and Informatics Institute, Keck School of Medicine of USC, University of Southern CaliforniaLos AngelesCaliforniaUSA
| | - Joshua I. Gold
- Department of NeuroscienceUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
| | - Dani S. Bassett
- Department of BioengineeringUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
- Department of Electrical and Systems EngineeringUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
- Department of NeurologyUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
- Department of PsychiatryUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
- Department of Physics and AstronomyUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
- Santa Fe InstituteSanta FeNew MexicoUSA
| | - Catherine A. Hartley
- Department of PsychologyNew York UniversityNew YorkNew YorkUSA
- Center for Neural ScienceNew York UniversityNew YorkNew YorkUSA
| | - Daphna Shohamy
- Department of PsychologyColumbia UniversityNew YorkNew YorkUSA
- Zuckerman Mind Brain Behavior Institute, Columbia UniversityNew YorkNew YorkUSA
- Kavli Institute for Brain ScienceColumbia UniversityNew YorkNew YorkUSA
| | - Scott T. Grafton
- Department of Psychological and Brain SciencesUniversity of CaliforniaSanta BarbaraCaliforniaUSA
| | - John P. O'Doherty
- Division of the Humanities and Social SciencesCalifornia Institute of TechnologyPasadenaCaliforniaUSA
- Computation and Neural Systems Program, California Institute of TechnologyPasadenaCaliforniaUSA
| |
Collapse
|