1
|
Suzuki T, Joho D, Kakeyama M. Purposive decision-making task in mice using touchscreen operant apparatus. Neurosci Res 2024; 200:34-40. [PMID: 37758027 DOI: 10.1016/j.neures.2023.09.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Revised: 09/19/2023] [Accepted: 09/22/2023] [Indexed: 10/03/2023]
Abstract
Purposive decision-making, based on sensory input and memory, is a component of executive functioning. Evaluating executive functioning is crucial for understanding neuropsychiatric disorders and brain injuries. However, there's a lack of mouse tests for this purpose. To address this, we developed a novel touchscreen task to assess purposive decision-making in mice. In the present task, the mice had to touch the correct window (left or right), with a visual stimulus as a cue for decision-making. The mice gradually acquired a relationship between the visual stimuli and the action they should take. Each mouse made the correct choice more than 80% of the time based on the visual cue and memory and knowledge of themselves. We could clearly determine when the mice saw the visual cue. The present task offers a valuable tool for investigating the neural mechanisms behind decision-making.
Collapse
Affiliation(s)
- Takeru Suzuki
- Laboratory for Environmental Brain Sciences, Graduate School of Human Sciences, Waseda University, 2-579-15 Mikajima, Tokorozawa, Saitama 359-1192, Japan
| | - Daisuke Joho
- Laboratory for Environmental Brain Sciences, Graduate School of Human Sciences, Waseda University, 2-579-15 Mikajima, Tokorozawa, Saitama 359-1192, Japan
| | - Masaki Kakeyama
- Laboratory for Environmental Brain Sciences, Faculty of Human Sciences, Waseda University, 2-579-15 Mikajima, Tokorozawa, Saitama 359-1192, Japan.
| |
Collapse
|
2
|
Schamiloglu S, Wu H, Zhou M, Kwan AC, Bender KJ. Dynamic Foraging Behavior Performance Is Not Affected by Scn2a Haploinsufficiency. eNeuro 2023; 10:ENEURO.0367-23.2023. [PMID: 38151324 PMCID: PMC10755640 DOI: 10.1523/eneuro.0367-23.2023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Revised: 10/23/2023] [Accepted: 11/14/2023] [Indexed: 12/29/2023] Open
Abstract
Dysfunction in the gene SCN2A, which encodes the voltage-gated sodium channel Nav1.2, is strongly associated with neurodevelopmental disorders including autism spectrum disorder and intellectual disability (ASD/ID). This dysfunction typically manifests in these disorders as a haploinsufficiency, where loss of one copy of a gene cannot be compensated for by the other allele. Scn2a haploinsufficiency affects a range of cells and circuits across the brain, including associative neocortical circuits that are important for cognitive flexibility and decision-making behaviors. Here, we tested whether Scn2a haploinsufficiency has any effect on a dynamic foraging task that engages such circuits. Scn2a +/- mice and wild-type (WT) littermates were trained on a choice behavior where the probability of reward between two options varied dynamically across trials and where the location of the high reward underwent uncued reversals. Despite impairments in Scn2a-related neuronal excitability, we found that both male and female Scn2a +/- mice performed these tasks as well as wild-type littermates, with no behavioral difference across genotypes in learning or performance parameters. Varying the number of trials between reversals or probabilities of receiving reward did not result in an observable behavioral difference, either. These data suggest that, despite heterozygous loss of Scn2a, mice can perform relatively complex foraging tasks that make use of higher-order neuronal circuits.
Collapse
Affiliation(s)
- Selin Schamiloglu
- Neuroscience Graduate Program, University of California, San Francisco, CA 94158
- Center for Integrative Neuroscience, Department of Neurology, University of California, San Francisco, CA 94158
| | - Hao Wu
- Interdepartmental Neuroscience Program, Yale University School of Medicine, New Haven, CT 06511
| | - Mingkang Zhou
- Neuroscience Graduate Program, University of California, San Francisco, CA 94158
- Center for Integrative Neuroscience, Department of Neurology, University of California, San Francisco, CA 94158
| | - Alex C Kwan
- Interdepartmental Neuroscience Program, Yale University School of Medicine, New Haven, CT 06511
- Meinig School of Biomedical Engineering, Cornell University, Ithaca, NY 14853
| | - Kevin J Bender
- Center for Integrative Neuroscience, Department of Neurology, University of California, San Francisco, CA 94158
| |
Collapse
|
3
|
Blackwell KT, Doya K. Enhancing reinforcement learning models by including direct and indirect pathways improves performance on striatal dependent tasks. PLoS Comput Biol 2023; 19:e1011385. [PMID: 37594982 PMCID: PMC10479916 DOI: 10.1371/journal.pcbi.1011385] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Revised: 09/05/2023] [Accepted: 07/25/2023] [Indexed: 08/20/2023] Open
Abstract
A major advance in understanding learning behavior stems from experiments showing that reward learning requires dopamine inputs to striatal neurons and arises from synaptic plasticity of cortico-striatal synapses. Numerous reinforcement learning models mimic this dopamine-dependent synaptic plasticity by using the reward prediction error, which resembles dopamine neuron firing, to learn the best action in response to a set of cues. Though these models can explain many facets of behavior, reproducing some types of goal-directed behavior, such as renewal and reversal, require additional model components. Here we present a reinforcement learning model, TD2Q, which better corresponds to the basal ganglia with two Q matrices, one representing direct pathway neurons (G) and another representing indirect pathway neurons (N). Unlike previous two-Q architectures, a novel and critical aspect of TD2Q is to update the G and N matrices utilizing the temporal difference reward prediction error. A best action is selected for N and G using a softmax with a reward-dependent adaptive exploration parameter, and then differences are resolved using a second selection step applied to the two action probabilities. The model is tested on a range of multi-step tasks including extinction, renewal, discrimination; switching reward probability learning; and sequence learning. Simulations show that TD2Q produces behaviors similar to rodents in choice and sequence learning tasks, and that use of the temporal difference reward prediction error is required to learn multi-step tasks. Blocking the update rule on the N matrix blocks discrimination learning, as observed experimentally. Performance in the sequence learning task is dramatically improved with two matrices. These results suggest that including additional aspects of basal ganglia physiology can improve the performance of reinforcement learning models, better reproduce animal behaviors, and provide insight as to the role of direct- and indirect-pathway striatal neurons.
Collapse
Affiliation(s)
- Kim T Blackwell
- Department of Bioengineering, Volgenau School of Engineering, George Mason University, Fairfax, Virginia, United States of America
| | - Kenji Doya
- Neural Computation Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
| |
Collapse
|
4
|
Russman Block S, Norman LJ, Zhang X, Mannella KA, Yang H, Angstadt M, Abelson JL, Himle JA, Taylor SF, Fitzgerald KD. Resting-State Connectivity and Response to Psychotherapy Treatment in Adolescents and Adults With OCD: A Randomized Clinical Trial. Am J Psychiatry 2023; 180:89-99. [PMID: 36475374 PMCID: PMC10956516 DOI: 10.1176/appi.ajp.21111173] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
OBJECTIVE Cortical-subcortical hyperconnectivity related to affective-behavioral integration and cortical network hypoconnectivity related to cognitive control have been demonstrated in obsessive-compulsive disorder (OCD); the study objective was to examine whether these connectivity patterns predict treatment response. METHODS Adolescents (ages 12-17) and adults (ages 24-45) were randomly assigned to 12 sessions of exposure and response prevention (ERP) or stress management therapy (SMT), an active control. Before treatment, resting-state connectivity of ventromedial prefrontal cortical (vmPFC), cingulo-opercular, frontoparietal, and subcortical regions was assessed with functional MRI. OCD severity was assessed with the Yale-Brown Obsessive Compulsive Scale before, during, and after treatment. Usable fMRI and longitudinal symptom data were obtained from 116 patients (68 female; 54 adolescents; 60 medicated). RESULTS ERP produced greater decreases in symptom scores than SMT. ERP was selectively associated with less vmPFC-subcortical (caudate and thalamus) connectivity in both age groups and primarily in unmedicated participants. Greater symptom improvement with both ERP and SMT was associated with greater cognitive-control (cingulo-opercular and frontoparietal) and subcortical (putamen) connectivity across age groups. Developmental specificity was observed across ERP and SMT treatments, such that greater improvements with ERP than SMT were associated with greater frontoparietal-subcortical (nucleus accumbens) connectivity in adolescents but greater connectivity between frontoparietal regions in adults. Comparison of response-predictive connections revealed no significant differences compared with a matched healthy control group. CONCLUSIONS The results suggest that less vmPFC-subcortical connectivity related to affect-influenced behavior may be important for ERP engagement, whereas greater cognitive-control and motor circuit connectivity may generally facilitate response to psychotherapy. Finally, neural predictors of treatment response may differ by age.
Collapse
Affiliation(s)
- Stefanie Russman Block
- Department of Psychiatry (Russman Block, Norman, Zhang, Mannella, Angstadt, Abelson, Himle, Taylor, Fitzgerald) and School of Social Work (Himle), University of Michigan, Ann Arbor; Changzhi Medical College, Changzhi, China (Zhang); Department of Psychiatry, Second Xiangya Hospital, Central South University, Changsha, China (Yang); Department of Psychiatry, Columbia University, and New York State Psychiatric Institute, New York (Fitzgerald)
| | - Luke J Norman
- Department of Psychiatry (Russman Block, Norman, Zhang, Mannella, Angstadt, Abelson, Himle, Taylor, Fitzgerald) and School of Social Work (Himle), University of Michigan, Ann Arbor; Changzhi Medical College, Changzhi, China (Zhang); Department of Psychiatry, Second Xiangya Hospital, Central South University, Changsha, China (Yang); Department of Psychiatry, Columbia University, and New York State Psychiatric Institute, New York (Fitzgerald)
| | - Xiaoxi Zhang
- Department of Psychiatry (Russman Block, Norman, Zhang, Mannella, Angstadt, Abelson, Himle, Taylor, Fitzgerald) and School of Social Work (Himle), University of Michigan, Ann Arbor; Changzhi Medical College, Changzhi, China (Zhang); Department of Psychiatry, Second Xiangya Hospital, Central South University, Changsha, China (Yang); Department of Psychiatry, Columbia University, and New York State Psychiatric Institute, New York (Fitzgerald)
| | - Kristin A Mannella
- Department of Psychiatry (Russman Block, Norman, Zhang, Mannella, Angstadt, Abelson, Himle, Taylor, Fitzgerald) and School of Social Work (Himle), University of Michigan, Ann Arbor; Changzhi Medical College, Changzhi, China (Zhang); Department of Psychiatry, Second Xiangya Hospital, Central South University, Changsha, China (Yang); Department of Psychiatry, Columbia University, and New York State Psychiatric Institute, New York (Fitzgerald)
| | - Huan Yang
- Department of Psychiatry (Russman Block, Norman, Zhang, Mannella, Angstadt, Abelson, Himle, Taylor, Fitzgerald) and School of Social Work (Himle), University of Michigan, Ann Arbor; Changzhi Medical College, Changzhi, China (Zhang); Department of Psychiatry, Second Xiangya Hospital, Central South University, Changsha, China (Yang); Department of Psychiatry, Columbia University, and New York State Psychiatric Institute, New York (Fitzgerald)
| | - Mike Angstadt
- Department of Psychiatry (Russman Block, Norman, Zhang, Mannella, Angstadt, Abelson, Himle, Taylor, Fitzgerald) and School of Social Work (Himle), University of Michigan, Ann Arbor; Changzhi Medical College, Changzhi, China (Zhang); Department of Psychiatry, Second Xiangya Hospital, Central South University, Changsha, China (Yang); Department of Psychiatry, Columbia University, and New York State Psychiatric Institute, New York (Fitzgerald)
| | - James L Abelson
- Department of Psychiatry (Russman Block, Norman, Zhang, Mannella, Angstadt, Abelson, Himle, Taylor, Fitzgerald) and School of Social Work (Himle), University of Michigan, Ann Arbor; Changzhi Medical College, Changzhi, China (Zhang); Department of Psychiatry, Second Xiangya Hospital, Central South University, Changsha, China (Yang); Department of Psychiatry, Columbia University, and New York State Psychiatric Institute, New York (Fitzgerald)
| | - Joseph A Himle
- Department of Psychiatry (Russman Block, Norman, Zhang, Mannella, Angstadt, Abelson, Himle, Taylor, Fitzgerald) and School of Social Work (Himle), University of Michigan, Ann Arbor; Changzhi Medical College, Changzhi, China (Zhang); Department of Psychiatry, Second Xiangya Hospital, Central South University, Changsha, China (Yang); Department of Psychiatry, Columbia University, and New York State Psychiatric Institute, New York (Fitzgerald)
| | - Stephan F Taylor
- Department of Psychiatry (Russman Block, Norman, Zhang, Mannella, Angstadt, Abelson, Himle, Taylor, Fitzgerald) and School of Social Work (Himle), University of Michigan, Ann Arbor; Changzhi Medical College, Changzhi, China (Zhang); Department of Psychiatry, Second Xiangya Hospital, Central South University, Changsha, China (Yang); Department of Psychiatry, Columbia University, and New York State Psychiatric Institute, New York (Fitzgerald)
| | - Kate D Fitzgerald
- Department of Psychiatry (Russman Block, Norman, Zhang, Mannella, Angstadt, Abelson, Himle, Taylor, Fitzgerald) and School of Social Work (Himle), University of Michigan, Ann Arbor; Changzhi Medical College, Changzhi, China (Zhang); Department of Psychiatry, Second Xiangya Hospital, Central South University, Changsha, China (Yang); Department of Psychiatry, Columbia University, and New York State Psychiatric Institute, New York (Fitzgerald)
| |
Collapse
|
5
|
Yamanaka K, Waki H. Conditional Regulation of Blood Pressure in Response to Emotional Stimuli by the Central Nucleus of the Amygdala in Rats. Front Physiol 2022; 13:820112. [PMID: 35721563 PMCID: PMC9198497 DOI: 10.3389/fphys.2022.820112] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Accepted: 04/12/2022] [Indexed: 11/17/2022] Open
Abstract
Humans and animals can determine whether a situation is favorable to them and act accordingly. For this, the autonomic tuning of the cardiovascular system to supply energy to active skeletal muscles through the circulatory system is as important as motor control. However, how the autonomic cardiovascular responses are regulated in dynamically changing environments and the neuronal mechanisms underlying these responses remain unclear. To resolve these issues, we recorded the blood pressure and heart rate of head-restrained rats during dynamically changing appetitive and aversive classical conditioning tasks. The rats displayed various associations between conditioned stimuli and unconditioned stimuli in appetitive (sucrose water), neutral (no outcome), and aversive (air puff) blocks. The blood pressure and heart rate in the appetitive block gradually increased in response to the reward-predicting cue and the response to the actual reward vigorously increased. The reward-predictive response was significantly higher than the responses obtained in the neutral and aversive condition blocks. To investigate whether the reward-predictive pressor response was caused by orofacial movements such as anticipatory licking behavior, we separately analyzed high- and low-licking trials. The conditioned pressor response was observed even in trials with low-licking behaviors. Blood pressure and heart rate responses to the air puff-predicting cue in the aversive block were not significantly different from the responses in the neutral block. The conditioned blood pressure response rapidly changed with condition block switching. Furthermore, to examine the contribution of the amygdala as an emotion center to these conditioned responses, we bilaterally microinjected a GABAA receptor agonist, muscimol, into the central nucleus of the amygdala. Pharmacological inactivation of the central nucleus of the amygdala significantly decreased the reward-predictive pressor responses. These results suggest that the blood pressure is adaptively and rapidly regulated by emotional conditioned stimuli and that the central nucleus of the amygdala participates in regulating the pressor response in dynamically changing situations.
Collapse
Affiliation(s)
- Ko Yamanaka
- Department of Physiology, Graduate School of Health and Sports Science, Juntendo University, Inzai, Japan
| | - Hidefumi Waki
- Department of Physiology, Graduate School of Health and Sports Science, Juntendo University, Inzai, Japan.,Institute of Health and Sports Science and Medicine, Juntendo University, Inzai, Japan
| |
Collapse
|
6
|
Ohta H, Satori K, Takarada Y, Arake M, Ishizuka T, Morimoto Y, Takahashi T. The asymmetric learning rates of murine exploratory behavior in sparse reward environments. Neural Netw 2021; 143:218-229. [PMID: 34157646 DOI: 10.1016/j.neunet.2021.05.030] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2021] [Revised: 04/16/2021] [Accepted: 05/26/2021] [Indexed: 11/29/2022]
Abstract
Goal-oriented behaviors of animals can be modeled by reinforcement learning algorithms. Such algorithms predict future outcomes of selected actions utilizing action values and updating those values in response to the positive and negative outcomes. In many models of animal behavior, the action values are updated symmetrically based on a common learning rate, that is, in the same way for both positive and negative outcomes. However, animals in environments with scarce rewards may have uneven learning rates. To investigate the asymmetry in learning rates in reward and non-reward, we analyzed the exploration behavior of mice in five-armed bandit tasks using a Q-learning model with differential learning rates for positive and negative outcomes. The positive learning rate was significantly higher in a scarce reward environment than in a rich reward environment, and conversely, the negative learning rate was significantly lower in the scarce environment. The positive to negative learning rate ratio was about 10 in the scarce environment and about 2 in the rich environment. This result suggests that when the reward probability was low, the mice tend to ignore failures and exploit the rare rewards. Computational modeling analysis revealed that the increased learning rates ratio could cause an overestimation of and perseveration on rare-rewarding events, increasing total reward acquisition in the scarce environment but disadvantaging impartial exploration.
Collapse
Affiliation(s)
- Hiroyuki Ohta
- Department of Pharmacology, National Defense Medical College, Saitama, 359-8513, Japan.
| | | | - Yu Takarada
- Tokyo Denki University, Saitama, 350-0394, Japan
| | - Masashi Arake
- Department of Physiology, National Defense Medical College, Saitama, 359-8513, Japan
| | - Toshiaki Ishizuka
- Department of Pharmacology, National Defense Medical College, Saitama, 359-8513, Japan
| | - Yuji Morimoto
- Department of Physiology, National Defense Medical College, Saitama, 359-8513, Japan
| | | |
Collapse
|
7
|
Trombetta-Lima M, Assis-Ribas T, Cintra RC, Campeiro JD, Guerreiro JR, Winnischofer SMB, Nascimento ICC, Ulrich H, Hayashi MAF, Sogayar MC. Impact of Reck expression and promoter activity in neuronal in vitro differentiation. Mol Biol Rep 2021; 48:1985-1994. [PMID: 33619662 DOI: 10.1007/s11033-021-06175-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2020] [Accepted: 01/20/2021] [Indexed: 02/07/2023]
Abstract
Reck (REversion-inducing Cysteine-rich protein with Kazal motifs) tumor suppressor gene encodes a multifunctional glycoprotein which inhibits the activity of several matrix metalloproteinases (MMPs), and has the ability to modulate the Notch and canonical Wnt pathways. Reck-deficient neuro-progenitor cells undergo precocious differentiation; however, modulation of Reck expression during progression of the neuronal differentiation process is yet to be characterized. In the present study, we demonstrate that Reck expression levels are increased during in vitro neuronal differentiation of PC12 pheochromocytoma cells and P19 murine teratocarcinoma cells and characterize mouse Reck promoter activity during this process. Increased Reck promoter activity was found upon induction of differentiation in PC12 cells, in accordance with its increased mRNA expression levels in mouse in vitro models. Interestingly, Reck overexpression, prior to the beginning of the differentiation protocol, led to diminished efficiency of the neuronal differentiation process. Taken together, our findings suggest that increased Reck expression at early stages of differentiation diminishes the number of neuron-like cells, which are positive for the beta-3 tubulin marker. Our data highlight the importance of Reck expression evaluation to optimize in vitro neuronal differentiation protocols.
Collapse
Affiliation(s)
- Marina Trombetta-Lima
- Núcleo de Terapia Celular e Molecular (NUCEL), Faculdade de Medicina, Universidade de São Paulo (USP), Rua Pangaré, 100 (Cidade Universitária), São Paulo, SP, 05360-130, Brazil
| | - Thais Assis-Ribas
- Núcleo de Terapia Celular e Molecular (NUCEL), Faculdade de Medicina, Universidade de São Paulo (USP), Rua Pangaré, 100 (Cidade Universitária), São Paulo, SP, 05360-130, Brazil
| | - Ricardo C Cintra
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo (USP), São Paulo, SP, 05508-000, Brazil
| | - Joana D Campeiro
- Departamento de Farmacologia, Escola Paulista de Medicina (EPM), Universidade Federal de São Paulo (UNIFESP), Rua 3 de Maio 100, Ed INFAR, 3º andar, São Paulo, SP, 04044-020, Brazil
| | - Juliano R Guerreiro
- Faculdade de Farmácia, Universidade Paulista (UNIP), São Paulo, SP, 05347-020, Brazil
| | - Sheila M B Winnischofer
- Departamento de Bioquímica e Biologia Molecular, Universidade Federal do Paraná (UFPR), Curitiba, PR, 81531-990, Brazil
- Departamento de Biologia Celular, Universidade Federal do Paraná (UFPR), Curitiba, PR, 81531-990, Brazil
| | - Isis C C Nascimento
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo (USP), São Paulo, SP, 05508-000, Brazil
| | - Henning Ulrich
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo (USP), São Paulo, SP, 05508-000, Brazil
| | - Mirian A F Hayashi
- Departamento de Farmacologia, Escola Paulista de Medicina (EPM), Universidade Federal de São Paulo (UNIFESP), Rua 3 de Maio 100, Ed INFAR, 3º andar, São Paulo, SP, 04044-020, Brazil.
| | - Mari C Sogayar
- Núcleo de Terapia Celular e Molecular (NUCEL), Faculdade de Medicina, Universidade de São Paulo (USP), Rua Pangaré, 100 (Cidade Universitária), São Paulo, SP, 05360-130, Brazil.
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo (USP), São Paulo, SP, 05508-000, Brazil.
| |
Collapse
|
8
|
Bari BA, Cohen JY. Dynamic decision making and value computations in medial frontal cortex. INTERNATIONAL REVIEW OF NEUROBIOLOGY 2021; 158:83-113. [PMID: 33785157 DOI: 10.1016/bs.irn.2020.12.001] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Dynamic decision making requires an intact medial frontal cortex. Recent work has combined theory and single-neuron measurements in frontal cortex to advance models of decision making. We review behavioral tasks that have been used to study dynamic decision making and algorithmic models of these tasks using reinforcement learning theory. We discuss studies linking neurophysiology and quantitative decision variables. We conclude with hypotheses about the role of other cortical and subcortical structures in dynamic decision making, including ascending neuromodulatory systems.
Collapse
Affiliation(s)
- Bilal A Bari
- The Solomon H. Snyder Department of Neuroscience, Brain Science Institute, Kavli Neuroscience Discovery Institute, Johns Hopkins University, Baltimore, MD, United States
| | - Jeremiah Y Cohen
- The Solomon H. Snyder Department of Neuroscience, Brain Science Institute, Kavli Neuroscience Discovery Institute, Johns Hopkins University, Baltimore, MD, United States.
| |
Collapse
|
9
|
Enomoto K, Matsumoto N, Inokawa H, Kimura M, Yamada H. Topographic distinction in long-term value signals between presumed dopamine neurons and presumed striatal projection neurons in behaving monkeys. Sci Rep 2020; 10:8912. [PMID: 32488042 PMCID: PMC7265398 DOI: 10.1038/s41598-020-65914-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2019] [Accepted: 05/12/2020] [Indexed: 12/15/2022] Open
Abstract
Nigrostriatal dopamine (DA) projections are anatomically organized along the dorsolateral-ventromedial axis, conveying long-term value signals to the striatum for shaping actions toward multiple future rewards. The present study examines whether the topographic organization of long-term value signals are observed upon activity of presumed DA neurons and presumed striatal projection neurons (phasically active neurons, PANs), as predicted based on anatomical literature. Our results indicate that DA neurons in the dorsolateral midbrain encode long-term value signals on a short timescale, while ventromedial midbrain DA neurons encode such signals on a relatively longer timescale. Activity of the PANs in the dorsal striatum is more heterogeneous for encoding long-term values, although significant differences in long-term value signals were observed between the caudate nucleus and putamen. These findings suggest that topographic DA signals for long-term values are not simply transferred to striatal neurons, possibly due to the contribution of other projections to the striatum.
Collapse
Affiliation(s)
- Kazuki Enomoto
- Department of Physiology, Kyoto Prefectural University of Medicine, Kyoto, 602-8566, Japan.,Center for Information and Neural Networks, National Institute of Information and Communications Technology, Osaka, 565-0871, Japan.,Brain Science Institute, Tamagawa University, Machida, Tokyo, 194-8610, Japan
| | - Naoyuki Matsumoto
- Department of Physiology, Kyoto Prefectural University of Medicine, Kyoto, 602-8566, Japan.,Division of Food and Health Sciences, Faculty of Environmental and Symbiotic Sciences, Prefectural University of Kumamoto, Kumamoto, 862-8502, Japan
| | - Hitoshi Inokawa
- Department of Physiology, Kyoto Prefectural University of Medicine, Kyoto, 602-8566, Japan
| | - Minoru Kimura
- Department of Physiology, Kyoto Prefectural University of Medicine, Kyoto, 602-8566, Japan.,Brain Science Institute, Tamagawa University, Machida, Tokyo, 194-8610, Japan
| | - Hiroshi Yamada
- Department of Physiology, Kyoto Prefectural University of Medicine, Kyoto, 602-8566, Japan. .,Graduate School of Comprehensive Human Sciences, University of Tsukuba, 1-1-1 Tenno-dai, Tsukuba, Ibaraki, 305-8577, Japan. .,Transborder Medical Research Center, University of Tsukuba, 1-1-1 Tenno-dai, Tsukuba, Ibaraki, 305-8577, Japan. .,Division of Biomedical Science, Faculty of Medicine, University of Tsukuba, 1-1-1 Tenno-dai, Tsukuba, Ibaraki, 305-8577, Japan.
| |
Collapse
|
10
|
Bari BA, Grossman CD, Lubin EE, Rajagopalan AE, Cressy JI, Cohen JY. Stable Representations of Decision Variables for Flexible Behavior. Neuron 2019; 103:922-933.e7. [PMID: 31280924 DOI: 10.1016/j.neuron.2019.06.001] [Citation(s) in RCA: 70] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2018] [Revised: 05/03/2019] [Accepted: 05/31/2019] [Indexed: 12/25/2022]
Abstract
Decisions occur in dynamic environments. In the framework of reinforcement learning, the probability of performing an action is influenced by decision variables. Discrepancies between predicted and obtained rewards (reward prediction errors) update these variables, but they are otherwise stable between decisions. Although reward prediction errors have been mapped to midbrain dopamine neurons, it is unclear how the brain represents decision variables themselves. We trained mice on a dynamic foraging task in which they chose between alternatives that delivered reward with changing probabilities. Neurons in the medial prefrontal cortex, including projections to the dorsomedial striatum, maintained persistent firing rate changes over long timescales. These changes stably represented relative action values (to bias choices) and total action values (to bias response times) with slow decay. In contrast, decision variables were weakly represented in the anterolateral motor cortex, a region necessary for generating choices. Thus, we define a stable neural mechanism to drive flexible behavior.
Collapse
Affiliation(s)
- Bilal A Bari
- The Solomon H. Snyder Department of Neuroscience, Brain Science Institute, Kavli Neuroscience Discovery Institute, The Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Cooper D Grossman
- The Solomon H. Snyder Department of Neuroscience, Brain Science Institute, Kavli Neuroscience Discovery Institute, The Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Emily E Lubin
- The Solomon H. Snyder Department of Neuroscience, Brain Science Institute, Kavli Neuroscience Discovery Institute, The Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Adithya E Rajagopalan
- The Solomon H. Snyder Department of Neuroscience, Brain Science Institute, Kavli Neuroscience Discovery Institute, The Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Jianna I Cressy
- The Solomon H. Snyder Department of Neuroscience, Brain Science Institute, Kavli Neuroscience Discovery Institute, The Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Jeremiah Y Cohen
- The Solomon H. Snyder Department of Neuroscience, Brain Science Institute, Kavli Neuroscience Discovery Institute, The Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA.
| |
Collapse
|
11
|
Monitoring and Updating of Action Selection for Goal-Directed Behavior through the Striatal Direct and Indirect Pathways. Neuron 2018; 99:1302-1314.e5. [PMID: 30146299 DOI: 10.1016/j.neuron.2018.08.002] [Citation(s) in RCA: 91] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2018] [Revised: 06/29/2018] [Accepted: 08/01/2018] [Indexed: 12/28/2022]
Abstract
The basal ganglia play key roles in adaptive behaviors guided by reward and punishment. However, despite accumulating knowledge, few studies have tested how heterogeneous signals in the basal ganglia are organized and coordinated for goal-directed behavior. In this study, we investigated neuronal signals of the direct and indirect pathways of the basal ganglia as rats performed a lever push/pull task for a probabilistic reward. In the dorsomedial striatum, we found that optogenetically and electrophysiologically identified direct pathway neurons encoded reward outcomes, whereas indirect pathway neurons encoded no-reward outcome and next-action selection. Outcome coding occurred in association with the chosen action. In support of pathway-specific neuronal coding, light activation induced a bias on repeat selection of the same action in the direct pathway, but on switch selection in the indirect pathway. Our data reveal the mechanisms underlying monitoring and updating of action selection for goal-directed behavior through basal ganglia circuits.
Collapse
|