1
|
Carter F, Cossette MP, Trujillo-Pisanty I, Pallikaras V, Breton YA, Conover K, Caplan J, Solis P, Voisard J, Yaksich A, Shizgal P. Does phasic dopamine release cause policy updates? Eur J Neurosci 2024; 59:1260-1277. [PMID: 38039083 DOI: 10.1111/ejn.16199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2023] [Revised: 10/21/2023] [Accepted: 11/01/2023] [Indexed: 12/03/2023]
Abstract
Phasic dopamine activity is believed to both encode reward-prediction errors (RPEs) and to cause the adaptations that these errors engender. If so, a rat working for optogenetic stimulation of dopamine neurons will repeatedly update its policy and/or action values, thus iteratively increasing its work rate. Here, we challenge this view by demonstrating stable, non-maximal work rates in the face of repeated optogenetic stimulation of midbrain dopamine neurons. Furthermore, we show that rats learn to discriminate between world states distinguished only by their history of dopamine activation. Comparison of these results to reinforcement learning simulations suggests that the induced dopamine transients acted more as rewards than RPEs. However, pursuit of dopaminergic stimulation drifted upwards over a time scale of days and weeks, despite its stability within trials. To reconcile the results with prior findings, we consider multiple roles for dopamine signalling.
Collapse
Affiliation(s)
- Francis Carter
- Department of Psychology, Concordia University, Montreal, Quebec, Canada
- Montreal Institute for Learning Algorithms, Université de Montréal, Montreal, Quebec, Canada
| | | | - Ivan Trujillo-Pisanty
- Department of Psychology, Concordia University, Montreal, Quebec, Canada
- Department of Psychology, Langara College, Vancouver, British Columbia, Canada
| | | | | | - Kent Conover
- Department of Psychology, Concordia University, Montreal, Quebec, Canada
| | - Jill Caplan
- Department of Psychology, Concordia University, Montreal, Quebec, Canada
| | - Pavel Solis
- Department of Psychology, Concordia University, Montreal, Quebec, Canada
| | - Jacques Voisard
- Department of Psychology, Concordia University, Montreal, Quebec, Canada
| | - Alexandra Yaksich
- Department of Psychology, Concordia University, Montreal, Quebec, Canada
| | - Peter Shizgal
- Department of Psychology, Concordia University, Montreal, Quebec, Canada
| |
Collapse
|
2
|
Bolkan SS, Stone IR, Pinto L, Ashwood ZC, Iravedra Garcia JM, Herman AL, Singh P, Bandi A, Cox J, Zimmerman CA, Cho JR, Engelhard B, Pillow JW, Witten IB. Opponent control of behavior by dorsomedial striatal pathways depends on task demands and internal state. Nat Neurosci 2022; 25:345-357. [PMID: 35260863 PMCID: PMC8915388 DOI: 10.1038/s41593-022-01021-9] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2020] [Accepted: 01/21/2022] [Indexed: 11/27/2022]
Abstract
A classic view of the striatum holds that activity in direct and indirect pathways oppositely modulates motor output. Whether this involves direct control of movement, or reflects a cognitive process underlying movement, remains unresolved. Here we find that strong, opponent control of behavior by the two pathways of the dorsomedial striatum depends on the cognitive requirements of a task. Furthermore, a latent state model (a hidden Markov model with generalized linear model observations) reveals that-even within a single task-the contribution of the two pathways to behavior is state dependent. Specifically, the two pathways have large contributions in one of two states associated with a strategy of evidence accumulation, compared to a state associated with a strategy of repeating previous choices. Thus, both the demands imposed by a task, as well as the internal state of mice when performing a task, determine whether dorsomedial striatum pathways provide strong and opponent control of behavior.
Collapse
Affiliation(s)
- Scott S Bolkan
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Iris R Stone
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Lucas Pinto
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Zoe C Ashwood
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | | | - Alison L Herman
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Priyanka Singh
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Akhil Bandi
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Julia Cox
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | | | - Jounhong Ryan Cho
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Ben Engelhard
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Jonathan W Pillow
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA.
- Department of Psychology, Princeton University, Princeton, NJ, USA.
| | - Ilana B Witten
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA.
- Department of Psychology, Princeton University, Princeton, NJ, USA.
| |
Collapse
|